Unison Consulting Logo

Unison Consulting

Senior Data Engineer (Quantexa, Spark ,Scala, Elastic Search)

Posted 7 Days Ago
Be an Early Applicant
Singapore
Senior level
Singapore
Senior level
As a Senior Data Engineer, you will design, develop, and optimize big data solutions using Apache Spark, Scala, and Elasticsearch. You will implement data transformation processes, collaborate with teams to meet data requirements, and ensure data quality and integrity. The role requires optimizing job performance, deploying data engineering solutions on OpenShift, and monitoring data pipelines.
The summary above was generated by AI

Description

We are seeking a talented and experienced Senior Data Engineer (Quantexa)with expertise in Hadoop, Scala, Spark, Elastic, Open Shift Container Platform (OCP) and DevOps practices. Elasticsearch to join our team. As a Data Engineer, you will play a crucial role in designing, developing, and optimizing big data solutions using Apache Spark, Scala, and Elasticsearch. You will collaborate with cross-functional teams to build scalable and efficient data processing pipelines and search applications. Knowledge and experience in the Compliance / AML domain will be a plus. Working experience with Quantexa tool is a must.

Responsibilities:

·        Implement data transformation, aggregation, and enrichment processes to support various data analytics and machine learning initiatives

·        Collaborate with cross-functional teams to understand data requirements and translate them into effective data engineering solutions

·        Design, develop, and implement Spark Scala applications and data processing pipelines to process large volumes of structured and unstructured data

·        Integrate Elasticsearch with Spark to enable efficient indexing, querying, and retrieval of data

·        Optimize and tune Spark jobs for performance and scalability, ensuring efficient data processing and indexing in Elasticsearch

·        Implement data transformations, aggregations, and computations using Spark RDDs, DataFrames, and Datasets, and integrate them with Elasticsearch

·        Develop and maintain scalable and fault-tolerant Spark applications, adhering to industry best practices and coding standards

·        Troubleshoot and resolve issues related to data processing, performance, and data quality in the Spark-Elasticsearch integration

·        Monitor and analyze job performance metrics, identify bottlenecks, and propose optimizations in both Spark and Elasticsearch components

·        Ensure data quality and integrity throughout the data processing lifecycle

·        Design and deploy data engineering solutions on OpenShift Container Platform (OCP) using containerization and orchestration techniques

·        Optimize data engineering workflows for containerized deployment and efficient resource utilization

·        Collaborate with DevOps teams to streamline deployment processes, implement CI/CD pipelines, and ensure platform stability

·        Implement data governance practices, data lineage, and metadata management to ensure data accuracy, traceability, and compliance

·        Monitor and optimize data pipeline performance, troubleshoot issues, and implement necessary enhancements

·        Implement monitoring and logging mechanisms to ensure the health, availability, and performance of the data infrastructure

·        Document data engineering processes, workflows, and infrastructure configurations for knowledge sharing and reference

Requirements
  1. More than 5 years of experience as a Data Engineer
  2. · Bachelor's or Master's degree in Computer Science, Software Engineering, or a related discipline
  3. · Possession of Quantexa certification as a Data Engineer or Data Architect, with proficiency in the tool
  4. · Demonstrated experience as a Data Engineer, utilizing Hadoop, Spark, and data processing technologies in large-scale environments
  5. · Expertise in the Scala programming language and familiarity with functional programming principles
  6. · Prior experience with the Quantexa tool is highly desirable
  7. · Comprehensive understanding of Apache Spark architecture, including RDDs, DataFrames, and Spark SQL
  8. · Advanced proficiency in designing and developing data infrastructure utilizing Hadoop, Spark, and associated tools (HDFS, Hive, Pig, etc.)
  9. · Experience with containerization platforms such as OpenShift Container Platform (OCP) and container orchestration via Kubernetes
  10. · Proficiency in programming languages commonly employed in data engineering, including Spark, Python, Scala, or Java
  11. · Knowledge of DevOps methodologies, CI/CD pipelines, and infrastructure automation tools (e.g., Docker, Jenkins, Ansible, BitBucket)
  12. · Experience with Graphana, Prometheus, and Splunk will be considered an added advantage
  13. · Background in integrating and utilizing Elasticsearch for data indexing and search applications
  14. · Solid understanding of Elasticsearch data modeling, indexing strategies, and query optimization techniques
  15. · Experience with distributed computing, parallel processing, and handling large datasets
  16. · Proficient in performance tuning and optimization methods for Spark applications and Elasticsearch queries
  17. · Strong problem-solving and analytical capabilities with the capacity to debug and resolve intricate issues
  18. · Familiarity with version control systems (e.g., Git) and collaborative development environments

Top Skills

Java
Python
Scala
Spark

Unison Consulting Singapore Office

1 Changi Business Park Crescent, , Plaza 8 #03-06 Tower A, Singapore, , Singapore, 486025

Unison Consulting Singapore Office

#12-00, 63 Market Street, Bank of Singapore Center, Singapore, , Singapore, 048942

Similar Jobs

Be an Early Applicant
2 Days Ago
Singapore, SGP
32,902 Employees
Mid level
32,902 Employees
Mid level
Information Technology
The Data Engineer (Python) will be responsible for ingesting data from various sources, curating data assets, collaborating with teams, deploying ML models, and architecting data pipelines to enable informed decision-making. The role focuses on enhancing the user experience through data-driven solutions.
Be an Early Applicant
3 Days Ago
Singapore, SGP
96 Employees
Junior
96 Employees
Junior
Information Technology • Consulting
As a Junior Data Engineer, you will design, develop, and maintain data pipelines using Hadoop and Spark. You will collaborate with teams to ensure data quality, and implement CI/CD processes with DevOps. The role requires optimizing workflows for containerized deployment and mentoring junior members.
Be an Early Applicant
8 Days Ago
Singapore, SGP
96 Employees
Senior level
96 Employees
Senior level
Information Technology • Consulting
The Data Engineer will design, implement, and optimize data pipelines using Google Cloud Platform tools. Key responsibilities include developing ETL processes, managing data workflows, ensuring data quality, and collaborating with stakeholders to fulfill data requirements.

What you need to know about the Singapore Tech Scene

The digital revolution has driven a constant demand for tech professionals across industries like software development, data analytics and cybersecurity. In Singapore, one of the largest cities in Southeast Asia, the demand for tech talent is so high that the government continues to invest millions into programs designed to develop a talent pipeline directly from universities while also scaling efforts in pre-employment training and mid-career upskilling to expand and elevate its workforce.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account