Unison Consulting Logo

Unison Consulting

Quantexa Data Engineer

Posted 3 Days Ago
Be an Early Applicant
In-Office
Singapore
Senior level
In-Office
Singapore
Senior level
The Quantexa Data Engineer will design and optimize big data solutions, build data processing pipelines, and integrate Elasticsearch with Spark, collaborating with teams to ensure data quality and governance.
The summary above was generated by AI

Quantexa Data Engineer

We are seeking a talented and experienced Data Engineer with expertise in Hadoop, Scala, Spark, Elastic, Open Shift Container Platform (OCP) and DevOps practices. Elasticsearch to join our team. As a Data Engineer, you will play a crucial role in designing, developing, and optimizing big data solutions using Apache Spark, Scala, and Elasticsearch. You will collaborate with cross-functional teams to build scalable and efficient data processing pipelines and search applications. Knowledge and experience in the Compliance / AML domain will be a plus. Working experience with Quantexa tool is a must.

Responsibilities:

  • Implement data transformation, aggregation, and enrichment processes to support various data analytics and machine learning initiatives
  • Collaborate with cross-functional teams to understand data requirements and translate them into effective data engineering solutions
  • Design, develop, and implement Spark Scala applications and data processing pipelines to process large volumes of structured and unstructured data
  • Integrate Elasticsearch with Spark to enable efficient indexing, querying, and retrieval of data
  • Optimize and tune Spark jobs for performance and scalability, ensuring efficient data processing and indexing in Elasticsearch
  • Implement data transformations, aggregations, and computations using Spark RDDs, DataFrames, and Datasets, and integrate them with Elasticsearch
  • Develop and maintain scalable and fault-tolerant Spark applications, adhering to industry best practices and coding standards
  • Troubleshoot and resolve issues related to data processing, performance, and data quality in the Spark-Elasticsearch integration
  • Monitor and analyze job performance metrics, identify bottlenecks, and propose optimizations in both Spark and Elasticsearch components
  • Ensure data quality and integrity throughout the data processing lifecycle
  • Design and deploy data engineering solutions on OpenShift Container Platform (OCP) using containerization and orchestration techniques
  • Optimize data engineering workflows for containerized deployment and efficient resource utilization
  • Collaborate with DevOps teams to streamline deployment processes, implement CI/CD pipelines, and ensure platform stability
  • Implement data governance practices, data lineage, and metadata management to ensure data accuracy, traceability, and compliance
  • Monitor and optimize data pipeline performance, troubleshoot issues, and implement necessary enhancements
  • Implement monitoring and logging mechanisms to ensure the health, availability, and performance of the data infrastructure
  • Document data engineering processes, workflows, and infrastructure configurations for knowledge sharing and reference

Requirements:

  • Bachelor's or Master's degree in Computer Science, Software Engineering, or a related field
  • Must be Quantexa certified data engineer / data architect and proficient with the tool.
  • Proven experience as a Data Engineer, working with Hadoop, Spark, and data processing technologies in large-scale environments
  • Proficiency in Scala programming language and familiarity with functional programming concepts
  • Experience with Quantexa tool is highly preferred.
  • In-depth understanding of Apache Spark architecture, RDDs, DataFrames, and Spark SQL
  • Strong expertise in designing and developing data infrastructure using Hadoop, Spark, and related tools (HDFS, Hive, Pig, etc)
  • Experience with containerization platforms such as OpenShift Container Platform (OCP) and container orchestration using Kubernetes
  • Proficiency in programming languages commonly used in data engineering, such as Spark, Python, Scala, or Java
  • Knowledge of DevOps practices, CI/CD pipelines, and infrastructure automation tools (e.g., Docker, Jenkins, Ansible, BitBucket)
  • Experience with Graphana, Prometheus, Splunk will be an added benefit
  • Experience integrating and working with Elasticsearch for data indexing and search applications
  • Solid understanding of Elasticsearch data modeling, indexing strategies, and query optimization
  • Experience with distributed computing, parallel processing, and working with large datasets
  • Proficient in performance tuning and optimization techniques for Spark applications and Elasticsearch queries
  • Strong problem-solving and analytical skills with the ability to debug and resolve complex issues
  • Familiarity with version control systems (e.g., Git) and collaborative development workflows
  • Excellent communication and teamwork skills with the ability to work effectively in cross-functional teams
  • Experience with cloud platforms (e.g., AWS, Azure, GCP) and their data services is a plus

Top Skills

Ansible
AWS
Azure
Bitbucket
DevOps
Docker
Elastic
Elasticsearch
GCP
Graphana
Hadoop
Jenkins
Open Shift Container Platform
Prometheus
Scala
Spark
Splunk

Unison Consulting Singapore Office

1 Changi Business Park Crescent, , Plaza 8 #03-06 Tower A, Singapore, , Singapore, 486025

Unison Consulting Singapore Office

#12-00, 63 Market Street, Bank of Singapore Center, Singapore, , Singapore, 048942

Similar Jobs

3 Days Ago
In-Office
Singapore, SGP
Senior level
Senior level
Information Technology • Consulting
The role requires designing and developing data infrastructure, utilizing Hadoop and Spark in large-scale environments, while also leveraging containerization and automation tools.
Top Skills: AnsibleBitbucketControl-MDockerGraphanaHadoopJavaJenkinsKubernetesOpenshift Container PlatformPrometheusPythonScalaSparkSplunk
3 Days Ago
In-Office
Singapore, SGP
Senior level
Senior level
Information Technology • Consulting
Design and optimize big data solutions using Spark, Scala, ElasticSearch, and more. Collaborate on data engineering, implement CI/CD, and ensure data quality.
Top Skills: AnsibleBitbucketCi/CdDevOpsDockerElasticsearchGrafanaHadoopJavaJenkinsOpenshift Container Platform (Ocp)PrometheusPythonScalaSparkSplunk
An Hour Ago
Remote or Hybrid
Singapore, SGP
Senior level
Senior level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
The candidate will lead marketing programs for CRM and Industry solutions in APAC, collaborating with global teams to drive business growth through targeted marketing strategies and campaigns.
Top Skills: AICRM

What you need to know about the Singapore Tech Scene

The digital revolution has driven a constant demand for tech professionals across industries like software development, data analytics and cybersecurity. In Singapore, one of the largest cities in Southeast Asia, the demand for tech talent is so high that the government continues to invest millions into programs designed to develop a talent pipeline directly from universities while also scaling efforts in pre-employment training and mid-career upskilling to expand and elevate its workforce.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account