OKX Logo

OKX

Senior/Staff Engineer - Stability Engineering Platform

Posted 5 Days Ago
Be an Early Applicant
Singapore
Senior level
Singapore
Senior level
As a Senior/Staff Engineer, you will enhance service stability through technology, implement SLO/SLA standards, and develop stability assurance platforms while fostering best practices and team learning.
The summary above was generated by AI

OKX will be prioritising applicants who have a current right to work in Singapore, and do not require OKX's sponsorship of a visa.

 

Who We Are

At OKX, we believe that the future will be reshaped by crypto, and ultimately contribute to every individual's freedom. OKX is a leading crypto exchange, and the developer of OKX Wallet, giving millions access to crypto trading and decentralized crypto applications (dApps). OKX is also a trusted brand by hundreds of large institutions seeking access to crypto markets. We are safe and reliable, backed by our Proof of Reserves. Across our multiple offices globally, we are united by our core principles: We Before Me, Do the Right Thing, and Get Things Done. These shared values drive our culture, shape our processes, and foster a friendly, rewarding, and diverse environment for every OK-er. OKX is part of OKG, a group that brings the value of Blockchain to users around the world, through our leading products OKX, OKX Wallet, OKLink and more.

 

About The Team

The Service Stability Engineering Team envisions service stability as one of the core competitive strengths of the company's products. By building end-to-end, link-level risk management capabilities, the team aims to achieve sustainable automatic identification and analysis of stability risks, transforming from "reactive governance" to "proactive governance." This approach shifts more stability-related matters forward and addresses them early, preventing issues before they arise and enhancing user experience.

 

What You’ll Be Doing 

  • Conduct research on leveraging technology for rapid issue detection, root cause analysis, and fault recovery to achieve the 1-5-10 stability objective.
  • Define and implement SLO/SLA standards, ensuring business stability through a goal-driven approach.
  • Continuously develop and enhance stability assurance platforms, including inspection systems, root cause analysis frameworks, and risk management repositories, to improve the accuracy and efficiency of issue detection, diagnosis, and resolution.
  • Establish and enforce stability best practices, ensuring that product design and development adhere to stability principles.
  • Stay abreast of industry-leading technologies, foster team learning and capability building, and drive the adoption and iteration of advanced technological solutions as needed.


What We Look For In You 

  • Bachelor's degree or higher in Computer Science or a related field, with over seven years of experience in software development and architecture. 
  • Proficiency in Java, hands-on experience with the Spring Cloud microservices technology stack, demonstrating strong coding standards and algorithmic skills.
  • Skilled in data computation and analysis tools, including Flink, Elasticsearch, ClickHouse, SkyWalking, Prometheus/VictoriaMetrics, and Python.
  • Strong problem-identification, analytical, and resolution skills, with clear logical reasoning and a holistic architectural mindset.
  • Product-oriented mindset with a solid understanding of software development processes, failure analysis, and incident management, leveraging tools to enhance problem resolution.


Nice-To-Haves

  • Experience in foundational infrastructure and framework development.
  • Experience in RAG (Retrieval-Augmented Generation) and Agent development and optimization is a plus.
  • Experience in stability assurance, inspection systems, root cause diagnosis platforms, or chaos engineering practices is highly desirable.
  • Proficiency in speaking, reading and writing in both English and Mandarin to collaborate effectively with global and cross-functional team members.


Perks & Benefits

  • Competitive total compensation package

  • L&D programs and Education subsidy for employees' growth and development

  • Various team building programs and company events

  • Wellness and meal allowances

  • Comprehensive healthcare schemes for employees and dependents 

  • More that we love to tell you along the process!

Top Skills

Clickhouse
Elasticsearch
Flink
Java
Prometheus
Python
Skywalking
Spring Cloud
Victoriametrics

Similar Jobs

48 Minutes Ago
Remote
Hybrid
Singapore, SGP
Expert/Leader
Expert/Leader
Cloud • Information Technology • Security • Software • Cybersecurity
As a Principal Solutions Architect, you'll guide enterprise customers in digital transformation, aligning business outcomes with Cloudflare's technology, and influence high-level technology decisions.
Top Skills: APIsAws LambdaCi/CdCloudflareCloudflare WorkersEvent-Driven SystemsGcp Cloud FunctionsMicroservicesZero Trust
Yesterday
Easy Apply
Hybrid
Singapore, SGP
Easy Apply
Mid level
Mid level
eCommerce • Food • HR Tech • Information Technology • Mobile • Retail • Software
Design, develop, and deploy scalable software solutions while collaborating with PMs and designers, focusing on app development and code reviews.
Top Skills: DjangoElasticsearchMySQLPythonReactReact NativeRedisRedux
Yesterday
Hybrid
Singapore, SGP
Mid level
Mid level
Financial Services
As a Software Engineer III, you will design and deliver technology products, execute software solutions, and improve system architectures all within an agile team.
Top Skills: AWSJavaPythonReactSpring BootSpring FrameworkSQLTerraform

What you need to know about the Singapore Tech Scene

The digital revolution has driven a constant demand for tech professionals across industries like software development, data analytics and cybersecurity. In Singapore, one of the largest cities in Southeast Asia, the demand for tech talent is so high that the government continues to invest millions into programs designed to develop a talent pipeline directly from universities while also scaling efforts in pre-employment training and mid-career upskilling to expand and elevate its workforce.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account