Lead the design and implementation of an enterprise operations automation platform, integrating AWS and Alibaba, focusing on efficiency and reliability.
OKX will be prioritising applicants who have a current right to work in Singapore, and do not require OKX's sponsorship of a visa.
At OKX, we believe that the future will be reshaped by crypto, and ultimately contribute to every individual's freedom. OKX is a leading crypto exchange, and the developer of OKX Wallet, giving millions access to crypto trading and decentralized crypto applications (dApps). OKX is also a trusted brand by hundreds of large institutions seeking access to crypto markets. We are safe and reliable, backed by our Proof of Reserves. Across our multiple offices globally, we are united by our core principles: We Before Me, Do the Right Thing, and Get Things Done. These shared values drive our culture, shape our processes, and foster a friendly, rewarding, and diverse environment for every OK-er. OKX is part of OKG, a group that brings the value of Blockchain to users around the world, through our leading products OKX, OKX Wallet, OKLink and more.
We are looking for a Senior Staff Engineer who will lead the design and implementation of an enterprise-level operations automation platform in a multi-cloud environment, integrating AWS and Alibaba Cloud resources to build a standardized, intelligent operational framework that enhances efficiency and reliability.
- Automation Platform Design & Development
- Lead the architecture design of a cross-cloud (AWS/Alibaba Cloud) operations automation platform, covering core modules such as resource orchestration, monitoring/alerting, self-healing, and cost optimization.
- Develop unified operational APIs and a visual console, integrating AWS SDK/Boto3 and Alibaba Cloud OpenAPI/SDK to standardize cross-cloud resource operations.
- Toolchain Integration & Optimization
- Build end-to-end resource lifecycle management using IaC tools (Terraform, AWS CloudFormation, Alibaba Cloud ROS), enabling one-click environment provisioning and teardown.
- Integrate CI/CD pipelines (GitLab, cloud-native toolchains) to automate application deployment, configuration changes, and database migrations.
- Intelligent Operations Capability Development
- Design an automated operations rule engine, leveraging AI/ML (e.g., anomaly detection, root cause analysis) for predictive fault resolution (e.g., AWS Lambda + CloudWatch event-triggered remediation).
- Build a knowledge base system to document SOPs and enable automated execution (e.g., Alibaba Cloud OOS).
- Multi-Cloud Coordination & Standardization
- Design a unified operations model across AWS and Alibaba Cloud, abstracting common interfaces to address multi-cloud differences (e.g., aligning ECS and EC2 instance management strategies).
- Establish operational standards and drive configuration standardization/automated validation across dev, test, and production environments.
- Security & Compliance Governance
- Embed security baseline checks to automatically scan cloud configurations (e.g., security group rules, IAM policies, Alibaba Cloud RAM permissions) and generate compliance reports.
- Automate approval workflows for sensitive operations (e.g., Alibaba Cloud ActionTrail and AWS CloudTrail log-triggered approval tickets).
- Cost Optimization Framework
- Develop resource utilization analysis tools, leveraging AWS Cost Explorer and Alibaba Cloud Cost Management APIs to generate automated optimization recommendations (e.g., idle resource cleanup, scaling policy tuning).
- Design FinOps automation solutions for budget alerts, cost allocation, and multi-dimensional cost visualization.
- Technical Skills
- Proficient in at least one programming language (Python/Go/Java), with experience in large-scale operations platform development and familiarity with microservices architecture (Spring Cloud/Dubbo) and full-stack technologies.
- Deep understanding of AWS and Alibaba Cloud core service APIs, cloud-native technologies (Serverless, K8s Operator), and DevOps toolchains (Ansible, Prometheus).
- Skilled in automated testing frameworks to ensure platform stability.
- Experience Requirements
- 5+ years in DevOps/operations development, with proven experience in designing and deploying enterprise-level automation platforms (e.g., CMDB, operations middleware).
- Hands-on experience with AWS/Aliyun hybrid cloud automation tools, including cross-cloud resource synchronization and federated authentication (e.g., Alibaba Cloud RAM SSO, AWS IAM Identity Center).
- Soft Skills
- Product-oriented mindset, capable of designing user-friendly and efficient operational features.
- Strong cross-team collaboration skills to drive adoption of automation platforms across development, operations, and security teams.
- Certifications & Education
- AWS Certified DevOps Engineer or Alibaba Cloud ACP/ACE (DevOps track) certifications preferred.
- Bachelor’s degree or higher in Computer Science, Software Engineering, or related fields.
Perks & Benefits
- Competitive total compensation package
- L&D programs and Education subsidy for employees' growth and development
- Various team building programs and company events
- Wellness and meal allowances
- Comprehensive healthcare schemes for employees and dependants
- More that we love to tell you along the process!
Top Skills
Alibaba Cloud
Alibaba Cloud Ros
Ansible
AWS
Aws Cloudformation
Gitlab
Go
Java
Prometheus
Python
Terraform
Similar Jobs
Artificial Intelligence • Fintech • Payments • Financial Services • Generative AI
Lead the development of Airwallex's money movement infrastructure, managing a team and collaborating on product strategy while ensuring scalability and user requirements.
Top Skills:
APIsComputer ScienceEngineeringProduct Management Software
Artificial Intelligence • Fintech • Payments • Financial Services • Generative AI
This role involves leading the strategy for Payment fraud products, managing product life cycles, and coordinating with cross-functional teams to enhance fraud prevention services.
Top Skills:
Payments FraudRisk Management
Artificial Intelligence • Fintech • Payments • Financial Services • Generative AI
Lead strategies for payment fraud products, manage a team of Fraud Analysts, and develop fraud detection and prevention solutions.
Top Skills:
Data SciencePayments FraudPayments Risk ManagementProduct ManagementRisk Analytics
What you need to know about the Singapore Tech Scene
The digital revolution has driven a constant demand for tech professionals across industries like software development, data analytics and cybersecurity. In Singapore, one of the largest cities in Southeast Asia, the demand for tech talent is so high that the government continues to invest millions into programs designed to develop a talent pipeline directly from universities while also scaling efforts in pre-employment training and mid-career upskilling to expand and elevate its workforce.