OKX Logo

OKX

Senior Staff Engineer - Operations Automation (AWS/Alibaba)

Posted 22 Days Ago
Be an Early Applicant
Singapore
Senior level
Singapore
Senior level
Lead the design and implementation of an enterprise operations automation platform, integrating AWS and Alibaba, focusing on efficiency and reliability.
The summary above was generated by AI
OKX will be prioritising applicants who have a current right to work in Singapore, and do not require OKX's sponsorship of a visa.
 
Who We Are
At OKX, we believe that the future will be reshaped by crypto, and ultimately contribute to every individual's freedom. OKX is a leading crypto exchange, and the developer of OKX Wallet, giving millions access to crypto trading and decentralized crypto applications (dApps). OKX is also a trusted brand by hundreds of large institutions seeking access to crypto markets. We are safe and reliable, backed by our Proof of Reserves. Across our multiple offices globally, we are united by our core principles: We Before Me, Do the Right Thing, and Get Things Done. These shared values drive our culture, shape our processes, and foster a friendly, rewarding, and diverse environment for every OK-er. OKX is part of OKG, a group that brings the value of Blockchain to users around the world, through our leading products OKX, OKX Wallet, OKLink and more.
About the Opportunity
We are looking for a Senior Staff Engineer who will lead the design and implementation of an enterprise-level operations automation platform in a multi-cloud environment, integrating AWS and Alibaba Cloud resources to build a standardized, intelligent operational framework that enhances efficiency and reliability.
 
What You'll Be Doing
  1. Automation Platform Design & Development
    1. Lead the architecture design of a cross-cloud (AWS/Alibaba Cloud) operations automation platform, covering core modules such as resource orchestration, monitoring/alerting, self-healing, and cost optimization.
    2. Develop unified operational APIs and a visual console, integrating AWS SDK/Boto3 and Alibaba Cloud OpenAPI/SDK to standardize cross-cloud resource operations.
  2. Toolchain Integration & Optimization
    1. Build end-to-end resource lifecycle management using IaC tools (Terraform, AWS CloudFormation, Alibaba Cloud ROS), enabling one-click environment provisioning and teardown.
    2. Integrate CI/CD pipelines (GitLab, cloud-native toolchains) to automate application deployment, configuration changes, and database migrations.
  3. Intelligent Operations Capability Development
    1. Design an automated operations rule engine, leveraging AI/ML (e.g., anomaly detection, root cause analysis) for predictive fault resolution (e.g., AWS Lambda + CloudWatch event-triggered remediation).
    2. Build a knowledge base system to document SOPs and enable automated execution (e.g., Alibaba Cloud OOS).
  4. Multi-Cloud Coordination & Standardization
    1. Design a unified operations model across AWS and Alibaba Cloud, abstracting common interfaces to address multi-cloud differences (e.g., aligning ECS and EC2 instance management strategies).
    2. Establish operational standards and drive configuration standardization/automated validation across dev, test, and production environments.
  5. Security & Compliance Governance
    1. Embed security baseline checks to automatically scan cloud configurations (e.g., security group rules, IAM policies, Alibaba Cloud RAM permissions) and generate compliance reports.
    2. Automate approval workflows for sensitive operations (e.g., Alibaba Cloud ActionTrail and AWS CloudTrail log-triggered approval tickets).
  6. Cost Optimization Framework
    1. Develop resource utilization analysis tools, leveraging AWS Cost Explorer and Alibaba Cloud Cost Management APIs to generate automated optimization recommendations (e.g., idle resource cleanup, scaling policy tuning).
    2. Design FinOps automation solutions for budget alerts, cost allocation, and multi-dimensional cost visualization.
What We Look For In You
  1. Technical Skills
    1. Proficient in at least one programming language (Python/Go/Java), with experience in large-scale operations platform development and familiarity with microservices architecture (Spring Cloud/Dubbo) and full-stack technologies.
    2. Deep understanding of AWS and Alibaba Cloud core service APIs, cloud-native technologies (Serverless, K8s Operator), and DevOps toolchains (Ansible, Prometheus).
    3. Skilled in automated testing frameworks to ensure platform stability.
  2. Experience Requirements
    1. 5+ years in DevOps/operations development, with proven experience in designing and deploying enterprise-level automation platforms (e.g., CMDB, operations middleware).
    2. Hands-on experience with AWS/Aliyun hybrid cloud automation tools, including cross-cloud resource synchronization and federated authentication (e.g., Alibaba Cloud RAM SSO, AWS IAM Identity Center).
  3. Soft Skills
    1. Product-oriented mindset, capable of designing user-friendly and efficient operational features.
    2. Strong cross-team collaboration skills to drive adoption of automation platforms across development, operations, and security teams.
  4. Certifications & Education
    1. AWS Certified DevOps Engineer or Alibaba Cloud ACP/ACE (DevOps track) certifications preferred.
    2. Bachelor’s degree or higher in Computer Science, Software Engineering, or related fields.
 Perks & Benefits
  • Competitive total compensation package
  • L&D programs and Education subsidy for employees' growth and development
  • Various team building programs and company events
  • Wellness and meal allowances 
  • Comprehensive healthcare schemes for employees and dependants 
  • More that we love to tell you along the process!

Top Skills

Alibaba Cloud
Alibaba Cloud Ros
Ansible
AWS
Aws Cloudformation
Gitlab
Go
Java
Prometheus
Python
Terraform

Similar Jobs

Senior level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
As an Advisory Solution Consultant, you will engage with clients to design and deliver innovative data integration solutions using ServiceNow's Workflow Data Fabric, focusing on AI-enhanced workflows and technical presales activities.
Top Skills: AngularjsCSSData PipelinesHTML5JavaScriptJdbcJSONMiddleware TechnologiesPythonReactRest ApisSoapSQLStreaming Data Platforms
2 Hours Ago
Hybrid
Singapore, SGP
Junior
Junior
Financial Services
The Software Engineer II will design, develop and troubleshoot software components for secure technology products, enhancing the capabilities of the Payments Technology team.
Top Skills: Ci/CdCloud TechnologiesGoJavaNode.jsPythonReactRelational DatabasesRest ApisSpring BootTypescript
2 Hours Ago
Remote
Hybrid
Singapore, SGP
Expert/Leader
Expert/Leader
Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Develop and maintain scalable services for CrowdStrike's cloud platform, enhance monitoring solutions, and collaborate across teams to optimize performance.
Top Skills: AWSC++CassandraGCPGoJavaKafkaKotlinKubernetesNode.jsOpensearchPythonScala

What you need to know about the Singapore Tech Scene

The digital revolution has driven a constant demand for tech professionals across industries like software development, data analytics and cybersecurity. In Singapore, one of the largest cities in Southeast Asia, the demand for tech talent is so high that the government continues to invest millions into programs designed to develop a talent pipeline directly from universities while also scaling efforts in pre-employment training and mid-career upskilling to expand and elevate its workforce.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account