The Senior Staff Engineer will manage hybrid cloud infrastructure on AWS and Alibaba Cloud, focusing on architecture, operations, cost optimization, security, and cross-team collaboration.
OKX will be prioritising applicants who have a current right to work in Singapore, and do not require OKX's sponsorship of a visa.
At OKX, we believe that the future will be reshaped by crypto, and ultimately contribute to every individual's freedom. OKX is a leading crypto exchange, and the developer of OKX Wallet, giving millions access to crypto trading and decentralized crypto applications (dApps). OKX is also a trusted brand by hundreds of large institutions seeking access to crypto markets. We are safe and reliable, backed by our Proof of Reserves. Across our multiple offices globally, we are united by our core principles: We Before Me, Do the Right Thing, and Get Things Done. These shared values drive our culture, shape our processes, and foster a friendly, rewarding, and diverse environment for every OK-er. OKX is part of OKG, a group that brings the value of Blockchain to users around the world, through our leading products OKX, OKX Wallet, OKLink and more.
We are currently seeking a Senior Staff Engineer to join our Singapore team. You be will responsible for the full lifecycle management of enterprise-level hybrid cloud infrastructure, leading unified orchestration, operations, and cost optimisation of AWS and Alibaba Cloud resources, ensuring high availability, high performance, and compliance.
- Cloud Platform Architecture & Operations
- Plan, deploy, monitor, and maintain AWS services (EC2, S3, VPC, Lambda, EKS, etc.) and Alibaba Cloud services (ECS, OSS, VPC, Function Compute, ACK, etc.).
- Design highly available, auto-scaling cloud architectures, optimizing network (e.g., Alibaba Cloud CEN, AWS Direct Connect), storage, and compute resource configurations.
- Monitoring & Incident Management
- Implement full-stack monitoring and alerting using cloud-native tools (AWS CloudWatch, Alibaba Cloud CloudMonitor) and open-source solutions (Prometheus+Grafana, ELK).
- Lead critical incident response, perform root cause analysis, and implement preventive measures (e.g., resource contention, misconfigurations, network latency).
- Cost Optimisation & Resource Management
- Analyse cloud resource usage, reduce costs via reserved instances, auto-scaling, and storage lifecycle policies (e.g., AWS S3 Intelligent-Tiering, Alibaba Cloud OSS Archive).
- Establish resource quota management strategies to prevent waste and overspending.
- Security & Compliance
- Implement cloud security baselines (security groups, IAM policies, Alibaba Cloud RAM permissions, AWS Security Hub), conduct regular security audits, and remediate vulnerabilities.
- Design granular access controls using AWS IAM and Alibaba Cloud RAM, and enforce database auditing (e.g., AWS CloudTrail + Alibaba Cloud DAS).
- Cross-Team Collaboration & Knowledge Sharing
- Collaborate with development teams to optimize application architectures and provide cloud-native solutions (Server-less, Microservices).
- Document operational procedures (SOP manuals) and lead internal technical training sessions.
- Technical Skills
- Mastery of core services (compute/storage/network/security) on AWS or Alibaba Cloud, with familiarity in the other platform.
- Proficient in Linux/Windows system operations and automation tools (Shell/Python/Ansible).
- Hands-on experience with containerized operations (Kubernetes, ECS/EKS, ACK) and cloud-native technologies (e.g., Service Mesh).
- Experience Requirements
- 5+ years of operations experience, with at least 3 years focused on public cloud (AWS/Alibaba Cloud) environments managing 100+ instances.
- Experience in building cloud platforms from scratch, hybrid cloud architecture design, or large-scale migration projects (e.g., IDC-to-cloud) is preferred.
- Soft Skills
- Strong problem-solving skills with the ability to handle high-pressure operational challenges.
- Excellent communication skills to collaborate with development, testing, and security teams.
- Certifications & Education
- AWS Certified SysOps Administrator or Alibaba Cloud ACP/ACE certifications are preferred.
- Bachelor’s degree or higher in Computer Science, Network Engineering, or related fields.
- Familiarity with multi-cloud management platforms (AWS, Alibaba Cloud, Azure) or FinOps cost optimisation methodologies.
- Experience in cloud security practices, including Web Application Firewall (WAF) and DDoS protection (Alibaba Cloud Anti-DDoS Premium, AWS Shield).
- Exposure to big data/AI operations (e.g., Alibaba Cloud MaxCompute, AWS EMR).
- Team leadership experience is preferred.
- Competitive total compensation package
- L&D programs and Education subsidy for employees' growth and development
- Various team building programs and company events
- Wellness and meal allowances
- Comprehensive healthcare schemes for employees and dependants
- More that we love to tell you along the process!
Information collected and processed as part of the recruitment process of any job application you choose to submit is subject to OKX's Candidate Privacy Notice.
Top Skills
Ack
Alibaba Cloud
Ansible
AWS
Cloudmonitor
Cloudwatch
Ec2
Ecs
Eks
Elk
Function Compute
Grafana
Kubernetes
Lambda
Oss
Prometheus
Python
S3
Shell
Vpc
Similar Jobs
Fintech • Mobile • Payments • Software • Financial Services
Oversee operational aspects of dispute management, focusing on automation initiatives to enhance efficiency, compliance, and resolution accuracy.
Top Skills:
Data AnalysisLeanProcess MappingSix Sigma
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
The Sales Operations Manager will be responsible for optimizing sales processes, analyzing pipeline data, and providing insights to drive business growth and customer success.
Top Skills:
Bi Reporting ToolsCrm SystemsPower BI
Fintech • Mobile • Payments • Software • Financial Services
The Payment Operations Reconciliation Senior Specialist performs bank account and payment reconciliations, resolves discrepancies, conducts month-end closing activities, and collaborates with various teams to enhance reconciliation processes.
Top Skills:
Excel
What you need to know about the Singapore Tech Scene
The digital revolution has driven a constant demand for tech professionals across industries like software development, data analytics and cybersecurity. In Singapore, one of the largest cities in Southeast Asia, the demand for tech talent is so high that the government continues to invest millions into programs designed to develop a talent pipeline directly from universities while also scaling efforts in pre-employment training and mid-career upskilling to expand and elevate its workforce.


