GXS Bank Logo

GXS Bank

Senior Site Reliability Engineer

Posted Yesterday
Be an Early Applicant
Singapore
Senior level
Singapore
Senior level
The Senior Site Reliability Engineer will focus on building software platforms for the reliable and scalable management of Digibank services. Responsibilities include working with Kubernetes on AWS, using Infrastructure as Code with Terraform and Ansible, troubleshooting incidents, and driving the adoption of self-healing systems. The role emphasizes collaboration with development teams to enhance reliability and scale, plus performance and cost optimization for infrastructure.
The summary above was generated by AI

About the Team:

Our team treats infrastructure and operations as software engineering problems. We are responsible for building and progressing software platforms that enable the provisioning and management of all Digibank services in safe, reliable, and scalable ways. We consistently challenge the status quo and use new technologies to build platforms and tooling for engineering teams. Join us and make significant decisions with a huge impact on building modern banking technology.


About the Role:

We treat Infrastructure and operations as Software Engineering problems. Our mission is to build and progress software platforms which enables the provisioning and managing of all Digibank services in safe, reliable and scalable ways. We consistently challenge the status quo, use new technologies to build platforms and tooling for engineering teams. In this role you will make significant decisions with a huge impact on building modern banking technology. You would be part of a team, responsible for designing & architecting new solutions, finding creative ways to optimize existing solutions which will improve agility for managing hundreds of microservices infrastructures in a stable & reliable way.

If you are:

  • A strong believer of automating DevOps & SRE aspects like infrastructure provisioning, deployment, observability, incident lifecycle, uptime SLA etc.

  • Bold to challenge, open to get challenged, curious to learn & grow

This is the right place for you!

Roles and Responsibilities:

  • Working with Kubernetes clusters hosted in AWS

  • Using InfrastructureAsCode tooling like Terraform, and Ansible to manage AWS, Azure & Kubernetes resources

  • Engage with the development teams throughout the life cycle to help develop software for reliability and scale. Coaching team's SRE best practices

  • Troubleshoot priority incidents, facilitate blameless post-mortems and ensure permanent closure of incidents

  • Perform analytics on previous incidents and usage patterns to better predict issues and take proactive actions

  • Build and drive adoption for greater self-healing and resiliency patterns

  • Design automated software and product upgrades, change management, and release management solutions

  • Design, code, test and deliver software to automate manual operational work. Own your tools and services end to end.

  • Performance and cost optimization for infrastructure

  • Be part of an on-call rotation for the team’s tooling and 24x7 support coverage as needed

  • Succeed, fail, and learn together with other talented people. We believe in an environment that provides an opportunity for growth and see education as an outcome of failure that gets us closer to the next breakthrough

Qualifications:

  • Bachelor's degree in information systems, information technology, computer science, or similar.

  • 5-7+ years of professional experience.

  • Experience with administering Kubernetes cluster

  • Experience with managing Infrastructure as code using Terraform

  • Direct production operations experience in a cloud environment.

  • Experience contributing to technology and product strategy.

  • Experience leading capability-building initiatives across diverse areas such as infrastructure and operations automation, observability, incident management, architecting HA systems, and other core engineering.

  • Demonstrated experience in driving operational efficiency and transparency of a growing engineering organization.

Top Skills

AWS
Kubernetes
Terraform

GXS Bank Singapore Office

Singapore, Singapore

Similar Jobs

2 Days Ago
Remote
4 Locations
Senior level
Senior level
Marketing Tech • Mobile • Software
As a Senior Site Reliability Engineer, you'll design and implement reliable infrastructure for customer engagement technology, streamline processes for efficiency, and troubleshoot production issues, while mentoring team members and influencing product direction.
Top Skills: BashGoMySQL
17 Days Ago
Remote
8 Locations
Senior level
Senior level
Cloud • Software
The Senior Site Reliability Engineer focuses on enhancing automation and operations at scale by leveraging Python, Kubernetes, and OpenStack. Responsibilities include managing private cloud infrastructure, promoting a scientific approach to operations, and contributing to the evolution of open source technologies for high-pressure, mission-critical environments.
Top Skills: Python
15 Hours Ago
Singapore, SGP
Mid level
Mid level
Information Technology • Software • Financial Services
The Site Reliability Engineer will manage first line support in a distributed environment, assist with performance tuning, handle application migrations and infrastructure upgrades, and provide capacity planning and deployment support, while fostering collaboration and problem-solving with team members.
Top Skills: Bash,Python

What you need to know about the Singapore Tech Scene

The digital revolution has driven a constant demand for tech professionals across industries like software development, data analytics and cybersecurity. In Singapore, one of the largest cities in Southeast Asia, the demand for tech talent is so high that the government continues to invest millions into programs designed to develop a talent pipeline directly from universities while also scaling efforts in pre-employment training and mid-career upskilling to expand and elevate its workforce.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account