Thought Machine Logo

Thought Machine

Site Reliability Engineer

Reposted 13 Days Ago
Be an Early Applicant
Hybrid
Singapore, SGP
Mid level
Hybrid
Singapore, SGP
Mid level
The Site Reliability Engineer at Thought Machine will maintain production systems, support product teams, implement disaster recovery strategies, and collaborate on engineering challenges to enhance the reliability of SaaS products.
The summary above was generated by AI

Thought Machine's mission is bold – to properly and permanently rid the world's banks of legacy technology. To achieve this, we have developed the foundations of modern banking through core and payments technology which run natively in the cloud. What we are attempting is hard and means we need great people working together to build great technology.

We have grown rapidly in the past few years – growing our team to more than 550 individuals across offices in London, New York, Singapore, Sydney and our newly established Engineering Hub in Lisbon. We have raised more than £500m in funding and our investors include Molten Ventures, Eurazeo, Intesa Sanpaolo, Temasek, Nyca Partners, JPMorgan Chase Strategic Investments, Standard Chartered Ventures, and more.

We have created a culture that enables our team to produce the best work in the industry while ensuring we have fun along the way. We're regularly cited as having a fantastic workplace culture and have been recognised by Sifted magazine as having one of the highest Glassdoor ratings for a UK fintech company and the industry's most generous employee share package. Named one of the world's most innovative fintechs by Global Finance Magazine, we were also recognised by the Financial Times as one of Europe's fastest-growing companies for two consecutive years—and a UK Best Employer for 2026.

Thought Machine’s Site Reliability Engineers are the guardians of mission-critical systems for the world's most influential financial institutions. As a member of our elite, globally distributed team, you'll be entrusted with running and maintaining the robust production infrastructure that powers our customers' cutting-edge Core Banking and Payments platforms. This is an opportunity to make a tangible impact on the global financial landscape while collaborating with brilliant minds to solve complex engineering challenges.

The team is deeply involved in tackling the technical challenges of executing Thought Machine’s growth ambitions - expect to be working with senior stakeholders in the organisation, our customers, and working on programmes and initiatives that are critical to the success of the company.

Duties:

  • Supporting the product engineering teams in building highly fault-tolerant, scalable applications by participating in design discussions, engaging in RFCs and code reviews.

  • Contributing to the execution of department strategies such as implementing disaster recovery, backup, redundancy, and capacity planning activities.

  • Participating in a global on-call rotation responsible for identifying and fixing bottlenecks in SaaS customer environments.

  • Regular maintenance of production systems that host Vault products.

  • Contributing to the evolution of our SaaS products by building features that foster exceptional reliability and an unparalleled user experience.

  • Implementing and testing DR strategies to ensure the highest level of resilience and fault tolerance of the platform.

  • Maintaining high-quality written documentation of assets, processes and runbooks that are used by the team in their day-to-day operations.

  • Collaborating effectively with team members, actively participating in knowledge sharing, and continuously growing your own technical understanding of Vault Products.

Requirements:

  • You have experience successfully delivering engineering tasks and projects with a focus on reliability and scalability.

  • You possess a good understanding of design patterns relevant to hosting and networking architectures.

  • You proactively champion product development, driven by a desire to build truly exceptional products, not just solve immediate challenges.

  • You have a strong background working in either Python, Golang or Java, having used one of these programming languages to build production level software.

  • You have experience working with Kubernetes or other container orchestration systems.

  • You have experience with automation/configuration management, e.g. Terraform, Puppet, Chef, Ansible.

  • You have a good understanding of one or more of the following areas: Database Administration, Networking, Observability Tools (such as Prometheus, Jaeger) or automation infrastructure.

  • You have solid experience working with either GCP or AWS.

Benefits:

  • Highly competitive salary

  • Bonus incentive

  • Healthcare

  • 25 days holiday and public holidays

  • Competitive maternity and paternity leave

  • $1,500 SGD per year flexible spend benefit

  • All the latest tech you need

  • A talented and experienced team as your colleagues

  • An environment where we encourage learning and progress

We actively hire candidates who demonstrate technical excellence in their field and welcome people of all ages and backgrounds, providing everyone with equal access to professional development. You are encouraged to apply even if your experience doesn't accurately match the job description. We also encourage applications from those with different abilities, including candidates with ADHD, autism, dyslexia or dyspraxia.

Thought Machine Singapore, Singapore, SGP Office

Raffles Place, Singapore, Singapore

Similar Jobs

2 Days Ago
In-Office
Singapore, SGP
Expert/Leader
Expert/Leader
Fintech
Lead APAC virtualization and SRE platform strategy, engineering governance, modernization, and 24x7 operations. Drive standardization, automation (Ansible, Terraform, PowerShell, IaC), observability (Dynatrace/Splunk), cloud integration (Azure/AWS), security, audit readiness, and migration of branch workloads to regional platforms while presenting roadmaps and risks to executive governance.
Top Skills: AnsibleAWSAzureCi/CdConverged Hyperconverged InfrastructureDynatraceEsxiHyper-VInfrastructure As Code (Iac)PowershellPowerstoreRed Hat OpenshiftSplunkTerraformVcfVmware Vsphere
10 Days Ago
In-Office
Singapore, SGP
Expert/Leader
Expert/Leader
Fintech • Information Technology • Software • Financial Services
Lead and manage global production support and SRE teams to improve reliability, reduce MTTR, enforce SOPs, automate manual toil, implement observability and SLIs/SLOs, drive root cause analysis, and coordinate incident/problem management and audit responses.
Top Skills: AixAlertingCloudJavaMonitoringObservabilityOpenshiftOracleRdbmsSlis/SlosUnix
10 Days Ago
In-Office
Singapore, SGP
Senior level
Senior level
Financial Services
Own and improve the production trading environment: monitor and troubleshoot large-scale trading systems and exchange connectivity, build DevOps tooling for deployment, configuration, and monitoring, coordinate incidents and changes with traders and risk teams, manage operational risk, document procedures, and mentor other SREs.
Top Skills: C++Configuration ManagementDevOpsEthernetLinuxLldpMonitoringMulticastPythonRoutingShell ScriptingVlan Tagging

What you need to know about the Singapore Tech Scene

The digital revolution has driven a constant demand for tech professionals across industries like software development, data analytics and cybersecurity. In Singapore, one of the largest cities in Southeast Asia, the demand for tech talent is so high that the government continues to invest millions into programs designed to develop a talent pipeline directly from universities while also scaling efforts in pre-employment training and mid-career upskilling to expand and elevate its workforce.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account