Design, build, and operate highly available on-prem/cloud infrastructure using IaC (Terraform/Ansible), implement networking, storage, VMs, security and DR, instrument telemetry, define SRE metrics (SLO/SLI), troubleshoot incidents, and perform capacity planning while collaborating with developers and stakeholders.
You will be part of a dynamic team responsible for exploring, designing, managing and optimising our on-premised cloud infrastructure platforms and services. You will be working with a team of cloud infrastructure engineers, responsible for implementing cloud networking, storages, virtual machines and infra security solutions. You must have a good understanding of cloud infra technologies, architecture and site reliability engineering (SRE).
Responsibilities
- Design, develop and deploy a highly available, reliable and scalable cloud infrastructure platform and services. The job scopes involve implementing effective infra solutions for cloud networks, storages, virtual machines, infra security and disaster recovery.
- Develop and maintain infrastructure using infra-as-code tools like Terraform or Ansible to ensure repeatable, automated and version-controlled deployments.
- Build in-systems telemetry to analyse and optimise their performance and reliability.
- Implement security measures to ensure infrastructure meets organisation security standards and compliance.
- Troubleshoot cloud infrastructure incidents to identify root cause and implement resolutions.
- Define, implement and track SRE metrics, including SLO, SLI and error budgets to improve cloud systems reliability.
- Collaborate with developers and infra stakeholders to understand their needs.
- Perform capacity planning to ensure that the infrastructure is scalable for future demands.
Requirements (Minimum Qualifications)
- Background in Computer Science, Computer or Electrical Engineering, Information Technology or a STEM qualification with relevant experience
- Knowledge in IT Infrastructure (i.e. networks, systems, storage) and Infra Operations
- Proficient in infra-as-code and scripting tools like Ansible, Terraform, Linux shell scripts, Powershell, etc.
- Understanding of public cloud services like AWS, Azure or GCP.
- System operations and administration experience on enterprise systems.
- At least 2 years of relevant working experience, including scripting and programming.
Nice-to-haves
- Cloud certifications, such as AWS, GCP certificates.
- Familiar with metrics/logging systems such as Elastic Stack and Prometheus/Grafana.
As CSIT is an agency under the Ministry of Defence (Singapore), only Singapore Citizens will be considered.
Centre for Strategic Infocomm Technologies Singapore Office
Similar Jobs
Information Technology • Security • Consulting • Cybersecurity
The IAM Engineer will design and manage identity solutions, configure identity brokering products, implement authentication protocols, and automate tasks in a Microsoft environment.
Top Skills:
AdfsAWSAzureCi/CdKerberosMicrosoft Active DirectoryMicrosoft EntraMicrosoft Graph ApiOauthPkiPowershellPythonSAML
Financial Services
Manage and oversee network infrastructure and cloud environments, ensuring compliance with security standards, while providing support and project management across teams.
Top Skills:
APIsCloud ServicesData CentreFirewallsLanLoad BalancersRoutersSwitchesVMwareWan
Artificial Intelligence • Hardware • Information Technology • Machine Learning
Lead a team to design and implement CMOS device solutions for NAND products, improving yield and reliability while managing cross-department collaborations and fostering team development.
Top Skills:
AICmosNand
What you need to know about the Singapore Tech Scene
The digital revolution has driven a constant demand for tech professionals across industries like software development, data analytics and cybersecurity. In Singapore, one of the largest cities in Southeast Asia, the demand for tech talent is so high that the government continues to invest millions into programs designed to develop a talent pipeline directly from universities while also scaling efforts in pre-employment training and mid-career upskilling to expand and elevate its workforce.



.jpeg)