Gcore Logo

Gcore

DevOps Engineer (AI Inference)

Posted Yesterday
Be an Early Applicant
In-Office or Remote
Hiring Remotely in Singapore, SGP
Mid level
In-Office or Remote
Hiring Remotely in Singapore, SGP
Mid level
Design, deploy, and maintain on-prem infrastructure for scalable AI inference workloads. Build GPU scheduling, model deployment pipelines, monitoring/observability, and collaborate with ML and platform teams to integrate inference runtimes and test performance at scale.
The summary above was generated by AI
Company Description

This position is available only under an employment (labor) agreement in Singapore.

The world’s digital experiences run on something invisible: the infrastructure and software that keep them fast, reliable, and secure. At Gcore, you’ll help design and deliver that foundation for an AI-driven world. 

We’re a global provider of infrastructure and software solutions for AI, cloud, network, and security, powering everything from real-time communication and streaming to enterprise AI and secure web applications. With 210+ edge locations, 50+ cloud regions, and thousands of GPUs, your work here can reach users and businesses across the globe. 

You’ll collaborate with leading technology partners such as Intel, NVIDIA, Dell, and Equinix, and work on platforms that power digital products used around the world. Our vision is simple: to connect the world to AI, anywhere, anytime. 

Want to work on technology that goes beyond a single product or industry?  Join a global team of 550+ professionals building infrastructure and software that supports the entire digital ecosystem. 

We are looking for a talented DevOps Engineer to join our AI Inference Operations Team.

Job Description

As a DevOps Engineer, you will be responsible for designing, deploying, and maintaining infrastructure and services that enable scalable and secure AI inference workloads on-premises.

What You Will Do

  • Design, develop, and maintain infrastructure for AI inference workloads, including GPU scheduling, model deployment pipelines, and data access patterns in on-prem environments
  • Build and manage monitoring and observability tools for AI inference platforms, including dashboards, alerts, and runbooks for model health and system performance
  • Collaborate with ML engineers and platform teams to design system architecture for AI workloads, integrate inference runtimes, and test performance at scale

Qualifications

What We're Looking For

  • Strong understanding of Kubernetes architecture, including CNI, CSI, operators, ingress/gateway, and control plane components.
  • Hands-on experience operating and troubleshooting production Kubernetes clusters.
  • Strong Linux and networking troubleshooting skills, including DNS, routing, firewalling, TLS, MTU, connectivity and performance issues.
  • Ability to develop automation and operational tooling using Python, Go, or Bash.
  • Experience with Terraform, Ansible, or similar IaC/configuration management tools.
  • Experience with VictoriaMetrics/Grafana or similar monitoring, alerting, and troubleshooting tools.
  • Strong experience with Git-based workflows and CI/CD pipelines.

Preferred Qualifications

  • Familiarity with Cluster API or similar Kubernetes cluster lifecycle management technologies.
  • Hands-on operation or administration of Slurm clusters.
  • Knowledge of Argo CD, GitOps workflows, Helm, or Helmfile.
  • Background working with managed platforms, PaaS, or cloud services.
  • Exposure to bare metal, GPU, HPC, or other high-performance computing environments.

Nice to Have

  • Familiarity with the NVIDIA GPU stack, RDMA/InfiniBand, or high-performance networking.
  • Knowledge of OpenStack or similar cloud infrastructure platforms.
  • Hands-on experience developing Kubernetes operators or controllers.

Additional Information

Benefits 

At Gcore, we want you to do your best work and enjoy the journey. Our benefits are designed to support your growth, well-being, and life beyond work: 

  • Competitive compensation
  • Flexible working hours and hybrid or remote options, depending on your role 
  • Work from anywhere in the world for up to 45 days per year 
  • Private medical insurance for you and your family* 
  • Extra paid vacation and sick leave days* 
  • Support for life’s important moments and celebrations 
  • Language courses to help you connect and grow 
  • Modern, welcoming offices with snacks, drinks, and entertainment* 
  • Team sports and social activities* 

*Benefits may vary depending on your location. 

Equal Opportunity Employer 

We provide equal opportunity to all applicants without regard to race, color, religion, sex, sexual orientation, age, gender identity, gender expression, national origin, disability, or any other legally protected characteristics. 

Similar Jobs

6 Hours Ago
Easy Apply
In-Office or Remote
Singapore, SGP
Easy Apply
Mid level
Mid level
Cloud • Information Technology • Security • Software
Provide pre-sales technical expertise across Southeast Asia: deliver tailored demos, consult on identity and device management, enable partners, assist trial setup, assess readiness for sales cycles, and surface product feedback for improvements.
Top Skills: Active DirectoryCloud/SaasIdentity And Directory SolutionsJitJumpcloudLdapMdmMfaNetworkingOauthOidcRadiusRmmSAMLScimTotpWebauthn
Yesterday
Remote or Hybrid
Singapore, SGP
Senior level
Senior level
Cloud • Information Technology • Security • Software • Cybersecurity
Lead rapid-response and proactive reliability work for high-severity, customer-facing incidents. Debug across edge, network, DNS, transport, and customer stacks; own on-call, postmortems, and remediation. Build telemetry, detectors, diagnostic tooling, and automation with Product Engineering. Mentor support teams, define reliability metrics, and ship AI-assisted diagnostics to reduce detection and resolution time.
Top Skills: BashBgpDistributed TracingDnsElasticsearchGrafanaGreHttp/SIpsecKibanaNtpOspfPythonSmtpSnmpTcpdumpTls/SslWiresharkZero Trust
Yesterday
In-Office or Remote
Singapore, SGP
Senior level
Senior level
Artificial Intelligence • Healthtech • Machine Learning • Natural Language Processing • Biotech • Pharmaceutical
Lead portfolio-level commercial strategy for Internal Medicine and Antivirals, prioritizing investments, advising senior leadership, and driving early commercial assessments, business development input, and AI-enabled decision frameworks to maximize portfolio value across development, launch, lifecycle, and external innovation.

What you need to know about the Singapore Tech Scene

The digital revolution has driven a constant demand for tech professionals across industries like software development, data analytics and cybersecurity. In Singapore, one of the largest cities in Southeast Asia, the demand for tech talent is so high that the government continues to invest millions into programs designed to develop a talent pipeline directly from universities while also scaling efforts in pre-employment training and mid-career upskilling to expand and elevate its workforce.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account