Exa (exa.ai) Logo

Exa (exa.ai)

Software Engineer, Infrastructure

Reposted 3 Days Ago
Be an Early Applicant
In-Office
Singapore, SGP
Mid level
In-Office
Singapore, SGP
Mid level
The role involves building large-scale infrastructure for a search engine, focusing on GPU clusters, Kubernetes orchestration, and optimizing performance across systems.
The summary above was generated by AI

Exa is building a search engine from scratch to serve every AI application. We build massive-scale infrastructure to crawl the web, train state-of-the-art embedding models to index it, and develop super high performant vector databases in Rust to search over it. We also own a $5M H200 GPU cluster that regularly lights up tens of thousands of machines.

Our Infrastructure Team builds the underlying tooling and infrastructure that powers all Exa's systems. Basically, we need more infra engineers to build the machine that builds the machine so that we can move as fast as possible as an engineering org. That could mean building GPU cluster orchestration in Kubernetes, map-reduce batchjobs on Ray, or the best observability tooling in the world.


Desired Experience

  • You have experience designing and operating large-scale infrastructure - GPU clusters or large Kubernetes clusters or cloud batchjob systems

  • You bring an obsessive mindset — always thinking about reliability, observability, and optimization across the entire stack.

Example Projects

  • Build the Kubernetes orchestration on a $20M GPU cluster

  • Scale our AWS batchjob system to handle map reduce jobs over 10s of thousands of machines

  • Design GPU scheduling software so we max out our cluster utilization

  • Build observability into our production systems

This is an in-person opportunity in Singapore. We’re happy to sponsor international candidates. In addition to premium healthcare benefits (medical, dental, vision), we also offer fertility benefits and a monthly wellness stipend to all of our employees.

Top Skills

AWS
Gpu Clusters
Kubernetes
Ray
Rust

Similar Jobs

4 Days Ago
Easy Apply
In-Office or Remote
Singapore, SGP
Easy Apply
Senior level
Senior level
Software
The Software Engineer for Data Infrastructure & Acquisition will manage data collection, cloud infrastructure, and collaborate with the AI team to enhance model training operations for Speechify's products.
Top Skills: DockerGCPPythonTerraform
19 Days Ago
Easy Apply
In-Office or Remote
Singapore, SGP
Easy Apply
Senior level
Senior level
Database • Analytics
The Senior Software Engineer will design, deploy, and maintain scalable cloud infrastructure, focusing on automation, security, and performance improvements.
Top Skills: AWSAzureC/C++CloudFormationEnvoyGCPGoIstioJavaKubernetesTerraform
6 Days Ago
In-Office
Singapore, SGP
Senior level
Senior level
Energy • Utilities • Renewable Energy
Lead platform engineering work balancing day-to-day operations and long-term automation. Implement SRE/DevOps practices, write and review code, build observability, triage and resolve system issues, and improve deployment and operational processes.
Top Skills: AWSAws EksKubernetesTerraform

What you need to know about the Singapore Tech Scene

The digital revolution has driven a constant demand for tech professionals across industries like software development, data analytics and cybersecurity. In Singapore, one of the largest cities in Southeast Asia, the demand for tech talent is so high that the government continues to invest millions into programs designed to develop a talent pipeline directly from universities while also scaling efforts in pre-employment training and mid-career upskilling to expand and elevate its workforce.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account