Exa (exa.ai) Logo

Exa (exa.ai)

Software Engineer, Infrastructure

Posted Yesterday
Be an Early Applicant
In-Office
Singapore
Mid level
In-Office
Singapore
Mid level
The role involves building large-scale infrastructure for a search engine, focusing on GPU clusters, Kubernetes orchestration, and optimizing performance across systems.
The summary above was generated by AI

Exa is building a search engine from scratch to serve every AI application. We build massive-scale infrastructure to crawl the web, train state-of-the-art embedding models to index it, and develop super high performant vector databases in rust to search over it. We also own a $5m H200 GPU cluster and routinely run batchjobs with tens of thousands of machines. This isn't your average startup :)

Our Infrastructure Team builds the underlying tooling and infrastructure that powers all Exa's systems. Basically, we need more infra engineers to build the machine that builds the machine so that we can move as fast as possible as an engineering org. That could mean building GPU cluster orchestration in Kubernetes, map-reduce batchjobs on Ray, or the best observability tooling in the world.

Desired Experience

  • You have experience designing and operating large-scale infrastructure - GPU clusters or large Kubernetes clusters or cloud batchjob systems

  • You bring an obsessive mindset — always thinking about reliability, observability, and optimization across the entire stack.

Example Projects

  • Build the Kubernetes orchestration on a $20m GPU cluster

  • Scale our AWS batchjob system to handle map reduce jobs over 10s of thousands of machines

  • Design GPU scheduling software so we max out our cluster utilization

  • Build observability into our production systems

This is an in-person opportunity in Singapore.

Top Skills

AWS
Gpu Clusters
Kubernetes
Ray
Rust

Similar Jobs

15 Days Ago
Easy Apply
In-Office or Remote
Singapore, SGP
Easy Apply
Senior level
Senior level
Database • Analytics
The Full Stack Software Engineer will own and build key frontend and backend features for ClickHouse Cloud, enhancing user experiences and collaborating with the Control Plane team.
Top Skills: AngularAWSAzureCloudFormationGCPKubernetesNode.jsReactTerraformTypescriptVue
20 Days Ago
In-Office
2 Locations
Mid level
Mid level
Artificial Intelligence • Information Technology • Software
Design and scale backend systems, manage critical services, tackle infrastructure challenges, collaborate with teams, and explore new product directions.
Top Skills: Ci/CdCloud InfrastructureDistributed SystemsGoPythonRustVirtualization
8 Days Ago
Easy Apply
In-Office or Remote
Singapore, SGP
Easy Apply
Senior level
Senior level
Database • Analytics
Design, deploy, and maintain cloud infrastructure, focusing on scalability and security in a multi-cloud environment. Work with teams to improve service reliability and performance.
Top Skills: AWSAzureC/C++CloudFormationEnvoyGCPGoIstioJavaKubernetesTerraform

What you need to know about the Singapore Tech Scene

The digital revolution has driven a constant demand for tech professionals across industries like software development, data analytics and cybersecurity. In Singapore, one of the largest cities in Southeast Asia, the demand for tech talent is so high that the government continues to invest millions into programs designed to develop a talent pipeline directly from universities while also scaling efforts in pre-employment training and mid-career upskilling to expand and elevate its workforce.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account