Exa (exa.ai) Logo

Exa (exa.ai)

Software Engineer, Infrastructure

Reposted 15 Hours Ago
Be an Early Applicant
In-Office
Singapore
Mid level
In-Office
Singapore
Mid level
The role involves building large-scale infrastructure for a search engine, focusing on GPU clusters, Kubernetes orchestration, and optimizing performance across systems.
The summary above was generated by AI

Exa is building a search engine from scratch to serve every AI application. We build massive-scale infrastructure to crawl the web, train state-of-the-art embedding models to index it, and develop super high performant vector databases in Rust to search over it. We also own a $5M H200 GPU cluster that regularly lights up tens of thousands of machines.

Our Infrastructure Team builds the underlying tooling and infrastructure that powers all Exa's systems. Basically, we need more infra engineers to build the machine that builds the machine so that we can move as fast as possible as an engineering org. That could mean building GPU cluster orchestration in Kubernetes, map-reduce batchjobs on Ray, or the best observability tooling in the world.


Desired Experience

  • You have experience designing and operating large-scale infrastructure - GPU clusters or large Kubernetes clusters or cloud batchjob systems

  • You bring an obsessive mindset — always thinking about reliability, observability, and optimization across the entire stack.

Example Projects

  • Build the Kubernetes orchestration on a $20m GPU cluster

  • Scale our AWS batchjob system to handle map reduce jobs over 10s of thousands of machines

  • Design GPU scheduling software so we max out our cluster utilization

  • Build observability into our production systems

This is an in-person opportunity in Singapore. We’re happy to sponsor international candidates.

Top Skills

AWS
Gpu Clusters
Kubernetes
Ray
Rust

Similar Jobs

17 Days Ago
In-Office or Remote
Singapore, SGP
Senior level
Senior level
Artificial Intelligence • Fintech • Payments • Business Intelligence • Financial Services • Generative AI
Join the Cloud Infrastructure team to create and maintain a secure, scalable cloud infrastructure for global payment solutions. Responsibilities include working on cloud platforms, ensuring system reliability, and automating processes.
Top Skills: AWSAzureGCPGoIstioKubernetesPython
17 Days Ago
In-Office or Remote
Singapore, SGP
Mid level
Mid level
Artificial Intelligence • Fintech • Payments • Business Intelligence • Financial Services • Generative AI
Develop and maintain cloud infrastructure services with a focus on reliability, security, and performance using modern technologies. Collaborate on architecture and contribute to incident response and operations.
Top Skills: ArgocdAWSAzureCrossplaneGCPGitopsGoIstioKubernetesPythonTerraform
17 Days Ago
In-Office or Remote
Singapore, SGP
Senior level
Senior level
Artificial Intelligence • Fintech • Payments • Business Intelligence • Financial Services • Generative AI
The Staff Software Engineer will join the Cloud Infrastructure team to create and maintain a secure, scalable infrastructure for cross-border payment solutions, focusing on cloud platforms, automation, and system reliability.
Top Skills: AWSAzureGCPGoIstioKubernetesPython

What you need to know about the Singapore Tech Scene

The digital revolution has driven a constant demand for tech professionals across industries like software development, data analytics and cybersecurity. In Singapore, one of the largest cities in Southeast Asia, the demand for tech talent is so high that the government continues to invest millions into programs designed to develop a talent pipeline directly from universities while also scaling efforts in pre-employment training and mid-career upskilling to expand and elevate its workforce.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account