HUD (YC W25) does RL and evaluations for frontier AI agents. Our HUD agentic RL platform is used by frontier labs, Fortune 500 companies and startups. We have grown revenue and raised funding from YC, A16Z and other leading VCs to scale fast.
About the roleWe're looking for an experienced senior infrastructure engineer to help build our platform for RL at scale.
ResponsibilitiesBuild out HUD's existing RL and evaluations framework
Optimize our RL and evaluation infrastructure at frontier scale
Technical Skills
Experience with AWS, Kubernetes, Docker, Redis, Linux, Python, PostgreSQL
Systems design, performance security, CI/CD management experience preferred
You may be a good fit if you:
Have hands-on experience with scalable infrastructure design and implementation
Have contributed to large-scale system architecture projects
Built reliable, high-performance distributed systems
Worked with containerized applications and orchestration platforms
Strong candidates may have:
Startup experience in early-stage technology companies with ability to work independently in fast-paced environments
Strong communication skills for remote collaboration across time zones
Deep familiarity with current AI tools and LLM capabilities
Understanding of LLM evaluation/RL frameworks and methodologies
Evidence of rapid learning and adaptability in technical environments
We prioritize technical aptitude and learning potential over years of experience. Motivated candidates are encouraged to apply even if they don't meet all criteria.
Team & Company DetailsTeam Size: ~15 people currently, mostly full-time in-person, but some remote.
Our team: Our team includes 4 international Olympiad medallists (IOI, ILO, IPhO), serial AI startup founders, and researchers with publications at ICLR, NeurIPS etc
Company stage: We have received tens of millions in venture funding, plus very strong demand and revenue growth beyond that. We are scaling profitably and fast to meet demand.
Employment: Fulltime.
Location: On-site only, for now. You can join the team in the San Francisco Bay Area or Singapore offices.
Visa Sponsorship: We provide support for relocation and visas for strong full-time candidates to USA or Singapore.
Timeline: Applications are rolling. The process should involve 2 technical interviews and a 1-week work trial.
You will have unlimited access* to API credits for leading providers like OpenAI, Anthropic, Gemini, Cursor etc. *By unlimited, we mean no one on our token usage leaderboard has ever hit a limit. So we have no idea what the limit is.
Due to high volume, we may not actively respond to every application, but feel free to contact us at [email protected] or elsewhere if we missed your application!

