Software Engineer, Infrastructure & DevOps
Train large-language models (LLMs) to write production-grade infrastructure and DevOps code
Help teach AI how to write, debug, and optimize infrastructure code like a top-tier DevOps engineer.
Compare and rank Terraform or IaC code snippets, explaining which is more reliable, efficient, or scalable.
Refactor or repair AI-generated infrastructure setups for correctness, security, and clarity.
Provide structured feedback (edits, test outcomes, architectural notes) that feeds into the RLHF pipeline.
End result: The model learns to reason about DevOps the way you do—smart, scalable, and safe.
RLHF in one line
Generate IaC or backend code ➜ expert engineers rank, edit, and explain ➜ convert feedback into reward signals ➜ reinforcement learning tunes the model to think like a real infra engineer.
What Is Needed
4+ years of professional software-engineering experience, ideally in backend, infrastructure, or DevOps.
Extreme attention to detail and excellent written communication—you’ll be writing a lot of thoughtful code reviews and architectural justifications.
Fluency with cloud infrastructure, especially AWS.
Hands-on experience with Terraform and Infrastructure as Code practices.
Strong instincts for performance, cost optimization, and security.
Comfortable in a low-oversight, async-first remote environment.
You enjoy reading documentation and specs—seriously.
What Is Not Needed
No prior RLHF or AI-training experience required.
No machine learning expertise needed—just deep infra/DevOps experience and clear thinking.
Tech Stack
We are especially looking for backend engineers working in Infra or Terraform-heavy environments.
Experience with any of the following is highly valued:
Terraform, AWS, GCP, or other cloud providers
Node.js or backend scripting
CI/CD pipelines
Infrastructure as Code (IaC) best practices
Logistics
Location: Fully remote (work from anywhere)
Hours: Minimum 15 hrs/week, can go up to 40 hrs/week
Engagement: 1099 contract