The role involves post-training of LLMs, model alignment, server operation for checkpoint routing, and building evaluation pipelines.
Job Responsibilities:
1. Advanced post-training of large language models (e.g. SFT, RLHF/RLAIF, continual pretraining).
2. Aligning models for reliable JSON-schema function calls and external tool usage.
3. Design, deploy, and operate Model Context Protocol (MCP) servers that handle checkpoint routing, manage context windows, and enforce safety gates.
4. Experience in distributed training and inference with DeepSpeed/FSDP, LoRA/QLoRA, mixed precision, and performance tuning on vLLM or Triton clusters.
5. Build offline and live eval pipelines for alignment, factuality, grounding, and hallucinations.
Qualifications
1. Bachelor’s or Master’s degree in Computer Science, Artificial Intelligence, Machine Learning, or a related field.
2. 3+ years of experience in developing and optimizing large language models.
3. Proven track record in implementing advanced post-training techniques (SFT, RLHF, RLAIF, continual pretraining).
4. Hands-on experience with distributed training frameworks (DeepSpeed, FSDP) and optimization techniques (LoRA, QLoRA, mixed precision).
5. Familiarity with model alignment, JSON-schema function calls, and external tool integration.
6. Experience in building and maintaining evaluation pipelines for model performance assessment.
7. Proficiency in Python and relevant machine learning frameworks (e.g., PyTorch, TensorFlow).
8. Strong understanding of distributed systems and high-performance computing.
9. Experience with model deployment and inference optimization on vLLM or Triton clusters.
10. Knowledge of JSON-schema and API development.
Top Skills
Deepspeed
Fsdp
Lora
Python
PyTorch
Qlora
TensorFlow
Triton
Vllm
Similar Jobs
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
The role involves consulting, configuring ServiceNow solutions, leading workshops, optimizing processes, and mentoring team members to achieve customer outcomes.
Top Skills:
BootstrapCSSHTMLJavaScriptLdapServicenowSsoWeb ServicesWeb TechnologiesXML
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
The Sr. Technical Consultant will leverage ServiceNow technologies to optimize customer processes, guide implementations, and lead projects focused on core business transformation solutions, requiring technical expertise and customer engagement skills.
Top Skills:
AIBootstrapCSSHTMLJavaScriptLdapMiddlewareServicenowSsoWeb ServicesXML
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Manage cross-functional projects, provide updates, and develop customer value propositions while integrating AI into workflows. Requires fluency in French and English.
Top Skills:
AISaaSServicenow
What you need to know about the Singapore Tech Scene
The digital revolution has driven a constant demand for tech professionals across industries like software development, data analytics and cybersecurity. In Singapore, one of the largest cities in Southeast Asia, the demand for tech talent is so high that the government continues to invest millions into programs designed to develop a talent pipeline directly from universities while also scaling efforts in pre-employment training and mid-career upskilling to expand and elevate its workforce.