Jabil Logo

Jabil

Senior / Staff SLM & VLM Engineer — Post-Training, Tool Calling & Agents

Posted 9 Days Ago
Be an Early Applicant
In-Office
Singapore, SGP
Senior level
In-Office
Singapore, SGP
Senior level
Lead R&D of Small Language Models and Vision-Language Models, focusing on training, optimization, tool calling systems, and data pipeline automation.
The summary above was generated by AI
At Jabil (NYSE: JBL), we are proud to be a trusted partner for the world's top brands, offering comprehensive engineering, supply chain, and manufacturing solutions. With 60 years of experience across industries and a vast network of over 100 sites worldwide, Jabil combines global reach with local expertise to deliver both scalable and customized solutions. Our commitment extends beyond business success as we strive to build sustainable processes that minimize environmental impact and foster vibrant and diverse communities around the globe.

 

Job Summary 

We are looking for a highly capable engineer/researcher to lead the R&D of Small Language Models (SLMs) and Vision-Language Models (VLMs) for edge / low-latency and cost-efficient production scenarios. You will own the continuous pretraining, supervised instruction tuning (SFT), and compression/distillation pipelines, and work closely with platform teams to deliver reliable, measurable improvements in inference efficiency, tool-use success rate, and overall model quality.  

Key Responsibilities 

1) SLM/VLM Training: Continuous Pretraining & Instruction Tuning (SFT) 

  • Conduct continuous pretraining and SFT for SLMs and VLMs to improve task performance and domain adaptation. 

  • Build reproducible training workflows in PyTorch, including data processing, training, evaluation, and model versioning.  

2) Compression, Distillation & Edge/Low-Latency Inference Optimization 

  • Design and implement efficient compression strategies for SLM/VLM, including knowledge distillation, pruning, and quantization-oriented training or post-training optimization. 

  • Optimize model serving and inference for low-latency / edge scenarios by improving throughput and cost-per-token via techniques such as quantization, caching/KV optimizations, batching strategies, and decoding-time optimizations.  

3) Tool Calling System: Catalog, Routing, Validation, Fallback & Observability 

  • Architect and implement a production-grade tool calling (function/tool calling) framework:  

  • Tool cataloging and metadata/schema design 

  • Tool selection/routing and argument construction 

  • Parameter validation, result verification, and safe fallback/retry strategies 

  • Call-chain tracing, monitoring, and observability to improve success rate and ROI 

4) RL & Reward Modeling for Alignment and Tool-Use Reliability 

  • Apply post-training methods such as PPO / DPO / GRPO-like optimization and reward modeling to align the model toward objectives including:  

  • semantic understanding 

  • tool-use success rate 

  • content generation quality and consistency 

  • Support both offline and online iteration loops, including policy evaluation, regression checks, and safe deployment gating.  

5) Data Pipeline Automation (Collection, Cleaning, Curation) 

  • Design automated pipelines for data collection, filtering, cleaning, de-duplication, labeling/weak supervision, and dataset version management to continuously improve training quality. 

  • Ensure datasets support both SFT and preference/RL style post-training.  

6) Rigorous Evaluation, Testing & Iteration 

  • Build robust evaluation mechanisms: offline benchmarks, task suites for tool-use, regression tests, and reliability metrics. 

  • Drive rapid iteration through A/B comparisons, ablations, and failure analysis, improving both quality and efficiency over time.  

Required Qualifications 

  • Strong software engineering skills in Python and C++, including experience building ML training/evaluation pipelines in PyTorch. 

  • Hands-on experience in model efficiency and inference optimization (e.g., distillation, quantization, pruning, serving optimization).  

  • Experience with high-performance computing and acceleration: CUDA and/or SIMD, profiling and performance tuning.  

  • Ability to read and reproduce key ideas from recent papers and implement algorithms with strong experimental discipline. 

  • Ability to communicate effectively in both Chinese (Mandarin) and English as the successful person will have to liaise with the our counterparts in China.

 


BE AWARE OF FRAUD: When applying for a job at Jabil you will be contacted via correspondence through our official job portal with a jabil.com e-mail address; direct phone call from a member of the Jabil team; or direct e-mail with a jabil.com e-mail address. Jabil does not request payments for interviews or at any other point during the hiring process. Jabil will not ask for your personal identifying information such as a social security number, birth certificate, financial institution, driver’s license number or passport information over the phone or via e-mail. If you believe you are a victim of identity theft, contact your local police department. Any scam job listings should be reported to whatever website it was posted in.

Jabil, including its subsidiaries, is an equal opportunity employer and considers qualified applicants for employment without regard to race, color, religion, national origin, sex, sexual orientation, gender identity, age, disability, genetic information, veteran status, or any other characteristic protected by law.

 

Accessibility Accommodation  

If you are a qualified individual with a disability, you have the right to request a reasonable accommodation if you are unable or limited in your ability to use or access Jabil.com/Careers site as a result of your disability. You can request a reasonable accommodation by sending an e-mail to [email protected] with the nature of your request and contact information. Please do not direct any other general employment related questions to this e-mail. Please note that only those inquiries concerning a request for reasonable accommodation will be responded to.

 

#whereyoubelong

 

 

Similar Jobs

Senior level
Hardware
The role involves leading R&D for Small Language Models and Vision-Language Models, focusing on training, optimization, and tool integration for efficient production scenarios.
Top Skills: C++CudaPythonPyTorchSimd
2 Hours Ago
In-Office or Remote
Singapore, SGP
Senior level
Senior level
Artificial Intelligence • Fintech • Payments • Business Intelligence • Financial Services • Generative AI
As the Manager, Revenue Strategy & Enablement, you'll enhance sales execution and accelerate revenue growth in Southeast Asia by designing and delivering enablement programs and collaborating cross-functionally.
Top Skills: ChorusGongHighspotLmsOutreachSalesforce
2 Hours Ago
In-Office or Remote
Singapore, SGP
Senior level
Senior level
Artificial Intelligence • Fintech • Payments • Business Intelligence • Financial Services • Generative AI
Lead the development and optimization of transaction monitoring capabilities across payments, ensuring effective detection of suspicious activities while enhancing customer experience.
Top Skills: Ai-Driven Risk SystemsAml SystemsData ToolsFraud Detection PlatformsMachine LearningSQL

What you need to know about the Singapore Tech Scene

The digital revolution has driven a constant demand for tech professionals across industries like software development, data analytics and cybersecurity. In Singapore, one of the largest cities in Southeast Asia, the demand for tech talent is so high that the government continues to invest millions into programs designed to develop a talent pipeline directly from universities while also scaling efforts in pre-employment training and mid-career upskilling to expand and elevate its workforce.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account