The role involves developing and optimizing large-scale language models, deploying solutions, and engaging in advanced research techniques for improvements. Collaboration across teams and staying updated with industry advancements are also key responsibilities.
Responsibilities
- Engage in the development and optimization of large-scale pre-training language models, including model architecture design, parallel training strategies, and performance improvements.
- Drive research and implementation of advanced LLM post-training techniques, including chain-of-thought tuning, preference alignment, and RL for reasoning.
- Develop and optimize data collection pipelines for model training, including data de-duplication, cleaning, and verification.
- Design and implement solutions for model deployment, including inference optimization and scaling strategies.
- Collaborate with cross-functional teams to apply LLM capabilities in various business scenarios, such as materials science.
- Stay current with the latest developments in the field and contribute to the company's technical roadmap.
Qualifications
- Master's or Ph.D. in Computer Science, AI, or related field.
- 5+ years of experience in machine learning, with specific focus on NLP and LLMs.
- Strong understanding of transformer architectures and modern LLM frameworks(BERT, GPT, T5).
- Extensive experience with deep learning frameworks (PyTorch, TensorFlow, JAX).
- Strong programming skills in Python and proficiency with ML tools (Hugging Face, DeepSpeed).
- Proven track record in training and optimizing large-scale language models (10B+ parameters) is preferred.
- Experience with distributed training systems (Megatron) and optimization techniques is preferred.
Top Skills
Deepspeed
Hugging Face
Jax
Megatron
Python
PyTorch
TensorFlow
Patsnap Singapore Office
47 Scotts Road - Goldbell Towers, #11-03, Singapore, 228233
Similar Jobs
Cloud • Information Technology • Security • Software
Conduct research and develop algorithms for AI and ML projects, enhancing software tools for performance and reliability.
Top Skills:
AIMlSoftware Engineering
Cloud • Information Technology • Security • Software • Cybersecurity
Responsible for building and expanding the global network, collaborating with various teams for datacenter management and ensuring effective processes are implemented for infrastructure operations.
Top Skills:
AnsibleApacheArista EosBgpChefCisco IosCisco Nx-OsCwdmDwdmHaproxyJuniper JunosLinuxNginxPuppetSaltstackVarnish
Financial Services
As an Infrastructure Engineer III, you will manage Oracle databases, resolve issues, and improve systems and applications while supporting the database team.
Top Skills:
AWSGrafanaOraclePythonSplunkSQL
What you need to know about the Singapore Tech Scene
The digital revolution has driven a constant demand for tech professionals across industries like software development, data analytics and cybersecurity. In Singapore, one of the largest cities in Southeast Asia, the demand for tech talent is so high that the government continues to invest millions into programs designed to develop a talent pipeline directly from universities while also scaling efforts in pre-employment training and mid-career upskilling to expand and elevate its workforce.