Conduct research on RL algorithms for multimodal models, design and develop RL infrastructure and strategies, and explore new RL paradigms.
Business UnitTechnology Engineering Group (TEG) is responsible for supporting the company and its business groups on technology and operational platforms, as well as the construction and operation of R&D management and data centers, TEG provides users with a full range of customer services. As the operator of the largest networking, devices, and data center in Asia,TEG also leads the Tencent Technology Committee in strengthening infrastructure R&D through internal and distributed open source collaboration, constructing new platforms and supporting business innovation.What the Role Entails
- Conduct research on RL algorithms for multimodal models, including diffusion models for image, video, and 3D generation, autoregressive models for multimodal understanding, and potentially unified multimodal frameworks
- Design and develop RL infrastructure and reward modeling strategies to enable efficient large-scale training, improve training stability, and mitigate reward hacking and related failure modes.
- Explore nextgeneration RL paradigms that more directly and effectively learn from environment feedback.
- Currently enrolled as a PhD student in Computer Science or a closely related field
- Demonstrated strong research capability, with publications in top-tier conferences such as ICML, NeurIPS, ICLR, CVPR, ICCV, ECCV, SIGGRAPH.
- Strong handson programming skills, with solid experience in deep learning system implementation, model training and inference optimization, CPU/GPU acceleration, and distributed training and inference.
- Prior experience with diffusion models, autoregressive models, and/or text-to-image or text-to-video generation is highly preferred.
- Participation in ACM/NOIP is a strong plus.
As an equal opportunity employer, we firmly believe that diverse voices fuel our innovation and allow us to better serve our users and the community. We foster an environment where every employee of Tencent feels supported and inspired to achieve individual and common goals.
Top Skills
Autoregressive Models
Cpu/Gpu Acceleration
Deep Learning
Diffusion Models
Distributed Training
Multimodal Models
Reinforcement Learning
Similar Jobs
Gaming • Software • Metaverse
Conduct research on reinforcement learning (RL) algorithms for multimodal models, design RL infrastructure, and explore new RL paradigms.
Top Skills:
Cpu/Gpu AccelerationDeep LearningDistributed TrainingMultimodal ModelsReinforcement Learning
Gaming • Software • Metaverse
Conduct research on RL algorithms for multimodal models, design infrastructure for large-scale training, and explore next-gen RL paradigms based on environment feedback.
Top Skills:
Autoregressive ModelsCpu/Gpu AccelerationDeep LearningDiffusion ModelsDistributed TrainingReinforcement Learning
Gaming • Software • Metaverse
Conduct research on RL algorithms for multimodal models and design RL infrastructure for efficient training and reward modeling. Explore next-generation RL paradigms.
Top Skills:
Autoregressive ModelsCpu/Gpu AccelerationDeep LearningDiffusion ModelsMultimodal ModelsReinforcement Learning
What you need to know about the Singapore Tech Scene
The digital revolution has driven a constant demand for tech professionals across industries like software development, data analytics and cybersecurity. In Singapore, one of the largest cities in Southeast Asia, the demand for tech talent is so high that the government continues to invest millions into programs designed to develop a talent pipeline directly from universities while also scaling efforts in pre-employment training and mid-career upskilling to expand and elevate its workforce.
