The intern will conduct research on RL algorithms, design training infrastructures, and explore new RL paradigms for multimodal models.
Business UnitWhat the Role EntailsResponsibilities:
1. Conduct research on RL algorithms for multimodal models, including diffusion models for image, video, and 3D generation, autoregressive models for multimodal understanding, and potentially unified multimodal frameworks.
2. Design and develop RL infrastructure and reward modeling strategies to enable efficient large-scale training, improve training stability, and mitigate reward hacking and related failure modes.
3. Explore next-generation RL paradigms that more directly and effectively learn from environment feedback.Who We Look ForRequirements:
1. Currently enrolled as a PhD student in Computer Science or a closely related field.
2. Demonstrated strong research capability, with publications in top-tier conferences such as ICML, NeurIPS, ICLR, CVPR, ICCV, ECCV, SIGGRAPH.
3. Strong hands-on programming skills, with solid experience in deep learning system implementation, model training and inference optimization, CPU/GPU acceleration, and distributed training and inference.
4. Prior experience with diffusion models, autoregressive models, and/or text-to-image or text-to-video generation is highly preferred.
5. Participation in ACM/NOIP is a strong plus.Equal Employment Opportunity at Tencent
As an equal opportunity employer, we firmly believe that diverse voices fuel our innovation and allow us to better serve our users and the community. We foster an environment where every employee of Tencent feels supported and inspired to achieve individual and common goals.
Top Skills
Autoregressive Models
Cpu
Deep Learning
Diffusion Models
Distributed Training
Gpu
Reinforcement Learning
Similar Jobs
Gaming • Software • Metaverse
Conduct research on RL algorithms for multimodal models, design and develop RL infrastructure and strategies, and explore new RL paradigms.
Top Skills:
Autoregressive ModelsCpu/Gpu AccelerationDeep LearningDiffusion ModelsDistributed TrainingMultimodal ModelsReinforcement Learning
Gaming • Software • Metaverse
Conduct research on reinforcement learning (RL) algorithms for multimodal models, design RL infrastructure, and explore new RL paradigms.
Top Skills:
Cpu/Gpu AccelerationDeep LearningDistributed TrainingMultimodal ModelsReinforcement Learning
An Hour Ago
Artificial Intelligence • Hardware • Information Technology • Machine Learning
As a Package Development Engineer, support chip-package interaction assessments, collaborate with cross-functional teams, and execute integration activities. Assist in semiconductor packaging and quality issue investigations while learning technical skills under mentorship.
Top Skills:
DoeFmeaExcelMicrosoft PowerpointMicrosoft WordPackage DesignSemiconductor Manufacturing
What you need to know about the Singapore Tech Scene
The digital revolution has driven a constant demand for tech professionals across industries like software development, data analytics and cybersecurity. In Singapore, one of the largest cities in Southeast Asia, the demand for tech talent is so high that the government continues to invest millions into programs designed to develop a talent pipeline directly from universities while also scaling efforts in pre-employment training and mid-career upskilling to expand and elevate its workforce.

.jpeg)