Bitdeer Group Logo

Bitdeer Group

Research Engineering, Inference

Posted Yesterday
Be an Early Applicant
In-Office
Singapore, SGP
Mid level
In-Office
Singapore, SGP
Mid level
The role focuses on ML systems research for optimizing large-scale inference, extending serving stacks, and improving AI inference infrastructure on GPUs.
The summary above was generated by AI

About Bitdeer:

Bitdeer is a world-leading technology company for Bitcoin mining and AI cloud.
Bitdeer is committed to providing comprehensive Bitcoin mining solutions for its customers. Apart from designing industry-leading ASIC chips and manufacturing mining rigs, the Group handles complex processes involved in computing across the value chain. This includes equipment procurement, transport logistics, datacenter design and construction, equipment management, and network and facility operations. Bitdeer also offers advanced cloud capabilities to customers with a high demand for artificial intelligence.
Headquartered in Singapore, Bitdeer operates globally with a diversified 3 GW energy portfolio, and deploys Bitcoin mining and HPC datacenters in the United States, Bhutan, Norway, Canada, Malaysia, and Ethiopia.

About Bitdeer AI Lab:

Bitdeer AI Lab is a frontier AI lab under Bitdeer, a global-leading computing power solutions provider. Guided by long-termism, we are committed to exploring the frontiers of artificial intelligence with the ambition, courage, and determination to build technologies that can truly change the world.

We believe that transformative breakthroughs in AI require both long-horizon thinking and relentless execution. Our mission is twofold: first, to effectively transform energy into intelligence; second, to push the limits of intelligence by rethinking AI systems and architectures that can learn more efficiently, reason more deeply, and scale more effectively.

Our vision is to create intelligence that learns more like humans do: efficiently, adaptively, and recursively, turning finite parameters and finite compute into unbounded potential. We pursue this work with a deep sense of purpose, believing that the most meaningful advances in AI will not only push the frontier of research, but also reshape the future of the world.

Our lab is equipped with thousands of cutting-edge GPUs dedicated to AI research, and we are committed to continuously investing in and expanding our computational infrastructure to support world-class research and engineering in artificial intelligence.

We are looking for exceptional talent to join us, helping build the inference foundation for frontier AI models.

    What you will be responsible for:

    This role is centered on ML systems research for large-scale inference optimization. You will focus on improving and extending SGLang- and vLLM-based serving stacks, optimizing inference for both open-source LLMs and reasoning models as well as future in-house models developed by Bitdeer AI Lab. Your work will span GPU kernel-level optimization, high-performance inter-node communication, and end-to-end serving system design on real production clusters of hundreds to thousands of GPUs, helping define the next generation of efficient AI inference infrastructure.

      How you will stand out:

      • Strong interest in ML systems research for large-scale inference optimization on hundreds to thousands of GPUs.
      • Hands-on experience with or strong familiarity with SGLang and vLLM, especially for large-scale model serving and inference optimization.
      • Understanding of inference optimization for open-source LLMs and reasoning models, with the ability to establish strong serving baselines for frontier model architectures.
      • Ability to work closely with research teams to develop efficient inference solutions for future in-house opensource models, and to translate new model ideas into practical deployment from an early stage.
      • Strong understanding of key inference metrics such as latency, throughput, stability, and GPU efficiency, and practical experience optimizing them through batching, scheduling, KV cache, memory usage, and runtime execution.
      • Ability to identify, analyze, and resolve bottlenecks in large-scale inference workloads through ML systems research, experimentation, and performance optimization.
      • Familiarity with distributed inference communication patterns and networking fundamentals, including tensor/pipeline parallelism communication, collective operations (e.g., NCCL), and awareness of interconnect topologies (NVLink, InfiniBand) and their impact on serving performance at scale.

      What you will experience working with us:

      • A culture that values authenticity and diversity of thoughts and backgrounds;
      • An inclusive and respectable environment with open workspaces and exciting start-up spirit;
      • Fast-growing company with the chance to network with industrial pioneers and enthusiasts;
      • Ability to contribute directly and make an impact on the future of the digital asset industry;
      • Involvement in new projects, developing processes/systems;
      • Personal accountability, autonomy, fast growth, and learning opportunities;
      • Attractive welfare benefits and developmental opportunities such as training and mentoring.

      Top Skills

      Gpus
      Infiniband
      Nccl
      Nvlink
      Sglang
      Vllm

      Bitdeer Group Singapore Office

      Singapore, Singapore

      Similar Jobs

      4 Hours Ago
      Remote or Hybrid
      Singapore, SGP
      Senior level
      Senior level
      Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
      As the CISO Solutions GTM for APAC, you will translate global CISO strategies into regional plans and facilitate cross-functional alignment. This includes activating regional teams, measuring KPIs, and representing the CISO persona at strategic meetings.
      Top Skills: CloudEnterprise TechnologySaaS
      4 Hours Ago
      Remote or Hybrid
      Singapore, SGP
      Senior level
      Senior level
      Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
      The Director leads pre-sales teams in Financial Services, Telecommunications, and Public Sector, driving business growth and team development in CRM solutions.
      Top Skills: Ai-Enhanced TechnologyCrm SolutionsServicenow Platform
      4 Hours Ago
      Remote or Hybrid
      Singapore, SGP
      Senior level
      Senior level
      Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
      The Solution Sales Executive will lead strategies for ServiceNow's ITAM and ITOM products, working with account teams and customers to drive digital transformations and sales solutions.
      Top Skills: AIIt Asset ManagementIt Operations ManagementPortfolio Management

      What you need to know about the Singapore Tech Scene

      The digital revolution has driven a constant demand for tech professionals across industries like software development, data analytics and cybersecurity. In Singapore, one of the largest cities in Southeast Asia, the demand for tech talent is so high that the government continues to invest millions into programs designed to develop a talent pipeline directly from universities while also scaling efforts in pre-employment training and mid-career upskilling to expand and elevate its workforce.

      Sign up now Access later

      Create Free Account

      Please log in or sign up to report this job.

      Create Free Account