Bitdeer Group Logo

Bitdeer Group

LLM Data Strategy Expert (Annealing / SFT)

Reposted An Hour Ago
Be an Early Applicant
In-Office
Singapore, SGP
Mid level
In-Office
Singapore, SGP
Mid level
The role focuses on building data foundations for LLMs, overseeing data pipelines, sourcing, cleaning, validating, and optimizing data for AI models.
The summary above was generated by AI

About Bitdeer:

Bitdeer is a world-leading technology company for Bitcoin mining and AI cloud.
Bitdeer is committed to providing comprehensive Bitcoin mining solutions for its customers. Apart from designing industry-leading ASIC chips and manufacturing mining rigs, the Group handles complex processes involved in computing across the value chain. This includes equipment procurement, transport logistics, datacenter design and construction, equipment management, and network and facility operations. Bitdeer also offers advanced cloud capabilities to customers with a high demand for artificial intelligence.
Headquartered in Singapore, Bitdeer operates globally with a diversified 3 GW energy portfolio, and deploys Bitcoin mining and HPC datacenters in the United States, Bhutan, Norway, Canada, Malaysia, and Ethiopia.

About Bitdeer AI Lab

Bitdeer AI Lab is a frontier AI lab under Bitdeer, a global-leading computing power solutions provider. Guided by long-termism, we are committed to exploring the frontiers of artificial intelligence with the ambition, courage, and determination to build technologies that can truly change the world.

We believe that transformative breakthroughs in AI require both long-horizon thinking and relentless execution. Our mission is twofold: first, to effectively transform energy into intelligence; second, to push the limits of intelligence by rethinking AI systems and architectures that can learn more efficiently, reason more deeply, and scale more effectively.

Our vision is to create intelligence that learns more like humans do: efficiently, adaptively, and recursively, turning finite parameters and finite compute into unbounded potential. We pursue this work with a deep sense of purpose, believing that the most meaningful advances in AI will not only push the frontier of research, but also reshape the future of the world.

Our lab is equipped with thousands of cutting-edge GPUs dedicated to AI research, and we are committed to continuously investing in and expanding our computational infrastructure to support world-class research and engineering in artificial intelligence.

What you will be responsible for:

While pre-training defines the knowledge boundaries of a model, Annealing and SFT determine its capability ceiling. The core mission of this role is to orchestrate the fine-grained reshaping of model capabilities through the design of Task Mixtures and Curriculum Learning, and to establish causal attribution between data decisions and training dynamics.

Responsibilities:

  • SFT Architecture Design: Define Task Mixtures and difficulty gradients (Curriculum) across reasoning, coding, and Agent domains. Proactively account for the impact of SFT on the signal-to-noise ratio in RLHF/DPO to prevent capability overfitting.
  • Annealing Strategy: Design the data mixture for the annealing phase, optimizing the proportion of high-density task data. Balance generalizability with specialized capabilities and quantitatively monitor Capability Drift.
  • Synthetic Data Amplification: Build synthetic data pipelines for verifiable domains (e.g., Math/Code). Design execution-based verifiers to drive the scaling of model capabilities.
  • Closed-Loop Attribution System: Establish a robust feedback loop: Data Distribution → Training Dynamics → Model Behavior. Accurately pinpoint data noise or distribution collapse from Eval Regressions.

How you will stand out:

  • Hands-on Expertise: Proven experience in designing SFT Mixtures, with a deep understanding of how data distributions impact surface-level model behaviors.
  • Data Intuition: Sharp intuition for identifying Reasoning Hallucinations and Distribution Collapse; ability to reverse-engineer data mixture issues from loss spikes/fluctuations.
  • Engineering Foundation: Proficiency in large-scale data processing and inference pipelines (e.g., vLLM, SGLang, Ray).
  • Bonus Points: Proven track record in implementing Annealing strategies, practical insights into DPO data evolution, or in-depth research in Data-centric AI.

What you will experience working with us:

  • A culture that values authenticity and diversity of thoughts and backgrounds;
  • An inclusive and respectable environment with open workspaces and exciting start-up spirit;
  • Fast-growing company with the chance to network with industrial pioneers and enthusiasts;
  • Ability to contribute directly and make an impact on the future of the digital asset industry;
  • Involvement in new projects, developing processes/systems;
  • Personal accountability, autonomy, fast growth, and learning opportunities;
  • Attractive welfare benefits and developmental opportunities such as training and mentoring.

Bitdeer Group Singapore, Singapore, SGP Office

Singapore, Singapore, Singapore

Similar Jobs

Junior
eCommerce • Fashion • Retail • Sales • Wearables • Design
The analyst will manage lease accounting operations, ensure compliance with accounting standards, assist in tax filings, and support audits. They will also implement efficiency improvements within accounting processes.
Top Skills: Ai-Enabled ProcessesAutomation ToolsIfrsMS OfficeSAPUs Gaap
An Hour Ago
In-Office or Remote
Singapore, SGP
Senior level
Senior level
Artificial Intelligence • Fintech • Payments • Business Intelligence • Financial Services • Generative AI
As a Staff Product Designer, you'll develop and maintain design systems, conduct component audits, ensure team alignment, advocate for design system usage, and enhance design architecture based on feedback and trends.
Top Skills: CSSFigmaHTML
An Hour Ago
In-Office or Remote
Singapore, SGP
Senior level
Senior level
Artificial Intelligence • Fintech • Payments • Business Intelligence • Financial Services • Generative AI
Lead analytics for regulatory reports, standardize data usage, build test frameworks, analyze data anomalies, and mentor colleagues in compliance domains.
Top Skills: LookerPythonRShellSQLTableau

What you need to know about the Singapore Tech Scene

The digital revolution has driven a constant demand for tech professionals across industries like software development, data analytics and cybersecurity. In Singapore, one of the largest cities in Southeast Asia, the demand for tech talent is so high that the government continues to invest millions into programs designed to develop a talent pipeline directly from universities while also scaling efforts in pre-employment training and mid-career upskilling to expand and elevate its workforce.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account