Lead end-to-end LLM pipeline for customer service scheduling: data prep, prompt design, RAG systems, multi-agent architectures, multi-GPU deployment, evaluation pipelines, and chatbot integration to improve model quality and decision-making.
Binance is a leading global blockchain ecosystem behind the world’s largest cryptocurrency exchange by trading volume and registered users. We are trusted by over 250 million people in 100+ countries for our industry-leading security, user fund transparency, trading engine speed, deep liquidity, and an unmatched portfolio of digital-asset products. Binance offerings range from trading and finance to education, research, payments, institutional services, Web3 features, and more. We leverage the power of digital assets and blockchain to build an inclusive financial ecosystem to advance the freedom of money and improve financial access for people around the world.
We are seeking a highly skilled professional to join our team, focusing on advancing customer service scheduling optimization through innovative AI solutions. This role involves researching and implementing cutting-edge algorithms to enhance scheduling systems, leveraging business domain knowledge to elevate the impact of AI products. The successful candidate will develop and refine Large Language Models (LLMs) to extract actionable insights, improve business decision-making, and optimize prompt design for more accurate outputs. Additionally, the role includes creating scalable and robust LLM/RAG frameworks tailored to customer service scheduling, fostering innovation and maintaining a competitive market edge.
Responsibilities:
- Own the full LLM pipeline from data preparation to production real case usage.
- Design, iterate and optimize prompts (zero-/few-shot, chain-of-thought, tool-calling, etc.) to maximize model utility and safety across products and languages.
- Build and maintain Retrieval-Augmented Generation (RAG) QA/search systems that connect to multi-source knowledge bases.
- Familiar with vLLM/SGLang inference architectures and have proven experience deploying and operating LLM services on multi‑GPU or cluster environments.
- Design, implement and operate multi‑agent LLM architectures (e.g. LangGraph, CrewAI, AutoGen) including task decomposition, agent orchestration, memory sharing and tool‑calling workflows.
- Develop evaluation pipelines (automatic metrics & human feedback) to measure prompt and model quality, bias, and hallucination rates.
- Collaborate with product and CS teams to integrate AI models into conversational Chatbot in different scenarios.
- Track cutting-edge research, author tech blogs, and keep improve current architecture.
Qualifications:
- Master’s degree or higher in Computer Science, Data Science or related field..
- 2+ years of deep-learning/NLP experience, including 1+ year practical LLM work (SFT, DPO, RAG, quantization, inference optimization, etc.).
- Demonstrated prompt engineering & tuning expertise (few-shot design, structured prompting, prefix-/p-tuning, reward re-ranking, safety filtering).
- Practical experience building and deploying multi‑agent LLM workflows, with understanding of agent‑orchestrator patterns, shared memory, long‑horizon planning and guard‑rail design.
- Clean coding practices, good English communication skills, and a passion for rapid learning.
- Excellent self-driven and ownership with good deliverables.
- Eager to learn, be curious about AI new technologies
- Good communication and collaboration skills
Similar Jobs
Artificial Intelligence • Marketing Tech • Sales • Software
Lead and scale the engineering organization for a regulated digital-asset custody platform. Own hiring, org design, engineering operations (on-call, incidents, release hygiene), delivery against roadmaps, audit and regulator readiness (SOC 2, SAMA, ISO 27001), and cross-functional alignment. Amplify an existing small, specialized team to meet regulatory and institutional requirements while preserving culture and retention.
Top Skills:
Aurora PostgresAWSBitcoinClickhouseEthereumGithub ActionsGoHsmKubernetesMpc/TssRustSolanaTemporalTerraformThreshold Signing ProtocolsTypescript
Artificial Intelligence • Marketing Tech • Sales • Software
Own the cryptographic core: select and evolve MPC/TSS schemes, author protocol reviews, implement and review Rust/Go production crypto, lead audit responses, prepare incident fixes, publish and represent externally, and teach/mentor engineers while scaling the crypto function.
Top Skills:
Bls ThresholdCggmp21Dkls23FrostGg18Gg20GoLattice-Based ThresholdMpcMulti-Party-EcdsaRaccoonRustSparkleTss
Artificial Intelligence • Marketing Tech • Sales • Software
Build and operate production custody systems: signing service, key management, chain integrations, and transaction pipelines. Own end-to-end subsystems, integrate new chains, collaborate with cryptography on threshold signing and key recovery, write post-mortems, and produce clear async design docs and PRs.
Top Skills:
Account Abstraction)Aurora PostgresAWSBitcoin (UtxoCggmp21ClickhouseCloudhsmCmp)Eip-1559Ethereum (Eip-712FrostGg20Github ActionsGoKubernetesLedger VaultMoveMpcPostgresPsbt)RustRust (For Chain Integration)SolanaSolidityTemporalTerraformThreshold Signing (Gg18TssTypescriptYubihsm
What you need to know about the Singapore Tech Scene
The digital revolution has driven a constant demand for tech professionals across industries like software development, data analytics and cybersecurity. In Singapore, one of the largest cities in Southeast Asia, the demand for tech talent is so high that the government continues to invest millions into programs designed to develop a talent pipeline directly from universities while also scaling efforts in pre-employment training and mid-career upskilling to expand and elevate its workforce.

