Optimize and compress large language and vision models for on-device inference. Build distillation and hardware-specific compilation pipelines, and benchmark performance across NPU/GPU architectures for efficient edge deployment.
We are looking for an AI Specialist Engineer to enhance the performance of large language and vision models for on-device inference. Your expertise will be crucial in developing and deploying cutting-edge AI solutions, ensuring optimal efficiency across diverse hardware architectures.
Responsibilities:
- Compress and optimize large language and vision models for on-device inference.
- Develop pipelines for model distillation and hardware-specific compilation.
- Benchmark performance across various NPU/GPU architectures.
Qualifications:
- Expertise in model distillation, pruning, and 4-bit/8-bit quantization techniques.
- Hands-on experience with TensorRT, ONNX Runtime, and edge deployment.
- Strong C++ and Python skills.
Similar Jobs
Agency • Artificial Intelligence • Blockchain • Web3
Design and run adversarial tests on LLMs and multimodal agents, implement guardrails and real-time filters for autonomous tools, and develop constitutional AI principles and RLHF alignment pipelines to ensure safe AI deployments.
Top Skills:
Adversarial MlConstitutional AiGuardrailsJailbreak TaxonomiesLlmsMultimodal AgentsPrompt EngineeringReal-Time FilteringRed-Teaming FrameworksRlhf
Artificial Intelligence • Software
Seeking a Semiconductor Reverse Engineering Specialist to develop AI-driven solutions, collaborating on workflows and knowledge systems while ensuring technical accuracy and validation of AI outputs.
Top Skills:
AIReverse EngineeringSemiconductor
5 Hours Ago
eCommerce • Fashion • Retail • Sales • Wearables • Design
The analyst will manage lease accounting operations, ensure compliance with accounting standards, assist in tax filings, and support audits. They will also implement efficiency improvements within accounting processes.
Top Skills:
Ai-Enabled ProcessesAutomation ToolsIfrsMS OfficeSAPUs Gaap
What you need to know about the Singapore Tech Scene
The digital revolution has driven a constant demand for tech professionals across industries like software development, data analytics and cybersecurity. In Singapore, one of the largest cities in Southeast Asia, the demand for tech talent is so high that the government continues to invest millions into programs designed to develop a talent pipeline directly from universities while also scaling efforts in pre-employment training and mid-career upskilling to expand and elevate its workforce.


