SMC Cloud Logo

SMC Cloud

Senior Network Engineer, AI Infrastructure

Posted 2 Days Ago
Be an Early Applicant
In-Office
Singapore
Senior level
In-Office
Singapore
Senior level
The Senior Network Engineer will design, configure, and deploy AI infrastructures, optimize network performance, and support operational reliability for AI clusters.
The summary above was generated by AI

ROLES AND RESPONSIBILITIES

Firmus Technologies is seeking a skilled Senior Network Engineer specialising in AI networks to join our Cloud Architecture and Software Defined Infrastructure team.

The ideal candidate will play a crucial role in network design, configuration, and deployment for AI infrastructure projects. This role offers an exciting opportunity to work at the forefront of AI networking technology and contribute to the growth of AI infrastructure.

  • Primary responsibilities will include design and building bespoke AI infrastructure for new and existing customers.
  • Support operational and reliability aspects of large-scale AI clusters with a focus on performance at scale, real-time monitoring, logging, and alerting.
  • Provide specialist network engineering support to ensure optimal operation of network software and hardware.
  • Develop high quality automation and scripts to operate network infrastructure at scale.
  • Engage in and improve the whole lifecycle of services – from inception and design through deployment, operation, and refinement.
  • Improve internal tooling by identifying automation opportunities to drive speed and scale in our capabilities.
  • Be the subject matter expert for networking-related escalations.
  • Provide feedback to internal teams such as opening bugs, documenting workarounds, and suggesting improvements.

SKILLS AND EXPERIENCE

  • B.Sc in Computer Science/Electrical/Mechanical Engineering or equivalent experience.
  • Hands-on experience in solving problems in large-scale RDMA over Converged Ethernet (RoCE) or InfiniBand network environments.
  • Strong hands-on experience in Linux-based platforms.
  • In-depth knowledge of network protocols and tools and management of security measures for network infrastructure.
  • Familiarity with data path hardware acceleration protocols and interfaces such as RDMA, RoCE, InfiniBand etc.
  • Familiarity with Infrastructure as Code practices. Experience in developing IaC to support automation.
  • Experience in using network automation tools such as Terraform, Ansible, Puppet, and Python scripts.
  • Familiarity with Linux networking, using device API and firewall policy management.
  • Experience with switching and routing network protocols.
  • Fast and independent self-learner with outstanding technical skills.
  • Driven and focused on customer needs and satisfaction.
  • Self-motivated with excellent leadership skills.
  • Strong written, verbal, and listening skills are essential.

KEY COMPETENCIES

  • CCIE or equivalent networking certifications and certification in Linux systems.
  • 5+ years of experience with AI, HPC, or parallel network architectures.
  • Proficiency in Infrastructure as Code (IaC) tools (e.g. Ansible, Netbox, Python scripts).
  • Understanding of how MPI, RDMA, and NCCL works, as well as an understanding of how job schedules (SLRUM, PBS) work.
  • Proven knowledge of Python or Bash.
  • Professional Services/Infrastructure Specialists delivery experience.

LOCATION

Singapore


EMPLOYMENT BASIS

Full-Time


Top Skills

Ai Infrastructure
Ansible
Infiniband
Infrastructure As Code (Iac)
Linux
Network Design
Puppet
Python
Rdma
Terraform
HQ

SMC Cloud Singapore Office

Singapore

Similar Jobs

An Hour Ago
Remote or Hybrid
2 Locations
Mid level
Mid level
Cloud • Information Technology • Security • Software • Cybersecurity
The Technical Support Engineer will provide advanced support to enterprise customers, troubleshoot technical issues involving various protocols and services, and create technical documentation.
Top Skills: AnycastApacheBashCachingCorsDnsHTTPIamIisJavaScriptLoad BalancingMs SqlMySQLNginxPostgresPythonSsl/TlsTcp/Udp
An Hour Ago
Remote or Hybrid
Singapore, SGP
Mid level
Mid level
Cloud • Information Technology • Security • Software • Cybersecurity
As a Customer Success Manager, you will manage customer relationships, ensure value realization from Cloudflare solutions, and collaborate with internal teams to drive customer satisfaction, focusing on iGaming and Entertainment industries.
Top Skills: B2B SaasGainsight
An Hour Ago
Remote or Hybrid
2 Locations
Senior level
Senior level
Cloud • Information Technology • Security • Software • Cybersecurity
The Premium Technical Support Engineer will provide high-quality support for CDN services, troubleshoot technical issues, and collaborate with customers and teams to enhance product performance.
Top Skills: BashBgpCdnCurlDdos MitigationDigDnsFirewall RulesGitHttp/SIptablesJavaScriptMs SqlMySQLOpensslPostgresPythonSsl/TlsTcp/IpTraceroute

What you need to know about the Singapore Tech Scene

The digital revolution has driven a constant demand for tech professionals across industries like software development, data analytics and cybersecurity. In Singapore, one of the largest cities in Southeast Asia, the demand for tech talent is so high that the government continues to invest millions into programs designed to develop a talent pipeline directly from universities while also scaling efforts in pre-employment training and mid-career upskilling to expand and elevate its workforce.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account