As a Hardware Systems Engineer, you'll troubleshoot and maintain Cloudflare's server fleet, validate firmware updates, and enhance automation tools.
Available Locations: Bengaluru
About the department
Cloudflare's Infrastructure group is responsible for building our global network. Our Hardware Engineering team helps research, develop, test, and deploy new equipment enabling 20% of the world's internet traffic to be served smoothly. Deployed across 330 cities in 120+ countries, the hardware we select helps improve the security, reliability, and performance of the Internet.
About the Role
We need to make thoughtful infrastructure choices affecting a significant portion of the Internet. Hardware we work with includes servers and components, as well as PDUs and network hardware. . As a Hardware Systems Engineer, you will work with colleagues on the Hardware Engineering, Product teams, and Hardware Sourcing teams to troubleshoot and maintain Cloudflare's worldwide fleet of storage and compute servers.
What you'll do
Examples of desirable skills, knowledge and experience
Bonus Points
About the department
Cloudflare's Infrastructure group is responsible for building our global network. Our Hardware Engineering team helps research, develop, test, and deploy new equipment enabling 20% of the world's internet traffic to be served smoothly. Deployed across 330 cities in 120+ countries, the hardware we select helps improve the security, reliability, and performance of the Internet.
About the Role
We need to make thoughtful infrastructure choices affecting a significant portion of the Internet. Hardware we work with includes servers and components, as well as PDUs and network hardware. . As a Hardware Systems Engineer, you will work with colleagues on the Hardware Engineering, Product teams, and Hardware Sourcing teams to troubleshoot and maintain Cloudflare's worldwide fleet of storage and compute servers.
What you'll do
- Work with software teams to validate bug fixes and assess performance of new firmware revisions
- Validate and deploy firmware updates to the fleet, monitoring the progress of the rollout for compliance and reliability
- Work with server and component vendors to obtain, debug, and maintain the latest updates
- Work with our Site Reliability Engineering teams to triage hardware problem reports
- Support our Data Centre Engineering teams in resolving hardware issues
- Develop and maintain automation tools to update firmware on servers and components in Cloudflare's fleet
- Communicate your results and updates through blog posts, internal talks, and tickets
Examples of desirable skills, knowledge and experience
- Bachelor's degree in Computer Engineering, Electrical Engineering, or Computer Science
- Desire to learn about the Cloudflare hardware used by 20% of all web sites
- Desire to learn how a diverse server fleet is managed at scale
- Desire to learn the tools Cloudflare uses to maintain and monitor our hardware
- Knowledge of bash and python and basic Linux task automation
- Knowledge of x86 server hardware including motherboards, CPUs, memory, storage and firmware updates. Knowledge of other platforms such as arm is a bonus.
- Knowledge of configuration management principals, in particular we use salt to manage our fleet
- Knowledge of Redfish, IPMI and server remote management protocols
- Knowledge of running production mission critical systems
Bonus Points
- Familiarity with server hardware architecture
- Knowledge of debugging server hardware faults and the ability to engage with our sourcing team and vendors to improve quality
- Experience of managing large fleets comprising of thousands of servers
- Experience of observability and monitoring tools such as Prometheus and Grafana, and the ability to observe trends over time
- Experience with software development tools and processes such as git, Bitbucket and TeamCity and Jira
Top Skills
Bash
Bitbucket
Git
Grafana
Ipmi
JIRA
Linux
Prometheus
Python
Redfish
Salt
Teamcity
X86 Server Hardware
Cloudflare Singapore Office
Cloudflare Singapore Office
182 Cecil St, #35-01 Frasers Tower, Singapore, 069547
Similar Jobs at Cloudflare
Cloud • Information Technology • Security • Software • Cybersecurity
The Software Engineer will enhance Cloudflare's data capture and experimentation features, collaborating cross-functionally to drive user growth and optimize experiences.
Top Skills:
Adobe AnalyticsApache IcebergAws LambdaAzure FunctionsClickhouseCloudflare WorkersGoGoogle BigqueryJavaScriptPythonRustTypescript
Cloud • Information Technology • Security • Software • Cybersecurity
This role focuses on developing and enhancing Cloudflare's Security Compliance program, managing compliance frameworks, and integrating security controls across teams.
Top Skills:
GoPythonSQL
Cloud • Information Technology • Security • Software • Cybersecurity
The Engineering Manager will lead a team in development and management of infrastructure lifecycle tooling, aiming for automation and efficiency.
Top Skills:
BackendCmdbDcimFrontendIpamSoftware Development
What you need to know about the Singapore Tech Scene
The digital revolution has driven a constant demand for tech professionals across industries like software development, data analytics and cybersecurity. In Singapore, one of the largest cities in Southeast Asia, the demand for tech talent is so high that the government continues to invest millions into programs designed to develop a talent pipeline directly from universities while also scaling efforts in pre-employment training and mid-career upskilling to expand and elevate its workforce.