As an Edge Systems Reliability Engineer, you will develop software, manage distributed systems, and optimize operations to ensure high reliability and performance of Cloudflare's Edge platform.
Available Locations: Bengaluru
About The Team
Infrastructure Engineering is responsible for the world's most reliable, observable, performant, and safe network ecosystem. Our customers rely on our products and systems to safely modify, troubleshoot, and release products without external impact.
Our external customers rely on us to provide seamless and predictable incident, traffic, policy management, resulting in the fastest and safest network services in the world.
We are accountable for the overall performance of internal and external facing services, guiding our product teams to optimal configurations and maximum efficiency. From the moment that a packet enters the Cloudflare ecosystem, we know exactly what its expected purpose and behavior is and we are capable of determining and exposing anomalous behavior.
The Cloudflare network makes it possible to solve challenges at massive scale and efficiency which would be impossible for almost any other organization.
What You'll Do
We are growing quickly and focused on building an extraordinary company. This is a systems reliability engineering role and is a superb opportunity to be part of a high performing team and help to support Cloudflare's mission and help build a better internet.
You will build services and APIs to constantly improve availability, performance and uptime.
You may be a good fit for our team if you have:
About The Team
Infrastructure Engineering is responsible for the world's most reliable, observable, performant, and safe network ecosystem. Our customers rely on our products and systems to safely modify, troubleshoot, and release products without external impact.
Our external customers rely on us to provide seamless and predictable incident, traffic, policy management, resulting in the fastest and safest network services in the world.
We are accountable for the overall performance of internal and external facing services, guiding our product teams to optimal configurations and maximum efficiency. From the moment that a packet enters the Cloudflare ecosystem, we know exactly what its expected purpose and behavior is and we are capable of determining and exposing anomalous behavior.
The Cloudflare network makes it possible to solve challenges at massive scale and efficiency which would be impossible for almost any other organization.
What You'll Do
- Develop Software: Design, write, and deliver software that improves Cloudflare's Edge platform
- Work on large scale systems: Scale and evolve systems through software and automation to improve reliability and velocity
- Maintain and manage distributed systems: Manage and be part of the on-call rotation that supports the largest distributed edge system in the world.
- Document, Propose and Implement: Collaborate with other engineers to design and implement scalable solutions that support our growing user base.
- Guide and mentor: Participate in the constant cycle of knowledge sharing and mentoring.
- Optimize and Automate: Research and introduce cutting-edge technologies. Develop and maintain sustainable tools that work on an extremely large scale.
- Open Source: Contribute to open-source
We are growing quickly and focused on building an extraordinary company. This is a systems reliability engineering role and is a superb opportunity to be part of a high performing team and help to support Cloudflare's mission and help build a better internet.
You will build services and APIs to constantly improve availability, performance and uptime.
You may be a good fit for our team if you have:
- Up to 8 years of experience managing distributed systems
- Proficiency in distributed Linux/Unix environments
- Proficiency in high-level programming (e.g., Golang, Python)
- Proficiency in configuration management (e.g., Saltstack, Chef, Puppet, Ansible)
- Proficiency in networking protocols Layer 3-7 of the OSI model
- Experience in performance analysis, debugging, and troubleshooting
- Experience in SQL databases (e.g., Postgres, MySQL)
- Experienced with being part of a rotation that tends to high priority reliability objectives
- Experience in load balancing and reverse proxies (e.g., Nginx)
- Familiarity with Key/Value stores (e.g., Redis)
- Familiarity with Internet working and BGP
- Exquisite written and verbal communication skills
- Strong bias for action
- Experience with continuous integration and delivery (CI/CD)
- Experience working in a 24/7/365 service environment
- Experience with high-bandwidth transit Internet working and routing
- Passion for tooling and automation
Top Skills
Ansible
Bgp
Chef
Go
MySQL
Nginx
Postgres
Puppet
Python
Redis
Saltstack
SQL
Cloudflare Singapore Office
Cloudflare Singapore Office
182 Cecil St, #35-01 Frasers Tower, Singapore, 069547
Similar Jobs at Cloudflare
Cloud • Information Technology • Security • Software • Cybersecurity
Join the NetOS team at Cloudflare, contributing to network software infrastructure, managing SONiC, and supporting network systems with a focus on security and software development.
Top Skills:
C++GnmiGoPrometheus ExportersPythonRedis
Cloud • Information Technology • Security • Software • Cybersecurity
As a Red Team Engineer, you'll develop adversarial emulation capabilities, enhance Cloudflare's security, and partner with various teams to innovate and execute Red Team strategies.
Top Skills:
APIsCloud SystemsCloudflare ProductsLinuxmacOSNetworking HardwareServerSoftware DevelopmentWeb ApplicationsWindows
Cloud • Information Technology • Security • Software • Cybersecurity
The Office Operations Manager leads daily operations, manages vendors, ensures workplace compliance, tracks metrics, and enhances employee experiences in a co-working environment.
Top Skills:
Google Workspace
What you need to know about the Singapore Tech Scene
The digital revolution has driven a constant demand for tech professionals across industries like software development, data analytics and cybersecurity. In Singapore, one of the largest cities in Southeast Asia, the demand for tech talent is so high that the government continues to invest millions into programs designed to develop a talent pipeline directly from universities while also scaling efforts in pre-employment training and mid-career upskilling to expand and elevate its workforce.