The Senior Site Reliability Engineer at TextNow will maintain and scale production services, improve reliability, write automation code, and collaborate with development teams for optimal infrastructure performance.
We believe communication belongs to everyone. We exist to democratize phone service. TextNow is evolving the way the world connects and that's because we're made up of people with curious minds who bring an optimistic, yet critical lens into the work we do. We're the largest provider of free phone service in the nation. And we're just getting started.
Join us in our mission to break down barriers to communication and free the flow of conversation for people everywhere.
TextNow is looking for motivated Senior Site Reliability Engineer to own infrastructure, monitoring, logging, ci/cd, reliability and everything in between!
What You'll Do
- Ensure System Reliability: Design, build, and maintain scalable, resilient, and highly available systems to support TextNow’s infrastructure and services.
- Automation & Infrastructure as Code: Develop and maintain automation using Terraform, Ansible, and other tools to enable efficient deployment, scaling, and operations of cloud-based systems (AWS preferred).
- Incident Response & On-Call Support: Participate in an on-call rotation, troubleshoot issues, and drive incident resolution to minimize downtime and improve system performance. Conduct post-mortems and implement corrective actions to enhance reliability.
- Performance Monitoring & Optimization: Implement and improve observability tools, logging, and monitoring solutions to identify and mitigate potential system issues proactively.
- Collaboration & Cross-Team Engagement: Work closely with software engineers, DevOps, and product teams to align technical efforts with business objectives and improve system reliability from development to production.
- Continuous Improvement: Identify areas for improvement in architecture, automation, and operational practices. Contribute to the design and implementation of new SRE best practices.
You'll be a great fit if you have:
- Experienced in SRE/DevOps: You have 5+ years of experience in an operationally focused role, such as SRE, DevOps, or Infrastructure Engineering, with a deep understanding of reliability, scalability, and performance optimization.
- Proficient with Key Technologies: Hands-on experience with AWS, GitHub, Terraform, Ansible, or similar tools to build and manage cloud infrastructure efficiently.
- Incident Management Expert: You are comfortable handling production incidents, analyzing root causes, and implementing long-term fixes to prevent recurrence.
- Automation & Observability Focused: Passionate about reducing toil through scripting and automation while ensuring robust observability using logging, metrics, and monitoring tools.
- Collaborative & Impact-Driven: You enjoy working cross-functionally with engineers, product teams, and leadership to drive meaningful improvements to system reliability.
More about TextNow
Our culture
We’re proud of the culture that we’ve built at TextNow, but one of the most common questions we hear is ‘how do you continue to sustain it as the world and the company continues to change?’ The reality is that we’re only able to keep up because each and every TextNovian contributes to our culture through being involved, by living our values, sharing feedback, embracing change and more!
Our values
Customer Obsessed
We strive to have a deep understanding of our customers.
Do Right By Our People
We treat each other with fairness, respect, and integrity.
Accept the Challenge
We adopt a "Yes, We Can" mindset to achieve ambitious goals.
Act Like an Owner
We treat this company like it's our own... because it is!
Give a Damn
We are deeply committed and passionate about our work and achieving results.
Our benefits and more
This is a brief overview of the benefits that TextNow offers its employees. More complete details can be found in TextNow’s Benefit Guide and legal plan documents, which are available to employees on or shortly after their start date with TextNow. The benefits listed herein are for illustrative purposes only and may change from time-to-time in TextNow’s sole discretion.
Free phone service
Strong work life blend
Flexible work arrangements (work-from-home, remote, or access to one of our office spaces)
Employee stock options
Unlimited vacation
12 paid holidays per year
Competitive pay
Health, dental, and vision benefits
Short-term & long-term disability
$750 annual wellness benefit or healthcare spending account
RRSP matching (Canada) | 401(K) (USA)
Parental leave for eligible employees
Learning & Development opportunities
We travel a few times a year for various team events, company-wide off-sites, and more
More information about our total rewards package will be available during the hiring process.
Dogfooding & Customer Obsession
At TextNow, every employee gets to actively use our app for calling and texting. Dogfooding helps us experience what customers do, to spot issues early, and drive better design, developer, and user experiences.
Diversity and Inclusion
Our aim is to make every person who joins TextNow feel like they belong, that they’re valued, and that they’re able to be their authentic selves at work.
We’re all accountable for creating an inclusive culture and a sense of belonging for one another. By doing this together, we’ll make TextNow better for everyone.
Equal opportunity
We are an equal opportunity employer and are committed to creating an inclusive environment for all employees. We consider all qualified applicants without regard to race, color, religion, sex, gender identity or expression, sexual orientation, age, disability, or any other protected characteristic.
Applicants who require reasonable accommodation during the hiring process may contact our Talent Team.
TextNow Candidate Policy
By submitting an application to TextNow, you agree to the collection, use, and disclosure of your personal information in accordance with the TextNow Candidate Policy
AI usage
We use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact our Talent Team.
Top Skills
Ansible
AWS
Bash
Docker
Go
Kubernetes
Linux
Mariadb
Puppet
Python
Redis
Ruby
Terraform
Similar Jobs
Artificial Intelligence • Information Technology • Software • Automation
As a Senior Site Reliability Engineer, you'll design and maintain Kubernetes-based infrastructure, ensure system reliability, and automate operational tasks while working with cross-functional teams to enhance platform capabilities.
Top Skills:
CloudFormationDatadogDockerElk StackGoGrafanaKubernetesNew RelicPrometheusPythonTerraform
Artificial Intelligence
The Senior Site Reliability Engineer will ensure high availability of core services, optimize system performance, manage cloud infrastructure, and collaborate with teams to solve engineering challenges.
Top Skills:
ArgocdAtlantisAWSGCPGithub ActionsGoHelmKubernetesPythonTerraform
Artificial Intelligence • Cloud • Information Technology • Software
The Senior Site Reliability Engineer is responsible for managing AI infrastructure, ensuring reliability through scalability, incident response, and collaboration with suppliers, focusing on Kubernetes and advanced GPU services.
Top Skills:
AnsibleBashGrafanaKubernetesPrometheusPython
What you need to know about the Singapore Tech Scene
The digital revolution has driven a constant demand for tech professionals across industries like software development, data analytics and cybersecurity. In Singapore, one of the largest cities in Southeast Asia, the demand for tech talent is so high that the government continues to invest millions into programs designed to develop a talent pipeline directly from universities while also scaling efforts in pre-employment training and mid-career upskilling to expand and elevate its workforce.


