TextNow

Site Reliability Engineer

Reposted 4 Days Ago

Be an Early Applicant

In-Office or Remote

Hiring Remotely in Open Hall, Subd. F, NL

Senior level

In-Office or Remote

Hiring Remotely in Open Hall, Subd. F, NL

Senior level

The Senior Site Reliability Engineer at TextNow will maintain and scale production services, improve reliability, write automation code, and collaborate with development teams for optimal infrastructure performance.

The summary above was generated by AI

We believe communication belongs to everyone. We exist to democratize phone service. TextNow is evolving the way the world connects and that's because we're made up of people with curious minds who bring an optimistic, yet critical lens into the work we do. We're the largest provider of free phone service in the nation. And we're just getting started.

Join us in our mission to break down barriers to communication and free the flow of conversation for people everywhere.

TextNow is looking for motivated Site Reliability Engineer to own infrastructure, monitoring, logging, ci/cd, reliability and everything in between!

This role is about impact at scale. You’ll shape how TextNow builds and operates its systems in an AI-first environment where intelligent tooling is embedded into everyday engineering practice. Using AI is not optional, it’s expected. From design and architecture to implementation, testing, debugging, documentation, and operational analysis, you will actively leverage AI tools to increase velocity, improve code quality, and make better technical decisions. We provide a robust suite of AI-powered development tools and workflows to support you, and we expect you to continuously evolve how you use them to raise the bar for efficiency, clarity, and product excellence across the organization.

What You'll Do

Ensure System Reliability: Design, build, and maintain scalable, resilient, and highly available systems to support TextNow’s infrastructure and services.
Automation & Infrastructure as Code: Develop and maintain automation using Terraform, Ansible, and other tools to enable efficient deployment, scaling, and operations of cloud-based systems (AWS preferred).
Incident Response & On-Call Support: Participate in an on-call rotation, troubleshoot issues, and drive incident resolution to minimize downtime and improve system performance. Conduct post-mortems and implement corrective actions to enhance reliability.
Performance Monitoring & Optimization: Implement and improve observability tools, logging, and monitoring solutions to identify and mitigate potential system issues proactively.
Collaboration & Cross-Team Engagement: Work closely with software engineers, DevOps, and product teams to align technical efforts with business objectives and improve system reliability from development to production.
Continuous Improvement: Identify areas for improvement in architecture, automation, and operational practices. Contribute to the design and implementation of new SRE best practices.

You'll be a great fit if you have:

Experienced in SRE/DevOps: You have 5+ years of experience in an operationally focused role, such as SRE, DevOps, or Infrastructure Engineering, with a deep understanding of reliability, scalability, and performance optimization.
Proficient with Key Technologies: Hands-on experience with AWS, GitHub, Terraform, Ansible, or similar tools to build and manage cloud infrastructure efficiently.
Incident Management Expert: You are comfortable handling production incidents, analyzing root causes, and implementing long-term fixes to prevent recurrence.
Automation & Observability Focused: Passionate about reducing toil through scripting and automation while ensuring robust observability using logging, metrics, and monitoring tools.
Collaborative & Impact-Driven: You enjoy working cross-functionally with engineers, product teams, and leadership to drive meaningful improvements to system reliability.

AI usage: We use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact our Talent Team.

More about TextNow

Our culture

We’re proud of the culture that we’ve built at TextNow, but one of the most common questions we hear is ‘how do you continue to sustain it as the world and the company continues to change?’ The reality is that we’re only able to keep up because each and every TextNovian contributes to our culture through being involved, by living our values, sharing feedback, embracing change and more! 

Our values

Customer Obsessed

We strive to have a deep understanding of our customers.

Do Right By Our People

We treat each other with fairness, respect, and integrity.

Accept the Challenge

We adopt a "Yes, We Can" mindset to achieve ambitious goals.

Act Like an Owner

We treat this company like it's our own... because it is!

Give a Damn

We are deeply committed and passionate about our work and achieving results.

Our benefits and more

This is a brief overview of the benefits that TextNow offers its employees. More complete details can be found in TextNow’s Benefit Guide and legal plan documents, which are available to employees on or shortly after their start date with TextNow. The benefits listed herein are for illustrative purposes only and may change from time-to-time in TextNow’s sole discretion.

Free phone service

Strong work life blend 

Flexible work arrangements (work-from-home, remote, or access to one of our office spaces)

Employee stock options 

Unlimited vacation 

12 paid holidays per year

Competitive pay

Health, dental, and vision benefits

Short-term & long-term disability

$750 annual wellness benefit or healthcare spending account

RRSP matching (Canada) | 401(K) (USA)

Parental leave for eligible employees

Learning & Development opportunities

We travel a few times a year for various team events, company-wide off-sites, and more

More information about our total rewards package will be available during the hiring process.

Dogfooding & Customer Obsession

At TextNow, every employee gets to actively use our app for calling and texting. Dogfooding helps us experience what customers do, to spot issues early, and drive better design, developer, and user experiences.

Diversity and Inclusion

Our aim is to make every person who joins TextNow feel like they belong, that they’re valued, and that they’re able to be their authentic selves at work. 

We’re all accountable for creating an inclusive culture and a sense of belonging for one another. By doing this together, we’ll make TextNow better for everyone. 

Equal opportunity

We are an equal opportunity employer and are committed to creating an inclusive environment for all employees. We consider all qualified applicants without regard to race, color, religion, sex, gender identity or expression, sexual orientation, age, disability, or any other protected characteristic.

Applicants who require reasonable accommodation during the hiring process may contact our Talent Team.

TextNow Candidate Policy

By submitting an application to TextNow, you agree to the collection, use, and disclosure of your personal information in accordance with the TextNow Candidate Policy

We use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact our Talent Team.

Top Skills

Ansible

AWS

Bash

Docker

Kubernetes

Linux

Mariadb

Puppet

Python

Redis

Ruby

Terraform

Similar Jobs

Coalition

Site Reliability Engineer

14 Days Ago

Remote

Canada

Mid level

Insurance • Cybersecurity

The Site Reliability Engineer II role involves building and operating infrastructure, automating deployment, ensuring system reliability, and enabling developers through self-service tools. Key responsibilities include improving observability and mentoring junior engineers.

Top Skills: AWSEcsGithub ActionsGoKafkaKinesisKubernetesPythonTerraform

Tyk

Site Reliability Engineer

15 Days Ago

Remote

Canada

Senior level

Cloud • Software

Operate, maintain and improve the global Tyk Cloud platform: run production Kubernetes clusters, manage cloud infrastructure, automate operations, run on-call incident response, create monitoring and dashboards, conduct post-incident analysis, document SRE processes, and drive reliability, efficiency and multi-region/multi-cloud expansion.

Top Skills: Kubernetes,Containers,Aws,Eks,Linux,Terraform,Helm,Go,Python,Mongodb,Redis,Prometheus,Grafana,Thanos,Logging Collection And Analysis Systems,Rancher,Gcp,Azure,Dns,Tcp/Ip,Http,Tls,Udp,Infrastructure As Code (Iac)

Finite State

Senior Site Reliability Engineer

16 Days Ago

Easy Apply

Remote

Easy Apply

Senior level

Security • Cybersecurity

Lead the design and implementation of observability, SLO/SLA frameworks, and AI-enabled infrastructure automation. Architect scalable AWS infrastructure, improve incident management and on-call practices, and drive organization-wide adoption of telemetry and reliability standards.

Top Skills: Ai-Assisted ToolingAWSCi/CdClaudeCodexCursorGrafanaHoneycombInfrastructure-As-CodeObservabilityPulumiSupabaseTelemetryTerraformVercel

What you need to know about the Singapore Tech Scene

The digital revolution has driven a constant demand for tech professionals across industries like software development, data analytics and cybersecurity. In Singapore, one of the largest cities in Southeast Asia, the demand for tech talent is so high that the government continues to invest millions into programs designed to develop a talent pipeline directly from universities while also scaling efforts in pre-employment training and mid-career upskilling to expand and elevate its workforce.