Site Reliability Engineer

Sonos, Inc.

California, United States

This job has been expired

At Sonos we want to create the ultimate listening experience for our customers and know that it starts by listening to each other. As part of the Sonos team, you’ll collaborate with people of all styles, skill sets, and backgrounds to realize our vision while fostering a community where everyone feels included and empowered to do the best work of their lives.

Are you excited about the scale of millions of devices in millions of homes? Do you want to work on the cutting edge of IoT and the rising wave of the connected home? Do you want to be part of the team that makes the magic happen in connecting the smartest speakers on the planet to streaming music services, voice services and other devices in the home?

We are seeking a software engineer with a passion for operations to embed on a cross-functional software development team. This software engineer will ensure our cloud services are designed with high availability, scalability, real-time monitoring and team supportability in mind.

In this role, you will draw upon a diverse set of technical expertise; including customer facing service architecture, troubleshooting and debugging skills, cloud computing, DevOps tooling, coding/scripting and infrastructure engineering to deliver exceptional always-on customer experiences. Beyond your technical ability, you will also have an excellent combination of innate curiosity and root cause focus, cross-group collaboration and communication skills, relationship building ability and planning skills. You strive to maintain an unwavering focus on Quality of Service; executing with high accountability and have a drive to improve, evolve and revolutionize the systems you manage with your team. You will have a sense of urgency to get things done efficiently, and you can do so independently as well as part of the team.

About You

You’re not like everyone else.

You bring a unique perspective to the table. Transparency tops your list of values. Your smarts and creativity are off the charts, matched only by your humility. You want to collaborate with a team of diverse talent. You proactively contribute to a culture of respect and inclusion.

You enjoy a challenge.

Inquisitive and focused, you see every challenge as an opportunity. You’re ambitious and unafraid to make mistakes because you learn from them and bounce back quickly. You don’t stop until you get it right. “Impossible” isn’t in your vocabulary. You’re more interested in creating the future than waiting for it.

What You’ll Do


  • Design for operations; from local development to production.
  • Design for security, high availability, reliability and quality of service.
  • Build delivery pipelines that ensure quality on every check-in.
  • Look for opportunities to create efficiencies through automation.
  • Make production deployments a routine event that any team member can do.
  • Actively contribute to our ability to measure the quality of our software.
  • Collaborate with DevOps Engineering to improve, contribute to and evolve the platform.


  • Build up the operational knowledge of the team to ensure everyone can support the full service stack.
  • Measure and optimize the service: tune alerts, right-size capacity, identify availability, performance and security opportunities.
  • Participate in 24×7 shared on-call rotation for production issues
  • Help define incident response procedures for production systems.
  • Debug software at the code and infrastructure level.
  • Participate in and encourage root cause analysis, use data to identify the scope and scale of impact.

Skills You’ll Need

  • Undergraduate degree in CS, a related technical field, or commensurate related work experience.
  • Expertise in designing, analyzing and troubleshooting large-scale distributed systems.
  • Scripting proficiency in shell and Python.
  • Software development experience (Java preferred).
  • Experience with cloud orchestration – focused on open source/Linux based systems (AWS, Google Cloud, Azure, Heroku).
  • Experience with configuration management and orchestration tools (Ansible, Puppet, Chef, Salt, Terraform).
  • Experience in service instrumentation, observability and monitoring production workloads.
  • Experience working with Unix/Linux systems from kernel to shell and beyond including working with system libraries, file systems, and client-server protocols.
  • Experience with containerized workloads in Docker and Kubernetes.
  • Scrum/Agile Methodology experience.
  • Familiarity with commercial software development practices (version control, defect tracking, product schedules and deliverables).
  • Caching layer technologies (Elasticache / memcached, redis) and CDN services such as Akamai and Apigee.
  • NoSQL experience (Cassandra, DynamoDB, MongoDB, etc).
  • Networking and security experience preferred.

Your profile will be reviewed and you’ll hear from us once we have an update. At Sonos we take the time to hire right and appreciate your patience.

Notice to U.S. Job Applicants: Sonos is an equal opportunity employer. We evaluate qualified applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, veteran status, and other legally protected characteristics.

Follow the links to review the EEO is the Law poster and its supplement. The pay transparency policy is available here. Sonos is committed to working with and providing reasonable accommodations to individuals with disabilities. If you need a reasonable accommodation because of a disability for any part of the employment process, please send an e-mail to and let us know the nature of your request and your contact information.