Site Reliability Engineer - Peloton Interactive｜Meet.jobs

Salary

137k - 178k USD Annually

Required skills

Job description

ABOUT THE ROLE

At Peloton, we view Platform as a Product. A phenomenal platform unlocks speed of development and learning. It allows us to scale easily, enabling our engineers to maximize attention on new features and capabilities. A key to crafting a phenomenal platform is data-driven insights and understanding where we should focus our attention to create the best outcomes for our members. Platform at Peloton is a force-multiplier that enables Peloton to move faster and scale safely with minimal effort. Core to this mission is creation of the best developer experience in the tech industry for the entire spectrum of Peloton's technology. We work across an incredible range of technology domains: hardware, firmware, web, mobile, backend, data, messaging, content, streaming, and machine learning. We get to apply these to create a platform of products loved by millions of customers all over the world.

Peloton is looking for a Site Reliability Engineer with a content and music operations focus to work with teams across the Content Business Solutions Group to help build and maintain a monitorable, performant, reliable, and highly-scalable deployment platform. We are a growing team of engineers tackling exciting problems to handle thousands of nodes and pods spread across many deployments.

YOUR DAILY IMPACT AT PELOTON

Automatic, fast auto scaling for music ingestion and media asset management
Host a critical infrastructure that ensures that our developers have the best experience possible on hundreds of pods across multiple clusters
Provide a platform for machine learning (and other exciting workloads) Allow developers to move quickly and experiment, without getting in the way
Promote standard methodologies for building and operating highly reliable systems
Serve as domain expert in observability and monitoring
Consult in system design to meet reliability and capacity requirements
Automate everything, from infrastructure down to day-to-day tasks.
Conduct timely post-mortems of infrastructure incidents
Assist with all aspects of operational security and compliance
Seek out potential threats to security and reliability and advocate solutions
We work with Amazon Web Services, C#, Python, Nginx, Jenkins, and Terraform and more

YOU BRING TO PELOTON

Experience maintaining scalable and stable Kubernetes clusters.
Knowledge of best practices when it comes to the observability and monitoring required of running Kubernetes at scale.
Knowledge of standard processes in regards to securing a Kubernetes cluster and its deployments at scale
A passion for helping development teams make the transition to a container-native world
Experience with CI/CD Systems such as for example: Jenkins, ArgoCD, Harness, Tekton, etc.
Experience deployment infrastructure using Infrastructure as Code utilities such as Terraform or Pulumi
Know when to triage and when to dive down into a root-cause analysis
Passion for reliable, scalable, observable software with a strong sense of ownership
Experience with a programming language like Python, Golang, Java, C#, etc.

#LI-HYBRID

#LI-SW2

Peloton Interactive focuses on Hardware, Retail, Fitness, Video Streaming, and Android. Their company has offices in New York City. They have a very large team that's between 1001-5000 employees. To date, Peloton Interactive has raised $1.041B of funding; their latest round was closed on August 2018 at a valuation of $4.125B.

You can view their website at http://www.onepeloton.com or find them on Twitter, Facebook, and LinkedIn.

All Jobs

Referrer

Employer

Column

Log in

Sign up

Site Reliability Engineer - Peloton Interactive｜Meet.jobs

Salary

137k - 178k USD Annually

Required skills

Job description

Peloton Interactive