Site Reliability Engineer - Peloton Interactive|Meet.jobs

Salary

137k - 178k USD Annually

Required skills

    Job description

    ABOUT THE ROLE

    At Peloton, we view Platform as a Product. A phenomenal platform unlocks speed of development and learning. It allows us to scale easily, enabling our engineers to maximize attention on new features and capabilities. A key to crafting a phenomenal platform is data-driven insights and understanding where we should focus our attention to create the best outcomes for our members. Platform at Peloton is a force-multiplier that enables Peloton to move faster and scale safely with minimal effort. Core to this mission is creation of the best developer experience in the tech industry for the entire spectrum of Peloton's technology. We work across an incredible range of technology domains: hardware, firmware, web, mobile, backend, data, messaging, content, streaming, and machine learning. We get to apply these to create a platform of products loved by millions of customers all over the world.

    Peloton is looking for a Site Reliability Engineer with a content and music operations focus to work with teams across the Content Business Solutions Group to help build and maintain a monitorable, performant, reliable, and highly-scalable deployment platform. We are a growing team of engineers tackling exciting problems to handle thousands of nodes and pods spread across many deployments.

    YOUR DAILY IMPACT AT PELOTON

    • Automatic, fast auto scaling for music ingestion and media asset management
    • Host a critical infrastructure that ensures that our developers have the best experience possible on hundreds of pods across multiple clusters
    • Provide a platform for machine learning (and other exciting workloads) Allow developers to move quickly and experiment, without getting in the way
    • Promote standard methodologies for building and operating highly reliable systems
    • Serve as domain expert in observability and monitoring
    • Consult in system design to meet reliability and capacity requirements
    • Automate everything, from infrastructure down to day-to-day tasks.
    • Conduct timely post-mortems of infrastructure incidents
    • Assist with all aspects of operational security and compliance
    • Seek out potential threats to security and reliability and advocate solutions
    • We work with Amazon Web Services, C#, Python, Nginx, Jenkins, and Terraform and more

    YOU BRING TO PELOTON

    • Experience maintaining scalable and stable Kubernetes clusters.
    • Knowledge of best practices when it comes to the observability and monitoring required of running Kubernetes at scale.
    • Knowledge of standard processes in regards to securing a Kubernetes cluster and its deployments at scale
    • A passion for helping development teams make the transition to a container-native world
    • Experience with CI/CD Systems such as for example: Jenkins, ArgoCD, Harness, Tekton, etc.
    • Experience deployment infrastructure using Infrastructure as Code utilities such as Terraform or Pulumi
    • Know when to triage and when to dive down into a root-cause analysis
    • Passion for reliable, scalable, observable software with a strong sense of ownership
    • Experience with a programming language like Python, Golang, Java, C#, etc.

    #LI-HYBRID

    #LI-SW2

    Peloton Interactive focuses on Hardware, Retail, Fitness, Video Streaming, and Android. Their company has offices in New York City. They have a very large team that's between 1001-5000 employees. To date, Peloton Interactive has raised $1.041B of funding; their latest round was closed on August 2018 at a valuation of $4.125B.

    You can view their website at http://www.onepeloton.com or find them on Twitter, Facebook, and LinkedIn.

    Peloton Interactive