Netflix Logo

Netflix

Site Reliability Engineer L4/L5, Games Engineering

Posted 3 Days Ago
Remote
Hiring Remotely in USA
100K-720K
Mid level
Remote
Hiring Remotely in USA
100K-720K
Mid level
As a Site Reliability Engineer at Netflix, you'll enhance gaming platform reliability, manage incidents, build detection tools, and improve operational excellence.
The summary above was generated by AI

Netflix is one of the world's leading entertainment services, with over 300 million paid memberships in over 190 countries enjoying TV series, films and games across a wide variety of genres and languages. Members can play, pause and resume watching as much as they want, anytime, anywhere, and can change their plans at any time.

Netflix has revolutionized the global entertainment industry and is on the path to achieving our next big objective: reinventing video games on a global scale. You will be a key engineer in strengthening our platform’s foundation while empowering various teams to move faster and more confidently. 

This role sits at the intersection of infrastructure, reliability, developer tools, and game studio partners - helping to build systems that scale with our users and tools that delight our engineers. We do this through cross-functional engagement with other engineering teams, managing incidents when they happen, and promoting reliability and resilience practices throughout the organization. This role is rewarding for people who have a passion for leveraging technology and working collaboratively with others to solve business problems.  Our team is seeking individuals with a broad set of technical skills who have seen how large distributed systems break!  

If this excites you, we invite you to bring your unique career and life experiences to enrich the culture and diversity of our team. Even if you are unsure if you fit the criteria listed below, we encourage you to apply.

Responsibilities:

  • Increase Netflix Games’ reliability using a strategic and operationally focused mindset to identify and solve problems across multiple domains

  • We command incidents to restore service for customers quickly. You will participate in our on-call rotation to bring a reliable game platform to both our players and developers. 

  • Build and adopt tools to help the platform detect problems and remediate issues faster.

  • Influence technical roadmaps to drive better operational excellence via observability, system correctness, and higher quality

  • Build dashboards and insights products to help both engineers and senior leadership understand our platform health. 

  • Form and maintain relationships with internal and external partners

We Value:

  • The ability to develop alignment and cultivate relationships to drive impact

  • Curiosity about how complex sociotechnical systems successfully operate at scale when failure is inevitable

  • Collaboration, continuous improvement, and iteration as the path forward

  • A desire to grow expertise, influence, and educate others

  • Being decisive and exercising good judgment when in crisis mode  

Our Work:

  • Develop automated solutions to reduce toil for our games catalog 

  • Operationalize learnings by collaborating cross-functionally with multiple engineering teams

  • Identify, assess, and mitigate risks associated with our systems, applications, and infrastructure. 

  • Proactively recognize sources of instability in distributed systems and analyze how complex systems fail from a reliability and resilience perspective.

  • Improve services’ availability, reliability, and observability and reduce human toil with tooling and automation.

  • Lead incident response and post-incident reviews, contributing to failure analysis and implementing preventive measures. 

You may be a good fit if you have:

  • Development experience with Java, JavaScript/Node.js, Python, or Go

  • Knowledge of cloud platforms (i.e. AWS, GCP, etc.) and microservices architecture, or bare metal platforms and debugging Unix/Linux systems (engineering fundamentals, networking, storage, operating systems)

  • Understand networking concepts and application protocols, especially TCP/UDP/IP, BGP, HTTP/S, TURN, and DNS

  • Knowledge of distributed analytic processing technologies (Presto/Trino, Spark SQL, etc)

  • Experience in risk management and incident management

  • Experience working in the games industry

  • Strong writing and presentation skills

Be sure to review our culture page and long-term view to learn more about the unique Netflix culture and the opportunity to be part of our team. We are a geographically distributed team looking for applicants based anywhere in the US.

We are an equal-opportunity employer and celebrate diversity. We recognize that diversity of thought and background builds stronger teams and approach diversity and inclusion seriously and thoughtfully. We do not discriminate based on race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.

At Netflix, we carefully consider a wide range of compensation factors to determine your personal top of market. We rely on market indicators to determine compensation and consider your specific job family, background, skills, and experience to get it right. These considerations can cause your compensation to vary and will also be dependent on your location.

The overall market range for roles in this area of Netflix is typically $100,000 - $720,000.

This market range is based on total compensation (vs. only base salary), which is in line with our compensation philosophy. Netflix is a unique culture and environment. Learn more here.

Inclusion is a Netflix value and we strive to host a meaningful interview experience for all candidates. If you want an accommodation/adjustment for a disability or any other reason during the hiring process, please send a request to your recruiting partner.

We are an equal-opportunity employer and celebrate diversity, recognizing that diversity builds stronger teams. We approach diversity and inclusion seriously and thoughtfully. We do not discriminate on the basis of race, religion, color, ancestry, national origin, caste, sex, sexual orientation, gender, gender identity or expression, age, disability, medical condition, pregnancy, genetic makeup, marital status, or military service.

Job is open for no less than 7 days and will be removed when the position is filled.

Top Skills

AWS
GCP
Go
Java
JavaScript
Linux
Node.js
Presto
Python
Spark Sql
Trino
Unix

Similar Jobs

2 Hours Ago
Remote or Hybrid
United States
Junior
Junior
Fintech • Legal Tech • Payments • Sales • Software
Conduct user research to identify pain points and deliver user-centered solutions. Collaborate with cross-functional teams to translate insights into actionable designs.
Top Skills: DovetailFigmaMazeQualtricsReductUsertestingZoom
2 Hours Ago
Remote or Hybrid
2 Locations
176K-200K Annually
Senior level
176K-200K Annually
Senior level
Fintech • Machine Learning • Payments • Software • Financial Services
Lead technical program management for data-intensive solutions, ensure execution of cloud software delivery, and collaborate across teams to drive business impact.
Top Skills: AgileAWSCloud Computing
2 Hours Ago
Remote or Hybrid
United States
145K-190K Annually
Senior level
145K-190K Annually
Senior level
Big Data • Fintech • Information Technology • Insurance • Financial Services
As a Developer in Underwriting Technology, you will own initiatives, prioritize product backlog, engage stakeholders, provide technical guidance, and ensure strategic alignment.
Top Skills: Agile MethodologiesArchitecture DesignAWSCloud ComputingSoftware Development

What you need to know about the Charlotte Tech Scene

Ranked among the hottest tech cities in 2024 by CompTIA, Charlotte is quickly cementing its place as a major U.S. tech hub. Home to more than 90,000 tech workers, the city’s ecosystem is primed for continued growth, fueled by billions in annual funding from heavyweights like Microsoft and RevTech Labs, which has created thousands of fintech jobs and made the city a go-to for tech pros looking for their next big opportunity.

Key Facts About Charlotte Tech

  • Number of Tech Workers: 90,859; 6.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Lowe’s, Bank of America, TIAA, Microsoft, Honeywell
  • Key Industries: Fintech, artificial intelligence, cybersecurity, cloud computing, e-commerce
  • Funding Landscape: $3.1 billion in venture capital funding in 2024 (CED)
  • Notable Investors: Microsoft, Google, Falfurrias Management Partners, RevTech Labs Foundation
  • Research Centers and Universities: University of North Carolina at Charlotte, Northeastern University, North Carolina Research Campus

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account