Distributed Systems Engineer (L5 + L6), Compute Runtime
Netflix
United States
See who Netflix has hired for this role
See who Netflix has hired for this role
At Netflix, our mission is to entertain the world. Together, we are writing the next episode - pushing the boundaries of storytelling, global fandom and making the unimaginable a reality. We are a dream team obsessed with the uncomfortable excitement of discovering what happens when you merge creativity, intuition and cutting-edge technology. Come be a part of what’s next.
The Opportunity
The Cloud Infrastructure Organization’s mission is to increase the fleet-wide agility, scalability, reliability, and efficiency of our Platform. We are responsible for one of the largest Cloud Compute footprints built on AWS and constantly seeking to push boundaries through innovation to delight hundreds of millions of Netflix users daily.
We are seeking a passionate engineer to join the team responsible for developing and maintaining the software that runs on our compute fleet. The team owns the Kubernetes data plane software; the container runtime; the way we download and run containers; and maintaining the underlying compute node’s operating system. The team is responsible for enabling containers to run on our compute nodes as efficiently and reliably as possible. The team partners with other teams within Netflix to ensure we are staying ahead of the needs of the business and enabling other teams to innovate on top of our platform.
You will be joining a team of stunning engineers who tackle some of the largest cloud compute challenges in the industry. You will have a direct line of sight between the work you do and the entertainment of our 300M+ members. This team provides the foundational infrastructure that drives almost all computing at Netflix. The infrastructure enables Netflix’s website, video playback, video encoding work, AI/ML workloads, and numerous other business-critical functions. We free every engineer to have an impact in our “context, not control” culture and to directly contribute to the success of Netflix’s business.
Some examples of the team’s work can be found in tech blogs on “noisy neighbor” detection and mount lock contention.
What You Will Do
Netflix provides comprehensive benefits including Health Plans, Mental Health support, a 401(k) Retirement Plan with employer match, Stock Option Program, Disability Programs, Health Savings and Flexible Spending Accounts, Family-forming benefits, and Life and Serious Injury Benefits. We also offer paid leave of absence programs. Full-time hourly employees accrue 35 days annually for paid time off to be used for vacation, holidays, and sick paid time off. Full-time salaried employees are immediately entitled to flexible time off. See more details about our Benefits here.
Netflix is a unique culture and environment. Learn more here.
Inclusion is a Netflix value and we strive to host a meaningful interview experience for all candidates. If you want an accommodation/adjustment for a disability or any other reason during the hiring process, please send a request to your recruiting partner.
We are an equal-opportunity employer and celebrate diversity, recognizing that diversity builds stronger teams. We approach diversity and inclusion seriously and thoughtfully. We do not discriminate on the basis of race, religion, color, ancestry, national origin, caste, sex, sexual orientation, gender, gender identity or expression, age, disability, medical condition, pregnancy, genetic makeup, marital status, or military service.
Job is open for no less than 7 days and will be removed when the position is filled.
The Opportunity
The Cloud Infrastructure Organization’s mission is to increase the fleet-wide agility, scalability, reliability, and efficiency of our Platform. We are responsible for one of the largest Cloud Compute footprints built on AWS and constantly seeking to push boundaries through innovation to delight hundreds of millions of Netflix users daily.
We are seeking a passionate engineer to join the team responsible for developing and maintaining the software that runs on our compute fleet. The team owns the Kubernetes data plane software; the container runtime; the way we download and run containers; and maintaining the underlying compute node’s operating system. The team is responsible for enabling containers to run on our compute nodes as efficiently and reliably as possible. The team partners with other teams within Netflix to ensure we are staying ahead of the needs of the business and enabling other teams to innovate on top of our platform.
You will be joining a team of stunning engineers who tackle some of the largest cloud compute challenges in the industry. You will have a direct line of sight between the work you do and the entertainment of our 300M+ members. This team provides the foundational infrastructure that drives almost all computing at Netflix. The infrastructure enables Netflix’s website, video playback, video encoding work, AI/ML workloads, and numerous other business-critical functions. We free every engineer to have an impact in our “context, not control” culture and to directly contribute to the success of Netflix’s business.
Some examples of the team’s work can be found in tech blogs on “noisy neighbor” detection and mount lock contention.
What You Will Do
- Build and maintain the software that runs our Kubernetes container orchestration platform
- Architect and design innovative solutions to support new workloads and features, and improve the reliability and performance of existing workloads
- Develop and maintain Kubernetes and containerd customizations and plugins
- Contribute to the upstream containerd and Kubernetes projects
- Debug performance and operational problems observed with container workloads
- Continuously and proactively increase efficiency and optimize our compute platform
- Minimum of 5 years of experience evolving Compute infrastructure for a large organization; total 8+ years of software development experience
- Experience supporting containers and related runtimes as a service (e.g. Kubernetes kubelet, containerd, runc, NRI plugins, etc.)
- Experience debugging system performance issues in a Linux environment
- Excellent operational and troubleshooting skills
- Experience designing large-scale distributed systems, preferably a compute orchestration system like Kubernetes
- Proficiency in Go, Java, or C/C++
- Understanding of networking concepts (TCP, IPv4, sockets, host and service networking in a containerized environment)
- Ability to thrive in ambiguity in a “context not control” environment while working with high velocity
- Excellent communication and collaboration skills
- Track record of successful contributions to open source projects
- Linux kernel development experience
- Experience managing compute infrastructure for AI/ML workloads
- Understanding of networking concepts (TCP, IPv4, sockets, host and service networking in a containerized environment)
Netflix provides comprehensive benefits including Health Plans, Mental Health support, a 401(k) Retirement Plan with employer match, Stock Option Program, Disability Programs, Health Savings and Flexible Spending Accounts, Family-forming benefits, and Life and Serious Injury Benefits. We also offer paid leave of absence programs. Full-time hourly employees accrue 35 days annually for paid time off to be used for vacation, holidays, and sick paid time off. Full-time salaried employees are immediately entitled to flexible time off. See more details about our Benefits here.
Netflix is a unique culture and environment. Learn more here.
Inclusion is a Netflix value and we strive to host a meaningful interview experience for all candidates. If you want an accommodation/adjustment for a disability or any other reason during the hiring process, please send a request to your recruiting partner.
We are an equal-opportunity employer and celebrate diversity, recognizing that diversity builds stronger teams. We approach diversity and inclusion seriously and thoughtfully. We do not discriminate on the basis of race, religion, color, ancestry, national origin, caste, sex, sexual orientation, gender, gender identity or expression, age, disability, medical condition, pregnancy, genetic makeup, marital status, or military service.
Job is open for no less than 7 days and will be removed when the position is filled.
-
Seniority level
Not Applicable -
Employment type
Full-time -
Job function
Other -
Industries
Entertainment Providers
Referrals increase your chances of interviewing at Netflix by 2x
See who you knowSimilar jobs
People also viewed
-
Sr. Systems Engineer
Sr. Systems Engineer
-
Systems Engineer I
Systems Engineer I
-
Infrastructure Engineer (IBM AS/400 Administration)
Star Software System LLC
-
Linux Systems Engineer Senior
Linux Systems Engineer Senior
-
Systems Engineer I
Systems Engineer I
-
Cloud Network Engineer
Cloud Network Engineer
-
Azure Cloud Systems Engineer
Azure Cloud Systems Engineer
-
Systems Engineer II (Remote)
Systems Engineer II (Remote)
-
Systems Engineer I
Systems Engineer I
-
Azure Cloud Systems Engineer
Azure Cloud Systems Engineer
Similar Searches
Explore top content on LinkedIn
Find curated posts and insights for relevant topics all in one place.
View top content