NVIDIA Logo

NVIDIA

Senior Systems Software Engineer, Containers and Kubernetes

Posted 4 Days Ago
Be an Early Applicant
In-Office or Remote
2 Locations
184K-357K
Senior level
In-Office or Remote
2 Locations
184K-357K
Senior level
Develop and operate GPU infrastructure management systems across Clouds, focusing on Kubernetes operators and integrations with datacenter software for enterprise solutions.
The summary above was generated by AI

NVIDIA is looking for outstanding software and systems engineers to help us develop and operate our enterprise GPU infrastructure management systems across Clouds. In this role, you will work closely with the broader NVIDIA team to operate, design and build infrastructure management systems, Kubernetes operators, and end-to-end HPC integration solutions that combine GPUs with the rest of the datacenter software management ecosystem. We are focused on supporting NVIDIA products across HPC, Cloud, and enterprise on both bare metal and virtualized platforms as the role of GPUs in all of these environments expands. Your contributions will span many aspects of GPU systems management, including Cloud provisioning, observability, operations and incident response. The systems you operate will support single-node developer systems through large clusters with thousands of nodes deployed on multiple Cloud providers.

To succeed, you must have a strong system and software development background, familiarity with modern distributed systems especially the Cloud-native ecosystem, and a proven work ethic. This is a dynamic work environment with many exciting opportunities awaiting. NVIDIA GPUs are central to many hot enterprise, cloud, and datacenter trends, come join us as we craft the future of accelerated computing and AI.

What you'll be doing:

  • Enable GPU provisioning and life-cycle with state-of-the-art Cloud-Native open-source ecosystem solutions, including Kubernetes, Docker, Prometheus, TerraForm and Crossplane.

  • Develop, maintain and/or operate robust, scalable Go programs in a Kubernetes environment.

  • Develop the next-generation multi-cloud infrastructure management systems to support GenAI.

  • Support internal and external users through bug fixes, documentation, and feature improvements.

  • Maintain high-quality products through robust test coverage and Day 2 capabilities.

What we need to see:

  • BS or higher in Computer Science or equivalent experience.

  • 8+ years of meaningful industry experience with a strong Kubernetes and SRE background

  • Deep understanding and execution skills of all aspects of the software development lifecycle

  • Experience with OpenAPI and Kubernetes Custom Resource Definitions

  • Business level English, outstanding written and verbal interpersonal skills

  • Strong motivation and commitment to learn new skills

  • Ability to manage time in a fast, heavily multitasked environment

Ways to stand out from the crowd:

  • Open-Source contributions to the Cloud-Native community and an understanding of AI and LLM principles

  • Strong experience with GitHub/GitLab CI/CD pipelines and application configuration.

  • Strong knowledge of container technologies, orchestration frameworks and observability systems.

  • Exposure to GPU programming with CUDA and familiarity with Kubernetes internals. Experience in developing Kubernetes operators.

  • Experience with managing and operating HPC schedulers and/or working across multiple Cloud providers.

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people on the planet working for us. If you're creative and autonomous, we want to hear from you!

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 184,000 USD - 287,500 USD for Level 4, and 224,000 USD - 356,500 USD for Level 5.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until October 10, 2025.NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Top Skills

Crossplane
Cuda
Docker
Go
Kubernetes
Prometheus
Terraform

Similar Jobs

Yesterday
In-Office or Remote
7 Locations
224K-426K
Senior level
224K-426K
Senior level
Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
The role involves building platform software with open-source container runtimes and Kubernetes, requiring strong programming skills and systems software experience.
Top Skills: CContainersGoKubernetesRust
12 Minutes Ago
Remote or Hybrid
Fort Walton Beach, FL, USA
78K-132K Annually
Mid level
78K-132K Annually
Mid level
Aerospace • Hardware • Information Technology • Security • Software • Cybersecurity • Defense
The Master Scheduler coordinates and schedules projects, manages budgets, mitigates risks, and ensures on-time delivery while maintaining high-quality standards.
Top Skills: ExcelMs ProjectProject Management Software
19 Minutes Ago
Remote
Pennsylvania, USA
124K-191K Annually
Senior level
124K-191K Annually
Senior level
Healthtech • Logistics • Pharmaceutical
The Director will lead AI initiatives, manage a technical team, and drive innovation in AI/ML solutions to generate business insights.
Top Skills: AIAzureDatabricksMlMlopsNeo4JPythonR

What you need to know about the Colorado Tech Scene

With a business-friendly climate and research universities like CU Boulder and Colorado State, Colorado has made a name for itself as a startup ecosystem. The state boasts a skilled workforce and high quality of life thanks to its affordable housing, vibrant cultural scene and unparalleled opportunities for outdoor recreation. Colorado is also home to the National Renewable Energy Laboratory, helping cement its status as a hub for renewable energy innovation.

Key Facts About Colorado Tech

  • Number of Tech Workers: 260,000; 8.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Lockheed Martin, Century Link, Comcast, BAE Systems, Level 3
  • Key Industries: Software, artificial intelligence, aerospace, e-commerce, fintech, healthtech
  • Funding Landscape: $4.9 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Access Venture Partners, Ridgeline Ventures, Techstars, Blackhorn Ventures
  • Research Centers and Universities: Colorado School of Mines, University of Colorado Boulder, University of Denver, Colorado State University, Mesa Laboratory, Space Science Institute, National Center for Atmospheric Research, National Renewable Energy Laboratory, Gottlieb Institute

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account