NVIDIA Logo

NVIDIA

Senior System Firmware Engineer, RAS - Platform Software

Reposted Yesterday
Be an Early Applicant
Remote
2 Locations
184K-357K
Senior level
Remote
2 Locations
184K-357K
Senior level
Design and implement RAS firmware for NVIDIA’s Arm Data Center products, collaborating with cross-functional teams and debugging system issues.
The summary above was generated by AI

We are looking for a: Sr Software Engineer, RAS Firmware - Platform Software. NVIDIA’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern deep learning — the next era of computing — with the GPU acting as the brain of computers, robots, and self-driving cars that can perceive and understand the world. Today, we are increasingly known as “the AI computing company.” We're looking to grow our company and establish teams with the most thoughtful people in the world. NVIDIA DGX systems deliver the world's leading solutions for enterprise AI infrastructure at scale.  

We are looking for a talented and experienced Datacenter CPU RAS (Reliability, Availability, and Serviceability) firmware engineer. As the CPU RAS Firmware Engineer, you will be responsible for designing and implementing firmware level changes. You will work closely with cross-functional teams, including hardware engineers, system architects, and software developers, to create designs that meet stringent reliability requirements and deliver exceptional customer experiences. Are you ready to change the next generation of computing? Join us at the forefront of technological advancement.  

What you’ll be doing:  

  • Design and develop RAS firmware for NVIDIA’s Arm Data Center products.  

  • Triaging and debugging system, SOC, board, RAS firmware/UEFI related issues on customer, reference, and production platforms.  

  • Working closely with hardware, firmware, and software teams to design features and debug issues.  

  • Engage with customer partners to root cause & resolve customer platform issues.  

  • Engage and coordinate with Quality & Reliability team to support manufacturing/RMA failure issues.  

  • Debug and resolve hardware & firmware issues during the SOC bring up phase.  

  • Work with NVIDIA partners on RAS firmware related issues to improve their use of NVIDIA products.  

  • Contribute to all phases of product development, from product definition and architecture and design, through implementation, debugging, testing and early customer support.  

What we need to see:  

  • BS, MS, or PhD in EE/CS or related field of education (or equivalent experience).

  • 8+ years of experience  

  • Demonstrated experience as a post-silicon debug Engineer, Hardware Test Engineer, and/or similar role.  

  • Familiarity with Linux, Ubuntu and RTOS bring up and ARM based platforms.  

  • Understanding of datacenter server platforms and firmware.  

  • Solid understanding of programming languages such as C, Python and Perl.  

  • Excellent problem-solving skills and attention to details.  

  • Possess excellent written and oral communication skills, good work ethic, high sense of team-work, love to produce quality work and commitment to finish your tasks every single day. 

  • Self-starter who loves to find creative solutions to complicated problems.  

Ways to stand out from the crowd:  

  • Familiar with ARM UEFI firmware development.  

  • Background with Linux kernel development, specifically writing device drivers.  

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people on the planet working for us. If you're creative and autonomous, we want to hear from you!  

The base salary range is 184,000 USD - 356,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Top Skills

Arm
C
Linux
Perl
Python
Rtos
Ubuntu

Similar Jobs

35 Minutes Ago
Remote
Hybrid
Pleasanton, CA, USA
157K-196K Annually
Senior level
157K-196K Annually
Senior level
Cloud • Fintech • Information Technology • Machine Learning • Software • App development • Generative AI
As a Senior Software Engineer at BlackLine, you will design and develop scalable cloud-based backend services, improve technical specifications, and ensure software quality and performance in a collaborative environment.
Top Skills: AgileApigeeAWSAzureElastic SearchGCPJavaKafkaMicroservicesNifiNo-SqlOktaRabbitMQRestful ApisServerless ArchitectureSQL
4 Hours Ago
Remote
Hybrid
Santa Clara, CA, USA
Senior level
Senior level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
The role involves designing and implementing infrastructure for AI workloads, improving platform reliability, and mentoring colleagues in best practices of software engineering.
Top Skills: AnsibleGitlab CiGoHelmJ2EeJavaKubernetesLinuxPrometheusPythonSplunk
4 Hours Ago
Remote
Hybrid
Addison, IL, USA
Senior level
Senior level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
The Principal Platform Architect advises clients on achieving business outcomes using ServiceNow, focusing on technical governance and solution design while building relationships with business leaders and guiding architecture personnel.
Top Skills: Ai TechnologiesAmazon Web ServicesAzureOracle CloudSalesforceServicenow PlatformWorkday

What you need to know about the Colorado Tech Scene

With a business-friendly climate and research universities like CU Boulder and Colorado State, Colorado has made a name for itself as a startup ecosystem. The state boasts a skilled workforce and high quality of life thanks to its affordable housing, vibrant cultural scene and unparalleled opportunities for outdoor recreation. Colorado is also home to the National Renewable Energy Laboratory, helping cement its status as a hub for renewable energy innovation.

Key Facts About Colorado Tech

  • Number of Tech Workers: 260,000; 8.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Lockheed Martin, Century Link, Comcast, BAE Systems, Level 3
  • Key Industries: Software, artificial intelligence, aerospace, e-commerce, fintech, healthtech
  • Funding Landscape: $4.9 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Access Venture Partners, Ridgeline Ventures, Techstars, Blackhorn Ventures
  • Research Centers and Universities: Colorado School of Mines, University of Colorado Boulder, University of Denver, Colorado State University, Mesa Laboratory, Space Science Institute, National Center for Atmospheric Research, National Renewable Energy Laboratory, Gottlieb Institute

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account