Mirantis is the Kubernetes-native AI infrastructure company, enabling organizations to build and operate scalable, secure, and sovereign infrastructure for modern AI, machine learning, and data-intensive applications. By combining open source innovation with deep expertise in Kubernetes orchestration, Mirantis empowers platform engineering teams to deliver composable, production-ready developer platforms across any environment—on-premises, in the cloud, at the edge, or in sovereign data centers. As enterprises navigate the growing complexity of AI-driven workloads, Mirantis delivers the automation, GPU orchestration, and policy-driven control needed to manage infrastructure with confidence and agility. Committed to open standards and freedom from lock-in, Mirantis ensures that customers retain full control of their infrastructure strategy.
https://www.mirantis.com/
Job DescriptionThe Dedicated L2 Technical Support Engineer is responsible for solving complex and high-impact technical issues within large-scale customer cloud environments. This role focuses on advanced troubleshooting, incident response, and problem management for one of Mirantis’ largest customers, with a strong emphasis on OpenStack- and Kubernetes-based platforms.
This engineer acts as a technical leader during their shift, maintaining continuous situational awareness, coordinating cross-functional resources, and driving issues to resolution under pressure. They are expected to lead incident calls, communicate clearly and decisively with customers, and take ownership of escalations from initial detection through full resolution.
Main Responsibilities:
Troubleshoot private cloud deployments based on OpenStack, Kubernetes, Mirantis Container Runtime (MCR), and related cloud technologies, detecting, reporting, and resolving complex issues across the stack.
Provide high-tier support for critical product issues escalated by peers or management, including leading high-severity incident calls and coordinating resolution efforts across multiple teams.
Act as a shift-level technical leader, maintaining awareness of platform health, responding to alerts, and proactively managing emerging issues within customer environments.
Perform cluster upgrades and lifecycle operations as new releases become available, ensuring minimal disruption and clear communication throughout the process.
Communicate urgently, clearly, and in detail with customers during incidents, providing accurate status updates and guiding them through troubleshooting and resolution.
Own escalations end-to-end by routing issues to the appropriate teams, including OpenStack, Ceph and storage, networking, hardware, and infrastructure, while maintaining accountability and follow-through.
Lead incident management efforts by structuring outage calls, distinguishing root cause from contributing factors, documenting findings, and driving corrective actions to completion.
Reproduce customer issues in internal lab environments, validate defects, and provide detailed diagnostics and reproduction steps to development teams.
Work closely with engineering teams to review customer issues, suggest improvements, identify potential product defects, and track fixes through delivery.
Qualifications
High school diploma or equivalent required; four-year degree preferred or equivalent experience with three or more years in a senior or Tier 3 systems administration role.
Strong experience with OpenStack core services including Nova, Neutron, Cinder, Keystone, and Glance.
Hands-on knowledge of Neutron networking, including OVS and OVN, VLAN and VXLAN, and SDN concepts.
Experience with Cinder storage backends, including Ceph RBD and LVM.
Strong Ceph experience, including troubleshooting cluster health, OSD and MON issues, performance tuning, and recovery operations.
Working knowledge of Kubernetes architecture and troubleshooting containerized workloads.
Advanced Linux system administration skills.
Strong understanding of networking fundamentals.
Excellent written and verbal English communication skills.
Preferred Qualifications
Experience with container technologies such as Docker and Kubernetes; certifications are a plus.
Experience with databases and messaging systems including MySQL, Galera, PostgreSQL, RabbitMQ, InfluxDB, Elasticsearch, Cassandra, and Zookeeper.
Experience with monitoring and observability tools such as Grafana, Kibana, and Prometheus.
Experience with automation and configuration management tools including Ansible, Salt, Puppet, Chef, and Helm.
What does Mirantis offer you?
Work with an established Silicon Valley leader in the cloud infrastructure industry.
Work with exceptionally passionate, talented and engaging colleagues, helping Fortune 500 and Global 2000 customers implement next-generation cloud technologies.
Be a part of cutting-edge, open-source innovation.
Thrive in the high-energy environment of a young company where openness, collaboration, risk-taking, and continuous growth are valued.
Receive a competitive compensation package with strong benefits plan
We are a Leader for Container Management in G2 (#2 after AWS)!
Top Skills
Similar Jobs
What you need to know about the Colorado Tech Scene
Key Facts About Colorado Tech
- Number of Tech Workers: 260,000; 8.5% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Lockheed Martin, Century Link, Comcast, BAE Systems, Level 3
- Key Industries: Software, artificial intelligence, aerospace, e-commerce, fintech, healthtech
- Funding Landscape: $4.9 billion in VC funding in 2024 (Pitchbook)
- Notable Investors: Access Venture Partners, Ridgeline Ventures, Techstars, Blackhorn Ventures
- Research Centers and Universities: Colorado School of Mines, University of Colorado Boulder, University of Denver, Colorado State University, Mesa Laboratory, Space Science Institute, National Center for Atmospheric Research, National Renewable Energy Laboratory, Gottlieb Institute


