Anduril Logo

Anduril

Senior Site Reliability Engineer

Posted 6 Days Ago
Be an Early Applicant
In-Office
Costa Mesa, CA
166K-220K Annually
Senior level
In-Office
Costa Mesa, CA
166K-220K Annually
Senior level
As a Senior Site Reliability Engineer, you will build and operate infrastructure, manage CI/CD pipelines, automate systems, and collaborate across teams for digital shipbuilding. Your role will emphasize security, reliability, and performance optimization in deploying machine learning models and applications.
The summary above was generated by AI

Anduril Industries is a defense technology company with a mission to transform U.S. and allied military capabilities with advanced technology. By bringing the expertise, technology, and business model of the 21st century’s most innovative companies to the defense industry, Anduril is changing how military systems are designed, built and sold. Anduril’s family of systems is powered by Lattice OS, an AI-powered operating system that turns thousands of data streams into a realtime, 3D command and control center. As the world enters an era of strategic competition, Anduril is committed to bringing cutting-edge autonomy, AI, computer vision, sensor fusion, and networking technology to the military in months, not years.

ABOUT THE TEAM

Anduril Maritime delivers platforms, systems, and integrated effects in the maritime domain. Our autonomous vehicles (sub-surface and surface) are the cornerstone of these capabilities, and we continually strive to push the boundaries of the possible in terms of endurance, autonomy and mission capability. The Maritime team develops and maintains core products and payloads, and adapts and applies those products to serve a wide variety of defense, IC and commercial customers in US and international markets.

ABOUT THE JOB

As a Senior Site Reliability Engineer on the Maritime Digital Shipbuilding team, you will build and operate the infrastructure that keeps our digital production systems running at full speed. You’ll develop and manage CI/CD pipelines, automate infrastructure with code, and deploy applications and machine learning models across cloud and edge environments with security, traceability, and reliability in mind.
You’ll work closely with software, data, and operations engineers to turn designs into working systems—streamlining development, improving performance, and keeping production stable as we scale. You’ll also collaborate with digital, manufacturing, and corporate technology teams across Anduril in a high-tech, fast-paced culture of innovation focused on solving real problems and delivering results.
If you’re driven to build systems that last, thrive on deep technical challenges, and want to see your work directly shape how we design, build, and sustain complex platforms, you’ll be helping build the future of digital shipbuilding and the next generation of maritime vehicles.

WHAT YOU'LL DO
  • Build and Manage CI/CD Pipelines: Develop and maintain CI/CD pipelines using tools like GitHub Actions and Jfrog Artifactory to ensure seamless integration and deployment of machine learning models and applications.
  • Infrastructure as Code (IaC): Utilize Terraform and Ansible to automate infrastructure provisioning and management on cloud platforms such as Azure, AWS, or Google Cloud Platform (GCP).
  • Containerization and Orchestration: Implement containerization solutions with Docker and manage container orchestration using Kubernetes to ensure reliable deployment and scaling of applications.
  • Model Management and Deployment: Set up and maintain model registries and feature stores (e.g., MLflow, Kubeflow), and manage deployment pipelines for both batch and real-time inference.
  • Monitoring and Logging: Establish comprehensive monitoring and logging solutions using tools like ELK Stack (Elasticsearch, Logstash, Kibana), Prometheus, and Grafana to ensure the smooth operation of deployment environments.
  • Collaborate with Cross-Functional Teams: Work closely with development, data science, and operations teams to foster collaboration and ensure the efficient and effective deployment of machine learning models.
  • Optimize Performance: Utilize parallel computing frameworks such as CUDA and OpenCL to accelerate high-performance computing tasks, ensuring timely processing of large datasets and complex simulations.
REQUIRED QUALIFICATIONS
  • Advanced proficiency in programming languages (Python for scripting and integration).
  • Experience with CI/CD tools like GitHub Actions, Jfrog Artifactory, and Git.
  • Proficiency with IaC tools (Terraform, Ansible).
  • Experience with cloud platforms (Azure, AWS, GCP).
  • Proficiency in containerization (Docker) and container orchestration (Kubernetes).
  • Knowledge of model registries and feature stores (e.g., MLflow, Kubeflow).
  • Experience with logging and monitoring tools (ELK Stack, Prometheus, Grafana).
  • Understanding of parallel computing frameworks (CUDA, OpenCL).
  • Strong collaboration skills and proficiency with collaborative tools (JIRA, Confluence).
  • Eligible to obtain and maintain an active U.S. Secret security clearance.
PREFERRED QUALIFICATIONS
  • Previous experience in a manufacturing or industrial setting.
  • Familiarity with observability concepts and tools.
  • Knowledge of security best practices for DevOps and MLOps.
US Salary Range
$166,000$220,000 USD

 

The salary range for this role is an estimate based on a wide range of compensation factors, inclusive of base salary only. Actual salary offer may vary based on (but not limited to) work experience, education and/or training, critical skills, and/or business considerations. Highly competitive equity grants are included in the majority of full time offers; and are considered part of Anduril's total compensation package. Additionally, Anduril offers top-tier benefits for full-time employees, including: 

Healthcare Benefits 

  • US Roles: Comprehensive medical, dental, and vision plans at little to no cost to you. 
  • UK & AUS Roles: We cover full cost of medical insurance premiums for you and your dependents. 
  • IE Roles: We offer an annual contribution toward your private health insurance for you and your dependents. 

Additional Benefits 

  • Income Protection: Anduril covers life and disability insurance for all employees. 
  • Generous time off: Highly competitive PTO plans with a holiday hiatus in December. Caregiver & Wellness Leave is available to care for family members, bond with a new baby, or address your own medical needs. 
  • Family Planning & Parenting Support: Coverage for fertility treatments (e.g., IVF, preservation), adoption, and gestational carriers, along with resources to support you and your partner from planning to parenting. 
  • Mental Health Resources: Access free mental health resources 24/7, including therapy and life coaching. Additional work-life services, such as legal and financial support, are also available. 
  • Professional Development: Annual reimbursement for professional development 
  • Commuter Benefits: Company-funded commuter benefits based on your region. 
  • Relocation Assistance: Available depending on role eligibility. 

Retirement Savings Plan 

  • US Roles: Traditional 401(k), Roth, and after-tax (mega backdoor Roth) options. 
  • UK & IE Roles: Pension plan with employer match. 
  • AUS Roles: Superannuation plan. 

The recruiter assigned to this role can share more information about the specific compensation and benefit details associated with this role during the hiring process. 

To view Anduril's candidate data privacy policy, please visit https://anduril.com/applicant-privacy-notice/. 

Top Skills

Ansible
AWS
Azure
C++
Confluence
Cuda
Docker
Elk Stack
Github Actions
Google Cloud Platform
Grafana
Jfrog Artifactory
JIRA
Kubeflow
Kubernetes
Mlflow
Opencl
Prometheus
Python
Terraform

Similar Jobs at Anduril

Yesterday
In-Office
Irvine, CA, USA
191K-287K Annually
Senior level
191K-287K Annually
Senior level
Aerospace • Artificial Intelligence • Hardware • Robotics • Security • Software • Defense
The Senior Site Reliability Engineer designs, deploys, and maintains cloud infrastructure, enhances system resilience and leads complex projects, collaborating with teams and promoting best practices.
Top Skills: ArgocdAWSAzureC++DockerGoHelmKubernetesPythonRustTerraform
23 Days Ago
In-Office
Costa Mesa, CA, USA
191K-208K Annually
Senior level
191K-208K Annually
Senior level
Aerospace • Artificial Intelligence • Hardware • Robotics • Security • Software • Defense
As a Site Reliability Engineer, develop solutions for deployment engineers, ensure scalable system delivery, and improve operational capabilities for military technologies.
Top Skills: C++Cloud TechnologiesCybersecurityGoNetworkingPythonRust
25 Minutes Ago
In-Office
Costa Mesa, CA, USA
146K-220K Annually
Mid level
146K-220K Annually
Mid level
Aerospace • Artificial Intelligence • Hardware • Robotics • Security • Software • Defense
As a Systems Engineer, you'll design and support the integration of subsystems for Maritime's Autonomous Underwater Vehicles, ensuring system performance and collaborating across departments.
Top Skills: Autonomous Underwater VehiclesComplex Robotic SystemsComputer ScienceCybersecurityEngineeringMechatronicsRobotic PerceptionRobotics

What you need to know about the Colorado Tech Scene

With a business-friendly climate and research universities like CU Boulder and Colorado State, Colorado has made a name for itself as a startup ecosystem. The state boasts a skilled workforce and high quality of life thanks to its affordable housing, vibrant cultural scene and unparalleled opportunities for outdoor recreation. Colorado is also home to the National Renewable Energy Laboratory, helping cement its status as a hub for renewable energy innovation.

Key Facts About Colorado Tech

  • Number of Tech Workers: 260,000; 8.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Lockheed Martin, Century Link, Comcast, BAE Systems, Level 3
  • Key Industries: Software, artificial intelligence, aerospace, e-commerce, fintech, healthtech
  • Funding Landscape: $4.9 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Access Venture Partners, Ridgeline Ventures, Techstars, Blackhorn Ventures
  • Research Centers and Universities: Colorado School of Mines, University of Colorado Boulder, University of Denver, Colorado State University, Mesa Laboratory, Space Science Institute, National Center for Atmospheric Research, National Renewable Energy Laboratory, Gottlieb Institute

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account