We are seeking a skilled Principal Site Reliability Engineer (SRE) to support and maintain Claroty's FedRAMP-compliant deployment in AWS GovCloud for public sector customers. The SRE will be responsible for ensuring high availability, security, and compliance of cloud-based environments while driving automation, monitoring, and incident response best practices.
We’re growing and looking to hire an individual who embodies our core values: People First, Customer Obsession, Strive for Excellence, and Integrity.
About Claroty:
Claroty has redefined cyber-physical systems (CPS) protection with an unrivaled industry-centric platform built to secure mission-critical infrastructure. The Claroty Platform provides the deepest asset visibility and the broadest, built-for-CPS solution set in the market comprising exposure management, network protection, secure access, and threat detection – whether in the cloud with Claroty xDome or on-premise with Claroty Continuous Threat Detection (CTD). Backed by award-winning threat research and a breadth of technology alliances, The Claroty Platform enables organizations to effectively reduce CPS risk, with the fastest time-to-value and lower total cost of ownership. Our solutions are deployed by over 1,000 organizations at thousands of sites across all seven continents.
A Great Place to Work® certified company, Claroty is headquartered in New York City with employees across the Americas, Europe, Asia-Pacific, and Tel Aviv. The company is widely recognized as the industry leader in CPS protection, with backing from the world’s largest investment firms and industrial automation vendors, recognized by KLAS Research as Best in KLAS for Healthcare IoT Security five years in a row, and ranking on the Forbes Cloud 100 and Deloitte Technology Fast 500 multiple consecutive years.
Responsibilities
- AWS GovCloud Operations: Manage and optimize Claroty’s cloud-based infrastructure in AWS GovCloud, ensuring FedRAMP compliance and high availability.
- Reliability & Performance: Monitor and enhance system performance, scalability, and reliability through observability tools, automation, and best practices.
- Security & Compliance: Implement and maintain security controls aligned with FedRAMP, NIST 800-53, and other federal cybersecurity standards.
- Infrastructure as Code (IaC): Develop and manage infrastructure automation using Terraform and Ansible.
- CI/CD & Automation: Enhance DevSecOps pipelines, automate deployments, and improve system resilience through tools like GitLab CI/CD, Jenkins, and Kubernetes.
- Incident Response & Monitoring: Implement and manage monitoring solutions (Prometheus, Grafana, ELK Stack), respond to incidents, and conduct post-mortems.
- Networking & Security: Configure and maintain VPCs, VPNs, security groups, and firewalls in AWS GovCloud, ensuring compliance with FedRAMP requirements.
- GOV Production Gatekeeper: Manage rollout strategy for new technologies and oversee their execution to ensure minimal disruption to existing systems.
- GOV Production On-Call: Act as the first line of response for critical incidents, assessing issues, triaging, and coordinating with the team to prevent further problems and swiftly restore services.
- Monitor Production Performance and Degradation: Monitor system performance metrics closely and detect any degradation early to prevent outages and disruptions.
- Production Maintenance: Conduct regular infrastructure upgrades to accommodate changes, developments, and advancements in the technological landscape.
- Manage Release Flow: Oversee the release of updates and new functionalities, ensuring a seamless transition while handling any potential negative impacts on production.
- Collaboration: Work closely with DevOps, security teams, developers, and federal stakeholders to maintain a compliant and secure cloud environment.
Requirements
- U.S. Citizenship (required for working in GovCloud environments).
- 6-8+ years of experience in SRE, DevOps, or Cloud Engineering roles.
- Hands-on experience with AWS GovCloud, including EC2, EKS, MSK, S3, RDS, IAM, CloudTrail, and CloudWatch.
- Strong expertise in Infrastructure as Code (Terraform, Ansible).
- Experience with FedRAMP, NIST 800-53, and cloud security best practices.
- Proficiency in Kubernetes, Docker, and container orchestration.
- Knowledge of Linux system administration and scripting (Python, Bash).
- Experience with logging, monitoring, and observability tools in a cloud-native environment.
- Strong troubleshooting, problem-solving, and automation mindset.
Cross-Functional Leadership and Execution
Beyond technical expertise, this role requires a high level of autonomy and ownership. The ideal candidate:
- Leads end-to-end tasks with minimal oversight—from planning through execution and validation.
- Is an all-around player who understands the broader technical and organizational context and knows how to drive tasks to completion, even when requirements are implicit.
- Executes with excellence, consistently delivering high-quality outcomes that meet or exceed expectations.
- Communicates clearly and effectively across all levels of the organization—from engineers to leadership—adapting messages appropriately.
- Demonstrates strong collaboration skills and team-first attitude, contributing to a healthy team culture while operating at a principal-level mindset and performance.
Top Skills
Similar Jobs
What you need to know about the Colorado Tech Scene
Key Facts About Colorado Tech
- Number of Tech Workers: 260,000; 8.5% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Lockheed Martin, Century Link, Comcast, BAE Systems, Level 3
- Key Industries: Software, artificial intelligence, aerospace, e-commerce, fintech, healthtech
- Funding Landscape: $4.9 billion in VC funding in 2024 (Pitchbook)
- Notable Investors: Access Venture Partners, Ridgeline Ventures, Techstars, Blackhorn Ventures
- Research Centers and Universities: Colorado School of Mines, University of Colorado Boulder, University of Denver, Colorado State University, Mesa Laboratory, Space Science Institute, National Center for Atmospheric Research, National Renewable Energy Laboratory, Gottlieb Institute