Claroty Logo

Claroty

Principal Site Reliability Engineer (SRE) – Claroty FedRAMP AWS GovCloud (Public Sector)

Posted 2 Days Ago
In-Office or Remote
Hiring Remotely in Washington, DC
Senior level
In-Office or Remote
Hiring Remotely in Washington, DC
Senior level
The Principal Site Reliability Engineer oversees AWS GovCloud infrastructure, ensuring compliance with FedRAMP, enhancing system performance, and managing incident responses using automation tools.
The summary above was generated by AI
Description

We are seeking a skilled Principal Site Reliability Engineer (SRE) to support and maintain Claroty's FedRAMP-compliant deployment in AWS GovCloud for public sector customers. The SRE will be responsible for ensuring high availability, security, and compliance of cloud-based environments while driving automation, monitoring, and incident response best practices.

We’re growing and looking to hire an individual who embodies our core values: People First, Customer Obsession, Strive for Excellence, and Integrity.

About Claroty:   

Claroty has redefined cyber-physical systems (CPS) protection with an unrivaled industry-centric platform built to secure mission-critical infrastructure. The Claroty Platform provides the deepest asset visibility and the broadest, built-for-CPS solution set in the market comprising exposure management, network protection, secure access, and threat detection – whether in the cloud with Claroty xDome or on-premise with Claroty Continuous Threat Detection (CTD). Backed by award-winning threat research and a breadth of technology alliances, The Claroty Platform enables organizations to effectively reduce CPS risk, with the fastest time-to-value and lower total cost of ownership. Our solutions are deployed by over 1,000 organizations at thousands of sites across all seven continents.

A Great Place to Work® certified company, Claroty is headquartered in New York City with employees across the Americas, Europe, Asia-Pacific, and Tel Aviv. The company is widely recognized as the industry leader in CPS protection, with backing from the world’s largest investment firms and industrial automation vendors, recognized by KLAS Research as Best in KLAS for Healthcare IoT Security five years in a row, and ranking on the Forbes Cloud 100 and Deloitte Technology Fast 500 multiple consecutive years. 


Responsibilities
  • AWS GovCloud Operations: Manage and optimize Claroty’s cloud-based infrastructure in AWS GovCloud, ensuring FedRAMP compliance and high availability.
  • Reliability & Performance: Monitor and enhance system performance, scalability, and reliability through observability tools, automation, and best practices.
  • Security & Compliance: Implement and maintain security controls aligned with FedRAMP, NIST 800-53, and other federal cybersecurity standards.
  • Infrastructure as Code (IaC): Develop and manage infrastructure automation using Terraform and Ansible.
  • CI/CD & Automation: Enhance DevSecOps pipelines, automate deployments, and improve system resilience through tools like GitLab CI/CD, Jenkins, and Kubernetes.
  • Incident Response & Monitoring: Implement and manage monitoring solutions (Prometheus, Grafana, ELK Stack), respond to incidents, and conduct post-mortems.
  • Networking & Security: Configure and maintain VPCs, VPNs, security groups, and firewalls in AWS GovCloud, ensuring compliance with FedRAMP requirements.
  • GOV Production Gatekeeper: Manage rollout strategy for new technologies and oversee their execution to ensure minimal disruption to existing systems.
  • GOV Production On-Call: Act as the first line of response for critical incidents, assessing issues, triaging, and coordinating with the team to prevent further problems and swiftly restore services.
  • Monitor Production Performance and Degradation: Monitor system performance metrics closely and detect any degradation early to prevent outages and disruptions.
  • Production Maintenance: Conduct regular infrastructure upgrades to accommodate changes, developments, and advancements in the technological landscape.
  • Manage Release Flow: Oversee the release of updates and new functionalities, ensuring a seamless transition while handling any potential negative impacts on production.
  • Collaboration: Work closely with DevOps, security teams, developers, and federal stakeholders to maintain a compliant and secure cloud environment.

Requirements
  • U.S. Citizenship (required for working in GovCloud environments).
  • 6-8+ years of experience in SRE, DevOps, or Cloud Engineering roles.
  • Hands-on experience with AWS GovCloud, including EC2, EKS, MSK, S3, RDS, IAM, CloudTrail, and CloudWatch.
  • Strong expertise in Infrastructure as Code (Terraform, Ansible).
  • Experience with FedRAMP, NIST 800-53, and cloud security best practices.
  • Proficiency in Kubernetes, Docker, and container orchestration.
  • Knowledge of Linux system administration and scripting (Python, Bash).
  • Experience with logging, monitoring, and observability tools in a cloud-native environment.
  • Strong troubleshooting, problem-solving, and automation mindset.

Cross-Functional Leadership and Execution

Beyond technical expertise, this role requires a high level of autonomy and ownership. The ideal candidate:

  • Leads end-to-end tasks with minimal oversight—from planning through execution and validation.
  • Is an all-around player who understands the broader technical and organizational context and knows how to drive tasks to completion, even when requirements are implicit.
  • Executes with excellence, consistently delivering high-quality outcomes that meet or exceed expectations.
  • Communicates clearly and effectively across all levels of the organization—from engineers to leadership—adapting messages appropriately.
  • Demonstrates strong collaboration skills and team-first attitude, contributing to a healthy team culture while operating at a principal-level mindset and performance.

Top Skills

Ansible
Aws Govcloud
Bash
Cloudtrail
Cloudwatch
Ec2
Eks
Elk Stack
Gitlab Ci/Cd
Grafana
Iam
Jenkins
Kubernetes
Msk
Prometheus
Python
Rds
S3
Terraform

Similar Jobs

8 Days Ago
Remote or Hybrid
USA
110K-137K Annually
Senior level
110K-137K Annually
Senior level
Artificial Intelligence • Healthtech • Machine Learning • Natural Language Processing • Software
The Site Reliability Engineer will manage AWS infrastructure, implementing cloud strategies, automation tools, and ensuring reliability, security, and cost-efficiency.
Top Skills: AnsibleAnsibleApi GatewayArgo CdAWSAws CdkAws CloudtrailAws CloudwatchBashCloudFormationCloudfrontDockerDocumentdbEc2EksGitlabGrafanaHashicorp VaultHelmKubernetesLambdaLokiMimirNew RelicPrometheusPythonRdsS3Secrets ManagerSsmTempoTerraform
7 Days Ago
Easy Apply
Remote or Hybrid
United States
Easy Apply
Senior level
Senior level
Automotive • Fintech • Hardware • Payments • Travel • Financial Services
The Senior Site Reliability Engineer will automate deployment, monitoring, and incident response, collaborating with teams to improve infrastructure and manage cloud technologies.
Top Skills: AnsibleAWSBashJavaPowershellPythonSql Server 2019TerraformWindows Server 2019
5 Days Ago
In-Office or Remote
2 Locations
Senior level
Senior level
Artificial Intelligence • Machine Learning • On-Demand
The Senior Site Reliability Engineer will manage cloud infrastructure, design CI/CD pipelines, automate workflows, and enhance observability and monitoring systems in a collaborative environment.
Top Skills: AWSCi/CdElasticsearchGithub ActionsGrafanaJenkinsKubernetesRedisSpinnakerSQL ServerTerraform

What you need to know about the Colorado Tech Scene

With a business-friendly climate and research universities like CU Boulder and Colorado State, Colorado has made a name for itself as a startup ecosystem. The state boasts a skilled workforce and high quality of life thanks to its affordable housing, vibrant cultural scene and unparalleled opportunities for outdoor recreation. Colorado is also home to the National Renewable Energy Laboratory, helping cement its status as a hub for renewable energy innovation.

Key Facts About Colorado Tech

  • Number of Tech Workers: 260,000; 8.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Lockheed Martin, Century Link, Comcast, BAE Systems, Level 3
  • Key Industries: Software, artificial intelligence, aerospace, e-commerce, fintech, healthtech
  • Funding Landscape: $4.9 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Access Venture Partners, Ridgeline Ventures, Techstars, Blackhorn Ventures
  • Research Centers and Universities: Colorado School of Mines, University of Colorado Boulder, University of Denver, Colorado State University, Mesa Laboratory, Space Science Institute, National Center for Atmospheric Research, National Renewable Energy Laboratory, Gottlieb Institute

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account