The Public Sector Site Reliability Engineer will manage cloud infrastructure, ensure compliance with regulations, implement observability tools, lead incident response, and enhance automation in federal environments.
At Unstructured, we’re building the backbone of generative AI—helping federal agencies transform PDFs, HTML, Word docs, images, and more into secure, high-performance data pipelines that scale. Our tools are trusted by nearly half of the Fortune 500 and downloaded more than 38 million times in the open-source community.
We’re expanding our federal/public sector practice, and we’re hiring a Public Sector Site Reliability Engineer (SRE) to help design, scale, and secure the systems that power the next generation of AI-driven workloads for government.
What You’ll Own & Drive
🔐 Mission-Grade Reliability & Security
Design, build, and manage secure, highly available, and scalable cloud infrastructure for federal environments.
Ensure compliance with FedRAMP, FISMA, and other relevant security and regulatory frameworks.
Develop IaC with Terraform, Pulumi, or similar for repeatable, compliant deployments.
Build and maintain automated CI/CD pipelines that move fast without sacrificing security or stability.
📊 Full Observability in Sensitive Environments
Implement/maintain monitoring, logging, and alerting (Prometheus, Grafana, Datadog, Elastic).
Enable real-time visibility and rapid response for mission-critical workloads.
Partner with engineering and program teams for high-assurance rollouts.
Lead capacity planning, deployment strategies, and resilient architecture design for federal networks.
🔥 Incident Response & Continuous Improvement
Lead incident response and root-cause analysis with a blameless, systems-thinking approach.
Drive postmortems and reliability improvements.
Enhance developer experience with secure automation and streamlined workflows.
Help teams iterate quickly while maintaining compliance and operational excellence.
What You Bring
5–9 years managing software deployed to US government or Department of Defense (DOD) networks
Active SECRET clearance required; TS/SCI strongly preferred
Expertise with AWS GovCloud and/or Azure Government.
Deep experience with Kubernetes, Docker, and container orchestration at scale.
Strong Linux systems and networking fundamentals.
Scripting/automation: Python, Bash, or Go. IaC: Terraform, Pulumi, Ansible (or similar).
Strong grasp of monitoring, logging, and observability best practices.
Travel required up to 20%
Bonus Points
ML infrastructure or real-time data pipelines experience.
Serverless or event-driven architectures.
Contributions to open-source DevOps/SRE projects.
Hands-on work with US government security/compliance in cloud-native settings.
Unstructured values service and encourages veterans of the US military and civilian agencies to apply to this role.
Why You’ll Love It Here
Mission Impact: Power critical AI workloads in the public sector.
Big Technical Challenges: High-assurance problems at the edge of AI, data, and cloud.
Elite Team: Sharp, low-ego engineers who value execution and learning.
Innovation + Security: Build cutting-edge systems with rigorous reliability for federal use cases.
Top Skills
Ansible
Aws Govcloud
Azure Government
Bash
Datadog
Docker
Elastic
Go
Grafana
Kubernetes
Prometheus
Pulumi
Python
Terraform
Similar Jobs
AdTech • Big Data • Digital Media • Software
The Principal Site Reliability Engineer will lead technical initiatives, enhance operational reliability, and mentor teams while focusing on automation and infrastructure improvements.
Top Skills:
AnsibleArgo CdAws EcrCi/CdGitGithub ActionsJenkinsKubernetesNexusPuppetTerraform
Software
The Site Reliability Engineer will design and maintain cloud infrastructure, automate deployment, enhance scalability and reliability, and resolve production incidents.
Top Skills:
AWSGCPKubernetesTerraform
Information Technology • Security • Cybersecurity
As a Senior Site Reliability Engineer, you'll enhance SaaS product operations, drive automation, respond to incidents, and collaborate with teams to improve reliability and deployment processes.
Top Skills:
ArgocdAWSAzureGCPGhaGoJavaJenkinsKubernetesMesosNomadPythonRubyTerraform
What you need to know about the Colorado Tech Scene
With a business-friendly climate and research universities like CU Boulder and Colorado State, Colorado has made a name for itself as a startup ecosystem. The state boasts a skilled workforce and high quality of life thanks to its affordable housing, vibrant cultural scene and unparalleled opportunities for outdoor recreation. Colorado is also home to the National Renewable Energy Laboratory, helping cement its status as a hub for renewable energy innovation.
Key Facts About Colorado Tech
- Number of Tech Workers: 260,000; 8.5% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Lockheed Martin, Century Link, Comcast, BAE Systems, Level 3
- Key Industries: Software, artificial intelligence, aerospace, e-commerce, fintech, healthtech
- Funding Landscape: $4.9 billion in VC funding in 2024 (Pitchbook)
- Notable Investors: Access Venture Partners, Ridgeline Ventures, Techstars, Blackhorn Ventures
- Research Centers and Universities: Colorado School of Mines, University of Colorado Boulder, University of Denver, Colorado State University, Mesa Laboratory, Space Science Institute, National Center for Atmospheric Research, National Renewable Energy Laboratory, Gottlieb Institute