The Site Reliability Engineer will ensure service quality and availability, manage AWS infrastructure, implement SRE best practices, and collaborate with teams.
Are you motivated by an incredible sense of purpose in doing work that helps keep people safe? Are you passionate about innovating on cutting edge technology to develop robust architecture principles, operability guidelines, progressive scaling methodologies, and implementing other sophisticated techniques to reliably operate infrastructure at scale? Do you have an appetite for securing systems, streamlining efficiency, automating away toil, and proactively eliminating problems before they occur? If so, this position is a perfect opportunity for you to join the Everbridge Federal Platform team.
As part of the Everbridge Federal Platform team, you will play a critical role in ensuring the overall service quality and availability of Everbridge's solutions. This includes designing, deploying, managing services at scale, evangelizing both SRE best practices, and helping to push the boundaries of the latest technology. The platforms that you will support are critical to the delivery of time sensitive information to help keep people safe and businesses running. We are dedicated, passionate people who are committed to customer service and doing the right thing.
What You'll Do:
- Keep people safe and businesses running.
- Be an integral member of the team implementing our platform in a DoD IL4 cloud environment.
- Maintain infrastructure from conception to completion within AWS. Including services such as VPCs, EC2, Transit Gateways, IAM roles and policies, Route53, S3, SGs, NACLs
- Build upon the operational availability, security, scalability, efficiency, monitoring, instrumentation, and overall service reliability of Everbridge's solutions.
- Collaborate across Agile teams with Architects, Developers, Quality, Data, Security, and other engineers on designing and implementing highly reliable solutions.
- Research and implement SRE and best practices and by creating automation, cross-functional collaboration, and data-driven decisions to reinforce the integrity and reliability of our systems.
- Participate in a rotating on-call rotation to resolve production escalations
What You'll Bring:
- 2+ years of technical AWS experience, managing and owning systems in a production environment
- 1+ years of Kubernetes experience (EKS, AKS, GKE, Self-managed)
- 2+ years of Terraform or similar IaC experience
- 2+ years of experience with MongoDB or ElasticSearch/ELK administration
- 2+ years of experience with application development or writing automation in Java
- Experience with the following tooling: GitLab CICD, Packer, Docker, EKS, Kubernetes, Spinnaker, Helm, Argo, Jenkins
- Experience with Telemetry tools such as Datadog, SumoLogic, Grafana, Prometheus
- Experience with configuration management tools such as Salt, Ansible, AWS user_data
- Experience with a DevOps/SRE production environment
- Experience with Agile practices
- UNIX/Linux experience
- Experience working on DoD programs
- Currently hold a Secret Clearance or a be a US citizen with the ability to obtain a Secret Clearance
- Must have or be able to obtain and maintain DoD 8140 “Intermediate” level or higher certification (formally DoD 8170 IAM Level II)
The reasonably estimated salary for this role at Everbridge ranges from $84,000 - $112,000 and may also include variable compensation. Actual compensation is based on factors such as the candidate's skills, qualifications, and experience. In addition, Everbridge offers a wide range of best in class, comprehensive and inclusive employee benefits for this role including healthcare, dental, parental planning, and mental health benefits, disability income benefits, life and AD&D insurance, a 401(k) plan and match, paid time off, and fitness reimbursements.
Fair Chance Statement US & Canada
We are committed to providing equal employment opportunities in compliance with all applicable Federal, Provincial/State and Local laws, including the California Fair Chance Act and any local County Fair Chance Ordinance (or local equivalent). Pursuant to these and other relevant regulations, we consider qualified applicants with criminal histories in a manner consistent with the law.
For roles subject to background checks, the following material job duties may be affected by an applicant’s criminal history:
- Access to sensitive or confidential information, such as financial records, proprietary data, or client information.
- Management of cash, company funds, or other valuable assets.
- Work in environments requiring heightened security measures.
- Compliance with contractual or regulatory requirements specific to the position.
We evaluate each applicant's criminal history individually, considering its nature, timing, and relevance to the specific job duties, while maintaining our commitment to fair hiring practices and promoting workplace equity.
About Everbridge
Everbridge empowers enterprises and government organizations to anticipate, mitigate, respond to, and recover stronger from critical events. In today’s unpredictable world, resilient organizations minimize impact to people and operations, absorb stress, and return to productivity faster when deploying critical event management (CEM) technology. Everbridge digitizes organizational resilience by combining intelligent automation with the industry’s most comprehensive risk data to Keep People Safe and Organizations Running™. For more information, visit www.everbridge.com, read the company blog, and follow on Twitter. Everbridge… Empowering Resilience
Everbridge is an Equal Opportunity/Affirmative Action Employer. All qualified Applicants will receive consideration for employment without regard to race, creed, color, religion, or sex including sexual orientation and gender identity, national origin, disability, protected Veteran Status, or any other characteristic protected by applicable federal, state, or local law.
Top Skills
Ansible
Argo
AWS
Datadog
Docker
Elasticsearch
Gitlab Cicd
Grafana
Helm
Java
Jenkins
Kubernetes
Linux
MongoDB
Packer
Prometheus
Salt
Spinnaker
Sumologic
Terraform
Unix
Similar Jobs
Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
As an SRE Manager, you will oversee a team ensuring operational excellence in a production environment, manage incident responses, and lead infrastructure planning efforts.
Top Skills:
FreebsdLinuxStorage Area NetworksVMware
AdTech • Big Data • Digital Media • Software
The Principal Site Reliability Engineer will lead technical initiatives, enhance operational reliability, and mentor teams while focusing on automation and infrastructure improvements.
Top Skills:
AnsibleArgo CdAws EcrCi/CdGitGithub ActionsJenkinsKubernetesNexusPuppetTerraform
Artificial Intelligence • Fintech • Information Technology • Software • Data Privacy
The Principal Site Reliability Engineer ensures that SaaS products are fast and stable, focusing on automating processes, monitoring systems, and collaborating across teams to enhance product performance and reliability.
Top Skills:
AnsibleAppdynamicsAzureAzure DevopsBashC# .NetCosmosDatadogDynatraceHarnessIdera Sql Diagnostic ManagerJavaJenkinsKubernetesNew RelicPowershellPythonRedgate Sql MonitorSolarwinds Database Performance AnalyzerSQLTerraform
What you need to know about the Colorado Tech Scene
With a business-friendly climate and research universities like CU Boulder and Colorado State, Colorado has made a name for itself as a startup ecosystem. The state boasts a skilled workforce and high quality of life thanks to its affordable housing, vibrant cultural scene and unparalleled opportunities for outdoor recreation. Colorado is also home to the National Renewable Energy Laboratory, helping cement its status as a hub for renewable energy innovation.
Key Facts About Colorado Tech
- Number of Tech Workers: 260,000; 8.5% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Lockheed Martin, Century Link, Comcast, BAE Systems, Level 3
- Key Industries: Software, artificial intelligence, aerospace, e-commerce, fintech, healthtech
- Funding Landscape: $4.9 billion in VC funding in 2024 (Pitchbook)
- Notable Investors: Access Venture Partners, Ridgeline Ventures, Techstars, Blackhorn Ventures
- Research Centers and Universities: Colorado School of Mines, University of Colorado Boulder, University of Denver, Colorado State University, Mesa Laboratory, Space Science Institute, National Center for Atmospheric Research, National Renewable Energy Laboratory, Gottlieb Institute