Consensus Cloud Solutions Logo

Consensus Cloud Solutions

Site Reliability Engineer I

Reposted 14 Days Ago
Remote
Hiring Remotely in US
180K-200K
Senior level
Remote
Hiring Remotely in US
180K-200K
Senior level
The Site Reliability Engineer I will automate and streamline software delivery, manage cloud infrastructure on AWS, and optimize CI/CD pipelines while collaborating with engineering teams to ensure reliability and performance.
The summary above was generated by AI

Consensus Cloud Solutions is a publicly traded, leading digital cloud fax and interoperability solutions organization in the United States and globally, focusing on connecting and empowering healthcare providers, payers, care teams, and technology innovators to unify multiple systems that wouldn’t otherwise talk to each other. Consensus is a trailblazer in our industry and believes that data transformation will reshape the world of healthcare.

Founded over 25 years ago, Consensus leverages its technology heritage to move from simple digital documents to advanced healthcare standards (HL7/FHIR) for secure data transport, as well as Natural Language Processing (NLP) and Artificial Intelligence (AI) to convert unstructured to structured, analytics-ready data, helping users unveil information that is meaningful and actionable for better patient care.  

Consensus leads the industry in data exchange solutions and we’re only getting started! With exciting new initiatives on the horizon, we are continuing our strategic expansion and we are looking to add to our diverse team of innovators. 

Now is the ideal time to join us in our mission to solve healthcare’s biggest challenges, and work collaboratively with a diverse team of like-minded self-starters and partners to accomplish it. 

Consensus Cloud Solutions is an Equal Opportunity Employer. We celebrate diversity and are committed to creating an inclusive and equitable environment for all employees. We offer many remote and hybrid career opportunities.

How you will impact the organization…

Reporting to the Director, Infrastructure Operations, the SRE I (Site Reliability Engineer I) with a strong DevOps focus is a key member of a team responsible for supporting the tooling, pipelines, frameworks, and other technologies that underpin the many platforms deployed within the company’s infrastructure. This role blends software engineering principles with deep DevOps expertise to automate and streamline the entire software delivery lifecycle. Additionally, the SRE I role will be an expert in Infrastructure as Code (IaC), AWS Cloud infrastructure design and best practices, and CI/CD platforms and processes, driving the automation and optimization of our operations.

As SRE I, they will partner closely with Engineering and Information Security peers on developing infrastructure solutions that follow established best practices and design patterns. They will also contribute to the continued development of RFCs, standards, and frameworks for IaC, automation, and supporting tools. Responsibilities also include the development of internal tooling, modules, and libraries used by technology teams to both implement new projects and maintain and enhance existing platforms with a focus on automation, resiliency, availability, scalability, and performance that meet business needs within appropriate cost constraints, primarily leveraging open-source technologies and frameworks.

This position will champion a DevOps culture and practices, providing expert full-stack support to software engineering teams (Java, Python, Node, Go, etc.) by integrating DevOps methodologies into their development and deployment workflows. Strong documentation skills and the ability to mentor other team members in DevOps, IaC, CI/CD, and operational best practices are essential.


The value you will deliver…

  • Lead the design, development, and maintenance of secure, scalable, resilient, and cost-effective cloud infrastructure solutions on AWS through a DevOps approach, leveraging the existing IaC framework based on Python, Terraform, and Terragrunt managing AWS resources; championing IaC best practices while ensuring adherence to best practices for security, reliability, performance, cost optimization, and operational excellence.
  • Design, implement, manage, and optimize robust CI/CD pipelines using tools like GitHub Actions and AWS CodePipeline for both infrastructure and applications; maintain deep expertise in GitHub.
  • Design, develop, and implement new tooling, applications, and platforms to improve and upgrade the capabilities of the IaC and automation platforms, and support infrastructure.
  • Provide expert DevOps-focused full-stack guidance and support to software engineering teams (using common languages such as Java, Python, Node, Go, etc.) to integrate DevOps practices, automate builds/deployments, identify/resolve reliability/performance bottlenecks, and establish comprehensive documentation. 
  • Champion and implement DevOps best practices across teams, fostering a culture of collaboration, automation, and continuous improvement, as well as providing mentorship and leadership in DevOps methodologies, IaC, CI/CD, and cloud technologies.
  • Participate in grooming and prioritizing development efforts in extending and supporting the IaC, tooling, and infrastructure support application platforms.
  • Partner with other teams across the technology group to propose and draft RFCs and standards for development of best practices and design patterns for applications and platforms using IaC and the established deployment pipelines and tooling.
  • Research, propose, and implement solutions to improve and upgrade cloud-based resources, infrastructure, and systems, ensuring they are performant, efficient, and resilient.
  • Initiate efforts to review and ensure existing platforms are performing resiliently, efficiently, and are cost effective.
  • Monitor ticket queues, provide timely and accurate updates, and resolve feature request and development tickets.
  • Monitor and respond to requests and questions in Slack channels, providing guidance to and assisting troubleshooting for developers and team members.
  • Create tickets and participate in deployments following Change Management procedures.
  • Participate in a 24/7 on-call rotation to respond to and resolve production incidents; lead and contribute to blameless postmortems.
  • Define and manage Service Level Objectives (SLOs) and Service Level Indicators (SLIs).
  • Evaluate, implement, and support Open Source frameworks and projects (e.g., ECS/Docker, Prometheus, Grafana, ELK stack, Kafka).
  • Update and maintain documentation for troubleshooting, and Methods of Procedure for deployments.
  • Ensure systems and users follow security standards and follow established policy, and support audit processes.
  • Light travel to summit meetings or conferences may be required.
  • Perform other duties and responsibilities as required, assigned, or requested. Consensus reserves the right to add or change duties at any time.

What you will bring to the table…

  • 6+ years hands-on experience managing and automating UNIX/Linux system environments within a DevOps context.
  • 5+ years of experience in a DevOps Engineer or SRE role with a strong DevOps focus, emphasizing infrastructure automation, CI/CD pipeline development, and cloud services.
  • 4+ years experience designing and implementing infrastructure as code within the AWS ecosphere using Terraform.
  • A security clearance of the ability to obtain a security clearance is required
  • Mastery in DevOps discipline and processes, including building and managing CI/CD pipelines supporting Infrastructure as Code frameworks such as Terraform, and Continuous Delivery Tools such as AWS CodePipeline, GitHub Actions, Jenkins, Git, Artifactory, etc.
  • Expert-level proficiency with Terraform and Terragrunt for managing AWS infrastructure as code.
  • Deep expertise in GitHub, GitHub Actions for CI/CD, including design, troubleshooting, and support.
  • Strong experience with AWS Cloud services (e.g., EC2, S3, RDS, VPC, IAM, Lambda, EKS/ECS, CloudWatch) and infrastructure design best practices, applied within a DevOps model.
  • Mastery of observability, monitoring, metrics and alerting at scale across regionally and globally resilient and distributed platforms leveraging common open source frameworks such as Prometheus, Thanos, OpenTelemetry, Grafana, etc.
  • Experience providing DevOps-centric support for applications developed in Java, Python, and Angular, including build automation, deployment pipelines, and observability.
  • Expert level proficiency in at least one scripting language (e.g., Python, Bash, Perl) and one programming language (e.g., Java, Go, Node). (Code samples and/or GitHub links to prior work desirable).
  • Mastery of Containerization (Docker), and strong familiarity with the container ecosystem, especially Amazon ECS.
  • Mastery in config automation tool sets such as AWS Config and/or SSM, Puppet, Ansible, Chef, etc. - Includes solid knowledge of concepts and practices surrounding such solutions.
  • Hands on experience with APM tools such as Zipkin, Jaeger, OpenTelemetry, NewRelic, etc.
  • Proficient with Jira, Confluence, and git toolset.
  • Hands-on experience with Agile/Scrum & Waterfall process environments.
  • Experience implementing and supporting a variety of Open Source frameworks and projects relevant to DevOps and SRE.
  • Consistently exhibits a personal accountability to outcomes to all team members, peers, and stakeholders.
  • Able to prioritize and manage multiple projects simultaneously in order to meet deadlines.
  • Self-starter able to work independently with minimal supervision, and high organization and communication skills to ensure alignment with team and project goals.
  • Driven to learn and stay abreast of the latest technologies and DevOps best practices.
  • Strong analytical and problem-solving skills with a proactive, blameless, and detail-oriented approach.
  • Excellent communication and collaboration skills, essential for fostering a DevOps culture and working effectively across teams; ability to mentor others.

You will stand out if you also have…

  • Experience with PCI, HiTrust, FedRamp/GovCloud and/or similar certification methodologies.
  • Experience with migrating and educating teams to newer SDLC and DevOps concepts.
  • Experience with APM/Observability and advanced DevOps/SRE concepts and methodologies.
  • Proven experience mentoring team members in DevOps practices.
  • Active, transferable U.S. Security clearance at the Public Trust level or higher preferred.

Additional details…

  • Location requirements: Fully remote within the U.S.
  • Travel requirements: Up to 10% travel
  • Physical requirements: Must be able to sit for long periods, as well as, handle long periods of screen time
  • Technology requirements: Reliable, high speed internet
  • Eligible for sponsorship: No
  • Security clearance: Ability to achieve and maintain a security clearance with the U.S. Government is required

The salary range for this role is $180,000 - $200,000 base USD annually. The total compensation package for this position is negotiable and may also include annual performance bonus, ESPP, enhanced time off packages and benefits. This job doesn't have an expiration date and will remain open until a qualified candidate is hired. 

We are not accepting agency submissions for this role.

To learn more about us visit consensus.com

Top Skills

Ansible
AWS
Aws Codepipeline
Bash
Chef
Confluence
Docker
Elk Stack
Github Actions
Go
Grafana
Jaeger
Java
Jenkins
JIRA
Kafka
Newrelic
Node.js
Opentelemetry
Perl
Prometheus
Puppet
Python
Terraform
Terragrunt
Zipkin

Similar Jobs

2 Days Ago
Remote
USA
153K-257K Annually
Senior level
153K-257K Annually
Senior level
Other • Real Estate • PropTech
As a Senior Site Reliability Engineer, you will design and manage scalable infrastructure, automate processes, collaborate with teams, and ensure system reliability.
Top Skills: GoInfrastructure As CodeJavaPython
4 Days Ago
Remote
6 Locations
Senior level
Senior level
Fintech • Software
The Site Reliability Engineer leads tech teams for resilient infrastructure, enhances reliability via automation, and integrates DevSecOps practices. They improve application reliability and work with cloud-native platforms.
Top Skills: Cloud-Native PlatformsKubernetesOpenshiftOpenstackPrometheusSplunkVMware
7 Days Ago
Remote
4 Locations
111K-184K Annually
Senior level
111K-184K Annually
Senior level
Cloud • Information Technology • Internet of Things • Software • Consulting • Infrastructure as a Service (IaaS) • Automation
The Senior Site Reliability Engineer will develop and operate OpenShift managed cloud services, automate processes, and improve system reliability while collaborating with a team.
Top Skills: AnsibleAWSAzureCC++DockerGCPGoJavaKubernetesOpenshiftPrometheusPythonRed Hat Enterprise Linux

What you need to know about the Colorado Tech Scene

With a business-friendly climate and research universities like CU Boulder and Colorado State, Colorado has made a name for itself as a startup ecosystem. The state boasts a skilled workforce and high quality of life thanks to its affordable housing, vibrant cultural scene and unparalleled opportunities for outdoor recreation. Colorado is also home to the National Renewable Energy Laboratory, helping cement its status as a hub for renewable energy innovation.

Key Facts About Colorado Tech

  • Number of Tech Workers: 260,000; 8.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Lockheed Martin, Century Link, Comcast, BAE Systems, Level 3
  • Key Industries: Software, artificial intelligence, aerospace, e-commerce, fintech, healthtech
  • Funding Landscape: $4.9 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Access Venture Partners, Ridgeline Ventures, Techstars, Blackhorn Ventures
  • Research Centers and Universities: Colorado School of Mines, University of Colorado Boulder, University of Denver, Colorado State University, Mesa Laboratory, Space Science Institute, National Center for Atmospheric Research, National Renewable Energy Laboratory, Gottlieb Institute

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account