Focused Logo

Focused

Staff SRE - Observability

Reposted 25 Days Ago
Be an Early Applicant
Easy Apply
In-Office
Denver, CO
160K-200K Annually
Mid level
Easy Apply
In-Office
Denver, CO
160K-200K Annually
Mid level
The role involves designing and implementing OpenTelemetry solutions, optimizing observability infrastructure, and supporting SRE practices and cloud deployments.
The summary above was generated by AI

Who we are:

At Focused, we move quickly to deliver quality software that achieves client outcomes and meets their customer’s needs. We strategically partner with our clients to leverage our expertise in design and software, while our clients bring their own domain expertise. We work with a variety of clients from different industries, collaborating as we get new products to market, modernizing legacy systems, or helping teams learn the skills they need to be successful.   

Our values:

  • Listen first • We are experts in product practices but life long learners in the domain of our customers. We research, collaborate, and understand. 
  • Learn why • We ask questions and talk to users to understand problem spaces, objectives, and goals, which allows us to deeply invest and drive towards the outcomes of our clients. 
  • Love your craft • We love diving into a variety of domains and solving problems.  We take pride in delivering value, in communicating progress, and guiding our clients to success.

We are seeking an experienced Staff Observability Consultant with deep expertise in OpenTelemetry and strong Platform Engineering capabilities to help organizations implement, optimize, and scale their observability infrastructure. This role requires a seasoned consultant who can design comprehensive telemetry strategies, implement distributed tracing solutions, establish robust monitoring practices, and interface closely with clients on the observability journey.

Key Responsibilities:

OpenTelemetry & Observability

  • Design and implement end-to-end OpenTelemetry solutions across diverse technology stacks
  • Configure and deploy OpenTelemetry Collectors for efficient data collection, processing, sampling, and routing
  • Establish telemetry pipelines for metrics, traces, and logs across microservices architectures
  • Optimize collector configurations for performance, reliability, and cost-effectiveness

Platform Engineering & Infrastructure

  • Augment existing infrastructure with with integrated observability solutions
  • Implement Infrastructure as Code (IaC) solutions using Terraform, Pulumi, CloudFormation, etc.
  • Architect and manage Kubernetes clusters with comprehensive monitoring and logging
  • Build CI/CD pipelines with embedded observability and automated testing

Site Reliability Engineering (SRE)

  • Establish and maintain Service Level Indicators (SLIs), Objectives (SLOs), and Agreements (SLAs)
  • Implement error budgets, toil reduction strategies, and capacity planning
  • Support incident response procedures and post-mortem processes

Cloud & DevOps Engineering

  • Deploy and manage observability infrastructure across AWS, GCP, and Azure
  • Establish security, compliance, and governance frameworks for telemetry data
  • Experience automating Agent Evaluations in CI/CD pipelines and observability backends.

Required Qualifications:

Core Observability & OpenTelemetry

  • 3-7 years of experience in observability, monitoring, and distributed systems
  • Deep hands-on experience with OpenTelemetry ecosystem, including SDKs, APIs, and specifications
  • Proficiency with OpenTelemetry Collector configuration, processors, exporters, and receivers
  • Strong understanding of telemetry data models, semantic conventions, and instrumentation best practices

Platform Engineering & DevOps

  • 5+ years of Platform Engineering or DevOps experience with focus on site reliability, observability, and incident response
  • Proficiency with Infrastructure as Code tools (Terraform, Pulumi, CloudFormation, CDK)
  • Strong experience with CI/CD platforms (GitHub Actions, GitLab CI, Jenkins, ArgoCD)

Cloud & Infrastructure

  • Hands-on experience with major cloud providers (AWS, GCP, Azure) and their observability services
  • Experience with container technologies (Docker, Podman) and container registries
  • Knowledge of networking, security, load balancing, and distributed systems concepts

Site Reliability Engineering

  • Experience implementing SRE practices including error budgets and toil metrics
  • Proficiency in incident management, on-call procedures, and post-mortem culture
  • Experience with capacity planning, performance optimization, and scalability design

Programming & Automation

  • Proficiency in multiple programming languages preferred (Go, Python, Java, Node.js, Rust)
  • Strong scripting and automation skills (Bash, Python, PowerShell)
  • Understanding of software engineering best practices and testing methodologies

Preferred Qualifications (Exceptional Candidates)

AI & Agentic Frameworks

  • Understanding of Large Language Models (LLMs) and their application in DevOps
  • Knowledge of vector databases, embeddings, and retrieval-augmented generation (RAG)
  • Experience with AI/ML model deployment and monitoring in production environments

Leadership & Communication

  • Strong technical writing and documentation skills
  • Ability to present complex technical concepts to diverse stakeholders
  • A passion for knowledge sharing

Key Competencies

  • Systems thinking and ability to design holistic observability solutions
  • Strong analytical and troubleshooting skills for complex distributed systems
  • Curiosity about emerging technologies, particularly AI applications in operations
  • Adaptability to rapidly evolving cloud-native and observability technologies
  • Collaborative mindset with focus on enabling developer productivity and system reliability

What Sets Exceptional Candidates Apart:

  • Experience with Honeycomb
  • Contributions to open-source observability or AI framework projects
  • Track record of implementing platform engineering solutions that significantly improved developer experience
  • Experience scaling observability infrastructure to handle high event volume

What to know before you apply: 

  • This role will require being in the Denver office three days per week and up to 20% travel within the United States.
  • Focused is unable to sponsor or take over sponsorship of the employment Visa process at this time.
  • The Denver base salary range for this role is $160,000 - $200,000.

Top Skills

AWS
Azure
CloudFormation
Docker
GCP
Go
Java
Kubernetes
Node.js
Opentelemetry
Pulumi
Python
Rust
Terraform

Focused Denver, Colorado, USA Office

1425 Market St, Denver, CO, United States, 80202

Similar Jobs

2 Hours Ago
Hybrid
Denver, CO, USA
80K-100K Annually
Junior
80K-100K Annually
Junior
Information Technology • Insurance • Software
The Legal Counsel drafts, reviews, and negotiates various commercial contracts, assists with litigation, and ensures compliance with laws related to software and intellectual property. They work collaboratively with the sales team and manage contract workflows.
Top Skills: Salesforce
2 Hours Ago
In-Office
Westminster, CO, USA
131K-219K Annually
Senior level
131K-219K Annually
Senior level
Aerospace • Artificial Intelligence • Computer Vision • Software • Analytics • Defense • Big Data Analytics
The Lead Integration and Test Systems Engineer will oversee space systems integration, lead a small team, manage budgets and schedules, and ensure successful spacecraft testing activities.
Top Skills: CclCecilPythonStol
2 Hours Ago
Hybrid
Rifle, CO, USA
21-30 Hourly
Entry level
21-30 Hourly
Entry level
Fintech • Financial Services
The Associate Personal Banker will enhance customer experiences, assist with account openings, handle service requests, and connect clients to bank services. Compliance with mortgage licensing is required.

What you need to know about the Colorado Tech Scene

With a business-friendly climate and research universities like CU Boulder and Colorado State, Colorado has made a name for itself as a startup ecosystem. The state boasts a skilled workforce and high quality of life thanks to its affordable housing, vibrant cultural scene and unparalleled opportunities for outdoor recreation. Colorado is also home to the National Renewable Energy Laboratory, helping cement its status as a hub for renewable energy innovation.

Key Facts About Colorado Tech

  • Number of Tech Workers: 260,000; 8.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Lockheed Martin, Century Link, Comcast, BAE Systems, Level 3
  • Key Industries: Software, artificial intelligence, aerospace, e-commerce, fintech, healthtech
  • Funding Landscape: $4.9 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Access Venture Partners, Ridgeline Ventures, Techstars, Blackhorn Ventures
  • Research Centers and Universities: Colorado School of Mines, University of Colorado Boulder, University of Denver, Colorado State University, Mesa Laboratory, Space Science Institute, National Center for Atmospheric Research, National Renewable Energy Laboratory, Gottlieb Institute

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account