Plume Design, Inc Logo

Plume Design, Inc

Site Reliability Engineer

Posted 2 Hours Ago
Remote
Hiring Remotely in United States
Senior level
Remote
Hiring Remotely in United States
Senior level
The Site Reliability Engineer will develop tools, manage infrastructure, ensure reliability, implement CI/CD pipelines, and optimize cloud resources.
The summary above was generated by AI

Life at Plume

At Plume, we believe that technology isn't about moving faster, it's about making life’s moments better. Which is why we’ve built the world's first, and only, open and hardware-independent service delivery platform for smart homes, small businesses, enterprises, and beyond. Our SaaS platform uses WiFi, advanced AI, and machine learning to create the future of connected spaces—and human experiences—at massive scale.

We now deliver services to over 60 million locations globally and have managed over 3 billion devices on our platform. We’re expanding rapidly, pioneering a new category, and we achieved our Series F funding in just four years. Our customers include many of the world's largest Internet Service Providers (ISPs) who look to Plume to help them evolve their smart home offerings while gleaning insights from their own data. 

With a bias for action and a love for being trailblazers, the team at Plume embodies a combination of relentless curiosity and imaginative innovation. We challenge ourselves to think in ways that other companies don't, work to do what should be done (rather than what can), and if we can’t do it exceptionally well, we don’t do it. It’s how we've assembled a team of world-class builders, thinkers, and doers. And it’s how we’re reinventing what’s possible every day.

We are seeking an experienced Site Reliability Engineer (SRE) to join our engineering team. In this role, you will be responsible for developing and implementing tools, processes and automations that help product engineers to do their work while ensuring the stability and reliability of our systems.  This role requires skills in both software engineering and cloud-native infrastructure, as well as excellent communication skills and a collaborative spirit.

What You’ll Do:

  • Implement, manage, and maintain scalable and reliable infrastructure using infrastructure-as-code (IaC) tools.
  • Develop and implement observability solutions to help engineers ensure high availability and performance of all services.
  • Design, build, and maintain CI/CD pipelines to streamline the deployment process.
  • Collaborate with development teams to ensure services are designed with operability and reliability in mind.
  • Participate in a global on-call rotation to provide support for critical production systems.
  • Drive down operational toil through automation and process improvements.
  • Manage and optimize cloud resources for cost efficiency and performance.

What You’ll Bring:

  • Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent experience.
  • 5+ years of experience as a Site Reliability Engineer, DevOps Engineer, or similar role.
  • Expertise in one or more programming languages (e.g., Python, Go).
  • Experience with one or more cloud computing platforms (e.g., AWS, GCP).
  • Proficiency with configuration management and IaC tools (e.g., Terraform, Salt).
  • Proficiency with Kubernetes-based environments
  • Experience with monitoring and logging tools (e.g., Prometheus, Grafana, OpenSearch).
  • Excellent technical communication skills.

Preferred:

  • Experience with large-scale distributed systems.
  • Knowledge of networking and security best practices.
  • Familiarity with database technologies (SQL and NoSQL).

About Plume

As the creator of the only open, hardware-independent, cloud-controlled experience platform for ISPs and their subscribers, Plume partners with over 400 ISP customers, including some of the world’s largest such as Comcast, Charter, Liberty Global, and J:COM. 

Using OpenSync, the most widely supported open-source, silicon-to-cloud framework for smart spaces, Plume’s software-defined network allows ISPs to decouple their service offerings from hardware and rapidly curate and deliver new services over a multi-vendor, open-platform architecture.  

Plume is an equal opportunity workplace that maintains a continuing policy of nondiscrimination in all employment practices and decisions, ensuring equal employment opportunities for all qualified individuals without regard to race, color, creed, religion, sex, national origin, age, physical or mental disability, sexual orientation, gender identity, marital status, pregnancy, childbirth or related individual conditions, medical conditions (as defined by state law), military or veteran status, or any other characteristic protected by federal, state or local law.

Top Skills

AWS
GCP
Go
Grafana
Kubernetes
Opensearch
Prometheus
Python
Salt
Terraform

Similar Jobs

4 Days Ago
Easy Apply
Remote or Hybrid
US
Easy Apply
114K-173K Annually
Junior
114K-173K Annually
Junior
Marketing Tech • Social Media • Software • Analytics • Business Intelligence
As a Site Reliability Engineer, you'll design scalable systems, drive infrastructure initiatives, improve security, and collaborate across teams to enhance system resilience. You'll also investigate failures and contribute to security tooling while building your skills in a supportive environment.
Top Skills: AnsibleAWSChefGithub ActionsGitlabGoJavaJenkinsLinuxPythonRubySaltstackTerraform
7 Days Ago
In-Office or Remote
Atlanta, GA, USA
120K-175K Annually
Senior level
120K-175K Annually
Senior level
Fintech • Gaming • Mobile • Sports • Esports
Design, implement, and monitor reliable production systems at scale. Lead incident response and post-mortems, debug critical production issues, build observability and monitoring, drive reliability best practices and SLO governance, and mentor/train engineers to improve system scalability, resilience, and security.
Top Skills: AWSAzureCrossplaneDatadogGCPGoGrafanaKubernetesNew RelicPythonRubyTerraform
24 Days Ago
Easy Apply
Remote or Hybrid
Virginia, USA
Easy Apply
Internship
Internship
Cloud • Information Technology • Security • Software • Cybersecurity
As an intern, manage operational tasks in classified environments, develop automation tools, create documentation, and enhance services for Zscaler's cloud security platform.
Top Skills: Aws EcsKubernetesPython

What you need to know about the Colorado Tech Scene

With a business-friendly climate and research universities like CU Boulder and Colorado State, Colorado has made a name for itself as a startup ecosystem. The state boasts a skilled workforce and high quality of life thanks to its affordable housing, vibrant cultural scene and unparalleled opportunities for outdoor recreation. Colorado is also home to the National Renewable Energy Laboratory, helping cement its status as a hub for renewable energy innovation.

Key Facts About Colorado Tech

  • Number of Tech Workers: 260,000; 8.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Lockheed Martin, Century Link, Comcast, BAE Systems, Level 3
  • Key Industries: Software, artificial intelligence, aerospace, e-commerce, fintech, healthtech
  • Funding Landscape: $4.9 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Access Venture Partners, Ridgeline Ventures, Techstars, Blackhorn Ventures
  • Research Centers and Universities: Colorado School of Mines, University of Colorado Boulder, University of Denver, Colorado State University, Mesa Laboratory, Space Science Institute, National Center for Atmospheric Research, National Renewable Energy Laboratory, Gottlieb Institute

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account