Plume Design, Inc Logo

Plume Design, Inc

Senior Incident and Problem Manager

Reposted 13 Days Ago
Remote
Hiring Remotely in United States
108K-127K
Senior level
Remote
Hiring Remotely in United States
108K-127K
Senior level
The Incident & Problem Manager leads incident responses and problem management for production systems, ensuring continuous improvement and minimal service disruption.
The summary above was generated by AI

Life at Plume

At Plume, we believe that technology isn't about moving faster, it's about making life’s moments better. Which is why we’ve built the world's first, and only, open and hardware-independent service delivery platform for smart homes, small businesses, enterprises, and beyond. Our SaaS platform uses WiFi, advanced AI, and machine learning to create the future of connected spaces—and human experiences—at massive scale.

We now deliver services to over 60 million locations globally and have managed over 3 billion devices on our platform. We’re expanding rapidly, pioneering a new category, and we achieved our Series F funding in just four years. Our customers include many of the world's largest Internet Service Providers (ISPs) who look to Plume to help them evolve their smart home offerings while gleaning insights from their own data. 

With a bias for action and a love for being trailblazers, the team at Plume embodies a combination of relentless curiosity and imaginative innovation. We challenge ourselves to think in ways that other companies don't, work to do what should be done (rather than what can), and if we can’t do it exceptionally well, we don’t do it. It’s how we've assembled a team of world-class builders, thinkers, and doers. And it’s how we’re reinventing what’s possible every day.

Incident & Problem Manager 

What You’ll Do 

As the Incident & Problem Manager, you will be responsible for ensuring the stability, efficiency, and continuous improvement of Plume’s global production systems. You will lead high-impact incident response efforts and manage problem resolution processes to minimize service disruptions, prevent recurrence, and drive long-term operational improvements. 

Incident Management Responsibilities: 

  • Manage and coordinate global incidents across Plume’s production systems. 
  • Own the incident lifecycle: classification, prioritization, escalation, resolution, and closure. 
  • Engage and coordinate cross-functional teams during major incidents (P0, P1, P2). 
  • Ensure real-time response and minimize downtime through rapid decision-making and structured execution. 
  • Provide timely and clear communication updates to internal teams, leadership, and customers throughout the incident lifecycle. 
  • Act as the central liaison between business, engineering, and operations teams during high-severity incidents. 
  • Maintain metrics and trends related to incident volumes, root causes, SLA adherence, and MTTR. 
  • Lead the post-incident review process (RCA) and ensure action items are identified and followed through. 
  • Leverage monitoring and alerting tools to proactively detect and respond to incidents. 

Problem Management Responsibilities: 

  • Identify and log problems based on incident trends, major incident reviews, and other system data. 
  • Categorize, prioritize, and manage problem records from investigation through resolution.
  • Conduct in-depth root cause analyses (RCA), utilizing techniques like 5 Whys or Kepner-Tregoe. 
  • Collaborate with engineering, operations, and DevOps to drive permanent fixes. 
  • Document workarounds and maintain a Known Error Database (KEDB) to support incident resolution. 
  • Track and communicate problem status, progress, and risks to relevant stakeholders. 
  • Feed insights into the Continual Service Improvement (CSI) process and identify systemic improvements across the organization. 

What You’ll Bring 

Professional Experience & Knowledge: 

  • 5+ years of senior-level experience in ITIL-based Incident and Problem Management, ideally in SaaS, networking, or cloud-native environments. 
  • Deep understanding of ITIL v3/v4 frameworks (certification preferred). 
  • Proven ability to establish and optimize incident/problem management processes in high-availability, high-pressure environments. 
  • Strong background in root cause analysis, trend identification, and service improvement planning. 
  • Technical knowledge of cloud-based infrastructure, WiFi and networking technologies, APIs, monitoring systems, and common SaaS architectures. 
  • Proficient with ITSM platforms like ServiceNow, Jira Service Management, or similar tools. 

Personal Attributes & Soft Skills: 

  • Confident leader with the ability to guide teams and make decisions under pressure.
  • Calm, structured, and effective in high-stress, high-stakes situations.
  • Analytical thinker with a data-driven and strategic mindset.
  • Clear and concise communicator, capable of translating complex technical issues into actionable insights for both technical teams and executive stakeholders. 
  • Collaborative team player with strong cross-functional influence—even without direct authority.
  • Ownership-driven and proactive, with a strong sense of accountability and urgency.
  • Willingness to support on-call rotation, including evenings and weekends, as needed.

Total Compensation package would include: anticipated base compensation range of 108,000.00 - 127,000.00 + bonus + equity + benefits.  Benefits include: a 401k plan and a company match, basic life insurance plus unparalleled health, dental, vision and other benefits and perks. Please see here for more details. An employee’s base salary and its position within the range may depend on a number of factors including job related knowledge, education, skills, experience and other business related considerations. Published ranges are provided in good faith at the time of posting.

Kindly note that this is a REMOTE position, with a requirement to work in EST.

About Plume

As the creator of the only open, hardware-independent, cloud-controlled experience platform for ISPs and their subscribers, Plume partners with over 350 ISP customers, including some of the world’s largest such as Comcast, Charter, Liberty Global, and J:COM. 

Using OpenSync, the most widely supported open-source, silicon-to-cloud framework for smart spaces, Plume’s software-defined network allows ISPs to decouple their service offerings from hardware and rapidly curate and deliver new services over a multi-vendor, open-platform architecture.  

Backed by investors such as Insight Partners and SoftBank Vision Fund 2, Plume is now valued at $2.6B, having added over $500M in funding in 2021 alone.

Plume is an equal opportunity workplace that maintains a continuing policy of nondiscrimination in all employment practices and decisions, ensuring equal employment opportunities for all qualified individuals without regard to race, color, creed, religion, sex, national origin, age, physical or mental disability, sexual orientation, gender identity, marital status, pregnancy, childbirth or related individual conditions, medical conditions (as defined by state law), military or veteran status, or any other characteristic protected by federal, state or local law.

Top Skills

APIs
Cloud-Based Infrastructure
Itil
Jira Service Management
Monitoring Systems
Networking Technologies
Saas Architectures
Servicenow
Wifi

Similar Jobs

8 Hours Ago
Remote or Hybrid
Santa Clara, CA, USA
103K-175K Annually
Mid level
103K-175K Annually
Mid level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
The Senior Technical Support Engineer resolves technical issues for customers using ServiceNow's platform, ensuring excellent support experiences and collaboration for complex problems.
Top Skills: AIJavaJavaScriptServicenow
12 Hours Ago
In-Office or Remote
New York, NY, USA
120K-150K
Senior level
120K-150K
Senior level
Healthtech • Insurance • Software
Lead healthcare program and data implementations, ensuring successful delivery and enhancing implementation processes with a focus on client expectations.
Top Skills: Healthcare Data StandardsProject Management Tools
Junior
Digital Media • eCommerce • Gaming • Mobile • News + Entertainment
The DVD/Blu-ray Quality Control Technician reviews German language audio and subtitles for quality assurance, ensuring adherence to standards and effective communication across departments.
Top Skills: Blu-RayDvdExcelMicrosoft Office Word

What you need to know about the Colorado Tech Scene

With a business-friendly climate and research universities like CU Boulder and Colorado State, Colorado has made a name for itself as a startup ecosystem. The state boasts a skilled workforce and high quality of life thanks to its affordable housing, vibrant cultural scene and unparalleled opportunities for outdoor recreation. Colorado is also home to the National Renewable Energy Laboratory, helping cement its status as a hub for renewable energy innovation.

Key Facts About Colorado Tech

  • Number of Tech Workers: 260,000; 8.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Lockheed Martin, Century Link, Comcast, BAE Systems, Level 3
  • Key Industries: Software, artificial intelligence, aerospace, e-commerce, fintech, healthtech
  • Funding Landscape: $4.9 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Access Venture Partners, Ridgeline Ventures, Techstars, Blackhorn Ventures
  • Research Centers and Universities: Colorado School of Mines, University of Colorado Boulder, University of Denver, Colorado State University, Mesa Laboratory, Space Science Institute, National Center for Atmospheric Research, National Renewable Energy Laboratory, Gottlieb Institute

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account