Bespoke Labs Logo

Bespoke Labs

DevOps / Site Reliability Engineer

Posted 2 Days Ago
Remote
Hiring Remotely in USA
Mid level
Remote
Hiring Remotely in USA
Mid level
As a DevOps/Site Reliability Engineer, you will manage cloud infrastructure, CI/CD pipelines, and improve system reliability and performance while supporting AI data pipelines.
The summary above was generated by AI

About Bespoke Labs

Bespoke Labs is an AI research and data company building the datasets, benchmarks, and evaluation infrastructure that power frontier AI models. We're backed by leading investors, trusted by top AI labs, and have research accepted at venues like ICLR 2026. Our team is small, moves fast, and has an outsized impact on how the next generation of AI is built.

The Role

We're looking for a mid-level DevOps / Site Reliability Engineer to own and scale our cloud infrastructure. You'll work closely with engineering and ML teams to keep our systems reliable, observable, and fast — directly supporting the infrastructure that powers AI data pipelines at scale.

What You'll Do

  • Own cloud infrastructure on AWS — EC2, EKS, RDS, S3, IAM, VPC

  • Manage Kubernetes clusters and container orchestration end-to-end

  • Build and maintain CI/CD pipelines using GitHub Actions or similar

  • Implement monitoring, alerting, and observability stacks (Prometheus, Grafana, or DataDog)

  • Improve reliability, performance, and security of production systems

  • Automate infrastructure with Terraform or similar IaC tools

  • Debug and resolve issues across complex, distributed systems

  • Participate in design reviews and help raise the infrastructure bar

What We're Looking For

  • 3–5 years in DevOps, SRE, or infrastructure engineering

  • Strong AWS experience — EKS, EC2, RDS, S3, IAM

  • Kubernetes — deployment, scaling, troubleshooting in production

  • CI/CD pipelines — GitHub Actions, ArgoCD, or similar

  • Infrastructure as Code — Terraform, Pulumi, or CDK

  • Python or Go scripting

  • Experience working in production environments with real users

  • Comfort with ambiguity and ability to operate autonomously

Nice to Have

  • Experience supporting ML training workloads or GPU clusters

  • Familiarity with distributed computing or large-scale data pipelines

  • Prior work at an AI, ML, or data company

  • Open-source contributions or published technical writing

What We Offer

  • Competitive compensation and meaningful equity

  • Direct impact on frontier AI model training and evaluation infrastructure

  • Flexible, remote-friendly environment with low bureaucracy

  • A small, high-caliber team with deep AI research expertise

  • Health, wellness, and learning & development benefits

Similar Jobs

24 Days Ago
Remote
United States
Senior level
Senior level
Logistics • Software • Transportation
Lead and mentor teams in DevOps and SRE, architect scalable Azure Cloud infrastructure, implement CI/CD and IaC, ensure database reliability, and drive cross-functional collaboration.
Top Skills: Azure CloudAzure DevopsCi/CdCosmosdbDockerElkGrafanaKubernetesMySQLPostgresPrometheusRedisSQL ServerTerraform
16 Days Ago
Remote or Hybrid
United States
154K-199K Annually
Senior level
154K-199K Annually
Senior level
3D Printing • Aerospace • Hardware • Robotics • Software
Lead the reliability and scalability of BRINC's production systems, building secure cloud infrastructure and improving incident response. Collaborate with teams for optimal system performance.
Top Skills: AWSInfrastructure As CodeJavaScriptNode.jsPython
16 Days Ago
In-Office or Remote
2 Locations
Senior level
Senior level
Healthtech
The SRE will design and implement platform solutions, maintain cloud environments, monitor and troubleshoot production issues, and automate tasks to improve efficiency.
Top Skills: AnsibleAWSDockerGCPGitIacLinuxMySQLPHPTerraform

What you need to know about the Colorado Tech Scene

With a business-friendly climate and research universities like CU Boulder and Colorado State, Colorado has made a name for itself as a startup ecosystem. The state boasts a skilled workforce and high quality of life thanks to its affordable housing, vibrant cultural scene and unparalleled opportunities for outdoor recreation. Colorado is also home to the National Renewable Energy Laboratory, helping cement its status as a hub for renewable energy innovation.

Key Facts About Colorado Tech

  • Number of Tech Workers: 260,000; 8.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Lockheed Martin, Century Link, Comcast, BAE Systems, Level 3
  • Key Industries: Software, artificial intelligence, aerospace, e-commerce, fintech, healthtech
  • Funding Landscape: $4.9 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Access Venture Partners, Ridgeline Ventures, Techstars, Blackhorn Ventures
  • Research Centers and Universities: Colorado School of Mines, University of Colorado Boulder, University of Denver, Colorado State University, Mesa Laboratory, Space Science Institute, National Center for Atmospheric Research, National Renewable Energy Laboratory, Gottlieb Institute

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account