2K Logo

2K

Senior Site Reliability Engineer

Posted An Hour Ago
Be an Early Applicant
Hybrid
Austin, TX
Senior level
Hybrid
Austin, TX
Senior level
Lead design, build, and operation of multi-cloud hybrid infrastructure and Kubernetes platforms. Drive observability, SLI/SLOs, incident response, automation, CI/CD hardening, secrets/policy-as-code, and promote SRE practices across studios.
The summary above was generated by AI
Who We Are

At 2K, we create some of the most iconic and culture-shaping video games in entertainment, including NBA® 2K, one of the top-selling franchises in the world, and legendary titles like BioShock®, Borderlands®, Mafia, Sid Meier’s Civilization®, and XCOM®, as well as fan favorites WWE® 2K, TopSpin®, and PGA TOUR® 2K. We build unforgettable experiences by pushing the boundaries of creativity, authenticity and innovation across every genre.

Our portfolio is brought to life by some of the most influential game development studios in the world. Visual Concepts, Firaxis Games, Hangar 13, Cat Daddy Games, 31st Union, Cloud Chamber, Gearbox, HB Studios, and 2K SportsLab create world-class experiences across platforms. But what truly powers 2K is our people. We believe the best ideas come from teams that feel empowered, supported, and inspired. As an equal opportunity employer, we are committed to fostering a diverse, inclusive workplace where people are encouraged to come as they are and do their best work.

The Team

The 2K SRE team owns the infrastructure behind every player connection—All 2K game services, account platforms, CI/CD pipelines, and developer tooling spanning AWS, GCP, and on-premises data centers across multiple global regions. Global launch windows and live-service events push systems to their limits, and this team is expected to hold the line.

Post-mortems here focus on systems, not people. Automation is the default answer to repetitive work. The infrastructure keeps millions of players connected, and the team takes that seriously!

The Role

The Senior SRE at 2K is a hands-on technical leader—shaping production infrastructure across multiple clouds and regions while partnering with network engineers, systems architects, and game studio developers. This is an ownership role: driving technical direction, influencing reliability from architecture review through production operation, and closing the gap between what engineering ships and what players experience.

What You'll Do

Platform & Infrastructure

  • Design, build, and operate scalable multi-cloud and hybrid infrastructure using Terraform, Pulumi, and GitOps workflows (ArgoCD, Flux).

  • Own Kubernetes platforms (EKS, GKE) end-to-end cluster lifecycle, multi-tenancy, networking (Istio, Cilium), and autoscaling.

  • Push progressive delivery patterns (blue/green, canary) across game service deployments.

Observability & Reliability

  • Build and run the full observability stack: Prometheus + Grafana + Datadog.

  • Define SLI/SLO/error budget policies and build alerting that cuts through the noise.

  • Lead chaos engineering exercises to surface failure modes before players encounter them.

  • Drive incident response and post-mortems with a focus on systemic fixes and real follow-through.

Automation, Security & Developer Experience

  • Eliminate toil through self-service provisioning, automated remediation, and intelligent scaling.

  • Harden CI/CD pipelines (GitHub Actions, Jenkins, ArgoCD).

  • Embed security at the platform layer through secrets management (PasswordState, 1Password, and AWS Secrets Manager) and policy-as-code (OPA/Gatekeeper).

Leadership

  • Promote SRE practices across 2K studios through reliability reviews, runbooks, and embedded collaboration.

  • Shape architectural decisions and author engineering RFCs that move the platform forward.

Required Qualifications
  • Experience: 5+ years in SRE, Platform Engineering, or equivalent infrastructure work at production scale.

  • Kubernetes: Deep experience in cloud environments (EKS or GKE preferred), including networking, storage, and multi-cluster patterns.

  • Infrastructure as Code (IaC): Strong proficiency with Terraform and/or Pulumi; hands-on with Helm, Terragrunt, and GitOps tooling (ArgoCD or GitHub Actions).

  • Environments: Experience with modern and legacy tech, including AWS, GCP, VMware, and Bare metal servers.

  • Configuration Management: Server configuration using Ansible, Puppet, and AWS Systems Manager.

  • Observability: Experience with Datadog, Prometheus + Grafana, and OpenTelemetry; fluency in operationalizing SLI/SLO/error budgets inside engineering teams.

  • Software Engineering: Production-quality code in Go, Python, or TypeScript for tools, automation, and internal libraries.

  • Systems & Networking: Solid understanding of Linux internals, TCP/IP networking, DNS, and TLS proven enough to debug at the system level.

  • Incident Management: Incident response and post-mortem leadership with a track record of systemic follow-through.

Preferred Qualifications
  • Live-service game or large-scale consumer internet experience dealing with millions of concurrent users.

  • Deep knowledge of Service mesh (Istio, Cilium) and advanced Kubernetes networking.

  • Experience with FinOps and managing resources efficiently at cloud scale.

  • Experience with AI and Agentic Development.

  • Cloud certifications (AWS Solutions Architect, GCP Professional Cloud Architect, CKA/CKS, or equivalent).

  • Experience mentoring SREs or leading reliability working groups.

As an equal opportunity employer, we are committed to ensuring that qualified individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, to perform their essential job functions, and to receive other benefits and privileges of employment. Please contact us if you need reasonable accommodation.

Please note that 2K Games and its studios never uses instant messaging apps or personal email accounts to contact prospective employees or conduct interviews and when emailing, only use 2K.com accounts.

#LI-Hybrid

Similar Jobs at 2K

4 Days Ago
Hybrid
Mid level
Mid level
Gaming • Information Technology • Mobile • Software • Esports
Build, support, and improve infrastructure, CI/CD pipelines, and automation to ensure reliable, scalable development and production environments. Improve developer workflows, contribute to observability and incident response, partner with teams on troubleshooting, maintain security and operational standards, and participate in on-call rotations.
Top Skills: AlertingAWSCi/CdContainersDeployment AutomationGCPInfrastructure As Code (Iac)KubernetesLoggingMonitoringObservability
5 Days Ago
Hybrid
Senior level
Senior level
Gaming • Information Technology • Mobile • Software • Esports
The Manager of Engineering will lead a team, optimize development processes, enforce high coding standards, and ensure technical quality through mentorship while driving engineering initiatives.
Top Skills: Ai-Augmented DevelopmentArchitecture ReviewsBackend PlatformsLow-Latency SystemsSoftware DevelopmentTdd
13 Days Ago
Hybrid
Senior level
Senior level
Gaming • Information Technology • Mobile • Software • Esports
Create high-quality real-time particle systems and materials in Unreal Engine 5, produce memory-efficient texture assets and basic 3D models, collaborate with art and technical leads to meet style, performance, and platform constraints for console and multiplatform game titles.
Top Skills: 3Ds MaxAdobe PhotoshopEmbergenHoudiniMayaPlaystationReal-Time Particle SystemsSubstance DesignerUnreal Engine 5Uv MappingXbox

What you need to know about the Colorado Tech Scene

With a business-friendly climate and research universities like CU Boulder and Colorado State, Colorado has made a name for itself as a startup ecosystem. The state boasts a skilled workforce and high quality of life thanks to its affordable housing, vibrant cultural scene and unparalleled opportunities for outdoor recreation. Colorado is also home to the National Renewable Energy Laboratory, helping cement its status as a hub for renewable energy innovation.

Key Facts About Colorado Tech

  • Number of Tech Workers: 260,000; 8.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Lockheed Martin, Century Link, Comcast, BAE Systems, Level 3
  • Key Industries: Software, artificial intelligence, aerospace, e-commerce, fintech, healthtech
  • Funding Landscape: $4.9 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Access Venture Partners, Ridgeline Ventures, Techstars, Blackhorn Ventures
  • Research Centers and Universities: Colorado School of Mines, University of Colorado Boulder, University of Denver, Colorado State University, Mesa Laboratory, Space Science Institute, National Center for Atmospheric Research, National Renewable Energy Laboratory, Gottlieb Institute

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account