Rackspace Technology Logo

Rackspace Technology

AI Model Serving Specialist

Posted 4 Days Ago
Be an Early Applicant
Remote
Hiring Remotely in United States
82K-141K Annually
Mid level
Remote
Hiring Remotely in United States
82K-141K Annually
Mid level
Enable and support enterprise customers in deploying, optimizing AI workloads, and integrating model-serving platforms for secure, efficient inference services.
The summary above was generated by AI
Role Purpose

Enable enterprise customers to operationalize AI workloads by deploying and optimizing model-serving platforms (e.g., NVIDIA Triton, vLLM, KServe) within Rackspace’s Private Cloud and Hybrid environments. This role bridges AI engineering and platform operations, ensuring secure, scalable, and cost-efficient inference services.

Key Responsibilities : -
Model Deployment & Optimization 
Package and deploy ML/LLM models on Triton, vLLM, or KServe within Kubernetes clusters.
Tune performance (batching, KV-cache, TensorRT optimizations) for latency and throughput SLAs.

Platform Integration 
Work with VMware VCF9, NSX-T, and vSAN ESA to ensure GPU resource allocation and multi-tenancy.
Implement RBAC, encryption, and compliance controls for sovereign/private cloud customers.

API & Service Enablement 
Integrate models with Rackspace’s Unified Inference API and API Gateway for multi-tenant routing.
Support RAG and agentic workflows by connecting to vector databases and context stores.

Observability & FinOps 
Configure telemetry for GPU utilization, request tracing, and error monitoring.
Collaborate with FinOps to enable usage metering and chargeback reporting.

Customer Engineering Support 
Assist solution architects in onboarding customers, creating reference patterns for BFSI, Healthcare, and other verticals.
Provide troubleshooting and performance benchmarking guidance.

Continuous Improvement 
Stay current with emerging model-serving frameworks and GPU acceleration techniques.
Contribute to reusable Helm charts, operators, and automation scripts.

Required Skills & Experience

  • Hands-on experience with NVIDIA Triton, vLLM, or similar serving stacks.
  • Strong knowledge of Kubernetes, GPU scheduling, and CUDA/MIG.
  • Familiarity with VMware VCF9, NSX-T networking, and vSAN storage classes.
  • Proficiency in Python and containerization (Docker).
  • Understanding of observability stacks (Prometheus, Grafana) and FinOps principles.
  • Exposure to RAG architectures, vector DBs, and secure multi-tenant environments.
  • Excellent problem-solving and customer-facing communication skills.

Preferred Certifications

  • NVIDIA Certified Professional (AI/ML)
  • Kubernetes Administrator (CKA)
  • VMware VCF Specialist
  • Rackspace AI Foundations (internal)

KPI's

  • Model deployment success rate and SLA compliance.
  • Latency/throughput benchmarks per SKU.
  • Customer satisfaction (NPS) for AI services.
  • Efficiency in GPU utilization and cost optimization.

Physical Demands

  • General office environment: no special physical demands required.
  • May require long periods of sitting and viewing a computer monitor.
  • Schedule flexibility to include working weekends and/or evenings and holidays as required by the business for 24/7 operations.

Travel

  • As per business needs

Sponsorship

  • This role is not sponsorship eligible
  • Candidates need to be legally allowed to work in the US for any employer

#LI-VM1
#LI-US



"Remote postings are limited to candidates residing within the country specified in the posting location"

About Rackspace Technology
We are the multicloud solutions experts. We combine our expertise with the world’s leading technologies — across applications, data and security — to deliver end-to-end solutions. We have a proven record of advising customers based on their business challenges, designing solutions that scale, building and managing those solutions, and optimizing returns into the future. Named a best place to work, year after year according to Fortune, Forbes and Glassdoor, we attract and develop world-class talent. Join us on our mission to embrace technology, empower customers and deliver the future.
 
 
More on Rackspace Technology
Though we’re all different, Rackers thrive through our connection to a central goal: to be a valued member of a winning team on an inspiring mission. We bring our whole selves to work every day. And we embrace the notion that unique perspectives fuel innovation and enable us to best serve our customers and communities around the globe. We welcome you to apply today and want you to know that we are committed to offering equal employment opportunity without regard to age, color, disability, gender reassignment or identity or expression, genetic information, marital or civil partner status, pregnancy or maternity status, military or veteran status, nationality, ethnic or national origin, race, religion or belief, sexual orientation, or any legally protected characteristic. If you have a disability or special need that requires accommodation, please let us know.

Top Skills

Cuda
Docker
Grafana
Kubernetes
Nsx-T
Nvidia Triton
Prometheus
Python
Vllm
Vmware Vcf9
Vsan

Similar Jobs

An Hour Ago
Remote
United States
146K-178K Annually
Senior level
146K-178K Annually
Senior level
Aerospace • Information Technology • Cybersecurity • Defense • Manufacturing
The Software Security Engineer will automate security assessments, analyze security risks, and support compliance for open-source software in the enterprise.
Top Skills: Automation ToolsCotsOpen SourceSoftware Composition Analysis
An Hour Ago
Remote or Hybrid
United States
46K-86K Annually
Mid level
46K-86K Annually
Mid level
Artificial Intelligence • Cloud • Sales • Security • Software • Cybersecurity • Data Privacy
The IT Auditor manages and conducts internal and external audits, assesses security risks, identifies control gaps, and provides audit training. Requires collaboration with compliance teams and effective communication of audit results to management.
Top Skills: AWSAzureGCPGrc ToolsIsoJIRASalesforceSnowSoc
An Hour Ago
Remote or Hybrid
United States
91K-169K Annually
Senior level
91K-169K Annually
Senior level
Artificial Intelligence • Cloud • Sales • Security • Software • Cybersecurity • Data Privacy
The Engagement Manager oversees public sector projects, manages sales processes, ensures client satisfaction, and leads teams while supporting customer success and maximizing revenue growth.
Top Skills: SaaSSoftware Delivery Models

What you need to know about the Colorado Tech Scene

With a business-friendly climate and research universities like CU Boulder and Colorado State, Colorado has made a name for itself as a startup ecosystem. The state boasts a skilled workforce and high quality of life thanks to its affordable housing, vibrant cultural scene and unparalleled opportunities for outdoor recreation. Colorado is also home to the National Renewable Energy Laboratory, helping cement its status as a hub for renewable energy innovation.

Key Facts About Colorado Tech

  • Number of Tech Workers: 260,000; 8.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Lockheed Martin, Century Link, Comcast, BAE Systems, Level 3
  • Key Industries: Software, artificial intelligence, aerospace, e-commerce, fintech, healthtech
  • Funding Landscape: $4.9 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Access Venture Partners, Ridgeline Ventures, Techstars, Blackhorn Ventures
  • Research Centers and Universities: Colorado School of Mines, University of Colorado Boulder, University of Denver, Colorado State University, Mesa Laboratory, Space Science Institute, National Center for Atmospheric Research, National Renewable Energy Laboratory, Gottlieb Institute

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account