adaption Logo

adaption

AI Systems & Inference Frameworks Engineer

Posted Yesterday
Remote or Hybrid
Hiring Remotely in United States
Mid level
Remote or Hybrid
Hiring Remotely in United States
Mid level
Design and build LLM inference stack, optimize performance and cost efficiency, and collaborate on model execution and infrastructure.
The summary above was generated by AI
About Us

Most AI is frozen in place - it doesn't adapt to the world. We think that's backwards. Our mandate is to build efficient intelligence that evolves in real-time. Our vision is AI systems that are flexible, personalized, and accessible to everyone. We believe efficiency is what makes this possible - it's how we expand access and ensure innovation benefits the many, not the few. We believe in talent density: bringing together the best and most driven individuals to push the boundaries of continual adaptation. We're looking for builders and creative thinkers ready to shape the next era of intelligence.

The Role

You’ll work directly with our founders to design and build the inference and optimization systems that power our core product. This role bridges research and production, combining deep exploration of inference techniques with hands-on ownership of scalable, high-performance serving infrastructure. You’ll own the full lifecycle of LLM inference—from experimentation and performance analysis to deployment and iteration in production—thriving in a zero-to-one environment and helping define the technical foundations of our inference stack.

Responsibilities
  • Inference Research & Systems: design and build our LLM inference stack from zero to one, exploring and implementing advanced techniques for low-latency, high-throughput serving of language and multimodal models.

  • Frameworks & Optimization: develop and optimize inference using modern frameworks (e.g., vLLM, SGLang, TensorRT-LLM), experimenting with batching strategies, KV-cache management, parallelism, and GPU utilization to push performance and cost efficiency.

  • Software–Hardware Co-Design: collaborate closely with founders and model developers to analyze bottlenecks across the stack, co-optimizing model execution, infrastructure, and deployment pipelines.

Qualifications
  • Strong experience building and optimizing LLM inference systems in production or research environments

  • Hands-on expertise with inference frameworks such as vLLM, SGLang, TensorRT-LLM, or similar

  • Deep performance mindset with experience in GPU-backed systems, latency/throughput optimization, and resource efficiency

  • Solid understanding of transformer inference, serving architectures, and KV-cache–based execution

  • Strong programming skills in Python; experience with CUDA, Triton, or C++ a plus

  • Comfort working in ambiguous, zero-to-one environments and driving research ideas into production systems

  • Nice to have: experience with model quantization or pruning, speculative decoding, multimodal inference, open-source contributions, or prior work in systems or ML research labs

Above all, we're looking for great teammates who make work feel lighter and aren't afraid to go out on a limb with bold ideas. You don't need to be perfect, but you do need to be adaptable. We encourage you to apply, even if you don't check every box.

Benefits
  • Flexible work: In-person collaboration in the Bay Area, a distributed global-first team, and quarterly offsites.

  • Adaption Passport: Annual travel stipend to explore a country you've never visited. We're building intelligence that evolves alongside you, so we encourage you to keep expanding your horizons.

  • Lunch Stipend: Weekly meal allowance for take-out or grocery delivery.

  • Well-Being: Comprehensive medical benefits and generous paid time off.

Top Skills

C++
Cuda
Python
Triton

Similar Jobs

4 Hours Ago
Easy Apply
Remote
United States
Easy Apply
150K-190K Annually
Senior level
150K-190K Annually
Senior level
Artificial Intelligence • Consumer Web • Digital Media • Information Technology • Social Impact • Software
The Senior Account Executive will drive revenue growth through consultative sales, close new business deals, and manage customer relationships, ensuring alignment with customer needs and company values.
Top Skills: SaaS
4 Hours Ago
Remote
United States
174K-261K Annually
Senior level
174K-261K Annually
Senior level
Artificial Intelligence • Productivity • Software • Automation
The role entails forecasting, planning, and analysis for Zapier's self-serve and new products businesses, requiring strong SQL skills and finance expertise.
Top Skills: SQL
4 Hours Ago
Remote or Hybrid
24 Locations
110K-160K Annually
Senior level
110K-160K Annually
Senior level
Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
The Business Resilience Manager will oversee Crisis Management programs, ensuring effective response and recovery from disruptions through strategic planning, training, and collaboration across teams.
Top Skills: Collaboration ToolsCrisis Management Software Platforms

What you need to know about the Colorado Tech Scene

With a business-friendly climate and research universities like CU Boulder and Colorado State, Colorado has made a name for itself as a startup ecosystem. The state boasts a skilled workforce and high quality of life thanks to its affordable housing, vibrant cultural scene and unparalleled opportunities for outdoor recreation. Colorado is also home to the National Renewable Energy Laboratory, helping cement its status as a hub for renewable energy innovation.

Key Facts About Colorado Tech

  • Number of Tech Workers: 260,000; 8.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Lockheed Martin, Century Link, Comcast, BAE Systems, Level 3
  • Key Industries: Software, artificial intelligence, aerospace, e-commerce, fintech, healthtech
  • Funding Landscape: $4.9 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Access Venture Partners, Ridgeline Ventures, Techstars, Blackhorn Ventures
  • Research Centers and Universities: Colorado School of Mines, University of Colorado Boulder, University of Denver, Colorado State University, Mesa Laboratory, Space Science Institute, National Center for Atmospheric Research, National Renewable Energy Laboratory, Gottlieb Institute

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account