Luma AI Logo

Luma AI

Research Scientist / Engineer – Performance Optimization

Reposted 12 Days Ago
Be an Early Applicant
Remote
Hiring Remotely in United States
180K-250K
Senior level
Remote
Hiring Remotely in United States
180K-250K
Senior level
As a Research Engineer, you'll optimize and implement models for data processing and inference, and work on performance enhancements in distributed systems using PyTorch and CUDA.
The summary above was generated by AI
About the Role

The Performance Optimization team at Luma is dedicated to maximizing the efficiency and performance of our AI models. Working closely with both research and engineering teams, this group ensures that our cutting-edge multimodal models can be trained efficiently and deployed at scale while maintaining the highest quality standards.

Responsibilities
  • Profile and optimize GPU/CPU/Accelerator code for maximum utilization and minimal latency

  • Write high-performance PyTorch, Triton, CUDA, deferring to custom PyTorch operations if necessary

  • Develop fused kernels and leverage tensor cores and modern hardware features for optimal hardware utilization on different hardware platforms

  • Optimize model architectures and implementations for distributed multi-node production deployment

  • Build performance monitoring and analysis tools and automation

  • Research and implement cutting-edge optimization techniques for transformer model

Experience
  • Expert-level proficiency in Triton/CUDA programming and GPU optimization

  • Strong PyTorch skills

  • Experience with PyTorch kernel development and custom operations

  • Proficiency with profiling tools (NVIDIA Nsight, torch profiler, custom tooling)

  • Deep understanding of transformer architectures and attention mechanisms

  • (Preferred) Experience with compilers/exporters such as torch.compile, TensorRT, ONNX, XLA

  • (Preferred) Experience optimizing inference workloads for latency and throughput

  • (Preferred) Experience with Triton compiler and kernel fusion techniques

  • (Preferred) Knowledge of warp-level intrinsics and advanced CUDA optimization

  • (Preferred) Background in compiler optimization or hardware-software co-design

Compensation

  • The pay range for this position in California is $180,000 - $250,000yr; however, base pay offered may vary depending on job-related knowledge, skills, candidate location, and experience. We also offer competitive equity packages in the form of stock options and a comprehensive benefits plan. 

Your applications are reviewed by real people.

Top Skills

C++
Cuda
PyTorch
Triton

Similar Jobs

34 Minutes Ago
Easy Apply
Remote or Hybrid
USA
Easy Apply
176K-264K Annually
Senior level
176K-264K Annually
Senior level
Healthtech • Information Technology • Software • Telehealth
Lead and scale the Interop Platform team, ensuring high quality and reliability of APIs and data systems while driving product strategy and team development.
Top Skills: AIAPIsCloud MicroservicesData PlatformsHipaaScalable Systems
35 Minutes Ago
Remote or Hybrid
United States
128K-160K
Senior level
128K-160K
Senior level
Digital Media • Gaming • Information Technology • Software • Sports • Esports • Big Data Analytics
As a Senior Site Reliability Engineer, you'll manage and optimize applications, implement infrastructure, design monitoring systems, and ensure continuous deployment.
Top Skills: BashDockerGoGoogle Cloud PlatformKubernetesPythonTerraform
36 Minutes Ago
Remote or Hybrid
New Jersey, USA
70K-88K
Junior
70K-88K
Junior
Digital Media • Gaming • Information Technology • Software • Sports • Esports • Big Data Analytics
Lead marketing campaigns for the Customer Retention & Monetization Team, manage A/B testing, analyze results, and coordinate with stakeholders for optimal performance.
Top Skills: BrazeGoogle SheetsGoogle SlidesHTMLLiquidSnowflakeSQLTableau

What you need to know about the Colorado Tech Scene

With a business-friendly climate and research universities like CU Boulder and Colorado State, Colorado has made a name for itself as a startup ecosystem. The state boasts a skilled workforce and high quality of life thanks to its affordable housing, vibrant cultural scene and unparalleled opportunities for outdoor recreation. Colorado is also home to the National Renewable Energy Laboratory, helping cement its status as a hub for renewable energy innovation.

Key Facts About Colorado Tech

  • Number of Tech Workers: 260,000; 8.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Lockheed Martin, Century Link, Comcast, BAE Systems, Level 3
  • Key Industries: Software, artificial intelligence, aerospace, e-commerce, fintech, healthtech
  • Funding Landscape: $4.9 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Access Venture Partners, Ridgeline Ventures, Techstars, Blackhorn Ventures
  • Research Centers and Universities: Colorado School of Mines, University of Colorado Boulder, University of Denver, Colorado State University, Mesa Laboratory, Space Science Institute, National Center for Atmospheric Research, National Renewable Energy Laboratory, Gottlieb Institute

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account