AssemblyAI Logo

AssemblyAI

Senior Research Engineer

Reposted 16 Days Ago
Be an Early Applicant
Easy Apply
Remote
Hiring Remotely in USA
240K-275K Annually
Senior level
Easy Apply
Remote
Hiring Remotely in USA
240K-275K Annually
Senior level
This role involves optimizing large-scale distributed training and inference systems, implementing deep learning optimizations, and collaborating across teams to enhance AI models.
The summary above was generated by AI
About AssemblyAI

At AssemblyAI, we’re building at the forefront of Speech AI, creating powerful models for speech-to-text and speech understanding available through a straightforward API. With more than 200,000 developers building on our API and over 5,000 paying customers, AssemblyAI is helping unlock and support the next generation of powerful, meaningful products built with AI. 

Progress in AI is moving at an unprecedented pace– and our team is made up of experts in AI research that are focused on making sure that our customers are able to stay on the cutting edge, with production-ready AI models that are constantly updating and improving as our team continues to improve accuracy, latency, and what’s possible with Speech AI. Our models consistently rank highest in industry benchmarks for accuracy, outperforming models from Google and Amazon, and up to 30% fewer hallucinations than OpenAI’s Whisper. Our models power more than 2 billion end-user experiences each day, helping companies better understand customer feedback, run more productive meetings with automated meeting notes, and helping improve childhood literacy via ed tech tools. 

We’ve raised funding by leading investors including Accel, Insight Partners, Y Combinator’s AI Fund, Patrick and John Collision, Nat Friedman, and Daniel Gross. We’re a remote team looking to build one of the next great AI companies, and are looking for driven, talented people to help us get there!

About the Role

We are seeking a highly skilled Senior Research Engineer to collaborate closely with both Research and Engineering teams. The role involves diagnosing and resolving bottlenecks across large-scale distributed training, data processing, and inference systems, while also driving optimizations for existing high-performance pipelines.

The ideal candidate possesses a deep understanding of modern deep learning systems, combined with strong engineering expertise in areas such as layer-level optimization, large-scale distributed training, streaming, low-latency and asynchronous inference, inference compilers, and advanced parallelization techniques.

This is a cross-functional role requiring strong technical rigor, attention to detail, intellectual curiosity, and excellent communication skills. The position is embedded within the Research team and is responsible for developing and refining the technical foundation that enables cutting-edge research and translates its outcomes into production, bridging research and production engineering.

What You'll Do
  • Investigate and mitigate performance bottlenecks in large-scale distributed training and inference systems.
  • Develop and implement both low-level (operator/kernel) and high-level (system/architecture) optimization strategies.
  • Translate research models and prototypes into highly optimized, production-ready inference systems.
  • Explore and integrate inference compilers such as TensorRT, ONNX Runtime, AWS Neuron and Inferentia, or similar technologies.
  • Design, test, and deploy scalable solutions for parallel and distributed workloads on heterogeneous hardware.
  • Facilitate knowledge transfer and bidirectional support between Research and Engineering teams, ensuring alignment of priorities and solutions.
What You'll Need
  • Strong expertise in the Python ecosystem and major ML frameworks (PyTorch, JAX).
  • Experience with lower-level programming (C++ or Rust preferred).
  • Deep understanding of GPU acceleration (CUDA, profiling, kernel-level optimization); TPU experience is a strong plus.
  • Proven ability to accelerate deep learning workloads using compiler frameworks, graph optimizations, and parallelization strategies.
  • Solid understanding of the deep learning lifecycle: model design, large-scale training, data processing pipelines, and inference deployment.
  • Strong debugging, profiling, and optimization skills in large-scale distributed environments.
  • Excellent communication and collaboration skills, with the ability to clearly prioritize and articulate impact-driven technical solutions.

Pay Transparency:

AssemblyAI strives to recruit and retain exceptional talent from diverse backgrounds while ensuring pay equity across our team. Our salary ranges are set to be competitive for our size, stage, and industry, and reflect just one component of the full compensation, benefits, and rewards we offer.

Salary determinations consider a variety of factors, including relevant experience, technical depth, skills demonstrated during the interview process, and maintaining internal equity with peers on the team. The range shared below represents a general expectation for the posted position. However, we are open to considering candidates who may fall above or below the outlined experience level—in those cases, we will communicate any adjustments to the expected salary range.

The range provided applies to candidates located in the United States. For candidates outside of the U.S., compensation ranges may differ; any adjustments will be communicated throughout the interview process.

Salary range: $210,000 - $309,000 

The expected base compensation for this role is listed above. Our total compensation package includes competitive equity grants, 100% employer-paid benefits, and the flexibility of being fully remote.

Working at AssemblyAI

We are a small but mighty group of startup veterans and experienced AI researchers with over 20 years of expertise in Machine Learning, Speech Recognition, and NLP. As a fully remote team, we’re looking for people to join our team who are ambitious, curious, and lead with integrity. We’re still in the early days of AI and of AssemblyAI’s journey, and are looking for teammates who won’t just fit in, but will help us define and build our company culture. 

We’re committed to creating a space where our employees can bring their full selves to work and have equal opportunity to succeed. No matter your race, gender identity or expression, sexual orientation, religion, origin, ability, age, veteran status, if joining this mission speaks to you, we encourage you to apply!

Using AI to Interview:

If you’re selected for an interview, please review this resource to better understand how AssemblyAI approaches the use of AI in our interview process.

Keep Exploring AssemblyAI:

Check us out on YouTube!

Learn more about AI models for speech recognition

Core Transcription | Audio Intelligence | LeMUR | Try the Playground

Our $50M Series C fundraise

Top Skills

Aws Neuron
C++
Cuda
Inferentia
Jax
Onnx Runtime
Python
PyTorch
Rust
Tensorrt

Similar Jobs

7 Days Ago
Remote
2 Locations
150K-250K Annually
Senior level
150K-250K Annually
Senior level
Artificial Intelligence • App development
The Senior Research Engineer will lead projects in AI safety, focusing on machine learning, high-performance computing, or technical leadership. Responsibilities include detecting model deception, preventing misuse, and developing frameworks for research acceleration.
Top Skills: JaxKubernetesPythonPyTorchSlurm
7 Days Ago
Remote or Hybrid
US
155K-228K Annually
Senior level
155K-228K Annually
Senior level
Artificial Intelligence • Big Data • Computer Vision • Machine Learning • Natural Language Processing • Software • Cybersecurity
As a Senior AI Engineer at Dataminr, you'll deploy and optimize large-scale LLMs, collaborate with cross-functional teams, and solve complex AI challenges.
Top Skills: AWSDatabricksGenerative AiKubernetesLlmsPythonPyTorchTransformers
23 Days Ago
Remote
30 Locations
154K-216K Annually
Senior level
154K-216K Annually
Senior level
Artificial Intelligence • Software
Lead the development of differentiable CFD solvers for multi-GPU systems. Ensure code reproducibility and scalability in simulations of chemical reactor designs. Validate CFD capabilities and collaborate with AI developers for optimized reactor designs while producing technical documentation.
Top Skills: CfdCudaDockerHpcJaxKubernetesMpiPythonPyTorch

What you need to know about the Colorado Tech Scene

With a business-friendly climate and research universities like CU Boulder and Colorado State, Colorado has made a name for itself as a startup ecosystem. The state boasts a skilled workforce and high quality of life thanks to its affordable housing, vibrant cultural scene and unparalleled opportunities for outdoor recreation. Colorado is also home to the National Renewable Energy Laboratory, helping cement its status as a hub for renewable energy innovation.

Key Facts About Colorado Tech

  • Number of Tech Workers: 260,000; 8.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Lockheed Martin, Century Link, Comcast, BAE Systems, Level 3
  • Key Industries: Software, artificial intelligence, aerospace, e-commerce, fintech, healthtech
  • Funding Landscape: $4.9 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Access Venture Partners, Ridgeline Ventures, Techstars, Blackhorn Ventures
  • Research Centers and Universities: Colorado School of Mines, University of Colorado Boulder, University of Denver, Colorado State University, Mesa Laboratory, Space Science Institute, National Center for Atmospheric Research, National Renewable Energy Laboratory, Gottlieb Institute

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account