Motional Logo

Motional

Senior Backend Engineer, Data Mining

Reposted 4 Days Ago
Remote or Hybrid
4 Locations
159K-207K Annually
Senior level
Remote or Hybrid
4 Locations
159K-207K Annually
Senior level
As a Senior Backend Engineer, you'll architect and enhance the OmniTag engine for data mining, optimize multimodal data pipelines, and ensure production reliability, directly impacting the efficiency of ML workflows.
The summary above was generated by AI

Mission Summary:
At Motional, we're transforming how autonomous vehicles discover critical intelligence hidden within petabytes of multimodal sensor data. Our next-generation autonomous driving stack depends on finding the rare edge cases, long-tail scenarios, and model errors that matter most. OmniTag, our ML-powered multimodal data mining framework, is the engine that powers this discovery.
As a Senior Backend Engineer on the Data Mining team, you'll architect and own the production systems that enable data scientists and ML engineers to rapidly mine, analyze, and extract insights from billions of data points across cameras, LiDAR, radar, and other modalities. You won't maintain a platform, you'll evolve its core foundation, ensuring OmniTag scales to support Motional's most ambitious autonomy challenges. Your work directly impacts the quality and speed at which we improve our perception and planning models.


What You'll Do:

  • Architect the OmniTag Engine: Design and build the high-throughput, low-latency backend systems that execute billion-scale inference across Ray/Spark, transforming raw sensor data into unified multimodal representations. You'll optimize for both query latency and resource efficiency in a cost-sensitive, cloud-based environment.
  • Scale Multimodal Data Pipelines: Own the complete data journey - from ingestion, normalization, and preprocessing of heterogeneous modalities (image, video, LiDAR, audio) through encoding, indexing, and cached embedding storage. Ensure pipelines are robust, observable, and meet the SLOs expected by downstream ML teams.
  • Evolve the Vector Search and Retrieval Engine: Enhance our in-house billion-scale vector search engine to power RAG-driven few-shot dataset creation. Optimize embedding storage, retrieval performance, and filtering across billions of examples to enable rapid interactive mining workflows.
  • Own Data Quality and Observability: Build comprehensive monitoring, logging, and alerting for multimodal data preprocessing pipelines. Develop data validation frameworks that catch regressions in data alignment, normalization, or encoding quality—critical for maintaining model performance.
  • Collaborate on Encoder-Decoder Adaptation: Work closely with ML engineers to support domain-specific fine-tuning workflows, model versioning, and A/B testing of new encoders and decoders. Ensure the backend infrastructure enables rapid experimentation with emerging open-source multimodal foundation models.
  • Drive Production Reliability: Establish patterns for graceful degradation, fault tolerance, and cost optimization. Operate OmniTag as a mission-critical data platform serving the entire ML organization, with a focus on reliability, debuggability, and operational excellence.

What We're Looking For (Must-Haves):

  • BS in Computer Science or a related field, or equivalent professional experience
  • 6+ years designing, building, and operating large-scale distributed systems in production environments
  • Deep, hands-on expertise with Ray or Spark (or both) for distributed data processing and large-scale inference workloads
  • Expert-level Python proficiency with strong software engineering fundamentals: testing (unit, integration, and end-to-end), CI/CD pipelines, containerization, and code review practices
  • Proven experience optimizing and scaling production data pipelines that process terabytes or petabytes of data
  • Strong SQL and data manipulation skills; comfort with both structured and semi-structured data
  • Experience with cloud infrastructure (AWS preferred: S3, EC2, EKS, EMR, IAM) and infrastructure-as-code patterns
  • Demonstrated track record of shipping robust, well-tested, production-grade systems and mentoring junior engineers

Bonus Points (Nice-to-Haves):

  • MS/PhD in Computer Science, Machine Learning, or a related field.
  • Experience building or scaling vector databases, large-scale information retrieval systems, or similarity search engines.
  • Hands-on work with multimodal machine learning models, foundation models (LLMs/VLMs), or embeddings-based systems.
  • Familiarity with ML frameworks (PyTorch, JAX) and the ecosystem around multimodal models.
  • Production experience with workflow orchestration (Airflow, Kubeflow, Dagster) and stream processing (Kafka, Flink).
  • Understanding of model serving patterns, feature stores, or ML ops infrastructure.
  • Domain knowledge in autonomous driving, computer vision, or sensor fusion.
  • Experience with ML-based data mining, active learning, or contrastive learning approaches.

We encourage a hybrid schedule with in-office time at one of our locations in Boston, Pittsburgh, or Las Vegas to support collaboration, or this role can be fully remote.

The salary range for this role is an estimate based on a wide range of compensation factors including but not limited to specific skills, experience and expertise, role location, certifications, licenses, and business needs. The estimated compensation range listed in this job posting reflects base salary only. This role may include additional forms of compensation such as a bonus or company equity. The recruiter assigned to this role can share more information about the specific compensation and benefit details associated with this role during the hiring process. 

Candidates for certain positions are eligible to participate in Motional’s benefits program. Motional’s benefits include but are not limited to medical, dental, vision, 401k with a company match, health saving accounts, life insurance, pet insurance, and more.

Salary Range
$159,000$207,000 USD

Motional is a driverless technology company making autonomous vehicles a safe, reliable, and accessible reality. We’re driven by something more.

Our journey is always people first.

We aren't just developing driverless cars; we're creating safer roadways, more equitable transportation options, and making our communities better places to live, work, and connect. Our team is made up of engineers, researchers, innovators, dreamers and doers, who are creating a technology with the potential to transform the way we move.

Higher purpose, greater impact.

We’re creating first-of-its-kind technology that will transform transportation. To do so successfully, we must design for everyone in our cities and on our roads. We believe in building a great place to work through a progressive, global culture that is diverse, inclusive, and ensures people feel valued at every level of the organization. Diversity helps us to see the world differently; it’s not only good for our business, it’s the right thing to do.  

Scale up, not starting up.

Our team is behind some of the industry's largest leaps forward, including the first fully-autonomous cross-country drive in the U.S, the launch of the world's first robotaxi pilot, and operation of the world's longest-standing public robotaxi fleet. We’re driven to scale; we’re moving towards commercialization of our technology, and we need team members who are ready to embrace change and challenges.

Formed as a joint venture between Hyundai Motor Group and Aptiv, Motional is fundamentally changing how people move through their lives. Headquartered in Boston, Motional has operations in the U.S and Asia. For more information, visit  www.Motional.com and follow us on Twitter, LinkedIn, Instagram and YouTube.

Motional AD Inc. is an EOE. We celebrate diversity and are committed to creating an inclusive environment for all employees. To comply with Federal Law, we participate in E-Verify. All newly-hired employees are queried through this electronic system established by the DHS and the SSA to verify their identity and employment eligibility.

Top Skills

Airflow
AWS
Ci/Cd
Jax
Python
PyTorch
Ray
Spark
SQL

Similar Jobs

23 Minutes Ago
Remote or Hybrid
2 Locations
173K-321K Annually
Senior level
173K-321K Annually
Senior level
Artificial Intelligence • Cloud • Sales • Security • Software • Cybersecurity • Data Privacy
The Director of Product Management will lead the strategy and execution for non-human identity security, defining the roadmap, driving adoption, managing a product team, and collaborating across functions to ensure product success.
Top Skills: AIAWSGoogle Cloud PlatformAzureMl
2 Hours Ago
Remote or Hybrid
United States
74K-220K Annually
Mid level
74K-220K Annually
Mid level
Agency • Artificial Intelligence • Mobile • Software • Consulting • Design
The Design Lead will create digital products focusing on collaboration, iterative design, and user testing while demonstrating a strong portfolio of work.
Top Skills: BrandingCodingPrototyping ToolsUser ExperienceUser Interface DesignVisual Design
2 Hours Ago
Remote or Hybrid
United States
94K-294K Annually
Senior level
94K-294K Annually
Senior level
Agency • Artificial Intelligence • Mobile • Software • Consulting • Design
The Senior Design Lead focuses on creating great digital products through collaboration, attention to detail, and participation in all project phases from strategy to launch.
Top Skills: Prototyping Tools

What you need to know about the Colorado Tech Scene

With a business-friendly climate and research universities like CU Boulder and Colorado State, Colorado has made a name for itself as a startup ecosystem. The state boasts a skilled workforce and high quality of life thanks to its affordable housing, vibrant cultural scene and unparalleled opportunities for outdoor recreation. Colorado is also home to the National Renewable Energy Laboratory, helping cement its status as a hub for renewable energy innovation.

Key Facts About Colorado Tech

  • Number of Tech Workers: 260,000; 8.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Lockheed Martin, Century Link, Comcast, BAE Systems, Level 3
  • Key Industries: Software, artificial intelligence, aerospace, e-commerce, fintech, healthtech
  • Funding Landscape: $4.9 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Access Venture Partners, Ridgeline Ventures, Techstars, Blackhorn Ventures
  • Research Centers and Universities: Colorado School of Mines, University of Colorado Boulder, University of Denver, Colorado State University, Mesa Laboratory, Space Science Institute, National Center for Atmospheric Research, National Renewable Energy Laboratory, Gottlieb Institute

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account