Serve Robotics Logo

Serve Robotics

Lead Machine Learning Engineer

Posted 14 Hours Ago
Be an Early Applicant
In-Office or Remote
Hiring Remotely in USA
225K-260K Annually
Senior level
In-Office or Remote
Hiring Remotely in USA
225K-260K Annually
Senior level
Develop and scale machine learning training systems for robotics data, optimize training pipelines, and collaborate with ML researchers to enhance autonomy models.
The summary above was generated by AI

At Serve Robotics, we’re reimagining how things move in cities. Our personable sidewalk robot is our vision for the future. It’s designed to take deliveries away from congested streets, make deliveries available to more people, and benefit local businesses.

The Serve fleet has been delighting merchants, customers, and pedestrians along the way in Los Angeles, Miami, Dallas, Atlanta and Chicago while doing commercial deliveries. We’re looking for talented individuals who will grow robotic deliveries from surprising novelty to efficient ubiquity.

Who We Are

We are tech industry veterans in software, hardware, and design who are pooling our skills to build the future we want to live in. We are solving real-world problems leveraging robotics, machine learning and computer vision, among other disciplines, with a mindful eye towards the end-to-end user experience. Our team is agile, diverse, and driven. We believe that the best way to solve complicated dynamic problems is collaboratively and respectfully.

This role develops and scales large-scale machine learning training systems for multimodal robotics data, enabling the creation of high-performance autonomy models. By optimizing distributed training pipelines, neural network architectures, and data processing workflows, the position improves training efficiency, accelerates model iteration, and maximizes GPU utilization. The role collaborates closely with ML researchers and infrastructure teams, influencing the design, deployment, and performance of end-to-end autonomy models and the large-scale data pipelines that support them.

Responsibilities

  • Design and maintain training systems that can process and learn from petabyte-scale multimodal datasets (e.g., video and point cloud data). This includes ensuring data is efficiently loaded, distributed, and processed across large GPU clusters.

  • Identify and resolve bottlenecks in the training pipeline, including data loading, preprocessing, model computation, and inter-node communication, to maximize GPU utilization and reduce training time.

  • Work with the ML team to develop and refine neural network architectures suitable for autonomy tasks, particularly those handling high-dimensional and sequential sensor data.

  • Create and adjust loss functions and training strategies that help the model learn effectively from complex multimodal inputs and improve autonomy performance.

  • Configure, monitor, and maintain large-scale distributed training jobs across multiple machines and GPUs, ensuring stability, fault tolerance, and efficient resource usage.

  • Implement scalable systems to preprocess, transform, and augment large robotics datasets so that they are suitable for model training.

  • Work closely with ML scientists and other engineers to integrate new models, experiments, and training approaches into the production training pipeline.

  • Analyze training metrics, model outputs, and experiment logs to assess model performance and guide improvements in architecture, data usage, or training strategies.

  • Develop tools and workflows that allow teams to run experiments, track results, and iterate quickly on new model ideas or training approaches.

Qualifications

  • Master’s or PhD in Computer Science, Robotics, Electrical Engineering, Machine Learning, or a closely related technical discipline.

  • Minimum of 5 years of professional experience developing, training, and deploying machine learning models in production environments.

  • Hands-on experience training machine learning models across multiple GPUs or compute nodes, including familiarity with distributed training frameworks and large dataset handling.

  • Strong programming skills in Python for implementing machine learning models, data pipelines, and training workflows.

  • Solid knowledge of core concepts such as neural networks, optimization algorithms, loss functions, model evaluation, and training methodologies.

What Makes You Stand out

  • Experience identifying and resolving training bottlenecks related to compute utilization, memory usage, and data throughput in machine learning systems.

  • Experience training machine learning models on robotics or autonomous driving datasets involving multimodal sensor inputs such as camera video, LiDAR point clouds, radar, or telemetry data.

  • Experience developing models that combine multiple data modalities (e.g., images, point clouds, and structured sensor data) into a unified learning system.

  • Peer-reviewed publications or significant research contributions in machine learning, robotics, or related areas.

*Please note: The listed base salary range applies to candidates based in the US. Compensation may vary depending on location, experience, and role alignment. We are open to qualified candidates working remotely in Canada

  • Canada - ALL: $177k - $215k CAD

Top Skills

Distributed Training Frameworks
Machine Learning
Neural Networks
Python

Similar Jobs

4 Days Ago
In-Office or Remote
92K-164K Annually
Expert/Leader
92K-164K Annually
Expert/Leader
Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics
Lead the design, development, and deployment of AI and machine learning solutions, mentor teams, and drive innovations in healthcare data analysis.
Top Skills: HadoopAzurePythonPyTorchSparkTensorFlow
19 Days Ago
Easy Apply
Remote
USA
Easy Apply
Senior level
Senior level
Software
Lead the design, development, and deployment of scalable machine learning and AI systems for enterprise clients, guiding architecture and mentoring engineers.
Top Skills: AIAWSAzureDockerGCPKubernetesMachine LearningPythonPyTorchTensorFlow
2 Days Ago
In-Office or Remote
113K-193K Annually
Expert/Leader
113K-193K Annually
Expert/Leader
Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics
Lead AI ML Engineer responsible for building AI systems, ensuring model quality, and driving architecture and governance standards while collaborating cross-functionally.
Top Skills: Apache AirflowAWSAzure AiDockerGrafanaKubeflowKubernetesMlflowOpentelemetryPrometheusPythonPyTorchScikit LearnSQLTensorFlowVertex AiXgboost

What you need to know about the Colorado Tech Scene

With a business-friendly climate and research universities like CU Boulder and Colorado State, Colorado has made a name for itself as a startup ecosystem. The state boasts a skilled workforce and high quality of life thanks to its affordable housing, vibrant cultural scene and unparalleled opportunities for outdoor recreation. Colorado is also home to the National Renewable Energy Laboratory, helping cement its status as a hub for renewable energy innovation.

Key Facts About Colorado Tech

  • Number of Tech Workers: 260,000; 8.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Lockheed Martin, Century Link, Comcast, BAE Systems, Level 3
  • Key Industries: Software, artificial intelligence, aerospace, e-commerce, fintech, healthtech
  • Funding Landscape: $4.9 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Access Venture Partners, Ridgeline Ventures, Techstars, Blackhorn Ventures
  • Research Centers and Universities: Colorado School of Mines, University of Colorado Boulder, University of Denver, Colorado State University, Mesa Laboratory, Space Science Institute, National Center for Atmospheric Research, National Renewable Energy Laboratory, Gottlieb Institute

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account