Iambic Therapeutics Logo

Iambic Therapeutics

Software Engineer II, Data

Posted Yesterday
Remote
Hiring Remotely in United States
152K-190K Annually
Senior level
Remote
Hiring Remotely in United States
152K-190K Annually
Senior level
The Data Engineer will design and optimize data pipelines for AI model training, improve data storage, and enhance Python-based workflows, collaborating with ML engineers on performance.
The summary above was generated by AI

JOB SUMMARY

We’re seeking a Data Engineer to build and optimize data pipelines for AI model training. You’ll work with large datasets, enhance data storage, and improve Python-based workflows. Collaborating closely with ML engineers, you’ll enhance the performance of Python-based data workflows. Ideal candidates have experience with ETL systems, orchestration tools, and multi-terabyte data processing. Familiarity with AWS, Kubernetes, and data lake technologies is a plus. This role is remote, with preference for candidates on the East Coast.

KEY RESPONSIBILITIES

  • Design and improve data pipelines that process large, multi-modal datasets from a variety of internal and external sources into training datasets for AI models.

  • Evolve our data storage layer to support analytics, schema evolution, reproducibility, and efficient data access.

  • Collaborate with ML engineers to improve the performance and reliability of Python-based data processing workflows.

QUALIFICATIONS

  • Minimum of 8 years of related experience with a Bachelor’s degree; or 6 years and a Master’s degree; or a PhD with 3 years experience; or equivalent experience.

  • Proven ability to design flexible, maintainable ETL systems.

  • Experience with data pipeline orchestration tools such as Prefect, Airflow, Argo, Databricks, or Spark.

  • Understanding of the ML model lifecycle; prior work with scientific or ML workflows is a plus.

  • Hands-on experience with multi-terabyte scale data processing.

  • Familiarity with AWS; Kubernetes experience is a bonus.

  • Knowledge of data lake technologies such as Parquet, Iceberg, AWS Glue etc.

  • Strong Python software engineering skills.

  • Pragmatic mindset — able to evaluate tradeoffs find solutions that empower ML researchers to move quickly.

  • Background in bioinformatics or chemistry is a plus.

ABOUT IAMBIC THERAPEUTICS

Iambic is a clinical-stage life-science and technology company developing novel medicines using its AI-driven discovery and development platform. Based in San Diego and founded in 2020, Iambic has assembled a world-class team that unites pioneering AI experts and experienced drug hunters. The Iambic platform has demonstrated delivery of new drug candidates to human clinical trials with unprecedented speed and across multiple target classes and mechanisms of action. Iambic is advancing a pipeline of potential best-in-class and first-in-class clinical assets, both internally and in partnership, to address urgent unmet patient need. Learn more about the Iambic team, platform, pipeline, and partnerships at iambic.ai.

MISSION & CORE VALUES

Our mission is to deliver better medicines through innovations in AI-based discovery technologies. The culture and work at Iambic Therapeutics are profoundly strengthened by the diversity of our people and our differences in background, culture, national origin, religion, sexual orientation, and life experiences. We are committed to building an inclusive environment where a diverse group of talented humans work together to discover therapeutics and create technologies.

PAY AND BENEFITS

We offer industry leading competitive pay, company paid healthcare, flexible spending accounts, voluntary life insurance, 401K matching, and uncapped vacation to our team. We are in a brand-new state-of-the art facility in beautiful San Diego with an onsite gym, dining, and easy access to great places to live and play.

Top Skills

Airflow
Argo
AWS
Aws Glue
Databricks
Etl Systems
Iceberg
Kubernetes
Parquet
Prefect
Python
Spark

Similar Jobs

3 Days Ago
Remote
United States of America
111K-231K Annually
Mid level
111K-231K Annually
Mid level
AdTech • Digital Media • Information Technology • Other
Design, build, and maintain scalable data pipelines and backend services using GCP, handle data processing, and ensure reliability and quality.
Top Skills: AirflowBigQueryCi/CdCloud ComposerDataflowDataprocDockerGCPGoJavaKubernetesLinuxPub/SubPythonTerraformUnix
8 Days Ago
Remote
USA
106K-155K Annually
Mid level
106K-155K Annually
Mid level
Financial Services
As a Software Engineer II, you'll develop and maintain data pipelines, implement data integration strategies, and collaborate with team members to deliver data solutions. Responsibilities include coding, testing, and troubleshooting while engaging in continual learning.
Top Skills: AirflowSparkAWSDatabricksDbtPower BIPythonSQL
10 Days Ago
Remote
US
158K-213K Annually
Senior level
158K-213K Annually
Senior level
Software
As a Software Development Engineer II, you will build internal data tools, support engineers, and enhance data processing systems on AWS.
Top Skills: AirflowAWSNode.jsPythonSpark

What you need to know about the Colorado Tech Scene

With a business-friendly climate and research universities like CU Boulder and Colorado State, Colorado has made a name for itself as a startup ecosystem. The state boasts a skilled workforce and high quality of life thanks to its affordable housing, vibrant cultural scene and unparalleled opportunities for outdoor recreation. Colorado is also home to the National Renewable Energy Laboratory, helping cement its status as a hub for renewable energy innovation.

Key Facts About Colorado Tech

  • Number of Tech Workers: 260,000; 8.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Lockheed Martin, Century Link, Comcast, BAE Systems, Level 3
  • Key Industries: Software, artificial intelligence, aerospace, e-commerce, fintech, healthtech
  • Funding Landscape: $4.9 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Access Venture Partners, Ridgeline Ventures, Techstars, Blackhorn Ventures
  • Research Centers and Universities: Colorado School of Mines, University of Colorado Boulder, University of Denver, Colorado State University, Mesa Laboratory, Space Science Institute, National Center for Atmospheric Research, National Renewable Energy Laboratory, Gottlieb Institute

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account