Axle Informatics Logo

Axle Informatics

Data Platform Engineer

Posted 24 Days Ago
Easy Apply
Remote
Hiring Remotely in USA
125K-150K Annually
Mid level
Easy Apply
Remote
Hiring Remotely in USA
125K-150K Annually
Mid level
The Senior Data Architect will develop and maintain the core data infrastructure for health research, focusing on data pipelines, orchestration, and quality systems. Responsibilities include coding, data modeling, and supporting data ingestion and transformation processes.
The summary above was generated by AI

(ID: 2026-1524)

 

Axle is a bioscience and information technology company that offers advancements in translational research, biomedical informatics, and data science applications to research centers and healthcare organizations nationally and abroad. With experts in biomedical science, software engineering, and program management, we focus on developing and applying research tools and techniques to empower decision-making and accelerate research discoveries. We work with some of the top research organizations and facilities in the country including multiple institutes at the National Institutes of Health (NIH).


Benefits We Offer:

  • 100% Medical, Dental & Vision Coverage for Employees
  • Paid Time Off and Paid Holidays
  • 401K match up to 5%
  • Educational Benefits for Career Growth
  • Employee Referral Bonus
  • Flexible Spending Accounts:
    • Healthcare (FSA)
    • Parking Reimbursement Account (PRK)
    • Dependent Care Assistant Program (DCAP)
    • Transportation Reimbursement Account (TRN)

About the Mission
Join the team at the forefront of revolutionizing medical research in the United States. We are building and maintaining the foundational infrastructure of the National Clinical Cohort Collaborative (N3C)—the nation’s largest and most significant public repository of harmonized electronic health record (EHR) data.

What began as a critical response to the COVID-19 pandemic has evolved into a multi-disease, terabyte-scale data resource that enables researchers across the country to accelerate discovery and improve public health outcomes. The platform integrates EHRs, claims, registries, and other data sources in a secure, regulated environment to support thousands of scientists.

This role is an opportunity to contribute to the core data platform that makes this research possible.


The Role
We are seeking a mid-level Data Platform Engineer to help build and operate the core data infrastructure that powers large-scale, regulated healthcare and research datasets. This role is ideal for an engineer who has moved beyond “entry level,” understands how production systems behave, and wants to grow into owning complex pipelines, orchestration logic, and platform reliability.

You’ll work alongside senior engineers and informatics experts to design, implement, and maintain ingestion, transformation, orchestration, and data quality systems that are reliable, observable, and secure.


What You’ll Do

Build Production-Grade Data Systems

  • Write clean, modular, well-tested Python code for data pipelines and platform services.
  • Use decorators, context managers, and unit tests to ensure correctness and maintainability.
  • Contribute to shared libraries and reusable components across the platform.


Design and Maintain Data Models

  • Implement relational data models aligned with medallion architectures (bronze/silver/gold).
  • Support schema evolution and backward-compatible changes.
  • Work with modern table formats such as Apache Iceberg.

Data Orchestration & Ingestion

  • Build and maintain data workflows using Dagster (preferred) or Airflow.
  • Manage sensors, schedules, and complex job dependencies.
  • Implement ingestion pipelines using Airbyte or similar ELT tools.

Transformation & Data Quality

  • Implement idempotent transformation logic using SQLMesh/Tobiko (preferred) or dbt.
  • Add data quality checks and validation gates using frameworks like Great Expectations.
  • Partner with upstream and downstream users to diagnose and resolve data issues.


Containerization & CI/CD

  • Build, debug, and optimize Docker images for local and production environments.
  • Contribute to CI/CD pipelines supporting automated testing and deployment.
  • Follow modern Git workflows including branching strategies, pull requests, and code reviews.

Infrastructure, Cloud & Security

  • Read and modify infrastructure-as-code using Terraform.
  • Work with AWS primitives (S3, Lambda, Glue, Fargate), with a focus on portability and migration toward open-source, cloud-agnostic alternatives.
  • Apply least-privilege and identity-based access concepts (OIDC/IAM).
  • Operate comfortably within regulated environments (HIPAA, FedRAMP).

Documentation & Collaboration

  • Document data flows, system architecture, and operational procedures clearly.
  • Collaborate closely with senior engineers, informaticists, and project stakeholders.
  • Participate in design reviews and contribute ideas for improving platform reliability and scalability.

What You’ll Bring
Required

  • 2–4 years of experience in Data Engineering or Backend Software Engineering.
  • Strong proficiency in Python and SQL.
  • Solid understanding of relational theory and data modeling.
  • Experience working with orchestration tools (Dagster, Airflow, or similar).
  • Familiarity with containerization and Docker-based workflows.
  • Experience working with version control, CI/CD, and collaborative development practices.
  • Ability to write clear technical documentation.

Nice to Have

  • Experience with Iceberg, Airbyte, Great Expectations, SQLMesh, or dbt.
  • Prior work on regulated data platforms (healthcare, government, finance).
  • Interest in data platform architecture and long-term system evolution.

 


Disclaimer: The above description is meant to illustrate the general nature of work and level of effort being performed by individuals assigned to this position or job description. This is not restricted as a complete list of all skills, responsibilities, duties, and/or assignments required. Individuals may be required to perform duties outside of their position, job description or responsibilities as needed.

The diversity of Axle’s employees is a tremendous asset. We are firmly committed to providing equal opportunity in all aspects of employment and will not tolerate any illegal discrimination or harassment based on age, race, gender, religion, national origin, disability, marital status, covered veteran status, sexual orientation, status with respect to public assistance, and other characteristics protected under state, federal, or local law and to deter those who aid, abet, or induce discrimination or coerce others to discriminate.

Accessibility: If you need an accommodation as part of the employment process please contact: [email protected]

This role has a market-competitive salary with an anticipated base compensation range listed below. Actual salaries will vary depending on a candidate’s experience, qualifications, skills, and location.

#IND

Salary Range
$125,000$150,000 USD

Top Skills

Airflow
Aws,S3,Lambda,Glue,Fargate,Sqlmesh,Dbt,Great Expectations
Dagster
Docker
Python
SQL
Terraform

Similar Jobs

3 Days Ago
Remote
United States
152K-205K Annually
Mid level
152K-205K Annually
Mid level
Artificial Intelligence • Cloud • Consumer Web • Productivity • Software • App development • Data Privacy
Build and operate Dropbox's petabyte-scale data platform: enable reliable ingestion, storage, and processing, lead data lake modernization, support AI/ML workflows, integrate with product teams, and participate in on-call operations.
Top Skills: AirflowBigQueryC#Data LakeDatabricksGoHiveJavaKafkaLakehousePythonRedshiftSnowflakeSparkSparksqlSuperset
16 Days Ago
Remote
USA
150K-215K Annually
Senior level
150K-215K Annually
Senior level
Artificial Intelligence • Machine Learning • Software • Defense
As a Backend Engineer, you will design and implement core infrastructure for Vannevar's platform, ensuring standards for data processing, security compliance, and performance. You will collaborate with various teams to enhance shared infrastructure and lead technical initiatives.
Top Skills: AWSAzureDockerGCPKubernetes
17 Days Ago
Remote
United States of America
195K-258K Annually
Senior level
195K-258K Annually
Senior level
Blockchain • Fintech • Payments • Financial Services • Cryptocurrency • Web3
The Staff Software Engineer will design and operate data platform services, enhancing performance and reliability, while managing complex data pipelines and governance in a collaborative environment.
Top Skills: Apache FlinkBigtableCassandraData WarehousingEltETLGoogle Cloud DataflowNosql DatabasesPythonSQL

What you need to know about the Colorado Tech Scene

With a business-friendly climate and research universities like CU Boulder and Colorado State, Colorado has made a name for itself as a startup ecosystem. The state boasts a skilled workforce and high quality of life thanks to its affordable housing, vibrant cultural scene and unparalleled opportunities for outdoor recreation. Colorado is also home to the National Renewable Energy Laboratory, helping cement its status as a hub for renewable energy innovation.

Key Facts About Colorado Tech

  • Number of Tech Workers: 260,000; 8.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Lockheed Martin, Century Link, Comcast, BAE Systems, Level 3
  • Key Industries: Software, artificial intelligence, aerospace, e-commerce, fintech, healthtech
  • Funding Landscape: $4.9 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Access Venture Partners, Ridgeline Ventures, Techstars, Blackhorn Ventures
  • Research Centers and Universities: Colorado School of Mines, University of Colorado Boulder, University of Denver, Colorado State University, Mesa Laboratory, Space Science Institute, National Center for Atmospheric Research, National Renewable Energy Laboratory, Gottlieb Institute

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account