MDCalc Logo

MDCalc

Senior Data Engineer

Posted 2 Days Ago
Remote
Hiring Remotely in USA
Senior level
Remote
Hiring Remotely in USA
Senior level
Design and maintain data pipelines for analytics and operational decision-making. Collaborate with teams to ensure reliable data systems and governance, utilizing Python and SQL.
The summary above was generated by AI
The Opportunity

Since 2005, MDCalc has been an essential part of the clinician’s workflow to help achieve better patient outcomes. Actively used by more than 65% of physicians worldwide, MDCalc is the most broadly used medical reference – at the point-of-care – for clinical decision tools and content, and one of only four references used by >50% of US HCPs. These evidence-based tools and content are used by millions of medical professionals globally and support 50+ specialties and cover 200+ patient conditions.

To continue accelerating this growth, we are expanding the Engineering team with a Senior Data Engineer who will help build and scale the data infrastructure that powers decision-making across the company. This is an opportunity for an experienced data engineer who enjoys working close to product and business teams, building reliable data systems, and transforming complex data into actionable insights.

This role will help define how data moves through MDCalc’s platform, designing the pipelines and architecture that enable reliable analytics, product insights, and data-driven decision making across the organization.

The Role

As a Senior Data Engineer at MDCalc, you will design, build, and maintain the data pipelines and infrastructure that support analytics, product insights, and operational decision-making across the company. A key part of this role is managing how data moves across systems, shaping and transforming it through robust ETL/ELT pipelines so it can be reliably used by downstream analytics, product, and business applications.

You will work closely with product, engineering, and business stakeholders to ensure data is reliable, accessible, and structured for effective use. This includes building programmatic data pipelines, primarily in Python, to extract, transform, and deliver data across MDCalc’s systems and data platform.

You will also contribute to the architecture of MDCalc’s data platform, helping define how data is structured and delivered across the organization. As a senior individual contributor, you will help establish best practices for data modeling, pipeline development, and data governance.

The responsibilities of this individual include the following, but are not limited to:

  • Design, build, and maintain scalable data pipelines and ELT/ETL workflows that support analytics, operational reporting, and business intelligence use cases

  • Build programmatic data pipelines (primarily in Python) that extract data from application and third-party systems, transform it into usable formats, and deliver it to downstream data platforms and consumers

  • Own and improve core data models and transformations to ensure data is accurate, well-structured, and easy for stakeholders to use

  • Partner with Product, Engineering, and Analytics teams to understand data needs and translate them into reliable data solutions

  • Develop and maintain systems that move data across the platform, ensuring it is properly shaped, structured, and available for downstream analysis and product use cases

  • Help shape and maintain the architecture of MDCalc’s modern data stack, including warehousing, orchestration, transformation, and monitoring

  • Improve data quality, observability, and reliability through testing, validation, and proactive monitoring practices

  • Support the ingestion and integration of data from a variety of application, product, and third-party sources

  • Establish and reinforce best practices around data governance, documentation, naming conventions, and maintainability

  • Identify and drive opportunities to improve performance, scalability, and efficiency across our data systems

  • Design efficient data workflows that query, transform, and deliver datasets to downstream systems and stakeholders

  • Contribute to technical direction and architectural decisions as a senior member of the team

  • Serve as a thought partner to teammates and cross-functional stakeholders on how to best leverage data across the business

Your Background
  • 5+ years experience in data engineering

  • Strong SQL skills and experience building and optimizing data models for analytical use cases

  • Experience building and maintaining reliable data pipelines in a modern cloud data environment

  • Strong proficiency in Python or a comparable programming language commonly used in data engineering

  • Experience building programmatic ETL/ELT pipelines using Python or similar tools to move and transform data across systems

  • Experience working with data warehouses such as Snowflake

  • Experience with transformation and orchestration tools such as dbt, Airflow, Dagster, or similar tools

  • Strong understanding of data architecture, data modeling, and pipeline design best practices

  • Ability to operate independently, prioritize effectively, and drive work forward in a fast-moving environment

What MDCalc offers:
  • Ability to make a true difference in medicine: MDCalc is the most broadly used medical reference used by 65% of physicians worldwide.

  • Medical, Dental, & Vision coverage, with option to extend to your dependents

  • Company-sponsored short-term insurance

  • Fully-paid 8 week parental leave, after 6 months of employment

  • Company-sponsored 401k, after 3 months of employment

  • Unlimited vacation for salaried roles - we trust you to take the time you need

  • Tri-annual company offsites to connect, reflect, and plan together

  • Work from home monthly stipend

  • Hybrid work environment with a great team office in Greenwich Village, NYC

  • A culture of fun and motivated team members who believe in a greater mission here at

Top Skills

Airflow
Dagster
Dbt
Python
Snowflake
SQL

Similar Jobs

2 Days Ago
In-Office or Remote
92K-164K Annually
Senior level
92K-164K Annually
Senior level
Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics
Design and maintain data pipelines and models, optimize data processing on cloud platforms, and enforce data governance while collaborating with various stakeholders.
Top Skills: Apache AirflowAWSAzureDatabricksDockerGCPKubernetesMicrosoft PurviewPysparkPythonSnowflakeSQL
3 Days Ago
Remote
United States
150K-175K Annually
Senior level
150K-175K Annually
Senior level
Software
The Senior Data Engineer will lead data architecture design, build reliable data pipelines, mentor team members, ensure data quality, and optimize ETL processes.
Top Skills: AirflowAWSBigQueryCloudFormationDagsterKafkaPrefectPysparkPythonRedshiftSnowflakeSparkSQLTerraform
12 Hours Ago
Remote or Hybrid
United States
102K-169K Annually
Senior level
102K-169K Annually
Senior level
Artificial Intelligence • Automotive • Greentech • Information Technology • Machine Learning • Software • Cybersecurity
The Senior Data Engineer will design and implement data architectures and pipelines, ensure data quality, automate processes, and collaborate with stakeholders to provide actionable insights using AI and big data technologies.
Top Skills: AWSAzureHadoopJavaKafkaPythonScalaSnowflakeSparkSQL

What you need to know about the Colorado Tech Scene

With a business-friendly climate and research universities like CU Boulder and Colorado State, Colorado has made a name for itself as a startup ecosystem. The state boasts a skilled workforce and high quality of life thanks to its affordable housing, vibrant cultural scene and unparalleled opportunities for outdoor recreation. Colorado is also home to the National Renewable Energy Laboratory, helping cement its status as a hub for renewable energy innovation.

Key Facts About Colorado Tech

  • Number of Tech Workers: 260,000; 8.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Lockheed Martin, Century Link, Comcast, BAE Systems, Level 3
  • Key Industries: Software, artificial intelligence, aerospace, e-commerce, fintech, healthtech
  • Funding Landscape: $4.9 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Access Venture Partners, Ridgeline Ventures, Techstars, Blackhorn Ventures
  • Research Centers and Universities: Colorado School of Mines, University of Colorado Boulder, University of Denver, Colorado State University, Mesa Laboratory, Space Science Institute, National Center for Atmospheric Research, National Renewable Energy Laboratory, Gottlieb Institute

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account