Zócalo Health Logo

Zócalo Health

Senior Data Engineer

Posted Yesterday
Remote
Hiring Remotely in USA
160K-180K Annually
Senior level
Remote
Hiring Remotely in USA
160K-180K Annually
Senior level
Design, build, and operate production-grade data ingestion pipelines and dbt transformations into a Databricks lakehouse. Establish data quality, monitoring, and observability. Partner with Product and Engineering to enable metrics, dashboards, and longitudinal patient analytics supporting clinical and operational decision-making.
The summary above was generated by AI

Senior Data Engineer

at Zócalo Health 

Remote (Full Time) 

Compensation: $160,000 - $180,000 (per year)


About Us

Zócalo Health is a tech-enabled, community-oriented primary care organization serving people who have historically been underserved by the one-size-fits-all healthcare system. We partner with health plans, providers, and community organizations to deliver culturally competent primary care, behavioral health, and social care.

Our model is built for populations with high medical and social complexity, where fragmented care drives poor outcomes and unnecessary cost. We combine local, community-based teams with virtual care and modern technology to deliver coordinated, whole-person care where members live and receive support.

Founded in 2021, Zócalo Health is backed by leading healthcare and mission-aligned investors and is scaling rapidly across states and populations. We are building a durable care platform designed to perform in constrained healthcare environments and to lead the shift toward accountable, value-based care.


Role Description

The Senior Data Engineer will join Zócalo Health as we build the data platform that powers analytics, product measurement, and operational visibility across the company. This is a hands-on building role at a foundational stage: you will design and ship the pipelines, ingestion frameworks, and data models that the rest of the company depends on.

The primary focus of this role is establishing a scalable, durable data platform. This includes laying the groundwork for longer-term initiatives such as the longitudinal patient record, population-level analytics, and product instrumentation. You will partner closely with Engineering and Product to ensure the data platform supports roadmap priorities and outcome measurement as the company grows.

This position reports to the Principal Data Engineer and partners closely with Engineering and Product.


In your first 12 months, you will:

  • Build and operate production-grade ingestion pipelines from core clinical, operational, and third-party systems into our Databricks lakehouse
  • Develop and maintain dbt models that turn raw data into clean, well-documented, analytics-ready datasets
  • Establish data quality, testing, and monitoring practices that make pipelines reliable and trustworthy
  • Help shape ingestion patterns and architecture standards alongside the Principal Data Engineer
  • Enable company-wide metrics for care outcomes and operations
  • Collaborate with cross-functional leads to develop and iterate on a suite of core operational dashboards, ensuring teams have the self-service tools they need to track company metrics and outcomes.

The Senior Data Engineer will contribute in the following ways:

  • Design, build, and operate production data pipelines across clinical, operational, and third-party systems using API-based ingestion, Change Data Capture (CDC), and event- or webhook-driven patterns
  • Build and maintain transformation layers in dbt, including tests, documentation, and reusable models
  • Develop and refine core analytical and longitudinal data models used across the company
  • Implement testing, monitoring, and observability to ensure data quality, pipeline reliability, and system performance
  • Apply strong engineering fundamentals to improve the scalability, performance, and cost-efficiency of data systems on AWS and Databricks
  • Partner with Product to support metric definitions, outcome measurement, and reporting needs
  • Contribute to engineering standards, code review, and a culture of knowledge sharing and continuous improvement
  • Partner with business, product, and engineering stakeholders to design and build intuitive data visualizations and dashboards that drive actionable insights and program visibility.

Core Technologies (current and planned)

  • Cloud: AWS
  • Lakehouse / data platform: Databricks
  • Transformations: dbt
  • Languages: SQL and Python (primary languages for ingestion and transformation)
  • Ingestion patterns: API-based ingestion, Change Data Capture (CDC), and event- or webhook-driven pipelines, including frameworks such as PySpark and Spark Structured Streaming on Databricks
  • Orchestration: workflow orchestration (e.g., Databricks Workflows or Airflow)

Qualifications

  • 5+ years of experience in data or backend engineering roles with significant data platform responsibility
  • Hands-on experience building and operating production-grade data pipelines and ingestion frameworks
  • Strong proficiency in SQL and Python for data ingestion, processing, and transformation
  • Experience with a cloud data platform; experience with AWS and Databricks (or a comparable Spark-based lakehouse) strongly preferred
  • Experience building SQL-based transformation workflows; hands-on experience with dbt preferred
  • Strong computer science fundamentals, including comfort reasoning about distributed systems and data processing at scale
  • Ability to diagnose and resolve performance, reliability, and data quality issues in complex systems
  • Strong ownership mindset and comfort operating in ambiguous, fast-growing environments
  • Clear communicator able to partner effectively with technical and non-technical stakeholders
  • Experience building dashboards or analytical outputs used by executives and frontline teams

Preferred Qualifications

  • Experience working with healthcare, care delivery, or other regulated data environments
  • Familiarity with HIPAA requirements and handling of sensitive health or customer data
  • Experience building streaming data pipelines or event-driven architectures
  • Experience implementing data observability, lineage, or quality monitoring tools
  • Familiarity with AI-assisted development tools and automation in engineering workflows
  • Early-stage startup experience strongly preferred

What you can expect from Zócalo Health

  • Equity compensation package
  • Comprehensive benefits including medical, dental, and vision
  • 401k
  • Flexible PTO policy - take the time you need to recharge
  • $1,000 home office stipend
  • We provide the equipment needed for this role.
  • Opportunity for rapid career progression with plenty of room for personal growth.

You must be authorized to work in the United States. Remote Work can be done from anywhere in the U.S.


At Zócalo Health Inc., we see diversity and inclusion as a source of strength in transforming healthcare. We believe building trust and innovation are best achieved through diverse perspectives. To us, acceptance and respect are rooted in an understanding that people do not experience things in the same way, including our healthcare system. Individuals seeking employment at Zócalo Health are considered without regard to race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.

Similar Jobs

3 Days Ago
Remote or Hybrid
Denver, CO, USA
124K-280K Annually
Senior level
124K-280K Annually
Senior level
Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
Lead data engineering efforts within Technology Consulting: design data architecture and pipelines, implement AWS/Redshift and ETL solutions, support BI (QlikView/Oracle BI), coach teams, manage client relationships and SLAs, apply systems thinking to optimize outcomes and validate solutions with stakeholders.
Top Skills: AWSDatastageDb2ETLJavaManaged ServicesOracle BiPythonQlikviewRedshiftSlasSQL ServerWorkload Orchestration And Scheduling
4 Days Ago
Remote or Hybrid
Denver, CO, USA
99K-232K Annually
Senior level
99K-232K Annually
Senior level
Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
Lead data engineering engagements to design, build, and maintain ETL/ELT pipelines and cloud data architectures. Manage client accounts and mentor teams, leverage tools like DataStage, AWS/Redshift, DB2/SQL Server, GoldenGate, and BI/visualization platforms to deliver analytics, performance tuning, and scalable reporting solutions.
Top Skills: AWSBirtCdcDatastageDb2Etl/EltGlueGoldengateJavaPythonQlikviewRedshiftS3SpotfireSQL Server
13 Days Ago
Easy Apply
Remote or Hybrid
United States
Easy Apply
120K-201K Annually
Senior level
120K-201K Annually
Senior level
Artificial Intelligence • Cloud • Computer Vision • Hardware • Internet of Things • Software
Design, build, and maintain scalable Spark-based ETL pipelines and computed tables in a central data lake. Integrate structured and unstructured IoT, sensor, and external data for analytics, model training, and dashboards. Collaborate with Data Science, Analytics, and ML teams to ensure reliable, high-quality customer-facing datasets.
Top Skills: AirflowAWSAzureDagsterData LakeDatabricksDelta LakeETLGCPGitGitPrefectPysparkPythonRest ApisSparksqlSQL

What you need to know about the Colorado Tech Scene

With a business-friendly climate and research universities like CU Boulder and Colorado State, Colorado has made a name for itself as a startup ecosystem. The state boasts a skilled workforce and high quality of life thanks to its affordable housing, vibrant cultural scene and unparalleled opportunities for outdoor recreation. Colorado is also home to the National Renewable Energy Laboratory, helping cement its status as a hub for renewable energy innovation.

Key Facts About Colorado Tech

  • Number of Tech Workers: 260,000; 8.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Lockheed Martin, Century Link, Comcast, BAE Systems, Level 3
  • Key Industries: Software, artificial intelligence, aerospace, e-commerce, fintech, healthtech
  • Funding Landscape: $4.9 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Access Venture Partners, Ridgeline Ventures, Techstars, Blackhorn Ventures
  • Research Centers and Universities: Colorado School of Mines, University of Colorado Boulder, University of Denver, Colorado State University, Mesa Laboratory, Space Science Institute, National Center for Atmospheric Research, National Renewable Energy Laboratory, Gottlieb Institute

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account