SugarAI Jobs

Senior Data Engineer - Databricks

SugarAI

Senior Data Engineer - Databricks

Posted Yesterday

Remote or Hybrid

Hiring Remotely in US

155K-185K Annually

Senior level

Remote or Hybrid

Hiring Remotely in US

155K-185K Annually

Senior level

Own and operate Databricks production pipelines for Sugar Predict: monitor, alert, incident response, SLA reporting, performance and cost optimizations, migrate legacy ETL, onboard tenants, enforce Delta Lake architecture and multi-tenant isolation, apply security and governance, enable observability, maintain runbooks, and support CI/CD and on-call rotations.

The summary above was generated by AI

About SugarAI

SugarAI is redefining CRM for the age of AI.
We’re delivering on the original promise of CRM—turning fragmented customer and revenue signals into clear, prioritized action. Instead of more dashboards or surface-level insights, we help teams focus on what matters most and know exactly what to do next.
More than two decades after our founding, we’re entering a new chapter with clarity and momentum—building intelligent, intuitive solutions that work within the flow of how teams actually sell and serve. We’re focused on solving complex, real-world challenges where relationships, context, and precision make all the difference.
Our global team is united by a shared commitment to impact, ownership, and continuous growth. We create an environment where thoughtful ideas move quickly, where people are trusted to lead, and where flexibility supports how great work gets done.
If you’re excited to help shape what’s next in AI-driven CRM—and build technology that drives real outcomes—we’d love to meet you.

Where You Fit In:

The Sugar Predict platform powers revenue intelligence for mid-market enterprises by fusing ERP and CRM data into actionable insights. As a Senior Data Engineer, you will own the Databricks pipelines that make this possible, driving production reliability, cost efficiency, and platform growth through customer onboarding and legacy modernization. You will work closely with ML engineers, product teams, and the Enterprise Architecture team to ensure the data backbone behind Sugar Predict is always fast, clean, and ready to deliver at a global scale.

Impact You Will Make in the Role:

Own Databricks production support for the Sugar Predict data platform, including monitoring, alerting, and incident response across all production data flows

Maintain and report on SLA performance metrics for data pipeline delivery, ensuring visibility into platform health and accountability across internal and external stakeholders

Identify and implement pipeline optimizations that reduce Databricks compute costs, improve throughput, and reduce processing windows while tracking impacts through measurable KPIs

Migrate legacy ETL/ELT pipelines to Databricks, building automation tooling to reduce manual intervention and ensure uninterrupted data delivery during transitions

Support new customers onboarding by provisioning, validating, and hardening tenant data pipelines that deliver reliable, isolated data from day one

Design and build high-performance Databricks pipelines that ingest, transform, and serve ERP and CRM data at scale across both Azure and AWS environments

Own the Delta Lake architecture including schema design, partitioning strategies, data quality enforcement, and incremental processing patterns

Enforce data security best practices across Databricks environments, including role-based access control, secrets management, and compliance requirements for enterprise CRM and ERP data

Implement data quality monitoring and observability across pipeline health and ML model inputs, ensuring data integrity that directly supports Sugar Predict prediction accuracy

Apply and enforce multi-tenant data isolation patterns ensuring reliable, secure data delivery across Sugar Predict enterprise customers

Partner with the Enterprise Architecture team to ensure Sugar Predict data pipelines integrate seamlessly with the broader SugarAI product ecosystem

Support a globally distributed operation through on-call rotation and after-hours incident response, meeting SLAs across multiple time zones

Maintain technical documentation, runbooks, and architectural decision records, contributing to team knowledge sharing and operational readiness across on-call and incident response scenarios

Apply CI/CD best practices to data pipeline development, including version control, automated testing, and deployment tooling to ensure reliable and repeatable pipeline delivery

What You Will Bring:

4+ years of data engineering experience

At least 2 years on Databricks or the Apache Spark ecosystem across Azure and/or AWS

Proficiency in PySpark, SQL, and Python with a strong track record building and operating production-grade pipelines under SLA constraints

Hands-on experience with Delta Lake including schema evolution, ACID transactions, optimize/vacuum lifecycle, and both incremental and streaming processing patterns

Hands-on experience with pipeline performance tuning and compute optimization in production Databricks environments

Solid working knowledge of PostgreSQL including query optimization, schema design, and use as a source or sink in production data pipelines

Experience supporting and maintaining legacy ETL tooling (SSIS, Informatica, custom Python/SQL pipelines, or similar) in production

Experience supporting large-scale multi-tenant architectures with a focus on tenant isolation, per-tenant performance, and data privacy, including navigating tools and platforms that default to single-tenant assumptions

Proven ability to work collaboratively across data science, product, and infrastructure teams, owning end-to-end delivery in a cross-functional environment

Strong understanding of data governance, security, and compliance principles, including access control, data privacy, and protection of sensitive enterprise data across multi-tenant environments

Preferred Qualifications/Experience:

Experience operating Databricks workspaces across both Azure and AWS, including cost governance, cluster management, and cross-cloud data access

Experience optimizing Databricks workloads in a Serverless environment, including compute cost governance and performance tuning for serverless compute

Experience with Microsoft SQL Server in a data engineering or ETL context

Exposure to ML feature engineering or feature stores (Databricks Feature Store, Feast, or similar) supporting predictive analytics

Experience with customer onboarding automation or IaC patterns for provisioning tenant data pipelines at scale

Databricks Certified Data Engineer Associate or Professional certification

We understand that no candidate is perfectly qualified for any job. Experience comes in different forms; many skills are transferable; and passion goes a long way. Even more important than your resume is a clear demonstration of dedication, impact, and the ability to thrive in a fluid and collaborative environment. We want you to learn new things in this role, and we encourage you to apply if your experience is close to what we’re looking for. We also know that diversity of background and thought makes for better problem solving and more creative thinking, which is why we're dedicated to adding new perspectives to the team.

Benefits and Perks:

Beyond a stellar work environment, friendly people, and inspiring work, we have some sweet benefits and perks:

· Excellent healthcare package for you and your family

· Savings and Investment – 401(k) match

· Unlimited Paid Time Off

· Paid Parental Leave

· Online Legal Services (Rocket Lawyer)

· Financial Planning Services (Origin)

· Discounted Pet Insurance (Embrace Pet Insurance)

· Corporate Benefit Program (Working Advantage). This benefit offers you exclusive travel and entertainment offers and special discounts that are not available to the general public

· Health and Wellness Reimbursement Program

· Travel Discounts

· Educational Resources - Career & Personal Development Program

· Employee Referral Bonus Program

· We are a merit-based company - many opportunities to learn, excel and grow your career!

If you require a reasonable accommodation to search for a job opening or submit an application, please call +1 (877) 842-7276 with your request and contact information.

Our company uses E-Verify to confirm the employment eligibility of all newly hired employees. To learn more about E-Verify, including your rights and responsibilities, please visit www.dhs.gov/E-Verify.

#LI-Remote

Denver, CO, United States

Similar Jobs

PwC

Data Engineer

17 Days Ago

Remote or Hybrid

Denver, CO, USA

77K-202K Annually

Senior level

77K-202K Annually

Senior level

Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI

Senior Data Engineer on PwC's Managed Data, Analytics & Insights team to design, build and manage advanced data ecosystems. Responsibilities include designing data solutions and scalable pipelines, solving complex problems, mentoring junior staff, maintaining high delivery standards, and building client relationships while aligning solutions to business context.

Top Skills: DatabricksKafka

Procter & Gamble

Senior Data Engineer

27 Days Ago

In-Office or Remote

Senior level

AdTech • Beauty • Marketing Tech • Retail • Pharmaceutical

The Senior Data Engineer will design and optimize data pipelines from SAP to Databricks, focusing on CDC and stream processing. Responsibilities include managing Kafka components, CI/CD pipelines, and ensuring data quality while collaborating with various teams.

Top Skills: AzureAzure DevopsCi/CdConfluentDatabricksDockerElkGithub ActionsGrafanaKafkaKubernetesPrometheusPythonSap S/4HanaSplunkSQLTerraform

Superlanet

Data Engineer

21 Days Ago

In-Office or Remote

70-81 Hourly

Senior level

70-81 Hourly

Senior level

Healthtech • HR Tech • Information Technology • Professional Services

The role involves building and maintaining financial data pipelines in Databricks, bridging Snowflake, and ensuring data accuracy for finance reporting.

Top Skills: Cloud-Based Data Engineering PlatformsDatabricksPysparkSnowflakeSparkSQL

What you need to know about the Colorado Tech Scene

With a business-friendly climate and research universities like CU Boulder and Colorado State, Colorado has made a name for itself as a startup ecosystem. The state boasts a skilled workforce and high quality of life thanks to its affordable housing, vibrant cultural scene and unparalleled opportunities for outdoor recreation. Colorado is also home to the National Renewable Energy Laboratory, helping cement its status as a hub for renewable energy innovation.

Key Facts About Colorado Tech

Number of Tech Workers: 260,000; 8.5% of overall workforce (2024 CompTIA survey)
Major Tech Employers: Lockheed Martin, Century Link, Comcast, BAE Systems, Level 3
Key Industries: Software, artificial intelligence, aerospace, e-commerce, fintech, healthtech
Funding Landscape: $4.9 billion in VC funding in 2024 (Pitchbook)
Notable Investors: Access Venture Partners, Ridgeline Ventures, Techstars, Blackhorn Ventures
Research Centers and Universities: Colorado School of Mines, University of Colorado Boulder, University of Denver, Colorado State University, Mesa Laboratory, Space Science Institute, National Center for Atmospheric Research, National Renewable Energy Laboratory, Gottlieb Institute

SugarAI

Senior Data Engineer - Databricks

SugarAI Denver, Colorado, USA Office

Similar Jobs

Data Engineer

Senior Data Engineer

Data Engineer

What you need to know about the Colorado Tech Scene

Key Facts About Colorado Tech