Applied Materials Logo

Applied Materials

Databricks Machine Learning (ML) Administrator

Reposted 17 Days Ago
In-Office or Remote
Hiring Remotely in Ontario, ON
Senior level
In-Office or Remote
Hiring Remotely in Ontario, ON
Senior level
The Databricks ML Administrator will manage ML environments, oversee governance, secure operations, and ensure reliable model training and deployment.
The summary above was generated by AI

Who We Are

Applied Materials is a global leader in materials engineering solutions used to produce virtually every new chip and advanced display in the world. We design, build and service cutting-edge equipment that helps our customers manufacture display and semiconductor chips – the brains of devices we use every day. As the foundation of the global electronics industry, Applied enables the exciting technologies that literally connect our world – like AI and IoT. If you want to push the boundaries of materials science and engineering to create next generation technology, join us to deliver material innovation that changes the world. 

What We Offer

Location:

Home / Mobile,CAN-ONTARIO-001

You’ll benefit from a supportive work culture that encourages you to learn, develop, and grow your career as you take on challenges and drive innovative solutions for our customers. We empower our team to push the boundaries of what is possible—while learning every day in a supportive leading global company. Visit our Careers website to learn more. 

At Applied Materials, we care about the health and wellbeing of our employees. We’re committed to providing programs and support that encourage personal and professional growth and care for you at work, at home, or wherever you may go. Learn more about our benefits

We are seeking an experienced Databricks Machine Learning (ML) Administrator to own the end‑to‑end administration, governance, and secure operations of our ML environments on Databricks. In this role, you will configure and manage ML compute, enforce access and governance for MLflow assets (experiments and model registry), and ensure reliable model training, deployment, and serving at scale. You will partner closely with Data Engineering, ML Engineering, Security, and FinOps to deliver a robust, compliant, and cost‑efficient ML platform.

Key Responsibilities

Platform Operations & Compute

  • Deploy, configure, and maintain Databricks ML clusters (CPU/GPU), SQL Warehouses, and cluster policies optimized for ML workloads; apply autoscaling, pools, and runtime selection (including Databricks Runtime for ML).
  • Administer Jobs and Pipelines that orchestrate training, evaluation, and batch/real‑time scoring; manage run‑as identities and default privileges to meet least‑privilege requirements.
  • Establish and enforce compute access controls (attach/restart/manage) and workspace object permissions; standardize policies to prevent configuration drift.

ML Lifecycle Governance (MLflow & Serving)

  • Govern MLflow Experiments and Registered Models with fine‑grained permissions (read/edit/manage), standardizing experiment tracking, model versioning, stage transitions, and approvals.
  • Operate and secure model serving endpoints, including permissions for view, query, and manage actions; implement change control for deployments.

Data Access & Unity Catalog Alignment

  • Coordinate with data governance to implement metastore, catalog, schema, and table‑level permissions that support feature engineering, training, and evaluation while safeguarding sensitive data.
  • Apply enterprise identity and access management patterns across account and workspace scopes (users, groups, service principals) using SCIM/SSO standards.

Security, Compliance & Auditability

  • Enforce workspace object ACLs, compute isolation modes, secret handling, and log‑access controls for ML clusters; implement Spark ACL settings per policy.
  • Operationalize system tables/audit logs and usage analytics to meet regulatory and internal control requirements; partner with Security/GRC for periodic reviews.

Reliability, Monitoring & Incident Response

  • Monitor cluster health, job success/failure, serving endpoint SLOs, and capacity; establish alerting and incident runbooks for ML infrastructure.
  • Lead post‑incident reviews and continuous improvement for platform reliability and developer productivity.

Cost Management & FinOps

  • Implement and iterate compute policies, budget policies, and usage dashboards to optimize GPU/CPU consumption for ML training and serving.

Enablement & Best Practices

  • Define and evangelize ML platform standards: environment baselines, cluster policies, experiment hygiene, model promotion flows, and serving change‑management.
  • Partner with ML teams to align platform features (AutoML, Feature/Vector stores, model serving) to use cases and performance targets.

Required Qualifications

  • 5+ years administering Databricks or similar ML/data platforms (e.g., Spark‑based platforms) with hands‑on experience in workspace administration, compute policies, and MLflow governance.
  • Proven expertise managing Databricks permissions (workspaces, clusters, jobs, experiments, registered models, serving endpoints) via UI, REST/CLI.
  • Strong understanding of Unity Catalog concepts and implementing catalog/schema/table access for ML workflows.
  • Working knowledge of Python/Scala sufficient to understand notebooks, init scripts, and operational tooling (no application development required).
  • Experience with SSO/SCIM, enterprise identity providers, and group‑based access patterns across account and workspace scopes.
  • Familiarity with audit logging, system tables, and cost‑management techniques in Databricks.

Preferred Qualifications

  • Databricks Platform Administrator accreditation (or equivalent) and experience with serverless/SQL warehouses, cluster pools, and model serving.
  • Experience operationalizing run‑as service principals for jobs and pipelines and separating ownership vs. execution permissions.
  • Exposure to infrastructure‑as‑code (e.g., Terraform) for permissions/policies and environment baselining.
  • Understanding of data protection controls (masking, row/column access) and secure handling of secrets and logs in ML contexts.

Tools & Technologies You Will Use

  • Databricks Workspace & Account Console, Unity Catalog, Jobs, Pipelines, MLflow, Model Serving, Databricks Runtime for ML, SQL Warehouses.
  • Databricks CLI/REST APIs for permissions and automation; optional IaC (Terraform) for policy/permission as code.

Additional Information

Time Type:

Full time

Employee Type:

Assignee / Regular

Travel:

Yes, 20% of the Time

Relocation Eligible:

No

Applied Materials is an Equal Opportunity Employer. Qualified applicants will receive consideration for employment without regard to race, color, national origin, citizenship, ancestry, religion, creed, sex, sexual orientation, gender identity, age, disability, veteran or military status, or any other basis prohibited by law.

Similar Jobs

An Hour Ago
Easy Apply
Remote
Easy Apply
136K-199K Annually
Senior level
136K-199K Annually
Senior level
Big Data • Fintech • Mobile • Payments • Financial Services
The Key Account Manager will manage a portfolio of strategic accounts, driving growth through contract negotiations, marketing strategies, and cross-functional team collaboration to optimize partner performance.
Top Skills: Business Intelligence ToolsSalesforce
An Hour Ago
Easy Apply
Remote
Easy Apply
191K-271K Annually
Senior level
191K-271K Annually
Senior level
Big Data • Fintech • Mobile • Payments • Financial Services
Lead analytics and insights for Revenue team, monitoring trends, guiding strategic initiatives, and developing a high-performing team to drive organizational growth.
Top Skills: Ai ToolsData Visualization ToolsPythonRSQL
An Hour Ago
Easy Apply
Remote
Easy Apply
102K-142K Annually
Mid level
102K-142K Annually
Mid level
Big Data • Fintech • Mobile • Payments • Financial Services
Manage and enhance a centralized People knowledge ecosystem, leveraging AI for automation, improving employee support experiences, and driving cross-functional initiatives.
Top Skills: AIKnowledge Management Systems

What you need to know about the Colorado Tech Scene

With a business-friendly climate and research universities like CU Boulder and Colorado State, Colorado has made a name for itself as a startup ecosystem. The state boasts a skilled workforce and high quality of life thanks to its affordable housing, vibrant cultural scene and unparalleled opportunities for outdoor recreation. Colorado is also home to the National Renewable Energy Laboratory, helping cement its status as a hub for renewable energy innovation.

Key Facts About Colorado Tech

  • Number of Tech Workers: 260,000; 8.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Lockheed Martin, Century Link, Comcast, BAE Systems, Level 3
  • Key Industries: Software, artificial intelligence, aerospace, e-commerce, fintech, healthtech
  • Funding Landscape: $4.9 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Access Venture Partners, Ridgeline Ventures, Techstars, Blackhorn Ventures
  • Research Centers and Universities: Colorado School of Mines, University of Colorado Boulder, University of Denver, Colorado State University, Mesa Laboratory, Space Science Institute, National Center for Atmospheric Research, National Renewable Energy Laboratory, Gottlieb Institute

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account