Upbound Logo

Upbound

Data Engineer - AI (REMOTE)

Posted 2 Days Ago
Remote
3 Locations
Senior level
Remote
3 Locations
Senior level
Lead architect and development of AI-driven data infrastructures. Design data pipelines and systems for ML model training and semantic search capabilities.
The summary above was generated by AI

Upbound is the company behind Crossplane, the open source project which started the control plane revolution in the cloud native community. Upbound is redefining how modern infrastructure is built. As the creators of Crossplane and the pioneers of the Intelligent Control Plane, we are leading the shift toward agentic infrastructure: platforms that reason, adapt, and operate alongside AI-native systems.


We're seeking an exceptional Principal Data Engineer to serve as the technical leader for data infrastructure supporting Upbound's current product suite in addition to our AI initiatives and intelligent control plane capabilities. In this role, you'll architect and drive the development of sophisticated data platforms that power AI-driven features. You'll design solutions that leverage control planes and our Marketplace as a knowledge store, building RAG systems and semantic search capabilities that help users discover and implement infrastructure patterns at scale. You'll create data pipelines that process infrastructure telemetry, configuration data, and usage patterns to train models that make our control planes more intelligent and autonomous. This is an opportunity to work at the intersection of cloud-native infrastructure and artificial intelligence, directly impacting how enterprises manage their infrastructure through Upbound's platform.

What You'll DoTechnical Leadership & Architecture
  • Define and drive the technical vision for data platforms that support AI-powered features in Crossplane and Upbound Spaces
  • Lead the design of data pipelines that transform infrastructure and data into training datasets for ML models
  • Architect vector search and RAG systems that leverage Crossplane Control Planes & Upbound Marketplace as a knowledge store
  • Build data infrastructure that processes resources, extensions, and compositions for semantic search
  • Establish frameworks for collecting, processing, and analyzing infrastructure configuration data
  • Design data pipelines that handle Crossplane-specific data
  • Create infrastructure for indexing and searching Upbound Marketplace content, documentation, and community patterns
  • Develop metrics and monitoring for AI features integrated with Upbound's control plane architecture
Product Development & Strategy
  • Design data systems that power AI agents for infrastructure provisioning & operations, helping users generate and optimize Crossplane compositions
  • Create feature engineering platforms that extract signals from control plane operations, resource status, and reconciliation patterns
  • Implement data infrastructure for training models that predict infrastructure failures, optimize resource allocation, and suggest configuration improvements
  • Drive the development of knowledge graph representations of infrastructure dependencies and relationships
What You'll Bring
  • 10+ years of software/data engineering experience with at least 4 years in technical leadership roles
  • Proven track record building data platforms that support production systems at scale
  • Deep expertise in both traditional data engineering (Spark, Airflow, data lakes) and ML-specific infrastructure (feature stores, model serving)
  • Experience with vector databases (Pinecone, Weaviate, Qdrant, Milvus, pgvector, Opensearch, ElasticSearch)
  • Demonstrated experience with LLM applications, including RAG architectures and semantic search implementations
  • Understanding of Kubernetes, cloud-native architectures, and infrastructure-as-code principles
Technical Expertise
  • Strong understanding of data requirements for AI/ML systems: training pipelines, feature stores, and inference infrastructure
  • Hands-on experience building knowledge bases and semantic search systems for technical documentation and code
  • Experience with embedding models for code and technical documentation
  • Knowledge of time-series data processing for infrastructure metrics and events
  • Understanding of graph databases and their application to infrastructure dependency modeling
Leadership Qualities
  • Exceptional technical judgment with the ability to navigate both the AI and cloud-native landscapes
  • Demonstrate a positive attitude and foster an environment of experimentation and innovation
  • Strong ability to translate infrastructure management problems into data engineering solutions
  • Passion for making infrastructure management more intelligent and accessible through AI
  • Deep empathy for platform engineers and understanding of their operational challenges

A plus if you:

  • Have direct experience with Crossplane and Upbound products
  • Experience building AI features for developer tools or infrastructure platforms
  • Understanding of enterprise compliance requirements for infrastructure platforms
  • Knowledge of policy engines 

#LI-REMOTE

#LI-REMOTE

Why Upbound?

At Upbound, you’ll help shape the systems and strategies that drive predictable, scalable growth in a product-led company embracing usage-based models. If you're excited to build from the ground up, work with cutting-edge cloud technologies, and directly impact how revenue is generated and scaled—this is your seat at the table.

About Upbound

Upbound is pioneering infrastructure platforms for the Agentic AI Era, serving Fortune 500 companies and platform engineers across more than 100 countries. The company empowers infrastructure and platform teams with Intelligent Control Planes - based on Kubernetes and Crossplane - that provision, operate, and adapt so platforms are ready for both humans and AI agents. Upbound is the creator and primary maintainer of Crossplane, the popular open-source framework for building cloud-native control planes, with over 100 million downloads and adoption by more than 1,000 teams worldwide. A Series B startup backed by GV (formerly Google Ventures), Altimeter Capital, and Intel Capital, Upbound has raised $69M to date. For more information, visit www.upbound.io.


Top Skills

Airflow
Elasticsearch
Kubernetes
Milvus
Opensearch
Pgvector
Pinecone
Qdrant
Spark
Vector Databases
Weaviate

Similar Jobs

Yesterday
In-Office or Remote
99 Locations
Junior
Junior
HR Tech • Other • Professional Services
As a Software Developer, you will create training data for AI models, audit tool usage, write test cases, and improve technical outputs. No prior AI experience is necessary, but coding skills are critical.
Top Skills: APIsC++GoJavaJavaScriptJSONPythonRuby
2 Minutes Ago
Remote or Hybrid
Montréal, QC, CAN
Senior level
Senior level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Responsible for selling service offerings to enhance customer adoption of products, developing strategies, managing opportunities, and collaborating with account teams and partners.
Top Skills: Ai-Enhanced TechnologySaas Software
2 Minutes Ago
Remote or Hybrid
Toronto, ON, CAN
10-15
Expert/Leader
10-15
Expert/Leader
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
The Director will lead ServiceNow's government relations and public policy strategy in Canada, focusing on AI, cybersecurity, and data governance, providing strategic insights and advocacy.
Top Skills: Ai GovernanceData RegulationDigital Transformation

What you need to know about the Colorado Tech Scene

With a business-friendly climate and research universities like CU Boulder and Colorado State, Colorado has made a name for itself as a startup ecosystem. The state boasts a skilled workforce and high quality of life thanks to its affordable housing, vibrant cultural scene and unparalleled opportunities for outdoor recreation. Colorado is also home to the National Renewable Energy Laboratory, helping cement its status as a hub for renewable energy innovation.

Key Facts About Colorado Tech

  • Number of Tech Workers: 260,000; 8.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Lockheed Martin, Century Link, Comcast, BAE Systems, Level 3
  • Key Industries: Software, artificial intelligence, aerospace, e-commerce, fintech, healthtech
  • Funding Landscape: $4.9 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Access Venture Partners, Ridgeline Ventures, Techstars, Blackhorn Ventures
  • Research Centers and Universities: Colorado School of Mines, University of Colorado Boulder, University of Denver, Colorado State University, Mesa Laboratory, Space Science Institute, National Center for Atmospheric Research, National Renewable Energy Laboratory, Gottlieb Institute

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account