TechBiz Global Logo

TechBiz Global

Senior AI Data Engineer

Posted Yesterday
Be an Early Applicant
Remote
Hiring Remotely in Greece
Senior level
Remote
Hiring Remotely in Greece
Senior level
Design, build, and scale ETL/ELT and real-time data pipelines for AI workloads (RAG, fine-tuning, batch inference). Transform unstructured data into vectorized formats, manage feature stores and vector databases, enforce data quality/governance, integrate event systems (Kafka), and collaborate with ML and engineering teams.
The summary above was generated by AI

At TechBiz Global, we are providing recruitment service to our TOP clients from our portfolio.

We are currently looking for a dedicated Senior AI Data Engineer to join one of our clients' teams. If you're looking for an exciting opportunity to grow in an innovative environment, this could be the perfect fit for you.

 

Responsibilities:

 

▪ Design, build, and scale robust ETL/ELT pipelines optimized for AI workloads, including RAG, fine-tuning, and batch inference.

▪ Transform unstructured data sources such as PDFs, logs, and transcripts into structured and vectorized formats suitable for LLM consumption.

▪ Maintain and automate the data-to-model lifecycle, ensuring AI knowledge bases remain synchronized with changing business data.

▪ Develop and maintain real-time feature pipelines that support low-latency AI and machine learning applications.

▪ Integrate data platforms with Kafka and other event-driven systems to enable real-time processing and AI-driven responses.

▪ Manage and optimize Feature Stores to ensure consistency between model training and production environments.

▪ Implement automated data quality controls and validation processes to ensure the reliability and accuracy of AI training and inference data.

▪ Establish and maintain data lineage frameworks to provide traceability, auditability, and regulatory compliance across data workflows.

▪ Enforce data security, privacy, and governance standards, including PII protection and compliance with industry regulations.

▪ Manage data movement and synchronization across on-premises systems, cloud platforms, and data warehouses.

▪ Optimize data storage and retrieval strategies for Vector Databases to support high-performance RAG and AI search workloads.

▪ Collaborate with Data Scientists, ML Engineers, Software Engineers, and business stakeholders to deliver scalable AI data solutions.


Job requirements

10+ years of experience in Data Engineering or Backend Engineering with a strong focus on data platforms and pipelines.

▪ 2+ years of hands-on experience supporting AI/ML data pipelines, including data preparation for machine learning and generative AI applications.

▪ Expert-level proficiency in Python and SQL; experience with Java or Scala is an advantage.

▪ Strong experience building and maintaining real-time data streaming solutions using Apache Kafka, Flink, or Spark Streaming.

▪ Hands-on experience with modern data orchestration and transformation tools such as Airflow, dbt, and Prefect.

▪ Experience working with Vector Databases and Feature Stores to support AI and machine learning workloads.

▪ Strong knowledge of cloud-based data services on AWS, Azure, or GCP, including services such as Glue, Kinesis, Data Factory, or Dataflow.

▪ Experience deploying and managing data workloads in Kubernetes (K8s) environments.

▪ Proven experience handling sensitive data within regulated industries such as Fintech, Healthcare, or other compliance-driven environments.

▪ Strong understanding of data quality, governance, security, and privacy best practices.

▪ Bachelor's degree in Computer Science, Software Engineering, Information Systems, or a related technical field. Equivalent practical experience will also be considered.

▪ Excellent problem-solving skills and the ability to collaborate effectively with cross-functional engineering, data, and AI teams.

Similar Jobs

5 Days Ago
Remote or Hybrid
Senior level
Senior level
Other
The Senior Data Engineer designs and maintains data pipelines and AI workflows, optimizing ETL processes for BI and LLM applications.
Top Skills: SparkAWSAzure Machine LearningHadoopKafkaPlsqlPythonRedshiftSnowflakeSQL
2 Days Ago
Remote or Hybrid
Mid level
Mid level
Big Data • Food • Hardware • Machine Learning • Retail • Automation • Manufacturing
You will manage change initiatives for the o9 project, ensuring effective communication, engagement, and adoption across functions, while training teams and tracking success metrics.
Top Skills: O9 Planning TransformationSap S/4 Hana
Senior level
Big Data • Food • Hardware • Machine Learning • Retail • Automation • Manufacturing
Assess information security risks, test security systems, implement cybersecurity technologies, and support day-to-day security operations. Apply policies, standards, and governance; manage third-party providers as needed; deliver security training; and support global information security lead in awareness campaigns and compliance activities while communicating with technical teams and leadership.

What you need to know about the Colorado Tech Scene

With a business-friendly climate and research universities like CU Boulder and Colorado State, Colorado has made a name for itself as a startup ecosystem. The state boasts a skilled workforce and high quality of life thanks to its affordable housing, vibrant cultural scene and unparalleled opportunities for outdoor recreation. Colorado is also home to the National Renewable Energy Laboratory, helping cement its status as a hub for renewable energy innovation.

Key Facts About Colorado Tech

  • Number of Tech Workers: 260,000; 8.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Lockheed Martin, Century Link, Comcast, BAE Systems, Level 3
  • Key Industries: Software, artificial intelligence, aerospace, e-commerce, fintech, healthtech
  • Funding Landscape: $4.9 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Access Venture Partners, Ridgeline Ventures, Techstars, Blackhorn Ventures
  • Research Centers and Universities: Colorado School of Mines, University of Colorado Boulder, University of Denver, Colorado State University, Mesa Laboratory, Space Science Institute, National Center for Atmospheric Research, National Renewable Energy Laboratory, Gottlieb Institute

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account