Tiger Analytics Logo

Tiger Analytics

Gen AI Data Engineer

Posted 23 Days Ago
Remote
Hiring Remotely in United States
Expert/Leader
Remote
Hiring Remotely in United States
Expert/Leader
The Gen AI Data Engineer will design and build distributed data systems, develop data pipelines, manage data infrastructure, and integrate technologies for real-time and batch processing, contributing to scalable analytics solutions.
The summary above was generated by AI

Tiger Analytics is looking for experienced Machine Learning Engineers with Gen AI experience to join our fast-growing advanced analytics consulting firm. Our employees bring deep expertise in Machine Learning, Data Science, and AI. We are the trusted analytics partner for multiple Fortune 500 companies, enabling them to generate business value from data. Our business value and leadership has been recognized by various market research firms, including Forrester and Gartner.

We are looking for top-notch talent as we continue to build the best global analytics consulting team in the world. You will be responsible for:

Technical Skills Required:

Programming Languages: Proficiency in Python, SQL, and PySpark.

Data Warehousing: Experience with Snowflake, NOSQL and Neo4j.

Data Pipelines: Proficiency with Apache Airflow.

Cloud Platforms: Familiarity with AWS (S3, RDS, Lambda, AWS batch, SageMaker processing Job, CloudFormation, etc.) or GCP (Vertex AI RAG, Data pipeline, Bigquery, GKE)

Operating Systems: Experience with Linux.

Batch/Realtime Pipelines: Experience in building and deploying various pipelines.

Version Control: Experience with GitHub.

Development Tools: Proficiency with VS Code.

Engineering Practices: Skills in testing, deployment automation, DevOps/SysOps.

Communication: Strong presentation and communication skills.

Collaboration: Experience working with onshore/offshore teams.


Requirements

Desired Skills:

·        Big Data Technologies: Experience with Hadoop and Spark.

Data Visualization: Proficiency with Streamlit and dashboards.

·        APIs: Experience in building and maintaining internal APIs.

·        Machine Learning: Basic understanding of ML concepts.

·        Generative AI: Familiarity with generative AI tools and techniques.

Additional Expertise:

·        Knowledge Graphs: Experience with creation and retrieval.

·        Vector Databases: Proficiency in managing vector databases.

·        Data Persistence: Ability to develop and maintain multiple forms of data persistence and retrieval methods (RDMBS, Vector Databases, buckets, graph databases, knowledge graphs, etc.).

·        Cloud Technologies: Experience with AWS, especially SageMaker, Lambda, OpenSearch.

·        Automation Tools: Experience with Airflow DAGs, AutoSys, and CronJobs.

·        Unstructured Data Management: Experience in managing data in unstructured forms (audio, video, image, text, etc.).

·        CI/CD: Expertise in continuous integration and deployment using Jenkins and GitHub Actions.

·        Infrastructure as Code: Advanced skills in Terraform and CloudFormation.

·        Containerization: Knowledge of Docker and Kubernetes.

·        Monitoring and Optimization: Proven ability to monitor system performance, reliability, and security, and optimize them as needed.

·        Security Best Practices: In-depth understanding of security best practices in cloud environments.

·        Scalability: Experience in designing and managing scalable infrastructure.

·        Disaster Recovery: Knowledge of disaster recovery and business continuity planning.

·        Problem-Solving: Excellent analytical and problem-solving abilities.

·        Adaptability: Ability to stay up-to-date with the latest industry trends and adapt to new technologies and methodologies.

·        Team Collaboration: Proven ability to work well in a team environment and contribute to a positive, collaborative culture.

GenAI Engineer Specific Skills:

·        Industry Experience: 8+ years of experience in data engineering, platform engineering, or related fields, with deep expertise in designing and building distributed data systems and large-scale data warehouses.

·        Data Platforms: Proven track record of architecting data platforms capable of processing petabytes of data and supporting real-time and batch ingestion processes.

·        Data Pipelines: Strong experience in building robust data pipelines for document ingestion, indexing, and retrieval to support scalable RAG solutions. Proficiency in information retrieval systems and vector search technologies (e.g., FAISS, Pinecone, Elasticsearch, Milvus).

·        Graph Algorithms: Experience with graphs/graph algorithms, LLMs, optimization algorithms, relational databases, and diverse data formats.

·        Data Infrastructure: Proficient in infrastructure and architecture for optimal extraction, transformation, and loading of data from various data sources.

·        Data Curation: Hands-on experience in curating and collecting data from a variety of traditional and non-traditional sources.

·        Ontologies: Experience in building ontologies in the knowledge retrieval space, schema-level constructs (including higher-level classes, punning, property inheritance), and Open Cypher.

·        Integration: Experience in integrating external databases, APIs, and knowledge graphs into RAG systems to improve contextualization and response generation.

·        Experimentation: Conduct experiments to evaluate the effectiveness of RAG workflows, analyze results, and iterate to achieve optimal performance.


Benefits

This position offers an excellent opportunity for significant career development in a fast-growing and challenging entrepreneurial environment with a high degree of individual responsibility.

Top Skills

Apache Airflow
AWS
CloudFormation
Docker
GCP
Git
Github Actions
Hadoop
Jenkins
Kubernetes
Linux
Neo4J
NoSQL
Pyspark
Python
Snowflake
Spark
SQL
Streamlit
Terraform
Vs Code

Similar Jobs

10 Hours Ago
Remote
United States
50K-130K Annually
Senior level
50K-130K Annually
Senior level
Software
Lead the development of Big Data solutions and machine learning systems, mentor engineers, and maintain high standards of engineering excellence.
Top Skills: AWSAzureCi/CdDeltaDockerGCPHudiIcebergKubernetesPysparkSpark
9 Hours Ago
Remote or Hybrid
United States
Senior level
Senior level
Cloud • Information Technology • Security • Software • Cybersecurity
Lead and manage the US Enterprise Majors sales team, focusing on strategic account management, recruiting, training, and exceeding sales targets.
Top Skills: Application DeliveryB2B Enterprise SoftwareEdge ComputingNetworkingSaaSSecurity
9 Hours Ago
Easy Apply
In-Office or Remote
2 Locations
Easy Apply
100K-180K Annually
Senior level
100K-180K Annually
Senior level
Fintech • Payments • Financial Services
The Technical Accountant will lead on complex accounting issues, ensure compliance with GAAP, and collaborate across teams to integrate accounting considerations into business decisions.
Top Skills: U.S. Gaap

What you need to know about the Colorado Tech Scene

With a business-friendly climate and research universities like CU Boulder and Colorado State, Colorado has made a name for itself as a startup ecosystem. The state boasts a skilled workforce and high quality of life thanks to its affordable housing, vibrant cultural scene and unparalleled opportunities for outdoor recreation. Colorado is also home to the National Renewable Energy Laboratory, helping cement its status as a hub for renewable energy innovation.

Key Facts About Colorado Tech

  • Number of Tech Workers: 260,000; 8.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Lockheed Martin, Century Link, Comcast, BAE Systems, Level 3
  • Key Industries: Software, artificial intelligence, aerospace, e-commerce, fintech, healthtech
  • Funding Landscape: $4.9 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Access Venture Partners, Ridgeline Ventures, Techstars, Blackhorn Ventures
  • Research Centers and Universities: Colorado School of Mines, University of Colorado Boulder, University of Denver, Colorado State University, Mesa Laboratory, Space Science Institute, National Center for Atmospheric Research, National Renewable Energy Laboratory, Gottlieb Institute

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account