Do you want to work on cutting-edge projects with the world’s best IT engineers? Do you wish you could control which projects to work on and choose your own pay rate? Are you interested in the future of work and how the cloud will form teams? If so - the Gigster Talent Network is for you.
Our clients rely on our Network for two main areas, Software Development and Cloud Services. In some cases, they need help building great new products, in others they want our expertise in migrating, maintaining, and optimizing their cloud solutions.
At Gigster, whether working with entrepreneurs to realize ‘the next great vision’ or with Fortune 500 companies to deliver a big product launch, we build really cool enterprise software on cutting-edge technology.
The Role:
We are seeking an experienced Data Engineer with deep expertise in data transformation at scale, particularly in integrating and processing data from third-party public APIs. This role is critical to enhancing and maintaining data pipelines that feed into Natural Language Processing (NLP) models.
What you’ll do:
Design, build, and optimize scalable ETL/ELT data pipelines using Apache Spark, Apache Kafka, and orchestration tools such as Prefect or Airflow
Integrate external data sources and public APIs with internal data systems
Work with large-scale datasets to support NLP model training and inference
Analyze existing pipelines and recommend enhancements for performance, reliability, and scalability
Collaborate with cross-functional teams, including data scientists and ML engineers
Own the end-to-end engineering process—from planning and technical design to implementation
Regularly report progress and outcomes to client stakeholders
Proficiency in Python and experience with data transformation and data engineering best practices
Strong experience with Apache Spark, Apache Kafka, and Google Cloud Platform (GCP)
Hands-on experience with workflow orchestration tools (e.g., Prefect, Airflow)
Demonstrated experience working with large datasets and real-time data processing
Experience building and maintaining ETL/ELT pipelines for analytical or machine learning use cases
Self-motivated, with excellent communication and project ownership skills
Preferred Qualifications:
Familiarity with financial services data or regulated data environments
Experience with Snowflake or Google BigQuery
- Experience with PostgreSQL and GCS (Google Cloud Storage)
Exposure to NLP workflows and data requirements for machine learning models
Logistics:
- This is a part-time, short term, 4 to 6 weeks contract
- Preferred location: Remote US
Top Skills
Similar Jobs
What you need to know about the Colorado Tech Scene
Key Facts About Colorado Tech
- Number of Tech Workers: 260,000; 8.5% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Lockheed Martin, Century Link, Comcast, BAE Systems, Level 3
- Key Industries: Software, artificial intelligence, aerospace, e-commerce, fintech, healthtech
- Funding Landscape: $4.9 billion in VC funding in 2024 (Pitchbook)
- Notable Investors: Access Venture Partners, Ridgeline Ventures, Techstars, Blackhorn Ventures
- Research Centers and Universities: Colorado School of Mines, University of Colorado Boulder, University of Denver, Colorado State University, Mesa Laboratory, Space Science Institute, National Center for Atmospheric Research, National Renewable Energy Laboratory, Gottlieb Institute