Atreides Logo

Atreides

Senior QA Automation Engineer (Canada)

Reposted Yesterday
Be an Early Applicant
In-Office or Remote
Hiring Remotely in Victoria, BC
Senior level
In-Office or Remote
Hiring Remotely in Victoria, BC
Senior level
As a Senior QA Automation Engineer, you will develop automated test frameworks, validate data pipelines, and maintain data quality for ETL processes, collaborating closely with data engineering teams.
The summary above was generated by AI

Job Title: Senior QA Automation Data Engineer (Remote CAN)

 

Company Overview: Atreides helps organizations transform large and complex multi-modal datasets into information-rich geo-spatial data subscriptions that can be used across a wide spectrum of use cases. Currently, Atreides focuses on providing high-fidelity data solutions to enable customers to derive insights quickly.  

 

We are a fast-moving, high-performance startup. We value a diverse team and believe inclusion drives better performance. We trust our team with autonomy, believing it leads to better results and job satisfaction. With a mission-driven mindset and entrepreneurial spirit, we are building something new and helping unlock the power of massive-scale data to make the world safer, stronger, and more prosperous. 

 

Team Overview: We are a passionate team of technologists, data scientists, and analysts with backgrounds in operational intelligence, law enforcement, large multinationals, and cybersecurity operations.  We obsess about designing products that will change the way global companies, governments and nonprofits protect themselves from external threats and global adversaries.  

 

Position Overview: We are seeking a QA Automation Data Engineer to ensure the correctness, performance, and reliability of our data pipelines, data lakes, and enrichment systems. In this role, you will design, implement, and maintain automated validation frameworks for our large-scale data workflows. You will work closely with data engineers, analysts, and platform engineers to embed test coverage and data quality controls directly into the CI/CD lifecycle of our ETL and geospatial data pipelines. 

You should be deeply familiar with test automation in data contexts, including schema evolution validation, edge case generation, null/duplicate detection, statistical drift analysis, and pipeline integration testing. This is not a manual QA role — you will write code, define test frameworks, and help enforce reliability through automation. 

 

Team Principles: 

At Atreides, we believe that teams work best when they: 

  • Remain curious and passionate in all aspects of our work 
  • Promote clear, direct, and transparent communication 
  • Embrace the 'measure twice, cut once' philosophy 
  • Value and encourage diverse ideas and technologies 
  • Lead with empathy in all interactions 

 

Responsibilities: 

  • Develop automated test harnesses for validating Spark pipelines, Iceberg table transformations, and Python-based data flows. 
  • Implement validation suites for data schema enforcement, contract testing, and null/duplication/anomaly checks. 
  • Design test cases for validating geospatial data processing pipelines (e.g., geometry validation, bounding box edge cases). 
  • Integrate data pipeline validation with CI/CD tooling. 
  • Monitor and alert on data quality regressions using metric-driven validation (e.g., row count deltas, join key sparsity, referential integrity). 
  • Write and maintain mock data generators and property-based test cases for data edge cases and corner conditions. 
  • Contribute to team standards for testing strategy, coverage thresholds, and release readiness gates. 
  • Collaborate with data engineers on pipeline observability and reproducibility strategies. 
  • Participate in root cause analysis and post-mortems for failed data releases or quality incidents. 
  • Document infrastructure design, data engineering processes, and maintain comprehensive documentation. 

 

Desired Qualifications: 

  • 5+ years of experience in data engineering or data QA roles with automation focus. 
  • Strong proficiency in Python and PySpark, including writing testable, modular data code. 
  • Experience with Apache Iceberg, Delta Lake, or Hudi, including schema evolution and partitioning. 
  • Familiarity with data validation libraries (e.g., Great Expectations, Deequ, Soda SQL) or homegrown equivalents. 
  • Understanding of geospatial formats (e.g., GeoParquetGeoJSON, Shapefiles) and related edge cases. 
  • Experience with test automation frameworks such as pytest, hypothesis, unittest, and integration with CI pipelines. 
  • Familiarity with cloud-native data infrastructure, especially AWS (Glue, S3, Athena, EMR). 
  • Knowledge of data lineage, data contracts, and observability tools is a plus. 
  • Strong communication skills and the ability to work cross-functionally with engineers and analysts. 

 

You’ll Succeed If You 

  • Enjoy catching issues before they hit production and designing coverage to prevent them. 
  • Believe that data quality is a first-class concern, not an afterthought. 
  • Thrive in environments where automated tests are part of the engineering pipeline, not separate from it. 
  • Can bridge the gap between engineering practices and analytics/ML testing needs. 
  • Have experience debugging distributed failures (e.g., skewed partitions, schema mismatches, memory pressure). 

 

Compensation and Benefits: 



  • Competitive salary 


  • Comprehensive health, dental, and vision insurance plans


  • Flexible hybrid work environment 


  • Additional benefits like flexible hours, work travel opportunities, competitive vacationtime and parental leave   

 

While meeting all of these criteria would be ideal, we understand that some candidates may meet most, but not all. If you're passionate, curious and ready to "work smart and get things done," we'd love to hear from you. 

Top Skills

Apache Iceberg
Athena
AWS
Deequ
Delta Lake
Emr
Glue
Great Expectations
Hudi
Pyspark
Pytest
Python
S3
Soda Sql

Similar Jobs

An Hour Ago
Remote or Hybrid
Canada
Senior level
Senior level
Digital Media • Gaming • Information Technology • Software • Sports • Esports • Big Data Analytics
The Senior Software Engineer will develop software solutions, enhance tools and systems, mentor junior engineers, and manage infrastructure deployments.
Top Skills: .Net CoreAWSBambooC#DatadogElk StackFluxGCPGitopsInfrastructure As CodeJenkinsKafkaKubernetesPythonTerraformTypescript
2 Hours Ago
Easy Apply
Remote or Hybrid
2 Locations
Easy Apply
119K-170K Annually
Mid level
119K-170K Annually
Mid level
Artificial Intelligence • Information Technology • Machine Learning • Natural Language Processing • Productivity • Software • Generative AI
As a Sales Engineer I, you will collaborate with Account Executives to demonstrate and architect solutions using Superhuman's AI productivity tools, enhancing customer workflows and value delivery.
Top Skills: Ai TechnologiesCodaGmailSalesforce
13 Hours Ago
Easy Apply
Remote or Hybrid
2 Locations
Easy Apply
166K-220K Annually
Senior level
166K-220K Annually
Senior level
Artificial Intelligence • Information Technology • Machine Learning • Natural Language Processing • Productivity • Software • Generative AI
Lead technical relationships with partners, guiding workshops and developing POCs to enhance platform adoption and joint value creation.
Top Skills: Software EngineeringSolution Architecture

What you need to know about the Colorado Tech Scene

With a business-friendly climate and research universities like CU Boulder and Colorado State, Colorado has made a name for itself as a startup ecosystem. The state boasts a skilled workforce and high quality of life thanks to its affordable housing, vibrant cultural scene and unparalleled opportunities for outdoor recreation. Colorado is also home to the National Renewable Energy Laboratory, helping cement its status as a hub for renewable energy innovation.

Key Facts About Colorado Tech

  • Number of Tech Workers: 260,000; 8.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Lockheed Martin, Century Link, Comcast, BAE Systems, Level 3
  • Key Industries: Software, artificial intelligence, aerospace, e-commerce, fintech, healthtech
  • Funding Landscape: $4.9 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Access Venture Partners, Ridgeline Ventures, Techstars, Blackhorn Ventures
  • Research Centers and Universities: Colorado School of Mines, University of Colorado Boulder, University of Denver, Colorado State University, Mesa Laboratory, Space Science Institute, National Center for Atmospheric Research, National Renewable Energy Laboratory, Gottlieb Institute

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account