Cogstate Logo

Cogstate

Data Engineer

Posted 4 Days Ago
Easy Apply
Remote
Hiring Remotely in United States
120K-135K Annually
Mid level
Easy Apply
Remote
Hiring Remotely in United States
120K-135K Annually
Mid level
The Data Engineer will build and maintain data infrastructure, design data pipelines, and develop reusable frameworks, leveraging technologies like Azure Databricks and PySpark for data processing and transformation.
The summary above was generated by AI

At Cogstate, we’re advancing the science of brain health - making it faster, easier, and more accurate to assess cognition across clinical trials, healthcare settings, and everyday life.

Our digital cognitive assessments are trusted by researchers, clinicians, and pharmaceutical partners around the world, helping to drive breakthroughs in neuroscience and improve outcomes for people living with neurological conditions. Founded on decades of cognitive science and backed by rigorous validation, Cogstate’s assessments are used in more than 150 countries and over 2,000 clinical trials.

Our global team of experts - spanning psychology, data science, operations, and technology - works together to solve complex challenges in brain health assessment, always with a patient-first mindset. Whether we’re supporting a multinational Alzheimer’s trial or developing tools to bring cognitive testing into routine care, our work is meaningful, collaborative, and constantly evolving. 

At Cogstate, we’re not just imagining the future of brain health - we’re building it.

That’s why we’re seeking a Data Engineer responsible for building and maintaining Cogstate’s data infrastructure using best practice approach and engineering. The position will have a central role for establishing and maintaining data pipelines and reporting tables using Azure Databricks, working closely with other members of the data and scientific services team and being a point of contact for the data platform and associated data reporting.

Core Responsibilities

  • Understand Cogstate data sources and develop data pipelines using Databricks to bring all data into the data lake. 
  • Design, develop, implement, and tune large-scale distributed systems and pipelines that process large volumes of data; focusing on scalability, low-latency, and fault-tolerance in every system 
  • Developing scalable and re-usable frameworks for ingesting data into Azure Databricks, incorporating standards and best practices into engineering solutions 
  • Databricks engineering - query tuning, performance tuning, troubleshooting, and debugging pipelines. 
  • Deep understanding of ETL/ELT design methodologies, architecture, strategy, and tactics for complex ETL solutions, including CI/CD skills. 
  • Develop high performance scripts in PySpark to achieve objectives of enterprise data, BI, data visualization and analytics needs.
  • Data processing/transformation using various technologies such as Apache Spark, SQL, Python/Scala and Azure cloud services. 
  • Manage code versions in source control and coordinate changes across teams by leveraging Github. 
  • Participate in architecture design and discussions, provide logical and physical data design, and database modelling 
  • Be part of the Agile team to ensure availability of data to internal and external users.
  • Organize and manage data shares.
  • Solve complex data issues around data integration, data quality, and other data processing incidents 
  • Work with business system owners to resolve source data issues and refine transformation rules 

Qualifications

  • BS/BA in Computer Science, Data Science, or a related field or relevant experience
  • 2+ years in implementing data engineering solutions in PySpark in Databricks
  • Knowledge of relational databases and Apache Spark.
  • Strong knowledge of Databricks configuration, troubleshooting and performance tuning.
  • Testing, automation and orchestration, including Github and Azure functions.
  • Experience with development tools for CI/CD.
  • Deep expertise in programming languages for data processes (PySpark, Python, Scala).
  • Experience with relational databases like SQL Server writing complex SQL transformations

What’s In It For You

  • Remote Work Practices: Cogstate is a virtual first company. Cogstate employees can work from anywhere where Cogstate is registered to business within the United States, Australia, or the United Kingdom!
  • Generous Paid Time-off: Cogstate employees receive 20 days of vacation leave, 10 days of personal leave and 10 paid public holidays.
  • 401(k) Matching: As you invest in yourself and your future, Cogstate invests in you too: we match up to3% of your yearly salary in Cogstate’s 401k program.
  • Competitive Salary: We offer competitive base salaries plus additional earning opportunities based on the position.
  • Health, Dental & Vision Coverage: We've invested in comprehensive health & dental insurance options with competitive company contributions to help when you need it most. We also offer free vision insurance for all full-time employees.
  • Short-Term & Long Term Disability Life Insurance: 100% employer sponsored
  • Pre-Tax Benefits: Healthcare and Dependent Care Flexible Spending Accounts
  • Learning & Development Opportunities: Cogstate offers a robust learning program from mentorships to assistance with programs to improve knowledge or obtain certifications in applicable areas of interest.

Wage Range
$120,000$135,000 USD

Our Culture
We bring our whole selves to work every day. We’re courageous and we deliver together. We’re passionate individuals who enjoy working together. We’re brave enough and care enough to have the right conversations to get the best outcome and are famous for our can-do attitude. We see challenges as opportunities and move with pace to achieve our goals.

If you’re ready to help us in our journey to optimize the measurement of brain health around the world, please apply now!

Applicants with disabilities may be entitled to reasonable accommodation under the terms of the Americans with Disabilities Act and certain state or local laws. A reasonable accommodation is a change in the way things are normally done which will ensure an equal employment opportunity without imposing undue hardship on the company. If you need assistance in applying please email [email protected].

Privacy Notice for Job Applicants

Cogstate is committed to protecting your personal data. We collect and process your information for recruitment purposes in compliance with applicable laws, including the  Australian Privacy Principles (APPs), the UK General Data Protection Regulation (UK GDPR), California Consumer Privacy Act (CCPA), Virginia Consumer Data Protection Act (VCDPA), Colorado Privacy Act (CPA), and similar laws in other jurisdictions.

For more information on how we collect, use, and protect your data, and your rights under these laws, you can find Cogstate's privacy policy by clicking here.


Top Skills

Spark
Azure Cloud Services
Azure Databricks
Git
Pyspark
Python
Scala
SQL

Similar Jobs

16 Days Ago
Remote or Hybrid
Framingham, MA, USA
69K-129K Annually
Mid level
69K-129K Annually
Mid level
Big Data • Healthtech • Software
Design, build, and maintain scalable ETL/ELT pipelines using Python, Spark, Databricks, Airflow and SSIS. Integrate and cleanse diverse healthcare datasets, implement Unity Catalog for metadata and governance, optimize Spark performance and JVM tuning, support Medallion architecture, and collaborate with cross-functional teams to automate CI/CD, observability, and data quality processes.
Top Skills: Apache AirflowSparkAWSCsvDatabricksDeltaGitlab CiJenkinsJvmMedallion ArchitectureNoSQLParquetPythonScalaSQLSsisUnity CatalogXML
2 Days Ago
Remote or Hybrid
United States
117K-146K Annually
Senior level
117K-146K Annually
Senior level
Digital Media • Gaming • Information Technology • Software • Sports • Esports • Big Data Analytics
As a Senior Data Engineer, you'll design and implement scalable data systems, optimize performance, and lead projects while collaborating with teams to enhance data solutions.
Top Skills: BigQueryGitRedshiftSnowflakeSQL
4 Days Ago
Easy Apply
Remote
United States
Easy Apply
Senior level
Senior level
Artificial Intelligence • eCommerce • Fintech • Payments • Retail • Software • Analytics
Build and maintain data onboarding pipelines and automation to extract, transform, validate, and import retailer data from legacy POS systems. Optimize performance, ensure data quality, and collaborate with onboarding, product, and operations teams to scale retailer onboarding.
Top Skills: Ai ToolsAsync Job ProcessingData PipelinesPostgresRuby On Rails

What you need to know about the Colorado Tech Scene

With a business-friendly climate and research universities like CU Boulder and Colorado State, Colorado has made a name for itself as a startup ecosystem. The state boasts a skilled workforce and high quality of life thanks to its affordable housing, vibrant cultural scene and unparalleled opportunities for outdoor recreation. Colorado is also home to the National Renewable Energy Laboratory, helping cement its status as a hub for renewable energy innovation.

Key Facts About Colorado Tech

  • Number of Tech Workers: 260,000; 8.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Lockheed Martin, Century Link, Comcast, BAE Systems, Level 3
  • Key Industries: Software, artificial intelligence, aerospace, e-commerce, fintech, healthtech
  • Funding Landscape: $4.9 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Access Venture Partners, Ridgeline Ventures, Techstars, Blackhorn Ventures
  • Research Centers and Universities: Colorado School of Mines, University of Colorado Boulder, University of Denver, Colorado State University, Mesa Laboratory, Space Science Institute, National Center for Atmospheric Research, National Renewable Energy Laboratory, Gottlieb Institute

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account