Sayari Logo

Sayari

Data Engineer (Remote, US)

Posted 3 Days Ago
Remote
Hiring Remotely in United States
100K Annually
Junior
Remote
Hiring Remotely in United States
100K Annually
Junior
The Data Engineer will build ETL pipelines, enhance entity resolution processes, and collaborate with product and development teams to maintain data infrastructure.
The summary above was generated by AI

About Sayari: 

Sayari is the transparency company providing the public and private sectors with immediate visibility into complex commercial relationships by delivering the largest commercially available collection of corporate and trade data as a dynamic model of global ownership and trade activity. Sayari’s solutions harness this model to enable risk resilience, complex investigations, and clear-eyed business decisions. Sayari is headquartered in Washington, D.C., and its solutions are used by thousands of frontline analysts in over 35 countries.


Our company culture is defined by a dedication to our mission of using open data to enhance visibility into global commercial and financial networks, a passion for finding novel approaches to complex problems, and an understanding that diverse perspectives create optimal outcomes. We embrace cross-team collaboration, encourage training and learning opportunities, and reward initiative and innovation. If you like working with supportive, high-performing, and curious teams, Sayari is the place for you.


POSITION DESCRIPTION

Sayari provides instant access to structured business information from hundreds of millions of corporate, legal, and trade records for a variety of use cases. As a member of Sayari's data team you will work with our Product and Software Engineering to build the graph that underlies Sayari’s products. 



Please note that we cannot provide H1B and/or Visa Sponsorship for this role at this time.




Job Responsibilities:

  • Build and maintain ETL pipelines to process and export record data to Sayari Graph application
  • Develop and improve entity resolution processes
  • Implement logic to calculate and export risk information
  • Work with product team and other development teams to collect and refine requirements
  • Run and maintain regular data releases 

Required Skills & Experience:

  • Expertise with Python or a JVM programming language (e.g. Java, Scala)
  • Expertise with SQL (e.g., Postgres) databases
  • 2+ years of experience designing, maintaining, and orchestrating ETL pipelines (e.g., Apache Spark, Apache Airflow) in cloud based environments (e.g., GCP, AWS, or Azure).

Desired Skills & Experience:

  • Experience with entity resolution, graph theory, and/or distributed computing
  • Experience with Kubernetes
  • Experience working as part of an agile development team using Scrum, Kanban, or similar

Benefits: 

·       100% fully paid medical, vision, and dental for employees and their dependents

·       Generous time off; we observe all US federal holidays, close our office for a winter break (12/24-12/31), in addition to granting 18 PTO days and 10 sick days

·       Outstanding compensation package; competitive commissions for revenue roles and quarterly bonuses for non-revenue positions

·       A strong commitment to diversity, equity, and inclusion

·       Eligibility to participate in additional benefits such as 401k match up to 5%, 100% paid life insurance (up to $100,000 coverage),, and parental leave

·       A collaborative and positive culture - your team will be as smart and driven as you

·       Limitless growth and learning opportunities

 

Sayari is an equal opportunity employer and strongly encourages diverse candidates to apply. We believe diversity and inclusion mean our team members should reflect the diversity of the United States. No employee or applicant will face discrimination or harassment based on race, color, ethnicity, religion, age, gender, gender identity or expression, sexual orientation, disability status, veteran status, genetics, or political affiliation. We strongly encourage applicants of all backgrounds to apply.

Top Skills

Apache Airflow
Spark
AWS
Azure
GCP
Java
Kubernetes
Postgres
Python
Scala
SQL

Similar Jobs

6 Days Ago
Easy Apply
Remote
United States
Easy Apply
108K-122K Annually
Junior
108K-122K Annually
Junior
Insurance
As a Data Engineer II, you will design and maintain data solutions, manage data pipelines, collaborate with teams for data architecture, and educate others on data practices.
Top Skills: AirflowAivenArcgisBigQueryCloud FunctionsComposerDataformDebeziumFivetranGcp GcsGcp Vertex AiGoGCPKafkaNuxtPostgresPythonRSQLTailwindTerraformVuejsWebpack
12 Days Ago
Easy Apply
Remote
Hybrid
3 Locations
Easy Apply
130K-170K
Senior level
130K-170K
Senior level
AdTech • Big Data • Information Technology • Marketing Tech • Sales • Software
As a Senior Data Engineer, you will develop and maintain ETL pipelines, design scalable systems, troubleshoot production issues, and mentor team members while ensuring best practices are followed.
Top Skills: AirflowBigQueryDataflowGoogle Cloud PlatformKubernetesPub/SubPythonSQL
2 Days Ago
Remote
Chicago, IL, USA
91K-111K Annually
Mid level
91K-111K Annually
Mid level
Fintech
As a Data Engineer II, you'll develop scalable data solutions, manage ETL pipelines, optimize SQL queries, and ensure data quality and governance.
Top Skills: GCPNode.jsPythonSQLTerraformTypescript

What you need to know about the Colorado Tech Scene

With a business-friendly climate and research universities like CU Boulder and Colorado State, Colorado has made a name for itself as a startup ecosystem. The state boasts a skilled workforce and high quality of life thanks to its affordable housing, vibrant cultural scene and unparalleled opportunities for outdoor recreation. Colorado is also home to the National Renewable Energy Laboratory, helping cement its status as a hub for renewable energy innovation.

Key Facts About Colorado Tech

  • Number of Tech Workers: 260,000; 8.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Lockheed Martin, Century Link, Comcast, BAE Systems, Level 3
  • Key Industries: Software, artificial intelligence, aerospace, e-commerce, fintech, healthtech
  • Funding Landscape: $4.9 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Access Venture Partners, Ridgeline Ventures, Techstars, Blackhorn Ventures
  • Research Centers and Universities: Colorado School of Mines, University of Colorado Boulder, University of Denver, Colorado State University, Mesa Laboratory, Space Science Institute, National Center for Atmospheric Research, National Renewable Energy Laboratory, Gottlieb Institute

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account