Yahoo Logo

Yahoo

Sr. Data Engineer

Reposted 17 Days Ago
United States of America
128K-267K Annually
Entry level
United States of America
128K-267K Annually
Entry level
The Data Engineer will develop and improve data infrastructures and pipelines for machine learning and analytics. Responsibilities include designing systems for efficient data processing, collaborating with teams to implement algorithms, and troubleshooting data issues, while managing large volumes of data at petabyte scale.
The summary above was generated by AI

Yahoo Mail is the ultimate consumer inbox with hundreds of millions of users. It’s the best way to access your email and stay organized from a computer, phone or tablet. With its beautiful design and lightning fast speed, Yahoo Mail makes reading, organizing, and sending emails easier than ever.

A Little About Us

Yahoo makes the world’s daily habits inspiring and entertaining. By creating highly personalized experiences for our users, we keep people connected to what matters most to them, across devices and around the world. Yahoo’s vast businesses span across Search, Communications, Media, and many other verticals.

 

Yahoo generates terabytes of data every day and it is critical to collect, manage and process data at petabyte scale to provide timely and accurate insights to executives, sales, product managers and product developers on all aspects of user interaction. 

The Mail Analytics Engineering team at Yahoo is responsible for building mission critical data systems, pipelines, warehouses, analytics systems, and Machine Learning/AI/data mining programs for the Communications business, which includes Yahoo Mail, with 200M monthly active users. We are constantly pushing the envelope of data platforms due to the insane amount of data we need to harness. 

A Lot About You

As part of the Mail Analytics Engineering team, you will be working on data engineering infrastructures, pipelines and next generation Machine Learning- and AI-based data infrastructure, supporting new functionalities on existing platforms, and mining data for analytics insights and product features. 

Our Big Data footprints are among the largest few in the world, at double-digit petabyte scale. Developing this infrastructure presents many technical challenges in the areas of efficient query processing, large-scale stream processing, machine learning and modeling, as well as satisfying complex business rules.

If you are someone who is passionate about harnessing data at insane scale, enjoys working with new technologies, setting up petabyte data infrastructures and implementing new machine learning solutions and metrics systems, we want to hear from you!

Your Day

  • Develop new or improve existing data infrastructures for data processing machine learning, and deep learning using your core expertise

  • Work with other engineers to implement algorithms and systems in an efficient way

  • Take end to end ownership of Machine Learning-based distributed data systems - from data and training pipelines, to real time data serving engines.

  • Develop complex queries, very large volume data pipelines, and analytics applications

  • Develop complex queries and software programs to solve analytics and data mining problems

  • Interact with data analysts, data scientists, product managers, and software engineers to understand business problems, technical requirements to deliver data solutions

  • Prototype new metrics or data systems

  • Lead data investigations to troubleshoot data issues that arise along the data pipelines

  • Maintenance and improvement of released systems

  • Engineering consulting on large and complex warehouse data

You Must Have

  • BS/MS/PhD in Computer Science/Electrical Engineering, or related engineering disciplines, ideally with specialization in Data Engineering or Machine Learning

  • 6+ years of hands-on experience in relevant fields, including data engineering

  • Strong fundamentals: algorithms, distributed computing, data structure, database

  • Fluency with: Python/Java/SQL

  • Self-driven, challenge-loving, detail oriented, teamwork spirit, excellent communication skills, ability to multitask and manage expectations

Preferred

  • Experience in Hadoop technologies (Map/Reduce, Pig, Hive, HBase, Storm, Spark, Kafka, Oozie).

  • Experience with Google Cloud Platform (BiqQuery, Dataproc, Dataflow, etc.) a big plus

  • Experience with machine learning algorithms, NLP, and/or statistical methods a big plus

  • Experience in any of: machine learning, analytics, data mining, or data mart and warehouse

  • Experience with Deep Learning platforms (Tensorflow/Keras/Spark MLlib) and SQL/Unix/Shell

#LI-FM1

The material job duties and responsibilities of this role include those listed above as well as adhering to Yahoo policies; exercising sound judgment; working effectively, safely and inclusively with others; exhibiting trustworthiness and meeting expectations; and safeguarding business operations and brand integrity.

At Yahoo, we offer flexible hybrid work options that our employees love! While most roles don’t require regular office attendance, you may occasionally be asked to attend in-person events or team sessions. You’ll always get notice to make arrangements. Your recruiter will let you know if a specific job requires regular attendance at a Yahoo office or facility. If you have any questions about how this applies to the role, just ask the recruiter!

Yahoo is proud to be an equal opportunity workplace. All qualified applicants will receive consideration for employment without regard to, and will not be discriminated against based on age, race, gender, color, religion, national origin, sexual orientation, gender identity, veteran status, disability or any other protected category. Yahoo will consider for employment qualified applicants with criminal histories in a manner consistent with applicable law. Yahoo is dedicated to providing an accessible environment for all candidates during the application process and for employees during their employment. If you need accessibility assistance and/or a reasonable accommodation due to a disability, please submit a request via the Accommodation Request Form (www.yahooinc.com/careers/contact-us.html) or call +1.866.772.3182. Requests and calls received for non-disability related issues, such as following up on an application, will not receive a response.

We believe that a diverse and inclusive workplace strengthens Yahoo and deepens our relationships. When you support everyone to be their best selves, they spark discovery, innovation and creativity. Among other efforts, our 11 employee resource groups (ERGs) enhance a culture of belonging with programs, events and fellowship that help educate, support and create a workplace where all feel welcome.

The compensation for this position ranges from $128,250.00 - $266,875.00/yr and will vary depending on factors such as your location, skills and experience.The compensation package may also include incentive compensation opportunities in the form of discretionary annual bonus or commissions. Our comprehensive benefits include healthcare, a great 401k, backup childcare, education stipends and much (much) more.

Currently work for Yahoo? Please apply on our internal career site.

Similar Jobs

4 Days Ago
Hybrid
2 Locations
144K-181K Annually
Mid level
144K-181K Annually
Mid level
Fintech • Machine Learning • Payments • Software • Financial Services
As a Senior Data Engineer, you'll design, develop, and support technical solutions while collaborating with Agile teams and utilizing big data technologies.
Top Skills: AWSCassandraEmrGurobiHadoopHiveJavaKafkaLinuxMapreduceMongodbMySQLNoSQLOpen Source RdbmsPythonRedshiftScalaSnowflakeSparkUnix
3 Days Ago
Hybrid
Plano, TX, USA
144K-165K Annually
Senior level
144K-165K Annually
Senior level
Fintech • Machine Learning • Payments • Software • Financial Services
As a Senior Data Engineer, you'll design and build scalable data pipelines using emerging technologies, impacting thousands of auto dealerships while enhancing analytics capabilities.
Top Skills: AWSCi/CdDynamoDBFlinkKafkaOpensearchPythonRedshiftSnowflakeSparkSQLUnix/Linux
12 Hours Ago
Hybrid
Boston, MA, USA
117K-146K
Senior level
117K-146K
Senior level
Digital Media • Gaming • Information Technology • Software • Sports • Esports • Big Data Analytics
As a Senior Data Engineer, you'll design data solutions, ensure data quality, collaborate with teams, and support compliance efforts in data governance.
Top Skills: AWSBitbucketConfluenceDatabricksDatadogJIRAKafkaNoSQLPagerdutyPythonSigmaSnowflakeTerraform

What you need to know about the Colorado Tech Scene

With a business-friendly climate and research universities like CU Boulder and Colorado State, Colorado has made a name for itself as a startup ecosystem. The state boasts a skilled workforce and high quality of life thanks to its affordable housing, vibrant cultural scene and unparalleled opportunities for outdoor recreation. Colorado is also home to the National Renewable Energy Laboratory, helping cement its status as a hub for renewable energy innovation.

Key Facts About Colorado Tech

  • Number of Tech Workers: 260,000; 8.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Lockheed Martin, Century Link, Comcast, BAE Systems, Level 3
  • Key Industries: Software, artificial intelligence, aerospace, e-commerce, fintech, healthtech
  • Funding Landscape: $4.9 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Access Venture Partners, Ridgeline Ventures, Techstars, Blackhorn Ventures
  • Research Centers and Universities: Colorado School of Mines, University of Colorado Boulder, University of Denver, Colorado State University, Mesa Laboratory, Space Science Institute, National Center for Atmospheric Research, National Renewable Energy Laboratory, Gottlieb Institute

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account