Binance Logo

Binance

Senior Backend Engineer (Big Data) / Big Data Infrastructure Engineer

Posted 5 Days Ago
In-Office or Remote
Hiring Remotely in Georgia, USA
Senior level
In-Office or Remote
Hiring Remotely in Georgia, USA
Senior level
Design, develop, and maintain high-performance, highly available backend services and APIs for Big Data products using Java/Python. Manage full service lifecycle, capacity planning, profiling/benchmarking, troubleshooting, security, and system architecture. Contribute to open-source big data projects and build scalable tools for global operations.
The summary above was generated by AI
Binance is a leading global blockchain ecosystem behind the world’s largest cryptocurrency exchange by trading volume and registered users. We are trusted by 300+ million people in 100+ countries for our industry-leading security, user fund transparency, trading engine speed, deep liquidity, and an unmatched portfolio of digital-asset products. Binance offerings range from trading and finance to education, research, payments, institutional services, Web3 features, and more. We leverage the power of digital assets and blockchain to build an inclusive financial ecosystem to advance the freedom of money and improve financial access for people around the world.

About the Team

We are building an enterprise-grade big data infrastructure platform, similar to AWS EMR, Databricks Runtime, and Google Dataproc. Our mission is to provide a highly scalable, cloud-native, and reliable computing platform for large-scale data processing and AI workloads.

This role focuses on the framework development, performance optimization, stability engineering, and cloud-native transformation of open-source distributed computing engines such as Hadoop, YARN, HDFS, and Spark. This is NOT a data development, ETL development, or business analytics role.

 

Responsibilities

  • Design, develop, and maintain enterprise-grade big data infrastructure platforms (EMR-like Platform).
  • Develop and enhance distributed computing frameworks, including Hadoop, Spark, YARN, and HDFS.
  • Build service capabilities for big data engines such as Spark, Hive, Presto, and Flink.
  • Drive cloud-native transformation by integrating big data engines with Kubernetes and related ecosystem components.
  • Optimize cluster performance, resource utilization, scheduling efficiency, and workload throughput.
  • Improve platform stability, reliability, and observability for large-scale production environments.
  • Troubleshoot and resolve complex performance bottlenecks and distributed system issues across large-scale clusters.
  • Contribute to the evolution of big data platform architecture and infrastructure best practices.

Requirements

    • Bachelor's degree or above in Computer Science, Software Engineering, or a related field.
    • Strong experience in distributed systems and big data infrastructure development.
    • Hands-on experience with Hadoop, Spark, HDFS, YARN, Hive, Flink, Presto, or related open-source computing frameworks.
    • Experience building or operating enterprise-grade big data platforms such as AWS EMR, Databricks Runtime, Google Dataproc, Huawei MRS, Alibaba Cloud E-MapReduce, Tencent Cloud EMR, or similar platforms.
    • Experience with Kubernetes and cloud-native deployment of distributed computing engines (e.g., Spark Operator).
    • Experience operating or developing large-scale clusters (1,000+ nodes) is highly preferred.
    • Strong understanding of distributed system architecture, performance tuning, resource scheduling, and stability engineering.
    • Proficiency in Java, Scala, Go, or C++, with solid software engineering fundamentals.
    •  

Preferred Qualifications

  • Contributor or Committer experience in Apache Hadoop, Spark, Flink, or related open-source communities.
  • Experience designing cloud-native big data infrastructure or containerized compute platforms.
  • Experience optimizing large-scale production clusters for performance, stability, and cost efficiency.
  • Familiarity with modern observability, monitoring, and distributed system troubleshooting.

Ideal Background

    Candidates may come from teams such as:

    • AWS EMR
    • Databricks Runtime
    • Google Dataproc
    • Alibaba Cloud E-MapReduce
    • Tencent Cloud EMR
    • Huawei MRS
    • Apache Hadoop / Spark Committer
    • Spark Core Development
    • Large-scale Distributed Systems or Cloud Infrastructure Teams

Why Binance
• Shape the future with the world’s leading blockchain ecosystem
• Collaborate with world-class talent in a user-centric global organization with a flat structure
• Tackle unique, fast-paced projects with autonomy in an innovative environment
• Thrive in a results-driven workplace with opportunities for career growth and continuous learning
• Competitive salary and company benefits
• Work-from-home arrangement (the arrangement may vary depending on the work nature of the business team)

Binance is committed to being an equal opportunity employer. We believe that having a diverse workforce is fundamental to our success.
By submitting a job application, you confirm that you have read and agree to our Candidate Privacy Notice.

Similar Jobs

13 Minutes Ago
In-Office or Remote
Georgia, USA
Mid level
Mid level
Professional Services • Real Estate • Consulting
Manage and support delivery of data centre projects through lifecycle: research, planning, scheduling, progress tracking, change control, reporting, stakeholder liaison, and meeting coordination.
Top Skills: Ms ProjectPrimavera
16 Minutes Ago
In-Office or Remote
Georgia, USA
Mid level
Mid level
Blockchain • Fintech • Software • Cryptocurrency • Metaverse
Lead end-to-end QA for Fiat/BigPay: plan tests, design and execute cases across App/Web/Backend/Admin, maintain Python pytest+k6 automation framework, drive feature and edge-case testing, and collaborate cross-functionally to close quality loops.
Top Skills: K6PytestPython
6 Hours Ago
In-Office or Remote
Georgia, USA
Entry level
Entry level
Blockchain • Fintech • Software • Cryptocurrency • Metaverse
Early-career Product Manager in analytics/strategy focusing on behavioral data analysis, A/B testing, funnel diagnostics, and recommendation ranking optimization. Produce strategy docs and PRDs, collaborate with algorithm teams, develop labels/taxonomies, and deliver actionable insights from experiments and data extraction.
Top Skills: A/B TestingData WarehousesExplore-Exploit)PandasPythonRecommender Systems (Ctr/CvrSQL

What you need to know about the Colorado Tech Scene

With a business-friendly climate and research universities like CU Boulder and Colorado State, Colorado has made a name for itself as a startup ecosystem. The state boasts a skilled workforce and high quality of life thanks to its affordable housing, vibrant cultural scene and unparalleled opportunities for outdoor recreation. Colorado is also home to the National Renewable Energy Laboratory, helping cement its status as a hub for renewable energy innovation.

Key Facts About Colorado Tech

  • Number of Tech Workers: 260,000; 8.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Lockheed Martin, Century Link, Comcast, BAE Systems, Level 3
  • Key Industries: Software, artificial intelligence, aerospace, e-commerce, fintech, healthtech
  • Funding Landscape: $4.9 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Access Venture Partners, Ridgeline Ventures, Techstars, Blackhorn Ventures
  • Research Centers and Universities: Colorado School of Mines, University of Colorado Boulder, University of Denver, Colorado State University, Mesa Laboratory, Space Science Institute, National Center for Atmospheric Research, National Renewable Energy Laboratory, Gottlieb Institute

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account