This role serves as a crucial bridge between our complex business data and advanced AI models. You will lead teams developing statistical, machine learning, and AI solutions for Gas Power stakeholders. Your core mission involves deep exploratory analysis, strategic curation, and rigorous management of business-specific data to generate high-quality, reliable input for Large Language Model (LLM) applications.
You will contribute to deploying modern machine learning, operational research, and semantic analysis methods to derive insights and achieve Gas Power's strategic objectives. Importantly, this position focuses not on building models from scratch, but on ensuring that models developed by our central AI Foundry effectively understand and address specific business challenges. This role is ideal for a data expert passionate about uncovering hidden context and enabling transformative business solutions.Job Description
As a Staff Data Scientist, you will be part of a data science or cross-disciplinary team developing innovative solutions, typically involving large, complex data sets to achieve business outcomes. These teams will include statisticians, computer scientists, software developers, engineers, product managers, and functional stakeholders. In addition to hands on development, the Staff Data Scientist will lead extended team members from the Emerging Technology Guild and functional DT teams to develop and operationalize data science solutions are ready for scale-up.
- Perform comprehensive exploratory data analysis (EDA) on diverse and complex business datasets, using statistical analysis, Natural Language Processing (NLP), and unsupervised clustering techniques to uncover patterns, identify quality issues, and extract meaningful insights.
- Collaborate closely with business Subject Matter Experts (SMEs) to translate their deep domain knowledge into structured, AI-ready datasets for use in prompt engineering, Retrieval-Augmented Generation (RAG), and model fine-tuning.
- Develop and prepare "golden datasets" that serve as pristine examples of our business processes, significantly reducing the iteration time for prompt engineering and AI development teams.
- Design, create, and maintain a suite of data benchmarks that represent our core business use cases. These benchmarks will be the definitive standard for evaluating the real-world performance of AI methods within our BU.
- Establish and enforce rigorous data quality standards and validation protocols, ensuring the accuracy, relevance, and integrity of all data used in our GenAI applications.
- Proactively identify and document potential data biases, working with stakeholders to develop mitigation strategies that promote responsible and fair AI outcomes.
- Serve as the primary steward for the BU’s curated AI datasets, defining and implementing a clear data management strategy that includes versioning, access controls, and a lifecycle management plan.
- Create and maintain comprehensive documentation for all curated datasets (e.g., "datasheets for datasets"), detailing their origin, schema, limitations, and intended use to ensure transparency and reusability.
- Continuously survey the BU's data landscape to identify new high-value data sources and champion their integration into our Generative AI ecosystem.
- Act as the primary data liaison between the Business Unit, prompt engineers, and the central AI Foundry.
- Rigorously test and validate the effectiveness of generalized tools and methods provided by the AI Foundry against your BU-specific data benchmarks.
- Provide precise, data-driven feedback and recommendations to the Foundry, collaborating to refine and enhance central AI capabilities to ensure they meet our specific business needs.
- Communicate methods, findings, and hypotheses with stakeholders
Qualifications
Required Qualifications:
- Bachelor’s or Master’s degree in a quantitative field such as Data Science, Computer Science, Statistics, Economics, or a related discipline.
- 3-5+ years of professional experience as a Data Scientist, Data Analyst, or in a similar role with a heavy emphasis on data exploration, manipulation, and preparation.
- Strong proficiency in Python and core data science libraries (e.g., pandas, NumPy, scikit-learn, spaCy, NLTK).
- Demonstrated experience with a wide range of exploratory data analysis and unsupervised machine learning techniques (e.g., clustering, topic modeling, dimensionality reduction).
- Proven ability to work with messy, unstructured, and semi-structured data, especially text.
- Exceptional communication and interpersonal skills, with a talent for translating complex technical concepts to non-technical audiences and building strong relationships with business stakeholders.
Preferred Qualifications:
- Hands-on experience preparing data specifically for Generative AI systems (e.g., creating datasets for RAG, few-shot prompting, or supervised fine-tuning).
- Familiarity with the architecture of modern LLMs and the role of vector databases (e.g., Pinecone, Milvus, Weaviate).
- Experience in establishing data quality frameworks, data governance policies, or data management best practices.
- Prior experience working in a federated analytics or data science model where collaboration between central and business-embedded teams was required.
- Domain expertise relevant to our Business Unit
GE Vernova offers a great work environment, professional development, challenging careers, and competitive compensation. GE Vernova is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, national or ethnic origin, sex, sexual orientation, gender identity or expression, age, disability, protected veteran status or other characteristics protected by law.
GE Vernova will only employ those who are legally authorized to work in the United States for this opening. Any offer of employment is conditioned upon the successful completion of a drug screen (as applicable).
Relocation Assistance Provided: No
For candidates applying to a U.S. based position, the pay range for this position is between $103,900.00 and $173,100.00. The Company pays a geographic differential of 110%, 120% or 130% of salary in certain areas. The specific pay offered may be influenced by a variety of factors, including the candidate’s experience, education, and skill set.
Bonus eligibility: ineligible.
This posting is expected to remain open for at least seven days after it was posted on November 20, 2025.
Available benefits include medical, dental, vision, and prescription drug coverage; access to Health Coach from GE Vernova, a 24/7 nurse-based resource; and access to the Employee Assistance Program, providing 24/7 confidential assessment, counseling and referral services. Retirement benefits include the GE Vernova Retirement Savings Plan, a tax-advantaged 401(k) savings opportunity with company matching contributions and company retirement contributions, as well as access to Fidelity resources and financial planning consultants. Other benefits include tuition assistance, adoption assistance, paid parental leave, disability benefits, life insurance, 12 paid holidays, and permissive time off.
GE Vernova Inc. or its affiliates (collectively or individually, “GE Vernova”) sponsor certain employee benefit plans or programs GE Vernova reserves the right to terminate, amend, suspend, replace, or modify its benefit plans and programs at any time and for any reason, in its sole discretion. No individual has a vested right to any benefit under a GE Vernova welfare benefit plan or program. This document does not create a contract of employment with any individual.
Top Skills
Similar Jobs
What you need to know about the Colorado Tech Scene
Key Facts About Colorado Tech
- Number of Tech Workers: 260,000; 8.5% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Lockheed Martin, Century Link, Comcast, BAE Systems, Level 3
- Key Industries: Software, artificial intelligence, aerospace, e-commerce, fintech, healthtech
- Funding Landscape: $4.9 billion in VC funding in 2024 (Pitchbook)
- Notable Investors: Access Venture Partners, Ridgeline Ventures, Techstars, Blackhorn Ventures
- Research Centers and Universities: Colorado School of Mines, University of Colorado Boulder, University of Denver, Colorado State University, Mesa Laboratory, Space Science Institute, National Center for Atmospheric Research, National Renewable Energy Laboratory, Gottlieb Institute



