Valo Health Logo

Valo Health

Staff Data Scientist in Epidemiology and Patient Data Products

Reposted 22 Days Ago
Easy Apply
Remote or Hybrid
Hiring Remotely in USA
153K-200K Annually
Senior level
Easy Apply
Remote or Hybrid
Hiring Remotely in USA
153K-200K Annually
Senior level
The Staff Data Scientist will lead studies using real-world healthcare data to drive drug discovery, collaborating with cross-functional teams and generating patient-centered insights.
The summary above was generated by AI
About Us

Valo Health is a human-centric, AI-enabled biotechnology company working to make new drugs for patients faster. The company’s Opal Computational Platform transforms drug discovery and development through a unique combination of real-world data, AI, human translational models and predictive chemistry. 

Our talented team of biologists, chemists and engineers, armed with advanced AI/ML tools, work together to break down traditional R&D silos and accelerate the speed and scale of drug discovery and development. 

Valo is committed to hiring diverse talent, prioritizing growth and development, fostering an inclusive environment, and creating opportunities to bring together a group of different experiences, backgrounds, and voices to work together. We embrace new ways of learning, solve complex problems and welcome diverse perspectives that can help us advance patient-centric innovation. 

Valo is headquartered in Lexington, MA, with additional offices in New York, NY and Tel Aviv, Israel.  To learn more, visit www.valohealth.com.  

About the Role

As a Staff Data Scientist in Epidemiology and Patient Data Products, you will be a core member of a team of data scientists advancing the discovery and development of new medicines. In this role, you will answer research questions using large real world healthcare databases to inform identification of biological molecules for effective drug development under the guidance of epidemiology program leads. To do so, you will work in partnership with colleagues in machine learning, statistical genetics, and computational biology to develop solutions to challenging computational problems. Successful candidates will work with a diverse set of scientists and domain experts and engage with external partners, in ways that cut across traditional industry boundaries in an innovative startup environment.

What You’ll Do

  • As a senior member of our cardiometabolic team, you will lead real world data studies (e.g., electronic medical records) from end-to-end to generate causal evidence for projects in drug discovery and development.
  • Translate research questions into observational study designs to generate patient-centric insights from statistical models. Examples include the following:
    • Curation of clinical and non-clinical variables for machine learning models
    • Execution of trajectory modeling techniques using real world data
    • Interpreting machine learning results into patient profiles.
    • Executing post-hoc longitudinal analyses among patient profiles of interest
  • Be comfortable with scientific uncertainty and embrace curiosity and creative solutions. Many of the challenges we’re trying to address do not have known solutions or clear processes to arrive at answers.
  • Work with a diverse array of data spanning electronic medical records, sequencing, multi-omics data, and other data modalities using R and Python in cloud environments.
  • Use your technical knowledge and intuition to articulate and break down large problems into solvable pieces. There are a lot of problems to solve; you’ll need to prioritize which of these are critical-path today from those that can wait.
  • Collaborate with drug discovery and clinical development teams to help ensure the relevance and impact of the insights generated by you and your teammates.
  • Be a dynamic and active team member, championing and adopting shared coding standards, participating in code review, and providing regular updates of your work and input into the work of your colleagues

What You Bring

  • MPH, MS with 5+ years or PhD in epidemiology or biostatistics with 3+ years of work-related experience applying epidemiological, statistical, and/or machine learning methods to real-world datasets.
  • Must have 3+ years of experience developing and executing robust analytical strategies, including cohort and case control study design, using health care databases including electronic health records, administrative claims databases, and/or patient registries.
  • Experience leading epidemiologic projects from end-to-end: from translating research questions into observational study designs, contrasting strengths and weaknesses of different study designs and statistical approaches, and generating patient-centric insights from statistical models.
  • Extensive experience with causal approaches applied to observational studies, including propensity score methods, bias adjustment, and covariate selection and adjustment.
  • Advanced knowledge in biostatistics approaches, including inferential and predictive modeling, and comfortable implementing unsupervised machine learning algorithms in real world health care databases.
  • Must have experience conducting data manipulation and statistical analysis in Python and/or R programming languages.
  • Comfortable working in ambiguous problem spaces; experience working in a start-up or agile work environment as part of cross-functional project teams.
  • Ability to lead and facilitate meetings and work collaboratively on multi-disciplinary project teams.
  • Exceptional time management, ability to prioritize multiple tasks simultaneously, and deliver products on time every time.
  • Enthusiastic about documentation–ensuring that all analyses are clear and reproducible with thorough documentation of key assumptions and decision points.

You May Also Bring

  • Research experience in obesity, cardiometabolic, and/or neurodegenerative therapeutic areas
  • Experience developing and maintaining machine learning pipelines, and translating machine learning output into meaningful insights for diverse audiences is a plus
  • Familiarity with or exposure to traditional drug discovery and development processes and approaches is a plus
  • Hands-on experience curating structured health data and working in health data from outside of the U.S.
Remote Salary Range
$153,000$200,000 USD
CA Salary Range
$180,540$236,000 USD

Compensation for the role will depend on a number of factors, including a candidate’s qualifications, skills, competencies, and experience. Valo Health currently offers healthcare coverage, annual incentive program, retirement benefits and a broad range of other benefits. Compensation and benefits information is based on Valo Health's good faith estimate as of the date of publication and may be modified in the future.

Please note: At this time, we are only able to consider candidates who currently have permanent US work authorization without the need for immediate or future sponsorship.

Top Skills

AI
Computational Biology
Machine Learning
Python
R
Statistical Genetics

Similar Jobs

An Hour Ago
Remote or Hybrid
USA
140K-215K Annually
Senior level
140K-215K Annually
Senior level
Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
The role involves designing and developing a Linux sensor for security, owning features from design to delivery, and collaborating with teams.
Top Skills: C/C++Linux
An Hour Ago
Easy Apply
Remote
3 Locations
Easy Apply
132K-214K Annually
Senior level
132K-214K Annually
Senior level
Artificial Intelligence • Cloud • eCommerce • Enterprise Web • Software • Design • Generative AI
As a Senior Infrastructure Engineer at Webflow, you will enhance cloud infrastructure, collaborate with teams on services, and improve internal engineering processes.
Top Skills: AWSDockerGCPGoKubernetesNode.jsPulumiTerraformTypescript
2 Hours Ago
Easy Apply
Remote
United States
Easy Apply
Mid level
Mid level
Social Impact • Software
Design and implement scalable onboarding programs for new hires, ensuring an engaging experience and aiding their progression. Collaborate with leaders to tailor team-specific onboarding, optimize processes, and create diverse training materials. Analyze feedback for continuous improvement.
Top Skills: E-Learning Development ToolsLearning Management Systems

What you need to know about the Colorado Tech Scene

With a business-friendly climate and research universities like CU Boulder and Colorado State, Colorado has made a name for itself as a startup ecosystem. The state boasts a skilled workforce and high quality of life thanks to its affordable housing, vibrant cultural scene and unparalleled opportunities for outdoor recreation. Colorado is also home to the National Renewable Energy Laboratory, helping cement its status as a hub for renewable energy innovation.

Key Facts About Colorado Tech

  • Number of Tech Workers: 260,000; 8.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Lockheed Martin, Century Link, Comcast, BAE Systems, Level 3
  • Key Industries: Software, artificial intelligence, aerospace, e-commerce, fintech, healthtech
  • Funding Landscape: $4.9 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Access Venture Partners, Ridgeline Ventures, Techstars, Blackhorn Ventures
  • Research Centers and Universities: Colorado School of Mines, University of Colorado Boulder, University of Denver, Colorado State University, Mesa Laboratory, Space Science Institute, National Center for Atmospheric Research, National Renewable Energy Laboratory, Gottlieb Institute

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account