The Semantic Data Modeler will develop semantic models, integrate NLP workflows, ensure data quality for knowledge graphs, and collaborate across teams.
At IMO Health, Semantic Data Modelers are key members of our ontology-driven graph engineering team, helping to build and maintain a virtualized, intelligent, and scalable medical terminology platform. Your work will empower over 740,000 clinicians by enhancing how healthcare data is structured, delivered, and understood.
We are seeking an experienced Semantic Data Modeler to join our team, focusing on the development and application of Knowledge Graphs integrated with Data Science and Natural Language Processing (NLP) workflows. In this critical role, you will contribute to the design and implementation of semantic models, bridging the gap between raw data sources and actionable clinical insights. You will utilize your skills in NLP and data analysis to enrich our knowledge graph, ensuring high data quality and accessibility for semantic enrichment and clinical interoperability initiatives. This position requires strong technical proficiency and exceptional collaboration skills, working closely with staff semantic engineers, clinicians, and content teams.
WHAT YOU’LL DO:
- Semantic Model Design: Contribute to the design, development, and iterative refinement of complex semantic data models, with a focus on ontologies, knowledge graphs, and property graphs.
- Requirement Translation: Assist senior team members in translating intricate, cross-functional business needs into formal, scalable knowledge graph structures, ensuring tight alignment with the enterprise data strategy.
- Governance and Standards: Document semantic assets, including detailed entity definitions, relationship types, axioms, constraints, and data lineage, ensuring adherence to established best practices and fostering consistency across the organization.
- Graph Data Analysis & Feature Engineering: Utilize data science methodologies and Python scripting to conduct exploratory data analysis (EDA) on graph structure, identify data patterns, perform feature engineering, and support advanced analytics based on the knowledge graph.
- Data-Driven NLP Integration: Apply Natural Language Processing (NLP) techniques and tooling to unstructured clinical text and data streams to identify, extract, and map new entities, attributes, and relationships directly into the semantic models, effectively structuring raw data for machine learning consumption.
- Robust Data Quality: Implement and maintain data quality frameworks, validation rules (e.g., using SHACL), and transformation logic within the semantic layer to ensure the accuracy, reliability, and consistency of the knowledge graph.
- Cross-Functional Partnership: Partner closely with Staff semantic engineers, clinicians, content teams, and business leaders to understand domain knowledge and requirements, and ensure semantic solutions effectively meet organizational objectives.
- Research & Evaluation: Assist in researching, evaluating, and utilizing new technologies, methodologies, and best practices in semantic modeling, knowledge graph technologies, and NLP to drive continuous process improvement.
- Knowledge Sharing: Proactively share technical expertise and knowledge with peers and cross-functional teams.
WHAT YOU’LL NEED:
- BA/BS in a STEM field (e.g., Computer Science, Data Science, Bioinformatics) with 3-5 years of hands-on work experience in data modeling or data engineering.
- Proven hands-on experience in semantic modeling concepts (ontologies, knowledge graphs, property graphs).
- Expertise in Python for data manipulation, analysis, and pipeline development, including libraries like Pandas/NumPy.
- Strong understanding of statistical and machine learning concepts (e.g., classification, clustering, regression) and their application to graph-based data.
- Demonstrated experience with NLP technologies and libraries (e.g., NLTK, spaCy, Gensim, Hugging Face) for text extraction, named entity recognition, or relationship extraction.
- Strong working knowledge of graph database platforms (e.g., Amazon Neptune, Neo4j, etc.) and graph query languages (e.g., SPARQL or Gremlin).
- Familiarity with semantic web standards like OWL, RDFS, and SHACL.
- Experience with relational databases (SQL) and general data warehousing concepts.
- Ability to communicate complex technical concepts effectively to both technical and non-technical stakeholders.
- Experience in an Agile/Scrum environment, iteratively developing and deploying data solutions.
NICE TO HAVE:
- Practical experience with AWS data services (e.g., Glue, Sagemaker) and ETL/ELT methodologies.
- Understanding healthcare ontologies and standards like SNOMED-CT, LOINC, RxNorm, and ICD-10.
Top Skills
Amazon Neptune
Gensim
Hugging Face
Neo4J
Nltk
Numpy
Owl
Pandas
Python
Rdfs
Shacl
Spacy
SQL
Similar Jobs
Big Data • Fintech • Information Technology • Business Intelligence • Financial Services • Cybersecurity • Big Data Analytics
Seeking a seasoned sales professional to drive new business sales in fraud solutions for retail, eCommerce, and hospitality sectors. Requires 10+ years of experience, proven success in closing high-value deals, and proficiency in Salesforce.
Top Skills:
Ai ToolsSalesforce
Fintech • Financial Services
The Senior Sales Enablement Manager will define sales processes, develop training programs, coach the sales team, manage sales content, and drive technology mastery for improved performance.
Top Skills:
PowerPointSalesforce
Fintech • Financial Services
The Analyst, Sales Operations will drive sales success through analytics, reporting, and collaboration across departments, focusing on capacity planning, sales pipeline optimization, and compensation administration.
Top Skills:
ExcelSalesforceSQLTableau
What you need to know about the Colorado Tech Scene
With a business-friendly climate and research universities like CU Boulder and Colorado State, Colorado has made a name for itself as a startup ecosystem. The state boasts a skilled workforce and high quality of life thanks to its affordable housing, vibrant cultural scene and unparalleled opportunities for outdoor recreation. Colorado is also home to the National Renewable Energy Laboratory, helping cement its status as a hub for renewable energy innovation.
Key Facts About Colorado Tech
- Number of Tech Workers: 260,000; 8.5% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Lockheed Martin, Century Link, Comcast, BAE Systems, Level 3
- Key Industries: Software, artificial intelligence, aerospace, e-commerce, fintech, healthtech
- Funding Landscape: $4.9 billion in VC funding in 2024 (Pitchbook)
- Notable Investors: Access Venture Partners, Ridgeline Ventures, Techstars, Blackhorn Ventures
- Research Centers and Universities: Colorado School of Mines, University of Colorado Boulder, University of Denver, Colorado State University, Mesa Laboratory, Space Science Institute, National Center for Atmospheric Research, National Renewable Energy Laboratory, Gottlieb Institute