SNOMED Ontology Parser

A Python tool for extracting medical concepts from text and mapping them to the SNOMED-CT ontology hierarchy.

Overview

This repository provides functionality to:

Extract medical concepts from text documents
Map extracted concepts to SNOMED-CT ontology
Analyze concept distributions across different hierarchical levels
Visualize concept relationships and distributions

Prerequisites

1. Required Python Packages

Install the following packages with their specific versions:

scispacy (v0.5.1)
medspacy (v0.2.0.0)
Owlready2 (v0.37) - Required for pymedtermino2

2. UMLS and SNOMED-CT Access

Apply for UMLS access at the UMLS website
Download the required UMLS and SNOMED-CT files
Follow the setup instructions in the pymedtermino2 documentation

Installation

Clone this repository:

git clone https://github.com/yourusername/snomed-ontology-parser.git
cd snomed-ontology-parser

Create and activate the conda environment:

conda env create -f environment.yml
conda activate snomed-parser

Usage

Running the Analysis

Place your UMLS data files in the ./data directory
Run the Jupyter notebook:

jupyter notebook concept_distribution.ipynb

Note: If you encounter a locked pym.sqlite file error, use the provided script:

bash remove_sql_lock.sh

Finding Concept IDs

To look up specific concept IDs, you can use the SNOMED CT Browser (Note: The browser may use an older version of the SNOMED ontology)

Example

Input text:

Alterations in the hypocretin receptor 2 and preprohypocretin genes produce narcolepsy in some animals.

Output visualization:

Project Structure

snomed-ontology-parser/
├── src/
│   ├── main.py              # Main application entry point
│   ├── concept_extractor.py # Extracts medical concepts from text
│   ├── concept_analyzer.py  # Analyzes concept distributions
│   └── data_loader.py       # Handles UMLS data loading
├── data/                    # Directory for UMLS data files
├── concept_distribution.ipynb
├── environment.yml
└── remove_sql_lock.sh

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
src		src
LICENSE		LICENSE
README.md		README.md
concept_distribution.ipynb		concept_distribution.ipynb
environment.yml		environment.yml
example_article1.txt		example_article1.txt
example_article2.txt		example_article2.txt
remove_sql_lock.sh		remove_sql_lock.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SNOMED Ontology Parser

Overview

Prerequisites

1. Required Python Packages

2. UMLS and SNOMED-CT Access

Installation

Usage

Running the Analysis

Finding Concept IDs

Example

Project Structure

About

Releases

Packages

Languages

License

joe32140/snomed-ontology-parser

Folders and files

Latest commit

History

Repository files navigation

SNOMED Ontology Parser

Overview

Prerequisites

1. Required Python Packages

2. UMLS and SNOMED-CT Access

Installation

Usage

Running the Analysis

Finding Concept IDs

Example

Project Structure

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages