Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/sempwn/medical-text-nlp

Tutorial for applying machine learning to text data within healthcare
https://github.com/sempwn/medical-text-nlp

Last synced: 3 months ago
JSON representation

Tutorial for applying machine learning to text data within healthcare

Awesome Lists containing this project

README

        

# medical-text-nlp

[![Binder](https://mybinder.org/badge_logo.svg)](https://mybinder.org/v2/gh/sempwn/medical-text-nlp/HEAD?labpath=slides%2Fnlp_overview_slides.ipynb)
[![Open All Collab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/sempwn/medical-text-nlp/blob/master/slides/nlp_overview_slides.ipynb)

Tutorial for applying machine learning to text data within healthcare

## Getting started

These instructions will get you a copy of the notebook up and running on your local machine.

### Prerequisites

The easiest way to get up and running is to install [Anaconda](https://www.anaconda.com/download/_), which provides `python`, numerous scientific libraries as well as `jupyter`. Once [Anaconda](https://www.anaconda.com/download/) is installed, navigate to directory for this repository and run `jupyter` via the command line (using the example directory `/Projects/medical-text-nlp/`):

```bash
cd /Projects/medical-text-nlp/
jupyter notebook
```
You can also launch `jupyter` through the [Anaconda Navigator](https://www.anaconda.com/distribution/#feature-desktop-gui).

### Installing

The repository requires the following libraries to be installed via the command-line or within a `conda` terminal

```bash
conda install scikit-learn
conda install pandas
conda install -c anaconda nltk
conda install -c conda-forge wordcloud
```

### Opening notebook

Once the jupyter session is running navigate to the notebook [tutorial.ipynb](nbs/tutorial.ipynb)

### (Optional) setting up using docker

For enhanced replicability a [Docker environment](https://docs.docker.com/get-started/) is also provided with the repo, along with a [requirements.txt](requirements.txt) file specifying the Python libraries required. In order to set up the project within this framework run through the following steps

Install [Docker Desktop](https://www.docker.com/products/docker-desktop)

Build the container by running the following command
```docker build --pull --rm -f "Dockerfile.dockerfile" -t medicaltextnlp:latest "."```

Run docker
```docker run --rm -d -p 8888:8888/tcp medicaltextnlp:latest```

Navigate to the following in a web browser
```http://localhost:8888/```

You will need an access token which can be found in the terminal logs. For more information read the [Jupyter Docker Stacks](https://jupyter-docker-stacks.readthedocs.io/en/latest/index.html) docs.

## License

This project is licensed under the MIT License - see the [LICENSE.md](LICENSE.md) file for details