Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/jhrcook/protein-language-models

Experimenting with protein language model predictions
https://github.com/jhrcook/protein-language-models

data-analysis protein-language-model variant-effect-prediction

Last synced: 7 days ago
JSON representation

Experimenting with protein language model predictions

Awesome Lists containing this project

README

        

# Protein language models

## Setup

```bash
pyenv local 3.11
python -m venv .env
source .env/bin/activate
pip install --upgrade pip
pip install -r requirements.txt
```

```bash
pre-commit install
```

## Data preparation

Run the following script to download and prepare the raw data:

```bash
./prepare_data.py
```

Data sources:

downloaded AlphaMissesnse predictions:
downloaded the file: "AlphaMissense_aa_substitutions.tsv.gz"

ESM1b paper:
Downloaded ESM1b:
downloaded the file: "ALL_hum_isoforms_ESM1b_LLR.zip"

Copied them to "raw-data/"