Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/shtoneyan/gopher

Toolset for training quantitative sequence to function models.
https://github.com/shtoneyan/gopher

Last synced: 3 months ago
JSON representation

Toolset for training quantitative sequence to function models.

Awesome Lists containing this project

README

        

**GOPHER**: **G**en**O**mic **P**rofile-model compre**H**ensive **E**valuato**R**

## Installation

```
$ pip install bio-gopher
```
Note that for proper installation, numpy needs to be installed before pyBigWig.

This repository contains scripts for data preprocessing, training deep learning models for DNA sequence to epigenetic function prediction and evaluation of models.

The repo contains a set of tutorial jupyter notebooks that illustrate these steps on a toy dataset. The two notebooks below are required prerequisites for the rest of tutorials:
- preprocessing/preprocessing/quant_dataset_tutorial.ipynb
- tutorials/train_model.ipynb

To replicate the results of the manuscript run the scripts in the analyzis directory. As a prerequisite download and unzip dataset.zip, trained_models.zip from zenodo https://doi.org/10.5281/zenodo.6464031 within the git repo. These contain test sets and pre-trained models. The analysis scripts can be ran in any order as long as paper_run_evaluate.py is ran first, in order to produce model evaluations which is required for further steps.