Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/applicativesystem/miniprot-protein-annotator

miniprot protein alignment analyzer
https://github.com/applicativesystem/miniprot-protein-annotator

bioinformatics genome-alignment genome-analysis genome-annotation genome-bioinformatics genome-informatics protein-alignment

Last synced: 7 days ago
JSON representation

miniprot protein alignment analyzer

Awesome Lists containing this project

README

        

# miniprot-protein-annotator

- a protein coding regions annotator that will take the alignment file in the PAF/GFF format and will generate the fasta from the corresponding fasta files for the aligned regions.
- implemented faster rates so that you can parse as many aligned regions as you want.
- you can also create the protein tokenzier from the same for machine learning.

```
# align your genome with the given protein using the miniprot such as
miniprot --gff genome.fasta protein.fasta > sample.gf
```
- and then run the proteinannotator to extract all the complete coding regions
```
generatingAlignments("/home/gaurav/Desktop/final_code_push/multi.gff",
"/home/gaurav/Desktop/final_code_push/multi.fasta",
"/home/gaurav/Desktop/final_code_push/multiout.fasta")
```
Gaurav Sablok \
University of Potsdam \
Potsdam,Germany