Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/codecreatede/mongodb-genome

a mongodb implementation for pacbiohifi genome and genome annotation
https://github.com/codecreatede/mongodb-genome

bioinformatics database database-management database-schema genome-analysis genome-bioinformatics genome-informatics mongodb mongodb-database python3

Last synced: 4 days ago
JSON representation

a mongodb implementation for pacbiohifi genome and genome annotation

Awesome Lists containing this project

README

        

# mongodb-genome

- python class to prepare the genome files for the mongodb atlas
- since the genome annotations id will be same, implmented a list based nested iteration so that it can be inserted into ``` mongodb as .insertMany() ```
- below is the UML for the following class.



```
# if you are executing from the specific directory then
import os
os.chdir("path/dir/fasta/gff")
mongodbprepare("samplefile.gff", prepare= "yes")
fastaindex("samplefile.fasta", prepare= "yes")
exonparse("samplefile.gff", prepare= "yes")
intronparse("samplefile.gff", prepare= "yes")
exonseq("samplefile.gff", "samplefile.fasta", mongoexonprepareseq)
intronseq("samplefile.gff", "samplefile.fasta", mongointronprepareseq)
goparsemongo("GO_slim_sample.txt")
```

Gaurav Sablok