Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/timtreis/motivesearch
A tool to search common motives in promoters after RNASeq analysis
https://github.com/timtreis/motivesearch
bioinformatics promoter-analysis
Last synced: 14 days ago
JSON representation
A tool to search common motives in promoters after RNASeq analysis
- Host: GitHub
- URL: https://github.com/timtreis/motivesearch
- Owner: timtreis
- Created: 2017-10-31T12:57:11.000Z (about 7 years ago)
- Default Branch: master
- Last Pushed: 2021-03-16T13:14:37.000Z (almost 4 years ago)
- Last Synced: 2024-12-14T08:08:47.404Z (about 2 months ago)
- Topics: bioinformatics, promoter-analysis
- Language: Python
- Homepage:
- Size: 1.12 MB
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# MotiveSearch
## Introduction
[under development]
## Workflow
1. Identify differentially expressed genes. (e.g. RNAseq)
2. Aquire genomic sequence of host organism and sequences of promoters.
3. Specify parameters in config.txt. An example is given in the "Input files" section.
4. Determine most frequent kmers in the promoters of the differentially expressed genes.
5. Determine baseline frequency of those kmers in the entire genome.
6. Cluster kmers that are found significantly more frequent in the used promoters than in the genome.## Usage
The script offers twopython script.py -m PrepareGenome -i genome.fasta
python script.py -m AnalysePromoters -i counttable.txt -t counttable
## Input files### config.txt
```
path_to_genome = ... // Full path to the genome of the relevant host organism
path_to_promoters = ... // Full path to the file with the genes promoter sequences
list_of_differentially_expressed_genes = ... // file containing a list of gene identifiers
kmer_length = ... // Length of the kmers that will be searched
top_n_kmers = ... // Amount of kmers that should be analysed
```