Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/dombennett/project-dormouse
:mouse: Mini phylogenetic analysis of dormouse tapeworm samples
https://github.com/dombennett/project-dormouse
Last synced: 9 days ago
JSON representation
:mouse: Mini phylogenetic analysis of dormouse tapeworm samples
- Host: GitHub
- URL: https://github.com/dombennett/project-dormouse
- Owner: DomBennett
- Created: 2015-04-10T13:46:00.000Z (over 9 years ago)
- Default Branch: master
- Last Pushed: 2016-01-06T14:21:12.000Z (almost 9 years ago)
- Last Synced: 2023-05-18T18:36:59.649Z (over 1 year ago)
- Language: Python
- Homepage:
- Size: 24.4 KB
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
## Phylogenetic analysis of dormouse tapeworm samples
Pipeline for determining species of tapeworm discovered in Hazel dormouse (*Muscardinus avellanarius*) using 18S and 28S PCR amplified sequences.
### Stages
1. NCBI BLAST amplified sequences against Cyclophellyidea species (txid6201) [MANUALLY]
2. NCBI BLAST amplified sequences against Caryophyllidea species (txid108240) [MANUALLY]
3. Align with MAFFT-QINISI each amplified sequence against BLAST results
4. Combine into single supermatrix
5. Use RAxML to create tree with Caryophyllidea outgroup and partition by geneThe scripts provided here perform stages 3-4. Original data not provided.
### Requirements
* Python
* Biopython
* MAFFT
* RAxML
* UNIX### Running
```{bash}
run.sh >& log.txt
```### Dir structure
```
-- run.sh
-- 1_sequences/
-- 2_alignments/
-- 3_trees/
-- python_scripts/
-- sort_names.py
-- [converts the names of each in the blast results to species names for tree]
-- combine.py
-- [combines the results of each alignment BLAST results intoa single supermatrix]
-- data/
-- samples/
-- [PCR amplified sequences from dormouse sample]
-- BLAST_results/
-- [Top 100 BLAST results for each amplified sequence constrained to the Cyclophyllidea, and then Caryophyllidea (outgroup). We used megablast with default parameters.]
-- misc/
-- all_cyclophyllidea_18s_sequences.fasta
-- [Downloaded all 18S sequences for Cyclophyllidea (tapeworms) NCBI nucleotide search term: "txid6201[Organism:exp] AND 18S NOT predicted[TI] NOT shotgun[TI] NOT scaffold[TI] NOT assembly[TI] NOT unverified[TI]"]
-- primer_pairs.fasta
-- [the primer pairs for 18S that Gaby sequenced from the Dormouse sample]```
### Run history
BLAST results were downloaded from NCBI manually from 04-05/2015run.sh was run on Imperial College London's bioinformatics server: codon.bioinformatics.ic.ac.uk (05/2015)
### Author
Dom Bennett