Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/audy/pangea3
OTU clustering pipeline based on CLC Assembly Cell
https://github.com/audy/pangea3
Last synced: about 21 hours ago
JSON representation
OTU clustering pipeline based on CLC Assembly Cell
- Host: GitHub
- URL: https://github.com/audy/pangea3
- Owner: audy
- License: other
- Created: 2011-10-04T18:22:53.000Z (almost 13 years ago)
- Default Branch: master
- Last Pushed: 2012-07-13T18:13:45.000Z (about 12 years ago)
- Last Synced: 2023-03-11T01:22:00.154Z (over 1 year ago)
- Language: Perl
- Homepage:
- Size: 1.1 MB
- Stars: 1
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: readme.md
- License: license.lic
Awesome Lists containing this project
README
# Pang3a
Austin G. Davis-Richardson,
Chris T. Brown,
David B. Crabb,## Description
This is a simple pipeline for running reference assemblies against the Tax-Collected 16S RDP database using CLC Reference Assemble, and generating (megaclust) tables that can be easily imported into Excel.
The entire process from assembly to splitting up into Phylum..Species is automated.
## Running
Your directory structure should look like this:
/.
/..
/reads/
/database/Then download pang3a:
$ git clone [email protected]:audy/pang3a.git
This will create a `pang3a/` directory
Invoke like this:cd pang3a/
./pang3a ../reads/ ../db.fasta #shannon "run"
The reason you have to be in the pang3a directory is because you have to be
in the same directory as CLC's License file (DRM kills science).
**TIP2** - The reads filenames are the headers in the megaclustable. So to make it legible, make them short. My preferred format is `L_1_B_002.txt` for Lane 1, barcode 2.