https://github.com/orangesi/pykits
simple python script for bioinformatics
https://github.com/orangesi/pykits
bioinformatics-scripts biopython fasta fire-python pandas-python python3
Last synced: 8 months ago
JSON representation
simple python script for bioinformatics
- Host: GitHub
- URL: https://github.com/orangesi/pykits
- Owner: orangeSi
- Created: 2018-10-31T04:22:46.000Z (almost 7 years ago)
- Default Branch: master
- Last Pushed: 2019-06-27T08:55:38.000Z (over 6 years ago)
- Last Synced: 2025-01-11T10:42:11.717Z (10 months ago)
- Topics: bioinformatics-scripts, biopython, fasta, fire-python, pandas-python, python3
- Language: Python
- Size: 83 KB
- Stars: 2
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# pykits
############################################################################################
### fastakit.py
If you upload genome to ncbi, and ncbi told your genome has some base is alignment to adapter, after you validate that and want to replace the region with N,
you can: blast adapter.fa to your genome get m6 output, cat m6|awk '{print $2","$9","$10}' > adapter.to.assembly.m6.replace.list
```
# --seq_pos_list is multi line, seprate by ,
python fastakit.py replace_base --seq_pos_list seq_pos_list --pos_count_from 1 \
--seq_file assembly.fa --output assembly.new.fa --replace_base_with N
# or this just one pos
python fastakit.py replace_base --seq_id 3 --pos_start 1 --pos_end 30 \
--pos_count_from 1 --seq_file assembly.fa --output assembly.new.fa
```
#### python package dependence:
```
biopython >= 1.72
fire >= 0.1.3
Python 3.6.1 :: Anaconda custom (64-bit)
```
############################################################################################
### fire_table.py
#### for metagenome/16s/vcf or other data process:
```
# extract column or row or both by keywords, also support extrac sample name by group name
python fire_table.py extract --data Phylum.gt1percent.kruskal.test.txt.new3 --keywords A.S.14d,C.S.7d --extract_index 0 \
--extract_column 1 --prefix test4 --header 0 --index_col 0 --sample2group_fileds 0,1 --sample2group sample2group.txt \
--sample2group_fileds "0,1" --same_outdir 1 --keep_metadata 1 --skip_not_exists 0
```
#### python package dependence:
```
fire >= 0.1.3
pandas >= 0.23.4
Python 3.6.1 :: Anaconda custom (64-bit)
```
############################################################################################
### fire_sns.py
```
# heatmap homemade for category fragments data, you can defined the block freely by --value_segments !
python fire_sns.py heatmap-homemade --data Class.gt1.kruskal.test.txt.new2 --header 0 --index-col 0 --linewidths 0.2 --outdir . --prefix Class.gt1.kruskal.test.txt.new2.heatmap --oformat pdf --order_x LP.0d,LP.L.7d,LP.L.14d,LP.S.7d,LP.S.14d,OJ.0d,OJ.L.7d,OJ.L.14d,OJ.S.7d,OJ.S.14d --xlabel-rotation 90 --ylabel-rotation 0 --legend-ncol 1 --color-segments '#FAEBD7,#FFEBCD,#F5DEB3,#8B4513' --value_segments 0.5:1,1:2,2:5,5:15
# barplot
python fire_sns.py barplot --outdir . --prefix xx --data Species.all_relative_abundance.xls --oformats "pdf,png" barplot --fig_width_scale 0.35 --fig-height 10 --legend-locus bottom --ncol-leg 0
```
#### python package dependence:
```
fire >= 0.1.3
matplotlib >= 3.0.0
palettable >= 3.1.1
pandas >= 0.23.4
seaborn >= 0.9.0
Python 3.6.1 :: Anaconda custom (64-bit)
```