{"id":19834320,"url":"https://github.com/orangesi/pykits","last_synced_at":"2025-02-28T18:20:32.580Z","repository":{"id":201241037,"uuid":"155498310","full_name":"orangeSi/pykits","owner":"orangeSi","description":"simple python script for bioinformatics","archived":false,"fork":false,"pushed_at":"2019-06-27T08:55:38.000Z","size":85,"stargazers_count":2,"open_issues_count":0,"forks_count":0,"subscribers_count":2,"default_branch":"master","last_synced_at":"2025-01-11T10:42:11.717Z","etag":null,"topics":["bioinformatics-scripts","biopython","fasta","fire-python","pandas-python","python3"],"latest_commit_sha":null,"homepage":null,"language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/orangeSi.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null}},"created_at":"2018-10-31T04:22:46.000Z","updated_at":"2022-03-14T01:49:45.000Z","dependencies_parsed_at":null,"dependency_job_id":"edcd3247-9b3e-4bf1-b34c-ee7dbf865ad1","html_url":"https://github.com/orangeSi/pykits","commit_stats":null,"previous_names":["orangesi/pykits"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/orangeSi%2Fpykits","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/orangeSi%2Fpykits/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/orangeSi%2Fpykits/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/orangeSi%2Fpykits/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/orangeSi","download_url":"https://codeload.github.com/orangeSi/pykits/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":241198980,"owners_count":19926554,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["bioinformatics-scripts","biopython","fasta","fire-python","pandas-python","python3"],"created_at":"2024-11-12T12:03:55.279Z","updated_at":"2025-02-28T18:20:32.562Z","avatar_url":"https://github.com/orangeSi.png","language":"Python","readme":"# pykits\n\n\u003cbr\u003e\n\n############################################################################################\n### fastakit.py\u003cbr\u003e\nIf you upload genome to ncbi, and ncbi told your genome has some base is alignment to adapter, after you validate that and want to replace the region with N,\nyou can: blast adapter.fa to your genome get m6 output, cat m6|awk '{print $2\",\"$9\",\"$10}' \u003e adapter.to.assembly.m6.replace.list\n```\n# --seq_pos_list is multi line, seprate by , \npython fastakit.py replace_base  --seq_pos_list seq_pos_list  --pos_count_from 1 \\\n\t--seq_file assembly.fa --output assembly.new.fa --replace_base_with N\n\n# or this just one pos\npython fastakit.py replace_base  --seq_id 3 --pos_start 1 --pos_end 30 \\\n\t--pos_count_from 1 --seq_file assembly.fa --output assembly.new.fa\n```\n#### python package dependence:\n```\nbiopython \u003e= 1.72\nfire \u003e= 0.1.3\nPython 3.6.1 :: Anaconda custom (64-bit)\n```\n\u003cbr\u003e\u003cbr\u003e\n\n############################################################################################\n### fire_table.py\u003cbr\u003e\n#### for metagenome/16s/vcf or other data process:\n```\n# extract column or row or both by keywords, also support extrac sample name by group name\npython fire_table.py extract --data Phylum.gt1percent.kruskal.test.txt.new3 --keywords A.S.14d,C.S.7d --extract_index 0 \\\n\t--extract_column 1 \t--prefix test4 --header 0 --index_col 0 --sample2group_fileds 0,1 --sample2group sample2group.txt \\\n\t--sample2group_fileds \"0,1\" --same_outdir 1 --keep_metadata 1 --skip_not_exists 0\n```\n\n#### python package dependence:\n```\nfire \u003e= 0.1.3\npandas \u003e= 0.23.4\nPython 3.6.1 :: Anaconda custom (64-bit)\n```\n\u003cbr\u003e\u003cbr\u003e\n\n############################################################################################\n### fire_sns.py\n```\n# heatmap homemade for category fragments data, you can defined the block freely by --value_segments !\npython fire_sns.py heatmap-homemade --data Class.gt1.kruskal.test.txt.new2 --header 0 --index-col 0 --linewidths 0.2 --outdir . --prefix Class.gt1.kruskal.test.txt.new2.heatmap --oformat pdf --order_x LP.0d,LP.L.7d,LP.L.14d,LP.S.7d,LP.S.14d,OJ.0d,OJ.L.7d,OJ.L.14d,OJ.S.7d,OJ.S.14d --xlabel-rotation 90 --ylabel-rotation 0 --legend-ncol 1 --color-segments '#FAEBD7,#FFEBCD,#F5DEB3,#8B4513' --value_segments 0.5:1,1:2,2:5,5:15\n\n# barplot\npython fire_sns.py  barplot --outdir . --prefix  xx --data Species.all_relative_abundance.xls --oformats \"pdf,png\" barplot --fig_width_scale 0.35 --fig-height 10 --legend-locus bottom --ncol-leg 0\n```\n#### python package dependence:\n```\nfire \u003e= 0.1.3\nmatplotlib \u003e= 3.0.0\npalettable \u003e= 3.1.1\npandas \u003e= 0.23.4\nseaborn \u003e= 0.9.0\nPython 3.6.1 :: Anaconda custom (64-bit)\n```\n\n\n","funding_links":[],"categories":[],"sub_categories":[],"project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Forangesi%2Fpykits","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Forangesi%2Fpykits","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Forangesi%2Fpykits/lists"}