{"id":18598225,"url":"https://github.com/cmdoret/acastellanii_genome_analysis","last_synced_at":"2026-02-13T22:04:19.277Z","repository":{"id":100714776,"uuid":"182310304","full_name":"cmdoret/Acastellanii_genome_analysis","owner":"cmdoret","description":"Comparative analysis of the genomes from two Acanthamoeba castellanii strains","archived":false,"fork":false,"pushed_at":"2021-10-06T19:10:16.000Z","size":91593,"stargazers_count":3,"open_issues_count":0,"forks_count":1,"subscribers_count":0,"default_branch":"master","last_synced_at":"2025-07-27T20:03:41.490Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/cmdoret.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2019-04-19T18:42:16.000Z","updated_at":"2022-02-17T03:10:35.000Z","dependencies_parsed_at":null,"dependency_job_id":"31e52192-36b5-4961-b171-1ac3e9653f55","html_url":"https://github.com/cmdoret/Acastellanii_genome_analysis","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/cmdoret/Acastellanii_genome_analysis","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/cmdoret%2FAcastellanii_genome_analysis","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/cmdoret%2FAcastellanii_genome_analysis/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/cmdoret%2FAcastellanii_genome_analysis/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/cmdoret%2FAcastellanii_genome_analysis/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/cmdoret","download_url":"https://codeload.github.com/cmdoret/Acastellanii_genome_analysis/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/cmdoret%2FAcastellanii_genome_analysis/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":29419612,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-02-13T06:24:03.484Z","status":"ssl_error","status_checked_at":"2026-02-13T06:23:12.830Z","response_time":78,"last_error":"SSL_connect returned=1 errno=0 peeraddr=140.82.121.5:443 state=error: unexpected eof while reading","robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":false,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-11-07T01:31:42.387Z","updated_at":"2026-02-13T22:04:19.263Z","avatar_url":"https://github.com/cmdoret.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# _Acanthamoeba castellanii_ genome analysis\n\n[![DOI](https://zenodo.org/badge/DOI/10.5281/zenodo.5541742.svg)](https://doi.org/10.5281/zenodo.5507417)\n## Description \n\nThis repository contains scripts and documentation related to the analysis and comparison of the _Acanthamoeba castellanii_ genome from strains C3 and Neff. Analyses include: Annotation statistics, busco, quast, orthologous gene comparison with related species, circos plot, sequence divergence between strains and Hi-C contact profiles at the rDNA sequences.\n\nA frozen copy of this repository as well as output data are available for download in the [associated Zenodo record](https://doi.org/10.5281/zenodo.5507417).\n\n## Installation\n\nThe pipeline is written using snakemake and manages dependencies using conda. Most of the pipeline steps are run inside self-contained conda environments, which are automatically built upon execution. There are two dependencies (MCScanX and dnaglider) which are not available through conda and need to be installed separately.\n\n**Dependencies:**\n\n* python3.7+\n    + snakemake\n    + pandas\n    + numpy\n* conda\n* dnaglider\n* MCScanX\n\nThe input data (genomes, annotations, ...) are downloaded automatically from Zenodo when executing the pipeline.\n\n## Usage\n\n The analyses are separated into distinct workflows in the `rules` directory.\n The whole analysis pipeline can be run using snakemake as follows:\n\n ```snakemake --use-conda -j4```\n\n## Structure\n\nThe master script `Snakefile` will call each workflow one after the other. Each workflow contains rules with input and output files, which execute code or external scripts. Each rule is executed in its own conda environment and will download its dependencies on the first execution. The overall workflow can be represented as a graph:\n\n![pipeline graph](doc/rulegraph.svg)\n\nThe `envs` directory contains conda environment build specifications for the different rules.\n\nGeneral parameters for the pipeline are stored in the `config.yaml` file and can be modified. The strains to analyze as well as the path to their sequence files are defined in `samples.tsv`. All external scripts executed by rules are stored in the `scripts` folder. Custom python utility libraries imported in the pipeline are stored in `src`.\n\nThe `doc` directory contains jupyter notebook with general analyses of the pipeline results.\n\n\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fcmdoret%2Facastellanii_genome_analysis","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fcmdoret%2Facastellanii_genome_analysis","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fcmdoret%2Facastellanii_genome_analysis/lists"}