{"id":28476015,"url":"https://github.com/vihaankulkarni29/geneaaextractor","last_synced_at":"2026-05-15T01:37:10.299Z","repository":{"id":297663910,"uuid":"997521287","full_name":"vihaankulkarni29/GeneAAExtractor","owner":"vihaankulkarni29","description":null,"archived":false,"fork":false,"pushed_at":"2025-06-08T17:11:49.000Z","size":18,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":0,"default_branch":"main","last_synced_at":"2025-08-04T08:27:04.116Z","etag":null,"topics":["amino","amino-acid-sequence","amino-acids","antimicrobial-resistance","bioinformatics-tool","biopython","extraction-data","google-colab","jupyter-notebook","python-3"],"latest_commit_sha":null,"homepage":"","language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/vihaankulkarni29.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null}},"created_at":"2025-06-06T17:09:34.000Z","updated_at":"2025-06-08T17:11:53.000Z","dependencies_parsed_at":"2025-08-04T07:28:46.222Z","dependency_job_id":"af6bd36e-947c-4c09-90f6-628ca7763d93","html_url":"https://github.com/vihaankulkarni29/GeneAAExtractor","commit_stats":null,"previous_names":["vihaankulkarni29/geneaaextractor"],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/vihaankulkarni29/GeneAAExtractor","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/vihaankulkarni29%2FGeneAAExtractor","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/vihaankulkarni29%2FGeneAAExtractor/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/vihaankulkarni29%2FGeneAAExtractor/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/vihaankulkarni29%2FGeneAAExtractor/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/vihaankulkarni29","download_url":"https://codeload.github.com/vihaankulkarni29/GeneAAExtractor/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/vihaankulkarni29%2FGeneAAExtractor/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":278903014,"owners_count":26065786,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","status":"online","status_checked_at":"2025-10-08T02:00:06.501Z","response_time":56,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["amino","amino-acid-sequence","amino-acids","antimicrobial-resistance","bioinformatics-tool","biopython","extraction-data","google-colab","jupyter-notebook","python-3"],"created_at":"2025-06-07T15:05:50.654Z","updated_at":"2025-10-08T06:45:10.294Z","avatar_url":"https://github.com/vihaankulkarni29.png","language":"Jupyter Notebook","funding_links":[],"categories":[],"sub_categories":[],"readme":"# GeneAAExtractor\n\n# 🧬 GeneAAExtractor\n\nGeneAAExtractor is a lightweight Google Colab tool for extracting amino acid sequences of specific genes directly from GFF3 and FASTA files. It's designed for microbiologists, bioinformaticians, and AMR researchers working with genome annotations and isolate analysis.\n\n## 🚀 Features\n- Extracts amino acid sequences for only the user-specified genes\n- Works with `.gff3`, `.fasta`, and `.txt` gene list inputs\n- Supports strand orientation and reverse complement logic\n- Exports individual `.faa` files for each gene in the format: `GeneName IsolateName.faa`\n- Automatically zips all extracted protein files for download\n\n## 📂 Input Files\n1. **GFF3 file** – Genome annotations\n2. **FASTA file** – Genomic sequence\n3. **TXT file** – List of gene names to extract (one per line, case-sensitive optional)\n\n## 🧪 Output\nA `.zip` archive containing one `.faa` file per gene:\nacrA SSS08.faa\nblaTEM SSS08.faa\ndfrA12 SSS08.faa\n\n\n## 🛠 How to Use\n1. Open the tool in **Google Colab**\n2. Upload your `.gff3`, `.fasta`, and `.txt` files\n3. Enter your isolate name when prompted\n4. The tool processes your genome and downloads the protein `.zip`\n\n## 👩‍🔬 Example Use Case\n```bash\nInput:\n  - ecoli_annotations.gff3\n  - ecoli_genome.fasta\n  - gene_list.txt (contains: acrA, acrB, blaTEM)\n\nOutput:\n  - acrA SSS08.faa\n  - acrB SSS08.faa\n  - blaTEM SSS08.faa\n\n📦 Dependencies\nPython 3.7+\n\nBiopython\n\n📄 License\nMIT License – free to use and adapt for academic or research purposes.\n\n✨ Acknowledgements\nDeveloped with love for wet-lab researchers looking to automate their isolate curation workflows.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fvihaankulkarni29%2Fgeneaaextractor","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fvihaankulkarni29%2Fgeneaaextractor","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fvihaankulkarni29%2Fgeneaaextractor/lists"}