An open API service indexing awesome lists of open source software.

https://github.com/bibymaths/codonanalyzer

A collection of Perl scripts for analyzing DNA and protein sequences, featuring tools for rare codon detection, GC content profiling, and hydropathy plotting, optimized for Escherichia coli datasets.
https://github.com/bibymaths/codonanalyzer

codon-usage dna-sequences perl protein

Last synced: 3 months ago
JSON representation

A collection of Perl scripts for analyzing DNA and protein sequences, featuring tools for rare codon detection, GC content profiling, and hydropathy plotting, optimized for Escherichia coli datasets.

Awesome Lists containing this project

README

          

# CodonAnalyzer

[![Snakemake](https://img.shields.io/badge/snakemake-≥5.6.0-brightgreen.svg?style=flat)](https://snakemake.readthedocs.io)

[![DOI](https://zenodo.org/badge/DOI/10.5281/zenodo.15384943.svg)](https://doi.org/10.5281/zenodo.15384943)

Originally developed using Perl in 2015, this pipeline has been modernized in 2025 with reproducibility and modularity in mind.

---

## Features

- Codon usage statistics and GC content
- Longest ORF extraction
- Protein sequence translation
- Hydropathy calculation (Kyte–Doolittle)
- Plotting via Python + Matplotlib
- Configurable via `config.yaml`
- Reproducible workflow using Snakemake

---

## Quickstart

```bash
git clone https://github.com/yourusername/codonanalyzer.git
cd codonanalyzer
```
### Setup Environment

```bash
conda install -c conda-forge -c bioconda snakemake matplotlib numpy=1.26.0
```

or using environment.yml:

```bash
conda env create -f environment.yml
```

Then, run:

```bash
snakemake --cores 1
```

---

## Documentation

See the full documentation [here](https://bibymaths.github.io/codonanalyzer).

---

## Author

**Abhinav Mishra**
Email: [mishraabhinav@gmail.com](mailto:mishraabhinav@gmail.com)

For full historical context, see [About](docs/about.md).

---

## License

BSD 3-Clause. See [LICENSE](LICENSE).