Projects in Awesome Lists tagged with sequence-analysis
A curated list of projects in awesome lists tagged with sequence-analysis .
https://github.com/lh3/seqtk
Toolkit for processing sequences in FASTA/Q formats
bioinformatics sequence-analysis
Last synced: 15 May 2025
https://github.com/biotite-dev/biotite
A comprehensive library for computational molecular biology
bioinformatics computational-biology pdb-files sequence-analysis structural-biology
Last synced: 22 Jan 2026
https://github.com/lh3/bioawk
BWK awk modified for biological data
bioinformatics sequence-analysis
Last synced: 12 Apr 2025
https://github.com/seqan/seqan3
The modern C++ library for sequence analysis. Contains version 3 of the library and API docs.
bioinformatics blast cpp-concepts cpp20 fasta fastq fm-index modern samtools seqan sequence-alignment sequence-analysis
Last synced: 15 May 2025
https://github.com/haowenz/chromap
Fast alignment and preprocessing of chromatin profiles
bioinformatics chromatin-profiles genomics sequence-analysis
Last synced: 09 May 2025
https://github.com/lh3/readfq
Fast multi-line FASTA/Q reader in several programming languages
bioinformatics sequence-analysis
Last synced: 07 May 2025
https://github.com/althonos/pyhmmer
Cython bindings and Python interface to HMMER3.
bioinformatics cython-library hidden-markov-model hmmer hmmer3 python-bindings python-library sequence-analysis
Last synced: 22 Jan 2026
https://github.com/moshi4/pymsaviz
MSA(Multiple Sequence Alignment) visualization python package for sequence analysis
bioinformatics genomics matplotlib msa multiple-sequence-alignment python sequence-alignment sequence-analysis visualization
Last synced: 07 May 2025
https://github.com/yuanx749/py-cdhit
A Python package for CD-HIT, clustering protein or nucleotide sequences.
bioinformatics package sequence-analysis tool
Last synced: 09 Aug 2025
https://github.com/csbiology/biofsharp
Open source bioinformatics and computational biology toolbox written in F#.
amino-acids biocontainers bioinformatics bioinformatics-containers biology biostatistics dataprocessing datascience docker fsharp nucleotides sequence-analysis
Last synced: 05 Apr 2025
https://github.com/Superzchen/iLearnPlus
iLearnPlus is the first machine-learning platform with both graphical- and web-based user interface that enables the construction of automated machine-learning pipelines for computational analysis and predictions using nucleic acid and protein sequences.
automated-modelling bioinformatics-tool biomedical-data-analytics deep-learning feature-selection machine-learning prediction python sequence-analysis
Last synced: 21 Jul 2025
https://github.com/chasewnelson/snpgenie
Program for estimating πN/πS, dN/dS, and other diversity measures from next-generation sequencing data
diversity-measurement dnds dnds-estimation evolution evolutionary-parameters fasta-sequences molecular-evolution natural-selection nei-gojobori next-generation-sequencing nucleotide nucleotide-diversity perl population-genetics sequence-analysis snp-report substitution-rate vcf vcf-files
Last synced: 23 Mar 2025
https://github.com/althonos/peptides.py
Physicochemical properties, indices and descriptors for amino-acid sequences.
bioinformatics protein pure-python qsar-models sequence-analysis
Last synced: 10 Apr 2025
https://github.com/ggonnella/gfapy
Gfapy: a flexible and extensible software library for handling sequence graphs in Python
assembly bioinformatics gfa graphs sequence-analysis
Last synced: 05 Apr 2025
https://github.com/biojulia/bioalignments.jl
Sequence alignment tools
bam-files bioinformatics biojulia biology dna-sequences high-throughput-sequencing sam-files sequence-alignment sequence-analysis smith-waterman-alignment
Last synced: 15 Apr 2025
https://github.com/nanxstats/protr
🧬 Toolkit for generating various numerical features of protein sequences
bioinformatics feature-engineering feature-extraction machine-learning peptides protein-sequences sequence-analysis
Last synced: 09 Sep 2025
https://github.com/althonos/lightmotif
A lightweight platform-accelerated library for biological motif scanning using position weight matrices.
bioinformatics genomics pssm rust-library sequence-analysis sequence-motif simd
Last synced: 04 Oct 2025
https://github.com/edinburgh-genome-foundry/geneblocks
:diamond_shape_with_a_dot_inside: Find common blocks and differences between DNA sequences
python sequence-analysis synthetic-biology
Last synced: 02 Jul 2025
https://github.com/soedinglab/spacepharer
SpacePHARER CRISPR Spacer Phage-Host pAiRs findER
bioinformatics crispr host-pathogen sequence-analysis
Last synced: 27 Apr 2025
https://github.com/sysbiochalmers/hgtphylodetect
HGTphyloDetect is a versatile toolbox to identify horizontal gene transfer (HGT) events combined with phylogenetic analysis.
evolutionary-biology high-throughput-pipeline horizontal-gene-transfer phylogenetics sequence-analysis systems-biology
Last synced: 29 Jul 2025
https://github.com/ylab-hi/pxblat
PxBLAT: An Efficient and Ergonomic Python Binding Library for BLAT
alignment bioinformatics blat c cpp pybind11 python sequence-analysis
Last synced: 14 Jan 2026
https://github.com/incertae-sedis/smof
Explore and analyze biological sequence data
bioinformatics biology fasta genomics sequence sequence-analysis
Last synced: 17 Dec 2025
https://maraab23.github.io/ggseqplot/
R library for visualizing (social science) sequence data using ggplot2
ggplot2 r r-package rstats sequence-analysis traminer visualization
Last synced: 06 May 2025
https://github.com/maraab23/ggseqplot
R library for visualizing (social science) sequence data using ggplot2
ggplot2 r r-package rstats sequence-analysis traminer visualization
Last synced: 15 Apr 2025
https://github.com/ebedthan/sabreur
fast, reliable and handy demultiplexing tool for fastx files
bioinformatics demultiplexing rust sequence-analysis
Last synced: 15 Sep 2025
https://github.com/althonos/pymemesuite
Cython bindings and Python interface to the MEME suite, a collection of tools for the analysis of sequence motifs.
bioinformatics cython-library genomics meme-suite sequence-analysis sequence-motif
Last synced: 21 Mar 2025
https://github.com/mbhall88/classification_benchmark
Benchmarking different ways of doing read (taxonomic) classification, with a focus on removal of contamination and MTB classification
bioinformatics contamination sequence-analysis taxonomic-classification tuberculosis
Last synced: 11 Apr 2025
https://github.com/mt1022/cubar
R Package for Codon Usage Bias Analysis. Comprehensive documentation and tutorials are available at:
bioinformatics codon-usage machine-learning r-package sequence-analysis
Last synced: 22 Oct 2025
https://github.com/sib-swiss/pftools3
A suite of tools to build and search generalized profiles
motif profile protein-domains sequence-analysis
Last synced: 14 Apr 2025
https://github.com/althonos/pyncbitk
Cython bindings and Python interface to the NCBI C++ ToolKit (including BLAST+).
bioinformatics blast cython-library python-bindings python-library sequence-analysis
Last synced: 30 Dec 2025
https://github.com/nbisweden/contigtax
Taxonomic classification of metagenomic contigs
metagenomics metatranscriptomics sequence-analysis taxonomy-assignment
Last synced: 05 Mar 2025
https://github.com/weekend37/string-kernels
Implementation of the Polynomial String Kernel and linear time String Kernel implementation. Compatible and ready for easy plug-in to Scikit-Learn's Support Vector Machines and supports multithreading.
algorithms genomics kernelsvm machine-learning multithreading sequence-analysis
Last synced: 14 Jan 2026
https://github.com/nmdp-bioinformatics/seq-ann
Python package for doing fast and accurate sequence annotation
bioinformatics gfe hla hla-typing ngs sequence-analysis
Last synced: 28 Jan 2026
https://github.com/woozoo73/adonistrack
Simple Java profiling tool
debug dynamic-analysis java logging performance profiler sequence-analysis
Last synced: 14 Jan 2026
https://github.com/intervals-mining-lab/libiada-core
The classes library for analysis of large data arrays, based on new approach in the field of informatics - formal order analysis (FOA).
bioinformatics c-sharp formal-order-analysis informatics information-theory mathematical-linguistics sequence-analysis
Last synced: 17 Dec 2025
https://github.com/ioangatop/datamining
Data Mining techniques on predicting mood using time-series analysis in mental health and predicting the click behaviour of users in a hotel ranking system.
data-mining data-science prediction ranking-algorithm sequence-analysis xgboost
Last synced: 24 Dec 2025
https://github.com/feldroop/genedex
A small and fast FM-Index for Rust
bioinformatics data-structures fmindex search sequence-analysis
Last synced: 13 Sep 2025
https://github.com/snikumbh/seqarchr
seqArchR: Identifying (promoter) sequence architectures de novo using NMF
clustering nmf nonnegative-matrix-factorization promoter-sequence-architectures r r-package scikit-learn sequence-analysis sequence-architectures unsupervised-machine-learning
Last synced: 19 Jun 2025
https://github.com/jeanmanguy/kappa-casein-evolution-paper
Code for the analysis and figures for the paper: Manguy J, Shields DC. Implications of kappa-casein evolutionary diversity for the self-assembly and aggregation of casein micelles. Royal Society Open Science. 2019 Oct 16;6(10):190939. Available from: https://royalsocietypublishing.org/doi/10.1098/rsos.190939
bioinformatics biology casein evolution milk protein r rstats sequence-analysis
Last synced: 22 Jun 2025
https://github.com/nsbuitrago/vfind
Simple variant finding from NGS data
bioinformatics ngs sequence-analysis
Last synced: 02 Feb 2026
https://github.com/mullinslab/methylation-station
An application for analyzing and visualizing DNA methylation in multiple sequence alignments
bioinformatics methylation sequence-analysis visualization
Last synced: 30 Oct 2025
https://github.com/tusharpandey003/fasta-sequence-analysis-web-app
Analysis of FASTA ,Protein,DNA sequence.
bioinformatics biotechnology data-science dna-sequence-analysis dna-sequences fasta fasta-sequence fasta-sequences nucleotide-sequence protein-sequences sequence-analysis streamlit web-application
Last synced: 13 Mar 2025
https://github.com/cdcgov/clean-genes
A rust crate that automatically cleans up a gene alignment by trimming to ORF and identifying and/or removing problematic sequences.
bioinformatics cdc-influenza-division data-cleaning data-normalisation data-normalization data-science fasta ncird ncird-id sequence-alignment sequence-analysis sequence-annotation
Last synced: 15 Apr 2025
https://github.com/xapple/seqsearch
Sequence similarity searches (e.g. BLAST) made easy
bioinformatics blast-search database-management sequence-analysis
Last synced: 27 Dec 2025
https://github.com/code-jl/dna-sequence-analyzer
A robust Python-based bioinformatics tool for comprehensive DNA sequence analysis and manipulation.
bio-tools bioinformatics biological-data computational-biology data-analysis dna-analysis dna-sequencing fasta gc-content gene-detection genetics genomics molecular-biology motif-finding nucleotide-analysis python python3 scientific-computing sequence-analysis sequence-manipulation
Last synced: 11 Mar 2025
https://github.com/rajesh26k/two-row-arrays
Recursive integer array transformation system inspired by mathematical pattern analysis.
colab frequency-analysis java mathematics pattern-detection sequence-analysis
Last synced: 21 Aug 2025
https://github.com/mvinyard/seq-toolkit
Basic functions for genomic sequence manipulation
dna-sequences genomics sequence-analysis
Last synced: 26 Jul 2025
https://github.com/leonardogemin/duohash
DuoHash is an advanced tool for the efficient calculation of forward and reverse hashes of spaced k-mers in nucleotide sequences, improving the analysis of genomic data by reducing processing time and computational resources.
algorithm-optimization bioinformatics computational-biology dna-sequencing genomics hashing sequence-analysis spaced-kmer
Last synced: 06 Dec 2025
https://github.com/berraylvc/bioseq-analyzer
DNA FASTA sequence analyzer built with Biopython
bioinformatics biopython cli fasta python sequence-analysis
Last synced: 24 Dec 2025
https://github.com/gauravcodepro/evolutioncal
A evolutionary function in ruby which given a similarity score means the number of the similar bases and the dissimilar bases and then the sequence rate and the divergence rate calculates a ratio which can tell us how much sequencing depth to be covered
evolution evolutionary-computation evolutionary-strategy ruby rubyprogramming sequence-analysis
Last synced: 22 Feb 2025
https://github.com/gauravcodepro/plant-resistance-gene-fetcher
a custom function to fetch the dna and the protein sequence from the plant resistance gene database and get the corresponding dna_sequence and the protein_sequence.
bioinformatics disease-detection diseaseresistance plantgenomics plantresistance sequence-analysis sequence-labeling
Last synced: 22 Feb 2025
https://github.com/ctskennerton/fx_tools
Very small utilities for interrogating fasta and fastq files
bioinformatics fasta fastq sequence-analysis
Last synced: 02 Apr 2025