Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Bioinformatics

Bioinformatics is an interdisciplinary field that intersects with biology, computer science, mathematics and statistics. It concerns itself with the development and use of methods and software tools for collecting and analyzing biological data.

https://github.com/biopython/biopython

Official git repository for Biopython (originally converted from CVS)

bioinformatics biopython dna genomics phylogenetics protein protein-structure python sequence-alignment

Last synced: 28 Oct 2024

https://github.com/google/deepvariant

DeepVariant is an analysis pipeline that uses a deep neural network to call genetic variants from next-generation DNA sequencing data.

bioinformatics deep-learning deep-neural-network deepvariant dna genome genomics machine-learning ngs science sequencing tensorflow

Last synced: 10 Oct 2024

https://github.com/opengene/fastp

An ultra-fast all-in-one FASTQ preprocessor (QC/adapters/trimming/filtering/splitting/merging...)

adapter bioinformatics duplication fastq filter filtering illumina merging ngs overlap polyg preprocessing qc quality quality-control sequencing splitting trimming umi

Last synced: 14 Oct 2024

https://github.com/OpenGene/fastp

An ultra-fast all-in-one FASTQ preprocessor (QC/adapters/trimming/filtering/splitting/merging...)

adapter bioinformatics duplication fastq filter filtering illumina merging ngs overlap polyg preprocessing qc quality quality-control sequencing splitting trimming umi

Last synced: 03 Aug 2024

https://github.com/scverse/scanpy

Single-cell analysis in Python. Scales to >1M cells.

anndata bioinformatics data-science machine-learning python scanpy scverse transcriptomics visualize-data

Last synced: 29 Oct 2024

https://github.com/lh3/minimap2

A versatile pairwise aligner for genomic and spliced nucleotide sequences

bioinformatics genomics sequence-alignment spliced-alignment

Last synced: 15 Oct 2024

https://github.com/broadinstitute/gatk

Official code repository for GATK versions 4 and up

bioinformatics dna gatk genome genomics ngs science sequencing spark

Last synced: 12 Oct 2024

https://allenai.github.io/scispacy/

A full spaCy pipeline and models for scientific/biomedical documents.

bioinformatics biomedical custom-pipes nlp scientific-documents spacy

Last synced: 04 Aug 2024

https://github.com/allenai/scispacy

A full spaCy pipeline and models for scientific/biomedical documents.

bioinformatics biomedical custom-pipes nlp scientific-documents spacy

Last synced: 14 Oct 2024

https://github.com/bioconda/bioconda-recipes

Conda recipes for the bioconda channel.

bioinformatics conda hacktoberfest package-management

Last synced: 14 Oct 2024

https://github.com/lh3/bwa

Burrow-Wheeler Aligner for short-read alignment (see minimap2 for long-read alignment)

bioinformatics fm-index genomics sequence-alignment

Last synced: 15 Oct 2024

https://github.com/lh3/seqtk

Toolkit for processing sequences in FASTA/Q formats

bioinformatics sequence-analysis

Last synced: 14 Oct 2024

https://github.com/shenwei356/seqkit

A cross-platform and ultrafast toolkit for FASTA/Q file manipulation

bioinformatics cross-platform fasta fastq golang manipulation sequence tool toolkit

Last synced: 02 Aug 2024

https://github.com/multiqc/multiqc

Aggregate results from bioinformatics analyses across many samples into a single report.

analysis bioconda bioinformatics data-visualization multiqc pypi python quality-control reporting seqera vizualisation

Last synced: 09 Oct 2024

https://github.com/MultiQC/MultiQC

Aggregate results from bioinformatics analyses across many samples into a single report.

analysis bioconda bioinformatics data-visualization multiqc pypi python quality-control reporting seqera vizualisation

Last synced: 02 Aug 2024

https://github.com/crazyhottommy/getting-started-with-genomics-tools-and-resources

Unix, R and python tools for genomics and data science

bioinformatics cancer-genomics data-science

Last synced: 15 Oct 2024

https://github.com/lightaime/deep_gcns_torch

Pytorch Repo for DeepGCNs (ICCV'2019 Oral, TPAMI'2021), DeeperGCN (arXiv'2020) and GNN1000(ICML'2021): https://www.deepgcns.org

3d-point-clouds bioinformatics cheminformatics computer-vision data-mining deep-gcns deep-learning geometric-deep-learning graph-convolutional-networks graph-neural-networks pytorch science-research social-network

Last synced: 29 Oct 2024

https://github.com/scipipe/scipipe

Robust, flexible and resource-efficient pipelines using Go and the commandline

bioinformatics bioinformatics-pipeline cheminformatics dataflow fbp go golang pipeline scientific-workflows scipipe workflow workflow-engine

Last synced: 15 Oct 2024

https://github.com/plotly/react-plotly.js

A plotly.js React component from Plotly 📈

bioinformatics charting-library d3 data-visualization fintech plotly react

Last synced: 01 Nov 2024

https://github.com/shenwei356/csvtk

A cross-platform, efficient and practical CSV/TSV toolkit in Golang

bioinformatics command-line cross-platform csv golang tool toolkit tsv

Last synced: 01 Nov 2024

https://github.com/bigdatagenomics/adam

ADAM is a genomics analysis platform with specialized file formats built using Apache Avro, Apache Spark, and Apache Parquet. Apache 2 licensed.

avro big-data bioinformatics genomics java parquet python r scala spark

Last synced: 29 Oct 2024

https://github.com/broadinstitute/cromwell

Scientific workflow engine designed for simplicity & scalability. Trivially transition between one off use cases to massive scale production environments

application bioinformatics cloud containers docker executor ga4gh hpc scala wdl workflow workflow-description-language workflow-execution

Last synced: 30 Oct 2024

https://github.com/kblin/ncbi-genome-download

Scripts to download genomes from the NCBI FTP servers

bioinformatics biology command-line download-genomes genbank genomics ncbi python

Last synced: 01 Nov 2024

https://github.com/hail-is/hail

Cloud-native genomic dataframes and batch computing

bioinformatics genetics genomics gwas hail python software vcf

Last synced: 02 Aug 2024

https://github.com/biocore/scikit-bio

scikit-bio: a community-driven Python library for bioinformatics, providing versatile data structures, algorithms and educational resources.

bioinformatics computational-biology

Last synced: 27 Oct 2024

https://github.com/scikit-bio/scikit-bio

scikit-bio: a community-driven Python library for bioinformatics, providing versatile data structures, algorithms and educational resources.

bioinformatics computational-biology

Last synced: 07 Aug 2024

https://github.com/shenwei356/rush

A cross-platform command-line tool for executing jobs in parallel

bioinformatics command cross-platform execute golang parallel pipeline shell windows

Last synced: 01 Aug 2024

https://github.com/google/nucleus

Python and C++ code for reading and writing genomics data.

bioinformatics dna genomics tensorflow

Last synced: 26 Sep 2024

https://github.com/samtools/htslib

C library for high-throughput sequencing data formats

bam bcf bioinformatics cram htslib ngs sam vcf

Last synced: 03 Aug 2024

https://github.com/tanghaibao/jcvi

Python library to facilitate genome assembly, annotation, and comparative genomics

allmaps assembly bioinformatics blast comparative-genomics genetic-maps genome-sequencing genomics sequence-alignments synteny variant-calling

Last synced: 12 Oct 2024

https://github.com/openwdl/wdl

Workflow Description Language - Specification and Implementations

bioinformatics cloud cromwell openwdl reproducibility reproducible-science wdl workflow

Last synced: 01 Aug 2024

https://github.com/COMBINE-lab/salmon

🐟 🍣 🍱 Highly-accurate & wicked fast transcript-level quantification from RNA-seq reads using selective alignment

10x bioinformatics c-plus-plus gene-expression quantification quasi-mapping rna-seq rna-seq-quantification rnaseq sailfish salmon scrna-seq selective-alignment single-cell single-cell-rna-seq transcriptome

Last synced: 03 Aug 2024

https://github.com/awslabs/dgl-lifesci

Python package for graph neural networks in chemistry and biology

bioinformatics cheminformatics deep-learning dgl drug-discovery geometric-deep-learning graph-neural-networks molecule

Last synced: 09 Oct 2024

https://github.com/danforthcenter/plantcv

Plant phenotyping with image analysis

bioinformatics image-analysis plant-phenotyping plantcv science

Last synced: 30 Oct 2024

https://github.com/deeptools/deepTools

Tools to process and analyze deep sequencing data.

bioinformatics chip-seq genomics ngs python rna-seq

Last synced: 02 Aug 2024

https://github.com/BenLangmead/bowtie2

A fast and sensitive gapped read aligner

bioinformatics c-plus-plus genomics read-aligners

Last synced: 02 Aug 2024

https://github.com/biotite-dev/biotite

A comprehensive library for computational molecular biology

bioinformatics computational-biology pdb-files sequence-analysis structural-biology

Last synced: 09 Aug 2024

https://github.com/lh3/bioawk

BWK awk modified for biological data

bioinformatics sequence-analysis

Last synced: 03 Aug 2024

https://github.com/voutcn/megahit

Ultra-fast and memory-efficient (meta-)genome assembler

bioinformatics data-structures genome-assembly genomics metagenomics succinct

Last synced: 03 Aug 2024

https://github.com/biod/sambamba

Tools for working with SAM/BAM data

bam bioinformatics sam

Last synced: 03 Aug 2024

https://github.com/fritzsedlazeck/sniffles

Structural variation caller using third generation sequencing

bioconda bioinformatics nanopore ngm-lr pacbio structural-variation structural-variations

Last synced: 30 Oct 2024

https://github.com/jts/nanopolish

Signal-level algorithms for MinION data

bioinformatics c-plus-plus epigenetics genome-assembly methylation science

Last synced: 02 Aug 2024

https://github.com/plotly/dash-bio

Open-source bioinformatics components for Dash

bioinformatics biojs dash

Last synced: 03 Aug 2024

https://github.com/owkin/PyDESeq2

A Python implementation of the DESeq2 pipeline for bulk RNA-seq DEA.

bioinformatics differential-expression python rna-seq transcriptomics

Last synced: 03 Aug 2024

https://github.com/plotly/dash.jl

Dash for Julia - A Julia interface to the Dash ecosystem for creating analytic web applications in Julia. No JavaScript required.

bioinformatics charting dash dashboard data-science data-visualization finance gui-framework julia modeling no-javascript no-vba plotly plotly-dash productivity react technical-computing web-app

Last synced: 12 Oct 2024

https://github.com/Martinsos/edlib

Lightweight, super fast C/C++ (& Python) library for sequence alignment using edit (Levenshtein) distance.

alignment-path bioinformatics c-plus-plus edit-distance levehnstein-distance library python sequence-alignment

Last synced: 01 Nov 2024

https://github.com/martinsos/edlib

Lightweight, super fast C/C++ (& Python) library for sequence alignment using edit (Levenshtein) distance.

alignment-path bioinformatics c-plus-plus edit-distance levehnstein-distance library python sequence-alignment

Last synced: 12 Oct 2024

https://github.com/cdk/cdk

The Chemistry Development Kit

bioinformatics blueobelisk cheminformatics chemistry code4lib java

Last synced: 02 Aug 2024

https://github.com/chhylp123/hifiasm

Hifiasm: a haplotype-resolved assembler for accurate Hifi reads

bioinformatics denovo-assembly genomics hifi-read pacbio

Last synced: 02 Aug 2024

https://github.com/GMOD/jbrowse

JBrowse 1, a full-featured genome browser built with JavaScript and HTML5. For JBrowse 2, see https://github.com/GMOD/jbrowse-components.

bioinformatics biology genome-annotation genome-browser genomics protein-annotation

Last synced: 01 Nov 2024

https://github.com/mdshw5/pyfaidx

Efficient pythonic random access to fasta subsequences

bgzf bioinformatics dna fasta genomics indexing protein python samtools

Last synced: 02 Aug 2024

https://github.com/poisonalien/maftools

Summarize, Analyze and Visualize MAF files from TCGA or in-house studies.

bioinformatics cancer-genome-atlas cancer-genomics genomics maf-files r tcga

Last synced: 31 Oct 2024

https://github.com/pharmai/plip

Protein-Ligand Interaction Profiler - Analyze and visualize non-covalent protein-ligand interactions in PDB files according to 📝 Adasme et al. (2021), https://doi.org/10.1093/nar/gkab294

bioinformatics docker openbabel pdb plip protein-structure python-bindings scientific-computing singularity

Last synced: 08 Aug 2024

https://github.com/openbiox/awosome-bioinformatics

A curated list of resources for learning bioinformatics.

bioinformatics data-analysis next-generation-sequencing

Last synced: 02 Aug 2024

https://github.com/agitter/single-cell-pseudotime

An overview of algorithms for estimating pseudotime in single-cell RNA-seq data

bioinformatics pseudotime-algorithms rna-seq single-cell

Last synced: 14 Oct 2024

https://github.com/lh3/minigraph

Sequence-to-graph mapper and graph generator

bioinformatics genome-graph genomics pan-genome sequence-alignment

Last synced: 01 Aug 2024

https://github.com/Illumina/manta

Structural variant and indel caller for mapped sequencing data

bioinformatics indels structural-variation structural-variations

Last synced: 02 Aug 2024

https://github.com/aidenlab/juicer

A One-Click System for Analyzing Loop-Resolution Hi-C Experiments

3d-genome 3d-genome-browser bioinformatics genomics hi-c ngs

Last synced: 29 Oct 2024

https://github.com/seqan/seqan3

The modern C++ library for sequence analysis. Contains version 3 of the library and API docs.

bioinformatics blast cpp-concepts cpp17 cpp20 fasta fastq fm-index modern samtools seqan sequence-alignment sequence-analysis

Last synced: 12 Oct 2024

https://github.com/FelixKrueger/Bismark

A tool to map bisulfite converted sequence reads and determine cytosine methylation states

bioinformatics dna methylation

Last synced: 01 Nov 2024

https://github.com/arvados/arvados

An open source platform for managing and analyzing biomedical big data

arvados aws azure bigdata bioinformatics cloud cluster cwl docker gcp genomics go python ruby workflow workflow-engine

Last synced: 31 Oct 2024

https://github.com/brentp/vcfanno

annotate a VCF with other VCFs/BEDs/tabixed files

annotation bioinformatics genomics vcf

Last synced: 01 Nov 2024

https://github.com/brentp/cyvcf2

cython + htslib == fast VCF and BCF processing

bioinformatics cython genomics htslib vcf

Last synced: 12 Oct 2024

https://github.com/bioconvert/bioconvert

Bioconvert is a collaborative project to facilitate the interconversion of life science data from one format to another.

bam2cram bam2sam bcf2vcf bigbed2bed bioinformatics conversion convertor cram2bam cram2sam embl2ena embl2fasta embl2genbank fasta2fastq genbank2embl genbank2fasta ngs phylogeny sam2bam sequence sniffer

Last synced: 03 Aug 2024

https://github.com/fritzsedlazeck/survivor

Toolset for SV simulation, comparison and filtering

bioconda bioinformatics comparison simulator structural-variations survivor vcf

Last synced: 26 Oct 2024

https://github.com/shenwei356/taxonkit

A Practical and Efficient NCBI Taxonomy Toolkit, also supports creating NCBI-style taxdump files for custom taxonomies like GTDB/ICTV

bioinformatics cross-platform lca lineage taxdump taxid taxonkit taxonomy

Last synced: 02 Aug 2024

https://github.com/Illumina/strelka

Strelka2 germline and somatic small variant caller

bioinformatics indels snps snvs

Last synced: 02 Aug 2024

https://github.com/gao-lab/GLUE

Graph-linked unified embedding for single-cell multi-omics data integration

bioinformatics deep-learning single-cell single-cell-multiomics

Last synced: 05 Aug 2024