Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

awesome-list


https://github.com/netguru/awesome-list

Last synced: 3 days ago
JSON representation

  • Graphical

  • Deep Learning

  • Dimensionality Reduction

    • umap - dimension reduction technique that
  • Multipurpose

    • DALI - A library containing both highly optimized building blocks and an execution engine for data pre-processing in deep learning applications [docs](https://docs.nvidia.com/deeplearning/sdk/dali-developer-guide/docs/index.html)
    • gin-config - Gin provides
    • imbalanced-learn - A python package offering a number of re-sampling techniques. Compatible with scikit-learn, is part of scikit-learn-contrib projects.
    • mlxtend - A library of extension and
    • numpy - The fundamental package for
    • PyOD - Outlier detection library
    • scikit-learn - machine
    • scikit-learn-laboratory (SKLL) - CLI
    • scipy - open-source software for
    • statsmodels - statistical
    • SymPy - A computer algebra system written in pure Python, library for symbolic mathematics
    • RAPIDS - Open GPU Data Science. More
    • here
    • statsmodels - statistical
    • Vaex - Out-of-Core DataFrames for Python, visualize and explore big tabular data at a billion rows per second. [Project page](https://vaex.io/)
  • Natural Language Processing

  • Optimization

  • Probabilistic Programming

    • pyro - Deep universal probabilistic
    • pgmpy - Python Library for Probabilistic
  • Recommender systems

  • Speech Processing

  • Compilers

    • glow - Compiler for Neural Network hardware
    • numba - NumPy aware dynamic Python compiler
    • jax - Composable transformations of
  • Data Adapters

    • csvkit - A suite of utilities for
    • redash - Connect to any data source,
    • odo - Odo migrates between many formats. These include in-memory structures like list, pd.DataFrame and np.ndarray and also data outside of Python like CSV/JSON/HDF5 files, SQL databases, data on remote machines, and the Hadoop File System.
  • Data Annotation

    • doccano - Open source text annotation tool for machine learning practitioners.
    • snorkel - A system for
  • Data Gathering

    • scrapy - high-level library to write
  • Data Management

    • Quilt - Quilt versions and deploys data
  • Data Visualization

  • Feature engineering

    • featuretools - an open source python framework for automated feature engineering.
    • featuretools - an open source python framework for automated feature engineering.
  • Hardware Management

    • nvtop - a (h)top like task monitor for NVIDIA GPUs.
    • s2i - Source-to-Image (S2I) is a toolkit and workflow for building reproducible container images from source code. S2I produces ready-to-run images by injecting source code into a container image and letting the container prepare that source code for execution. By creating self-assembling builder images, you can version and control your build environments exactly like you use container images to version your runtime environments.
  • Job Management

    • luigi - Luigi is a Python module that
  • Parallelization

    • pywren - parfor on AWS Lambda
    • horovod - Distributed training framework
    • dask - library for parallel computing in Python with dynamic task scheduling: numpy computation graphs.
  • Reporting

  • Speach

    • Common-Voice - Multi language, open source database with voice samples that anyone can use to train speech-enabled applications.
  • Hybrid