Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

awesome-python-data-science

From gitlab
https://github.com/jacob98415/awesome-python-data-science

Last synced: about 15 hours ago
JSON representation

  • Visualization

    • Interactive plots

      • Altair - Declarative statistical visualization library for Python. Can easily do many data transformation within the code to create graph
    • Map

      • folium - Makes it easy to visualize data on an interactive open street map
  • Deployment

    • NLP

      • fastapi - Modern, fast (high-performance), a web framework for building APIs with Python
      • binder - Enable sharing and execute Jupyter Notebooks
      • datapane - A collection of APIs to turn scripts and notebooks into interactive reports.
  • Model Explanation

    • NLP

      • Skater - Python Library for Model Interpretation.
      • FlashLight - Visualization Tool for your NeuralNetwork.
  • Reinforcement Learning

    • NLP

      • RLlib - Scalable Reinforcement Learning.
  • Probabilistic Methods

    • NLP

      • GPflow - Gaussian processes in TensorFlow. <img height="20" src="img/tf_big2.png" alt="sklearn">
  • Optimization

    • NLP

      • OR-Tools - An open-source software suite for optimization by Google; provides a unified programming interface to a half dozen solvers: SCIP, GLPK, GLOP, CP-SAT, CPLEX, and Gurobi.
  • Time Series

    • NLP

      • dateutil - Powerful extensions to the standard datetime module
  • Natural Language Processing

    • NLP

      • spaCy - Industrial-Strength Natural Language Processing.
      • gensim - Topic Modelling for Humans.
  • Computer Vision

  • Distributed Computing

    • NLP

      • PySpark - Exposes the Spark programming model to Python. <img height="20" src="img/spark_big.png" alt="Apache Spark based">
  • Machine Learning

    • General Purpose Machine Learning

      • Shogun - Machine learning toolbox.
    • Ensemble Methods

      • ML-Ensemble - High performance ensemble learning. <img height="20" src="img/sklearn_big.png" alt="sklearn">
  • Deep Learning

  • Web Scraping

  • Data Manipulation

    • Pipelines

      • SSPipe - Python pipe (|) operator with support for DataFrames and Numpy, and Pytorch.