Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

awesome-python-tools

Awesome Python Tools
https://github.com/dr-saad-la/awesome-python-tools

Last synced: 3 days ago
JSON representation

  • Data Science

    • Data Manipulations

      • Dask
      • cuDF - accelerated DataFrame library based on Pandas, part of the RAPIDS ecosystem for faster large-scale data processing.
      • cuDF - accelerated DataFrame library based on Pandas, part of the RAPIDS ecosystem for faster large-scale data processing.
      • Bcolz - memory analytics, provides columnar and compressed data containers for faster manipulation of large datasets.
      • PySpark
      • Koalas - like API built on Apache Spark, allowing distributed DataFrame operations while maintaining a familiar syntax.
      • Bcolz - memory analytics, provides columnar and compressed data containers for faster manipulation of large datasets.
      • PySpark
      • Koalas - like API built on Apache Spark, allowing distributed DataFrame operations while maintaining a familiar syntax.
      • Mars - scale data computation using multi-dimensional arrays and DataFrames in distributed settings.
      • Mars - scale data computation using multi-dimensional arrays and DataFrames in distributed settings.
      • Awkward Array - like data and scientific research.
      • SQLAlchemy
      • PyArrow - scale data manipulation, commonly used with Parquet and Feather file formats.
      • Pandas
      • Pandas
      • PyJanitor
      • SQLite - in Python library for handling structured data stored in SQLite databases.
      • SQLite - in Python library for handling structured data stored in SQLite databases.
      • Dask
      • Vaex - of-core DataFrame library, optimized for working with datasets larger than memory.
  • Data Visualizations

    • Data Manipulations

      • Matplotlib - quality figures.
      • Seaborn
      • Plotly - ready plots. It supports a wide range of chart types, including 3D plots, geographic maps, and dashboards.
      • Bokeh - time streaming data and interactive dashboards.
      • Altair
      • ggplot - based approach to data visualization.
      • Holoviews
      • Datashader
      • hvPlot - level plotting API that simplifies the creation of interactive visualizations for Pandas, Dask, and Xarray data structures, integrating with Holoviews and Bokeh.
      • Streamlit
      • Flask-Dashboard - based visualizations on top of Flask.
      • Folium
      • Cartopy
      • Kepler.gl - scale geographic data visualizations with beautiful, interactive maps.
      • PyGraphviz
      • mplfinance
      • PyMC3
      • Manim - quality mathematical animations and presentations, widely used for educational videos.
      • Mayavi
      • PyVista
      • Vispy - performance interactive 2D/3D data visualization library that leverages the power of OpenGL.
      • Matplotlib 3D - in 3D plotting capabilities in Matplotlib, ideal for basic 3D visualizations and surface plots.
      • Dash - driven apps without requiring frontend knowledge.
      • Vaex - in fast visualization of large datasets, including scatter plots and histograms.
  • Machine Learning

    • General Machine Learning Libraries

      • Scikit-learn
      • PyCaret - source, low-code machine learning library that automates many aspects of machine learning pipelines, including preprocessing, model selection, and tuning.
    • Gradient Boosting Libraries

      • XGBoost - world applications. It is particularly effective for structured/tabular data.
      • CatBoost
    • Neural Networks and Deep Learning Integration

      • TensorFlow - source deep learning framework widely used for neural network modeling, machine learning, and artificial intelligence applications. It supports both high-level and low-level APIs for building, training, and deploying machine learning models.
      • Keras - level neural networks API, built on top of TensorFlow, that simplifies the process of building and training neural networks. Keras is user-friendly and modular, making it ideal for rapid experimentation.
      • PyTorch
    • Model Explainability and Interpretability

      • LIME - agnostic Explanations (LIME) is a library that explains the predictions of any machine learning classifier or regressor by approximating it locally with interpretable models.
    • Hyperparameter Tuning and Optimization

    • Automated Machine Learning (AutoML)

      • Auto-sklearn - learn, automating model selection, hyperparameter optimization, and data preprocessing.
      • TPOT - based AutoML library that automates the selection of models and hyperparameters.
    • Model Deployment and Monitoring

      • MLflow - source platform for managing the machine learning lifecycle. MLflow tracks experiments, packages code into reproducible runs, and manages model deployment.
      • DVC - source tool for managing datasets and machine learning models. It helps version data, manage pipelines, and reproduce results.
    • Dimensionality Reduction

      • UMAP - dimensional data.
      • t-SNE - Distributed Stochastic Neighbor Embedding is a non-linear dimensionality reduction technique useful for visualizing high-dimensional data.
      • PCA
    • Specialized Machine Learning Libraries

      • FastAI - level library built on top of PyTorch, designed to simplify the implementation of deep learning models with minimal code.
      • OpenCV
  • Text Analysis

  • Speech Recognition

  • Time Series Analysis

  • Statistical Analysis

  • Web Development

    • Specialized Libraries

      • Django - level Python web framework that encourages rapid development and clean, pragmatic design. Includes ORM, admin interface, and authentication.
      • Flask
      • FastAPI - performance) web framework for building APIs with Python, based on standard Python type hints. Known for its speed and ease of use.
      • Pyramid
  • Programming