Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/dylanhogg/awesome-python

๐Ÿ Hand-picked awesome Python libraries and frameworks, organised by category
https://github.com/dylanhogg/awesome-python

List: awesome-python

awesome awesome-list awesome-python chatgpt data data-science deep-learning jupyter machine-learning natural-language-processing nlp open-source pandas python python-library

Last synced: about 2 months ago
JSON representation

๐Ÿ Hand-picked awesome Python libraries and frameworks, organised by category

Awesome Lists containing this project

README

        

# Awesome Python

[![Awesome](https://awesome.re/badge.svg)](https://awesome.re) ![Last commit](https://img.shields.io/github/last-commit/dylanhogg/awesome-python) [![License: MIT](https://img.shields.io/badge/license-MIT-green.svg)](https://opensource.org/licenses/MIT)

Hand-picked awesome Python libraries and frameworks, organised by category ๐Ÿ

Interactive version: [www.awesomepython.org](https://www.awesomepython.org/)

Updated 20 Jul 2024

## Categories

- [Newly Created Repositories](#newly-created-repositories) - Awesome Python is regularly updated, and this category lists the most recently created GitHub repositories from all the other repositories here (10 repos)
- [Code Quality](#code-quality) - Code quality tooling: linters, formatters, pre-commit hooks, unused code removal (17 repos)
- [Crypto and Blockchain](#crypto-and-blockchain) - Cryptocurrency and blockchain libraries: trading bots, API integration, Ethereum virtual machine, solidity (13 repos)
- [Data](#data) - General data libraries: data processing, serialisation, formats, databases, SQL, connectors, web crawlers, data generation/augmentation/checks (100 repos)
- [Debugging](#debugging) - Debugging and tracing tools (9 repos)
- [Diffusion Text to Image](#diffusion-text-to-image) - Text-to-image diffusion model libraries, tools and apps for generating images from natural language (41 repos)
- [Finance](#finance) - Financial and quantitative libraries: investment research tools, market data, algorithmic trading, backtesting, financial derivatives (31 repos)
- [Game Development](#game-development) - Game development tools, engines and libraries (6 repos)
- [GIS](#gis) - Geospatial libraries: raster and vector data formats, interactive mapping and visualisation, computing frameworks for processing images, projections (29 repos)
- [Graph](#graph) - Graphs and network libraries: network analysis, graph machine learning, visualisation (6 repos)
- [GUI](#gui) - Graphical user interface libraries and toolkits (8 repos)
- [Jupyter](#jupyter) - Jupyter and JupyterLab and Notebook tools, libraries and plugins (24 repos)
- [LLMs and ChatGPT](#llms-and-chatgpt) - Large language model and GPT libraries and frameworks: auto-gpt, agents, QnA, chain-of-thought workflows, API integations. Also see the Natural Language Processing category for crossover (246 repos)
- [Math and Science](#math-and-science) - Mathematical, numerical and scientific libraries (22 repos)
- [Machine Learning - General](#machine-learning---general) - General and classical machine learning libraries. See below for other sections covering specialised ML areas (156 repos)
- [Machine Learning - Deep Learning](#machine-learning---deep-learning) - Machine learning libraries that cross over with deep learning in some way (72 repos)
- [Machine Learning - Interpretability](#machine-learning---interpretability) - Machine learning interpretability libraries. Covers explainability, prediction explainations, dashboards, understanding knowledge development in training (19 repos)
- [Machine Learning - Ops](#machine-learning---ops) - MLOps tools, frameworks and libraries: intersection of machine learning, data engineering and DevOps; deployment, health, diagnostics and governance of ML models (44 repos)
- [Machine Learning - Reinforcement](#machine-learning---reinforcement) - Machine learning libraries and toolkits that cross over with reinforcement learning in some way: agent reinforcement learning, agent environemnts, RLHF (22 repos)
- [Machine Learning - Time Series](#machine-learning---time-series) - Machine learning and classical timeseries libraries: forecasting, seasonality, anomaly detection, econometrics (19 repos)
- [Natural Language Processing](#natural-language-processing) - Natural language processing libraries and toolkits: text processing, topic modelling, tokenisers, chatbots. Also see the LLMs and ChatGPT category for crossover (82 repos)
- [Packaging](#packaging) - Python packaging, dependency management and bundling (28 repos)
- [Pandas](#pandas) - Pandas and dataframe libraries: data analysis, statistical reporting, pandas GUIs, pandas performance optimisations (24 repos)
- [Performance](#performance) - Performance, parallelisation and low level libraries (28 repos)
- [Profiling](#profiling) - Memory and CPU/GPU profiling tools and libraries (11 repos)
- [Security](#security) - Security related libraries: vulnerability discovery, SQL injection, environment auditing (14 repos)
- [Simulation](#simulation) - Simulation libraries: robotics, economic, agent-based, traffic, physics, astronomy, chemistry, quantum simulation. Also see the Maths and Science category for crossover (28 repos)
- [Study](#study) - Miscellaneous study resources: algorithms, general resources, system design, code repos for textbooks, best practices, tutorials (55 repos)
- [Template](#template) - Template tools and libraries: cookiecutter repos, generators, quick-starts (9 repos)
- [Terminal](#terminal) - Terminal and console tools and libraries: CLI tools, terminal based formatters, progress bars (15 repos)
- [Testing](#testing) - Testing libraries: unit testing, load testing, acceptance testing, code coverage, browser automation, plugins (24 repos)
- [Typing](#typing) - Typing libraries: static and run-time type checking, annotations (12 repos)
- [Utility](#utility) - General utility libraries: miscellaneous tools, linters, code formatters, version management, package tools, documentation tools (197 repos)
- [Vizualisation](#vizualisation) - Vizualisation tools and libraries. Application frameworks, 2D/3D plotting, dashboards, WebGL (33 repos)
- [Web](#web) - Web related frameworks and libraries: webapp servers, WSGI, ASGI, asyncio, HTTP, REST, user management (58 repos)

## Newly Created Repositories

Awesome Python is regularly updated, and this category lists the most recently created GitHub repositories from all the other repositories here.

1. naklecha/llama3-from-scratch โญ 11,288
llama3 implementation one matrix multiplication at a time

2. codium-ai/cover-agent โญ 4,042
CodiumAI Cover-Agent: An AI-Powered Tool for Automated Test Generation and Code Coverage Enhancement! ๐Ÿ’ป๐Ÿค–๐Ÿงช๐Ÿž
๐Ÿ”— [www.codium.ai](https://www.codium.ai/)

3. blealtan/efficient-kan โญ 3,606
An efficient pure-PyTorch implementation of Kolmogorov-Arnold Network (KAN).

4. google-research/timesfm โญ 3,171
TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting.
๐Ÿ”— [research.google/blog/a-decoder-only-foundation-model-for-time-series-forecasting](https://research.google/blog/a-decoder-only-foundation-model-for-time-series-forecasting/)

5. mistralai/mistral-finetune โญ 2,489
A light-weight codebase that enables memory-efficient and performant finetuning of Mistral's models. It is based on LoRA.

6. pydantic/logfire โญ 1,707
Uncomplicated Observability for Python and beyond! ๐Ÿชต๐Ÿ”ฅ
๐Ÿ”— [docs.pydantic.dev/logfire](https://docs.pydantic.dev/logfire/)

7. thudm/CogVLM2 โญ 1,599
GPT4V-level open-source multi-modal model based on Llama3-8B

8. likejazz/llama3.np โญ 933
llama3.np is a pure NumPy implementation for Llama 3 model.

9. eureka-research/DrEureka โญ 749
Official Repository for "DrEureka: Language Model Guided Sim-To-Real Transfer" (RSS 2024)
๐Ÿ”— [eureka-research.github.io/dr-eureka](https://eureka-research.github.io/dr-eureka/)

10. prometheus-eval/prometheus-eval โญ 690
Evaluate your LLM's response with Prometheus and GPT4 ๐Ÿ’ฏ

## Code Quality

Code quality tooling: linters, formatters, pre-commit hooks, unused code removal.

1. psf/black โญ 37,860
The uncompromising Python code formatter
๐Ÿ”— [black.readthedocs.io/en/stable](https://black.readthedocs.io/en/stable/)

2. astral-sh/ruff โญ 29,132
An extremely fast Python linter and code formatter, written in Rust.
๐Ÿ”— [docs.astral.sh/ruff](https://docs.astral.sh/ruff)

3. google/yapf โญ 13,702
A formatter for Python files

4. pre-commit/pre-commit โญ 12,444
A framework for managing and maintaining multi-language pre-commit hooks.
๐Ÿ”— [pre-commit.com](https://pre-commit.com)

5. sqlfluff/sqlfluff โญ 7,428
A modular SQL linter and auto-formatter with support for multiple dialects and templated code.
๐Ÿ”— [www.sqlfluff.com](https://www.sqlfluff.com)

6. pycqa/isort โญ 6,381
A Python utility / library to sort imports.
๐Ÿ”— [pycqa.github.io/isort](https://pycqa.github.io/isort/)

7. davidhalter/jedi โญ 5,731
Awesome autocompletion, static analysis and refactoring library for python
๐Ÿ”— [jedi.readthedocs.io](http://jedi.readthedocs.io)

8. pycqa/pylint โญ 5,196
It's not just a linter that annoys you!
๐Ÿ”— [pylint.readthedocs.io/en/latest](https://pylint.readthedocs.io/en/latest/)

9. asottile/pyupgrade โญ 3,430
A tool (and pre-commit hook) to automatically upgrade syntax for newer versions of the language.

10. pycqa/flake8 โญ 3,351
flake8 is a python tool that glues together pycodestyle, pyflakes, mccabe, and third-party plugins to check the style and quality of some python code.
๐Ÿ”— [flake8.pycqa.org](https://flake8.pycqa.org)

11. jendrikseipp/vulture โญ 3,301
Find dead Python code

12. wemake-services/wemake-python-styleguide โญ 2,466
The strictest and most opinionated python linter ever!
๐Ÿ”— [wemake-python-styleguide.rtfd.io](https://wemake-python-styleguide.rtfd.io)

13. codespell-project/codespell โญ 1,812
check code for common misspellings

14. python-lsp/python-lsp-server โญ 1,768
Fork of the python-language-server project, maintained by the Spyder IDE team and the community

15. sourcery-ai/sourcery โญ 1,504
Instant AI code reviews
๐Ÿ”— [sourcery.ai](https://sourcery.ai)

16. akaihola/darker โญ 624
Apply black reformatting to Python files only in regions changed since a given commit. For a practical usage example, see the blog post at https://dev.to/akaihola/improving-python-code-incrementally-3f7a
๐Ÿ”— [pypi.org/project/darker](https://pypi.org/project/darker/)

17. tconbeer/sqlfmt โญ 369
sqlfmt formats your dbt SQL files so you don't have to
๐Ÿ”— [sqlfmt.com](https://sqlfmt.com)

## Crypto and Blockchain

Cryptocurrency and blockchain libraries: trading bots, API integration, Ethereum virtual machine, solidity.

1. ccxt/ccxt โญ 32,044
A JavaScript / TypeScript / Python / C# / PHP cryptocurrency trading API with support for more than 100 bitcoin/altcoin exchanges
๐Ÿ”— [docs.ccxt.com](https://docs.ccxt.com)

2. freqtrade/freqtrade โญ 26,570
Free, open source crypto trading bot
๐Ÿ”— [www.freqtrade.io](https://www.freqtrade.io)

3. crytic/slither โญ 5,151
Static Analyzer for Solidity and Vyper
๐Ÿ”— [blog.trailofbits.com/2018/10/19/slither-a-solidity-static-analysis-framework](https://blog.trailofbits.com/2018/10/19/slither-a-solidity-static-analysis-framework/)

4. ethereum/web3.py โญ 4,895
A python interface for interacting with the Ethereum blockchain and ecosystem.
๐Ÿ”— [web3py.readthedocs.io](http://web3py.readthedocs.io)

5. ethereum/consensus-specs โญ 3,494
Ethereum Proof-of-Stake Consensus Specifications

6. cyberpunkmetalhead/Binance-volatility-trading-bot โญ 3,386
This is a fully functioning Binance trading bot that measures the volatility of every coin on Binance and places trades with the highest gaining coins If you like this project consider donating though the Brave browser to allow me to continuously improve the script.

7. ethereum/py-evm โญ 2,221
A Python implementation of the Ethereum Virtual Machine
๐Ÿ”— [py-evm.readthedocs.io/en/latest](https://py-evm.readthedocs.io/en/latest/)

8. bmoscon/cryptofeed โญ 2,141
Cryptocurrency Exchange Websocket Data Feed Handler

9. binance/binance-public-data โญ 1,412
Details on how to get Binance public data

10. ofek/bit โญ 1,224
Bitcoin made easy.
๐Ÿ”— [ofek.dev/bit](https://ofek.dev/bit/)

11. man-c/pycoingecko โญ 1,038
Python wrapper for the CoinGecko API

12. palkeo/panoramix โญ 783
Ethereum decompiler

13. dylanhogg/awesome-crypto โญ 66
A list of awesome crypto and blockchain projects
๐Ÿ”— [www.awesomecrypto.xyz](https://www.awesomecrypto.xyz/)

## Data

General data libraries: data processing, serialisation, formats, databases, SQL, connectors, web crawlers, data generation/augmentation/checks.

1. scrapy/scrapy โญ 51,755
Scrapy, a fast high-level web crawling & scraping framework for Python.
๐Ÿ”— [scrapy.org](https://scrapy.org)

2. apache/spark โญ 38,960
Apache Spark - A unified analytics engine for large-scale data processing
๐Ÿ”— [spark.apache.org](https://spark.apache.org/)

3. mindsdb/mindsdb โญ 25,561
The platform for building AI from enterprise data
๐Ÿ”— [mindsdb.com](https://mindsdb.com)

4. getredash/redash โญ 25,446
Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
๐Ÿ”— [redash.io](http://redash.io/)

5. jaidedai/EasyOCR โญ 22,916
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
๐Ÿ”— [www.jaided.ai](https://www.jaided.ai)

6. qdrant/qdrant โญ 18,898
Qdrant - High-performance, massive-scale Vector Database for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
๐Ÿ”— [qdrant.tech](https://qdrant.tech)

7. humansignal/label-studio โญ 17,522
Label Studio is an open source data labeling tool. It lets you label data types like audio, text, images, videos, and time series with a simple and straightforward UI and export to various model formats.
๐Ÿ”— [labelstud.io](https://labelstud.io)

8. joke2k/faker โญ 17,377
Faker is a Python package that generates fake data for you.
๐Ÿ”— [faker.readthedocs.io](https://faker.readthedocs.io)

9. binux/pyspider โญ 16,419
A Powerful Spider(Web Crawler) System in Python.
๐Ÿ”— [docs.pyspider.org](http://docs.pyspider.org/)

10. twintproject/twint โญ 15,657
An advanced Twitter scraping & OSINT tool written in Python that doesn't use Twitter's API, allowing you to scrape a user's followers, following, Tweets and more while evading most API limitations.

11. airbytehq/airbyte โญ 14,891
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
๐Ÿ”— [airbyte.com](https://airbyte.com)

12. apache/arrow โญ 13,950
Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing
๐Ÿ”— [arrow.apache.org](https://arrow.apache.org/)

13. chroma-core/chroma โญ 13,672
the AI-native open-source embedding database
๐Ÿ”— [www.trychroma.com](https://www.trychroma.com/)

14. tiangolo/sqlmodel โญ 13,630
SQL databases in Python, designed for simplicity, compatibility, and robustness.
๐Ÿ”— [sqlmodel.tiangolo.com](https://sqlmodel.tiangolo.com/)

15. redis/redis-py โญ 12,428
Redis Python client

16. coleifer/peewee โญ 10,973
a small, expressive orm -- supports postgresql, mysql, sqlite and cockroachdb
๐Ÿ”— [docs.peewee-orm.com](http://docs.peewee-orm.com/)

17. s0md3v/Photon โญ 10,697
Incredibly fast crawler designed for OSINT.

18. avaiga/taipy โญ 9,894
Turns Data and AI algorithms into production-ready web applications in no time.
๐Ÿ”— [www.taipy.io](https://www.taipy.io)

19. sqlalchemy/sqlalchemy โญ 9,186
The Database Toolkit for Python
๐Ÿ”— [www.sqlalchemy.org](https://www.sqlalchemy.org)

20. simonw/datasette โญ 9,113
An open source multi-tool for exploring and publishing data
๐Ÿ”— [datasette.io](https://datasette.io)

21. bigscience-workshop/petals โญ 8,934
๐ŸŒธ Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
๐Ÿ”— [petals.dev](https://petals.dev)

22. yzhao062/pyod โญ 8,305
A Python Library for Outlier and Anomaly Detection, Integrating Classical and Deep Learning Techniques
๐Ÿ”— [pyod.readthedocs.io](http://pyod.readthedocs.io)

23. voxel51/fiftyone โญ 7,910
The open-source tool for building high-quality datasets and computer vision models
๐Ÿ”— [fiftyone.ai](https://fiftyone.ai)

24. gristlabs/grist-core โญ 6,624
Grist is the evolution of spreadsheets.
๐Ÿ”— [www.getgrist.com](https://www.getgrist.com/)

25. alirezamika/autoscraper โญ 6,078
A Smart, Automatic, Fast and Lightweight Web Scraper for Python

26. tobymao/sqlglot โญ 6,066
Python SQL Parser and Transpiler
๐Ÿ”— [sqlglot.com](https://sqlglot.com/)

27. kaggle/kaggle-api โญ 6,041
Official Kaggle API

28. madmaze/pytesseract โญ 5,673
A Python wrapper for Google Tesseract

29. vi3k6i5/flashtext โญ 5,573
Extract Keywords from sentence or Replace keywords in sentences.

30. airbnb/knowledge-repo โญ 5,454
A next-generation curated knowledge sharing platform for data scientists and other technical professions.

31. facebookresearch/AugLy โญ 4,924
A data augmentations library for audio, image, text, and video.
๐Ÿ”— [ai.facebook.com/blog/augly-a-new-data-augmentation-library-to-help-build-more-robust-ai-models](https://ai.facebook.com/blog/augly-a-new-data-augmentation-library-to-help-build-more-robust-ai-models/)

32. ibis-project/ibis โญ 4,616
Ibis is a Python library that provides a lightweight, universal interface for data wrangling. It helps Python users explore and transform data of any size, stored anywhere.
๐Ÿ”— [ibis-project.org](https://ibis-project.org)

33. superduperdb/superduper โญ 4,564
๐Ÿ”ฎ SuperDuper: Bring AI to your database! Build, deploy and manage any AI application directly with your existing data infrastructure, without moving your data. Including streaming inference, scalable model training and vector search.
๐Ÿ”— [superduper.io](https://superduper.io)

34. jazzband/tablib โญ 4,563
Python Module for Tabular Datasets in XLS, CSV, JSON, YAML, &c.
๐Ÿ”— [tablib.readthedocs.io](https://tablib.readthedocs.io/)

35. lk-geimfari/mimesis โญ 4,350
Mimesis is a robust data generator for Python that can produce a wide range of fake data in multiple languages.
๐Ÿ”— [mimesis.name](https://mimesis.name)

36. amundsen-io/amundsen โญ 4,343
Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.
๐Ÿ”— [www.amundsen.io/amundsen](https://www.amundsen.io/amundsen/)

37. mongodb/mongo-python-driver โญ 4,090
PyMongo - the Official MongoDB Python driver
๐Ÿ”— [pymongo.readthedocs.io](https://pymongo.readthedocs.io)

38. giskard-ai/giskard โญ 3,761
๐Ÿข Open-Source Evaluation & Testing for LLMs and ML models
๐Ÿ”— [docs.giskard.ai](https://docs.giskard.ai)

39. lancedb/lancedb โญ 3,672
Developer-friendly, serverless vector database for AI applications. Easily add long-term memory to your LLM apps!
๐Ÿ”— [lancedb.github.io/lancedb](https://lancedb.github.io/lancedb/)

40. andialbrecht/sqlparse โญ 3,644
A non-validating SQL parser module for Python

41. jmcnamara/XlsxWriter โญ 3,555
A Python module for creating Excel XLSX files.
๐Ÿ”— [xlsxwriter.readthedocs.io](https://xlsxwriter.readthedocs.io)

42. deepchecks/deepchecks โญ 3,495
Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML validation needs, enabling to thoroughly test your data and models from research to production.
๐Ÿ”— [docs.deepchecks.com/stable](https://docs.deepchecks.com/stable)

43. rom1504/img2dataset โญ 3,458
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

44. run-llama/llama-hub โญ 3,425
A library of data loaders for LLMs made by the community -- to be used with LlamaIndex and/or LangChain
๐Ÿ”— [llamahub.ai](https://llamahub.ai/)

45. praw-dev/praw โญ 3,403
PRAW, an acronym for "Python Reddit API Wrapper", is a python package that allows for simple access to Reddit's API.
๐Ÿ”— [praw.readthedocs.io](http://praw.readthedocs.io/)

46. pyeve/cerberus โญ 3,126
Lightweight, extensible data validation library for Python
๐Ÿ”— [python-cerberus.org](http://python-cerberus.org)

47. datafold/data-diff โญ 2,920
Compare tables within or across databases
๐Ÿ”— [docs.datafold.com](https://docs.datafold.com)

48. zoomeranalytics/xlwings โญ 2,906
xlwings is a Python library that makes it easy to call Python from Excel and vice versa. It works with Excel on Windows and macOS as well as with Google Sheets and Excel on the web.
๐Ÿ”— [www.xlwings.org](https://www.xlwings.org)

49. pallets/itsdangerous โญ 2,864
Safely pass trusted data to untrusted environments and back.
๐Ÿ”— [itsdangerous.palletsprojects.com](https://itsdangerous.palletsprojects.com)

50. docarray/docarray โญ 2,861
Represent, send, store and search multimodal data
๐Ÿ”— [docs.docarray.org](https://docs.docarray.org/)

51. goldsmith/Wikipedia โญ 2,855
A Pythonic wrapper for the Wikipedia API
๐Ÿ”— [wikipedia.readthedocs.org](https://wikipedia.readthedocs.org/)

52. pathwaycom/pathway โญ 2,847
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
๐Ÿ”— [pathway.com](https://pathway.com)

53. awslabs/amazon-redshift-utils โญ 2,731
Amazon Redshift Utils contains utilities, scripts and view which are useful in a Redshift environment

54. sqlalchemy/alembic โญ 2,606
A database migrations tool for SQLAlchemy.

55. kayak/pypika โญ 2,443
PyPika is a python SQL query builder that exposes the full richness of the SQL language using a syntax that reflects the resulting query. PyPika excels at all sorts of SQL queries but is especially useful for data analysis.
๐Ÿ”— [pypika.readthedocs.io/en/latest](http://pypika.readthedocs.io/en/latest/)

56. pynamodb/PynamoDB โญ 2,405
A pythonic interface to Amazon's DynamoDB
๐Ÿ”— [pynamodb.readthedocs.io](http://pynamodb.readthedocs.io)

57. rapidai/RapidOCR โญ 2,372
Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVION and PaddlePaddle. ๏ผˆๅฐ†PaddleOCRๆจกๅž‹ๅšไบ†่ฝฌๆข๏ผŒ้‡‡็”จONNXRuntimeๆŽจ็†๏ผŒ้€Ÿๅบฆๅพˆๅฟซ๏ผ‰
๐Ÿ”— [rapidai.github.io/rapidocrdocs](https://rapidai.github.io/RapidOCRDocs)

58. emirozer/fake2db โญ 2,272
Generate fake but valid data filled databases for test purposes using most popular patterns(AFAIK). Current support is sqlite, mysql, postgresql, mongodb, redis, couchdb.

59. sdv-dev/SDV โญ 2,223
Synthetic data generation for tabular data
๐Ÿ”— [docs.sdv.dev/sdv](https://docs.sdv.dev/sdv)

60. uqfoundation/dill โญ 2,207
serialize all of Python
๐Ÿ”— [dill.rtfd.io](http://dill.rtfd.io)

61. accenture/AmpliGraph โญ 2,121
Python library for Representation Learning on Knowledge Graphs https://docs.ampligraph.org

62. graphistry/pygraphistry โญ 2,101
PyGraphistry is a Python library to quickly load, shape, embed, and explore big graphs with the GPU-accelerated Graphistry visual graph analyzer

63. dlt-hub/dlt โญ 2,033
data load tool (dlt) is an open source Python library that makes data loading easy ๐Ÿ› ๏ธ
๐Ÿ”— [dlthub.com/docs](https://dlthub.com/docs)

64. samuelcolvin/arq โญ 2,029
Fast job queuing and RPC in python with asyncio and redis.
๐Ÿ”— [arq-docs.helpmanual.io](https://arq-docs.helpmanual.io/)

65. sfu-db/connector-x โญ 1,871
Fastest library to load data from DB to DataFrames in Rust and Python
๐Ÿ”— [sfu-db.github.io/connector-x](https://sfu-db.github.io/connector-x)

66. agronholm/sqlacodegen โญ 1,793
Automatic model code generator for SQLAlchemy

67. huggingface/datatrove โญ 1,786
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

68. uber/petastorm โญ 1,766
Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code.

69. milvus-io/bootcamp โญ 1,726
Dealing with all unstructured data, such as reverse image search, audio search, molecular search, video analysis, question and answer systems, NLP, etc.
๐Ÿ”— [milvus.io](https://milvus.io)

70. aio-libs/aiomysql โญ 1,723
aiomysql is a library for accessing a MySQL database from the asyncio
๐Ÿ”— [aiomysql.rtfd.io](https://aiomysql.rtfd.io)

71. aminalaee/sqladmin โญ 1,718
SQLAlchemy Admin for FastAPI and Starlette
๐Ÿ”— [aminalaee.dev/sqladmin](https://aminalaee.dev/sqladmin/)

72. simple-salesforce/simple-salesforce โญ 1,628
A very simple Salesforce.com REST API client for Python

73. collerek/ormar โญ 1,622
python async orm with fastapi in mind and pydantic validation
๐Ÿ”— [collerek.github.io/ormar](https://collerek.github.io/ormar/)

74. simonw/sqlite-utils โญ 1,572
Python CLI utility and library for manipulating SQLite databases
๐Ÿ”— [sqlite-utils.datasette.io](https://sqlite-utils.datasette.io)

75. eleutherai/the-pile โญ 1,438
The Pile is a large, diverse, open source language modelling data set that consists of many smaller datasets combined together.

76. sdispater/orator โญ 1,430
The Orator ORM provides a simple yet beautiful ActiveRecord implementation.
๐Ÿ”— [orator-orm.com](https://orator-orm.com)

77. mchong6/JoJoGAN โญ 1,413
Official PyTorch repo for JoJoGAN: One Shot Face Stylization

78. zarr-developers/zarr-python โญ 1,407
An implementation of chunked, compressed, N-dimensional arrays for Python.
๐Ÿ”— [zarr.readthedocs.io](http://zarr.readthedocs.io/)

79. aio-libs/aiopg โญ 1,385
aiopg is a library for accessing a PostgreSQL database from the asyncio
๐Ÿ”— [aiopg.readthedocs.io](http://aiopg.readthedocs.io)

80. ydataai/ydata-synthetic โญ 1,377
Synthetic data generators for tabular and time-series data
๐Ÿ”— [docs.synthetic.ydata.ai](https://docs.synthetic.ydata.ai)

81. google/tensorstore โญ 1,322
Library for reading and writing large multi-dimensional arrays.
๐Ÿ”— [google.github.io/tensorstore](https://google.github.io/tensorstore/)

82. scholarly-python-package/scholarly โญ 1,296
Retrieve author and publication information from Google Scholar in a friendly, Pythonic way without having to worry about CAPTCHAs!
๐Ÿ”— [scholarly.readthedocs.io](https://scholarly.readthedocs.io/)

83. eliasdabbas/advertools โญ 1,099
advertools - online marketing productivity and analysis tools
๐Ÿ”— [advertools.readthedocs.io](https://advertools.readthedocs.io)

84. pytorch/data โญ 1,086
A PyTorch repo for data loading and utilities to be shared by the PyTorch domain libraries.

85. aio-libs/aiocache โญ 1,074
Asyncio cache manager for redis, memcached and memory
๐Ÿ”— [aiocache.readthedocs.io](http://aiocache.readthedocs.io)

86. brettkromkamp/contextualise โญ 1,043
Contextualise is an effective tool particularly suited for organising information-heavy projects and activities consisting of unstructured and widely diverse data and information resources
๐Ÿ”— [contextualise.dev](https://contextualise.dev/)

87. uber/fiber โญ 1,041
Distributed Computing for AI Made Simple
๐Ÿ”— [uber.github.io/fiber](https://uber.github.io/fiber/)

88. intake/intake โญ 994
Intake is a lightweight package for finding, investigating, loading and disseminating data.
๐Ÿ”— [intake.readthedocs.io](https://intake.readthedocs.io/)

89. scikit-hep/awkward โญ 813
Manipulate JSON-like data with NumPy-like idioms.
๐Ÿ”— [awkward-array.org](https://awkward-array.org)

90. duckdb/dbt-duckdb โญ 807
dbt (http://getdbt.com) adapter for DuckDB (http://duckdb.org)

91. koaning/human-learn โญ 788
Natural Intelligence is still a pretty good idea.
๐Ÿ”— [koaning.github.io/human-learn](https://koaning.github.io/human-learn/)

92. goccy/bigquery-emulator โญ 770
BigQuery emulator provides a way to launch a BigQuery server on your local machine for testing and development.

93. macbre/sql-metadata โญ 767
Uses tokenized query returned by python-sqlparse and generates query metadata
๐Ÿ”— [pypi.python.org/pypi/sql-metadata](https://pypi.python.org/pypi/sql-metadata)

94. hyperqueryhq/whale โญ 724
๐Ÿณ The stupidly simple CLI workspace for your data warehouse.
๐Ÿ”— [rsyi.gitbook.io/whale](https://rsyi.gitbook.io/whale)

95. googleapis/python-bigquery โญ 719
Python Client for Google BigQuery

96. dgarnitz/vectorflow โญ 656
VectorFlow is a high volume vector embedding pipeline that ingests raw data, transforms it into vectors and writes it to a vector DB of your choice.
๐Ÿ”— [www.getvectorflow.com](https://www.getvectorflow.com/)

97. kagisearch/vectordb โญ 595
A minimal Python package for storing and retrieving text using chunking, embeddings, and vector search.
๐Ÿ”— [vectordb.com](https://vectordb.com)

98. igorbenav/fastcrud โญ 565
FastCRUD is a Python package for FastAPI, offering robust async CRUD operations and flexible endpoint creation utilities.

99. jina-ai/vectordb โญ 509
A Python vector database you just need - no more, no less.

100. koaning/bulk โญ 471
Bulk is a quick UI developer tool to apply some bulk labels.

## Debugging

Debugging and tracing tools.

1. cool-rr/PySnooper โญ 16,300
Never use print for debugging again

2. gruns/icecream โญ 8,692
๐Ÿฆ Never use print() to debug again.

3. shobrook/rebound โญ 4,092
Command-line tool that instantly fetches Stack Overflow results when an exception is thrown

4. inducer/pudb โญ 2,914
Full-screen console debugger for Python
๐Ÿ”— [documen.tician.de/pudb](https://documen.tician.de/pudb/)

5. gotcha/ipdb โญ 1,836
Integration of IPython pdb

6. alexmojaki/heartrate โญ 1,749
Simple real time visualisation of the execution of a Python program.

7. alexmojaki/birdseye โญ 1,643
Graphical Python debugger which lets you easily view the values of all evaluated expressions
๐Ÿ”— [birdseye.readthedocs.io](https://birdseye.readthedocs.io)

8. alexmojaki/snoop โญ 1,222
A powerful set of Python debugging tools, based on PySnooper

9. samuelcolvin/python-devtools โญ 968
Dev tools for python
๐Ÿ”— [python-devtools.helpmanual.io](https://python-devtools.helpmanual.io/)

## Diffusion Text to Image

Text-to-image diffusion model libraries, tools and apps for generating images from natural language.

1. automatic1111/stable-diffusion-webui โญ 136,239
Stable Diffusion web UI

2. compvis/stable-diffusion โญ 66,754
A latent text-to-image diffusion model
๐Ÿ”— [ommer-lab.com/research/latent-diffusion-models](https://ommer-lab.com/research/latent-diffusion-models/)

3. comfyanonymous/ComfyUI โญ 42,528
The most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface.
๐Ÿ”— [www.comfy.org](https://www.comfy.org/)

4. stability-ai/stablediffusion โญ 37,660
High-Resolution Image Synthesis with Latent Diffusion Models

5. lllyasviel/ControlNet โญ 29,126
Let us control diffusion models!

6. huggingface/diffusers โญ 24,159
๐Ÿค— Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
๐Ÿ”— [huggingface.co/docs/diffusers](https://huggingface.co/docs/diffusers)

7. invoke-ai/InvokeAI โญ 22,355
InvokeAI is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The solution offers an industry leading WebUI, supports terminal use through a CLI, and serves as the foundation for multi...
๐Ÿ”— [invoke-ai.github.io/invokeai](https://invoke-ai.github.io/InvokeAI/)

8. apple/ml-stable-diffusion โญ 16,491
Stable Diffusion with Core ML on Apple Silicon

9. borisdayma/dalle-mini โญ 14,703
DALLยทE Mini - Generate images from a text prompt
๐Ÿ”— [www.craiyon.com](https://www.craiyon.com)

10. divamgupta/diffusionbee-stable-diffusion-ui โญ 12,154
Diffusion Bee is the easiest way to run Stable Diffusion locally on your M1 Mac. Comes with a one-click installer. No dependencies or technical knowledge needed.
๐Ÿ”— [diffusionbee.com](https://diffusionbee.com)

11. compvis/latent-diffusion โญ 11,145
High-Resolution Image Synthesis with Latent Diffusion Models

12. lucidrains/DALLE2-pytorch โญ 10,974
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch

13. instantid/InstantID โญ 10,520
InstantID : Zero-shot Identity-Preserving Generation in Seconds ๐Ÿ”ฅ
๐Ÿ”— [instantid.github.io](https://instantid.github.io/)

14. facebookresearch/dinov2 โญ 8,422
PyTorch code and models for the DINOv2 self-supervised learning method.

15. ashawkey/stable-dreamfusion โญ 8,027
Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.

16. openbmb/MiniCPM-V โญ 7,999
MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone

17. carson-katri/dream-textures โญ 7,708
Stable Diffusion built-in to Blender

18. xavierxiao/Dreambooth-Stable-Diffusion โญ 7,524
Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion

19. timothybrooks/instruct-pix2pix โญ 6,123
PyTorch implementation of InstructPix2Pix, an instruction-based image editing model, based on the original CompVis/stable_diffusion repo.

20. openai/consistency_models โญ 6,034
Official repo for consistency models.

21. idea-research/GroundingDINO โญ 5,723
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
๐Ÿ”— [arxiv.org/abs/2303.05499](https://arxiv.org/abs/2303.05499)

22. salesforce/BLIP โญ 4,487
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

23. nateraw/stable-diffusion-videos โญ 4,353
Create ๐Ÿ”ฅ videos with Stable Diffusion by exploring the latent space and morphing between text prompts

24. opengvlab/InternVL โญ 4,212
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. ๆŽฅ่ฟ‘GPT-4V่กจ็Žฐ็š„ๅฏๅ•†็”จๅผ€ๆบๅคšๆจกๆ€ๅฏน่ฏๆจกๅž‹
๐Ÿ”— [internvl.github.io](https://internvl.github.io/)

25. jina-ai/discoart โญ 3,837
๐Ÿชฉ Create Disco Diffusion artworks in one line

26. lkwq007/stablediffusion-infinity โญ 3,830
Outpainting with Stable Diffusion on an infinite canvas

27. mlc-ai/web-stable-diffusion โญ 3,503
Bringing stable diffusion models to web browsers. Everything runs inside the browser with no server support.
๐Ÿ”— [mlc.ai/web-stable-diffusion](https://mlc.ai/web-stable-diffusion)

28. openai/glide-text2im โญ 3,501
GLIDE: a diffusion-based text-conditional image synthesis model

29. openai/improved-diffusion โญ 3,029
Release for Improved Denoising Diffusion Probabilistic Models

30. saharmor/dalle-playground โญ 2,760
A playground to generate images from any text prompt using Stable Diffusion (past: using DALL-E Mini)

31. stability-ai/stability-sdk โญ 2,415
SDK for interacting with stability.ai APIs (e.g. stable diffusion inference)
๐Ÿ”— [platform.stability.ai](https://platform.stability.ai/)

32. google-research/big_vision โญ 2,040
Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.

33. thudm/CogVLM2 โญ 1,599
GPT4V-level open-source multi-modal model based on Llama3-8B

34. divamgupta/stable-diffusion-tensorflow โญ 1,575
Stable Diffusion in TensorFlow / Keras

35. coyote-a/ultimate-upscale-for-automatic1111 โญ 1,567
Ultimate SD Upscale extension for AUTOMATIC1111 Stable Diffusion web UI

36. nvlabs/prismer โญ 1,291
The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts".
๐Ÿ”— [shikun.io/projects/prismer](https://shikun.io/projects/prismer)

37. chenyangqiqi/FateZero โญ 1,075
[ICCV 2023 Oral] "FateZero: Fusing Attentions for Zero-shot Text-based Video Editing"
๐Ÿ”— [fate-zero-edit.github.io](http://fate-zero-edit.github.io/)

38. thereforegames/unprompted โญ 770
Templating language written for Stable Diffusion workflows. Available as an extension for the Automatic1111 WebUI.

39. open-compass/VLMEvalKit โญ 713
Open-source evaluation toolkit of large vision-language models (LVLMs), support GPT-4v, Gemini, QwenVLPlus, 50+ HF models, 20+ benchmarks
๐Ÿ”— [huggingface.co/spaces/opencompass/open_vlm_leaderboard](https://huggingface.co/spaces/opencompass/open_vlm_leaderboard)

40. sharonzhou/long_stable_diffusion โญ 680
Long-form text-to-images generation, using a pipeline of deep generative models (GPT-3 and Stable Diffusion)

41. tanelp/tiny-diffusion โญ 590
A minimal PyTorch implementation of probabilistic diffusion models for 2D datasets.

## Finance

Financial and quantitative libraries: investment research tools, market data, algorithmic trading, backtesting, financial derivatives.

1. openbb-finance/OpenBB โญ 26,729
Investment Research for Everyone, Everywhere.
๐Ÿ”— [openbb.co](https://openbb.co)

2. quantopian/zipline โญ 17,306
Zipline, a Pythonic Algorithmic Trading Library
๐Ÿ”— [www.zipline.io](https://www.zipline.io)

3. microsoft/qlib โญ 14,635
Qlib is an AI-oriented quantitative investment platform that aims to realize the potential, empower research, and create value using AI technologies in quantitative investment, from exploring ideas to implementing productions. Qlib supports diverse machine learning modeling paradigms. including supervised learning, ...
๐Ÿ”— [qlib.readthedocs.io/en/latest](https://qlib.readthedocs.io/en/latest/)

4. mementum/backtrader โญ 13,577
Python Backtesting library for trading strategies
๐Ÿ”— [www.backtrader.com](https://www.backtrader.com)

5. ai4finance-foundation/FinGPT โญ 12,688
FinGPT: Open-Source Financial Large Language Models! Revolutionize ๐Ÿ”ฅ We release the trained model on HuggingFace.
๐Ÿ”— [ai4finance.org](https://ai4finance.org)

6. ranaroussi/yfinance โญ 12,523
Download market data from Yahoo! Finance's API
๐Ÿ”— [aroussi.com/post/python-yahoo-finance](https://aroussi.com/post/python-yahoo-finance)

7. ai4finance-foundation/FinRL โญ 9,443
FinRL: Financial Reinforcement Learning. ๐Ÿ”ฅ
๐Ÿ”— [ai4finance.org](https://ai4finance.org)

8. ta-lib/ta-lib-python โญ 9,269
Python wrapper for TA-Lib (http://ta-lib.org/).
๐Ÿ”— [ta-lib.github.io/ta-lib-python](http://ta-lib.github.io/ta-lib-python)

9. quantconnect/Lean โญ 9,018
Lean Algorithmic Trading Engine by QuantConnect (Python, C#)
๐Ÿ”— [lean.io](https://lean.io)

10. goldmansachs/gs-quant โญ 6,633
Python toolkit for quantitative finance
๐Ÿ”— [developer.gs.com/discover/products/gs-quant](https://developer.gs.com/discover/products/gs-quant/)

11. quantopian/pyfolio โญ 5,540
Portfolio and risk analytics in Python
๐Ÿ”— [quantopian.github.io/pyfolio](https://quantopian.github.io/pyfolio)

12. kernc/backtesting.py โญ 5,130
:mag_right: :chart_with_upwards_trend: ๐Ÿ :moneybag: Backtest trading strategies in Python.
๐Ÿ”— [kernc.github.io/backtesting.py](https://kernc.github.io/backtesting.py/)

13. twopirllc/pandas-ta โญ 5,016
Technical Analysis Indicators - Pandas TA is an easy to use Python 3 Pandas Extension with 150+ Indicators
๐Ÿ”— [twopirllc.github.io/pandas-ta](https://twopirllc.github.io/pandas-ta/)

14. ranaroussi/quantstats โญ 4,573
Portfolio analytics for quants, written in Python

15. google/tf-quant-finance โญ 4,380
High-performance TensorFlow library for quantitative finance.

16. gbeced/pyalgotrade โญ 4,343
Python Algorithmic Trading Library
๐Ÿ”— [gbeced.github.io/pyalgotrade](http://gbeced.github.io/pyalgotrade/)

17. borisbanushev/stockpredictionai โญ 4,013
In this noteboook I will create a complete process for predicting stock price movements. Follow along and we will achieve some pretty good results. For that purpose we will use a Generative Adversarial Network (GAN) with LSTM, a type of Recurrent Neural Network, as generator, and a Convolutional Neural Networ...

18. polakowo/vectorbt โญ 3,988
Find your trading edge, using the fastest engine for backtesting, algorithmic trading, and research.
๐Ÿ”— [vectorbt.dev](https://vectorbt.dev)

19. matplotlib/mplfinance โญ 3,499
Financial Markets Data Visualization using Matplotlib
๐Ÿ”— [pypi.org/project/mplfinance](https://pypi.org/project/mplfinance/)

20. cuemacro/finmarketpy โญ 3,398
Python library for backtesting trading strategies & analyzing financial markets (formerly pythalesians)
๐Ÿ”— [www.cuemacro.com](http://www.cuemacro.com)

21. quantopian/alphalens โญ 3,187
Performance analysis of predictive (alpha) stock factors
๐Ÿ”— [quantopian.github.io/alphalens](http://quantopian.github.io/alphalens)

22. zvtvz/zvt โญ 3,080
modular quant framework.
๐Ÿ”— [zvt.readthedocs.io/en/latest](https://zvt.readthedocs.io/en/latest/)

23. robcarver17/pysystemtrade โญ 2,500
Systematic Trading in python

24. quantopian/research_public โญ 2,364
Quantitative research and educational materials
๐Ÿ”— [www.quantopian.com/lectures](https://www.quantopian.com/lectures)

25. pmorissette/bt โญ 2,114
bt - flexible backtesting for Python
๐Ÿ”— [pmorissette.github.io/bt](http://pmorissette.github.io/bt)

26. blankly-finance/blankly โญ 2,039
๐Ÿš€ ๐Ÿ’ธ Easily build, backtest and deploy your algo in just a few lines of code. Trade stocks, cryptos, and forex across exchanges w/ one package.
๐Ÿ”— [package.blankly.finance](https://package.blankly.finance)

27. domokane/FinancePy โญ 2,009
A Python Finance Library that focuses on the pricing and risk-management of Financial Derivatives, including fixed-income, equity, FX and credit derivatives.

28. pmorissette/ffn โญ 1,858
ffn - a financial function library for Python
๐Ÿ”— [pmorissette.github.io/ffn](pmorissette.github.io/ffn)

29. cuemacro/findatapy โญ 1,623
Python library to download market data via Bloomberg, Eikon, Quandl, Yahoo etc.

30. quantopian/empyrical โญ 1,252
Common financial risk and performance metrics. Used by zipline and pyfolio.
๐Ÿ”— [quantopian.github.io/empyrical](https://quantopian.github.io/empyrical)

31. idanya/algo-trader โญ 761
Trading bot with support for realtime trading, backtesting, custom strategies and much more.

## Game Development

Game development tools, engines and libraries.

1. kitao/pyxel โญ 14,159
A retro game engine for Python

2. pygame/pygame โญ 7,177
๐Ÿ๐ŸŽฎ pygame (the library) is a Free and Open Source python programming language library for making multimedia applications like games built on top of the excellent SDL library. C, Python, Native, OpenGL.
๐Ÿ”— [www.pygame.org](https://www.pygame.org)

3. panda3d/panda3d โญ 4,373
Powerful, mature open-source cross-platform game engine for Python and C++, developed by Disney and CMU
๐Ÿ”— [www.panda3d.org](https://www.panda3d.org/)

4. pokepetter/ursina โญ 2,138
A game engine powered by python and panda3d.
๐Ÿ”— [pokepetter.github.io/ursina](https://pokepetter.github.io/ursina/)

5. pyglet/pyglet โญ 1,814
pyglet is a cross-platform windowing and multimedia library for Python, for developing games and other visually rich applications.
๐Ÿ”— [pyglet.org](http://pyglet.org)

6. pythonarcade/arcade โญ 1,663
Easy to use Python library for creating 2D arcade games.
๐Ÿ”— [arcade.academy](http://arcade.academy)

## GIS

Geospatial libraries: raster and vector data formats, interactive mapping and visualisation, computing frameworks for processing images, projections.

1. domlysz/BlenderGIS โญ 7,462
Blender addons to make the bridge between Blender and geographic data

2. python-visualization/folium โญ 6,778
Python Data. Leaflet.js Maps.
๐Ÿ”— [python-visualization.github.io/folium](https://python-visualization.github.io/folium/)

3. gboeing/osmnx โญ 4,759
OSMnx is a Python package to easily download, model, analyze, and visualize street networks and other geospatial features from OpenStreetMap.
๐Ÿ”— [osmnx.readthedocs.io](https://osmnx.readthedocs.io)

4. osgeo/gdal โญ 4,664
GDAL is an open source MIT licensed translator library for raster and vector geospatial data formats.
๐Ÿ”— [gdal.org](https://gdal.org)

5. geopandas/geopandas โญ 4,363
Python tools for geographic data
๐Ÿ”— [geopandas.org](http://geopandas.org/)

6. shapely/shapely โญ 3,760
Manipulation and analysis of geometric objects
๐Ÿ”— [shapely.readthedocs.io/en/stable](https://shapely.readthedocs.io/en/stable/)

7. giswqs/geemap โญ 3,323
A Python package for interactive geospatial analysis and visualization with Google Earth Engine.
๐Ÿ”— [geemap.org](https://geemap.org)

8. holoviz/datashader โญ 3,257
Quickly and accurately render even the largest data.
๐Ÿ”— [datashader.org](http://datashader.org)

9. opengeos/leafmap โญ 3,084
A Python package for interactive mapping and geospatial analysis with minimal coding in a Jupyter environment
๐Ÿ”— [leafmap.org](https://leafmap.org)

10. opengeos/segment-geospatial โญ 2,769
A Python package for segmenting geospatial data with the Segment Anything Model (SAM)
๐Ÿ”— [samgeo.gishub.org](https://samgeo.gishub.org)

11. google/earthengine-api โญ 2,593
Python and JavaScript bindings for calling the Earth Engine API.

12. microsoft/torchgeo โญ 2,349
TorchGeo: datasets, samplers, transforms, and pre-trained models for geospatial data
๐Ÿ”— [www.osgeo.org/projects/torchgeo](https://www.osgeo.org/projects/torchgeo/)

13. rasterio/rasterio โญ 2,180
Rasterio reads and writes geospatial raster datasets
๐Ÿ”— [rasterio.readthedocs.io](https://rasterio.readthedocs.io/)

14. mcordts/cityscapesScripts โญ 2,119
README and scripts for the Cityscapes Dataset

15. azavea/raster-vision โญ 2,029
An open source library and framework for deep learning on satellite and aerial imagery.
๐Ÿ”— [docs.rastervision.io](https://docs.rastervision.io)

16. apache/sedona โญ 1,823
A cluster computing framework for processing large-scale geospatial data
๐Ÿ”— [sedona.apache.org](https://sedona.apache.org/)

17. plant99/felicette โญ 1,818
Satellite imagery for dummies.

18. gboeing/osmnx-examples โญ 1,509
Gallery of OSMnx tutorials, usage examples, and feature demonstations.
๐Ÿ”— [osmnx.readthedocs.io](https://osmnx.readthedocs.io)

19. jupyter-widgets/ipyleaflet โญ 1,469
A Jupyter - Leaflet.js bridge
๐Ÿ”— [ipyleaflet.readthedocs.io](https://ipyleaflet.readthedocs.io)

20. microsoft/GlobalMLBuildingFootprints โญ 1,331
Worldwide building footprints derived from satellite imagery

21. pysal/pysal โญ 1,294
PySAL: Python Spatial Analysis Library Meta-Package
๐Ÿ”— [pysal.org/pysal](http://pysal.org/pysal)

22. anitagraser/movingpandas โญ 1,178
Movement trajectory classes and functions built on top of GeoPandas
๐Ÿ”— [movingpandas.org](http://movingpandas.org)

23. residentmario/geoplot โญ 1,139
High-level geospatial data visualization library for Python.
๐Ÿ”— [residentmario.github.io/geoplot/index.html](https://residentmario.github.io/geoplot/index.html)

24. sentinel-hub/eo-learn โญ 1,098
Earth observation processing framework for machine learning in Python
๐Ÿ”— [eo-learn.readthedocs.io/en/latest](https://eo-learn.readthedocs.io/en/latest/)

25. opengeos/streamlit-geospatial โญ 829
A multi-page streamlit app for geospatial
๐Ÿ”— [huggingface.co/spaces/giswqs/streamlit](https://huggingface.co/spaces/giswqs/Streamlit)

26. makepath/xarray-spatial โญ 805
Raster-based Spatial Analytics for Python
๐Ÿ”— [xarray-spatial.readthedocs.io](https://xarray-spatial.readthedocs.io/)

27. osgeo/grass โญ 800
GRASS GIS - free and open-source geospatial processing engine
๐Ÿ”— [grass.osgeo.org](https://grass.osgeo.org)

28. developmentseed/titiler โญ 724
Build your own Raster dynamic map tile services
๐Ÿ”— [developmentseed.org/titiler](https://developmentseed.org/titiler/)

29. scikit-mobility/scikit-mobility โญ 701
scikit-mobility: mobility analysis in Python
๐Ÿ”— [scikit-mobility.github.io/scikit-mobility](https://scikit-mobility.github.io/scikit-mobility/)

## Graph

Graphs and network libraries: network analysis, graph machine learning, visualisation.

1. networkx/networkx โญ 14,496
Network Analysis in Python
๐Ÿ”— [networkx.org](https://networkx.org)

2. stellargraph/stellargraph โญ 2,924
StellarGraph - Machine Learning on Graphs
๐Ÿ”— [stellargraph.readthedocs.io](https://stellargraph.readthedocs.io/)

3. westhealth/pyvis โญ 949
Python package for creating and visualizing interactive network graphs.
๐Ÿ”— [pyvis.readthedocs.io/en/latest](http://pyvis.readthedocs.io/en/latest/)

4. microsoft/graspologic โญ 680
graspologic is a package for graph statistical algorithms
๐Ÿ”— [graspologic-org.github.io/graspologic](https://graspologic-org.github.io/graspologic/)

5. rampasek/GraphGPS โญ 621
Recipe for a General, Powerful, Scalable Graph Transformer

6. dylanhogg/llmgraph โญ 233
Create knowledge graphs with LLMs

## GUI

Graphical user interface libraries and toolkits.

1. pysimplegui/PySimpleGUI โญ 13,264
Python GUIs for Humans! PySimpleGUI is the top-rated Python application development environment. Launched in 2018 and actively developed, maintained, and supported in 2024. Transforms tkinter, Qt, WxPython, and Remi into a simple, intuitive, and fun experience for both hobbyists and expert users.
๐Ÿ”— [www.pysimplegui.com](https://www.PySimpleGUI.com)

2. hoffstadt/DearPyGui โญ 12,683
Dear PyGui: A fast and powerful Graphical User Interface Toolkit for Python with minimal dependencies
๐Ÿ”— [dearpygui.readthedocs.io/en/latest](https://dearpygui.readthedocs.io/en/latest/)

3. parthjadhav/Tkinter-Designer โญ 8,667
An easy and fast way to create a Python GUI ๐Ÿ

4. samuelcolvin/FastUI โญ 7,861
FastUI is a new way to build web application user interfaces defined by declarative Python code.
๐Ÿ”— [fastui-demo.onrender.com](https://fastui-demo.onrender.com)

5. r0x0r/pywebview โญ 4,514
Build GUI for your Python program with JavaScript, HTML, and CSS
๐Ÿ”— [pywebview.flowrl.com](https://pywebview.flowrl.com)

6. beeware/toga โญ 4,215
A Python native, OS native GUI toolkit.
๐Ÿ”— [toga.readthedocs.io/en/latest](https://toga.readthedocs.io/en/latest/)

7. dddomodossola/remi โญ 3,481
Python REMote Interface library. Platform independent. In about 100 Kbytes, perfect for your diet.

8. wxwidgets/Phoenix โญ 2,252
wxPython's Project Phoenix. A new implementation of wxPython, better, stronger, faster than he was before.
๐Ÿ”— [wxpython.org](http://wxpython.org/)

## Jupyter

Jupyter and JupyterLab and Notebook tools, libraries and plugins.

1. jupyterlab/jupyterlab โญ 13,972
JupyterLab computational environment.
๐Ÿ”— [jupyterlab.readthedocs.io](https://jupyterlab.readthedocs.io/)

2. jupyter/notebook โญ 11,400
Jupyter Interactive Notebook
๐Ÿ”— [jupyter-notebook.readthedocs.io](https://jupyter-notebook.readthedocs.io/)

3. mwouts/jupytext โญ 6,520
Jupyter Notebooks as Markdown Documents, Julia, Python or R scripts
๐Ÿ”— [jupytext.readthedocs.io](https://jupytext.readthedocs.io)

4. nteract/papermill โญ 5,725
๐Ÿ“š Parameterize, execute, and analyze notebooks
๐Ÿ”— [papermill.readthedocs.io/en/latest](http://papermill.readthedocs.io/en/latest/)

5. connorferster/handcalcs โญ 5,381
Python library for converting Python calculations into rendered latex.

6. voila-dashboards/voila โญ 5,330
Voilร  turns Jupyter notebooks into standalone web applications
๐Ÿ”— [voila.readthedocs.io](https://voila.readthedocs.io)

7. executablebooks/jupyter-book โญ 3,777
Create beautiful, publication-quality books and documents from computational content.
๐Ÿ”— [jupyterbook.org](http://jupyterbook.org)

8. jupyterlite/jupyterlite โญ 3,754
Wasm powered Jupyter running in the browser ๐Ÿ’ก
๐Ÿ”— [jupyterlite.rtfd.io/en/stable/try/lab](https://jupyterlite.rtfd.io/en/stable/try/lab)

9. jupyterlab/jupyterlab-desktop โญ 3,550
JupyterLab desktop application, based on Electron.

10. jupyter-widgets/ipywidgets โญ 3,102
Interactive Widgets for the Jupyter Notebook
๐Ÿ”— [ipywidgets.readthedocs.io](https://ipywidgets.readthedocs.io)

11. quantopian/qgrid โญ 3,033
An interactive grid for sorting, filtering, and editing DataFrames in Jupyter notebooks

12. jupyterlab/jupyter-ai โญ 3,017
A generative AI extension for JupyterLab
๐Ÿ”— [jupyter-ai.readthedocs.io](https://jupyter-ai.readthedocs.io/)

13. jupyter/nbdime โญ 2,630
Tools for diffing and merging of Jupyter notebooks.
๐Ÿ”— [nbdime.readthedocs.io](http://nbdime.readthedocs.io)

14. mito-ds/mito โญ 2,254
The mitosheet package, trymito.io, and other public Mito code.
๐Ÿ”— [trymito.io](https://trymito.io)

15. jupyter/nbviewer โญ 2,189
nbconvert as a web service: Render Jupyter Notebooks as static web pages
๐Ÿ”— [nbviewer.jupyter.org](https://nbviewer.jupyter.org)

16. maartenbreddels/ipyvolume โญ 1,928
3d plotting for Python in the Jupyter notebook based on IPython widgets using WebGL

17. jupyter-lsp/jupyterlab-lsp โญ 1,768
Coding assistance for JupyterLab (code navigation + hover suggestions + linters + autocompletion + rename) using Language Server Protocol
๐Ÿ”— [jupyterlab-lsp.readthedocs.io](https://jupyterlab-lsp.readthedocs.io)

18. jupyter/nbconvert โญ 1,694
Jupyter Notebook Conversion
๐Ÿ”— [nbconvert.readthedocs.io](https://nbconvert.readthedocs.io/)

19. nbqa-dev/nbQA โญ 997
Run ruff, isort, pyupgrade, mypy, pylint, flake8, and more on Jupyter Notebooks
๐Ÿ”— [nbqa.readthedocs.io/en/latest/index.html](https://nbqa.readthedocs.io/en/latest/index.html)

20. vizzuhq/ipyvizzu โญ 933
Build animated charts in Jupyter Notebook and similar environments with a simple Python syntax.
๐Ÿ”— [ipyvizzu.vizzuhq.com](https://ipyvizzu.vizzuhq.com)

21. koaning/drawdata โญ 743
Draw datasets from within Jupyter.
๐Ÿ”— [calmcode.io/labs/drawdata.html](https://calmcode.io/labs/drawdata.html)

22. aws/graph-notebook โญ 705
Library extending Jupyter notebooks to integrate with Apache TinkerPop, openCypher, and RDF SPARQL.
๐Ÿ”— [github.com/aws/graph-notebook](https://github.com/aws/graph-notebook)

23. linealabs/lineapy โญ 663
Move fast from data science prototype to pipeline. Capture, analyze, and transform messy notebooks into data pipelines with just two lines of code.
๐Ÿ”— [lineapy.org](https://lineapy.org)

24. xiaohk/stickyland โญ 506
Break the linear presentation of Jupyter Notebooks with sticky cells!
๐Ÿ”— [xiaohk.github.io/stickyland](https://xiaohk.github.io/stickyland/)

## LLMs and ChatGPT

Large language model and GPT libraries and frameworks: auto-gpt, agents, QnA, chain-of-thought workflows, API integations. Also see the Natural Language Processing category for crossover.

1. significant-gravitas/AutoGPT โญ 164,660
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
๐Ÿ”— [agpt.co](https://agpt.co)

2. hwchase17/langchain โญ 89,354
๐Ÿฆœ๐Ÿ”— Build context-aware reasoning applications
๐Ÿ”— [python.langchain.com](https://python.langchain.com)

3. nomic-ai/gpt4all โญ 67,509
GPT4All: Chat with Local LLMs on Any Device
๐Ÿ”— [gpt4all.io](https://gpt4all.io)

4. ggerganov/llama.cpp โญ 61,825
LLM inference in C/C++

5. xtekky/gpt4free โญ 59,326
The official gpt4free repository | various collection of powerful language models
๐Ÿ”— [g4f.ai](https://g4f.ai)

6. facebookresearch/llama โญ 54,268
Inference code for Llama models

7. imartinez/private-gpt โญ 53,019
Interact with your documents using the power of GPT, 100% privately, no data leaks
๐Ÿ”— [docs.privategpt.dev](https://docs.privategpt.dev)

8. gpt-engineer-org/gpt-engineer โญ 51,432
Specify what you want it to build, the AI asks for clarification, and then builds it.

9. killianlucas/open-interpreter โญ 50,936
A natural language interface for computers
๐Ÿ”— [openinterpreter.com](http://openinterpreter.com/)

10. xai-org/grok-1 โญ 49,181
This repository contains JAX example code for loading and running the Grok-1 open-weights model.

11. geekan/MetaGPT โญ 41,739
๐ŸŒŸ The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
๐Ÿ”— [deepwisdom.ai](https://deepwisdom.ai/)

12. thudm/ChatGLM-6B โญ 40,054
ChatGLM-6B: An Open Bilingual Dialogue Language Model | ๅผ€ๆบๅŒ่ฏญๅฏน่ฏ่ฏญ่จ€ๆจกๅž‹

13. oobabooga/text-generation-webui โญ 38,493
A Gradio web UI for Large Language Models.

14. hpcaitech/ColossalAI โญ 38,372
Making large AI models cheaper, faster and more accessible
๐Ÿ”— [www.colossalai.org](https://www.colossalai.org)

15. laion-ai/Open-Assistant โญ 36,852
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
๐Ÿ”— [open-assistant.io](https://open-assistant.io)

16. lm-sys/FastChat โญ 35,781
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

17. karpathy/nanoGPT โญ 34,889
The simplest, fastest repository for training/finetuning medium-sized GPTs.

18. moymix/TaskMatrix โญ 34,537
Connects ChatGPT and a series of Visual Foundation Models to enable sending and receiving images during chatting.

19. quivrhq/quivr โญ 34,227
Open-source RAG Framework for building GenAI Second Brains ๐Ÿง  Build productivity assistant (RAG) โšก๏ธ๐Ÿค– Chat with your docs (PDF, CSV, ...) & apps using Langchain, GPT 3.5 / 4 turbo, Private, Anthropic, VertexAI, Ollama, LLMs, Groq that you can share with users ! Efficient retrieval augmented generation framework
๐Ÿ”— [quivr.com](https://quivr.com)

20. jerryjliu/llama_index โญ 33,698
LlamaIndex is a data framework for your LLM applications
๐Ÿ”— [docs.llamaindex.ai](https://docs.llamaindex.ai)

21. tatsu-lab/stanford_alpaca โญ 29,171
Code and documentation to train Stanford's Alpaca models, and generate the data.
๐Ÿ”— [crfm.stanford.edu/2023/03/13/alpaca.html](https://crfm.stanford.edu/2023/03/13/alpaca.html)

22. pythagora-io/gpt-pilot โญ 29,144
The first real AI developer

23. microsoft/autogen โญ 28,540
A programming framework for agentic AI. Discord: https://aka.ms/autogen-dc. Roadmap: https://aka.ms/autogen-roadmap
๐Ÿ”— [microsoft.github.io/autogen](https://microsoft.github.io/autogen/)

24. hiyouga/LLaMA-Factory โญ 26,660
A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
๐Ÿ”— [arxiv.org/abs/2403.13372](https://arxiv.org/abs/2403.13372)

25. vision-cair/MiniGPT-4 โญ 25,163
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
๐Ÿ”— [minigpt-4.github.io](https://minigpt-4.github.io)

26. microsoft/JARVIS โญ 23,379
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf

27. vllm-project/vllm โญ 23,132
A high-throughput and memory-efficient inference and serving engine for LLMs
๐Ÿ”— [docs.vllm.ai](https://docs.vllm.ai)

28. karpathy/llm.c โญ 22,036
LLM training in simple, pure C/CUDA. There is no need for 245MB of PyTorch or 107MB of cPython

29. openai/gpt-2 โญ 21,993
Code for the paper "Language Models are Unsupervised Multitask Learners"
๐Ÿ”— [openai.com/blog/better-language-models](https://openai.com/blog/better-language-models/)

30. logspace-ai/langflow โญ 21,498
โ›“๏ธ Langflow is a visual framework for building multi-agent and RAG applications. It's open-source, Python-powered, fully customizable, model and vector store agnostic.
๐Ÿ”— [www.langflow.org](http://www.langflow.org)

31. openai/chatgpt-retrieval-plugin โญ 20,974
The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.

32. microsoft/semantic-kernel โญ 20,584
Integrate cutting-edge LLM technology quickly and easily into your apps
๐Ÿ”— [aka.ms/semantic-kernel](https://aka.ms/semantic-kernel)

33. yoheinakajima/babyagi โญ 19,711
GPT-4 powered task-driven autonomous agent

34. karpathy/minGPT โญ 19,475
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

35. danielmiessler/fabric โญ 19,385
fabric is an open-source framework for augmenting humans using AI. It provides a modular framework for solving specific problems using a crowdsourced set of AI prompts that can be used anywhere.
๐Ÿ”— [danielmiessler.com/p/fabric-origin-story](https://danielmiessler.com/p/fabric-origin-story)

36. tloen/alpaca-lora โญ 18,418
Instruct-tune LLaMA on consumer hardware

37. rasahq/rasa โญ 18,332
๐Ÿ’ฌ Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants
๐Ÿ”— [rasa.com/docs/rasa](https://rasa.com/docs/rasa/)

38. guidance-ai/guidance โญ 18,237
A guidance language for controlling large language models.

39. haotian-liu/LLaVA โญ 18,222
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
๐Ÿ”— [llava.hliu.cc](https://llava.hliu.cc)

40. stitionai/devika โญ 17,960
Devika is an advanced AI software engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective.

41. mlc-ai/mlc-llm โญ 17,815
Universal LLM Deployment Engine with ML Compilation
๐Ÿ”— [llm.mlc.ai](https://llm.mlc.ai/)

42. joaomdmoura/crewAI โญ 17,373
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
๐Ÿ”— [crewai.com](https://crewai.com)

43. karpathy/llama2.c โญ 16,843
Inference Llama 2 in one file of pure C

44. thudm/ChatGLM2-6B โญ 15,634
ChatGLM2-6B: An Open Bilingual Chat LLM | ๅผ€ๆบๅŒ่ฏญๅฏน่ฏ่ฏญ่จ€ๆจกๅž‹

45. facebookresearch/codellama โญ 15,472
Inference code for CodeLlama models

46. huggingface/peft โญ 15,081
๐Ÿค— PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
๐Ÿ”— [huggingface.co/docs/peft](https://huggingface.co/docs/peft)

47. transformeroptimus/SuperAGI โญ 14,971
<โšก๏ธ> SuperAGI - A dev-first open source autonomous AI agent framework. Enabling developers to build, manage & run useful autonomous agents quickly and reliably.
๐Ÿ”— [superagi.com](https://superagi.com/)

48. mayooear/gpt4-pdf-chatbot-langchain โญ 14,758
GPT4 & LangChain Chatbot for large PDF docs
๐Ÿ”— [www.youtube.com/watch?v=ih9pbgvvoo4](https://www.youtube.com/watch?v=ih9PBGVVOO4)

49. deepset-ai/haystack โญ 14,680
:mag: LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conv...
๐Ÿ”— [haystack.deepset.ai](https://haystack.deepset.ai)

50. stanfordnlp/dspy โญ 14,544
DSPy: The framework for programmingโ€”not promptingโ€”foundation models
๐Ÿ”— [dspy-docs.vercel.app](https://dspy-docs.vercel.app/)

51. fauxpilot/fauxpilot โญ 14,448
FauxPilot - an open-source alternative to GitHub Copilot server

52. openai/evals โญ 14,397
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

53. idea-research/Grounded-Segment-Anything โญ 14,225
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
๐Ÿ”— [arxiv.org/abs/2401.14159](https://arxiv.org/abs/2401.14159)

54. assafelovic/gpt-researcher โญ 13,227
GPT based autonomous agent that does online comprehensive research on any given topic
๐Ÿ”— [gptr.dev](https://gptr.dev)

55. unslothai/unsloth โญ 12,771
Finetune Llama 3, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
๐Ÿ”— [unsloth.ai](https://unsloth.ai)

56. qwenlm/Qwen โญ 12,687
The official repo of Qwen (้€šไน‰ๅƒ้—ฎ) chat & pretrained large language model proposed by Alibaba Cloud.

57. dao-ailab/flash-attention โญ 12,431
Fast and memory-efficient exact attention

58. infiniflow/ragflow โญ 12,269
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
๐Ÿ”— [ragflow.io](https://ragflow.io)

59. blinkdl/RWKV-LM โญ 12,010
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.

60. openlmlab/MOSS โญ 11,888
An open-source tool-augmented conversational language model from Fudan University
๐Ÿ”— [txsun1997.github.io/blogs/moss.html](https://txsun1997.github.io/blogs/moss.html)

61. mlc-ai/web-llm โญ 11,817
High-performance In-browser LLM Inference Engine
๐Ÿ”— [webllm.mlc.ai](https://webllm.mlc.ai)

62. paddlepaddle/PaddleNLP โญ 11,753
๐Ÿ‘‘ Easy-to-use and powerful NLP and LLM library with ๐Ÿค— Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including ๐Ÿ—‚Text Classification, ๐Ÿ” Neural Search, โ“ Question Answering, โ„น๏ธ Information Extraction, ๐Ÿ“„ Document Intelligence, ๐Ÿ’Œ Sentiment Analysis etc.
๐Ÿ”— [paddlenlp.readthedocs.io](https://paddlenlp.readthedocs.io)

63. smol-ai/developer โญ 11,738
the first library to let you embed a developer agent in your own app!
๐Ÿ”— [twitter.com/smolmodels](https://twitter.com/SmolModels)

64. h2oai/h2ogpt โญ 11,021
Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://codellama.h2o.ai/
๐Ÿ”— [h2o.ai](http://h2o.ai)

65. shishirpatil/gorilla โญ 10,872
Enables LLMs to use tools by invoking APIs. Given a query, Gorilla comes up with the semantically and syntactically correct API.
๐Ÿ”— [gorilla.cs.berkeley.edu](https://gorilla.cs.berkeley.edu/)

66. databrickslabs/dolly โญ 10,804
Databricksโ€™ Dolly, a large language model trained on the Databricks Machine Learning Platform
๐Ÿ”— [www.databricks.com/blog/2023/03/24/hello-dolly-democratizing-magic-chatgpt-open-models.html](https://www.databricks.com/blog/2023/03/24/hello-dolly-democratizing-magic-chatgpt-open-models.html)

67. phidatahq/phidata โญ 10,651
Phidata is a toolkit for building AI Assistants using function calling.
๐Ÿ”— [docs.phidata.com](https://docs.phidata.com)

68. berriai/litellm โญ 10,561
Call all LLM APIs using the OpenAI format. Use Bedrock, Azure, OpenAI, Cohere, Anthropic, Ollama, Sagemaker, HuggingFace, Replicate (100+ LLMs)
๐Ÿ”— [docs.litellm.ai/docs](https://docs.litellm.ai/docs/)

69. facebookresearch/llama-recipes โญ 10,511
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta ...

70. microsoft/LoRA โญ 9,835
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
๐Ÿ”— [arxiv.org/abs/2106.09685](https://arxiv.org/abs/2106.09685)

71. vanna-ai/vanna โญ 9,827
๐Ÿค– Chat with your SQL database ๐Ÿ“Š. Accurate Text-to-SQL Generation via LLMs using RAG ๐Ÿ”„.
๐Ÿ”— [vanna.ai/docs](https://vanna.ai/docs/)

72. google-research/vision_transformer โญ 9,777
Vision Transformer and MLP-Mixer Architectures

73. artidoro/qlora โญ 9,722
QLoRA: Efficient Finetuning of Quantized LLMs
๐Ÿ”— [arxiv.org/abs/2305.14314](https://arxiv.org/abs/2305.14314)

74. embedchain/mem0 โญ 9,441
The memory layer for Personalized AI
๐Ÿ”— [docs.mem0.ai](https://docs.mem0.ai/)

75. nvidia/Megatron-LM โญ 9,434
Ongoing research training transformer models at scale
๐Ÿ”— [docs.nvidia.com/megatron-core/developer-guide/latest/user-guide/index.html#quick-start](https://docs.nvidia.com/megatron-core/developer-guide/latest/user-guide/index.html#quick-start)

76. blinkdl/ChatRWKV โญ 9,341
ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.

77. mistralai/mistral-inference โญ 9,253
Official inference library for Mistral models
๐Ÿ”— [mistral.ai](https://mistral.ai/)

78. lightning-ai/litgpt โญ 8,818
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
๐Ÿ”— [lightning.ai](https://lightning.ai)

79. lvwerra/trl โญ 8,799
Train transformer language models with reinforcement learning.
๐Ÿ”— [hf.co/docs/trl](http://hf.co/docs/trl)

80. microsoft/promptflow โญ 8,786
Build high-quality LLM apps - from prototyping, testing to production deployment and monitoring.
๐Ÿ”— [microsoft.github.io/promptflow](https://microsoft.github.io/promptflow/)

81. karpathy/minbpe โญ 8,758
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

82. mshumer/gpt-prompt-engineer โญ 8,242
Simply input a description of your task and some test cases, and the system will generate, test, and rank a multitude of prompts to find the ones that perform the best.

83. apple/ml-ferret โญ 8,216
Ferret: Refer and Ground Anything Anywhere at Any Granularity

84. eleutherai/gpt-neo โญ 8,175
An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.
๐Ÿ”— [www.eleuther.ai](https://www.eleuther.ai)

85. optimalscale/LMFlow โญ 8,131
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.
๐Ÿ”— [optimalscale.github.io/lmflow](https://optimalscale.github.io/LMFlow/)

86. thudm/CodeGeeX โญ 7,989
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
๐Ÿ”— [codegeex.cn](https://codegeex.cn)

87. lianjiatech/BELLE โญ 7,735
BELLE: Be Everyone's Large Language model Engine๏ผˆๅผ€ๆบไธญๆ–‡ๅฏน่ฏๅคงๆจกๅž‹๏ผ‰

88. sjtu-ipads/PowerInfer โญ 7,664
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

89. thudm/GLM-130B โญ 7,646
GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)

90. plachtaa/VALL-E-X โญ 7,438
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

91. jzhang38/TinyLlama โญ 7,360
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

92. sweepai/sweep โญ 7,318
Sweep: open-source AI-powered Software Developer for small features and bug fixes.
๐Ÿ”— [sweep.dev](https://sweep.dev)

93. openlm-research/open_llama โญ 7,284
OpenLLaMA: An Open Reproduction of LLaMA

94. abetlen/llama-cpp-python โญ 7,226
Simple Python bindings for @ggerganov's llama.cpp library.
๐Ÿ”— [llama-cpp-python.readthedocs.io](https://llama-cpp-python.readthedocs.io)

95. outlines-dev/outlines โญ 7,213
Structured Text Generation from LLMs
๐Ÿ”— [outlines-dev.github.io/outlines](https://outlines-dev.github.io/outlines/)

96. bigcode-project/starcoder โญ 7,206
Home of StarCoder: fine-tuning & inference!

97. vaibhavs10/insanely-fast-whisper โญ 7,002
An opinionated CLI to transcribe Audio files w/ Whisper on-device! Powered by ๐Ÿค— Transformers, Optimum & flash-attn

98. zilliztech/GPTCache โญ 6,917
Semantic cache for LLMs. Fully integrated with LangChain and llama_index.
๐Ÿ”— [gptcache.readthedocs.io](https://gptcache.readthedocs.io)

99. bhaskatripathi/pdfGPT โญ 6,840
PDF GPT allows you to chat with the contents of your PDF file by using GPT capabilities. The most effective open source solution to turn your pdf files in a chatbot!
๐Ÿ”— [huggingface.co/spaces/bhaskartripathi/pdfgpt_turbo](https://huggingface.co/spaces/bhaskartripathi/pdfGPT_Turbo)

100. apple/corenet โญ 6,754
CoreNet is a deep neural network toolkit that allows researchers and engineers to train standard and novel small and large-scale models for variety of tasks, including foundation models (e.g., CLIP and LLM), object classification, object detection, and semantic segmentation.

101. eleutherai/gpt-neox โญ 6,718
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
๐Ÿ”— [www.eleuther.ai](https://www.eleuther.ai/)

102. jxnl/instructor โญ 6,716
Instructor is a Python library that makes it a breeze to work with structured outputs from large language models (LLMs). Built on top of Pydantic, it provides a simple, transparent, and user-friendly API to manage validation, retries, and streaming responses.
๐Ÿ”— [python.useinstructor.com](https://python.useinstructor.com/)

103. mit-han-lab/streaming-llm โญ 6,373
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
๐Ÿ”— [arxiv.org/abs/2309.17453](https://arxiv.org/abs/2309.17453)

104. langchain-ai/opengpts โญ 6,325
An open source effort to create a similar experience to OpenAI's GPTs and Assistants API.

105. skypilot-org/skypilot โญ 6,293
SkyPilot: Run LLMs, AI, and Batch jobs on any cloud. Get maximum savings, highest GPU availability, and managed executionโ€”all with a simple interface.
๐Ÿ”— [skypilot.readthedocs.io](https://skypilot.readthedocs.io)

106. chainlit/chainlit โญ 6,216
Build Conversational AI in minutes โšก๏ธ
๐Ÿ”— [docs.chainlit.io](https://docs.chainlit.io)

107. nat/openplayground โญ 6,165
An LLM playground you can run on your laptop

108. run-llama/rags โญ 6,093
Build ChatGPT over your data, all with natural language

109. lightning-ai/lit-llama โญ 5,897
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

110. eleutherai/lm-evaluation-harness โญ 5,879
A framework for few-shot evaluation of language models.
๐Ÿ”— [www.eleuther.ai](https://www.eleuther.ai)

111. internlm/InternLM โญ 5,840
Official release of InternLM2.5 7B base and chat models. 1M context support
๐Ÿ”— [internlm.intern-ai.org.cn](https://internlm.intern-ai.org.cn/)

112. explodinggradients/ragas โญ 5,817
Evaluation framework for your Retrieval Augmented Generation (RAG) pipelines
๐Ÿ”— [docs.ragas.io](https://docs.ragas.io)

113. skyvern-ai/skyvern โญ 5,477
Skyvern automates browser-based workflows using LLMs and computer vision. It provides a simple API endpoint to fully automate manual workflows, replacing brittle or unreliable automation solutions.
๐Ÿ”— [www.skyvern.com](https://www.skyvern.com)

114. minedojo/Voyager โญ 5,383
An Open-Ended Embodied Agent with Large Language Models
๐Ÿ”— [voyager.minedojo.org](https://voyager.minedojo.org/)

115. pytorch-labs/gpt-fast โญ 5,373
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

116. dsdanielpark/Bard-API โญ 5,351
The unofficial python package that returns response of Google Bard through cookie value.
๐Ÿ”— [pypi.org/project/bardapi](https://pypi.org/project/bardapi/)

117. microsoft/promptbase โญ 5,256
promptbase is an evolving collection of resources, best practices, and example scripts for eliciting the best performance from foundation models.

118. langchain-ai/chat-langchain โญ 4,996
Locally hosted chatbot specifically focused on question answering over the LangChain documentation
๐Ÿ”— [chat.langchain.com](https://chat.langchain.com)

119. weaviate/Verba โญ 4,987
Retrieval Augmented Generation (RAG) chatbot powered by Weaviate

120. mnotgod96/AppAgent โญ 4,655
AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.
๐Ÿ”— [appagent-official.github.io](https://appagent-official.github.io/)

121. openbmb/ToolBench โญ 4,618
[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.
๐Ÿ”— [openbmb.github.io/toolbench](https://openbmb.github.io/ToolBench/)

122. vikhyat/moondream โญ 4,558
A tiny open-source computer-vision language model designed to run efficiently on edge devices
๐Ÿ”— [moondream.ai](https://moondream.ai)

123. togethercomputer/RedPajama-Data โญ 4,461
The RedPajama-Data repository contains code for preparing large datasets for training large language models.

124. langchain-ai/langgraph โญ 4,452
LangGraph is a library for building stateful, multi-actor applications with LLMs, built on top of (and intended to be used with) LangChain.
๐Ÿ”— [langchain-ai.github.io/langgraph](https://langchain-ai.github.io/langgraph/)

125. microsoft/BioGPT โญ 4,265
Implementation of BioGPT: Generative Pre-trained Transformer for Biomedical Text Generation and Mining

126. llmware-ai/llmware โญ 4,256
Unified framework for building enterprise RAG pipelines with small, specialized models
๐Ÿ”— [llmware-ai.github.io/llmware](https://llmware-ai.github.io/llmware/)

127. microsoft/LLMLingua โญ 4,232
To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.
๐Ÿ”— [llmlingua.com](https://llmlingua.com/)

128. allenai/OLMo โญ 4,225
OLMo is a repository for training and using AI2's state-of-the-art open language models. It is built by scientists, for scientists.
๐Ÿ”— [allenai.org/olmo](https://allenai.org/olmo)

129. kyegomez/tree-of-thoughts โญ 4,171
Plug in and Play Implementation of Tree of Thoughts: Deliberate Problem Solving with Large Language Models that Elevates Model Reasoning by atleast 70%
๐Ÿ”— [discord.gg/qutxnk2nmf](https://discord.gg/qUtxnK2NMf)

130. cg123/mergekit โญ 4,115
Tools for merging pretrained large language models.

131. instruction-tuning-with-gpt-4/GPT-4-LLM โญ 4,098
Instruction Tuning with GPT-4
๐Ÿ”— [instruction-tuning-with-gpt-4.github.io](https://instruction-tuning-with-gpt-4.github.io/)

132. 1rgs/jsonformer โญ 4,000
A Bulletproof Way to Generate Structured JSON from Language Models

133. yizhongw/self-instruct โญ 3,958
Aligning pretrained language models with instruction data generated by themselves.

134. mshumer/gpt-llm-trainer โญ 3,873
Input a description of your task, and the system will generate a dataset, parse it, and fine-tune a LLaMA 2 model for you

135. ravenscroftj/turbopilot โญ 3,835
Turbopilot is an open source large-language-model based code completion engine that runs locally on CPU

136. whitead/paper-qa โญ 3,794
LLM Chain for answering questions from documents with citations

137. nvidia/NeMo-Guardrails โญ 3,791
NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.

138. h2oai/h2o-llmstudio โญ 3,777
H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://h2oai.github.io/h2o-llmstudio/
๐Ÿ”— [gpt-gm.h2o.ai](https://gpt-gm.h2o.ai)

139. guardrails-ai/guardrails โญ 3,653
Open-source Python package for specifying structure and type, validating and correcting the outputs of large language models (LLMs)
๐Ÿ”— [www.guardrailsai.com/docs](https://www.guardrailsai.com/docs)

140. promptfoo/promptfoo โญ 3,634
Test your prompts, agents, and RAGs. Use LLM evals to improve your app's quality and catch problems. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line and CI/CD integration.
๐Ÿ”— [www.promptfoo.dev](https://www.promptfoo.dev/)

141. mmabrouk/llm-workflow-engine โญ 3,624
Power CLI and Workflow manager for LLMs (core package)

142. luodian/Otter โญ 3,518
๐Ÿฆฆ Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
๐Ÿ”— [otter-ntu.github.io](https://otter-ntu.github.io/)

143. simonw/llm โญ 3,509
Access large language models from the command-line
๐Ÿ”— [llm.datasette.io](https://llm.datasette.io)

144. eth-sri/lmql โญ 3,503
A language for constraint-guided and efficient LLM programming.
๐Ÿ”— [lmql.ai](https://lmql.ai)

145. minimaxir/simpleaichat โญ 3,437
Python package for easily interfacing with chat apps, with robust features and minimal code complexity.

146. microsoft/LMOps โญ 3,429
General technology for enabling AI capabilities w/ LLMs and MLLMs
๐Ÿ”— [aka.ms/generalai](https://aka.ms/GeneralAI)

147. minimaxir/gpt-2-simple โญ 3,401
Python package to easily retrain OpenAI's GPT-2 text-generating model on new texts

148. pathwaycom/llm-app โญ 3,375
Dynamic RAG for enterprise. Ready to run with Docker,โšกin sync with Sharepoint, Google Drive, S3, Kafka, PostgreSQL, real-time data APIs, and more.
๐Ÿ”— [pathway.com/developers/templates](https://pathway.com/developers/templates/)

149. open-compass/opencompass โญ 3,314
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
๐Ÿ”— [opencompass.org.cn](https://opencompass.org.cn/)

150. turboderp/exllamav2 โญ 3,265
A fast inference library for running LLMs locally on modern consumer-class GPUs

151. deep-diver/LLM-As-Chatbot โญ 3,260
LLM as a Chatbot Service

152. defog-ai/sqlcoder โญ 3,134
SoTA LLM for converting natural language questions to SQL queries

153. jaymody/picoGPT โญ 3,133
An unnecessarily tiny implementation of GPT-2 in NumPy.

154. next-gpt/NExT-GPT โญ 3,084
Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model
๐Ÿ”— [next-gpt.github.io](https://next-gpt.github.io/)

155. agiresearch/AIOS โญ 3,035
AIOS, a Large Language Model (LLM) Agent operating system, embeds large language model into Operating Systems (OS) as the brain of the OS, enabling an operating system "with soul" -- an important step towards AGI.

156. truefoundry/cognita โญ 3,014
RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry
๐Ÿ”— [cognita.truefoundry.com](https://cognita.truefoundry.com)

157. iryna-kondr/scikit-llm โญ 3,002
Seamlessly integrate LLMs into scikit-learn.
๐Ÿ”— [beastbyte.ai](https://beastbyte.ai/)

158. baichuan-inc/Baichuan-13B โญ 2,976
A 13B large language model developed by Baichuan Intelligent Technology
๐Ÿ”— [huggingface.co/baichuan-inc/baichuan-13b-chat](https://huggingface.co/baichuan-inc/Baichuan-13B-Chat)

159. microsoft/torchscale โญ 2,974
Foundation Architecture for (M)LLMs
๐Ÿ”— [aka.ms/generalai](https://aka.ms/GeneralAI)

160. freedomintelligence/LLMZoo โญ 2,906
โšกLLM Zoo is a project that provides data, models, and evaluation benchmark for large language models.โšก

161. sgl-project/sglang โญ 2,863
SGLang is a structured generation language designed for large language models (LLMs). It makes your interaction with models faster and more controllable.

162. li-plus/chatglm.cpp โญ 2,840
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4

163. juncongmoo/pyllama โญ 2,802
LLaMA: Open and Efficient Foundation Language Models

164. paperswithcode/galai โญ 2,667
Model API for GALACTICA

165. alpha-vllm/LLaMA2-Accessory โญ 2,622
An Open-source Toolkit for LLM Development
๐Ÿ”— [llama2-accessory.readthedocs.io](https://llama2-accessory.readthedocs.io/)

166. hegelai/prompttools โญ 2,558
Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chroma, Weaviate, LanceDB).
๐Ÿ”— [prompttools.readthedocs.io](http://prompttools.readthedocs.io)

167. bclavie/RAGatouille โญ 2,507
Bridging the gap between state-of-the-art research and alchemical RAG pipeline practices.

168. mistralai/mistral-finetune โญ 2,489
A light-weight codebase that enables memory-efficient and performant finetuning of Mistral's models. It is based on LoRA.

169. cohere-ai/cohere-toolkit โญ 2,489
Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.

170. databricks/dbrx โญ 2,483
Code examples and resources for DBRX, a large language model developed by Databricks
๐Ÿ”— [www.databricks.com](https://www.databricks.com/)

171. pipecat-ai/pipecat โญ 2,391
Open Source framework for voice and multimodal conversational AI

172. ofa-sys/OFA โญ 2,373
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework

173. young-geng/EasyLM โญ 2,325
Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.

174. huggingface/text-embeddings-inference โญ 2,311
A blazing fast inference solution for text embeddings models
๐Ÿ”— [huggingface.co/docs/text-embeddings-inference/quick_tour](https://huggingface.co/docs/text-embeddings-inference/quick_tour)

175. civitai/sd_civitai_extension โญ 2,298
All of the Civitai models inside Automatic 1111 Stable Diffusion Web UI

176. noahshinn/reflexion โญ 2,169
[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning

177. cheshire-cat-ai/core โญ 2,167
Production ready AI agent framework
๐Ÿ”— [cheshirecat.ai](https://cheshirecat.ai)

178. openai/finetune-transformer-lm โญ 2,121
Code and model for the paper "Improving Language Understanding by Generative Pre-Training"
๐Ÿ”— [s3-us-west-2.amazonaws.com/openai-assets/research-covers/language-unsupervised/language_understanding_paper.pdf](https://s3-us-west-2.amazonaws.com/openai-assets/research-covers/language-unsupervised/language_understanding_paper.pdf)

179. intel/neural-compressor โญ 2,084
SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime
๐Ÿ”— [intel.github.io/neural-compressor](https://intel.github.io/neural-compressor/)

180. tairov/llama2.mojo โญ 2,070
Inference Llama 2 in one file of pure ๐Ÿ”ฅ
๐Ÿ”— [www.modular.com/blog/community-spotlight-how-i-built-llama2-by-aydyn-tairov](https://www.modular.com/blog/community-spotlight-how-i-built-llama2-by-aydyn-tairov)

181. langroid/langroid โญ 2,020
Harness LLMs with Multi-Agent Programming
๐Ÿ”— [langroid.github.io/langroid](https://langroid.github.io/langroid/)

182. openai/image-gpt โญ 2,019
Archived. Code and models from the paper "Generative Pretraining from Pixels"

183. spcl/graph-of-thoughts โญ 1,990
Official Implementation of "Graph of Thoughts: Solving Elaborate Problems with Large Language Models"
๐Ÿ”— [arxiv.org/pdf/2308.09687.pdf](https://arxiv.org/pdf/2308.09687.pdf)

184. neulab/prompt2model โญ 1,929
prompt2model - Generate Deployable Models from Natural Language Instructions

185. predibase/lorax โญ 1,926
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
๐Ÿ”— [loraexchange.ai](https://loraexchange.ai)

186. lucidrains/toolformer-pytorch โญ 1,924
Implementation of Toolformer, Language Models That Can Use Tools, by MetaAI

187. openai/gpt-2-output-dataset โญ 1,918
Dataset of GPT-2 outputs for research in detection, biases, and more

188. truera/trulens โญ 1,872
Evaluation and Tracking for LLM Experiments
๐Ÿ”— [www.trulens.org](https://www.trulens.org/)

189. minimaxir/aitextgen โญ 1,832
A robust Python tool for text-based AI training and generation using GPT-2.
๐Ÿ”— [docs.aitextgen.io](https://docs.aitextgen.io)

190. ist-daslab/gptq โญ 1,796
Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".
๐Ÿ”— [arxiv.org/abs/2210.17323](https://arxiv.org/abs/2210.17323)

191. epfllm/meditron โญ 1,771
Meditron is a suite of open-source medical Large Language Models (LLMs).
๐Ÿ”— [huggingface.co/epfl-llm](https://huggingface.co/epfl-llm)

192. microsoft/Megatron-DeepSpeed โญ 1,751
Ongoing research training transformer language models at scale, including: BERT & GPT-2

193. openai/gpt-discord-bot โญ 1,742
Example Discord bot written in Python that uses the completions API to have conversations with the `text-davinci-003` model, and the moderations API to filter the messages.

194. akariasai/self-rag โญ 1,639
This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.
๐Ÿ”— [selfrag.github.io](https://selfrag.github.io/)

195. ray-project/llm-applications โญ 1,616
A comprehensive guide to building RAG-based LLM applications for production.

196. jina-ai/thinkgpt โญ 1,510
Agent techniques to augment your LLM and push it beyong its limits

197. casper-hansen/AutoAWQ โญ 1,457
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:
๐Ÿ”— [casper-hansen.github.io/autoawq](https://casper-hansen.github.io/AutoAWQ/)

198. cstankonrad/long_llama โญ 1,440
LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA and fine-tuned with the Focused Transformer (FoT) method.

199. farizrahman4u/loopgpt โญ 1,411
Re-implementation of Auto-GPT as a python package, written with modularity and extensibility in mind.

200. openai/simple-evals โญ 1,371
Lightweight library for evaluating language models

201. run-llama/llama-lab โญ 1,363
Llama Lab is a repo dedicated to building cutting-edge projects using LlamaIndex

202. explosion/spacy-transformers โญ 1,327
๐Ÿ›ธ Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy
๐Ÿ”— [spacy.io/usage/embeddings-transformers](https://spacy.io/usage/embeddings-transformers)

203. chatarena/chatarena โญ 1,299
ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration capabilities of AIs.
๐Ÿ”— [www.chatarena.org](https://www.chatarena.org/)

204. agentops-ai/tokencost โญ 1,292
Easy token price estimates for 400+ LLMs. TokenOps.
๐Ÿ”— [agentops.ai](https://agentops.ai)

205. bigscience-workshop/Megatron-DeepSpeed โญ 1,283
Ongoing research training transformer language models at scale, including: BERT & GPT-2

206. meetkai/functionary โญ 1,234
Chat language model that can use tools and interpret the results

207. ray-project/ray-llm โญ 1,201
RayLLM - LLMs on Ray
๐Ÿ”— [aviary.anyscale.com](https://aviary.anyscale.com)

208. srush/MiniChain โญ 1,191
A tiny library for coding with large language models.
๐Ÿ”— [srush-minichain.hf.space](https://srush-minichain.hf.space/)

209. linksoul-ai/AutoAgents โญ 1,109
[IJCAI 2024] Generate different roles for GPTs to form a collaborative entity for complex tasks.
๐Ÿ”— [huggingface.co/spaces/linksoul/autoagents](https://huggingface.co/spaces/LinkSoul/AutoAgents)

210. ibm/Dromedary โญ 1,102
Dromedary: towards helpful, ethical and reliable LLMs.

211. evolvinglmms-lab/lmms-eval โญ 1,097
Accelerating the development of large multimodal models (LMMs) with lmms-eval
๐Ÿ”— [lmms-lab.github.io](https://lmms-lab.github.io/)

212. keirp/automatic_prompt_engineer โญ 1,058
Large Language Models Are Human-Level Prompt Engineers

213. hao-ai-lab/LookaheadDecoding โญ 1,047
Break the Sequential Dependency of LLM Inference Using Lookahead Decoding

214. lupantech/chameleon-llm โญ 1,039
Codes for "Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models".
๐Ÿ”— [chameleon-llm.github.io](https://chameleon-llm.github.io)

215. rlancemartin/auto-evaluator โญ 1,035
Evaluation tool for LLM QA chains
๐Ÿ”— [autoevaluator.langchain.com](https://autoevaluator.langchain.com/)

216. agenta-ai/agenta โญ 1,031
The all-in-one LLM developer platform: prompt management, evaluation, human feedback, and deployment all in one place.
๐Ÿ”— [www.agenta.ai](http://www.agenta.ai)

217. nomic-ai/pygpt4all โญ 1,027
Official supported Python bindings for llama.cpp + gpt4all
๐Ÿ”— [nomic-ai.github.io/pygpt4all](https://nomic-ai.github.io/pygpt4all/)

218. explosion/spacy-llm โญ 1,022
๐Ÿฆ™ Integrating LLMs into structured NLP pipelines
๐Ÿ”— [spacy.io/usage/large-language-models](https://spacy.io/usage/large-language-models)

219. ctlllll/LLM-ToolMaker โญ 1,011
Large Language Models as Tool Makers

220. microsoft/Llama-2-Onnx โญ 1,002
A Microsoft optimized version of the Llama 2 model, available from Meta

221. huggingface/nanotron โญ 972
Minimalistic large language model 3D-parallelism training

222. ajndkr/lanarky โญ 958
The web framework for building LLM microservices
๐Ÿ”— [lanarky.ajndkr.com](https://lanarky.ajndkr.com/)

223. likejazz/llama3.np โญ 933
llama3.np is a pure NumPy implementation for Llama 3 model.

224. pinecone-io/canopy โญ 928
Retrieval Augmented Generation (RAG) framework and context engine powered by Pinecone
๐Ÿ”— [www.pinecone.io](https://www.pinecone.io/)

225. cerebras/modelzoo โญ 870
Examples of common deep learning models that can be trained on Cerebras hardware

226. muennighoff/sgpt โญ 828
SGPT: GPT Sentence Embeddings for Semantic Search
๐Ÿ”— [arxiv.org/abs/2202.08904](https://arxiv.org/abs/2202.08904)

227. oliveirabruno01/babyagi-asi โญ 773
BabyAGI: an Autonomous and Self-Improving agent, or BASI

228. topoteretes/cognee โญ 765
Deterministic LLMs Outputs for AI Applications and AI Agents
๐Ÿ”— [www.cognee.ai](https://www.cognee.ai)

229. opengenerativeai/GenossGPT โญ 747
One API for all LLMs either Private or Public (Anthropic, Llama V2, GPT 3.5/4, Vertex, GPT4ALL, HuggingFace ...) ๐ŸŒˆ๐Ÿ‚ Replace OpenAI GPT with any LLMs in your app with one line.
๐Ÿ”— [genoss.ai](https://genoss.ai/)

230. datadreamer-dev/DataDreamer โญ 742
DataDreamer is a powerful open-source Python library for prompting, synthetic data generation, and training workflows. It is designed to be simple, extremely efficient, and research-grade.
๐Ÿ”— [datadreamer.dev](https://datadreamer.dev)

231. salesforce/xgen โญ 713
Salesforce open-source LLMs with 8k sequence length.

232. langchain-ai/langsmith-cookbook โญ 703
LangSmith is a platform for building production-grade LLM applications.
๐Ÿ”— [langsmith-cookbook.vercel.app](https://langsmith-cookbook.vercel.app)

233. prometheus-eval/prometheus-eval โญ 690
Evaluate your LLM's response with Prometheus and GPT4 ๐Ÿ’ฏ

234. opengvlab/OmniQuant โญ 626
[ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs.

235. wandb/weave โญ 605
Weave is a toolkit for developing AI-powered applications, built by Weights & Biases.
๐Ÿ”— [wandb.me/weave](https://wandb.me/weave)

236. squeezeailab/SqueezeLLM โญ 603
[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization
๐Ÿ”— [arxiv.org/abs/2306.07629](https://arxiv.org/abs/2306.07629)

237. lupantech/ScienceQA โญ 573
Data and code for NeurIPS 2022 Paper "Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question Answering".

238. zhudotexe/kani โญ 542
kani (ใ‚ซใƒ‹) is a highly hackable microframework for chat-based language models with tool use/function calling. (NLP-OSS @ EMNLP 2023)
๐Ÿ”— [kani.readthedocs.io](https://kani.readthedocs.io)

239. hazyresearch/ama_prompting โญ 534
Ask Me Anything language model prompting

240. vahe1994/SpQR โญ 515
Quantization algorithm and the model evaluation code for SpQR method for LLM compression

241. continuum-llms/chatgpt-memory โญ 513
Allows to scale the ChatGPT API to multiple simultaneous sessions with infinite contextual and adaptive memory powered by GPT and Redis datastore.

242. hazyresearch/H3 โญ 505
Language Modeling with the H3 State Space Model

243. modal-labs/llm-finetuning โญ 490
Guide for fine-tuning Llama/Mistral/CodeLlama models and more

244. huggingface/lighteval โญ 481
LightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processing library datatrove and LLM training library nanotron.

245. judahpaul16/gpt-home โญ 364
ChatGPT at home! Basically a better Google Nest Hub or Amazon Alexa home assistant. Built on the Raspberry Pi using the OpenAI API.
๐Ÿ”— [hub.docker.com/r/judahpaul/gpt-home](https://hub.docker.com/r/judahpaul/gpt-home)

246. stanford-oval/suql โญ 174
SUQL: Conversational Search over Structured and Unstructured Data with LLMs
๐Ÿ”— [arxiv.org/abs/2311.09818](https://arxiv.org/abs/2311.09818)

## Math and Science

Mathematical, numerical and scientific libraries.

1. numpy/numpy โญ 27,143
The fundamental package for scientific computing with Python.
๐Ÿ”— [numpy.org](https://numpy.org)

2. taichi-dev/taichi โญ 25,121
Productive, portable, and performant GPU programming in Python: Taichi Lang is an open-source, imperative, parallel programming language for high-performance numerical computation.
๐Ÿ”— [taichi-lang.org](https://taichi-lang.org)

3. scipy/scipy โญ 12,732
SciPy library main repository
๐Ÿ”— [scipy.org](https://scipy.org)

4. sympy/sympy โญ 12,615
A computer algebra system written in pure Python
๐Ÿ”— [sympy.org](https://sympy.org/)

5. google/or-tools โญ 10,807
Google Optimization Tools (a.k.a., OR-Tools) is an open-source, fast and portable software suite for solving combinatorial optimization problems.
๐Ÿ”— [developers.google.com/optimization](https://developers.google.com/optimization/)

6. z3prover/z3 โญ 9,992
Z3 is a theorem prover from Microsoft Research with a Python language binding.

7. cupy/cupy โญ 7,990
NumPy & SciPy for GPU
๐Ÿ”— [cupy.dev](https://cupy.dev)

8. google-deepmind/alphageometry โญ 3,829
Solving Olympiad Geometry without Human Demonstrations

9. mikedh/trimesh โญ 2,862
Python library for loading and using triangular meshes.
๐Ÿ”— [trimesh.org](https://trimesh.org)

10. mckinsey/causalnex โญ 2,194
A Python library that helps data scientists to infer causation rather than observing correlation.
๐Ÿ”— [causalnex.readthedocs.io](http://causalnex.readthedocs.io/)

11. pyomo/pyomo โญ 1,914
An object-oriented algebraic modeling language in Python for structured optimization problems.
๐Ÿ”— [www.pyomo.org](https://www.pyomo.org)

12. facebookresearch/theseus โญ 1,672
A library for differentiable nonlinear optimization

13. google-research/torchsde โญ 1,525
Differentiable SDE solvers with GPU support and efficient sensitivity analysis.

14. dynamicslab/pysindy โญ 1,367
A package for the sparse identification of nonlinear dynamical systems from data
๐Ÿ”— [pysindy.readthedocs.io/en/latest](https://pysindy.readthedocs.io/en/latest/)

15. geomstats/geomstats โญ 1,174
Computations and statistics on manifolds with geometric structures.
๐Ÿ”— [geomstats.ai](http://geomstats.ai)

16. cma-es/pycma โญ 1,056
pycma is a Python implementation of CMA-ES and a few related numerical optimization tools.

17. sj001/AI-Feynman โญ 610
Implementation of AI Feynman: a Physics-Inspired Method for Symbolic Regression

18. willianfuks/tfcausalimpact โญ 593
Python Causal Impact Implementation Based on Google's R Package. Built using TensorFlow Probability.

19. brandondube/prysm โญ 249
Prysm is an open-source library for physical and first-order modeling of optical systems and analysis of related data: numerical and physical optics, integrated modeling, phase retrieval, segmented systems, polynomials and fitting, sequential raytracing.
๐Ÿ”— [prysm.readthedocs.io/en/stable](https://prysm.readthedocs.io/en/stable/)

20. lean-dojo/ReProver โญ 189
Retrieval-Augmented Theorem Provers for Lean
๐Ÿ”— [leandojo.org](https://leandojo.org)

21. albahnsen/pycircular โญ 90
pycircular is a Python module for circular data analysis

22. gbillotey/Fractalshades โญ 27
Arbitrary-precision fractal explorer - Python package

## Machine Learning - General

General and classical machine learning libraries. See below for other sections covering specialised ML areas.

1. scikit-learn/scikit-learn โญ 58,933
scikit-learn: machine learning in Python
๐Ÿ”— [scikit-learn.org](https://scikit-learn.org)

2. openai/openai-cookbook โญ 57,704
Examples and guides for using the OpenAI API
๐Ÿ”— [cookbook.openai.com](https://cookbook.openai.com)

3. tencentarc/GFPGAN โญ 35,141
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.

4. suno-ai/bark โญ 33,862
๐Ÿ”Š Text-Prompted Generative Audio Model

5. google-research/google-research โญ 33,460
This repository contains code released by Google Research
๐Ÿ”— [research.google](https://research.google)

6. facebookresearch/faiss โญ 29,557
A library for efficient similarity search and clustering of dense vectors.
๐Ÿ”— [faiss.ai](https://faiss.ai)

7. google/jax โญ 29,213
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
๐Ÿ”— [jax.readthedocs.io](http://jax.readthedocs.io/)

8. open-mmlab/mmdetection โญ 28,627
OpenMMLab Detection Toolbox and Benchmark
๐Ÿ”— [mmdetection.readthedocs.io](https://mmdetection.readthedocs.io)

9. ageron/handson-ml2 โญ 27,355
A series of Jupyter notebooks that walk you through the fundamentals of Machine Learning and Deep Learning in Python using Scikit-Learn, Keras and TensorFlow 2.

10. lutzroeder/netron โญ 26,914
Visualizer for neural network, deep learning and machine learning models
๐Ÿ”— [netron.app](https://netron.app)

11. google/mediapipe โญ 26,259
Cross-platform, customizable ML solutions for live and streaming media.
๐Ÿ”— [mediapipe.dev](https://mediapipe.dev)

12. dmlc/xgboost โญ 25,820
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow
๐Ÿ”— [xgboost.readthedocs.io/en/stable](https://xgboost.readthedocs.io/en/stable/)

13. harisiqbal88/PlotNeuralNet โญ 21,592
Latex code for making neural networks diagrams

14. jina-ai/jina โญ 20,608
โ˜๏ธ Build multimodal AI applications with cloud-native stack
๐Ÿ”— [docs.jina.ai](https://docs.jina.ai)

15. roboflow/supervision โญ 17,956
We write your reusable computer vision tools. ๐Ÿ’œ
๐Ÿ”— [supervision.roboflow.com](https://supervision.roboflow.com)

16. onnx/onnx โญ 17,301
Open standard for machine learning interoperability
๐Ÿ”— [onnx.ai](https://onnx.ai/)

17. microsoft/LightGBM โญ 16,378
A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.
๐Ÿ”— [lightgbm.readthedocs.io/en/latest](https://lightgbm.readthedocs.io/en/latest/)

18. ml-explore/mlx โญ 15,813
MLX is an array framework for machine learning on Apple silicon, brought to you by Apple machine learning research.
๐Ÿ”— [ml-explore.github.io/mlx](https://ml-explore.github.io/mlx/)

19. tensorflow/tensor2tensor โญ 15,169
Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.

20. ddbourgin/numpy-ml โญ 14,850
Machine learning, in numpy
๐Ÿ”— [numpy-ml.readthedocs.io](https://numpy-ml.readthedocs.io/)

21. aleju/imgaug โญ 14,278
Image augmentation for machine learning experiments.
๐Ÿ”— [imgaug.readthedocs.io](http://imgaug.readthedocs.io)

22. microsoft/nni โญ 13,903
An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.
๐Ÿ”— [nni.readthedocs.io](https://nni.readthedocs.io)

23. microsoft/onnxruntime โญ 13,615
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
๐Ÿ”— [onnxruntime.ai](https://onnxruntime.ai)

24. microsoft/Swin-Transformer โญ 13,353
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
๐Ÿ”— [arxiv.org/abs/2103.14030](https://arxiv.org/abs/2103.14030)

25. jindongwang/transferlearning โญ 13,103
Transfer learning / domain adaptation / domain generalization / multi-task learning etc. Papers, codes, datasets, applications, tutorials.-่ฟ็งปๅญฆไน 
๐Ÿ”— [transferlearning.xyz](http://transferlearning.xyz/)

26. deepmind/deepmind-research โญ 12,966
This repository contains implementations and illustrative code to accompany DeepMind publications

27. spotify/annoy โญ 12,940
Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk

28. neonbjb/tortoise-tts โญ 12,484
A multi-voice TTS system trained with an emphasis on quality

29. deepmind/alphafold โญ 12,139
Implementation of the inference pipeline of AlphaFold v2

30. ggerganov/ggml โญ 10,369
Tensor library for machine learning

31. facebookresearch/AnimatedDrawings โญ 10,329
Code to accompany "A Method for Animating Children's Drawings of the Human Figure"

32. optuna/optuna โญ 10,207
A hyperparameter optimization framework
๐Ÿ”— [optuna.org](https://optuna.org)

33. twitter/the-algorithm-ml โญ 9,971
Source code for Twitter's Recommendation Algorithm
๐Ÿ”— [blog.twitter.com/engineering/en_us/topics/open-source/2023/twitter-recommendation-algorithm](https://blog.twitter.com/engineering/en_us/topics/open-source/2023/twitter-recommendation-algorithm)

34. statsmodels/statsmodels โญ 9,758
Statsmodels: statistical modeling and econometrics in Python
๐Ÿ”— [www.statsmodels.org/devel](http://www.statsmodels.org/devel/)

35. epistasislab/tpot โญ 9,599
A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.
๐Ÿ”— [epistasislab.github.io/tpot](http://epistasislab.github.io/tpot/)

36. megvii-basedetection/YOLOX โญ 9,189
YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/

37. cleanlab/cleanlab โญ 9,142
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
๐Ÿ”— [cleanlab.ai](https://cleanlab.ai)

38. pycaret/pycaret โญ 8,675
An open-source, low-code machine learning library in Python
๐Ÿ”— [www.pycaret.org](https://www.pycaret.org)

39. wandb/wandb โญ 8,634
๐Ÿ”ฅ A tool for visualizing and tracking your machine learning experiments. This repo contains the CLI and Python API.
๐Ÿ”— [wandb.ai](https://wandb.ai)

40. pymc-devs/pymc โญ 8,500
Bayesian Modeling and Probabilistic Programming in Python
๐Ÿ”— [docs.pymc.io](https://docs.pymc.io/)

41. uberi/speech_recognition โญ 8,191
Speech recognition module for Python, supporting several engines and APIs, online and offline.
๐Ÿ”— [pypi.python.org/pypi/speechrecognition](https://pypi.python.org/pypi/SpeechRecognition/)

42. facebookresearch/xformers โญ 8,049
Hackable and optimized Transformers building blocks, supporting a composable construction.
๐Ÿ”— [facebookresearch.github.io/xformers](https://facebookresearch.github.io/xformers/)

43. catboost/catboost โญ 7,914
A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.
๐Ÿ”— [catboost.ai](https://catboost.ai)

44. open-mmlab/mmsegmentation โญ 7,740
OpenMMLab Semantic Segmentation Toolbox and Benchmark.
๐Ÿ”— [mmsegmentation.readthedocs.io/en/main](https://mmsegmentation.readthedocs.io/en/main/)

45. automl/auto-sklearn โญ 7,490
Automated Machine Learning with scikit-learn
๐Ÿ”— [automl.github.io/auto-sklearn](https://automl.github.io/auto-sklearn)

46. awslabs/autogluon โญ 7,449
Fast and Accurate ML in 3 Lines of Code
๐Ÿ”— [auto.gluon.ai](https://auto.gluon.ai/)

47. huggingface/accelerate โญ 7,393
๐Ÿš€ A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
๐Ÿ”— [huggingface.co/docs/accelerate](https://huggingface.co/docs/accelerate)

48. lmcinnes/umap โญ 7,232
Uniform Manifold Approximation and Projection

49. hyperopt/hyperopt โญ 7,155
Distributed Asynchronous Hyperparameter Optimization in Python
๐Ÿ”— [hyperopt.github.io/hyperopt](http://hyperopt.github.io/hyperopt)

50. featurelabs/featuretools โญ 7,140
An open source python library for automated feature engineering
๐Ÿ”— [www.featuretools.com](https://www.featuretools.com)

51. py-why/dowhy โญ 6,909
DoWhy is a Python library for causal inference that supports explicit modeling and testing of causal assumptions. DoWhy is based on a unified language for causal inference, combining causal graphical models and potential outcomes frameworks.
๐Ÿ”— [www.pywhy.org/dowhy](https://www.pywhy.org/dowhy)

52. hips/autograd โญ 6,875
Efficiently computes derivatives of numpy code.

53. scikit-learn-contrib/imbalanced-learn โญ 6,763
A Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning
๐Ÿ”— [imbalanced-learn.org](https://imbalanced-learn.org)

54. open-mmlab/mmagic โญ 6,759
OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic ๐Ÿช„: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generation, image/video restoration/enhancement, etc.
๐Ÿ”— [mmagic.readthedocs.io/en/latest](https://mmagic.readthedocs.io/en/latest/)

55. probml/pyprobml โญ 6,372
Python code for "Probabilistic Machine learning" book by Kevin Murphy

56. nicolashug/Surprise โญ 6,283
A Python scikit for building and analyzing recommender systems
๐Ÿ”— [surpriselib.com](http://surpriselib.com)

57. google/automl โญ 6,186
Google Brain AutoML

58. cleverhans-lab/cleverhans โญ 6,117
An adversarial example library for constructing attacks, building defenses, and benchmarking both

59. kevinmusgrave/pytorch-metric-learning โญ 5,876
The easiest way to use deep metric learning in your application. Modular, flexible, and extensible. Written in PyTorch.
๐Ÿ”— [kevinmusgrave.github.io/pytorch-metric-learning](https://kevinmusgrave.github.io/pytorch-metric-learning/)

60. open-mmlab/mmcv โญ 5,749
OpenMMLab Computer Vision Foundation
๐Ÿ”— [mmcv.readthedocs.io/en/latest](https://mmcv.readthedocs.io/en/latest/)

61. ml-explore/mlx-examples โญ 5,559
Examples in the MLX framework

62. project-monai/MONAI โญ 5,526
AI Toolkit for Healthcare Imaging
๐Ÿ”— [monai.io](https://monai.io/)

63. mdbloice/Augmentor โญ 5,044
Image augmentation library in Python for machine learning.
๐Ÿ”— [augmentor.readthedocs.io/en/stable](https://augmentor.readthedocs.io/en/stable)

64. uber/causalml โญ 4,895
Uplift modeling and causal inference with machine learning algorithms

65. online-ml/river โญ 4,893
๐ŸŒŠ Online machine learning in Python
๐Ÿ”— [riverml.xyz](https://riverml.xyz)

66. rasbt/mlxtend โญ 4,827
A library of extension and helper modules for Python's data analysis and machine learning libraries.
๐Ÿ”— [rasbt.github.io/mlxtend](https://rasbt.github.io/mlxtend/)

67. google-deepmind/graphcast โญ 4,401
GraphCast: Learning skillful medium-range global weather forecasting

68. lucidrains/deep-daze โญ 4,378
Simple command line tool for text to image generation using OpenAI's CLIP and Siren (Implicit neural representation network). Technique was originally created by https://twitter.com/advadnoun

69. marqo-ai/marqo โญ 4,346
Unified embedding generation and search engine. Also available on cloud - cloud.marqo.ai
๐Ÿ”— [www.marqo.ai](https://www.marqo.ai/)

70. skvark/opencv-python โญ 4,321
Automated CI toolchain to produce precompiled opencv-python, opencv-python-headless, opencv-contrib-python and opencv-contrib-python-headless packages.
๐Ÿ”— [pypi.org/project/opencv-python](https://pypi.org/project/opencv-python/)

71. sanchit-gandhi/whisper-jax โญ 4,261
JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.

72. districtdatalabs/yellowbrick โญ 4,250
Visual analysis and diagnostic tools to facilitate machine learning model selection.
๐Ÿ”— [www.scikit-yb.org](http://www.scikit-yb.org/)

73. google-gemini/cookbook โญ 4,241
A collection of guides and examples for the Gemini API, including quickstart tutorials for writing prompts.
๐Ÿ”— [ai.google.dev/gemini-api/docs](https://ai.google.dev/gemini-api/docs)

74. apple/coremltools โญ 4,239
Core ML tools contain supporting tools for Core ML model conversion, editing, and validation.
๐Ÿ”— [coremltools.readme.io](https://coremltools.readme.io)

75. nv-tlabs/GET3D โญ 4,165
Generative Model of High Quality 3D Textured Shapes Learned from Images

76. nmslib/hnswlib โญ 4,157
Header-only C++/python library for fast approximate nearest neighbors
๐Ÿ”— [github.com/nmslib/hnswlib](https://github.com/nmslib/hnswlib)

77. cmusphinx/pocketsphinx โญ 3,837
A small speech recognizer

78. microsoft/FLAML โญ 3,779
A fast library for AutoML and tuning. Join our Discord: https://discord.gg/Cppx2vSPVP.
๐Ÿ”— [microsoft.github.io/flaml](https://microsoft.github.io/FLAML/)

79. ourownstory/neural_prophet โญ 3,745
NeuralProphet: A simple forecasting package
๐Ÿ”— [neuralprophet.com](https://neuralprophet.com)

80. py-why/EconML โญ 3,668
ALICE (Automated Learning and Intelligence for Causation and Economics) is a Microsoft Research project aimed at applying Artificial Intelligence concepts to economic decision making. One of its goals is to build a toolkit that combines state-of-the-art machine learning techniques with econometrics in order to brin...
๐Ÿ”— [www.microsoft.com/en-us/research/project/alice](https://www.microsoft.com/en-us/research/project/alice/)

81. huggingface/autotrain-advanced โญ 3,649
AutoTrain Advanced: faster and easier training and deployments of state-of-the-art machine learning models
๐Ÿ”— [huggingface.co/autotrain](https://huggingface.co/autotrain)

82. thudm/CogVideo โญ 3,550
Text-to-video generation. The repo for ICLR2023 paper "CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers"

83. huggingface/notebooks โญ 3,460
Notebooks using the Hugging Face libraries ๐Ÿค—

84. rucaibox/RecBole โญ 3,276
A unified, comprehensive and efficient recommendation library
๐Ÿ”— [recbole.io](https://recbole.io/)

85. yoheinakajima/instagraph โญ 3,248
Converts text input or URL into knowledge graph and displays

86. facebookresearch/vissl โญ 3,239
VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.
๐Ÿ”— [vissl.ai](https://vissl.ai)

87. zjunlp/DeepKE โญ 3,230
[EMNLP 2022] An Open Toolkit for Knowledge Graph Extraction and Construction
๐Ÿ”— [deepke.zjukg.cn](http://deepke.zjukg.cn/)

88. pytorch/glow โญ 3,189
Compiler for Neural Network hardware accelerators

89. hrnet/HRNet-Semantic-Segmentation โญ 3,093
The OCR approach is rephrased as Segmentation Transformer: https://arxiv.org/abs/1909.11065. This is an official implementation of semantic segmentation for HRNet. https://arxiv.org/abs/1908.07919

90. lucidrains/musiclm-pytorch โญ 3,090
Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch

91. mljar/mljar-supervised โญ 2,975
Python package for AutoML on Tabular Data with Feature Engineering, Hyper-Parameters Tuning, Explanations and Automatic Documentation
๐Ÿ”— [mljar.com](https://mljar.com)

92. lightly-ai/lightly โญ 2,847
A python library for self-supervised learning on images.
๐Ÿ”— [docs.lightly.ai/self-supervised-learning](https://docs.lightly.ai/self-supervised-learning/)

93. shankarpandala/lazypredict โญ 2,776
Lazy Predict help build a lot of basic models without much code and helps understand which models works better without any parameter tuning

94. scikit-optimize/scikit-optimize โญ 2,735
Sequential model-based optimization with a `scipy.optimize` interface
๐Ÿ”— [scikit-optimize.github.io](https://scikit-optimize.github.io)

95. scikit-learn-contrib/hdbscan โญ 2,731
A high performance implementation of HDBSCAN clustering.
๐Ÿ”— [hdbscan.readthedocs.io/en/latest](http://hdbscan.readthedocs.io/en/latest/)

96. huggingface/safetensors โญ 2,626
Implements a new simple format for storing tensors safely (as opposed to pickle) and that is still fast (zero-copy).
๐Ÿ”— [huggingface.co/docs/safetensors](https://huggingface.co/docs/safetensors)

97. google-research/t5x โญ 2,589
T5X is a modular, composable, research-friendly framework for high-performance, configurable, self-service training, evaluation, and inference of sequence models (starting with language) at many scales.

98. apple/ml-ane-transformers โญ 2,520
Reference implementation of the Transformer architecture optimized for Apple Neural Engine (ANE)

99. freedmand/semantra โญ 2,441
Semantra is a multipurpose tool for semantically searching documents. Query by meaning rather than just by matching text.

100. scikit-learn-contrib/category_encoders โญ 2,384
A library of sklearn compatible categorical variable encoders
๐Ÿ”— [contrib.scikit-learn.org/category_encoders](http://contrib.scikit-learn.org/category_encoders/)

101. huggingface/optimum โญ 2,326
๐Ÿš€ Accelerate training and inference of ๐Ÿค— Transformers and ๐Ÿค— Diffusers with easy to use hardware optimization tools
๐Ÿ”— [huggingface.co/docs/optimum/main](https://huggingface.co/docs/optimum/main/)

102. rom1504/clip-retrieval โญ 2,273
Easily compute clip embeddings and build a clip retrieval system with them
๐Ÿ”— [rom1504.github.io/clip-retrieval](https://rom1504.github.io/clip-retrieval/)

103. aws/sagemaker-python-sdk โญ 2,067
A library for training and deploying machine learning models on Amazon SageMaker
๐Ÿ”— [sagemaker.readthedocs.io](https://sagemaker.readthedocs.io/)

104. huggingface/evaluate โญ 1,903
๐Ÿค— Evaluate: A library for easily evaluating machine learning models and datasets.
๐Ÿ”— [huggingface.co/docs/evaluate](https://huggingface.co/docs/evaluate)

105. eric-mitchell/direct-preference-optimization โญ 1,876
Reference implementation for DPO (Direct Preference Optimization)

106. huggingface/huggingface_hub โญ 1,852
The official Python client for the Huggingface Hub.
๐Ÿ”— [huggingface.co/docs/huggingface_hub](https://huggingface.co/docs/huggingface_hub)

107. rentruewang/koila โญ 1,815
Prevent PyTorch's `CUDA error: out of memory` in just 1 line of code.
๐Ÿ”— [rentruewang.com/koila](https://rentruewang.com/koila/)

108. contextlab/hypertools โญ 1,810
A Python toolbox for gaining geometric insights into high-dimensional data
๐Ÿ”— [hypertools.readthedocs.io/en/latest](http://hypertools.readthedocs.io/en/latest/)

109. linkedin/greykite โญ 1,804
A flexible, intuitive and fast forecasting library

110. bmabey/pyLDAvis โญ 1,791
Python library for interactive topic model visualization. Port of the R LDAvis package.

111. scikit-learn-contrib/lightning โญ 1,714
Large-scale linear classification, regression and ranking in Python
๐Ÿ”— [contrib.scikit-learn.org/lightning](https://contrib.scikit-learn.org/lightning/)

112. tensorflow/addons โญ 1,686
Useful extra functionality for TensorFlow 2.x maintained by SIG-addons

113. microsoft/i-Code โญ 1,655
The ambition of the i-Code project is to build integrative and composable multimodal AI. The "i" stands for integrative multimodal learning.

114. castorini/pyserini โญ 1,565
Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.
๐Ÿ”— [pyserini.io](http://pyserini.io/)

115. googlecloudplatform/vertex-ai-samples โญ 1,538
Notebooks, code samples, sample apps, and other resources that demonstrate how to use, develop and manage machine learning and generative AI workflows using Google Cloud Vertex AI.
๐Ÿ”— [cloud.google.com/vertex-ai](https://cloud.google.com/vertex-ai)

116. visual-layer/fastdup โญ 1,534
fastdup is a powerful free tool designed to rapidly extract valuable insights from your image & video datasets. Assisting you to increase your dataset images & labels quality and reduce your data operations costs at an unparalleled scale.

117. laekov/fastmoe โญ 1,478
A fast MoE impl for PyTorch
๐Ÿ”— [fastmoe.ai](https://fastmoe.ai)

118. kubeflow/katib โญ 1,461
Automated Machine Learning on Kubernetes
๐Ÿ”— [www.kubeflow.org/docs/components/katib](https://www.kubeflow.org/docs/components/katib)

119. jina-ai/finetuner โญ 1,454
:dart: Task-oriented embedding tuning for BERT, CLIP, etc.
๐Ÿ”— [finetuner.jina.ai](https://finetuner.jina.ai)

120. microsoft/Olive โญ 1,368
Olive is an easy-to-use hardware-aware model optimization tool that composes industry-leading techniques across model compression, optimization, and compilation.
๐Ÿ”— [microsoft.github.io/olive](https://microsoft.github.io/Olive/)

121. csinva/imodels โญ 1,341
Interpretable ML package ๐Ÿ” for concise, transparent, and accurate predictive modeling (sklearn-compatible).
๐Ÿ”— [csinva.io/imodels](https://csinva.io/imodels)

122. borealisai/advertorch โญ 1,284
A Toolbox for Adversarial Robustness Research

123. microsoft/Semi-supervised-learning โญ 1,272
A Unified Semi-Supervised Learning Codebase (NeurIPS'22)
๐Ÿ”— [usb.readthedocs.io](https://usb.readthedocs.io)

124. patchy631/machine-learning โญ 1,256
Machine Learning Tutorials Repository

125. awslabs/dgl-ke โญ 1,250
High performance, easy-to-use, and scalable package for learning large-scale knowledge graph embeddings.
๐Ÿ”— [dglke.dgl.ai/doc](https://dglke.dgl.ai/doc/)

126. spotify/voyager โญ 1,227
๐Ÿ›ฐ๏ธ An approximate nearest-neighbor search library for Python and Java with a focus on ease of use, simplicity, and deployability.
๐Ÿ”— [spotify.github.io/voyager](https://spotify.github.io/voyager/)

127. koaning/scikit-lego โญ 1,216
Extra blocks for scikit-learn pipelines.
๐Ÿ”— [koaning.github.io/scikit-lego](https://koaning.github.io/scikit-lego/)

128. google/vizier โญ 1,195
Python-based research interface for blackbox and hyperparameter optimization, based on the internal Google Vizier Service.
๐Ÿ”— [oss-vizier.readthedocs.io](https://oss-vizier.readthedocs.io)

129. qdrant/fastembed โญ 1,155
Fast, Accurate, Lightweight Python library to make State of the Art Embedding
๐Ÿ”— [qdrant.github.io/fastembed](https://qdrant.github.io/fastembed/)

130. automl/TabPFN โญ 1,143
Official implementation of the TabPFN paper (https://arxiv.org/abs/2207.01848) and the tabpfn package.
๐Ÿ”— [priorlabs.ai](http://priorlabs.ai)

131. lightning-ai/lightning-thunder โญ 1,086
Make PyTorch models up to 40% faster! Thunder is a source to source compiler for PyTorch. It enables using different hardware executors at once; across one or thousands of GPUs.

132. google-research/deeplab2 โญ 992
DeepLab2 is a TensorFlow library for deep labeling, aiming to provide a unified and state-of-the-art TensorFlow codebase for dense pixel labeling tasks.

133. davidmrau/mixture-of-experts โญ 903
PyTorch Re-Implementation of "The Sparsely-Gated Mixture-of-Experts Layer" by Noam Shazeer et al. https://arxiv.org/abs/1701.06538

134. lmcinnes/pynndescent โญ 871
A Python nearest neighbor descent for approximate nearest neighbors

135. hazyresearch/safari โญ 851
Convolutions for Sequence Modeling

136. oml-team/open-metric-learning โญ 840
OML is a PyTorch-based framework to train and validate the models producing high-quality embeddings.
๐Ÿ”— [open-metric-learning.readthedocs.io/en/latest/index.html](https://open-metric-learning.readthedocs.io/en/latest/index.html)

137. opentensor/bittensor โญ 821
Internet-scale Neural Networks
๐Ÿ”— [www.bittensor.com](https://www.bittensor.com/)

138. nvidia/cuda-python โญ 816
CUDA Python Low-level Bindings
๐Ÿ”— [nvidia.github.io/cuda-python](https://nvidia.github.io/cuda-python/)

139. criteo/autofaiss โญ 777
Automatically create Faiss knn indices with the most optimal similarity search parameters.
๐Ÿ”— [criteo.github.io/autofaiss](https://criteo.github.io/autofaiss/)

140. replicate/replicate-python โญ 696
Python client for Replicate
๐Ÿ”— [replicate.com](https://replicate.com)

141. facebookresearch/balance โญ 677
The balance python package offers a simple workflow and methods for dealing with biased data samples when looking to infer from them to some target population of interest.
๐Ÿ”— [import-balance.org](https://import-balance.org)

142. awslabs/python-deequ โญ 676
Python API for Deequ, a library built on Spark for defining "unit tests for data", which measure data quality in large datasets

143. huggingface/optimum-quanto โญ 662
A pytorch quantization backend for optimum

144. nicolas-hbt/pygraft โญ 656
Configurable Generation of Synthetic Schemas and Knowledge Graphs at Your Fingertips
๐Ÿ”— [pygraft.readthedocs.io/en/latest](https://pygraft.readthedocs.io/en/latest/)

145. qdrant/quaterion โญ 630
Blazing fast framework for fine-tuning similarity learning models
๐Ÿ”— [quaterion.qdrant.tech](https://quaterion.qdrant.tech/)

146. hpcaitech/EnergonAI โญ 628
Large-scale model inference.

147. pymc-labs/pymc-marketing โญ 600
Bayesian marketing toolbox in PyMC. Media Mix (MMM), customer lifetime value (CLV), buy-till-you-die (BTYD) models and more.
๐Ÿ”— [www.pymc-marketing.io](https://www.pymc-marketing.io/)

148. googleapis/python-aiplatform โญ 588
A Python SDK for Vertex AI, a fully managed, end-to-end platform for data science and machine learning.

149. huggingface/exporters โญ 581
Export Hugging Face models to Core ML and TensorFlow Lite

150. microsoft/Focal-Transformer โญ 545
[NeurIPS 2021 Spotlight] Official code for "Focal Self-attention for Local-Global Interactions in Vision Transformers"

151. nevronai/MetisFL โญ 533
The first open Federated Learning framework implemented in C++ and Python.
๐Ÿ”— [metisfl.org](https://metisfl.org)

152. cvxgrp/pymde โญ 528
Minimum-distortion embedding with PyTorch
๐Ÿ”— [pymde.org](https://pymde.org)

153. mrdbourke/m1-machine-learning-test โญ 511
Code for testing various M1 Chip benchmarks with TensorFlow.

154. linkedin/FastTreeSHAP โญ 503
Fast SHAP value computation for interpreting tree-based models

155. lightning-ai/litdata โญ 261
Transform datasets at scale. Optimize datasets for fast AI model training.

156. dylanhogg/gptauthor โญ 53
GPTAuthor is an AI tool for writing long form, multi-chapter stories given a story prompt.

## Machine Learning - Deep Learning

Machine learning libraries that cross over with deep learning in some way.

1. tensorflow/tensorflow โญ 184,142
An Open Source Machine Learning Framework for Everyone
๐Ÿ”— [tensorflow.org](https://tensorflow.org)

2. pytorch/pytorch โญ 80,511
Tensors and Dynamic neural networks in Python with strong GPU acceleration
๐Ÿ”— [pytorch.org](https://pytorch.org)

3. openai/whisper โญ 64,741
Robust Speech Recognition via Large-Scale Weak Supervision

4. keras-team/keras โญ 61,337
Deep Learning for humans
๐Ÿ”— [keras.io](http://keras.io/)

5. deepfakes/faceswap โญ 49,916
Deepfakes Software For All
๐Ÿ”— [www.faceswap.dev](https://www.faceswap.dev)

6. iperov/DeepFaceLab โญ 46,336
DeepFaceLab is the leading software for creating deepfakes.

7. facebookresearch/segment-anything โญ 45,644
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

8. microsoft/DeepSpeed โญ 33,912
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
๐Ÿ”— [www.deepspeed.ai](https://www.deepspeed.ai/)

9. rwightman/pytorch-image-models โญ 30,815
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
๐Ÿ”— [huggingface.co/docs/timm](https://huggingface.co/docs/timm)

10. facebookresearch/detectron2 โญ 29,450
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
๐Ÿ”— [detectron2.readthedocs.io/en/latest](https://detectron2.readthedocs.io/en/latest/)

11. lightning-ai/pytorch-lightning โญ 27,545
The deep learning framework to pretrain, finetune and deploy AI models. PyTorch Lightning is just organized PyTorch - Lightning disentangles PyTorch code to decouple the science from the engineering.
๐Ÿ”— [lightning.ai](https://lightning.ai)

12. xinntao/Real-ESRGAN โญ 27,072
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.

13. facebookresearch/Detectron โญ 26,210
FAIR's research platform for object detection research, implementing popular algorithms like Mask R-CNN and RetinaNet.

14. matterport/Mask_RCNN โญ 24,414
Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow

15. openai/CLIP โญ 23,701
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

16. paddlepaddle/Paddle โญ 21,904
PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice ๏ผˆใ€Ž้ฃžๆกจใ€ๆ ธๅฟƒๆก†ๆžถ๏ผŒๆทฑๅบฆๅญฆไน &ๆœบๅ™จๅญฆไน ้ซ˜ๆ€ง่ƒฝๅ•ๆœบใ€ๅˆ†ๅธƒๅผ่ฎญ็ปƒๅ’Œ่ทจๅนณๅฐ้ƒจ็ฝฒ๏ผ‰
๐Ÿ”— [www.paddlepaddle.org](http://www.paddlepaddle.org/)

17. apache/mxnet โญ 20,730
Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more
๐Ÿ”— [mxnet.apache.org](https://mxnet.apache.org)

18. pyg-team/pytorch_geometric โญ 20,633
Graph Neural Network Library for PyTorch
๐Ÿ”— [pyg.org](https://pyg.org)

19. lucidrains/vit-pytorch โญ 18,920
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

20. sanster/IOPaint โญ 18,235
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
๐Ÿ”— [www.iopaint.com](https://www.iopaint.com/)

21. rasbt/deeplearning-models โญ 16,465
A collection of various deep learning architectures, models, and tips

22. danielgatis/rembg โญ 15,542
Rembg is a tool to remove images background

23. albumentations-team/albumentations โญ 13,777
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
๐Ÿ”— [albumentations.ai](https://albumentations.ai)

24. dmlc/dgl โญ 13,225
Python package built to ease deep learning on graph, on top of existing DL frameworks.
๐Ÿ”— [dgl.ai](http://dgl.ai)

25. facebookresearch/detr โญ 13,137
End-to-End Object Detection with Transformers

26. nvidia/DeepLearningExamples โญ 13,013
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.

27. kornia/kornia โญ 9,637
Geometric Computer Vision Library for Spatial AI
๐Ÿ”— [kornia.readthedocs.io](https://kornia.readthedocs.io)

28. mlfoundations/open_clip โญ 9,244
An open source implementation of OpenAI's CLIP (Contrastive Language-Image Pre-training)

29. keras-team/autokeras โญ 9,100
AutoML library for deep learning
๐Ÿ”— [autokeras.com](http://autokeras.com/)

30. facebookresearch/pytorch3d โญ 8,513
PyTorch3D is FAIR's library of reusable components for deep learning with 3D data
๐Ÿ”— [pytorch3d.org](https://pytorch3d.org/)

31. pyro-ppl/pyro โญ 8,444
Deep universal probabilistic programming with Python and PyTorch
๐Ÿ”— [pyro.ai](http://pyro.ai)

32. nvidia/apex โญ 8,193
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

33. arogozhnikov/einops โญ 8,177
Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)
๐Ÿ”— [einops.rocks](https://einops.rocks)

34. facebookresearch/ImageBind โญ 8,092
ImageBind One Embedding Space to Bind Them All

35. google/trax โญ 8,014
Trax โ€” Deep Learning with Clear Code and Speed

36. lucidrains/imagen-pytorch โญ 7,894
Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch

37. xpixelgroup/BasicSR โญ 6,508
Open Source Image and Video Restoration Toolbox for Super-resolution, Denoise, Deblurring, etc. Currently, it includes EDSR, RCAN, SRResNet, SRGAN, ESRGAN, EDVR, BasicVSR, SwinIR, ECBSR, etc. Also support StyleGAN2, DFDNet.
๐Ÿ”— [basicsr.readthedocs.io/en/latest](https://basicsr.readthedocs.io/en/latest/)

38. google/flax โญ 5,818
Flax is a neural network library for JAX that is designed for flexibility.
๐Ÿ”— [flax.readthedocs.io](https://flax.readthedocs.io)

39. skorch-dev/skorch โญ 5,704
A scikit-learn compatible neural network library that wraps PyTorch

40. facebookresearch/mmf โญ 5,451
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
๐Ÿ”— [mmf.sh](https://mmf.sh/)

41. mosaicml/composer โญ 5,074
Supercharge Your Model Training
๐Ÿ”— [docs.mosaicml.com](http://docs.mosaicml.com)

42. pytorch/ignite โญ 4,496
High-level library to help with training and evaluating neural networks in PyTorch flexibly and transparently.
๐Ÿ”— [pytorch-ignite.ai](https://pytorch-ignite.ai)

43. facebookincubator/AITemplate โญ 4,487
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.

44. deci-ai/super-gradients โญ 4,461
Easily train or fine-tune SOTA computer vision models with one open source training library. The home of Yolo-NAS.
๐Ÿ”— [www.supergradients.com](https://www.supergradients.com)

45. nvidiagameworks/kaolin โญ 4,321
A PyTorch Library for Accelerating 3D Deep Learning Research

46. williamyang1991/VToonify โญ 3,509
[SIGGRAPH Asia 2022] VToonify: Controllable High-Resolution Portrait Video Style Transfer

47. facebookresearch/PyTorch-BigGraph โญ 3,361
Generate embeddings from large-scale graph-structured data.
๐Ÿ”— [torchbiggraph.readthedocs.io](https://torchbiggraph.readthedocs.io/)

48. cvg/LightGlue โญ 3,178
LightGlue: Local Feature Matching at Light Speed (ICCV 2023)

49. google-research/scenic โญ 3,152
Scenic: A Jax Library for Computer Vision Research and Beyond

50. pytorch/botorch โญ 3,016
Bayesian optimization in PyTorch
๐Ÿ”— [botorch.org](https://botorch.org/)

51. alpa-projects/alpa โญ 3,013
Training and serving large-scale neural networks with auto parallelization.
๐Ÿ”— [alpa.ai](https://alpa.ai)

52. deepmind/dm-haiku โญ 2,841
JAX-based neural network library
๐Ÿ”— [dm-haiku.readthedocs.io](https://dm-haiku.readthedocs.io)

53. explosion/thinc โญ 2,808
๐Ÿ”ฎ A refreshing functional take on deep learning, compatible with your favorite libraries
๐Ÿ”— [thinc.ai](https://thinc.ai)

54. nerdyrodent/VQGAN-CLIP โญ 2,592
Just playing with getting VQGAN+CLIP running locally, rather than having to use colab.

55. danielegrattarola/spektral โญ 2,355
Graph Neural Networks with Keras and Tensorflow 2.
๐Ÿ”— [graphneural.network](https://graphneural.network)

56. google-research/electra โญ 2,315
ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators

57. neuralmagic/sparseml โญ 2,013
Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models

58. fepegar/torchio โญ 2,002
Medical imaging toolkit for deep learning
๐Ÿ”— [www.torchio.org](http://www.torchio.org)

59. pytorch/torchrec โญ 1,804
Pytorch domain library for recommendation systems

60. tensorflow/mesh โญ 1,566
Mesh TensorFlow: Model Parallelism Made Easier

61. vt-vl-lab/FGVC โญ 1,548
[ECCV 2020] Flow-edge Guided Video Completion

62. tensorly/tensorly โญ 1,525
TensorLy: Tensor Learning in Python.
๐Ÿ”— [tensorly.org](http://tensorly.org)

63. calculatedcontent/WeightWatcher โญ 1,415
The WeightWatcher tool for predicting the accuracy of Deep Neural Networks

64. hysts/pytorch_image_classification โญ 1,333
PyTorch implementation of image classification models for CIFAR-10/CIFAR-100/MNIST/FashionMNIST/Kuzushiji-MNIST/ImageNet

65. jeshraghian/snntorch โญ 1,195
Deep and online learning with spiking neural networks in Python
๐Ÿ”— [snntorch.readthedocs.io/en/latest](https://snntorch.readthedocs.io/en/latest/)

66. xl0/lovely-tensors โญ 1,073
Tensors, for human consumption
๐Ÿ”— [xl0.github.io/lovely-tensors](https://xl0.github.io/lovely-tensors)

67. tensorflow/similarity โญ 1,004
TensorFlow Similarity is a python package focused on making similarity learning quick and easy.

68. keras-team/keras-cv โญ 980
Industry-strength Computer Vision workflows with Keras

69. deepmind/android_env โญ 974
RL research on Android devices.

70. deepmind/chex โญ 747
Chex is a library of utilities for helping to write reliable JAX code
๐Ÿ”— [chex.readthedocs.io](https://chex.readthedocs.io)

71. kakaobrain/rq-vae-transformer โญ 730
The official implementation of Autoregressive Image Generation using Residual Quantization (CVPR '22)

72. whitead/dmol-book โญ 595
Deep learning for molecules and materials book
๐Ÿ”— [dmol.pub](https://dmol.pub)

## Machine Learning - Interpretability

Machine learning interpretability libraries. Covers explainability, prediction explainations, dashboards, understanding knowledge development in training.

1. slundberg/shap โญ 22,171
A game theoretic approach to explain the output of any machine learning model.
๐Ÿ”— [shap.readthedocs.io](https://shap.readthedocs.io)

2. marcotcr/lime โญ 11,417
Lime: Explaining the predictions of any machine learning classifier

3. interpretml/interpret โญ 6,147
Fit interpretable models. Explain blackbox machine learning.
๐Ÿ”— [interpret.ml/docs](https://interpret.ml/docs)

4. pytorch/captum โญ 4,719
Model interpretability and understanding for PyTorch
๐Ÿ”— [captum.ai](https://captum.ai)

5. tensorflow/lucid โญ 4,637
A collection of infrastructure and tools for research in neural network interpretability.

6. pair-code/lit โญ 3,435
The Learning Interpretability Tool: Interactively analyze ML models to understand their behavior in an extensible and framework agnostic interface.
๐Ÿ”— [pair-code.github.io/lit](https://pair-code.github.io/lit)

7. arize-ai/phoenix โญ 3,054
AI Observability & Evaluation
๐Ÿ”— [docs.arize.com/phoenix](https://docs.arize.com/phoenix)

8. teamhg-memex/eli5 โญ 2,742
A library for debugging/inspecting machine learning classifiers and explaining their predictions
๐Ÿ”— [eli5.readthedocs.io](http://eli5.readthedocs.io)

9. maif/shapash โญ 2,689
๐Ÿ”… Shapash: User-friendly Explainability and Interpretability to Develop Reliable and Transparent Machine Learning Models
๐Ÿ”— [maif.github.io/shapash](https://maif.github.io/shapash/)

10. seldonio/alibi โญ 2,350
Algorithms for explaining machine learning models
๐Ÿ”— [docs.seldon.io/projects/alibi/en/stable](https://docs.seldon.io/projects/alibi/en/stable/)

11. oegedijk/explainerdashboard โญ 2,266
Quickly build Explainable AI dashboards that show the inner workings of so-called "blackbox" machine learning models.
๐Ÿ”— [explainerdashboard.readthedocs.io](http://explainerdashboard.readthedocs.io)

12. eleutherai/pythia โญ 2,149
Interpretability analysis and scaling laws to understand how knowledge develops and evolves during training in autoregressive transformers

13. jalammar/ecco โญ 1,943
Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the behavior of Transformer-based language models (like GPT2, BERT, RoBERTA, T5, and T0).
๐Ÿ”— [ecco.readthedocs.io](https://ecco.readthedocs.io)

14. trusted-ai/AIX360 โญ 1,568
Interpretability and explainability of data and machine learning models
๐Ÿ”— [aix360.res.ibm.com](https://aix360.res.ibm.com/)

15. google-deepmind/penzai โญ 1,558
A JAX library for writing models as legible, functional pytree data structures, along with tools for visualizing, modifying, and analyzing them. Penzai focuses on making it easy to do stuff with models after they have been trained
๐Ÿ”— [penzai.readthedocs.io](https://penzai.readthedocs.io/)

16. cdpierse/transformers-interpret โญ 1,236
Model explainability that works seamlessly with ๐Ÿค— transformers. Explain your transformers model in just 2 lines of code.

17. ethicalml/xai โญ 1,083
XAI is a Machine Learning library that is designed with AI explainability in its core. XAI contains various tools that enable for analysis and evaluation of data and models
๐Ÿ”— [ethical.institute/principles.html#commitment-3](https://ethical.institute/principles.html#commitment-3)

18. selfexplainml/PiML-Toolbox โญ 911
PiML (Python Interpretable Machine Learning) toolbox for model development & diagnostics
๐Ÿ”— [selfexplainml.github.io/piml-toolbox](https://selfexplainml.github.io/PiML-Toolbox)

19. salesforce/OmniXAI โญ 847
OmniXAI: A Library for eXplainable AI

## Machine Learning - Ops

MLOps tools, frameworks and libraries: intersection of machine learning, data engineering and DevOps; deployment, health, diagnostics and governance of ML models.

1. apache/airflow โญ 35,426
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
๐Ÿ”— [airflow.apache.org](https://airflow.apache.org/)

2. ray-project/ray โญ 32,138
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
๐Ÿ”— [ray.io](https://ray.io)

3. mlflow/mlflow โญ 17,963
Open source platform for the machine learning lifecycle
๐Ÿ”— [mlflow.org](https://mlflow.org)

4. spotify/luigi โญ 17,559
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.

5. prefecthq/prefect โญ 15,435
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
๐Ÿ”— [prefect.io](https://prefect.io)

6. horovod/horovod โญ 14,071
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
๐Ÿ”— [horovod.ai](http://horovod.ai)

7. iterative/dvc โญ 13,411
๐Ÿฆ‰ ML Experiments and Data Management with Git
๐Ÿ”— [dvc.org](https://dvc.org)

8. ludwig-ai/ludwig โญ 11,004
Low-code framework for building custom LLMs, neural networks, and other AI models
๐Ÿ”— [ludwig.ai](http://ludwig.ai)

9. dagster-io/dagster โญ 10,792
An orchestration platform for the development, production, and observation of data assets.
๐Ÿ”— [dagster.io](https://dagster.io)

10. great-expectations/great_expectations โญ 9,685
Always know what to expect from your data.
๐Ÿ”— [docs.greatexpectations.io](https://docs.greatexpectations.io/)

11. kedro-org/kedro โญ 9,508
Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable, and modular.
๐Ÿ”— [kedro.org](https://kedro.org)

12. dbt-labs/dbt-core โญ 9,342
dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.
๐Ÿ”— [getdbt.com](https://getdbt.com)

13. bentoml/OpenLLM โญ 9,339
Run any open-source LLMs, such as Llama 2, Mistral, as OpenAI compatible API endpoint in the cloud.
๐Ÿ”— [bentoml.com](https://bentoml.com)

14. huggingface/text-generation-inference โญ 8,411
A Rust, Python and gRPC server for text generation inference. Used in production at HuggingFace to power Hugging Chat, the Inference API and Inference Endpoint.
๐Ÿ”— [hf.co/docs/text-generation-inference](http://hf.co/docs/text-generation-inference)

15. activeloopai/deeplake โญ 7,897
Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai
๐Ÿ”— [activeloop.ai](https://activeloop.ai)

16. netflix/metaflow โญ 7,837
๐Ÿš€ Build and manage real-life ML, AI, and data science projects with ease!
๐Ÿ”— [metaflow.org](https://metaflow.org)

17. mage-ai/mage-ai โญ 7,479
๐Ÿง™ Build, run, and manage data pipelines for integrating and transforming data.
๐Ÿ”— [www.mage.ai](https://www.mage.ai/)

18. kestra-io/kestra โญ 7,127
Infinitely scalable, event-driven, language-agnostic orchestration and scheduling platform to manage millions of workflows declaratively in code.
๐Ÿ”— [kestra.io](https://kestra.io)

19. bentoml/BentoML โญ 6,815
The easiest way to serve AI/ML models in production - Build Model Inference Service, LLM APIs, Multi-model Inference Graph/Pipelines, LLM/RAG apps, and more!
๐Ÿ”— [bentoml.com](https://bentoml.com)

20. allegroai/clearml โญ 5,457
ClearML - Auto-Magical CI/CD to streamline your AI workload. Experiment Management, Data Management, Pipeline, Orchestration, Scheduling & Serving in one MLOps/LLMOps solution
๐Ÿ”— [clear.ml/docs](https://clear.ml/docs)

21. feast-dev/feast โญ 5,389
The Open Source Feature Store for Machine Learning
๐Ÿ”— [feast.dev](https://feast.dev)

22. flyteorg/flyte โญ 5,220
Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.
๐Ÿ”— [flyte.org](https://flyte.org)

23. aimhubio/aim โญ 4,979
Aim ๐Ÿ’ซ โ€” An easy-to-use & supercharged open-source experiment tracker.
๐Ÿ”— [aimstack.io](https://aimstack.io)

24. evidentlyai/evidently โญ 4,896
Evaluate and monitor ML models from validation to production. Join our Discord: https://discord.com/invite/xZjKRaNp8b

25. langfuse/langfuse โญ 4,790
๐Ÿชข Open source LLM engineering platform: Observability, metrics, evals, prompt management, playground, datasets. Integrates with LlamaIndex, Langchain, OpenAI SDK, LiteLLM, and more. ๐ŸŠYC W23
๐Ÿ”— [langfuse.com/docs](https://langfuse.com/docs)

26. adap/flower โญ 4,495
Flower: A Friendly Federated Learning Framework
๐Ÿ”— [flower.ai](https://flower.ai)

27. orchest/orchest โญ 4,034
Build data pipelines, the easy way ๐Ÿ› ๏ธ
๐Ÿ”— [orchest.readthedocs.io/en/stable](https://orchest.readthedocs.io/en/stable/)

28. zenml-io/zenml โญ 3,830
ZenML ๐Ÿ™: Build portable, production-ready MLOps pipelines. https://zenml.io.
๐Ÿ”— [zenml.io](https://zenml.io)

29. kubeflow/pipelines โญ 3,513
Machine Learning Pipelines for Kubeflow
๐Ÿ”— [www.kubeflow.org/docs/components/pipelines](https://www.kubeflow.org/docs/components/pipelines/)

30. polyaxon/polyaxon โญ 3,513
MLOps Tools For Managing & Orchestrating The Machine Learning LifeCycle
๐Ÿ”— [polyaxon.com](https://polyaxon.com)

31. ploomber/ploomber โญ 3,442
The fastest โšก๏ธ way to build data pipelines. Develop iteratively, deploy anywhere. โ˜๏ธ
๐Ÿ”— [docs.ploomber.io](https://docs.ploomber.io)

32. internlm/lmdeploy โญ 3,395
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
๐Ÿ”— [lmdeploy.readthedocs.io/en/latest](https://lmdeploy.readthedocs.io/en/latest/)

33. internlm/xtuner โญ 3,351
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
๐Ÿ”— [xtuner.readthedocs.io/zh-cn/latest](https://xtuner.readthedocs.io/zh-cn/latest/)

34. towhee-io/towhee โญ 3,087
Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.
๐Ÿ”— [towhee.io](https://towhee.io)

35. determined-ai/determined โญ 2,934
Determined is an open-source machine learning platform that simplifies distributed training, hyperparameter tuning, experiment tracking, and resource management. Works with PyTorch and TensorFlow.
๐Ÿ”— [determined.ai](https://determined.ai)

36. leptonai/leptonai โญ 2,582
A Pythonic framework to simplify AI service building
๐Ÿ”— [lepton.ai](https://lepton.ai/)

37. meltano/meltano โญ 1,702
Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to writing, maintaining, and scaling your own API integrations.
๐Ÿ”— [meltano.com](https://meltano.com/)

38. dagworks-inc/hamilton โญ 1,580
Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage and metadata. Runs and scales everywhere python does.
๐Ÿ”— [hamilton.dagworks.io/en/latest](https://hamilton.dagworks.io/en/latest/)

39. azure/PyRIT โญ 1,517
The Python Risk Identification Tool for generative AI (PyRIT) is an open access automation framework to empower security professionals and ML engineers to red team foundation models and their applications.

40. hi-primus/optimus โญ 1,462
:truck: Agile Data Preparation Workflows madeย easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
๐Ÿ”— [hi-optimus.com](https://hi-optimus.com)

41. kubeflow/examples โญ 1,387
A repository to host extended examples and tutorials

42. dstackai/dstack โญ 1,244
dstack is an easy-to-use and flexible container orchestrator for running AI workloads in any cloud or data center.
๐Ÿ”— [dstack.ai](https://dstack.ai)

43. dagworks-inc/burr โญ 758
Build applications that make decisions (chatbots, agents, simulations, etc...). Monitor, persist, and execute on your own infrastructure.
๐Ÿ”— [burr.dagworks.io](https://burr.dagworks.io)

44. nccr-itmo/FEDOT โญ 623
Automated modeling and machine learning framework FEDOT
๐Ÿ”— [fedot.readthedocs.io](https://fedot.readthedocs.io)

## Machine Learning - Reinforcement

Machine learning libraries and toolkits that cross over with reinforcement learning in some way: agent reinforcement learning, agent environemnts, RLHF

1. openai/gym โญ 34,270
A toolkit for developing and comparing reinforcement learning algorithms.
๐Ÿ”— [www.gymlibrary.dev](https://www.gymlibrary.dev)

2. unity-technologies/ml-agents โญ 16,672
The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement learning and imitation learning.
๐Ÿ”— [unity.com/products/machine-learning-agents](https://unity.com/products/machine-learning-agents)

3. openai/baselines โญ 15,526
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

4. google/dopamine โญ 10,423
Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.
๐Ÿ”— [github.com/google/dopamine](https://github.com/google/dopamine)

5. deepmind/pysc2 โญ 7,966
StarCraft II Learning Environment

6. lucidrains/PaLM-rlhf-pytorch โญ 7,646
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

7. thu-ml/tianshou โญ 7,625
An elegant PyTorch deep reinforcement learning library.
๐Ÿ”— [tianshou.org](https://tianshou.org)

8. tensorlayer/TensorLayer โญ 7,307
Deep Learning and Reinforcement Learning Library for Scientists and Engineers
๐Ÿ”— [tensorlayerx.com](http://tensorlayerx.com)

9. farama-foundation/Gymnasium โญ 6,325
An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)
๐Ÿ”— [gymnasium.farama.org](https://gymnasium.farama.org)

10. keras-rl/keras-rl โญ 5,498
Deep Reinforcement Learning for Keras.
๐Ÿ”— [keras-rl.readthedocs.io](http://keras-rl.readthedocs.io/)

11. deepmind/dm_control โญ 3,646
Google DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.

12. ai4finance-foundation/ElegantRL โญ 3,554
Massively Parallel Deep Reinforcement Learning. ๐Ÿ”ฅ
๐Ÿ”— [ai4finance.org](https://ai4finance.org)

13. facebookresearch/ReAgent โญ 3,542
A platform for Reasoning systems (Reinforcement Learning, Contextual Bandits, etc.)
๐Ÿ”— [reagent.ai](https://reagent.ai)

14. deepmind/acme โญ 3,434
A library of reinforcement learning components and agents

15. eureka-research/Eureka โญ 2,746
Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models" (ICLR 2024)
๐Ÿ”— [eureka-research.github.io](https://eureka-research.github.io/)

16. pettingzoo-team/PettingZoo โญ 2,468
An API standard for multi-agent reinforcement learning environments, with popular reference environments and related utilities
๐Ÿ”— [pettingzoo.farama.org](https://pettingzoo.farama.org)

17. kzl/decision-transformer โญ 2,272
Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.

18. pytorch/rl โญ 2,027
A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.
๐Ÿ”— [pytorch.org/rl](https://pytorch.org/rl)

19. anthropics/hh-rlhf โญ 1,509
Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"
๐Ÿ”— [arxiv.org/abs/2204.05862](https://arxiv.org/abs/2204.05862)

20. humancompatibleai/imitation โญ 1,218
Clean PyTorch implementations of imitation and reward learning algorithms
๐Ÿ”— [imitation.readthedocs.io](https://imitation.readthedocs.io/)

21. arise-initiative/robosuite โญ 1,179
robosuite: A Modular Simulation Framework and Benchmark for Robot Learning
๐Ÿ”— [robosuite.ai](https://robosuite.ai)

22. denys88/rl_games โญ 808
RL Games: High performance RL library

## Natural Language Processing

Natural language processing libraries and toolkits: text processing, topic modelling, tokenisers, chatbots. Also see the LLMs and ChatGPT category for crossover.

1. huggingface/transformers โญ 129,432
๐Ÿค— Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
๐Ÿ”— [huggingface.co/transformers](https://huggingface.co/transformers)

2. pytorch/fairseq โญ 29,846
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

3. explosion/spaCy โญ 29,309
๐Ÿ’ซ Industrial-strength Natural Language Processing (NLP) in Python
๐Ÿ”— [spacy.io](https://spacy.io)

4. myshell-ai/OpenVoice โญ 27,337
Instant voice cloning by MyShell.
๐Ÿ”— [research.myshell.ai/open-voice](https://research.myshell.ai/open-voice)

5. microsoft/unilm โญ 19,192
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
๐Ÿ”— [aka.ms/generalai](https://aka.ms/GeneralAI)

6. huggingface/datasets โญ 18,772
๐Ÿค— The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
๐Ÿ”— [huggingface.co/docs/datasets](https://huggingface.co/docs/datasets)

7. rare-technologies/gensim โญ 15,453
Topic Modelling for Humans
๐Ÿ”— [radimrehurek.com/gensim](https://radimrehurek.com/gensim)

8. vikparuchuri/marker โญ 14,472
Marker converts PDF, EPUB, and MOBI to markdown. It's 10x faster than nougat, more accurate on most documents, and has low hallucination risk.
๐Ÿ”— [www.datalab.to](https://www.datalab.to)

9. ukplab/sentence-transformers โญ 14,461
Multilingual Sentence & Image Embeddings with BERT
๐Ÿ”— [www.sbert.net](https://www.SBERT.net)

10. gunthercox/ChatterBot โญ 13,968
ChatterBot is a machine learning, conversational dialog engine for creating chat bots
๐Ÿ”— [chatterbot.readthedocs.io](https://chatterbot.readthedocs.io)

11. flairnlp/flair โญ 13,740
A very simple framework for state-of-the-art Natural Language Processing (NLP)
๐Ÿ”— [flairnlp.github.io/flair](https://flairnlp.github.io/flair/)

12. nltk/nltk โญ 13,258
NLTK Source
๐Ÿ”— [www.nltk.org](https://www.nltk.org)

13. jina-ai/clip-as-service โญ 12,299
๐Ÿ„ Scalable embedding, reasoning, ranking for images and sentences with CLIP
๐Ÿ”— [clip-as-service.jina.ai](https://clip-as-service.jina.ai)

14. allenai/allennlp โญ 11,714
An open-source NLP research library, built on PyTorch.
๐Ÿ”— [www.allennlp.org](http://www.allennlp.org)

15. openai/tiktoken โญ 11,172
tiktoken is a fast BPE tokeniser for use with OpenAI's models.

16. nvidia/NeMo โญ 10,958
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
๐Ÿ”— [docs.nvidia.com/nemo-framework/user-guide/latest/overview.html](https://docs.nvidia.com/nemo-framework/user-guide/latest/overview.html)

17. facebookresearch/seamless_communication โญ 10,555
Foundational Models for State-of-the-Art Speech and Text Translation

18. facebookresearch/ParlAI โญ 10,453
A framework for training and evaluating AI models on a variety of openly available dialogue datasets.
๐Ÿ”— [parl.ai](https://parl.ai)

19. m-bain/whisperX โญ 10,119
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

20. google/sentencepiece โญ 9,852
Unsupervised text tokenizer for Neural Network-based text generation.

21. doccano/doccano โญ 9,247
Open source annotation tool for machine learning practitioners.

22. vikparuchuri/surya โญ 9,232
OCR, layout analysis, reading order, line detection in 90+ languages
๐Ÿ”— [www.datalab.to](https://www.datalab.to)

23. sloria/TextBlob โญ 9,032
Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.
๐Ÿ”— [textblob.readthedocs.io](https://textblob.readthedocs.io/)

24. togethercomputer/OpenChatKit โญ 9,020
OpenChatKit provides a powerful, open-source base to create both specialized and general purpose chatbots

25. clips/pattern โญ 8,710
Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.
๐Ÿ”— [github.com/clips/pattern/wiki](https://github.com/clips/pattern/wiki)

26. facebookresearch/nougat โญ 8,488
Implementation of Nougat Neural Optical Understanding for Academic Documents
๐Ÿ”— [facebookresearch.github.io/nougat](https://facebookresearch.github.io/nougat/)

27. speechbrain/speechbrain โญ 8,275
A PyTorch-based Speech Toolkit
๐Ÿ”— [speechbrain.github.io](http://speechbrain.github.io)

28. espnet/espnet โญ 8,133
End-to-End Speech Processing Toolkit
๐Ÿ”— [espnet.github.io/espnet](https://espnet.github.io/espnet/)

29. neuml/txtai โญ 7,592
๐Ÿ’ก All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
๐Ÿ”— [neuml.github.io/txtai](https://neuml.github.io/txtai)

30. deeppavlov/DeepPavlov โญ 6,610
An open source library for deep learning end-to-end dialog systems and chatbots.
๐Ÿ”— [deeppavlov.ai](https://deeppavlov.ai)

31. facebookresearch/metaseq โญ 6,436
A codebase for working with Open Pre-trained Transformers, originally forked from fairseq.

32. kingoflolz/mesh-transformer-jax โญ 6,247
Model parallel transformers in JAX and Haiku

33. maartengr/BERTopic โญ 5,817
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
๐Ÿ”— [maartengr.github.io/bertopic](https://maartengr.github.io/BERTopic/)

34. prefecthq/marvin โญ 5,014
โœจ Build AI interfaces that spark joy
๐Ÿ”— [askmarvin.ai](https://askmarvin.ai)

35. minimaxir/textgenrnn โญ 4,942
Easily train your own text-generating neural network of any size and complexity on any text dataset with a few lines of code.

36. aiwaves-cn/agents โญ 4,900
An Open-source Framework for Autonomous Language Agents

37. salesforce/CodeGen โญ 4,842
CodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.

38. layout-parser/layout-parser โญ 4,667
A Unified Toolkit for Deep Learning Based Document Image Analysis
๐Ÿ”— [layout-parser.github.io](https://layout-parser.github.io/)

39. facebookresearch/DrQA โญ 4,467
Reading Wikipedia to Answer Open-Domain Questions

40. makcedward/nlpaug โญ 4,354
Data augmentation for NLP
๐Ÿ”— [makcedward.github.io](https://makcedward.github.io/)

41. thilinarajapakse/simpletransformers โญ 4,043
Transformers for Information Retrieval, Text Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conversational AI
๐Ÿ”— [simpletransformers.ai](https://simpletransformers.ai/)

42. argilla-io/argilla โญ 3,657
Argilla is a collaboration platform for AI engineers and domain experts that require high-quality outputs, full data ownership, and overall efficiency.
๐Ÿ”— [argilla-io.github.io/argilla/latest](https://argilla-io.github.io/argilla/latest/)

43. maartengr/KeyBERT โญ 3,339
Minimal keyword extraction with BERT
๐Ÿ”— [maartengr.github.io/keybert](https://MaartenGr.github.io/KeyBERT/)

44. life4/textdistance โญ 3,332
๐Ÿ“ Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.

45. jsvine/markovify โญ 3,288
A simple, extensible Markov chain generator.

46. bytedance/lightseq โญ 3,136
LightSeq: A High Performance Library for Sequence Processing and Generation

47. promptslab/Promptify โญ 3,125
Prompt Engineering | Prompt Versioning | Use GPT or other prompt based models to get structured output. Join our discord for Prompt-Engineering, LLMs and other latest research
๐Ÿ”— [discord.gg/m88xfymbk6](https://discord.gg/m88xfYMbK6)

48. errbotio/errbot โญ 3,091
Errbot is a chatbot, a daemon that connects to your favorite chat service and bring your tools and some fun into the conversation.
๐Ÿ”— [errbot.io](http://errbot.io)

49. huawei-noah/Pretrained-Language-Model โญ 2,991
Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.

50. neuralmagic/deepsparse โญ 2,944
Sparsity-aware deep learning inference runtime for CPUs
๐Ÿ”— [neuralmagic.com/deepsparse](https://neuralmagic.com/deepsparse/)

51. ddangelov/Top2Vec โญ 2,894
Top2Vec learns jointly embedded topic, document and word vectors.

52. jbesomi/texthero โญ 2,880
Text preprocessing, representation and visualization from zero to hero.
๐Ÿ”— [texthero.org](https://texthero.org)

53. huggingface/neuralcoref โญ 2,838
โœจFast Coreference Resolution in spaCy with Neural Networks
๐Ÿ”— [huggingface.co/coref](https://huggingface.co/coref/)

54. salesforce/CodeT5 โญ 2,663
Home of CodeT5: Open Code LLMs for Code Understanding and Generation
๐Ÿ”— [arxiv.org/abs/2305.07922](https://arxiv.org/abs/2305.07922)

55. bigscience-workshop/promptsource โญ 2,598
Toolkit for creating, sharing and using natural language prompts.

56. huggingface/setfit โญ 2,089
SetFit is an efficient and prompt-free framework for few-shot fine-tuning of Sentence Transformers.
๐Ÿ”— [hf.co/docs/setfit](https://hf.co/docs/setfit)

57. jamesturk/jellyfish โญ 2,024
๐Ÿชผ a python library for doing approximate and phonetic matching of strings.
๐Ÿ”— [jamesturk.github.io/jellyfish](https://jamesturk.github.io/jellyfish/)

58. alibaba/EasyNLP โญ 2,009
EasyNLP: A Comprehensive and Easy-to-use NLP Toolkit

59. thudm/P-tuning-v2 โญ 1,938
An optimized deep prompt tuning strategy comparable to fine-tuning across scales and tasks

60. marella/ctransformers โญ 1,763
Python bindings for the Transformer models implemented in C/C++ using GGML library.

61. featureform/featureform โญ 1,752
The Virtual Feature Store. Turn your existing data infrastructure into a feature store.
๐Ÿ”— [www.featureform.com](https://www.featureform.com)

62. deepset-ai/FARM โญ 1,735
๐Ÿก Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.
๐Ÿ”— [farm.deepset.ai](https://farm.deepset.ai)

63. franck-dernoncourt/NeuroNER โญ 1,687
Named-entity recognition using neural networks. Easy-to-use and state-of-the-art results.
๐Ÿ”— [neuroner.com](http://neuroner.com)

64. plasticityai/magnitude โญ 1,616
A fast, efficient universal vector embedding utility package.

65. arxiv-vanity/arxiv-vanity โญ 1,599
Renders papers from arXiv as responsive web pages so you don't have to squint at a PDF.
๐Ÿ”— [www.arxiv-vanity.com](https://www.arxiv-vanity.com)

66. google-research/language โญ 1,585
Shared repository for open-sourced projects from the Google AI Language team.
๐Ÿ”— [ai.google/research/teams/language](https://ai.google/research/teams/language/)

67. explosion/spacy-models โญ 1,560
๐Ÿ’ซ Models for the spaCy Natural Language Processing (NLP) library
๐Ÿ”— [spacy.io](https://spacy.io)

68. chrismattmann/tika-python โญ 1,458
Tika-Python is a Python binding to the Apache Tikaโ„ข REST services allowing Tika to be called natively in the Python community.

69. dmmiller612/bert-extractive-summarizer โญ 1,370
Easy to use extractive text summarization with BERT

70. gunthercox/chatterbot-corpus โญ 1,353
A multilingual dialog corpus
๐Ÿ”— [chatterbot-corpus.readthedocs.io](http://chatterbot-corpus.readthedocs.io)

71. jonasgeiping/cramming โญ 1,263
Cramming the training of a (BERT-type) language model into limited compute.

72. nomic-ai/nomic โญ 1,163
Interact, analyze and structure massive text, image, embedding, audio and video datasets
๐Ÿ”— [atlas.nomic.ai](https://atlas.nomic.ai)

73. intellabs/fastRAG โญ 1,137
Efficient Retrieval Augmentation and Generation Framework

74. abertsch72/unlimiformer โญ 1,043
Public repo for the NeurIPS 2023 paper "Unlimiformer: Long-Range Transformers with Unlimited Length Input"

75. urchade/GLiNER โญ 1,034
Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024
๐Ÿ”— [arxiv.org/abs/2311.08526](https://arxiv.org/abs/2311.08526)

76. pemistahl/lingua-py โญ 1,023
The most accurate natural language detection library for Python, suitable for short text and mixed-language text

77. openai/grade-school-math โญ 945
GSM8K, a dataset of 8.5K high quality linguistically diverse grade school math word problems

78. norskregnesentral/skweak โญ 914
skweak: A software toolkit for weak supervision applied to NLP tasks

79. explosion/spacy-streamlit โญ 782
๐Ÿ‘‘ spaCy building blocks and visualizers for Streamlit apps
๐Ÿ”— [share.streamlit.io/ines/spacy-streamlit-demo/master/app.py](https://share.streamlit.io/ines/spacy-streamlit-demo/master/app.py)

80. paddlepaddle/RocketQA โญ 755
๐Ÿš€ RocketQA, dense retrieval for information retrieval and question answering, including both Chinese and English state-of-the-art models.

81. keras-team/keras-nlp โญ 738
Modular Natural Language Processing workflows with Keras

82. explosion/spacy-stanza โญ 722
๐Ÿ’ฅ Use the latest Stanza (StanfordNLP) research models directly in spaCy

## Packaging

Python packaging, dependency management and bundling.

1. pyenv/pyenv โญ 37,899
pyenv lets you easily switch between multiple versions of Python.

2. python-poetry/poetry โญ 30,483
Python packaging and dependency management made easy
๐Ÿ”— [python-poetry.org](https://python-poetry.org)

3. pypa/pipenv โญ 24,737
A virtualenv management tool that supports a multitude of systems and nicely bridges the gaps between pip, python and virtualenv.
๐Ÿ”— [pipenv.pypa.io](https://pipenv.pypa.io)

4. astral-sh/uv โญ 15,209
An extremely fast Python package installer and resolver, written in Rust. Designed as a drop-in replacement for pip and pip-compile.
๐Ÿ”— [astral.sh](https://astral.sh/)

5. mitsuhiko/rye โญ 12,727
a Hassle-Free Python Experience
๐Ÿ”— [rye.astral.sh](https://rye.astral.sh)

6. pyinstaller/pyinstaller โญ 11,540
Freeze (package) Python programs into stand-alone executables
๐Ÿ”— [www.pyinstaller.org](http://www.pyinstaller.org)

7. pypa/pipx โญ 9,615
Install and Run Python Applications in Isolated Environments
๐Ÿ”— [pipx.pypa.io](https://pipx.pypa.io)

8. jazzband/pip-tools โญ 7,600
A set of tools to keep your pinned Python dependencies fresh (pip-compile + pip-sync)
๐Ÿ”— [pip-tools.rtfd.io](https://pip-tools.rtfd.io)

9. pdm-project/pdm โญ 7,596
A modern Python package and dependency manager supporting the latest PEP standards
๐Ÿ”— [pdm-project.org](https://pdm-project.org)

10. mamba-org/mamba โญ 6,573
The Fast Cross-Platform Package Manager: mamba is a reimplementation of the conda package manager in C++
๐Ÿ”— [mamba.readthedocs.io](https://mamba.readthedocs.io)

11. conda/conda โญ 6,253
A system-level, binary package and environment manager running on all major operating systems and platforms.
๐Ÿ”— [docs.conda.io/projects/conda](https://docs.conda.io/projects/conda/)

12. conda-forge/miniforge โญ 5,809
A conda-forge distribution.
๐Ÿ”— [conda-forge.org/miniforge](https://conda-forge.org/miniforge)

13. pypa/hatch โญ 5,725
Modern, extensible Python project management
๐Ÿ”— [hatch.pypa.io/latest](https://hatch.pypa.io/latest/)

14. indygreg/PyOxidizer โญ 5,319
A modern Python application packaging and distribution tool

15. pypa/virtualenv โญ 4,768
A tool to create isolated Python environments. Since Python 3.3, a subset of it has been integrated into the standard lib venv module.
๐Ÿ”— [virtualenv.pypa.io](https://virtualenv.pypa.io)

16. spack/spack โญ 4,112
A flexible package manager that supports multiple versions, configurations, platforms, and compilers.
๐Ÿ”— [spack.io](https://spack.io)

17. beeware/briefcase โญ 2,505
Tools to support converting a Python project into a standalone native application.
๐Ÿ”— [briefcase.readthedocs.io](https://briefcase.readthedocs.io/)

18. pantsbuild/pex โญ 2,493
A tool for generating .pex (Python EXecutable) files, lock files and venvs.
๐Ÿ”— [docs.pex-tool.org](https://docs.pex-tool.org)

19. prefix-dev/pixi โญ 2,424
pixi is a cross-platform, multi-language package manager and workflow tool built on the foundation of the conda ecosystem.
๐Ÿ”— [pixi.sh](https://pixi.sh)

20. pypa/flit โญ 2,135
Simplified packaging of Python modules
๐Ÿ”— [flit.pypa.io](https://flit.pypa.io/)

21. linkedin/shiv โญ 1,711
shiv is a command line utility for building fully self contained Python zipapps as outlined in PEP 441, but with all their dependencies included.

22. marcelotduarte/cx_Freeze โญ 1,286
cx_Freeze creates standalone executables from Python scripts, with the same performance, is cross-platform and should work on any platform that Python itself works on.
๐Ÿ”— [marcelotduarte.github.io/cx_freeze](https://marcelotduarte.github.io/cx_Freeze/)

23. ofek/pyapp โญ 1,108
Runtime installer for Python applications
๐Ÿ”— [ofek.dev/pyapp](https://ofek.dev/pyapp/)

24. pypa/gh-action-pypi-publish โญ 876
The blessed :octocat: GitHub Action, for publishing your ๐Ÿ“ฆ distribution files to PyPI: https://github.com/marketplace/actions/pypi-publish
๐Ÿ”— [packaging.python.org/guides/publishing-package-distribution-releases-using-github-actions-ci-cd-workflows](https://packaging.python.org/guides/publishing-package-distribution-releases-using-github-actions-ci-cd-workflows/)

25. py2exe/py2exe โญ 794
Create standalone Windows programs from Python code
๐Ÿ”— [www.py2exe.org](http://www.py2exe.org)

26. prefix-dev/rip โญ 635
RIP is a library that allows the resolving and installing of Python PyPI packages from Rust into a virtual environment. It's based on our experience with building Rattler and aims to provide the same experience but for PyPI instead of Conda.
๐Ÿ”— [prefix.dev](https://prefix.dev)

27. snok/install-poetry โญ 556
Github action for installing and configuring Poetry

28. python-poetry/install.python-poetry.org โญ 187
The official Poetry installation script
๐Ÿ”— [install.python-poetry.org](https://install.python-poetry.org)

## Pandas

Pandas and dataframe libraries: data analysis, statistical reporting, pandas GUIs, pandas performance optimisations.

1. pandas-dev/pandas โญ 42,744
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
๐Ÿ”— [pandas.pydata.org](https://pandas.pydata.org)

2. pola-rs/polars โญ 28,095
Dataframes powered by a multithreaded, vectorized query engine, written in Rust
๐Ÿ”— [docs.pola.rs](https://docs.pola.rs)

3. duckdb/duckdb โญ 20,952
DuckDB is an analytical in-process SQL database management system
๐Ÿ”— [www.duckdb.org](http://www.duckdb.org)

4. ydataai/ydata-profiling โญ 12,261
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
๐Ÿ”— [docs.profiling.ydata.ai](https://docs.profiling.ydata.ai)

5. gventuri/pandas-ai โญ 12,055
Chat with your database (SQL, CSV, pandas, polars, mongodb, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG.
๐Ÿ”— [pandas-ai.com](https://pandas-ai.com)

6. kanaries/pygwalker โญ 10,745
PyGWalker: Turn your pandas dataframe into an interactive UI for visual analysis
๐Ÿ”— [kanaries.net/pygwalker](https://kanaries.net/pygwalker)

7. rapidsai/cudf โญ 8,047
cuDF is a GPU DataFrame library for loading joining, aggregating, filtering, and otherwise manipulating data
๐Ÿ”— [docs.rapids.ai/api/cudf/stable](https://docs.rapids.ai/api/cudf/stable/)

8. aws/aws-sdk-pandas โญ 3,852
pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
๐Ÿ”— [aws-sdk-pandas.readthedocs.io](https://aws-sdk-pandas.readthedocs.io)

9. nalepae/pandarallel โญ 3,595
A simple and efficient tool to parallelize Pandas operations on all availableย CPUs
๐Ÿ”— [nalepae.github.io/pandarallel](https://nalepae.github.io/pandarallel)

10. blaze/blaze โญ 3,178
NumPy and Pandas interface to Big Data
๐Ÿ”— [blaze.pydata.org](blaze.pydata.org)

11. unionai-oss/pandera โญ 3,156
A light-weight, flexible, and expressive statistical data testing library
๐Ÿ”— [www.union.ai/pandera](https://www.union.ai/pandera)

12. adamerose/PandasGUI โญ 3,152
A GUI for Pandas DataFrames

13. pydata/pandas-datareader โญ 2,874
Extract data from a wide range of Internet sources into a pandas DataFrame.
๐Ÿ”— [pydata.github.io/pandas-datareader/stable/index.html](https://pydata.github.io/pandas-datareader/stable/index.html)

14. scikit-learn-contrib/sklearn-pandas โญ 2,793
Pandas integration with sklearn

15. jmcarpenter2/swifter โญ 2,493
A package which efficiently applies any function to a pandas dataframe or series in the fastest available manner

16. delta-io/delta-rs โญ 2,006
A native Rust library for Delta Lake, with bindings into Python
๐Ÿ”— [delta-io.github.io/delta-rs](https://delta-io.github.io/delta-rs/)

17. fugue-project/fugue โญ 1,922
A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewrites.
๐Ÿ”— [fugue-tutorials.readthedocs.io](https://fugue-tutorials.readthedocs.io/)

18. eventual-inc/Daft โญ 1,887
Distributed DataFrame for Python designed for the cloud, powered by Rust
๐Ÿ”— [getdaft.io](https://getdaft.io)

19. pyjanitor-devs/pyjanitor โญ 1,328
Clean APIs for data cleaning. Python implementation of R package Janitor
๐Ÿ”— [pyjanitor-devs.github.io/pyjanitor](https://pyjanitor-devs.github.io/pyjanitor)

20. machow/siuba โญ 1,135
Python library for using dplyr like syntax with pandas and SQL
๐Ÿ”— [siuba.org](https://siuba.org)

21. renumics/spotlight โญ 1,054
Interactively explore unstructured datasets from your dataframe.
๐Ÿ”— [renumics.com](https://renumics.com)

22. holoviz/hvplot โญ 1,027
A high-level plotting API for pandas, dask, xarray, and networkx built on HoloViews
๐Ÿ”— [hvplot.holoviz.org](https://hvplot.holoviz.org)

23. tkrabel/bamboolib โญ 935
bamboolib - a GUI for pandas DataFrames
๐Ÿ”— [bamboolib.com](https://bamboolib.com)

24. mwouts/itables โญ 736
This packages changes how Pandas and Polars DataFrames are rendered in Jupyter Notebooks. With itables you can display your tables as interactive DataTables that you can sort, paginate, scroll or filter.
๐Ÿ”— [mwouts.github.io/itables](https://mwouts.github.io/itables/)

## Performance

Performance, parallelisation and low level libraries.

1. celery/celery โญ 24,100
Distributed Task Queue (development branch)
๐Ÿ”— [docs.celeryq.dev](https://docs.celeryq.dev)

2. google/flatbuffers โญ 22,591
FlatBuffers: Memory Efficient Serialization Library
๐Ÿ”— [flatbuffers.dev](https://flatbuffers.dev/)

3. pybind/pybind11 โญ 15,167
Seamless operability between C++11 and Python
๐Ÿ”— [pybind11.readthedocs.io](https://pybind11.readthedocs.io/)

4. exaloop/codon โญ 13,980
A high-performance, zero-overhead, extensible Python compiler using LLVM
๐Ÿ”— [docs.exaloop.io/codon](https://docs.exaloop.io/codon)

5. dask/dask โญ 12,259
Parallel computing with task scheduling
๐Ÿ”— [dask.org](https://dask.org)

6. numba/numba โญ 9,648
NumPy aware dynamic Python compiler using LLVM
๐Ÿ”— [numba.pydata.org](https://numba.pydata.org/)

7. modin-project/modin โญ 9,622
Modin: Scale your Pandas workflows by changing a single line of code
๐Ÿ”— [modin.readthedocs.io](http://modin.readthedocs.io)

8. nebuly-ai/nebuly โญ 8,366
The user analytics platform for LLMs
๐Ÿ”— [www.nebuly.com](https://www.nebuly.com/)

9. vaexio/vaex โญ 8,228
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second ๐Ÿš€
๐Ÿ”— [vaex.io](https://vaex.io)

10. mher/flower โญ 6,293
Real-time monitor and web admin for Celery distributed task queue
๐Ÿ”— [flower.readthedocs.io](https://flower.readthedocs.io)

11. python-trio/trio โญ 6,003
Trio โ€“ a friendly Python library for async concurrency and I/O
๐Ÿ”— [trio.readthedocs.io](https://trio.readthedocs.io)

12. ultrajson/ultrajson โญ 4,287
Ultra fast JSON decoder and encoder written in C with Python bindings
๐Ÿ”— [pypi.org/project/ujson](https://pypi.org/project/ujson/)

13. facebookincubator/cinder โญ 3,430
Cinder is Meta's internal performance-oriented production version of CPython.
๐Ÿ”— [trycinder.com](https://trycinder.com)

14. tlkh/asitop โญ 3,194
Perf monitoring CLI tool for Apple Silicon
๐Ÿ”— [tlkh.github.io/asitop](https://tlkh.github.io/asitop/)

15. ipython/ipyparallel โญ 2,566
IPython Parallel: Interactive Parallel Computing in Python
๐Ÿ”— [ipyparallel.readthedocs.io](https://ipyparallel.readthedocs.io/)

16. airtai/faststream โญ 2,179
FastStream is a powerful and easy-to-use Python framework for building asynchronous services interacting with event streams such as Apache Kafka, RabbitMQ, NATS and Redis.
๐Ÿ”— [faststream.airt.ai/latest](https://faststream.airt.ai/latest/)

17. intel/intel-extension-for-transformers โญ 2,069
โšก Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platformsโšก

18. h5py/h5py โญ 2,047
HDF5 for Python -- The h5py package is a Pythonic interface to the HDF5 binary data format.
๐Ÿ”— [www.h5py.org](http://www.h5py.org)

19. agronholm/anyio โญ 1,693
High level asynchronous concurrency and networking framework that works on top of either trio or asyncio

20. faster-cpython/ideas โญ 1,666
Discussion and work tracker for Faster CPython project.

21. dask/distributed โญ 1,556
A distributed task scheduler for Dask
๐Ÿ”— [distributed.dask.org](https://distributed.dask.org)

22. tiangolo/asyncer โญ 1,516
Asyncer, async and await, focused on developer experience.
๐Ÿ”— [asyncer.tiangolo.com](https://asyncer.tiangolo.com/)

23. intel/intel-extension-for-pytorch โญ 1,474
A Python package for extending the official PyTorch that can easily obtain performance on Intel platform

24. nschloe/perfplot โญ 1,328
:chart_with_upwards_trend: Performance analysis for Python snippets

25. intel/scikit-learn-intelex โญ 1,187
Intel(R) Extension for Scikit-learn is a seamless way to speed up your Scikit-learn application
๐Ÿ”— [intel.github.io/scikit-learn-intelex](https://intel.github.io/scikit-learn-intelex/)

26. markshannon/faster-cpython โญ 940
How to make CPython faster.

27. zerointensity/pointers.py โญ 914
Bringing the hell of pointers to Python.
๐Ÿ”— [pointers.zintensity.dev](https://pointers.zintensity.dev/)

28. brandtbucher/specialist โญ 618
Visualize CPython's specializing, adaptive interpreter. ๐Ÿ”ฅ

## Profiling

Memory and CPU/GPU profiling tools and libraries.

1. bloomberg/memray โญ 12,854
Memray is a memory profiler for Python
๐Ÿ”— [bloomberg.github.io/memray](https://bloomberg.github.io/memray/)

2. benfred/py-spy โญ 12,211
Sampling profiler for Python programs

3. plasma-umass/scalene โญ 11,401
Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals

4. joerick/pyinstrument โญ 6,304
๐Ÿšดย Call stack profiler for Python. Shows you why your code is slow!
๐Ÿ”— [pyinstrument.readthedocs.io](https://pyinstrument.readthedocs.io/)

5. gaogaotiantian/viztracer โญ 4,675
VizTracer is a low-overhead logging/debugging/profiling tool that can trace and visualize your python code execution.
๐Ÿ”— [viztracer.readthedocs.io](https://viztracer.readthedocs.io/)

6. pythonprofilers/memory_profiler โญ 4,277
Monitor Memory usage of Python code
๐Ÿ”— [pypi.python.org/pypi/memory_profiler](http://pypi.python.org/pypi/memory_profiler)

7. reloadware/reloadium โญ 2,754
Hot Reloading and Profiling for Python
๐Ÿ”— [reloadium.io](https://reloadium.io)

8. pyutils/line_profiler โญ 2,583
Line-by-line profiling for Python

9. jiffyclub/snakeviz โญ 2,285
An in-browser Python profile viewer
๐Ÿ”— [jiffyclub.github.io/snakeviz](https://jiffyclub.github.io/snakeviz/)

10. p403n1x87/austin โญ 1,405
Python frame stack sampler for CPython
๐Ÿ”— [pypi.org/project/austin-dist](https://pypi.org/project/austin-dist/)

11. pythonspeed/filprofiler โญ 824
A Python memory profiler for data processing and scientific computing applications
๐Ÿ”— [pythonspeed.com/products/filmemoryprofiler](https://pythonspeed.com/products/filmemoryprofiler/)

## Security

Security related libraries: vulnerability discovery, SQL injection, environment auditing.

1. swisskyrepo/PayloadsAllTheThings โญ 58,493
A list of useful payloads and bypass for Web Application Security and Pentest/CTF
๐Ÿ”— [swisskyrepo.github.io/payloadsallthethings](https://swisskyrepo.github.io/PayloadsAllTheThings/)

2. sqlmapproject/sqlmap โญ 31,373
Automatic SQL injection and database takeover tool
๐Ÿ”— [sqlmap.org](http://sqlmap.org)

3. certbot/certbot โญ 31,151
Certbot is EFF's tool to obtain certs from Let's Encrypt and (optionally) auto-enable HTTPS on your server. It can also act as a client for any other CA that uses the ACME protocol.

4. aquasecurity/trivy โญ 22,229
Find vulnerabilities, misconfigurations, secrets, SBOM in containers, Kubernetes, code repositories, clouds and more
๐Ÿ”— [aquasecurity.github.io/trivy](https://aquasecurity.github.io/trivy)

5. bridgecrewio/checkov โญ 6,752
Checkov is a static code analysis tool for infrastructure as code (IaC) and also a software composition analysis (SCA) tool for images and open source packages.
๐Ÿ”— [www.checkov.io](https://www.checkov.io/)

6. nccgroup/ScoutSuite โญ 6,347
Multi-Cloud Security Auditing Tool

7. pycqa/bandit โญ 6,165
Bandit is a tool designed to find common security issues in Python code.
๐Ÿ”— [bandit.readthedocs.io](https://bandit.readthedocs.io)

8. stamparm/maltrail โญ 5,956
Malicious traffic detection system

9. rhinosecuritylabs/pacu โญ 4,204
The AWS exploitation framework, designed for testing the security of Amazon Web Services environments.
๐Ÿ”— [rhinosecuritylabs.com/aws/pacu-open-source-aws-exploitation-framework](https://rhinosecuritylabs.com/aws/pacu-open-source-aws-exploitation-framework/)

10. dashingsoft/pyarmor โญ 3,104
A tool used to obfuscate python scripts, bind obfuscated scripts to fixed machine or expire obfuscated scripts.
๐Ÿ”— [pyarmor.dashingsoft.com](http://pyarmor.dashingsoft.com)

11. luijait/DarkGPT โญ 1,871
DarkGPT is an OSINT assistant based on GPT-4-200K (recommended use) designed to perform queries on leaked databases, thus providing an artificial intelligence assistant that can be useful in your traditional OSINT processes.

12. pyupio/safety โญ 1,662
Safety checks Python dependencies for known security vulnerabilities and suggests the proper remediations for vulnerabilities detected.
๐Ÿ”— [safetycli.com/product/safety-cli](https://safetycli.com/product/safety-cli)

13. trailofbits/pip-audit โญ 948
Audits Python environments, requirements files and dependency trees for known security vulnerabilities, and can automatically fix them
๐Ÿ”— [pypi.org/project/pip-audit](https://pypi.org/project/pip-audit/)

14. fadi002/de4py โญ 814
toolkit for python reverse engineering
๐Ÿ”— [de4py.000.pe](https://de4py.000.pe)

## Simulation

Simulation libraries: robotics, economic, agent-based, traffic, physics, astronomy, chemistry, quantum simulation. Also see the Maths and Science category for crossover.

1. atsushisakai/PythonRobotics โญ 22,339
Python sample codes for robotics algorithms.
๐Ÿ”— [atsushisakai.github.io/pythonrobotics](https://atsushisakai.github.io/PythonRobotics/)

2. bulletphysics/bullet3 โญ 12,223
Bullet Physics SDK: real-time collision detection and multi-physics simulation for VR, games, visual effects, robotics, machine learning etc.
๐Ÿ”— [bulletphysics.org](http://bulletphysics.org)

3. isl-org/Open3D โญ 10,879
Open3D: A Modern Library for 3D Data Processing
๐Ÿ”— [www.open3d.org](http://www.open3d.org)

4. qiskit/qiskit โญ 4,876
Qiskit is an open-source SDK for working with quantum computers at the level of extended quantum circuits, operators, and primitives.
๐Ÿ”— [www.ibm.com/quantum/qiskit](https://www.ibm.com/quantum/qiskit)

5. astropy/astropy โญ 4,311
Astronomy and astrophysics core library
๐Ÿ”— [www.astropy.org](https://www.astropy.org)

6. quantumlib/Cirq โญ 4,199
A python framework for creating, editing, and invoking Noisy Intermediate Scale Quantum (NISQ) circuits.

7. nvidia/warp โญ 3,865
A Python framework for high performance GPU simulation and graphics
๐Ÿ”— [nvidia.github.io/warp](https://nvidia.github.io/warp/)

8. openai/mujoco-py โญ 2,792
MuJoCo is a physics engine for detailed, efficient rigid body simulations with contacts. mujoco-py allows using MuJoCo from Python 3.

9. rdkit/rdkit โญ 2,530
The official sources for the RDKit library

10. taichi-dev/difftaichi โญ 2,434
10 differentiable physical simulators built with Taichi differentiable programming (DiffTaichi, ICLR 2020)

11. projectmesa/mesa โญ 2,308
Mesa is an open-source Python library for agent-based modeling, ideal for simulating complex systems and exploring emergent behaviors.
๐Ÿ”— [mesa.readthedocs.io](https://mesa.readthedocs.io)

12. google/brax โญ 2,152
Massively parallel rigidbody physics simulation on accelerator hardware.

13. quantecon/QuantEcon.py โญ 1,901
A community based Python library for quantitative economics
๐Ÿ”— [quantecon.org/quantecon-py](https://quantecon.org/quantecon-py/)

14. facebookresearch/habitat-lab โญ 1,833
A modular high-level library to train embodied AI agents across a variety of tasks and environments.
๐Ÿ”— [aihabitat.org](https://aihabitat.org/)

15. nvidia-omniverse/IsaacGymEnvs โญ 1,794
Example RL environments for the NVIDIA Isaac Gym high performance environments

16. microsoft/PromptCraft-Robotics โญ 1,791
Community for applying LLMs to robotics and a robot simulator with ChatGPT integration
๐Ÿ”— [aka.ms/chatgpt-robotics](https://aka.ms/ChatGPT-Robotics)

17. nvidia-omniverse/IsaacLab โญ 1,604
Unified framework for robot learning built on NVIDIA Isaac Sim
๐Ÿ”— [isaac-sim.github.io/isaaclab](https://isaac-sim.github.io/IsaacLab)

18. deepmodeling/deepmd-kit โญ 1,413
A deep learning package for many-body potential energy representation and molecular dynamics
๐Ÿ”— [docs.deepmodeling.com/projects/deepmd](https://docs.deepmodeling.com/projects/deepmd/)

19. sail-sg/envpool โญ 1,047
C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.
๐Ÿ”— [envpool.readthedocs.io](https://envpool.readthedocs.io)

20. a-r-j/graphein โญ 1,002
Protein Graph Library
๐Ÿ”— [graphein.ai](https://graphein.ai/)

21. bowang-lab/scGPT โญ 930
scGPT: Towards Building a Foundation Model for Single-Cell Multi-omics Using Generative AI
๐Ÿ”— [scgpt.readthedocs.io/en/latest](https://scgpt.readthedocs.io/en/latest/)

22. hardmaru/estool โญ 927
Evolution Strategies Tool

23. viblo/pymunk โญ 903
Pymunk is a easy-to-use pythonic 2d physics library that can be used whenever you need 2d rigid body physics from Python
๐Ÿ”— [www.pymunk.org](http://www.pymunk.org)

24. google-deepmind/materials_discovery โญ 841
Graph Networks for Materials Science (GNoME) is a project centered around scaling machine learning methods to tackle materials science.

25. facebookresearch/fairo โญ 838
A modular embodied agent architecture and platform for building embodied agents

26. google/evojax โญ 816
EvoJAX is a scalable, general purpose, hardware-accelerated neuroevolution toolkit built on the JAX library

27. nvidia-omniverse/OmniIsaacGymEnvs โญ 777
Reinforcement Learning Environments for Omniverse Isaac Gym

28. eureka-research/DrEureka โญ 749
Official Repository for "DrEureka: Language Model Guided Sim-To-Real Transfer" (RSS 2024)
๐Ÿ”— [eureka-research.github.io/dr-eureka](https://eureka-research.github.io/dr-eureka/)

## Study

Miscellaneous study resources: algorithms, general resources, system design, code repos for textbooks, best practices, tutorials.

1. thealgorithms/Python โญ 182,280
All Algorithms implemented in Python
๐Ÿ”— [the-algorithms.com](https://the-algorithms.com/)

2. microsoft/generative-ai-for-beginners โญ 56,729
18 Lessons, Get Started Building with Generative AI ๐Ÿ”— https://microsoft.github.io/generative-ai-for-beginners/
๐Ÿ”— [microsoft.github.io/generative-ai-for-beginners](https://microsoft.github.io/generative-ai-for-beginners/)

3. jakevdp/PythonDataScienceHandbook โญ 42,219
Python Data Science Handbook: full text in Jupyter Notebooks
๐Ÿ”— [jakevdp.github.io/pythondatasciencehandbook](http://jakevdp.github.io/PythonDataScienceHandbook)

4. mlabonne/llm-course โญ 34,298
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
๐Ÿ”— [mlabonne.github.io/blog](https://mlabonne.github.io/blog/)

5. realpython/python-guide โญ 28,013
Python best practices guidebook, written for humans.
๐Ÿ”— [docs.python-guide.org](https://docs.python-guide.org)

6. christoschristofidis/awesome-deep-learning โญ 23,336
A curated list of awesome Deep Learning tutorials, projects and communities.

7. rasbt/LLMs-from-scratch โญ 23,082
Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step
๐Ÿ”— [www.manning.com/books/build-a-large-language-model-from-scratch](https://www.manning.com/books/build-a-large-language-model-from-scratch)

8. d2l-ai/d2l-en โญ 22,570
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
๐Ÿ”— [d2l.ai](https://D2L.ai)

9. wesm/pydata-book โญ 21,698
Materials and IPython notebooks for "Python for Data Analysis" by Wes McKinney, published by O'Reilly Media

10. microsoft/recommenders โญ 18,478
Best Practices on Recommendation Systems
๐Ÿ”— [recommenders-team.github.io/recommenders/intro.html](https://recommenders-team.github.io/recommenders/intro.html)

11. fchollet/deep-learning-with-python-notebooks โญ 17,963
Jupyter notebooks for the code samples of the book "Deep Learning with Python"

12. hannibal046/Awesome-LLM โญ 16,195
Awesome-LLM: a curated list of Large Language Model

13. graykode/nlp-tutorial โญ 13,926
Natural Language Processing Tutorial for Deep Learning Researchers
๐Ÿ”— [www.reddit.com/r/machinelearning/comments/amfinl/project_nlptutoral_repository_who_is_studying](https://www.reddit.com/r/MachineLearning/comments/amfinl/project_nlptutoral_repository_who_is_studying/)

14. shangtongzhang/reinforcement-learning-an-introduction โญ 13,352
Python Implementation of Reinforcement Learning: An Introduction

15. naklecha/llama3-from-scratch โญ 11,288
llama3 implementation one matrix multiplication at a time

16. karpathy/nn-zero-to-hero โญ 11,063
Neural Networks: Zero to Hero

17. eugeneyan/open-llms โญ 10,640
๐Ÿ“‹ A list of open LLMs available for commercial use.

18. openai/spinningup โญ 9,837
An educational resource to help anyone learn deep reinforcement learning.
๐Ÿ”— [spinningup.openai.com](https://spinningup.openai.com/)

19. rucaibox/LLMSurvey โญ 9,626
The official GitHub page for the survey paper "A Survey of Large Language Models".
๐Ÿ”— [arxiv.org/abs/2303.18223](https://arxiv.org/abs/2303.18223)

20. karpathy/micrograd โญ 9,532
A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API

21. mrdbourke/pytorch-deep-learning โญ 9,450
Materials for the Learn PyTorch for Deep Learning: Zero to Mastery course.
๐Ÿ”— [learnpytorch.io](https://learnpytorch.io)

22. mooler0410/LLMsPracticalGuide โญ 9,045
A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)
๐Ÿ”— [arxiv.org/abs/2304.13712v2](https://arxiv.org/abs/2304.13712v2)

23. nielsrogge/Transformers-Tutorials โญ 8,593
This repository contains demos I made with the Transformers library by HuggingFace.

24. zhanymkanov/fastapi-best-practices โญ 7,867
FastAPI Best Practices and Conventions we used at our startup

25. firmai/industry-machine-learning โญ 7,143
A curated list of applied machine learning and data science notebooks and libraries across different industries (by @firmai)
๐Ÿ”— [www.linkedin.com/company/firmai](https://www.linkedin.com/company/firmai)

26. gkamradt/langchain-tutorials โญ 6,525
Overview and tutorial of the LangChain Library

27. udlbook/udlbook โญ 5,536
Understanding Deep Learning - Simon J.D. Prince

28. srush/GPU-Puzzles โญ 5,408
Teaching beginner GPU programming in a completely interactive fashion

29. neetcode-gh/leetcode โญ 5,326
Leetcode solutions for NeetCode.io

30. udacity/deep-learning-v2-pytorch โญ 5,238
Projects and exercises for the latest Deep Learning ND program https://www.udacity.com/course/deep-learning-nanodegree--nd101

31. mrdbourke/tensorflow-deep-learning โญ 5,043
All course materials for the Zero to Mastery Deep Learning with TensorFlow course.
๐Ÿ”— [dbourke.link/ztmtfcourse](https://dbourke.link/ZTMTFcourse)

32. roboflow/notebooks โญ 4,841
Examples and tutorials on using SOTA computer vision models and techniques. Learn everything from old-school ResNet, through YOLO and object-detection transformers like DETR, to the latest models like Grounding DINO and SAM.
๐Ÿ”— [roboflow.com/models](https://roboflow.com/models)

33. timofurrer/awesome-asyncio โญ 4,487
A curated list of awesome Python asyncio frameworks, libraries, software and resources

34. zotroneneis/machine_learning_basics โญ 4,246
Plain python implementations of basic machine learning algorithms

35. alirezadir/Machine-Learning-Interviews โญ 3,951
This repo is meant to serve as a guide for Machine Learning/AI technical interviews.

36. huggingface/deep-rl-class โญ 3,739
This repo contains the syllabus of the Hugging Face Deep Reinforcement Learning Course.

37. promptslab/Awesome-Prompt-Engineering โญ 3,524
This repository contains a hand-curated resources for Prompt Engineering with a focus on Generative Pre-trained Transformer (GPT), ChatGPT, PaLM etc
๐Ÿ”— [discord.gg/m88xfymbk6](https://discord.gg/m88xfYMbK6)

38. huggingface/diffusion-models-class โญ 3,400
Materials for the Hugging Face Diffusion Models Course

39. cosmicpython/book โญ 3,326
A Book about Pythonic Application Architecture Patterns for Managing Complexity. Cosmos is the Opposite of Chaos you see. O'R. wouldn't actually let us call it "Cosmic Python" tho.
๐Ÿ”— [www.cosmicpython.com](https://www.cosmicpython.com)

40. rasbt/machine-learning-book โญ 3,122
Code Repository for Machine Learning with PyTorch and Scikit-Learn
๐Ÿ”— [sebastianraschka.com/books/#machine-learning-with-pytorch-and-scikit-learn](https://sebastianraschka.com/books/#machine-learning-with-pytorch-and-scikit-learn)

41. fluentpython/example-code-2e โญ 3,087
Example code for Fluent Python, 2nd edition (O'Reilly 2022)
๐Ÿ”— [amzn.to/3j48u2j](https://amzn.to/3J48u2J)

42. amanchadha/coursera-deep-learning-specialization โญ 2,877
Notes, programming assignments and quizzes from all courses within the Coursera Deep Learning specialization offered by deeplearning.ai: (i) Neural Networks and Deep Learning; (ii) Improving Deep Neural Networks: Hyperparameter tuning, Regularization and Optimization; (iii) Structuring Machine Learning Projects; (iv...

43. mrdbourke/zero-to-mastery-ml โญ 2,752
All course materials for the Zero to Mastery Machine Learning and Data Science course.
๐Ÿ”— [dbourke.link/ztmmlcourse](https://dbourke.link/ZTMmlcourse)

44. krzjoa/awesome-python-data-science โญ 2,429
Probably the best curated list of data science software in Python.
๐Ÿ”— [krzjoa.github.io/awesome-python-data-science](https://krzjoa.github.io/awesome-python-data-science)

45. cgpotts/cs224u โญ 2,081
Code for CS224u: Natural Language Understanding

46. cerlymarco/MEDIUM_NoteBook โญ 2,050
Repository containing notebooks of my posts on Medium

47. gerdm/prml โญ 2,016
Repository of notes, code and notebooks in Python for the book Pattern Recognition and Machine Learning by Christopher Bishop

48. trananhkma/fucking-awesome-python โญ 1,977
awesome-python with :octocat: โญ and ๐Ÿด

49. chandlerbang/awesome-self-supervised-gnn โญ 1,516
Papers about pretraining and self-supervised learning on Graph Neural Networks (GNN).

50. atcold/NYU-DLSP21 โญ 1,512
NYU Deep Learning Spring 2021
๐Ÿ”— [atcold.github.io/nyu-dlsp21](https://atcold.github.io/NYU-DLSP21/)

51. huggingface/cookbook โญ 1,507
Community-driven practical examples of building AI applications and solving various tasks with AI using open-source tools and models.
๐Ÿ”— [huggingface.co/learn/cookbook](https://huggingface.co/learn/cookbook)

52. patrickloeber/MLfromscratch โญ 1,222
Machine Learning algorithm implementations from scratch.

53. davidadsp/Generative_Deep_Learning_2nd_Edition โญ 972
The official code repository for the second edition of the O'Reilly book Generative Deep Learning: Teaching Machines to Paint, Write, Compose and Play.
๐Ÿ”— [www.oreilly.com/library/view/generative-deep-learning/9781098134174](https://www.oreilly.com/library/view/generative-deep-learning/9781098134174/)

54. jackhidary/quantumcomputingbook โญ 778
Companion site for the textbook Quantum Computing: An Applied Approach

55. dylanhogg/awesome-python โญ 281
๐Ÿ Hand-picked awesome Python libraries and frameworks, organised by category
๐Ÿ”— [www.awesomepython.org](https://www.awesomepython.org)

## Template

Template tools and libraries: cookiecutter repos, generators, quick-starts.

1. tiangolo/full-stack-fastapi-template โญ 24,714
Full stack, modern web application template. Using FastAPI, React, SQLModel, PostgreSQL, Docker, GitHub Actions, automatic HTTPS and more.

2. cookiecutter/cookiecutter โญ 22,018
A cross-platform command-line utility that creates projects from cookiecutters (project templates), e.g. Python package projects, C projects.
๐Ÿ”— [pypi.org/project/cookiecutter](https://pypi.org/project/cookiecutter/)

3. drivendata/cookiecutter-data-science โญ 7,992
A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.
๐Ÿ”— [cookiecutter-data-science.drivendata.org](https://cookiecutter-data-science.drivendata.org/)

4. buuntu/fastapi-react โญ 2,145
๐Ÿš€ Cookiecutter Template for FastAPI + React Projects. Using PostgreSQL, SQLAlchemy, and Docker

5. pyscaffold/pyscaffold โญ 2,053
๐Ÿ›  Python project template generator with batteries included
๐Ÿ”— [pyscaffold.org](https://pyscaffold.org)

6. cjolowicz/cookiecutter-hypermodern-python โญ 1,762
Cookiecutter template for a Python package based on the Hypermodern Python article series.
๐Ÿ”— [cookiecutter-hypermodern-python.readthedocs.io](http://cookiecutter-hypermodern-python.readthedocs.io/)

7. tezromach/python-package-template โญ 1,085
๐Ÿš€ Your next Python package needs a bleeding-edge project structure.

8. martinheinz/python-project-blueprint โญ 952
Blueprint/Boilerplate For Python Projects

9. fmind/mlops-python-package โญ 578
Kickstart your MLOps initiative with a flexible, robust, and productive Python package.
๐Ÿ”— [fmind.github.io/mlops-python-package](https://fmind.github.io/mlops-python-package/)

## Terminal

Terminal and console tools and libraries: CLI tools, terminal based formatters, progress bars.

1. willmcgugan/rich โญ 48,285
Rich is a Python library for rich text and beautiful formatting in the terminal.
๐Ÿ”— [rich.readthedocs.io/en/latest](https://rich.readthedocs.io/en/latest/)

2. tqdm/tqdm โญ 27,987
:zap: A Fast, Extensible Progress Bar for Python and CLI
๐Ÿ”— [tqdm.github.io](https://tqdm.github.io)

3. google/python-fire โญ 26,616
Python Fire is a library for automatically generating command line interfaces (CLIs) from absolutely any Python object.

4. willmcgugan/textual โญ 24,279
The lean application framework for Python. Build sophisticated user interfaces with a simple Python API. Run your apps in the terminal and a web browser.
๐Ÿ”— [textual.textualize.io](https://textual.textualize.io/)

5. pallets/click โญ 15,322
Python composable command line interface toolkit
๐Ÿ”— [click.palletsprojects.com](https://click.palletsprojects.com)

6. tiangolo/typer โญ 14,925
Typer, build great CLIs. Easy to code. Based on Python type hints.
๐Ÿ”— [typer.tiangolo.com](https://typer.tiangolo.com/)

7. saulpw/visidata โญ 7,692
A terminal spreadsheet multitool for discovering and arranging data
๐Ÿ”— [visidata.org](http://visidata.org)

8. tconbeer/harlequin โญ 3,357
The SQL IDE for Your Terminal.
๐Ÿ”— [harlequin.sh](https://harlequin.sh)

9. manrajgrover/halo โญ 2,865
๐Ÿ’ซ Beautiful spinners for terminal, IPython and Jupyter

10. urwid/urwid โญ 2,765
Console user interface library for Python (official repo)
๐Ÿ”— [urwid.org](urwid.org)

11. textualize/trogon โญ 2,406
Easily turn your Click CLI into a powerful terminal application

12. darrenburns/elia โญ 1,633
A snappy, keyboard-centric terminal user interface for interacting with large language models. Chat with ChatGPT, Claude, Llama 3, Phi 3, Mistral, Gemma and more.

13. tmbo/questionary โญ 1,476
Python library to build pretty command line user prompts โœจEasy to use multi-select lists, confirmations, free text prompts ...

14. jazzband/prettytable โญ 1,312
Display tabular data in a visually appealing ASCII table format
๐Ÿ”— [pypi.org/project/prettytable](https://pypi.org/project/PrettyTable/)

15. 1j01/textual-paint โญ 929
:art: MS Paint in your terminal.
๐Ÿ”— [pypi.org/project/textual-paint](https://pypi.org/project/textual-paint/)

## Testing

Testing libraries: unit testing, load testing, acceptance testing, code coverage, browser automation, plugins.

1. mitmproxy/mitmproxy โญ 35,318
An interactive TLS-capable intercepting HTTP proxy for penetration testers and software developers.
๐Ÿ”— [mitmproxy.org](https://mitmproxy.org)

2. locustio/locust โญ 24,243
Write scalable load tests in plain Python ๐Ÿš—๐Ÿ’จ

3. pytest-dev/pytest โญ 11,669
The pytest framework makes it easy to write small tests, yet scales to support complex functional testing
๐Ÿ”— [pytest.org](https://pytest.org)

4. microsoft/playwright-python โญ 11,208
Python version of the Playwright testing and automation library.
๐Ÿ”— [playwright.dev/python](https://playwright.dev/python/)

5. robotframework/robotframework โญ 9,414
Generic automation framework for acceptance testing and RPA
๐Ÿ”— [robotframework.org](http://robotframework.org)

6. getmoto/moto โญ 7,503
A library that allows you to easily mock out tests based on AWS infrastructure.
๐Ÿ”— [docs.getmoto.org/en/latest](http://docs.getmoto.org/en/latest/)

7. hypothesisworks/hypothesis โญ 7,416
Hypothesis is a powerful, flexible, and easy to use library for property-based testing.
๐Ÿ”— [hypothesis.works](https://hypothesis.works)

8. newsapps/beeswithmachineguns โญ 6,424
A utility for arming (creating) many bees (micro EC2 instances) to attack (load test) targets (web applications).
๐Ÿ”— [apps.chicagotribune.com](http://apps.chicagotribune.com/)

9. seleniumbase/SeleniumBase โญ 4,505
๐Ÿ“Š Python's all-in-one framework for web crawling, scraping, testing, and reporting. Supports pytest. UC Mode provides stealth. Includes many tools.
๐Ÿ”— [seleniumbase.io](https://seleniumbase.io)

10. getsentry/responses โญ 4,088
A utility for mocking out the Python Requests library.

11. codium-ai/cover-agent โญ 4,042
CodiumAI Cover-Agent: An AI-Powered Tool for Automated Test Generation and Code Coverage Enhancement! ๐Ÿ’ป๐Ÿค–๐Ÿงช๐Ÿž
๐Ÿ”— [www.codium.ai](https://www.codium.ai/)

12. spulec/freezegun โญ 4,041
Let your Python tests travel through time

13. tox-dev/tox โญ 3,585
Command line driven CI frontend and development task automation tool.
๐Ÿ”— [tox.wiki](https://tox.wiki)

14. behave/behave โญ 3,106
BDD, Python style.
๐Ÿ”— [behave.readthedocs.io/en/latest](https://behave.readthedocs.io/en/latest/)

15. nedbat/coveragepy โญ 2,911
The code coverage tool for Python
๐Ÿ”— [coverage.readthedocs.io](https://coverage.readthedocs.io)

16. cobrateam/splinter โญ 2,700
splinter - python test framework for web applications
๐Ÿ”— [splinter.readthedocs.org/en/stable/index.html](http://splinter.readthedocs.org/en/stable/index.html)

17. kevin1024/vcrpy โญ 2,652
Automatically mock your HTTP interactions to simplify and speed up testing

18. confident-ai/deepeval โญ 2,445
The LLM Evaluation Framework
๐Ÿ”— [docs.confident-ai.com](https://docs.confident-ai.com/)

19. pytest-dev/pytest-testinfra โญ 2,343
With Testinfra you can write unit tests in Python to test actual state of your servers configured by management tools like Salt, Ansible, Puppet, Chef and so on.
๐Ÿ”— [testinfra.readthedocs.io](https://testinfra.readthedocs.io)

20. pytest-dev/pytest-mock โญ 1,790
Thin-wrapper around the mock package for easier use with pytest
๐Ÿ”— [pytest-mock.readthedocs.io/en/latest](https://pytest-mock.readthedocs.io/en/latest/)

21. pytest-dev/pytest-cov โญ 1,699
Coverage plugin for pytest.

22. pytest-dev/pytest-xdist โญ 1,408
pytest plugin for distributed testing and loop-on-failures testing modes.
๐Ÿ”— [pytest-xdist.readthedocs.io](https://pytest-xdist.readthedocs.io)

23. pytest-dev/pytest-asyncio โญ 1,367
Asyncio support for pytest
๐Ÿ”— [pytest-asyncio.readthedocs.io](https://pytest-asyncio.readthedocs.io)

24. taverntesting/tavern โญ 1,005
A command-line tool and Python library and Pytest plugin for automated testing of RESTful APIs, with a simple, concise and flexible YAML-based syntax
๐Ÿ”— [taverntesting.github.io](https://taverntesting.github.io/)

## Machine Learning - Time Series

Machine learning and classical timeseries libraries: forecasting, seasonality, anomaly detection, econometrics.

1. facebook/prophet โญ 18,069
Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth.
๐Ÿ”— [facebook.github.io/prophet](https://facebook.github.io/prophet)

2. blue-yonder/tsfresh โญ 8,230
Automatic extraction of relevant features from time series:
๐Ÿ”— [tsfresh.readthedocs.io](http://tsfresh.readthedocs.io)

3. sktime/sktime โญ 7,606
A unified framework for machine learning with time series
๐Ÿ”— [www.sktime.net](https://www.sktime.net)

4. unit8co/darts โญ 7,603
A python library for user-friendly forecasting and anomaly detection on time series.
๐Ÿ”— [unit8co.github.io/darts](https://unit8co.github.io/darts/)

5. facebookresearch/Kats โญ 4,849
Kats, a kit to analyze time series data, a lightweight, easy-to-use, generalizable, and extendable framework to perform time series analysis, from understanding the key statistics and characteristics, detecting change points and anomalies, to forecasting future trends.

6. awslabs/gluonts โญ 4,454
Probabilistic time series modeling in Python
๐Ÿ”— [ts.gluon.ai](https://ts.gluon.ai)

7. nixtla/statsforecast โญ 3,740
Lightning โšก๏ธ fast forecasting with statistical and econometric models.
๐Ÿ”— [nixtlaverse.nixtla.io/statsforecast](https://nixtlaverse.nixtla.io/statsforecast)

8. salesforce/Merlion โญ 3,312
Merlion: A Machine Learning Framework for Time Series Intelligence

9. google-research/timesfm โญ 3,171
TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting.
๐Ÿ”— [research.google/blog/a-decoder-only-foundation-model-for-time-series-forecasting](https://research.google/blog/a-decoder-only-foundation-model-for-time-series-forecasting/)

10. tdameritrade/stumpy โญ 3,088
STUMPY is a powerful and scalable Python library for modern time series analysis
๐Ÿ”— [stumpy.readthedocs.io/en/latest](https://stumpy.readthedocs.io/en/latest/)

11. amazon-science/chronos-forecasting โญ 2,103
Chronos: Pretrained (Language) Models for Probabilistic Time Series Forecasting
๐Ÿ”— [arxiv.org/abs/2403.07815](https://arxiv.org/abs/2403.07815)

12. rjt1990/pyflux โญ 2,099
Open source time series library for Python

13. aistream-peelout/flow-forecast โญ 1,964
Deep learning PyTorch library for time series forecasting, classification, and anomaly detection (originally for flood forecasting).
๐Ÿ”— [flow-forecast.atlassian.net/wiki/spaces/ff/overview](https://flow-forecast.atlassian.net/wiki/spaces/FF/overview)

14. uber/orbit โญ 1,833
A Python package for Bayesian forecasting with object-oriented design and probabilistic models under the hood.
๐Ÿ”— [orbit-ml.readthedocs.io/en/stable](https://orbit-ml.readthedocs.io/en/stable/)

15. alkaline-ml/pmdarima โญ 1,553
A statistical library designed to fill the void in Python's time series analysis capabilities, including the equivalent of R's auto.arima function.
๐Ÿ”— [www.alkaline-ml.com/pmdarima](https://www.alkaline-ml.com/pmdarima)

16. time-series-foundation-models/lag-llama โญ 1,120
Lag-Llama: Towards Foundation Models for Probabilistic Time Series Forecasting

17. winedarksea/AutoTS โญ 1,057
Automated Time Series Forecasting

18. autoviml/Auto_TS โญ 700
Automatically build ARIMA, SARIMAX, VAR, FB Prophet and XGBoost Models on Time Series data sets with a Single Line of Code. Created by Ram Seshadri. Collaborators welcome.

19. google/temporian โญ 654
Temporian is an open-source Python library for preprocessing โšก and feature engineering ๐Ÿ›  temporal data ๐Ÿ“ˆ for machine learning applications ๐Ÿค–
๐Ÿ”— [temporian.readthedocs.io](https://temporian.readthedocs.io)

## Typing

Typing libraries: static and run-time type checking, annotations.

1. python/mypy โญ 17,920
Optional static typing for Python
๐Ÿ”— [www.mypy-lang.org](https://www.mypy-lang.org/)

2. microsoft/pyright โญ 12,699
Static Type Checker for Python

3. facebook/pyre-check โญ 6,753
Performant type-checking for python.
๐Ÿ”— [pyre-check.org](https://pyre-check.org/)

4. python-attrs/attrs โญ 5,153
Python Classes Without Boilerplate
๐Ÿ”— [www.attrs.org](https://www.attrs.org/)

5. instagram/MonkeyType โญ 4,693
A Python library that generates static type annotations by collecting runtime types

6. google/pytype โญ 4,676
A static type analyzer for Python code
๐Ÿ”— [google.github.io/pytype](https://google.github.io/pytype)

7. python/typeshed โญ 4,172
Collection of library stubs for Python, with static types

8. mtshiba/pylyzer โญ 2,161
A fast static code analyzer & language server for Python
๐Ÿ”— [mtshiba.github.io/pylyzer](http://mtshiba.github.io/pylyzer/)

9. microsoft/pylance-release โญ 1,684
Fast, feature-rich language support for Python. Documentation and issues for Pylance.

10. agronholm/typeguard โญ 1,473
Run-time type checker for Python

11. patrick-kidger/torchtyping โญ 1,358
Type annotations and dynamic checking for a tensor's shape, dtype, names, etc.

12. robertcraigie/pyright-python โญ 152
Python command line wrapper for pyright, a static type checker
๐Ÿ”— [pypi.org/project/pyright](https://pypi.org/project/pyright/)

## Utility

General utility libraries: miscellaneous tools, linters, code formatters, version management, package tools, documentation tools.

1. yt-dlp/yt-dlp โญ 77,167
A feature-rich command-line audio/video downloader
๐Ÿ”— [discord.gg/h5mncfw63r](https://discord.gg/H5MNcFW63r)

2. home-assistant/core โญ 69,988
๐Ÿก Open source home automation that puts local control and privacy first.
๐Ÿ”— [www.home-assistant.io](https://www.home-assistant.io)

3. python/cpython โญ 61,174
The Python programming language
๐Ÿ”— [www.python.org](https://www.python.org/)

4. localstack/localstack โญ 53,077
๐Ÿ’ป A fully functional local AWS cloud stack. Develop and test your cloud & Serverless apps offline
๐Ÿ”— [localstack.cloud](https://localstack.cloud)

5. faif/python-patterns โญ 39,914
A collection of design patterns/idioms in Python

6. mingrammer/diagrams โญ 35,681
:art: Diagram as Code for prototyping cloud system architectures
๐Ÿ”— [diagrams.mingrammer.com](https://diagrams.mingrammer.com)

7. ggerganov/whisper.cpp โญ 33,185
Port of OpenAI's Whisper model in C/C++

8. keon/algorithms โญ 23,798
Minimal examples of data structures and algorithms in Python

9. norvig/pytudes โญ 22,532
Python programs, usually short, of considerable difficulty, to perfect particular skills.

10. modularml/mojo โญ 22,334
The Mojo Programming Language
๐Ÿ”— [docs.modular.com/mojo](https://docs.modular.com/mojo)

11. openai/openai-python โญ 21,237
The official Python library for the OpenAI API
๐Ÿ”— [pypi.org/project/openai](https://pypi.org/project/openai/)

12. chriskiehl/Gooey โญ 20,499
Turn (almost) any Python command line program into a full GUI application with one line

13. facebookresearch/audiocraft โญ 20,243
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

14. pydantic/pydantic โญ 19,760
Data validation using Python type hints
๐Ÿ”— [docs.pydantic.dev](https://docs.pydantic.dev)

15. squidfunk/mkdocs-material โญ 19,277
Documentation that simply works
๐Ÿ”— [squidfunk.github.io/mkdocs-material](https://squidfunk.github.io/mkdocs-material/)

16. delgan/loguru โญ 18,878
Python logging made (stupidly) simple

17. micropython/micropython โญ 18,771
MicroPython - a lean and efficient Python implementation for microcontrollers and constrained systems
๐Ÿ”— [micropython.org](https://micropython.org)

18. mkdocs/mkdocs โญ 18,731
Project documentation with Markdown.
๐Ÿ”— [www.mkdocs.org](https://www.mkdocs.org)

19. rustpython/RustPython โญ 18,085
A Python Interpreter written in Rust
๐Ÿ”— [rustpython.github.io](https://rustpython.github.io)

20. kivy/kivy โญ 17,354
Open source UI framework written in Python, running on Windows, Linux, macOS, Android and iOS
๐Ÿ”— [kivy.org](https://kivy.org)

21. higherorderco/Bend โญ 16,822
A massively parallel, high-level programming language
๐Ÿ”— [higherorderco.com](https://higherorderco.com)

22. ipython/ipython โญ 16,195
Official repository for IPython itself. Other repos in the IPython organization contain things like the website, documentation builds, etc.
๐Ÿ”— [ipython.readthedocs.org](https://ipython.readthedocs.org)

23. blakeblackshear/frigate โญ 16,187
NVR with realtime local object detection for IP cameras
๐Ÿ”— [frigate.video](https://frigate.video)

24. alievk/avatarify-python โญ 16,182
Avatars for Zoom, Skype and other video-conferencing apps.

25. paul-gauthier/aider โญ 14,037
Aider is a command line tool that lets you pair program with LLMs, to edit code stored in your local git repository
๐Ÿ”— [aider.chat](https://aider.chat/)

26. zulko/moviepy โญ 12,141
Video editing with Python
๐Ÿ”— [zulko.github.io/moviepy](https://zulko.github.io/moviepy/)

27. openai/triton โญ 12,019
Development repository for the Triton language and compiler
๐Ÿ”— [triton-lang.org](https://triton-lang.org/)

28. python-pillow/Pillow โญ 11,920
The Python Imaging Library adds image processing capabilities to Python (Pillow is the friendly PIL fork)
๐Ÿ”— [python-pillow.org](https://python-pillow.org)

29. pyodide/pyodide โญ 11,743
Pyodide is a Python distribution for the browser and Node.js based on WebAssembly
๐Ÿ”— [pyodide.org/en/stable](https://pyodide.org/en/stable/)

30. dbader/schedule โญ 11,623
Python job scheduling for humans.
๐Ÿ”— [schedule.readthedocs.io](https://schedule.readthedocs.io/)

31. pyo3/pyo3 โญ 11,543
Rust bindings for the Python interpreter
๐Ÿ”— [pyo3.rs](https://pyo3.rs)

32. pytube/pytube โญ 11,436
A lightweight, dependency-free Python library (and command-line utility) for downloading YouTube Videos.
๐Ÿ”— [pytube.io](https://pytube.io)

33. nuitka/Nuitka โญ 11,432
Nuitka is a Python compiler written in Python. It's fully compatible with Python 2.6, 2.7, 3.4-3.12. You feed it your Python app, it does a lot of clever things, and spits out an executable or extension module.
๐Ÿ”— [nuitka.net](http://nuitka.net)

34. caronc/apprise โญ 11,010
Apprise - Push Notifications that work with just about every platform!
๐Ÿ”— [hub.docker.com/r/caronc/apprise](https://hub.docker.com/r/caronc/apprise)

35. ninja-build/ninja โญ 10,823
Ninja is a small build system with a focus on speed.
๐Ÿ”— [ninja-build.org](https://ninja-build.org/)

36. secdev/scapy โญ 10,342
Scapy: the Python-based interactive packet manipulation program & library.
๐Ÿ”— [scapy.net](https://scapy.net)

37. magicstack/uvloop โญ 10,170
Ultra fast asyncio event loop.

38. pallets/jinja โญ 10,111
A very fast and expressive template engine.
๐Ÿ”— [jinja.palletsprojects.com](https://jinja.palletsprojects.com)

39. asweigart/pyautogui โญ 9,948
A cross-platform GUI automation Python module for human beings. Used to programmatically control the mouse & keyboard.

40. aws/serverless-application-model โญ 9,291
The AWS Serverless Application Model (AWS SAM) transform is a AWS CloudFormation macro that transforms SAM templates into CloudFormation templates.
๐Ÿ”— [aws.amazon.com/serverless/sam](https://aws.amazon.com/serverless/sam)

41. cython/cython โญ 9,190
The most widely used Python to C compiler
๐Ÿ”— [cython.org](https://cython.org)

42. paramiko/paramiko โญ 8,947
The leading native Python SSHv2 protocol library.
๐Ÿ”— [paramiko.org](http://paramiko.org)

43. boto/boto3 โญ 8,846
AWS SDK for Python
๐Ÿ”— [aws.amazon.com/sdk-for-python](https://aws.amazon.com/sdk-for-python/)

44. arrow-py/arrow โญ 8,631
๐Ÿน Better dates & times for Python
๐Ÿ”— [arrow.readthedocs.io](https://arrow.readthedocs.io)

45. facebookresearch/hydra โญ 8,425
Hydra is a framework for elegantly configuring complex applications
๐Ÿ”— [hydra.cc](https://hydra.cc)

46. xonsh/xonsh โญ 8,195
:shell: Python-powered shell. Full-featured and cross-platform.
๐Ÿ”— [xon.sh](http://xon.sh)

47. eternnoir/pyTelegramBotAPI โญ 7,900
Python Telegram bot api.

48. py-pdf/pypdf โญ 7,779
A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
๐Ÿ”— [pypdf.readthedocs.io/en/latest](https://pypdf.readthedocs.io/en/latest/)

49. kellyjonbrazil/jc โญ 7,704
CLI tool and python library that converts the output of popular command-line tools, file-types, and common strings to JSON, YAML, or Dictionaries. This allows piping of output to tools like jq and simplifying automation scripts.

50. googleapis/google-api-python-client โญ 7,576
๐Ÿ The official Python client library for Google's discovery based APIs.
๐Ÿ”— [googleapis.github.io/google-api-python-client/docs](https://googleapis.github.io/google-api-python-client/docs/)

51. theskumar/python-dotenv โญ 7,341
Reads key-value pairs from a .env file and can set them as environment variables. It helps in developing applications following the 12-factor principles.
๐Ÿ”— [saurabh-kumar.com/python-dotenv](https://saurabh-kumar.com/python-dotenv/)

52. jasonppy/VoiceCraft โญ 7,251
Zero-Shot Speech Editing and Text-to-Speech in the Wild

53. google/latexify_py โญ 7,105
A library to generate LaTeX expression from Python code.

54. googlecloudplatform/python-docs-samples โญ 7,104
Code samples used on cloud.google.com

55. marshmallow-code/marshmallow โญ 6,939
A lightweight library for converting complex objects to and from simple Python datatypes.
๐Ÿ”— [marshmallow.readthedocs.io](https://marshmallow.readthedocs.io/)

56. hugapi/hug โญ 6,854
Embrace the APIs of the future. Hug aims to make developing APIs as simple as possible, but no simpler.

57. pygithub/PyGithub โญ 6,816
Typed interactions with the GitHub API v3
๐Ÿ”— [pygithub.readthedocs.io](https://pygithub.readthedocs.io/)

58. pyca/cryptography โญ 6,428
cryptography is a package designed to expose cryptographic primitives and recipes to Python developers.
๐Ÿ”— [cryptography.io](https://cryptography.io)

59. openai/point-e โญ 6,416
Point cloud diffusion for 3D model synthesis

60. gorakhargosh/watchdog โญ 6,399
Python library and shell utilities to monitor filesystem events.
๐Ÿ”— [packages.python.org/watchdog](http://packages.python.org/watchdog/)

61. jd/tenacity โญ 6,320
Retrying library for Python
๐Ÿ”— [tenacity.readthedocs.io](http://tenacity.readthedocs.io)

62. icloud-photos-downloader/icloud_photos_downloader โญ 6,257
A command-line tool to download photos from iCloud

63. sphinx-doc/sphinx โญ 6,222
The Sphinx documentation generator
๐Ÿ”— [www.sphinx-doc.org](https://www.sphinx-doc.org/)

64. sdispater/pendulum โญ 6,127
Python datetimes made easy
๐Ÿ”— [pendulum.eustace.io](https://pendulum.eustace.io)

65. bndr/pipreqs โญ 5,993
pipreqs - Generate pip requirements.txt file based on imports of any project. Looking for maintainers to move this project forward.

66. agronholm/apscheduler โญ 5,965
Task scheduling library for Python

67. scikit-image/scikit-image โญ 5,961
Image processing in Python
๐Ÿ”— [scikit-image.org](https://scikit-image.org)

68. wireservice/csvkit โญ 5,907
A suite of utilities for converting to and working with CSV, the king of tabular file formats.
๐Ÿ”— [csvkit.readthedocs.io](https://csvkit.readthedocs.io)

69. ijl/orjson โญ 5,863
Fast, correct Python JSON library supporting dataclasses, datetimes, and numpy

70. timdettmers/bitsandbytes โญ 5,781
Accessible large language models via k-bit quantization for PyTorch.
๐Ÿ”— [huggingface.co/docs/bitsandbytes/main/en/index](https://huggingface.co/docs/bitsandbytes/main/en/index)

71. pdfminer/pdfminer.six โญ 5,659
Community maintained fork of pdfminer - we fathom PDF
๐Ÿ”— [pdfminersix.readthedocs.io](https://pdfminersix.readthedocs.io)

72. pytransitions/transitions โญ 5,520
A lightweight, object-oriented finite state machine implementation in Python with many extensions

73. rsalmei/alive-progress โญ 5,259
A new kind of Progress Bar, with real-time throughput, ETA, and very cool animations!

74. buildbot/buildbot โญ 5,200
Python-based continuous integration testing framework; your pull requests are more than welcome!
๐Ÿ”— [www.buildbot.net](https://www.buildbot.net)

75. prompt-toolkit/ptpython โญ 5,125
A better Python REPL

76. pycqa/pycodestyle โญ 5,005
Simple Python style checker in one Python file
๐Ÿ”— [pycodestyle.pycqa.org](https://pycodestyle.pycqa.org)

77. spotify/pedalboard โญ 4,992
๐ŸŽ› ๐Ÿ”Š A Python library for audio.
๐Ÿ”— [spotify.github.io/pedalboard](https://spotify.github.io/pedalboard)

78. jorgebastida/awslogs โญ 4,802
AWS CloudWatch logs for Humansโ„ข

79. pywinauto/pywinauto โญ 4,784
Windows GUI Automation with Python (based on text properties)
๐Ÿ”— [pywinauto.github.io](http://pywinauto.github.io/)

80. tebelorg/RPA-Python โญ 4,716
Python package for doing RPA

81. pytoolz/toolz โญ 4,583
A functional standard library for Python.
๐Ÿ”— [toolz.readthedocs.org](http://toolz.readthedocs.org/)

82. hhatto/autopep8 โญ 4,542
A tool that automatically formats Python code to conform to the PEP 8 style guide.
๐Ÿ”— [pypi.org/project/autopep8](https://pypi.org/project/autopep8/)

83. pyinvoke/invoke โญ 4,322
Pythonic task management & command execution.
๐Ÿ”— [pyinvoke.org](http://pyinvoke.org)

84. bogdanp/dramatiq โญ 4,164
A fast and reliable background task processing library for Python 3.
๐Ÿ”— [dramatiq.io](https://dramatiq.io)

85. evhub/coconut โญ 4,003
Coconut (coconut-lang.org) is a variant of Python that adds on top of Python syntax new features for simple, elegant, Pythonic functional programming.
๐Ÿ”— [coconut-lang.org](http://coconut-lang.org)

86. adafruit/circuitpython โญ 3,969
CircuitPython - a Python implementation for teaching coding with microcontrollers
๐Ÿ”— [circuitpython.org](https://circuitpython.org)

87. ashleve/lightning-hydra-template โญ 3,889
PyTorch Lightning + Hydra. A very user-friendly template for ML experimentation. โšก๐Ÿ”ฅโšก

88. miguelgrinberg/python-socketio โญ 3,866
Python Socket.IO server and client

89. joblib/joblib โญ 3,749
Computing with Python functions.
๐Ÿ”— [joblib.readthedocs.org](http://joblib.readthedocs.org)

90. rspeer/python-ftfy โญ 3,733
Fixes mojibake and other glitches in Unicode text, after the fact.
๐Ÿ”— [ftfy.readthedocs.org](http://ftfy.readthedocs.org)

91. pyinfra-dev/pyinfra โญ 3,717
pyinfra turns Python code into shell commands and runs them on your servers. Execute ad-hoc commands and write declarative operations. Target SSH servers, local machine and Docker containers. Fast and scales from one server to thousands.
๐Ÿ”— [pyinfra.com](https://pyinfra.com)

92. ets-labs/python-dependency-injector โญ 3,717
Dependency injection framework for Python
๐Ÿ”— [python-dependency-injector.ets-labs.org](https://python-dependency-injector.ets-labs.org/)

93. python-markdown/markdown โญ 3,672
A Python implementation of John Gruberโ€™s Markdown with Extension support.
๐Ÿ”— [python-markdown.github.io](https://python-markdown.github.io/)

94. zeromq/pyzmq โญ 3,619
PyZMQ: Python bindings for zeromq
๐Ÿ”— [zguide.zeromq.org/py:all](http://zguide.zeromq.org/py:all)

95. more-itertools/more-itertools โญ 3,612
More routines for operating on iterables, beyond itertools
๐Ÿ”— [more-itertools.rtfd.io](https://more-itertools.rtfd.io)

96. blealtan/efficient-kan โญ 3,606
An efficient pure-PyTorch implementation of Kolmogorov-Arnold Network (KAN).

97. pyo3/maturin โญ 3,517
Build and publish crates with pyo3, cffi and uniffi bindings as well as rust binaries as python packages
๐Ÿ”— [maturin.rs](https://maturin.rs)

98. pypi/warehouse โญ 3,514
The Python Package Index
๐Ÿ”— [pypi.org](https://pypi.org)

99. pydata/xarray โญ 3,510
N-D labeled arrays and datasets in Python
๐Ÿ”— [xarray.dev](https://xarray.dev)

100. tartley/colorama โญ 3,481
Simple cross-platform colored terminal text in Python

101. osohq/oso โญ 3,448
Oso is a batteries-included framework for building authorization in your application.
๐Ÿ”— [docs.osohq.com](https://docs.osohq.com)

102. jorisschellekens/borb โญ 3,339
borb is a library for reading, creating and manipulating PDF files in python.
๐Ÿ”— [borbpdf.com](https://borbpdf.com/)

103. suor/funcy โญ 3,324
A fancy and practical functional tools

104. pyserial/pyserial โญ 3,158
Python serial port access library

105. spotify/basic-pitch โญ 3,151
A lightweight yet powerful audio-to-MIDI converter with pitch bend detection
๐Ÿ”— [basicpitch.io](https://basicpitch.io)

106. camelot-dev/camelot โญ 2,807
A Python library to extract tabular data from PDFs
๐Ÿ”— [camelot-py.readthedocs.io](https://camelot-py.readthedocs.io)

107. legrandin/pycryptodome โญ 2,750
A self-contained cryptographic library for Python
๐Ÿ”— [www.pycryptodome.org](https://www.pycryptodome.org)

108. tox-dev/pipdeptree โญ 2,735
A command line utility to display dependency tree of the installed Python packages
๐Ÿ”— [pypi.python.org/pypi/pipdeptree](https://pypi.python.org/pypi/pipdeptree)

109. lxml/lxml โญ 2,624
The lxml XML toolkit for Python
๐Ÿ”— [lxml.de](https://lxml.de/)

110. liiight/notifiers โญ 2,623
The easy way to send notifications
๐Ÿ”— [notifiers.readthedocs.io](http://notifiers.readthedocs.io/)

111. whylabs/whylogs โญ 2,595
An open-source data logging library for machine learning models and data pipelines. ๐Ÿ“š Provides visibility into data quality & model performance over time. ๐Ÿ›ก๏ธ Supports privacy-preserving data collection, ensuring safety & robustness. ๐Ÿ“ˆ
๐Ÿ”— [whylogs.readthedocs.io](https://whylogs.readthedocs.io/)

112. pexpect/pexpect โญ 2,565
A Python module for controlling interactive programs in a pseudo-terminal
๐Ÿ”— [pexpect.readthedocs.io](http://pexpect.readthedocs.io/)

113. litl/backoff โญ 2,531
Python library providing function decorators for configurable backoff and retry

114. pyston/pyston โญ 2,505
A faster and highly-compatible implementation of the Python programming language.
๐Ÿ”— [www.pyston.org](https://www.pyston.org/)

115. scrapinghub/dateparser โญ 2,505
python parser for human readable dates

116. yaml/pyyaml โญ 2,482
Canonical source repository for PyYAML

117. dosisod/refurb โญ 2,464
A tool for refurbishing and modernizing Python codebases

118. cdgriffith/Box โญ 2,411
Python dictionaries with advanced dot notation access
๐Ÿ”— [github.com/cdgriffith/box/wiki](https://github.com/cdgriffith/Box/wiki)

119. pypa/setuptools โญ 2,390
Official project repository for the Setuptools build system
๐Ÿ”— [pypi.org/project/setuptools](https://pypi.org/project/setuptools/)

120. nschloe/tikzplotlib โญ 2,356
๐Ÿ“Š Save matplotlib figures as TikZ/PGFplots for smooth integration into LaTeX.

121. hgrecco/pint โญ 2,329
Operate and manipulate physical quantities in Python
๐Ÿ”— [pint.readthedocs.org](http://pint.readthedocs.org/)

122. dateutil/dateutil โญ 2,287
Useful extensions to the standard Python datetime features

123. grantjenks/python-diskcache โญ 2,251
Python disk-backed cache (Django-compatible). Faster than Redis and Memcached. Pure-Python.
๐Ÿ”— [www.grantjenks.com/docs/diskcache](http://www.grantjenks.com/docs/diskcache/)

124. pndurette/gTTS โญ 2,198
Python library and CLI tool to interface with Google Translate's text-to-speech API
๐Ÿ”— [gtts.readthedocs.org](http://gtts.readthedocs.org/)

125. kiminewt/pyshark โญ 2,175
Python wrapper for tshark, allowing python packet parsing using wireshark dissectors

126. ianmiell/shutit โญ 2,149
Automation framework for programmers
๐Ÿ”— [ianmiell.github.io/shutit](http://ianmiell.github.io/shutit/)

127. pyparsing/pyparsing โญ 2,147
Python library for creating PEG parsers

128. jcrist/msgspec โญ 2,099
A fast serialization and validation library, with builtin support for JSON, MessagePack, YAML, and TOML
๐Ÿ”— [jcristharif.com/msgspec](https://jcristharif.com/msgspec/)

129. libaudioflux/audioFlux โญ 2,096
A library for audio and music analysis, feature extraction.
๐Ÿ”— [audioflux.top](https://audioflux.top)

130. astanin/python-tabulate โญ 2,053
Pretty-print tabular data in Python, a library and a command-line utility. Repository migrated from bitbucket.org/astanin/python-tabulate.
๐Ÿ”— [pypi.org/project/tabulate](https://pypi.org/project/tabulate/)

131. google/gin-config โญ 2,017
Gin provides a lightweight configuration framework for Python

132. grahamdumpleton/wrapt โญ 2,012
A Python module for decorators, wrappers and monkey patching.

133. nateshmbhat/pyttsx3 โญ 2,001
Offline Text To Speech synthesis for python

134. pyfilesystem/pyfilesystem2 โญ 1,966
Python's Filesystem abstraction layer
๐Ÿ”— [www.pyfilesystem.org](https://www.pyfilesystem.org)

135. landscapeio/prospector โญ 1,922
Inspects Python source files and provides information about type and location of classes, methods etc

136. python-rope/rope โญ 1,890
a python refactoring library

137. mitmproxy/pdoc โญ 1,880
API Documentation for Python Projects
๐Ÿ”— [pdoc.dev](https://pdoc.dev)

138. julienpalard/Pipe โญ 1,878
A Python library to use infix notation in Python

139. omry/omegaconf โญ 1,876
Flexible Python configuration system. The last one you will ever need.

140. rhettbull/osxphotos โญ 1,874
Python app to work with pictures and associated metadata from Apple Photos on macOS. Also includes a package to provide programmatic access to the Photos library, pictures, and metadata.

141. numba/llvmlite โญ 1,870
A lightweight LLVM python binding for writing JIT compilers
๐Ÿ”— [llvmlite.pydata.org](https://llvmlite.pydata.org/)

142. chaostoolkit/chaostoolkit โญ 1,858
Chaos Engineering Toolkit & Orchestration for Developers
๐Ÿ”— [chaostoolkit.org](https://chaostoolkit.org)

143. carpedm20/emoji โญ 1,853
emoji terminal output for Python

144. pydoit/doit โญ 1,813
task management & automation tool
๐Ÿ”— [pydoit.org](http://pydoit.org)

145. joowani/binarytree โญ 1,810
Python Library for Studying Binary Trees
๐Ÿ”— [binarytree.readthedocs.io](http://binarytree.readthedocs.io)

146. pygments/pygments โญ 1,765
Pygments is a generic syntax highlighter written in Python
๐Ÿ”— [pygments.org](http://pygments.org/)

147. kalliope-project/kalliope โญ 1,709
Kalliope is a framework that will help you to create your own personal assistant.
๐Ÿ”— [kalliope-project.github.io](https://kalliope-project.github.io/)

148. pydantic/logfire โญ 1,707
Uncomplicated Observability for Python and beyond! ๐Ÿชต๐Ÿ”ฅ
๐Ÿ”— [docs.pydantic.dev/logfire](https://docs.pydantic.dev/logfire/)

149. konradhalas/dacite โญ 1,697
Simple creation of data classes from dictionaries.

150. open-telemetry/opentelemetry-python โญ 1,676
OpenTelemetry Python API and SDK
๐Ÿ”— [opentelemetry.io](https://opentelemetry.io)

151. samuelcolvin/watchfiles โญ 1,673
Simple, modern and fast file watching and code reload in python.
๐Ÿ”— [watchfiles.helpmanual.io](https://watchfiles.helpmanual.io)

152. home-assistant/supervisor โญ 1,672
๐Ÿก Home Assistant Supervisor
๐Ÿ”— [home-assistant.io/hassio](https://home-assistant.io/hassio/)

153. p0dalirius/Coercer โญ 1,669
A python script to automatically coerce a Windows server to authenticate on an arbitrary machine through 12 methods.
๐Ÿ”— [podalirius.net](https://podalirius.net/)

154. hbldh/bleak โญ 1,644
A cross platform Bluetooth Low Energy Client for Python using asyncio

155. mkdocstrings/mkdocstrings โญ 1,632
๐Ÿ“˜ Automatic documentation from sources, for MkDocs.
๐Ÿ”— [mkdocstrings.github.io](https://mkdocstrings.github.io/)

156. rubik/radon โญ 1,618
Various code metrics for Python code
๐Ÿ”— [radon.readthedocs.org](http://radon.readthedocs.org/)

157. facebookincubator/Bowler โญ 1,527
Safe code refactoring for modern Python.
๐Ÿ”— [pybowler.io](https://pybowler.io/)

158. nficano/python-lambda โญ 1,487
A toolkit for developing and deploying serverless Python code in AWS Lambda.

159. quodlibet/mutagen โญ 1,485
Python module for handling audio metadata
๐Ÿ”— [mutagen.readthedocs.io](https://mutagen.readthedocs.io)

160. instagram/LibCST โญ 1,472
A concrete syntax tree parser and serializer library for Python that preserves many aspects of Python's abstract syntax tree
๐Ÿ”— [libcst.readthedocs.io](https://libcst.readthedocs.io/)

161. lcompilers/lpython โญ 1,470
Python compiler
๐Ÿ”— [lpython.org](https://lpython.org/)

162. fabiocaccamo/python-benedict โญ 1,458
๐Ÿ“˜ dict subclass with keylist/keypath support, built-in I/O operations (base64, csv, html, ini, json, pickle, plist, query-string, toml, xls, xml, yaml), s3 support and many utilities.

163. aws-samples/aws-glue-samples โญ 1,410
AWS Glue code samples

164. pycqa/pyflakes โญ 1,352
A simple program which checks Python source files for errors
๐Ÿ”— [pypi.org/project/pyflakes](https://pypi.org/project/pyflakes)

165. lidatong/dataclasses-json โญ 1,342
Easily serialize Data Classes to and from JSON

166. ossf/criticality_score โญ 1,304
Gives criticality score for an open source project

167. brandon-rhodes/python-patterns โญ 1,288
Source code behind the python-patterns.guide site by Brandon Rhodes

168. aio-libs/yarl โญ 1,271
Yet another URL library
๐Ÿ”— [yarl.aio-libs.org](https://yarl.aio-libs.org)

169. jaraco/keyring โญ 1,227
The Python keyring library provides an easy way to access the system keyring service from python. It can be used in any application that needs safe password storage.

170. anthropics/anthropic-sdk-python โญ 1,189
SDK providing access to Anthropic's safety-first language model APIs

171. oracle/graalpython โญ 1,168
A Python 3 implementation built on GraalVM

172. pdoc3/pdoc โญ 1,113
๐Ÿ :arrow_right: ๐Ÿ“œ Auto-generate API documentation for Python projects
๐Ÿ”— [pdoc3.github.io/pdoc](https://pdoc3.github.io/pdoc/)

173. pyo3/rust-numpy โญ 1,064
PyO3-based Rust bindings of the NumPy C-API

174. c4urself/bump2version โญ 1,049
Version-bump your software with a single command
๐Ÿ”— [pypi.python.org/pypi/bump2version](https://pypi.python.org/pypi/bump2version)

175. metachris/logzero โญ 1,028
Robust and effective logging for Python 2 and 3.
๐Ÿ”— [logzero.readthedocs.io](https://logzero.readthedocs.io)

176. pyfpdf/fpdf2 โญ 1,013
Simple PDF generation for Python
๐Ÿ”— [py-pdf.github.io/fpdf2](https://py-pdf.github.io/fpdf2/)

177. fsspec/filesystem_spec โญ 956
A specification that python filesystems should adhere to.

178. milvus-io/pymilvus โญ 951
Python SDK for Milvus.

179. fastai/fastcore โญ 912
Python supercharged for the fastai library
๐Ÿ”— [fastcore.fast.ai](http://fastcore.fast.ai)

180. lastmile-ai/aiconfig โญ 903
AIConfig saves prompts, models and model parameters as source control friendly configs. This allows you to iterate on prompts and model parameters separately from your application code.
๐Ÿ”— [aiconfig.lastmileai.dev](https://aiconfig.lastmileai.dev)

181. alex-sherman/unsync โญ 872
Unsynchronize asyncio

182. pypy/pypy โญ 846
PyPy is a very fast and compliant implementation of the Python language.
๐Ÿ”— [pypy.org](https://pypy.org)

183. samuelcolvin/dirty-equals โญ 782
Doing dirty (but extremely useful) things with equals.
๐Ÿ”— [dirty-equals.helpmanual.io](https://dirty-equals.helpmanual.io)

184. barracuda-fsh/pyobd โญ 777
An OBD-II compliant car diagnostic tool

185. pypa/build โญ 705
A simple, correct Python build frontend
๐Ÿ”— [build.pypa.io](https://build.pypa.io)

186. qdrant/qdrant-client โญ 690
Python client for Qdrant vector search engine
๐Ÿ”— [qdrant.tech](https://qdrant.tech)

187. ariebovenberg/whenever โญ 682
โฐ Modern datetime library for Python, written in Rust
๐Ÿ”— [whenever.rtfd.io](https://whenever.rtfd.io)

188. open-telemetry/opentelemetry-python-contrib โญ 659
OpenTelemetry instrumentation for Python modules
๐Ÿ”— [opentelemetry.io](https://opentelemetry.io)

189. instagram/Fixit โญ 659
Advanced Python linting framework with auto-fixes and hierarchical configuration that makes it easy to write custom in-repo lint rules.
๐Ÿ”— [fixit.rtfd.io/en/latest](https://fixit.rtfd.io/en/latest/)

190. modal-labs/modal-examples โญ 647
Examples of programs built using Modal
๐Ÿ”— [modal.com/docs](https://modal.com/docs)

191. gefyrahq/gefyra โญ 645
Blazingly-fast ๐Ÿš€, rock-solid, local application development :arrow_right: with Kubernetes.
๐Ÿ”— [gefyra.dev](https://gefyra.dev)

192. methexis-inc/terminal-copilot โญ 565
A smart terminal assistant that helps you find the right command.

193. platformdirs/platformdirs โญ 541
A small Python module for determining appropriate platform-specific dirs, e.g. a "user data dir".
๐Ÿ”— [platformdirs.readthedocs.io](https://platformdirs.readthedocs.io)

194. argoproj-labs/hera โญ 531
Hera is an Argo Python SDK. Hera aims to make construction and submission of various Argo Project resources easy and accessible to everyone! Hera abstracts away low-level setup details while still maintaining a consistent vocabulary with Argo. โญ๏ธ Remember to star!
๐Ÿ”— [hera.rtfd.io](https://hera.rtfd.io)

195. fastai/ghapi โญ 527
A delightful and complete interface to GitHub's amazing API
๐Ÿ”— [ghapi.fast.ai](https://ghapi.fast.ai/)

196. steamship-core/steamship-langchain โญ 507
steamship-langchain

197. google/pyglove โญ 321
Manipulating Python Programs

## Vizualisation

Vizualisation tools and libraries. Application frameworks, 2D/3D plotting, dashboards, WebGL.

1. apache/superset โญ 60,738
Apache Superset is a Data Visualization and Data Exploration Platform
๐Ÿ”— [superset.apache.org](https://superset.apache.org/)

2. streamlit/streamlit โญ 33,438
Streamlit โ€” A faster way to build and share data apps.
๐Ÿ”— [streamlit.io](https://streamlit.io)

3. gradio-app/gradio โญ 31,019
Build and share delightful machine learning apps, all in Python. ๐ŸŒŸ Star to support our work!
๐Ÿ”— [www.gradio.app](http://www.gradio.app)

4. plotly/dash โญ 20,910
Data Apps & Dashboards for Python. No JavaScript Required.
๐Ÿ”— [plotly.com/dash](https://plotly.com/dash)

5. matplotlib/matplotlib โญ 19,692
matplotlib: plotting with Python
๐Ÿ”— [matplotlib.org/stable](https://matplotlib.org/stable/)

6. bokeh/bokeh โญ 19,065
Interactive Data Visualization in the browser, from Python
๐Ÿ”— [bokeh.org](https://bokeh.org)

7. plotly/plotly.py โญ 15,732
The interactive graphing library for Python โœจ This project now includes Plotly Express!
๐Ÿ”— [plotly.com/python](https://plotly.com/python/)

8. mwaskom/seaborn โญ 12,216
Statistical data visualization in Python
๐Ÿ”— [seaborn.pydata.org](https://seaborn.pydata.org)

9. visgl/deck.gl โญ 11,934
WebGL2 powered visualization framework
๐Ÿ”— [deck.gl](https://deck.gl)

10. marceloprates/prettymaps โญ 11,038
A small set of Python functions to draw pretty maps from OpenStreetMap data. Based on osmnx, matplotlib and shapely libraries.

11. altair-viz/altair โญ 9,082
Declarative statistical visualization library for Python
๐Ÿ”— [altair-viz.github.io](https://altair-viz.github.io/)

12. nvidia/TensorRT-LLM โญ 7,510
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT...
๐Ÿ”— [nvidia.github.io/tensorrt-llm](https://nvidia.github.io/TensorRT-LLM)

13. lux-org/lux โญ 5,075
Automatically visualize your pandas dataframe via a single print! ๐Ÿ“Š ๐Ÿ’ก

14. renpy/renpy โญ 4,757
The Ren'Py Visual Novel Engine
๐Ÿ”— [www.renpy.org](http://www.renpy.org/)

15. man-group/dtale โญ 4,639
Visualizer for pandas data structures
๐Ÿ”— [alphatechadmin.pythonanywhere.com](http://alphatechadmin.pythonanywhere.com)

16. holoviz/panel โญ 4,466
Panel: The powerful data exploration & web app framework for Python
๐Ÿ”— [panel.holoviz.org](https://panel.holoviz.org)

17. has2k1/plotnine โญ 3,902
A Grammar of Graphics for Python
๐Ÿ”— [plotnine.org](https://plotnine.org)

18. residentmario/missingno โญ 3,864
missingno provides a small toolset of flexible and easy-to-use missing data visualizations and utilities that allows you to get a quick visual summary of the completeness (or lack thereof) of your dataset.

19. pyqtgraph/pyqtgraph โญ 3,777
Fast data visualization and GUI tools for scientific / engineering applications
๐Ÿ”— [www.pyqtgraph.org](https://www.pyqtgraph.org)

20. vispy/vispy โญ 3,262
Main repository for Vispy
๐Ÿ”— [vispy.org](http://vispy.org)

21. ml-tooling/opyrator โญ 3,044
๐Ÿช„ Turns your machine learning code into microservices with web API, interactive GUI, and more.
๐Ÿ”— [opyrator-playground.mltooling.org](https://opyrator-playground.mltooling.org)

22. netflix/flamescope โญ 2,991
FlameScope is a visualization tool for exploring different time ranges as Flame Graphs.

23. facebookresearch/hiplot โญ 2,722
HiPlot makes understanding high dimensional data easy
๐Ÿ”— [facebookresearch.github.io/hiplot](https://facebookresearch.github.io/hiplot/)

24. holoviz/holoviews โญ 2,665
With Holoviews, your data visualizes itself.
๐Ÿ”— [holoviews.org](https://holoviews.org)

25. kozea/pygal โญ 2,619
pygal is a dynamic SVG charting library written in python.
๐Ÿ”— [www.pygal.org](https://www.pygal.org)

26. pyvista/pyvista โญ 2,485
3D plotting and mesh analysis through a streamlined interface for the Visualization Toolkit (VTK)
๐Ÿ”— [docs.pyvista.org](https://docs.pyvista.org)

27. mckinsey/vizro โญ 2,480
Vizro is a toolkit for creating modular data visualization applications.
๐Ÿ”— [vizro.readthedocs.io/en/stable](https://vizro.readthedocs.io/en/stable/)

28. marcomusy/vedo โญ 1,985
A python module for scientific analysis of 3D data based on VTK and Numpy
๐Ÿ”— [vedo.embl.es](https://vedo.embl.es)

29. datapane/datapane โญ 1,362
Build and share data reports in 100% Python
๐Ÿ”— [datapane.com](https://datapane.com)

30. facultyai/dash-bootstrap-components โญ 1,090
Bootstrap components for Plotly Dash
๐Ÿ”— [dash-bootstrap-components.opensource.faculty.ai](https://dash-bootstrap-components.opensource.faculty.ai/)

31. nomic-ai/deepscatter โญ 1,001
Zoomable, animated scatterplots in the browser that scales over a billion points

32. holoviz/holoviz โญ 821
High-level tools to simplify visualization in Python.
๐Ÿ”— [holoviz.org](https://holoviz.org/)

33. hazyresearch/meerkat โญ 818
Creative interactive views of any dataset.

## Web

Web related frameworks and libraries: webapp servers, WSGI, ASGI, asyncio, HTTP, REST, user management.

1. django/django โญ 78,066
The Web framework for perfectionists with deadlines.
๐Ÿ”— [www.djangoproject.com](https://www.djangoproject.com/)

2. tiangolo/fastapi โญ 73,629
FastAPI framework, high performance, easy to learn, fast to code, ready for production
๐Ÿ”— [fastapi.tiangolo.com](https://fastapi.tiangolo.com/)

3. pallets/flask โญ 67,093
The Python micro framework for building web applications.
๐Ÿ”— [flask.palletsprojects.com](https://flask.palletsprojects.com)

4. sherlock-project/sherlock โญ 52,780
Hunt down social media accounts by username across social networks
๐Ÿ”— [sherlockproject.xyz](https://sherlockproject.xyz)

5. psf/requests โญ 51,778
A simple, yet elegant, HTTP library.
๐Ÿ”— [requests.readthedocs.io/en/latest](https://requests.readthedocs.io/en/latest/)

6. tornadoweb/tornado โญ 21,621
Tornado is a Python web framework and asynchronous networking library, originally developed at FriendFeed.
๐Ÿ”— [www.tornadoweb.org](http://www.tornadoweb.org/)

7. reflex-dev/reflex โญ 18,075
๐Ÿ•ธ๏ธ Web apps in pure Python ๐Ÿ
๐Ÿ”— [reflex.dev](https://reflex.dev)

8. huge-success/sanic โญ 17,899
Accelerate your web app development | Build fast. Run fast.
๐Ÿ”— [sanic.dev](https://sanic.dev)

9. wagtail/wagtail โญ 17,641
A Django content management system focused on flexibility and user experience
๐Ÿ”— [wagtail.org](https://wagtail.org)

10. pyscript/pyscript โญ 17,573
A framework that allows users to create rich Python applications in the browser using HTML's interface and the power of Pyodide, WASM, and modern web technologies.
๐Ÿ”— [pyscript.net](https://pyscript.net/)

11. aio-libs/aiohttp โญ 14,810
Asynchronous HTTP client/server framework for asyncio and Python
๐Ÿ”— [docs.aiohttp.org](https://docs.aiohttp.org)

12. vincigit00/Scrapegraph-ai โญ 13,299
ScrapeGraphAI is a web scraping python library that uses LLM and direct graph logic to create scraping pipelines for websites and local documents
๐Ÿ”— [scrapegraphai.com](https://scrapegraphai.com)

13. encode/httpx โญ 12,756
A next generation HTTP client for Python. ๐Ÿฆ‹
๐Ÿ”— [www.python-httpx.org](https://www.python-httpx.org/)

14. getpelican/pelican โญ 12,404
Static site generator that supports Markdown and reST syntax. Powered by Python.
๐Ÿ”— [getpelican.com](https://getpelican.com)

15. aws/chalice โญ 10,536
Python Serverless Microframework for AWS

16. flet-dev/flet โญ 10,308
Flet enables developers to easily build realtime web, mobile and desktop apps in Python. No frontend experience required.
๐Ÿ”— [flet.dev](https://flet.dev)

17. encode/starlette โญ 9,812
The little ASGI framework that shines. ๐ŸŒŸ
๐Ÿ”— [www.starlette.io](https://www.starlette.io/)

18. benoitc/gunicorn โญ 9,647
gunicorn 'Green Unicorn' is a WSGI HTTP Server for UNIX, fast clients and sleepy applications.
๐Ÿ”— [www.gunicorn.org](http://www.gunicorn.org)

19. falconry/falcon โญ 9,447
The no-magic web data plane API and microservices framework for Python developers, with a focus on reliability, correctness, and performance at scale.
๐Ÿ”— [falcon.readthedocs.io/en/stable](https://falcon.readthedocs.io/en/stable/)

20. bottlepy/bottle โญ 8,341
bottle.py is a fast and simple micro-framework for python web-applications.
๐Ÿ”— [bottlepy.org](http://bottlepy.org/)

21. zauberzeug/nicegui โญ 8,337
Create web-based user interfaces with Python. The nice way.
๐Ÿ”— [nicegui.io](https://nicegui.io)

22. encode/uvicorn โญ 8,163
An ASGI web server, for Python. ๐Ÿฆ„
๐Ÿ”— [www.uvicorn.org](https://www.uvicorn.org/)

23. graphql-python/graphene โญ 8,031
GraphQL framework for Python
๐Ÿ”— [graphene-python.org](http://graphene-python.org/)

24. reactive-python/reactpy โญ 7,767
ReactPy is a library for building user interfaces in Python without Javascript
๐Ÿ”— [reactpy.dev](https://reactpy.dev)

25. pyeve/eve โญ 6,676
REST API framework designed for human beings
๐Ÿ”— [python-eve.org](https://python-eve.org)

26. vitalik/django-ninja โญ 6,642
๐Ÿ’จ Fast, Async-ready, Openapi, type hints based framework for building APIs
๐Ÿ”— [django-ninja.dev](https://django-ninja.dev)

27. pallets/werkzeug โญ 6,592
The comprehensive WSGI web application library.
๐Ÿ”— [werkzeug.palletsprojects.com](https://werkzeug.palletsprojects.com)

28. webpy/webpy โญ 5,882
web.py is a web framework for python that is as simple as it is powerful.
๐Ÿ”— [webpy.org](http://webpy.org)

29. starlite-api/litestar โญ 5,151
Production-ready, Light, Flexible and Extensible ASGI API framework | Effortlessly Build Performant APIs
๐Ÿ”— [litestar.dev](https://litestar.dev/)

30. stephenmcd/mezzanine โญ 4,742
CMS framework for Django
๐Ÿ”— [mezzanine.jupo.org](http://mezzanine.jupo.org)

31. nameko/nameko โญ 4,682
A microservices framework for Python that lets service developers concentrate on application logic and encourages testability.
๐Ÿ”— [www.nameko.io](https://www.nameko.io)

32. pywebio/PyWebIO โญ 4,440
Write interactive web app in script way.
๐Ÿ”— [pywebio.readthedocs.io](https://pywebio.readthedocs.io)

33. fastapi-users/fastapi-users โญ 4,331
Ready-to-use and customizable users management for FastAPI
๐Ÿ”— [fastapi-users.github.io/fastapi-users](https://fastapi-users.github.io/fastapi-users/)

34. pylons/pyramid โญ 3,923
Pyramid - A Python web framework
๐Ÿ”— [trypyramid.com](https://trypyramid.com/)

35. h2oai/wave โญ 3,903
H2O Wave is a software stack for building beautiful, low-latency, realtime, browser-based applications and dashboards entirely in Python/R without using HTML, Javascript, or CSS.
๐Ÿ”— [wave.h2o.ai](https://wave.h2o.ai)

36. strawberry-graphql/strawberry โญ 3,863
A GraphQL library for Python that leverages type annotations ๐Ÿ“
๐Ÿ”— [strawberry.rocks](https://strawberry.rocks)

37. websocket-client/websocket-client โญ 3,522
WebSocket client for Python
๐Ÿ”— [github.com/websocket-client/websocket-client](https://github.com/websocket-client/websocket-client)

38. unbit/uwsgi โญ 3,431
uWSGI application server container
๐Ÿ”— [projects.unbit.it/uwsgi](http://projects.unbit.it/uwsgi)

39. pallets/quart โญ 2,774
An async Python micro framework for building web applications.
๐Ÿ”— [quart.palletsprojects.com](https://quart.palletsprojects.com)

40. fastapi-admin/fastapi-admin โญ 2,713
A fast admin dashboard based on FastAPI and TortoiseORM with tabler ui, inspired by Django admin
๐Ÿ”— [fastapi-admin-docs.long2ice.io](https://fastapi-admin-docs.long2ice.io)

41. flipkart-incubator/Astra โญ 2,459
Automated Security Testing For REST API's

42. masoniteframework/masonite โญ 2,168
The Modern And Developer Centric Python Web Framework. Be sure to read the documentation and join the Discord channel for questions: https://discord.gg/TwKeFahmPZ
๐Ÿ”— [docs.masoniteproject.com](http://docs.masoniteproject.com)

43. dot-agent/nextpy โญ 2,164
๐Ÿค–Self-Modifying Framework from the Future ๐Ÿ”ฎ World's First AMS
๐Ÿ”— [dotagent.ai](https://dotagent.ai)

44. python-restx/flask-restx โญ 2,119
Fork of Flask-RESTPlus: Fully featured framework for fast, easy and documented API development with Flask
๐Ÿ”— [flask-restx.readthedocs.io/en/latest](https://flask-restx.readthedocs.io/en/latest/)

45. dmontagu/fastapi-utils โญ 1,835
Reusable utilities for FastAPI: a number of utilities to help reduce boilerplate and reuse common functionality across projects

46. s3rius/FastAPI-template โญ 1,818
Feature rich robust FastAPI template.

47. cherrypy/cherrypy โญ 1,813
CherryPy is a pythonic, object-oriented HTTP framework. https://cherrypy.dev
๐Ÿ”— [docs.cherrypy.dev](https://docs.cherrypy.dev)

48. neoteroi/BlackSheep โญ 1,804
Fast ASGI web framework for Python
๐Ÿ”— [www.neoteroi.dev/blacksheep](https://www.neoteroi.dev/blacksheep/)

49. indico/indico โญ 1,703
Indico - A feature-rich event management system, made @ CERN, the place where the Web was born.
๐Ÿ”— [getindico.io](https://getindico.io)

50. jordaneremieff/mangum โญ 1,658
AWS Lambda support for ASGI applications
๐Ÿ”— [mangum.io](https://mangum.io/)

51. wtforms/wtforms โญ 1,487
A flexible forms validation and rendering library for Python.
๐Ÿ”— [wtforms.readthedocs.io](https://wtforms.readthedocs.io)

52. awtkns/fastapi-crudrouter โญ 1,346
A dynamic FastAPI router that automatically creates CRUD routes for your models
๐Ÿ”— [fastapi-crudrouter.awtkns.com](https://fastapi-crudrouter.awtkns.com)

53. long2ice/fastapi-cache โญ 1,208
fastapi-cache is a tool to cache fastapi response and function result, with backends support redis and memcached.
๐Ÿ”— [github.com/long2ice/fastapi-cache](https://github.com/long2ice/fastapi-cache)

54. magicstack/httptools โญ 1,182
Fast HTTP parser

55. rstudio/py-shiny โญ 1,123
Shiny for Python
๐Ÿ”— [shiny.posit.co/py](https://shiny.posit.co/py/)

56. whitphx/stlite โญ 1,099
A port of Streamlit to WebAssembly, powered by Pyodide.
๐Ÿ”— [edit.share.stlite.net](https://edit.share.stlite.net)

57. koxudaxi/fastapi-code-generator โญ 967
This code generator creates FastAPI app from an openapi file.

58. aeternalis-ingenium/FastAPI-Backend-Template โญ 590
A backend project template with FastAPI, PostgreSQL with asynchronous SQLAlchemy 2.0, Alembic for asynchronous database migration, and Docker.

---

Interactive version: [www.awesomepython.org](https://www.awesomepython.org/), Hugging Face Dataset: [awesome-python](https://huggingface.co/datasets/dylanhogg/awesome-python)

Please raise a new issue to suggest a Python repo that you would like to see added.

1,512 hand-picked awesome Python libraries and frameworks, updated 20 Jul 2024

[![Hits](https://hits.seeyoufarm.com/api/count/incr/badge.svg?url=https%3A%2F%2Fgithub.com%2Fdylanhogg%2Fawesome-python&count_bg=%2379C83D&title_bg=%23555555&icon=&icon_color=%23E7E7E7&title=hits&edge_flat=false)](https://hits.seeyoufarm.com)