An open API service indexing awesome lists of open source software.

https://github.com/dylanhogg/awesome-python

๐Ÿ Hand-picked awesome Python libraries and frameworks, organised by category
https://github.com/dylanhogg/awesome-python

List: awesome-python

awesome awesome-list awesome-python chatgpt data data-science deep-learning jupyter machine-learning natural-language-processing nlp open-source pandas python python-library

Last synced: 3 months ago
JSON representation

๐Ÿ Hand-picked awesome Python libraries and frameworks, organised by category

Awesome Lists containing this project

README

        

# Awesome Python

[![Awesome](https://awesome.re/badge.svg)](https://awesome.re) ![Last commit](https://img.shields.io/github/last-commit/dylanhogg/awesome-python) [![License: MIT](https://img.shields.io/badge/license-MIT-green.svg)](https://opensource.org/licenses/MIT)

Hand-picked awesome Python libraries and frameworks, organised by category ๐Ÿ

Interactive version: [www.awesomepython.org](https://www.awesomepython.org/)

Updated 21 Jan 2025

## Categories

- [Newly Created Repositories](#newly-created-repositories) - Awesome Python is regularly updated, and this category lists the most recently created GitHub repositories from all the other repositories here (10 repos)
- [Agentic AI](#agentic-ai) - Agentic AI libraries, frameworks and tools: AI agents, workflows, autonomous decision-making, goal-oriented tasks, and API integrations (41 repos)
- [Code Quality](#code-quality) - Code quality tooling: linters, formatters, pre-commit hooks, unused code removal (16 repos)
- [Crypto and Blockchain](#crypto-and-blockchain) - Cryptocurrency and blockchain libraries: trading bots, API integration, Ethereum virtual machine, solidity (13 repos)
- [Data](#data) - General data libraries: data processing, serialisation, formats, databases, SQL, connectors, web crawlers, data generation/augmentation/checks (113 repos)
- [Debugging](#debugging) - Debugging and tracing tools (10 repos)
- [Diffusion Text to Image](#diffusion-text-to-image) - Text-to-image diffusion model libraries, tools and apps for generating images from natural language (42 repos)
- [Finance](#finance) - Financial and quantitative libraries: investment research tools, market data, algorithmic trading, backtesting, financial derivatives (34 repos)
- [Game Development](#game-development) - Game development tools, engines and libraries (7 repos)
- [GIS](#gis) - Geospatial libraries: raster and vector data formats, interactive mapping and visualisation, computing frameworks for processing images, projections (28 repos)
- [Graph](#graph) - Graphs and network libraries: network analysis, graph machine learning, visualisation (6 repos)
- [GUI](#gui) - Graphical user interface libraries and toolkits (8 repos)
- [Jupyter](#jupyter) - Jupyter and JupyterLab and Notebook tools, libraries and plugins (27 repos)
- [LLMs and ChatGPT](#llms-and-chatgpt) - Large language model and GPT libraries and frameworks: auto-gpt, agents, QnA, chain-of-thought workflows, API integations. Also see the Natural Language Processing category for crossover (300 repos)
- [Math and Science](#math-and-science) - Mathematical, numerical and scientific libraries (30 repos)
- [Machine Learning - General](#machine-learning---general) - General and classical machine learning libraries. See below for other sections covering specialised ML areas (159 repos)
- [Machine Learning - Deep Learning](#machine-learning---deep-learning) - Machine learning libraries that cross over with deep learning in some way (79 repos)
- [Machine Learning - Interpretability](#machine-learning---interpretability) - Machine learning interpretability libraries. Covers explainability, prediction explainations, dashboards, understanding knowledge development in training (22 repos)
- [Machine Learning - Ops](#machine-learning---ops) - MLOps tools, frameworks and libraries: intersection of machine learning, data engineering and DevOps; deployment, health, diagnostics and governance of ML models (43 repos)
- [Machine Learning - Reinforcement](#machine-learning---reinforcement) - Machine learning libraries and toolkits that cross over with reinforcement learning in some way: agent reinforcement learning, agent environemnts, RLHF (23 repos)
- [Machine Learning - Time Series](#machine-learning---time-series) - Machine learning and classical timeseries libraries: forecasting, seasonality, anomaly detection, econometrics (19 repos)
- [Natural Language Processing](#natural-language-processing) - Natural language processing libraries and toolkits: text processing, topic modelling, tokenisers, chatbots. Also see the LLMs and ChatGPT category for crossover (87 repos)
- [Packaging](#packaging) - Python packaging, dependency management and bundling (28 repos)
- [Pandas](#pandas) - Pandas and dataframe libraries: data analysis, statistical reporting, pandas GUIs, pandas performance optimisations (24 repos)
- [Performance](#performance) - Performance, parallelisation and low level libraries (28 repos)
- [Profiling](#profiling) - Memory and CPU/GPU profiling tools and libraries (11 repos)
- [Security](#security) - Security related libraries: vulnerability discovery, SQL injection, environment auditing (15 repos)
- [Simulation](#simulation) - Simulation libraries: robotics, economic, agent-based, traffic, physics, astronomy, chemistry, quantum simulation. Also see the Maths and Science category for crossover (34 repos)
- [Study](#study) - Miscellaneous study resources: algorithms, general resources, system design, code repos for textbooks, best practices, tutorials (59 repos)
- [Template](#template) - Template tools and libraries: cookiecutter repos, generators, quick-starts (9 repos)
- [Terminal](#terminal) - Terminal and console tools and libraries: CLI tools, terminal based formatters, progress bars (15 repos)
- [Testing](#testing) - Testing libraries: unit testing, load testing, acceptance testing, code coverage, browser automation, plugins (24 repos)
- [Typing](#typing) - Typing libraries: static and run-time type checking, annotations (12 repos)
- [Utility](#utility) - General utility libraries: miscellaneous tools, linters, code formatters, version management, package tools, documentation tools (206 repos)
- [Vizualisation](#vizualisation) - Vizualisation tools and libraries. Application frameworks, 2D/3D plotting, dashboards, WebGL (36 repos)
- [Web](#web) - Web related frameworks and libraries: webapp servers, WSGI, ASGI, asyncio, HTTP, REST, user management (58 repos)

## Newly Created Repositories

Awesome Python is regularly updated, and this category lists the most recently created GitHub repositories from all the other repositories here.

1. deepseek-ai/DeepSeek-V3 โญ 16,791
A strong Mixture-of-Experts (MoE) language model with 671B total parameters with 37B activated for each token.

2. tencent/HunyuanVideo โญ 7,111
HunyuanVideo: A Systematic Framework For Large Video Generation Model
๐Ÿ”— [aivideo.hunyuan.tencent.com](https://aivideo.hunyuan.tencent.com)

3. microsoft/TRELLIS โญ 6,109
A large 3D asset generation model. It takes in text or image prompts and generates high-quality 3D assets in various formats, such as Radiance Fields, 3D Gaussians, and meshes.
๐Ÿ”— [trellis3d.github.io](https://trellis3d.github.io)

4. cyclotruc/gitingest โญ 4,351
Turn any Git repository into a prompt-friendly text ingest for LLMs.
๐Ÿ”— [gitingest.com](https://gitingest.com/)

5. huggingface/smolagents โญ 3,695
๐Ÿค— smolagents: a barebones library for agents. Agents write python code to call tools and orchestrate other agents.
๐Ÿ”— [huggingface.co/docs/smolagents](https://huggingface.co/docs/smolagents)

6. novasky-ai/SkyThought โญ 1,984
Sky-T1: Train your own O1 preview model within $450
๐Ÿ”— [novasky-ai.github.io](https://novasky-ai.github.io/)

7. facebookresearch/large_concept_model โญ 1,471
Large Concept Models: Language modeling in a sentence representation space

8. thytu/Agentarium โญ 747
Framework for managing and orchestrating AI agents with ease. Agentarium provides a flexible and intuitive way to create, manage, and coordinate interactions between multiple AI agents in various environments.

9. polymathicai/the_well โญ 703
15TB of Physics Simulations: collection of machine learning datasets containing numerical simulations of a wide variety of spatiotemporal physical systems.
๐Ÿ”— [polymathic-ai.org/the_well](https://polymathic-ai.org/the_well/)

10. neuml/annotateai โญ 240
Automatically annotates papers using Large Language Models (LLMs)

## Agentic AI

Agentic AI libraries, frameworks and tools: AI agents, workflows, autonomous decision-making, goal-oriented tasks, and API integrations.

1. hwchase17/langchain โญ 97,765
๐Ÿฆœ๐Ÿ”— Build context-aware reasoning applications
๐Ÿ”— [python.langchain.com](https://python.langchain.com)

2. logspace-ai/langflow โญ 42,217
Langflow is a low-code app builder for RAG and multi-agent AI applications. Itโ€™s Python-based and agnostic to any model, API, or database.
๐Ÿ”— [www.langflow.org](http://www.langflow.org)

3. run-llama/llama_index โญ 37,830
LlamaIndex is a data framework for your LLM applications
๐Ÿ”— [docs.llamaindex.ai](https://docs.llamaindex.ai)

4. microsoft/autogen โญ 36,898
A programming framework for agentic AI ๐Ÿค– PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour
๐Ÿ”— [microsoft.github.io/autogen/0.4.0.dev13](https://microsoft.github.io/autogen/0.4.0.dev13/)

5. openbmb/ChatDev โญ 25,934
ChatDev stands as a virtual software company that operates through various intelligent agents holding different roles, including Chief Executive Officer, Chief Product Officer etc
๐Ÿ”— [arxiv.org/abs/2307.07924](https://arxiv.org/abs/2307.07924)

6. joaomdmoura/crewAI โญ 24,079
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
๐Ÿ”— [crewai.com](https://crewai.com)

7. yoheinakajima/babyagi โญ 20,688
GPT-4 powered task-driven autonomous agent
๐Ÿ”— [babyagi.org](https://babyagi.org/)

8. stanford-oval/storm โญ 18,722
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
๐Ÿ”— [storm.genie.stanford.edu](http://storm.genie.stanford.edu)

9. openai/swarm โญ 17,487
A framework exploring ergonomic, lightweight multi-agent orchestration.

10. phidatahq/phidata โญ 17,478
Phidata is a toolkit for building AI Assistants using function calling.
๐Ÿ”— [docs.phidata.com](https://docs.phidata.com)

11. unity-technologies/ml-agents โญ 17,415
The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement learning and imitation learning.
๐Ÿ”— [unity.com/products/machine-learning-agents](https://unity.com/products/machine-learning-agents)

12. assafelovic/gpt-researcher โญ 15,562
LLM based autonomous agent that conducts local and web research on any topic and generates a comprehensive report with citations.
๐Ÿ”— [gptr.dev](https://gptr.dev)

13. browser-use/browser-use โญ 15,421
Browser use is the easiest way to connect your AI agents with the browser.
๐Ÿ”— [browser-use.com](https://browser-use.com/)

14. letta-ai/letta โญ 13,740
Letta (formerly MemGPT) is a framework for creating LLM services with memory.
๐Ÿ”— [letta.com](https://letta.com)

15. smol-ai/developer โญ 11,852
the first library to let you embed a developer agent in your own app!
๐Ÿ”— [twitter.com/smolmodels](https://twitter.com/SmolModels)

16. langchain-ai/langgraph โญ 7,817
LangGraph is a library for building stateful, multi-actor applications with LLMs, built on top of (and intended to be used with) LangChain.
๐Ÿ”— [langchain-ai.github.io/langgraph](https://langchain-ai.github.io/langgraph/)

17. meta-llama/llama-stack โญ 5,935
Llama Stack standardizes the building blocks needed to bring genai applications to market. These blocks cover model training and fine-tuning, evaluation, and running AI agents in production

18. prefecthq/marvin โญ 5,402
โœจ Build AI interfaces that spark joy
๐Ÿ”— [askmarvin.ai](https://askmarvin.ai)

19. mnotgod96/AppAgent โญ 5,319
AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.
๐Ÿ”— [appagent-official.github.io](https://appagent-official.github.io/)

20. nirdiamant/GenAI_Agents โญ 5,216
Tutorials and implementations for various Generative AI Agent techniques, from basic to advanced. It serves as a comprehensive guide for building intelligent, interactive AI systems.

21. pydantic/pydantic-ai โญ 4,954
PydanticAI is a Python Agent Framework designed to make it less painful to build production grade applications with Generative AI.
๐Ÿ”— [ai.pydantic.dev](https://ai.pydantic.dev)

22. kyegomez/swarms โญ 4,169
The Enterprise-Grade Production-Ready Multi-Agent Orchestration Framework Join our Community: https://discord.gg/jM3Z6M9uMq
๐Ÿ”— [docs.swarms.world](https://docs.swarms.world)

23. meta-llama/llama-stack-apps โญ 4,047
Agentic components of the Llama Stack APIs

24. huggingface/smolagents โญ 3,695
๐Ÿค— smolagents: a barebones library for agents. Agents write python code to call tools and orchestrate other agents.
๐Ÿ”— [huggingface.co/docs/smolagents](https://huggingface.co/docs/smolagents)

25. crewaiinc/crewAI-examples โญ 3,302
A collection of examples that show how to use CrewAI framework to automate workflows.

26. langroid/langroid โญ 2,886
Harness LLMs with Multi-Agent Programming
๐Ÿ”— [langroid.github.io/langroid](https://langroid.github.io/langroid/)

27. facebookresearch/Pearl โญ 2,737
A Production-ready Reinforcement Learning AI Agent Library brought by the Applied Reinforcement Learning team at Meta.

28. run-llama/llama_deploy โญ 1,896
Async-first framework for deploying, scaling, and productionizing agentic multi-service systems based on workflows from llama_index.
๐Ÿ”— [docs.llamaindex.ai/en/stable/module_guides/llama_deploy](https://docs.llamaindex.ai/en/stable/module_guides/llama_deploy/)

29. brainblend-ai/atomic-agents โญ 1,727
Atomic Agents provides a set of tools and agents that can be combined to create powerful applications. It is built on top of Instructor and leverages the power of Pydantic for data and schema validation and serialization.

30. landing-ai/vision-agent โญ 1,676
VisionAgent is a library that helps you utilize agent frameworks to generate code to solve your vision task

31. openautocoder/Agentless โญ 1,325
Agentless๐Ÿฑ: an agentless approach to automatically solve software development problems

32. om-ai-lab/OmAgent โญ 1,281
OmAgent is python library for building multimodal language agents with ease. We try to keep the library simple without too much overhead like other agent framework.
๐Ÿ”— [om-agent.com](https://om-agent.com)

33. link-agi/AutoAgents โญ 1,259
[IJCAI 2024] Generate different roles for GPTs to form a collaborative entity for complex tasks.
๐Ÿ”— [huggingface.co/spaces/linksoul/autoagents](https://huggingface.co/spaces/LinkSoul/AutoAgents)

34. pyspur-dev/pyspur โญ 1,105
Graph-Based Editor for LLM Workflows
๐Ÿ”— [pyspur.dev](https://pyspur.dev)

35. shengranhu/ADAS โญ 1,105
Automated Design of Agentic Systems using Meta Agent Search to show agents can invent novel and powerful agent designs

36. prefecthq/ControlFlow โญ 1,042
ControlFlow provides a structured, developer-focused framework for defining workflows and delegating work to LLMs, without sacrificing control or transparency
๐Ÿ”— [controlflow.ai](https://controlflow.ai)

37. victordibia/autogen-ui โญ 823
Web UI for AutoGen (A Framework Multi-Agent LLM Applications)

38. thytu/Agentarium โญ 747
Framework for managing and orchestrating AI agents with ease. Agentarium provides a flexible and intuitive way to create, manage, and coordinate interactions between multiple AI agents in various environments.

39. google-deepmind/concordia โญ 745
Concordia is a library to facilitate construction and use of generative agent-based models to simulate interactions of agents in grounded physical, social, or digital space.

40. deedy/mac_computer_use โญ 692
A fork of Anthropic Computer Use that you can run on Mac computers to give Claude and other AI models autonomous access to your computer.
๐Ÿ”— [x.com/deedydas/status/1849481225041559910](https://x.com/deedydas/status/1849481225041559910)

41. joshuac215/agent-service-toolkit โญ 674
A full toolkit for running an AI agent service built with LangGraph, FastAPI and Streamlit.
๐Ÿ”— [agent-service-toolkit.streamlit.app](https://agent-service-toolkit.streamlit.app)

## Code Quality

Code quality tooling: linters, formatters, pre-commit hooks, unused code removal.

1. psf/black โญ 39,403
The uncompromising Python code formatter
๐Ÿ”— [black.readthedocs.io/en/stable](https://black.readthedocs.io/en/stable/)

2. astral-sh/ruff โญ 34,331
An extremely fast Python linter and code formatter, written in Rust.
๐Ÿ”— [docs.astral.sh/ruff](https://docs.astral.sh/ruff)

3. google/yapf โญ 13,817
A formatter for Python files

4. pre-commit/pre-commit โญ 13,179
A framework for managing and maintaining multi-language pre-commit hooks.
๐Ÿ”— [pre-commit.com](https://pre-commit.com)

5. sqlfluff/sqlfluff โญ 8,413
A modular SQL linter and auto-formatter with support for multiple dialects and templated code.
๐Ÿ”— [www.sqlfluff.com](https://www.sqlfluff.com)

6. pycqa/isort โญ 6,553
A Python utility / library to sort imports.
๐Ÿ”— [pycqa.github.io/isort](https://pycqa.github.io/isort/)

7. davidhalter/jedi โญ 5,847
Awesome autocompletion, static analysis and refactoring library for python
๐Ÿ”— [jedi.readthedocs.io](http://jedi.readthedocs.io)

8. pycqa/pylint โญ 5,364
It's not just a linter that annoys you!
๐Ÿ”— [pylint.readthedocs.io/en/latest](https://pylint.readthedocs.io/en/latest/)

9. asottile/pyupgrade โญ 3,650
A tool (and pre-commit hook) to automatically upgrade syntax for newer versions of the language.

10. jendrikseipp/vulture โญ 3,600
Find dead Python code

11. pycqa/flake8 โญ 3,502
flake8 is a python tool that glues together pycodestyle, pyflakes, mccabe, and third-party plugins to check the style and quality of some python code.
๐Ÿ”— [flake8.pycqa.org](https://flake8.pycqa.org)

12. wemake-services/wemake-python-styleguide โญ 2,633
The strictest and most opinionated python linter ever!
๐Ÿ”— [wemake-python-styleguide.rtfd.io](https://wemake-python-styleguide.rtfd.io)

13. python-lsp/python-lsp-server โญ 2,017
Fork of the python-language-server project, maintained by the Spyder IDE team and the community

14. codespell-project/codespell โญ 1,974
check code for common misspellings

15. sourcery-ai/sourcery โญ 1,578
Instant AI code reviews
๐Ÿ”— [sourcery.ai](https://sourcery.ai)

16. tconbeer/sqlfmt โญ 423
sqlfmt formats your dbt SQL files so you don't have to
๐Ÿ”— [sqlfmt.com](https://sqlfmt.com)

## Crypto and Blockchain

Cryptocurrency and blockchain libraries: trading bots, API integration, Ethereum virtual machine, solidity.

1. freqtrade/freqtrade โญ 34,099
Free, open source crypto trading bot
๐Ÿ”— [www.freqtrade.io](https://www.freqtrade.io)

2. ccxt/ccxt โญ 33,931
A JavaScript / TypeScript / Python / C# / PHP cryptocurrency trading API with support for more than 100 bitcoin/altcoin exchanges
๐Ÿ”— [docs.ccxt.com](https://docs.ccxt.com)

3. crytic/slither โญ 5,425
Static Analyzer for Solidity and Vyper
๐Ÿ”— [blog.trailofbits.com/2018/10/19/slither-a-solidity-static-analysis-framework](https://blog.trailofbits.com/2018/10/19/slither-a-solidity-static-analysis-framework/)

4. ethereum/web3.py โญ 5,103
A python interface for interacting with the Ethereum blockchain and ecosystem.
๐Ÿ”— [web3py.readthedocs.io](http://web3py.readthedocs.io)

5. ethereum/consensus-specs โญ 3,610
Ethereum Proof-of-Stake Consensus Specifications

6. cyberpunkmetalhead/Binance-volatility-trading-bot โญ 3,438
This is a fully functioning Binance trading bot that measures the volatility of every coin on Binance and places trades with the highest gaining coins If you like this project consider donating though the Brave browser to allow me to continuously improve the script.

7. bmoscon/cryptofeed โญ 2,305
Cryptocurrency Exchange Websocket Data Feed Handler

8. ethereum/py-evm โญ 2,296
A Python implementation of the Ethereum Virtual Machine
๐Ÿ”— [py-evm.readthedocs.io/en/latest](https://py-evm.readthedocs.io/en/latest/)

9. binance/binance-public-data โญ 1,639
Details on how to get Binance public data

10. ofek/bit โญ 1,258
Bitcoin made easy.
๐Ÿ”— [ofek.dev/bit](https://ofek.dev/bit/)

11. man-c/pycoingecko โญ 1,058
Python wrapper for the CoinGecko API

12. palkeo/panoramix โญ 822
Ethereum decompiler

13. dylanhogg/awesome-crypto โญ 71
A list of awesome crypto and blockchain projects
๐Ÿ”— [www.awesomecrypto.xyz](https://www.awesomecrypto.xyz/)

## Data

General data libraries: data processing, serialisation, formats, databases, SQL, connectors, web crawlers, data generation/augmentation/checks.

1. scrapy/scrapy โญ 53,715
Scrapy, a fast high-level web crawling & scraping framework for Python.
๐Ÿ”— [scrapy.org](https://scrapy.org)

2. apache/spark โญ 40,295
Apache Spark - A unified analytics engine for large-scale data processing
๐Ÿ”— [spark.apache.org](https://spark.apache.org/)

3. microsoft/markitdown โญ 32,671
A utility for converting files to Markdown, supports: PDF, PPT, Word, Excel, Images etc

4. mindsdb/mindsdb โญ 27,032
AGI's query engine - Platform for building AI that can learn and answer questions over federated data.
๐Ÿ”— [mindsdb.com](https://mindsdb.com)

5. getredash/redash โญ 26,704
Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
๐Ÿ”— [redash.io](http://redash.io/)

6. jaidedai/EasyOCR โญ 25,091
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
๐Ÿ”— [www.jaided.ai](https://www.jaided.ai)

7. qdrant/qdrant โญ 21,242
Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
๐Ÿ”— [qdrant.tech](https://qdrant.tech)

8. humansignal/label-studio โญ 20,092
Label Studio is an open source data labeling tool. It lets you label data types like audio, text, images, videos, and time series with a simple and straightforward UI and export to various model formats.
๐Ÿ”— [labelstud.io](https://labelstud.io)

9. joke2k/faker โญ 17,898
Faker is a Python package that generates fake data for you.
๐Ÿ”— [faker.readthedocs.io](https://faker.readthedocs.io)

10. avaiga/taipy โญ 17,561
Turns Data and AI algorithms into production-ready web applications in no time.
๐Ÿ”— [www.taipy.io](https://www.taipy.io)

11. ds4sd/docling โญ 17,528
Docling parses documents and exports them to the desired format with ease and speed.
๐Ÿ”— [ds4sd.github.io/docling](https://ds4sd.github.io/docling)

12. airbytehq/airbyte โญ 16,691
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
๐Ÿ”— [airbyte.com](https://airbyte.com)

13. chroma-core/chroma โญ 16,574
the AI-native open-source embedding database
๐Ÿ”— [www.trychroma.com](https://www.trychroma.com/)

14. binux/pyspider โญ 16,523
A Powerful Spider(Web Crawler) System in Python.
๐Ÿ”— [docs.pyspider.org](http://docs.pyspider.org/)

15. twintproject/twint โญ 15,911
An advanced Twitter scraping & OSINT tool written in Python that doesn't use Twitter's API, allowing you to scrape a user's followers, following, Tweets and more while evading most API limitations.

16. tiangolo/sqlmodel โญ 14,949
SQL databases in Python, designed for simplicity, compatibility, and robustness.
๐Ÿ”— [sqlmodel.tiangolo.com](https://sqlmodel.tiangolo.com/)

17. apache/arrow โญ 14,810
Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics
๐Ÿ”— [arrow.apache.org](https://arrow.apache.org/)

18. redis/redis-py โญ 12,773
Redis Python client

19. pathwaycom/pathway โญ 12,339
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
๐Ÿ”— [pathway.com](https://pathway.com)

20. weaviate/weaviate โญ 11,971
Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native databaseโ€‹.
๐Ÿ”— [weaviate.io/developers/weaviate](https://weaviate.io/developers/weaviate/)

21. coleifer/peewee โญ 11,306
a small, expressive orm -- supports postgresql, mysql, sqlite and cockroachdb
๐Ÿ”— [docs.peewee-orm.com](http://docs.peewee-orm.com/)

22. s0md3v/Photon โญ 11,155
Incredibly fast crawler designed for OSINT.

23. sqlalchemy/sqlalchemy โญ 9,875
The Database Toolkit for Python
๐Ÿ”— [www.sqlalchemy.org](https://www.sqlalchemy.org)

24. simonw/datasette โญ 9,699
An open source multi-tool for exploring and publishing data
๐Ÿ”— [datasette.io](https://datasette.io)

25. bigscience-workshop/petals โญ 9,333
๐ŸŒธ Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
๐Ÿ”— [petals.dev](https://petals.dev)

26. voxel51/fiftyone โญ 9,039
Refine high-quality datasets and visual AI models
๐Ÿ”— [fiftyone.ai](https://fiftyone.ai)

27. yzhao062/pyod โญ 8,722
A Python Library for Outlier and Anomaly Detection, Integrating Classical and Deep Learning Techniques
๐Ÿ”— [pyod.readthedocs.io](http://pyod.readthedocs.io)

28. gristlabs/grist-core โญ 7,467
Grist is the evolution of spreadsheets.
๐Ÿ”— [www.getgrist.com](https://www.getgrist.com)

29. tobymao/sqlglot โญ 6,941
Python SQL Parser and Transpiler
๐Ÿ”— [sqlglot.com](https://sqlglot.com/)

30. alirezamika/autoscraper โญ 6,564
A Smart, Automatic, Fast and Lightweight Web Scraper for Python

31. kaggle/kaggle-api โญ 6,376
Official Kaggle API

32. madmaze/pytesseract โญ 5,949
A Python wrapper for Google Tesseract

33. vi3k6i5/flashtext โญ 5,605
Extract Keywords from sentence or Replace keywords in sentences.

34. airbnb/knowledge-repo โญ 5,495
A next-generation curated knowledge sharing platform for data scientists and other technical professions.

35. ibis-project/ibis โญ 5,424
Ibis is a Python library that provides a lightweight, universal interface for data wrangling. It helps Python users explore and transform data of any size, stored anywhere.
๐Ÿ”— [ibis-project.org](https://ibis-project.org)

36. lancedb/lancedb โญ 5,203
Developer-friendly, serverless vector database for AI applications. Easily add long-term memory to your LLM apps!
๐Ÿ”— [lancedb.github.io/lancedb](https://lancedb.github.io/lancedb/)

37. facebookresearch/AugLy โญ 4,981
A data augmentations library for audio, image, text, and video.
๐Ÿ”— [ai.facebook.com/blog/augly-a-new-data-augmentation-library-to-help-build-more-robust-ai-models](https://ai.facebook.com/blog/augly-a-new-data-augmentation-library-to-help-build-more-robust-ai-models/)

38. superduperdb/superduper โญ 4,896
Superduper: Build end-to-end AI applications and agent workflows on your existing data infrastructure and preferred tools - without migrating your data.
๐Ÿ”— [superduper.io](https://superduper.io)

39. jazzband/tablib โญ 4,650
Python Module for Tabular Datasets in XLS, CSV, JSON, YAML, &c.
๐Ÿ”— [tablib.readthedocs.io](https://tablib.readthedocs.io/)

40. lk-geimfari/mimesis โญ 4,468
Mimesis is a robust data generator for Python that can produce a wide range of fake data in multiple languages.
๐Ÿ”— [mimesis.name](https://mimesis.name)

41. amundsen-io/amundsen โญ 4,466
Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.
๐Ÿ”— [www.amundsen.io/amundsen](https://www.amundsen.io/amundsen/)

42. cyclotruc/gitingest โญ 4,351
Turn any Git repository into a prompt-friendly text ingest for LLMs.
๐Ÿ”— [gitingest.com](https://gitingest.com/)

43. giskard-ai/giskard โญ 4,197
๐Ÿข Open-Source Evaluation & Testing for AI & LLM systems
๐Ÿ”— [docs.giskard.ai](https://docs.giskard.ai)

44. mongodb/mongo-python-driver โญ 4,171
PyMongo - the Official MongoDB Python driver
๐Ÿ”— [www.mongodb.com/docs/languages/python/pymongo-driver/current](https://www.mongodb.com/docs/languages/python/pymongo-driver/current/)

45. rom1504/img2dataset โญ 3,833
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

46. andialbrecht/sqlparse โญ 3,790
A non-validating SQL parser module for Python

47. jmcnamara/XlsxWriter โญ 3,687
A Python module for creating Excel XLSX files.
๐Ÿ”— [xlsxwriter.readthedocs.io](https://xlsxwriter.readthedocs.io)

48. deepchecks/deepchecks โญ 3,661
Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML validation needs, enabling to thoroughly test your data and models from research to production.
๐Ÿ”— [docs.deepchecks.com/stable](https://docs.deepchecks.com/stable)

49. praw-dev/praw โญ 3,558
PRAW, an acronym for "Python Reddit API Wrapper", is a python package that allows for simple access to Reddit's API.
๐Ÿ”— [praw.readthedocs.io](http://praw.readthedocs.io/)

50. run-llama/llama-hub โญ 3,463
A library of data loaders for LLMs made by the community -- to be used with LlamaIndex and/or LangChain
๐Ÿ”— [llamahub.ai](https://llamahub.ai/)

51. rapidai/RapidOCR โญ 3,327
๐Ÿ“„ Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVINO and PaddlePaddle.
๐Ÿ”— [rapidai.github.io/rapidocrdocs](https://rapidai.github.io/RapidOCRDocs)

52. pyeve/cerberus โญ 3,182
Lightweight, extensible data validation library for Python
๐Ÿ”— [python-cerberus.org](http://python-cerberus.org)

53. zoomeranalytics/xlwings โญ 3,045
xlwings is a Python library that makes it easy to call Python from Excel and vice versa. It works with Excel on Windows and macOS as well as with Google Sheets and Excel on the web.
๐Ÿ”— [www.xlwings.org](https://www.xlwings.org)

54. docarray/docarray โญ 3,002
Represent, send, store and search multimodal data
๐Ÿ”— [docs.docarray.org](https://docs.docarray.org/)

55. sqlalchemy/alembic โญ 2,980
A database migrations tool for SQLAlchemy.

56. pallets/itsdangerous โญ 2,953
Safely pass trusted data to untrusted environments and back.
๐Ÿ”— [itsdangerous.palletsprojects.com](https://itsdangerous.palletsprojects.com)

57. datafold/data-diff โญ 2,951
Compare tables within or across databases
๐Ÿ”— [docs.datafold.com](https://docs.datafold.com)

58. goldsmith/Wikipedia โญ 2,906
A Pythonic wrapper for the Wikipedia API
๐Ÿ”— [wikipedia.readthedocs.org](https://wikipedia.readthedocs.org/)

59. dlt-hub/dlt โญ 2,902
data load tool (dlt) is an open source Python library that makes data loading easy ๐Ÿ› ๏ธ
๐Ÿ”— [dlthub.com/docs](https://dlthub.com/docs)

60. awslabs/amazon-redshift-utils โญ 2,781
Amazon Redshift Utils contains utilities, scripts and view which are useful in a Redshift environment

61. kayak/pypika โญ 2,584
PyPika is a python SQL query builder that exposes the full richness of the SQL language using a syntax that reflects the resulting query. PyPika excels at all sorts of SQL queries but is especially useful for data analysis.
๐Ÿ”— [pypika.readthedocs.io/en/latest](http://pypika.readthedocs.io/en/latest/)

62. pynamodb/PynamoDB โญ 2,469
A pythonic interface to Amazon's DynamoDB
๐Ÿ”— [pynamodb.readthedocs.io](http://pynamodb.readthedocs.io)

63. sdv-dev/SDV โญ 2,434
Synthetic data generation for tabular data
๐Ÿ”— [docs.sdv.dev/sdv](https://docs.sdv.dev/sdv)

64. mlabonne/llm-datasets โญ 2,332
High-quality datasets, tools, and concepts for LLM fine-tuning.

65. uqfoundation/dill โญ 2,295
serialize all of Python
๐Ÿ”— [dill.rtfd.io](http://dill.rtfd.io)

66. emirozer/fake2db โญ 2,285
Generate fake but valid data filled databases for test purposes using most popular patterns(AFAIK). Current support is sqlite, mysql, postgresql, mongodb, redis, couchdb.

67. samuelcolvin/arq โญ 2,231
Fast job queuing and RPC in python with asyncio and redis.
๐Ÿ”— [arq-docs.helpmanual.io](https://arq-docs.helpmanual.io/)

68. pikepdf/pikepdf โญ 2,226
A Python library for reading and writing PDF, powered by QPDF
๐Ÿ”— [pikepdf.readthedocs.io](https://pikepdf.readthedocs.io/)

69. graphistry/pygraphistry โญ 2,195
PyGraphistry is a Python library to quickly load, shape, embed, and explore big graphs with the GPU-accelerated Graphistry visual graph analyzer

70. accenture/AmpliGraph โญ 2,176
Python library for Representation Learning on Knowledge Graphs https://docs.ampligraph.org

71. huggingface/datatrove โญ 2,141
DataTrove is a library to process, filter and deduplicate text data at a very large scale. It provides a set of prebuilt commonly used processing blocks with a framework to easily add custom functionality

72. sfu-db/connector-x โญ 2,057
Fastest library to load data from DB to DataFrames in Rust and Python
๐Ÿ”— [sfu-db.github.io/connector-x](https://sfu-db.github.io/connector-x)

73. aminalaee/sqladmin โญ 1,983
SQLAlchemy Admin for FastAPI and Starlette
๐Ÿ”— [aminalaee.dev/sqladmin](https://aminalaee.dev/sqladmin/)

74. milvus-io/bootcamp โญ 1,962
Dealing with all unstructured data, such as reverse image search, audio search, molecular search, video analysis, question and answer systems, NLP, etc.
๐Ÿ”— [milvus.io](https://milvus.io)

75. agronholm/sqlacodegen โญ 1,948
Automatic model code generator for SQLAlchemy

76. uber/petastorm โญ 1,809
Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code.

77. aio-libs/aiomysql โญ 1,773
aiomysql is a library for accessing a MySQL database from the asyncio
๐Ÿ”— [aiomysql.rtfd.io](https://aiomysql.rtfd.io)

78. simonw/sqlite-utils โญ 1,727
Python CLI utility and library for manipulating SQLite databases
๐Ÿ”— [sqlite-utils.datasette.io](https://sqlite-utils.datasette.io)

79. simple-salesforce/simple-salesforce โญ 1,718
A very simple Salesforce.com REST API client for Python

80. collerek/ormar โญ 1,690
python async orm with fastapi in mind and pydantic validation
๐Ÿ”— [collerek.github.io/ormar](https://collerek.github.io/ormar/)

81. zarr-developers/zarr-python โญ 1,558
An implementation of chunked, compressed, N-dimensional arrays for Python.
๐Ÿ”— [zarr.readthedocs.io](https://zarr.readthedocs.io)

82. eleutherai/the-pile โญ 1,520
The Pile is a large, diverse, open source language modelling data set that consists of many smaller datasets combined together.

83. ydataai/ydata-synthetic โญ 1,473
Synthetic data generators for tabular and time-series data
๐Ÿ”— [docs.synthetic.ydata.ai](https://docs.synthetic.ydata.ai)

84. scholarly-python-package/scholarly โญ 1,466
Retrieve author and publication information from Google Scholar in a friendly, Pythonic way without having to worry about CAPTCHAs!
๐Ÿ”— [scholarly.readthedocs.io](https://scholarly.readthedocs.io/)

85. sdispater/orator โญ 1,423
The Orator ORM provides a simple yet beautiful ActiveRecord implementation.
๐Ÿ”— [orator-orm.com](https://orator-orm.com)

86. mchong6/JoJoGAN โญ 1,423
Official PyTorch repo for JoJoGAN: One Shot Face Stylization

87. google/tensorstore โญ 1,369
Library for reading and writing large multi-dimensional arrays.
๐Ÿ”— [google.github.io/tensorstore](https://google.github.io/tensorstore/)

88. quixio/quix-streams โญ 1,255
Python stream processing for Kafka
๐Ÿ”— [docs.quix.io](https://docs.quix.io)

89. aio-libs/aiocache โญ 1,191
Asyncio cache manager for redis, memcached and memory
๐Ÿ”— [aiocache.readthedocs.io](http://aiocache.readthedocs.io)

90. eliasdabbas/advertools โญ 1,168
advertools - online marketing productivity and analysis tools
๐Ÿ”— [advertools.readthedocs.io](https://advertools.readthedocs.io)

91. pytorch/data โญ 1,152
A PyTorch repo for data loading and utilities to be shared by the PyTorch domain libraries.

92. d-star-ai/dsRAG โญ 1,105
A retrieval engine for unstructured data. It is especially good at handling challenging queries over dense text, like financial reports, legal documents, and academic papers.

93. brettkromkamp/contextualise โญ 1,061
Contextualise is an effective tool particularly suited for organising information-heavy projects and activities consisting of unstructured and widely diverse data and information resources
๐Ÿ”— [contextualise.dev](https://contextualise.dev/)

94. uber/fiber โญ 1,041
Distributed Computing for AI Made Simple
๐Ÿ”— [uber.github.io/fiber](https://uber.github.io/fiber/)

95. intake/intake โญ 1,019
Intake is a lightweight package for finding, investigating, loading and disseminating data.
๐Ÿ”— [intake.readthedocs.io](https://intake.readthedocs.io/)

96. duckdb/dbt-duckdb โญ 961
dbt (http://getdbt.com) adapter for DuckDB (http://duckdb.org)

97. goccy/bigquery-emulator โญ 872
BigQuery emulator provides a way to launch a BigQuery server on your local machine for testing and development.

98. scikit-hep/awkward โญ 854
Manipulate JSON-like data with NumPy-like idioms.
๐Ÿ”— [awkward-array.org](https://awkward-array.org)

99. macbre/sql-metadata โญ 821
Uses tokenized query returned by python-sqlparse and generates query metadata
๐Ÿ”— [pypi.python.org/pypi/sql-metadata](https://pypi.python.org/pypi/sql-metadata)

100. koaning/human-learn โญ 800
Natural Intelligence is still a pretty good idea.
๐Ÿ”— [koaning.github.io/human-learn](https://koaning.github.io/human-learn/)

101. googleapis/python-bigquery โญ 749
Python Client for Google BigQuery

102. igorbenav/fastcrud โญ 735
FastCRUD is a Python package for FastAPI, offering robust async CRUD operations and flexible endpoint creation utilities.

103. hyperqueryhq/whale โญ 725
๐Ÿณ The stupidly simple CLI workspace for your data warehouse.
๐Ÿ”— [rsyi.gitbook.io/whale](https://rsyi.gitbook.io/whale)

104. dgarnitz/vectorflow โญ 682
VectorFlow is a high volume vector embedding pipeline that ingests raw data, transforms it into vectors and writes it to a vector DB of your choice.
๐Ÿ”— [www.getvectorflow.com](https://www.getvectorflow.com/)

105. kagisearch/vectordb โญ 661
A minimal Python package for storing and retrieving text using chunking, embeddings, and vector search.
๐Ÿ”— [vectordb.com](https://vectordb.com)

106. weaviate/recipes โญ 629
This repository shares end-to-end notebooks on how to use various Weaviate features and integrations!

107. unstructured-io/unstructured-api โญ 612
API for Open-Source Pre-Processing Tools for Unstructured Data

108. jina-ai/vectordb โญ 581
A Python vector database you just need - no more, no less.

109. koaning/bulk โญ 559
Bulk is a quick UI developer tool to apply some bulk labels.

110. apache/iceberg-python โญ 539
PyIceberg is a Python library for programmatic access to Iceberg table metadata as well as to table data in Iceberg format.
๐Ÿ”— [py.iceberg.apache.org](https://py.iceberg.apache.org/)

111. koaning/doubtlab โญ 507
Doubt your data, find bad labels.
๐Ÿ”— [koaning.github.io/doubtlab](https://koaning.github.io/doubtlab/)

112. titan-systems/titan โญ 444
Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data access. Declarative Python Resource API.

113. stackloklabs/promptwright โญ 357
Promptwright is a Python library designed for generating large synthetic datasets using LLMs

## Debugging

Debugging and tracing tools.

1. cool-rr/PySnooper โญ 16,412
Never use print for debugging again

2. gruns/icecream โญ 9,403
๐Ÿฆ Never use print() to debug again.

3. shobrook/rebound โญ 4,119
Get Stack Overflow results in your terminal whenever an error is thrown

4. inducer/pudb โญ 3,013
Full-screen console debugger for Python
๐Ÿ”— [documen.tician.de/pudb](https://documen.tician.de/pudb/)

5. gotcha/ipdb โญ 1,876
Integration of IPython pdb

6. alexmojaki/heartrate โญ 1,772
Simple real time visualisation of the execution of a Python program.

7. alexmojaki/birdseye โญ 1,662
Graphical Python debugger which lets you easily view the values of all evaluated expressions
๐Ÿ”— [birdseye.readthedocs.io](https://birdseye.readthedocs.io)

8. pdbpp/pdbpp โญ 1,321
pdb++, a drop-in replacement for pdb (the Python debugger)

9. alexmojaki/snoop โญ 1,305
A powerful set of Python debugging tools, based on PySnooper

10. samuelcolvin/python-devtools โญ 1,001
Dev tools for python
๐Ÿ”— [python-devtools.helpmanual.io](https://python-devtools.helpmanual.io/)

## Diffusion Text to Image

Text-to-image diffusion model libraries, tools and apps for generating images from natural language.

1. automatic1111/stable-diffusion-webui โญ 145,457
Stable Diffusion web UI

2. compvis/stable-diffusion โญ 69,053
A latent text-to-image diffusion model
๐Ÿ”— [ommer-lab.com/research/latent-diffusion-models](https://ommer-lab.com/research/latent-diffusion-models/)

3. comfyanonymous/ComfyUI โญ 62,658
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
๐Ÿ”— [www.comfy.org](https://www.comfy.org/)

4. stability-ai/stablediffusion โญ 39,694
High-Resolution Image Synthesis with Latent Diffusion Models

5. lllyasviel/ControlNet โญ 31,129
Let us control diffusion models!

6. huggingface/diffusers โญ 26,996
๐Ÿค— Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
๐Ÿ”— [huggingface.co/docs/diffusers](https://huggingface.co/docs/diffusers)

7. invoke-ai/InvokeAI โญ 24,094
Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The solution offers an industry leading WebUI, and serves as the foundation for multiple commercial products.
๐Ÿ”— [invoke-ai.github.io/invokeai](https://invoke-ai.github.io/InvokeAI/)

8. apple/ml-stable-diffusion โญ 17,048
Stable Diffusion with Core ML on Apple Silicon

9. borisdayma/dalle-mini โญ 14,775
DALLยทE Mini - Generate images from a text prompt
๐Ÿ”— [www.craiyon.com](https://www.craiyon.com)

10. openbmb/MiniCPM-V โญ 13,016
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

11. divamgupta/diffusionbee-stable-diffusion-ui โญ 12,848
Diffusion Bee is the easiest way to run Stable Diffusion locally on your M1 Mac. Comes with a one-click installer. No dependencies or technical knowledge needed.
๐Ÿ”— [diffusionbee.com](https://diffusionbee.com)

12. compvis/latent-diffusion โญ 12,167
High-Resolution Image Synthesis with Latent Diffusion Models

13. instantid/InstantID โญ 11,277
InstantID: Zero-shot Identity-Preserving Generation in Seconds ๐Ÿ”ฅ
๐Ÿ”— [instantid.github.io](https://instantid.github.io/)

14. lucidrains/DALLE2-pytorch โญ 11,198
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch

15. facebookresearch/dinov2 โญ 9,578
PyTorch code and models for the DINOv2 self-supervised learning method.

16. ashawkey/stable-dreamfusion โญ 8,400
Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.

17. carson-katri/dream-textures โญ 7,888
Stable Diffusion built-in to Blender

18. xavierxiao/Dreambooth-Stable-Diffusion โญ 7,647
Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion

19. idea-research/GroundingDINO โญ 7,127
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
๐Ÿ”— [arxiv.org/abs/2303.05499](https://arxiv.org/abs/2303.05499)

20. opengvlab/InternVL โญ 6,669
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. ๆŽฅ่ฟ‘GPT-4o่กจ็Žฐ็š„ๅผ€ๆบๅคšๆจกๆ€ๅฏน่ฏๆจกๅž‹
๐Ÿ”— [internvl.readthedocs.io/en/latest](https://internvl.readthedocs.io/en/latest/)

21. timothybrooks/instruct-pix2pix โญ 6,469
PyTorch implementation of InstructPix2Pix, an instruction-based image editing model, based on the original CompVis/stable_diffusion repo.

22. openai/consistency_models โญ 6,228
Official repo for consistency models.

23. salesforce/BLIP โญ 4,943
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

24. nateraw/stable-diffusion-videos โญ 4,484
Create ๐Ÿ”ฅ videos with Stable Diffusion by exploring the latent space and morphing between text prompts

25. lkwq007/stablediffusion-infinity โญ 3,866
Outpainting with Stable Diffusion on an infinite canvas

26. jina-ai/discoart โญ 3,846
๐Ÿชฉ Create Disco Diffusion artworks in one line

27. mlc-ai/web-stable-diffusion โญ 3,625
Bringing stable diffusion models to web browsers. Everything runs inside the browser with no server support.
๐Ÿ”— [mlc.ai/web-stable-diffusion](https://mlc.ai/web-stable-diffusion)

28. openai/glide-text2im โญ 3,567
GLIDE: a diffusion-based text-conditional image synthesis model

29. openai/improved-diffusion โญ 3,376
Release for Improved Denoising Diffusion Probabilistic Models

30. saharmor/dalle-playground โญ 2,770
A playground to generate images from any text prompt using Stable Diffusion (past: using DALL-E Mini)

31. google-research/big_vision โญ 2,500
Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.

32. stability-ai/stability-sdk โญ 2,427
SDK for interacting with stability.ai APIs (e.g. stable diffusion inference)
๐Ÿ”— [platform.stability.ai](https://platform.stability.ai/)

33. thudm/CogVLM2 โญ 2,192
GPT4V-level open-source multi-modal model based on Llama3-8B

34. coyote-a/ultimate-upscale-for-automatic1111 โญ 1,678
Ultimate SD Upscale extension for AUTOMATIC1111 Stable Diffusion web UI

35. open-compass/VLMEvalKit โญ 1,627
Open-source evaluation toolkit of large vision-language models (LVLMs), support 160+ VLMs, 50+ benchmarks
๐Ÿ”— [huggingface.co/spaces/opencompass/open_vlm_leaderboard](https://huggingface.co/spaces/opencompass/open_vlm_leaderboard)

36. divamgupta/stable-diffusion-tensorflow โญ 1,593
Stable Diffusion in TensorFlow / Keras

37. nvlabs/prismer โญ 1,304
The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts".
๐Ÿ”— [shikun.io/projects/prismer](https://shikun.io/projects/prismer)

38. chenyangqiqi/FateZero โญ 1,126
[ICCV 2023 Oral] "FateZero: Fusing Attentions for Zero-shot Text-based Video Editing"
๐Ÿ”— [fate-zero-edit.github.io](http://fate-zero-edit.github.io/)

39. thereforegames/unprompted โญ 786
Templating language written for Stable Diffusion workflows. Available as an extension for the Automatic1111 WebUI.

40. tanelp/tiny-diffusion โญ 691
A minimal PyTorch implementation of probabilistic diffusion models for 2D datasets.

41. sharonzhou/long_stable_diffusion โญ 683
Long-form text-to-images generation, using a pipeline of deep generative models (GPT-3 and Stable Diffusion)

42. laion-ai/dalle2-laion โญ 500
Pretrained Dalle2 from laion

## Finance

Financial and quantitative libraries: investment research tools, market data, algorithmic trading, backtesting, financial derivatives.

1. openbb-finance/OpenBB โญ 34,753
Investment Research for Everyone, Everywhere.
๐Ÿ”— [openbb.co](https://openbb.co)

2. quantopian/zipline โญ 17,897
Zipline, a Pythonic Algorithmic Trading Library
๐Ÿ”— [www.zipline.io](https://www.zipline.io)

3. microsoft/qlib โญ 15,919
Qlib is an AI-oriented quantitative investment platform that aims to realize the potential, empower research, and create value using AI technologies in quantitative investment, from exploring ideas to implementing productions. Qlib supports diverse machine learning modeling paradigms. including supervised learning, ...
๐Ÿ”— [qlib.readthedocs.io/en/latest](https://qlib.readthedocs.io/en/latest/)

4. mementum/backtrader โญ 15,379
Python Backtesting library for trading strategies
๐Ÿ”— [www.backtrader.com](https://www.backtrader.com)

5. ranaroussi/yfinance โญ 15,313
Download market data from Yahoo! Finance's API
๐Ÿ”— [aroussi.com/post/python-yahoo-finance](https://aroussi.com/post/python-yahoo-finance)

6. ai4finance-foundation/FinGPT โญ 14,579
FinGPT: Open-Source Financial Large Language Models! Revolutionize ๐Ÿ”ฅ We release the trained model on HuggingFace.
๐Ÿ”— [ai4finance.org](https://ai4finance.org)

7. ai4finance-foundation/FinRL โญ 10,373
FinRL: Financial Reinforcement Learning. ๐Ÿ”ฅ
๐Ÿ”— [ai4finance.org](https://ai4finance.org)

8. quantconnect/Lean โญ 10,224
Lean Algorithmic Trading Engine by QuantConnect (Python, C#)
๐Ÿ”— [lean.io](https://lean.io)

9. ta-lib/ta-lib-python โญ 9,989
Python wrapper for TA-Lib (http://ta-lib.org/).
๐Ÿ”— [ta-lib.github.io/ta-lib-python](http://ta-lib.github.io/ta-lib-python)

10. goldmansachs/gs-quant โญ 8,121
Python toolkit for quantitative finance
๐Ÿ”— [developer.gs.com/discover/products/gs-quant](https://developer.gs.com/discover/products/gs-quant/)

11. virattt/ai-hedge-fund โญ 5,794
AI-powered hedge fund. The goal of this project is to explore the use of AI to make trading decisions.

12. quantopian/pyfolio โญ 5,774
Portfolio and risk analytics in Python
๐Ÿ”— [quantopian.github.io/pyfolio](https://quantopian.github.io/pyfolio)

13. kernc/backtesting.py โญ 5,735
:mag_right: :chart_with_upwards_trend: ๐Ÿ :moneybag: Backtest trading strategies in Python.
๐Ÿ”— [kernc.github.io/backtesting.py](https://kernc.github.io/backtesting.py/)

14. twopirllc/pandas-ta โญ 5,628
Technical Analysis Indicators - Pandas TA is an easy to use Python 3 Pandas Extension with 150+ Indicators
๐Ÿ”— [twopirllc.github.io/pandas-ta](https://twopirllc.github.io/pandas-ta/)

15. ranaroussi/quantstats โญ 5,140
Portfolio analytics for quants, written in Python

16. google/tf-quant-finance โญ 4,635
High-performance TensorFlow library for quantitative finance.

17. polakowo/vectorbt โญ 4,622
Find your trading edge, using the fastest engine for backtesting, algorithmic trading, and research.
๐Ÿ”— [vectorbt.dev](https://vectorbt.dev)

18. gbeced/pyalgotrade โญ 4,459
Python Algorithmic Trading Library
๐Ÿ”— [gbeced.github.io/pyalgotrade](http://gbeced.github.io/pyalgotrade/)

19. borisbanushev/stockpredictionai โญ 4,387
In this noteboook I will create a complete process for predicting stock price movements. Follow along and we will achieve some pretty good results. For that purpose we will use a Generative Adversarial Network (GAN) with LSTM, a type of Recurrent Neural Network, as generator, and a Convolutional Neural Networ...

20. matplotlib/mplfinance โญ 3,790
Financial Markets Data Visualization using Matplotlib
๐Ÿ”— [pypi.org/project/mplfinance](https://pypi.org/project/mplfinance/)

21. cuemacro/finmarketpy โญ 3,491
Python library for backtesting trading strategies & analyzing financial markets (formerly pythalesians)
๐Ÿ”— [www.cuemacro.com](http://www.cuemacro.com)

22. quantopian/alphalens โญ 3,471
Performance analysis of predictive (alpha) stock factors
๐Ÿ”— [quantopian.github.io/alphalens](http://quantopian.github.io/alphalens)

23. zvtvz/zvt โญ 3,331
modular quant framework.
๐Ÿ”— [zvt.readthedocs.io/en/latest](https://zvt.readthedocs.io/en/latest/)

24. robcarver17/pysystemtrade โญ 2,716
Systematic Trading in python

25. quantopian/research_public โญ 2,476
Quantitative research and educational materials
๐Ÿ”— [www.quantopian.com/lectures](https://www.quantopian.com/lectures)

26. pmorissette/bt โญ 2,339
bt - flexible backtesting for Python
๐Ÿ”— [pmorissette.github.io/bt](http://pmorissette.github.io/bt)

27. domokane/FinancePy โญ 2,203
A Python Finance Library that focuses on the pricing and risk-management of Financial Derivatives, including fixed-income, equity, FX and credit derivatives.

28. blankly-finance/blankly โญ 2,192
๐Ÿš€ ๐Ÿ’ธ Easily build, backtest and deploy your algo in just a few lines of code. Trade stocks, cryptos, and forex across exchanges w/ one package.
๐Ÿ”— [package.blankly.finance](https://package.blankly.finance)

29. pmorissette/ffn โญ 2,078
ffn - a financial function library for Python
๐Ÿ”— [pmorissette.github.io/ffn](pmorissette.github.io/ffn)

30. cuemacro/findatapy โญ 1,730
Python library to download market data via Bloomberg, Eikon, Quandl, Yahoo etc.

31. quantopian/empyrical โญ 1,322
Common financial risk and performance metrics. Used by zipline and pyfolio.
๐Ÿ”— [quantopian.github.io/empyrical](https://quantopian.github.io/empyrical)

32. idanya/algo-trader โญ 806
Trading bot with support for realtime trading, backtesting, custom strategies and much more.

33. gbeced/basana โญ 616
A Python async and event driven framework for algorithmic trading, with a focus on crypto currencies.

34. chancefocus/PIXIU โญ 586
This repository introduces PIXIU, an open-source resource featuring the first financial large language models (LLMs), instruction tuning data, and evaluation benchmarks to holistically assess financial LLMs. Our goal is to continually push forward the open-source development of financial artificial intelligence (AI).

## Game Development

Game development tools, engines and libraries.

1. kitao/pyxel โญ 15,631
A retro game engine for Python

2. pygame/pygame โญ 7,644
๐Ÿ๐ŸŽฎ pygame (the library) is a Free and Open Source python programming language library for making multimedia applications like games built on top of the excellent SDL library. C, Python, Native, OpenGL.
๐Ÿ”— [www.pygame.org](https://www.pygame.org)

3. microsoft/TRELLIS โญ 6,109
A large 3D asset generation model. It takes in text or image prompts and generates high-quality 3D assets in various formats, such as Radiance Fields, 3D Gaussians, and meshes.
๐Ÿ”— [trellis3d.github.io](https://trellis3d.github.io)

4. panda3d/panda3d โญ 4,601
Powerful, mature open-source cross-platform game engine for Python and C++, developed by Disney and CMU
๐Ÿ”— [www.panda3d.org](https://www.panda3d.org/)

5. pokepetter/ursina โญ 2,248
A game engine powered by python and panda3d.
๐Ÿ”— [pokepetter.github.io/ursina](https://pokepetter.github.io/ursina/)

6. pyglet/pyglet โญ 1,925
pyglet is a cross-platform windowing and multimedia library for Python, for developing games and other visually rich applications.
๐Ÿ”— [pyglet.org](http://pyglet.org)

7. pythonarcade/arcade โญ 1,725
Easy to use Python library for creating 2D arcade games.
๐Ÿ”— [arcade.academy](http://arcade.academy)

## GIS

Geospatial libraries: raster and vector data formats, interactive mapping and visualisation, computing frameworks for processing images, projections.

1. domlysz/BlenderGIS โญ 7,923
Blender addons to make the bridge between Blender and geographic data

2. python-visualization/folium โญ 6,988
Python Data. Leaflet.js Maps.
๐Ÿ”— [python-visualization.github.io/folium](https://python-visualization.github.io/folium/)

3. osgeo/gdal โญ 5,012
GDAL is an open source MIT licensed translator library for raster and vector geospatial data formats.
๐Ÿ”— [gdal.org](https://gdal.org)

4. gboeing/osmnx โญ 4,955
Python package to easily download, model, analyze, and visualize street networks and other geospatial features from OpenStreetMap.
๐Ÿ”— [osmnx.readthedocs.io](https://osmnx.readthedocs.io)

5. geopandas/geopandas โญ 4,588
Python tools for geographic data
๐Ÿ”— [geopandas.org](http://geopandas.org/)

6. shapely/shapely โญ 3,970
Manipulation and analysis of geometric objects
๐Ÿ”— [shapely.readthedocs.io/en/stable](https://shapely.readthedocs.io/en/stable/)

7. giswqs/geemap โญ 3,534
A Python package for interactive geospatial analysis and visualization with Google Earth Engine.
๐Ÿ”— [geemap.org](https://geemap.org)

8. holoviz/datashader โญ 3,346
Quickly and accurately render even the largest data.
๐Ÿ”— [datashader.org](http://datashader.org)

9. opengeos/leafmap โญ 3,240
A Python package for interactive mapping and geospatial analysis with minimal coding in a Jupyter environment
๐Ÿ”— [leafmap.org](https://leafmap.org)

10. microsoft/torchgeo โญ 3,147
TorchGeo: datasets, samplers, transforms, and pre-trained models for geospatial data
๐Ÿ”— [www.osgeo.org/projects/torchgeo](https://www.osgeo.org/projects/torchgeo/)

11. opengeos/segment-geospatial โญ 3,123
A Python package for segmenting geospatial data with the Segment Anything Model (SAM)
๐Ÿ”— [samgeo.gishub.org](https://samgeo.gishub.org)

12. google/earthengine-api โญ 2,732
Python and JavaScript bindings for calling the Earth Engine API.

13. rasterio/rasterio โญ 2,291
Rasterio reads and writes geospatial raster datasets
๐Ÿ”— [rasterio.readthedocs.io](https://rasterio.readthedocs.io/)

14. mcordts/cityscapesScripts โญ 2,199
README and scripts for the Cityscapes Dataset

15. azavea/raster-vision โญ 2,104
An open source library and framework for deep learning on satellite and aerial imagery.
๐Ÿ”— [docs.rastervision.io](https://docs.rastervision.io)

16. apache/sedona โญ 1,975
A cluster computing framework for processing large-scale geospatial data
๐Ÿ”— [sedona.apache.org](https://sedona.apache.org/)

17. plant99/felicette โญ 1,819
Satellite imagery for dummies.

18. gboeing/osmnx-examples โญ 1,595
Gallery of OSMnx tutorials, usage examples, and feature demonstations.
๐Ÿ”— [osmnx.readthedocs.io](https://osmnx.readthedocs.io)

19. jupyter-widgets/ipyleaflet โญ 1,499
A Jupyter - Leaflet.js bridge
๐Ÿ”— [ipyleaflet.readthedocs.io](https://ipyleaflet.readthedocs.io)

20. microsoft/GlobalMLBuildingFootprints โญ 1,459
Worldwide building footprints derived from satellite imagery

21. pysal/pysal โญ 1,351
PySAL: Python Spatial Analysis Library Meta-Package
๐Ÿ”— [pysal.org/pysal](http://pysal.org/pysal)

22. anitagraser/movingpandas โญ 1,256
Movement trajectory classes and functions built on top of GeoPandas
๐Ÿ”— [movingpandas.org](http://movingpandas.org)

23. residentmario/geoplot โญ 1,161
High-level geospatial data visualization library for Python.
๐Ÿ”— [residentmario.github.io/geoplot/index.html](https://residentmario.github.io/geoplot/index.html)

24. sentinel-hub/eo-learn โญ 1,142
Earth observation processing framework for machine learning in Python
๐Ÿ”— [eo-learn.readthedocs.io/en/latest](https://eo-learn.readthedocs.io/en/latest/)

25. opengeos/streamlit-geospatial โญ 895
A multi-page streamlit app for geospatial
๐Ÿ”— [huggingface.co/spaces/giswqs/streamlit](https://huggingface.co/spaces/giswqs/Streamlit)

26. osgeo/grass โญ 868
GRASS GIS - free and open-source geospatial processing engine
๐Ÿ”— [grass.osgeo.org](https://grass.osgeo.org)

27. makepath/xarray-spatial โญ 854
Raster-based Spatial Analytics for Python
๐Ÿ”— [xarray-spatial.readthedocs.io](https://xarray-spatial.readthedocs.io/)

28. developmentseed/titiler โญ 812
Build your own Raster dynamic map tile services
๐Ÿ”— [developmentseed.org/titiler](https://developmentseed.org/titiler/)

## Graph

Graphs and network libraries: network analysis, graph machine learning, visualisation.

1. networkx/networkx โญ 15,243
Network Analysis in Python
๐Ÿ”— [networkx.org](https://networkx.org)

2. stellargraph/stellargraph โญ 2,962
StellarGraph - Machine Learning on Graphs
๐Ÿ”— [stellargraph.readthedocs.io](https://stellargraph.readthedocs.io/)

3. westhealth/pyvis โญ 1,040
Python package for creating and visualizing interactive network graphs.
๐Ÿ”— [pyvis.readthedocs.io/en/latest](http://pyvis.readthedocs.io/en/latest/)

4. microsoft/graspologic โญ 844
graspologic is a package for graph statistical algorithms
๐Ÿ”— [graspologic-org.github.io/graspologic](https://graspologic-org.github.io/graspologic/)

5. rampasek/GraphGPS โญ 692
Recipe for a General, Powerful, Scalable Graph Transformer

6. dylanhogg/llmgraph โญ 353
Create knowledge graphs with LLMs

## GUI

Graphical user interface libraries and toolkits.

1. hoffstadt/DearPyGui โญ 13,551
Dear PyGui: A fast and powerful Graphical User Interface Toolkit for Python with minimal dependencies
๐Ÿ”— [dearpygui.readthedocs.io/en/latest](https://dearpygui.readthedocs.io/en/latest/)

2. pysimplegui/PySimpleGUI โญ 13,500
Python GUIs for Humans! PySimpleGUI is the top-rated Python application development environment. Launched in 2018 and actively developed, maintained, and supported in 2024. Transforms tkinter, Qt, WxPython, and Remi into a simple, intuitive, and fun experience for both hobbyists and expert users.
๐Ÿ”— [www.pysimplegui.com](https://www.PySimpleGUI.com)

3. parthjadhav/Tkinter-Designer โญ 9,439
An easy and fast way to create a Python GUI ๐Ÿ

4. samuelcolvin/FastUI โญ 8,572
FastUI is a new way to build web application user interfaces defined by declarative Python code.
๐Ÿ”— [fastui-demo.onrender.com](https://fastui-demo.onrender.com)

5. r0x0r/pywebview โญ 4,888
Build GUI for your Python program with JavaScript, HTML, and CSS
๐Ÿ”— [pywebview.flowrl.com](https://pywebview.flowrl.com)

6. beeware/toga โญ 4,441
A Python native, OS native GUI toolkit.
๐Ÿ”— [toga.readthedocs.io/en/latest](https://toga.readthedocs.io/en/latest/)

7. dddomodossola/remi โญ 3,541
Python REMote Interface library. Platform independent. In about 100 Kbytes, perfect for your diet.

8. wxwidgets/Phoenix โญ 2,356
wxPython's Project Phoenix. A new implementation of wxPython, better, stronger, faster than he was before.
๐Ÿ”— [wxpython.org](http://wxpython.org/)

## Jupyter

Jupyter and JupyterLab and Notebook tools, libraries and plugins.

1. jupyterlab/jupyterlab โญ 14,304
JupyterLab computational environment.
๐Ÿ”— [jupyterlab.readthedocs.io](https://jupyterlab.readthedocs.io/)

2. jupyter/notebook โญ 11,897
Jupyter Interactive Notebook
๐Ÿ”— [jupyter-notebook.readthedocs.io](https://jupyter-notebook.readthedocs.io/)

3. marimo-team/marimo โญ 8,990
A reactive Python notebook: run a cell or interact with a UI element, and marimo automatically runs dependent cells, keeping code and outputs consistent. marimo notebooks are stored as pure Python, executable as scripts, and deployable as apps.
๐Ÿ”— [marimo.io](https://marimo.io)

4. mwouts/jupytext โญ 6,693
Jupyter Notebooks as Markdown Documents, Julia, Python or R scripts
๐Ÿ”— [jupytext.readthedocs.io](https://jupytext.readthedocs.io)

5. nteract/papermill โญ 6,047
๐Ÿ“š Parameterize, execute, and analyze notebooks
๐Ÿ”— [papermill.readthedocs.io/en/latest](http://papermill.readthedocs.io/en/latest/)

6. connorferster/handcalcs โญ 5,694
Python library for converting Python calculations into rendered latex.

7. voila-dashboards/voila โญ 5,530
Voilร  turns Jupyter notebooks into standalone web applications
๐Ÿ”— [voila.readthedocs.io](https://voila.readthedocs.io)

8. jupyterlite/jupyterlite โญ 3,970
Wasm powered Jupyter running in the browser ๐Ÿ’ก
๐Ÿ”— [jupyterlite.rtfd.io/en/stable/try/lab](https://jupyterlite.rtfd.io/en/stable/try/lab)

9. executablebooks/jupyter-book โญ 3,925
Create beautiful, publication-quality books and documents from computational content.
๐Ÿ”— [jupyterbook.org](http://jupyterbook.org)

10. jupyterlab/jupyterlab-desktop โญ 3,817
JupyterLab desktop application, based on Electron.

11. jupyterlab/jupyter-ai โญ 3,318
A generative AI extension for JupyterLab
๐Ÿ”— [jupyter-ai.readthedocs.io](https://jupyter-ai.readthedocs.io/)

12. jupyter-widgets/ipywidgets โญ 3,173
Interactive Widgets for the Jupyter Notebook
๐Ÿ”— [ipywidgets.readthedocs.io](https://ipywidgets.readthedocs.io)

13. quantopian/qgrid โญ 3,054
An interactive grid for sorting, filtering, and editing DataFrames in Jupyter notebooks

14. jupyter/nbdime โญ 2,686
Tools for diffing and merging of Jupyter notebooks.
๐Ÿ”— [nbdime.readthedocs.io](http://nbdime.readthedocs.io)

15. mito-ds/mito โญ 2,318
The mitosheet package, trymito.io, and other public Mito code.
๐Ÿ”— [trymito.io](https://trymito.io)

16. jupyter/nbviewer โญ 2,218
nbconvert as a web service: Render Jupyter Notebooks as static web pages
๐Ÿ”— [nbviewer.jupyter.org](https://nbviewer.jupyter.org)

17. maartenbreddels/ipyvolume โญ 1,951
3d plotting for Python in the Jupyter notebook based on IPython widgets using WebGL

18. jupyter-lsp/jupyterlab-lsp โญ 1,834
Coding assistance for JupyterLab (code navigation + hover suggestions + linters + autocompletion + rename) using Language Server Protocol
๐Ÿ”— [jupyterlab-lsp.readthedocs.io](https://jupyterlab-lsp.readthedocs.io)

19. jupyter/nbconvert โญ 1,769
Jupyter Notebook Conversion
๐Ÿ”— [nbconvert.readthedocs.io](https://nbconvert.readthedocs.io/)

20. 8080labs/pyforest โญ 1,107
With pyforest you can use all your favorite Python libraries without importing them before. If you use a package that is not imported yet, pyforest imports the package for you and adds the code to the first Jupyter cell.
๐Ÿ”— [8080labs.com](https://8080labs.com)

21. nbqa-dev/nbQA โญ 1,064
Run ruff, isort, pyupgrade, mypy, pylint, flake8, and more on Jupyter Notebooks
๐Ÿ”— [nbqa.readthedocs.io/en/latest/index.html](https://nbqa.readthedocs.io/en/latest/index.html)

22. vizzuhq/ipyvizzu โญ 959
Build animated charts in Jupyter Notebook and similar environments with a simple Python syntax.
๐Ÿ”— [ipyvizzu.vizzuhq.com](https://ipyvizzu.vizzuhq.com)

23. koaning/drawdata โญ 818
Draw datasets from within Jupyter.
๐Ÿ”— [calmcode.io/labs/drawdata.html](https://calmcode.io/labs/drawdata.html)

24. aws/graph-notebook โญ 747
Library extending Jupyter notebooks to integrate with Apache TinkerPop, openCypher, and RDF SPARQL.
๐Ÿ”— [github.com/aws/graph-notebook](https://github.com/aws/graph-notebook)

25. linealabs/lineapy โญ 664
Move fast from data science prototype to pipeline. Capture, analyze, and transform messy notebooks into data pipelines with just two lines of code.
๐Ÿ”— [lineapy.org](https://lineapy.org)

26. xiaohk/stickyland โญ 538
Break the linear presentation of Jupyter Notebooks with sticky cells!
๐Ÿ”— [xiaohk.github.io/stickyland](https://xiaohk.github.io/stickyland/)

27. infuseai/colab-xterm โญ 407
Open a terminal in colab, including the free tier.

## LLMs and ChatGPT

Large language model and GPT libraries and frameworks: auto-gpt, agents, QnA, chain-of-thought workflows, API integations. Also see the Natural Language Processing category for crossover.

1. significant-gravitas/AutoGPT โญ 170,183
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
๐Ÿ”— [agpt.co](https://agpt.co)

2. nomic-ai/gpt4all โญ 71,567
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
๐Ÿ”— [nomic.ai/gpt4all](https://nomic.ai/gpt4all)

3. ggerganov/llama.cpp โญ 70,305
LLM inference in C/C++

4. xtekky/gpt4free โญ 62,969
The official gpt4free repository | various collection of powerful language models
๐Ÿ”— [t.me/g4f_channel](https://t.me/g4f_channel)

5. killianlucas/open-interpreter โญ 57,731
A natural language interface for computers
๐Ÿ”— [openinterpreter.com](http://openinterpreter.com/)

6. facebookresearch/llama โญ 57,116
Inference code for Llama models

7. open-webui/open-webui โญ 54,971
Open WebUI is an extensible, feature-rich, and user-friendly self-hosted AI platform designed to operate entirely offline. It supports various LLM runners like Ollama and OpenAI-compatible APIs, with built-in inference engine for RAG
๐Ÿ”— [openwebui.com](https://openwebui.com)

8. imartinez/private-gpt โญ 54,648
Interact with your documents using the power of GPT, 100% privately, no data leaks
๐Ÿ”— [privategpt.dev](https://privategpt.dev)

9. gpt-engineer-org/gpt-engineer โญ 52,795
Platform to experiment with the AI Software Engineer. Terminal based. NOTE: Very different from https://gptengineer.app

10. xai-org/grok-1 โญ 49,758
This repository contains JAX example code for loading and running the Grok-1 open-weights model.

11. geekan/MetaGPT โญ 46,478
๐ŸŒŸ The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
๐Ÿ”— [deepwisdom.ai](https://deepwisdom.ai/)

12. oobabooga/text-generation-webui โญ 41,503
A Gradio web UI for Large Language Models with support for multiple inference backends.

13. thudm/ChatGLM-6B โญ 40,950
ChatGLM-6B: An Open Bilingual Dialogue Language Model | ๅผ€ๆบๅŒ่ฏญๅฏน่ฏ่ฏญ่จ€ๆจกๅž‹

14. hpcaitech/ColossalAI โญ 39,002
Making large AI models cheaper, faster and more accessible
๐Ÿ”— [www.colossalai.org](https://www.colossalai.org)

15. karpathy/nanoGPT โญ 38,337
The simplest, fastest repository for training/finetuning medium-sized GPTs.

16. lm-sys/FastChat โญ 37,440
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

17. hiyouga/LLaMA-Factory โญ 37,438
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
๐Ÿ”— [arxiv.org/abs/2403.13372](https://arxiv.org/abs/2403.13372)

18. laion-ai/Open-Assistant โญ 37,166
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
๐Ÿ”— [open-assistant.io](https://open-assistant.io)

19. quivrhq/quivr โญ 37,049
Opiniated RAG for integrating GenAI in your apps ๐Ÿง  Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: PGVector, Faiss. Any Files. Anyway you want.
๐Ÿ”— [core.quivr.com](https://core.quivr.com)

20. moymix/TaskMatrix โญ 34,551
Connects ChatGPT and a series of Visual Foundation Models to enable sending and receiving images during chatting.

21. vllm-project/vllm โญ 33,290
A high-throughput and memory-efficient inference and serving engine for LLMs
๐Ÿ”— [docs.vllm.ai](https://docs.vllm.ai)

22. pythagora-io/gpt-pilot โญ 32,161
The first real AI developer

23. tatsu-lab/stanford_alpaca โญ 29,715
Code and documentation to train Stanford's Alpaca models, and generate the data.
๐Ÿ”— [crfm.stanford.edu/2023/03/13/alpaca.html](https://crfm.stanford.edu/2023/03/13/alpaca.html)

24. meta-llama/llama3 โญ 27,831
The official Meta Llama 3 GitHub site

25. infiniflow/ragflow โญ 27,414
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
๐Ÿ”— [ragflow.io](https://ragflow.io)

26. danielmiessler/fabric โญ 26,483
fabric is an open-source framework for augmenting humans using AI. It provides a modular framework for solving specific problems using a crowdsourced set of AI prompts that can be used anywhere.
๐Ÿ”— [danielmiessler.com/p/fabric-origin-story](https://danielmiessler.com/p/fabric-origin-story)

27. vision-cair/MiniGPT-4 โญ 25,529
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
๐Ÿ”— [minigpt-4.github.io](https://minigpt-4.github.io)

28. karpathy/llm.c โญ 24,964
LLM training in simple, pure C/CUDA. There is no need for 245MB of PyTorch or 107MB of cPython

29. microsoft/JARVIS โญ 23,859
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf

30. embedchain/mem0 โญ 23,776
The Memory layer for your AI apps
๐Ÿ”— [mem0.ai](https://mem0.ai)

31. openai/gpt-2 โญ 22,746
Code for the paper "Language Models are Unsupervised Multitask Learners"
๐Ÿ”— [openai.com/blog/better-language-models](https://openai.com/blog/better-language-models/)

32. microsoft/semantic-kernel โญ 22,578
Integrate cutting-edge LLM technology quickly and easily into your apps
๐Ÿ”— [aka.ms/semantic-kernel](https://aka.ms/semantic-kernel)

33. unclecode/crawl4ai โญ 21,724
AI-ready web crawling tailored for LLMs, AI agents, and data pipelines. Open source, flexible, and built for real-time performance, Crawl4AI empowers developers with unmatched speed, precision, and deployment ease.
๐Ÿ”— [crawl4ai.com](https://crawl4ai.com)

34. microsoft/graphrag โญ 21,435
A modular graph-based Retrieval-Augmented Generation (RAG) system
๐Ÿ”— [microsoft.github.io/graphrag](https://microsoft.github.io/graphrag/)

35. openai/chatgpt-retrieval-plugin โญ 21,102
The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.

36. haotian-liu/LLaVA โญ 20,999
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
๐Ÿ”— [llava.hliu.cc](https://llava.hliu.cc)

37. stanfordnlp/dspy โญ 20,809
DSPy: The framework for programmingโ€”not promptingโ€”language models
๐Ÿ”— [dspy.ai](https://dspy.ai)

38. karpathy/minGPT โญ 20,757
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

39. khoj-ai/khoj โญ 20,282
Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI
๐Ÿ”— [khoj.dev](https://khoj.dev)

40. unslothai/unsloth โญ 20,076
Finetune Llama 3.3, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory
๐Ÿ”— [unsloth.ai](https://unsloth.ai)

41. cinnamon/kotaemon โญ 19,717
An open-source RAG UI for chatting with your documents. Built with both end users and developers in mind
๐Ÿ”— [cinnamon.github.io/kotaemon](https://cinnamon.github.io/kotaemon/)

42. mlc-ai/mlc-llm โญ 19,551
Universal LLM Deployment Engine with ML Compilation
๐Ÿ”— [llm.mlc.ai](https://llm.mlc.ai/)

43. guidance-ai/guidance โญ 19,404
A guidance language for controlling large language models.

44. rasahq/rasa โญ 19,160
๐Ÿ’ฌ Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants
๐Ÿ”— [rasa.com/docs/rasa](https://rasa.com/docs/rasa/)

45. tloen/alpaca-lora โญ 18,746
Instruct-tune LLaMA on consumer hardware

46. stitionai/devika โญ 18,714
Devika is an advanced AI software engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective.

47. deepset-ai/haystack โญ 18,469
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversatio...
๐Ÿ”— [haystack.deepset.ai](https://haystack.deepset.ai)

48. karpathy/llama2.c โญ 17,789
Inference Llama 2 in one file of pure C

49. huggingface/peft โญ 16,892
๐Ÿค— PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
๐Ÿ”— [huggingface.co/docs/peft](https://huggingface.co/docs/peft)

50. deepseek-ai/DeepSeek-V3 โญ 16,791
A strong Mixture-of-Experts (MoE) language model with 671B total parameters with 37B activated for each token.

51. facebookresearch/codellama โญ 16,133
Inference code for CodeLlama models

52. berriai/litellm โญ 15,934
Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]
๐Ÿ”— [docs.litellm.ai/docs](https://docs.litellm.ai/docs/)

53. facebookresearch/llama-recipes โญ 15,827
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta L...

54. thudm/ChatGLM2-6B โญ 15,748
ChatGLM2-6B: An Open Bilingual Chat LLM | ๅผ€ๆบๅŒ่ฏญๅฏน่ฏ่ฏญ่จ€ๆจกๅž‹

55. transformeroptimus/SuperAGI โญ 15,677
<โšก๏ธ> SuperAGI - A dev-first open source autonomous AI agent framework. Enabling developers to build, manage & run useful autonomous agents quickly and reliably.
๐Ÿ”— [superagi.com](https://superagi.com/)

56. idea-research/Grounded-Segment-Anything โญ 15,531
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
๐Ÿ”— [arxiv.org/abs/2401.14159](https://arxiv.org/abs/2401.14159)

57. openai/evals โญ 15,292
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

58. qwenlm/Qwen โญ 15,212
The official repo of Qwen (้€šไน‰ๅƒ้—ฎ) chat & pretrained large language model proposed by Alibaba Cloud.

59. mayooear/gpt4-pdf-chatbot-langchain โญ 15,001
GPT4 & LangChain Chatbot for large PDF docs
๐Ÿ”— [www.youtube.com/watch?v=ih9pbgvvoo4](https://www.youtube.com/watch?v=ih9PBGVVOO4)

60. dao-ailab/flash-attention โญ 14,952
Fast and memory-efficient exact attention

61. fauxpilot/fauxpilot โญ 14,639
FauxPilot - an open-source alternative to GitHub Copilot server

62. mlc-ai/web-llm โญ 14,168
High-performance In-browser LLM Inference Engine
๐Ÿ”— [webllm.mlc.ai](https://webllm.mlc.ai)

63. blinkdl/RWKV-LM โญ 12,975
RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RNN and transformer - great performance, linear time, constant space (no kv-cache), fast training, infinite ctx_len, and f...

64. microsoft/BitNet โญ 12,563
Official inference framework for 1-bit LLMs (e.g., BitNet b1.58). It offers a suite of optimized kernels, that support fast and lossless inference of 1.58-bit models

65. vanna-ai/vanna โญ 12,538
๐Ÿค– Chat with your SQL database ๐Ÿ“Š. Accurate Text-to-SQL Generation via LLMs using RAG ๐Ÿ”„.
๐Ÿ”— [vanna.ai/docs](https://vanna.ai/docs/)

66. paddlepaddle/PaddleNLP โญ 12,272
๐Ÿ‘‘ Easy-to-use and powerful NLP and LLM library with ๐Ÿค— Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including ๐Ÿ—‚Text Classification, ๐Ÿ” Neural Search, โ“ Question Answering, โ„น๏ธ Information Extraction, ๐Ÿ“„ Document Intelligence, ๐Ÿ’Œ Sentiment Analysis etc.
๐Ÿ”— [paddlenlp.readthedocs.io](https://paddlenlp.readthedocs.io)

67. openlmlab/MOSS โญ 12,014
An open-source tool-augmented conversational language model from Fudan University
๐Ÿ”— [txsun1997.github.io/blogs/moss.html](https://txsun1997.github.io/blogs/moss.html)

68. shishirpatil/gorilla โญ 11,643
Enables LLMs to use tools by invoking APIs. Given a query, Gorilla comes up with the semantically and syntactically correct API.
๐Ÿ”— [gorilla.cs.berkeley.edu](https://gorilla.cs.berkeley.edu/)

69. h2oai/h2ogpt โญ 11,574
Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://gpt-docs.h2o.ai/
๐Ÿ”— [h2o.ai](http://h2o.ai)

70. pathwaycom/llm-app โญ 11,452
Ready-to-run cloud templates for RAG, AI pipelines, and enterprise search with live data. ๐ŸณDocker-friendly.โšกAlways in sync with Sharepoint, Google Drive, S3, Kafka, PostgreSQL, real-time data APIs, and more.
๐Ÿ”— [pathway.com/developers/templates](https://pathway.com/developers/templates/)

71. skyvern-ai/skyvern โญ 11,333
Skyvern automates browser-based workflows using LLMs and computer vision. It provides a simple API endpoint to fully automate manual workflows, replacing brittle or unreliable automation solutions.
๐Ÿ”— [www.skyvern.com](https://www.skyvern.com)

72. lightning-ai/litgpt โญ 11,078
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
๐Ÿ”— [lightning.ai](https://lightning.ai)

73. lightning-ai/litgpt โญ 11,078
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
๐Ÿ”— [lightning.ai](https://lightning.ai)

74. microsoft/LoRA โญ 11,062
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
๐Ÿ”— [arxiv.org/abs/2106.09685](https://arxiv.org/abs/2106.09685)

75. nvidia/Megatron-LM โญ 11,032
Ongoing research training transformer models at scale
๐Ÿ”— [docs.nvidia.com/megatron-core/developer-guide/latest/user-guide/index.html#quick-start](https://docs.nvidia.com/megatron-core/developer-guide/latest/user-guide/index.html#quick-start)

76. databrickslabs/dolly โญ 10,820
Databricksโ€™ Dolly, a large language model trained on the Databricks Machine Learning Platform
๐Ÿ”— [www.databricks.com/blog/2023/03/24/hello-dolly-democratizing-magic-chatgpt-open-models.html](https://www.databricks.com/blog/2023/03/24/hello-dolly-democratizing-magic-chatgpt-open-models.html)

77. google-research/vision_transformer โญ 10,744
Vision Transformer and MLP-Mixer Architectures

78. lvwerra/trl โญ 10,524
Train transformer language models with reinforcement learning.
๐Ÿ”— [hf.co/docs/trl](http://hf.co/docs/trl)

79. nirdiamant/RAG_Techniques โญ 10,378
The most comprehensive and dynamic collections of Retrieval-Augmented Generation (RAG) tutorials available today. This repository serves as a hub for cutting-edge techniques aimed at enhancing the accuracy, efficiency, and contextual richness of RAG systems.

80. outlines-dev/outlines โญ 10,247
Structured Text Generation from LLMs
๐Ÿ”— [dottxt-ai.github.io/outlines](https://dottxt-ai.github.io/outlines/)

81. artidoro/qlora โญ 10,146
QLoRA: Efficient Finetuning of Quantized LLMs
๐Ÿ”— [arxiv.org/abs/2305.14314](https://arxiv.org/abs/2305.14314)

82. mistralai/mistral-inference โญ 9,838
Official inference library for Mistral models
๐Ÿ”— [mistral.ai](https://mistral.ai/)

83. microsoft/promptflow โญ 9,739
Build high-quality LLM apps - from prototyping, testing to production deployment and monitoring.
๐Ÿ”— [microsoft.github.io/promptflow](https://microsoft.github.io/promptflow/)

84. andrewyng/aisuite โญ 9,610
Simple, unified interface to multiple Generative AI providers. aisuite makes it easy for developers to use multiple LLM through a standardized interface.

85. prompt-toolkit/python-prompt-toolkit โญ 9,467
Library for building powerful interactive command line applications in Python
๐Ÿ”— [python-prompt-toolkit.readthedocs.io](https://python-prompt-toolkit.readthedocs.io/)

86. blinkdl/ChatRWKV โญ 9,455
ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.

87. mshumer/gpt-prompt-engineer โญ 9,420
Simply input a description of your task and some test cases, and the system will generate, test, and rank a multitude of prompts to find the ones that perform the best.

88. karpathy/minbpe โญ 9,306
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

89. anthropics/anthropic-cookbook โญ 9,088
Provides code and guides designed to help developers build with Claude, offering copy-able code snippets that you can easily integrate into your own projects.

90. jxnl/instructor โญ 8,799
Instructor is a Python library that makes it a breeze to work with structured outputs from large language models (LLMs). Built on top of Pydantic, it provides a simple, transparent, and user-friendly API to manage validation, retries, and streaming responses.
๐Ÿ”— [python.useinstructor.com](https://python.useinstructor.com/)

91. swivid/F5-TTS โญ 8,717
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
๐Ÿ”— [arxiv.org/abs/2410.06885](https://arxiv.org/abs/2410.06885)

92. apple/ml-ferret โญ 8,536
Ferret: Refer and Ground Anything Anywhere at Any Granularity

93. llmware-ai/llmware โญ 8,395
Unified framework for building enterprise RAG pipelines with small, specialized models
๐Ÿ”— [llmware-ai.github.io/llmware](https://llmware-ai.github.io/llmware/)

94. abetlen/llama-cpp-python โญ 8,384
Simple Python bindings for @ggerganov's llama.cpp library.
๐Ÿ”— [llama-cpp-python.readthedocs.io](https://llama-cpp-python.readthedocs.io)

95. thudm/CodeGeeX โญ 8,326
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
๐Ÿ”— [codegeex.cn](https://codegeex.cn)

96. optimalscale/LMFlow โญ 8,321
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.
๐Ÿ”— [optimalscale.github.io/lmflow](https://optimalscale.github.io/LMFlow/)

97. eleutherai/gpt-neo โญ 8,254
An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.
๐Ÿ”— [www.eleuther.ai](https://www.eleuther.ai)

98. axolotl-ai-cloud/axolotl โญ 8,236
Go ahead and axolotl questions
๐Ÿ”— [axolotl-ai-cloud.github.io/axolotl](https://axolotl-ai-cloud.github.io/axolotl/)

99. jzhang38/TinyLlama โญ 8,085
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

100. sjtu-ipads/PowerInfer โญ 8,041
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

101. lianjiatech/BELLE โญ 8,014
BELLE: Be Everyone's Large Language model Engine๏ผˆๅผ€ๆบไธญๆ–‡ๅฏน่ฏๅคงๆจกๅž‹๏ผ‰

102. vaibhavs10/insanely-fast-whisper โญ 7,902
An opinionated CLI to transcribe Audio files w/ Whisper on-device! Powered by ๐Ÿค— Transformers, Optimum & flash-attn

103. explodinggradients/ragas โญ 7,795
Supercharge Your LLM Application Evaluations ๐Ÿš€
๐Ÿ”— [docs.ragas.io](https://docs.ragas.io)

104. plachtaa/VALL-E-X โญ 7,752
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/

105. thudm/GLM-130B โญ 7,679
GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)

106. chainlit/chainlit โญ 7,568
Build Conversational AI in minutes โšก๏ธ
๐Ÿ”— [docs.chainlit.io](https://docs.chainlit.io)

107. sweepai/sweep โญ 7,477
Sweep: open-source AI-powered Software Developer for small features and bug fixes.
๐Ÿ”— [sweep.dev](https://sweep.dev)

108. openlm-research/open_llama โญ 7,406
OpenLLaMA: An Open Reproduction of LLaMA

109. eleutherai/lm-evaluation-harness โญ 7,393
A framework for few-shot evaluation of language models.
๐Ÿ”— [www.eleuther.ai](https://www.eleuther.ai)

110. bigcode-project/starcoder โญ 7,346
Home of StarCoder: fine-tuning & inference!

111. anthropics/anthropic-quickstarts โญ 7,340
A collection of projects designed to help developers quickly get started with building applications using the Anthropic API. Each quickstart provides a foundation that you can easily build upon and customize for your specific needs.

112. zilliztech/GPTCache โญ 7,327
Semantic cache for LLMs. Fully integrated with LangChain and llama_index.
๐Ÿ”— [gptcache.readthedocs.io](https://gptcache.readthedocs.io)

113. sgl-project/sglang โญ 7,134
SGLang is a fast serving framework for large language models and vision language models.
๐Ÿ”— [sgl-project.github.io](https://sgl-project.github.io/)

114. eleutherai/gpt-neox โญ 7,027
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
๐Ÿ”— [www.eleuther.ai](https://www.eleuther.ai/)

115. bhaskatripathi/pdfGPT โญ 7,024
PDF GPT allows you to chat with the contents of your PDF file by using GPT capabilities. The most effective open source solution to turn your pdf files in a chatbot!
๐Ÿ”— [huggingface.co/spaces/bhaskartripathi/pdfchatter](https://huggingface.co/spaces/bhaskartripathi/pdfChatter)

116. skypilot-org/skypilot โญ 7,008
SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 12+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.
๐Ÿ”— [docs.skypilot.co](https://docs.skypilot.co/)

117. apple/corenet โญ 6,994
CoreNet is a deep neural network toolkit that allows researchers and engineers to train standard and novel small and large-scale models for variety of tasks, including foundation models (e.g., CLIP and LLM), object classification, object detection, and semantic segmentation.

118. mit-han-lab/streaming-llm โญ 6,735
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
๐Ÿ”— [arxiv.org/abs/2309.17453](https://arxiv.org/abs/2309.17453)

119. whitead/paper-qa โญ 6,697
High accuracy RAG for answering questions from scientific documents with citations

120. future-house/paper-qa โญ 6,697
High-accuracy retrieval augmented generation (RAG) on PDFs or text files, with a focus on the scientific literature

121. internlm/InternLM โญ 6,625
Official release of InternLM2.5 base and chat models. 1M context support
๐Ÿ”— [internlm.intern-ai.org.cn](https://internlm.intern-ai.org.cn/)

122. weaviate/Verba โญ 6,573
Retrieval Augmented Generation (RAG) chatbot powered by Weaviate

123. langchain-ai/opengpts โญ 6,526
An open source effort to create a similar experience to OpenAI's GPTs and Assistants API.

124. run-llama/rags โญ 6,365
RAGs is a Streamlit app that lets you create a RAG pipeline from a data source using natural language.

125. nat/openplayground โญ 6,302
An LLM playground you can run on your laptop

126. vikhyat/moondream โญ 6,270
A tiny open-source computer-vision language model designed to run efficiently on edge devices
๐Ÿ”— [moondream.ai](https://moondream.ai)

127. lightning-ai/lit-llama โญ 6,018
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

128. minedojo/Voyager โญ 5,787
An Open-Ended Embodied Agent with Large Language Models
๐Ÿ”— [voyager.minedojo.org](https://voyager.minedojo.org/)

129. pytorch-labs/gpt-fast โญ 5,744
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

130. langchain-ai/chat-langchain โญ 5,601
Locally hosted chatbot specifically focused on question answering over the LangChain documentation
๐Ÿ”— [chat.langchain.com](https://chat.langchain.com)

131. lyogavin/airllm โญ 5,524
AirLLM optimizes inference memory usage, allowing 70B large language models to run inference on a single 4GB GPU card without quantization, distillation and pruning. And you can run 405B Llama3.1 on 8GB vram now.

132. microsoft/promptbase โญ 5,485
promptbase is an evolving collection of resources, best practices, and example scripts for eliciting the best performance from foundation models.

133. simonw/llm โญ 5,400
Access large language models from the command-line
๐Ÿ”— [llm.datasette.io](https://llm.datasette.io)

134. dsdanielpark/Bard-API โญ 5,309
The unofficial python package that returns response of Google Bard through cookie value.
๐Ÿ”— [pypi.org/project/bardapi](https://pypi.org/project/bardapi/)

135. qwenlm/Qwen-VL โญ 5,278
The official repo of Qwen-VL (้€šไน‰ๅƒ้—ฎ-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

136. promptfoo/promptfoo โญ 5,112
Test your prompts, agents, and RAGs. Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line and CI/CD integration.
๐Ÿ”— [promptfoo.dev](https://promptfoo.dev)

137. cg123/mergekit โญ 5,055
Tools for merging pretrained large language models.

138. arcee-ai/mergekit โญ 5,055
Tools for merging pretrained large language models.

139. allenai/OLMo โญ 4,988
OLMo is a repository for training and using AI2's state-of-the-art open language models. It is designed by scientists, for scientists.
๐Ÿ”— [allenai.org/olmo](https://allenai.org/olmo)

140. modelscope/ms-swift โญ 4,932
Use PEFT or Full-parameter to finetune 400+ LLMs (Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, ...) or 150+ MLLMs (Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2.5, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL2, Phi3.5-Vision, GOT-OCR2, ...).
๐Ÿ”— [swift.readthedocs.io/zh-cn/latest](https://swift.readthedocs.io/zh-cn/latest/)

141. openbmb/ToolBench โญ 4,920
[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.
๐Ÿ”— [openbmb.github.io/toolbench](https://openbmb.github.io/ToolBench/)

142. microsoft/LLMLingua โญ 4,786
[EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.
๐Ÿ”— [llmlingua.com](https://llmlingua.com/)

143. togethercomputer/RedPajama-Data โญ 4,613
The RedPajama-Data repository contains code for preparing large datasets for training large language models.

144. 1rgs/jsonformer โญ 4,516
A Bulletproof Way to Generate Structured JSON from Language Models

145. open-compass/opencompass โญ 4,438
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
๐Ÿ”— [opencompass.org.cn](https://opencompass.org.cn/)

146. kyegomez/tree-of-thoughts โญ 4,409
Plug in and Play Implementation of Tree of Thoughts: Deliberate Problem Solving with Large Language Models that Elevates Model Reasoning by atleast 70%
๐Ÿ”— [discord.gg/qutxnk2nmf](https://discord.gg/qUtxnK2NMf)

147. microsoft/BioGPT โญ 4,345
Implementation of BioGPT: Generative Pre-trained Transformer for Biomedical Text Generation and Mining

148. guardrails-ai/guardrails โญ 4,334
Open-source Python package for specifying structure and type, validating and correcting the outputs of large language models (LLMs)
๐Ÿ”— [www.guardrailsai.com/docs](https://www.guardrailsai.com/docs)

149. nvidia/NeMo-Guardrails โญ 4,295
NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.

150. instruction-tuning-with-gpt-4/GPT-4-LLM โญ 4,251
Instruction Tuning with GPT-4
๐Ÿ”— [instruction-tuning-with-gpt-4.github.io](https://instruction-tuning-with-gpt-4.github.io/)

151. yizhongw/self-instruct โญ 4,233
Aligning pretrained language models with instruction data generated by themselves.

152. linkedin/Liger-Kernel โญ 4,123
Efficient Triton Kernels for LLM Training
๐Ÿ”— [arxiv.org/pdf/2410.10989](https://arxiv.org/pdf/2410.10989)

153. pipecat-ai/pipecat โญ 4,119
Open Source framework for voice and multimodal conversational AI

154. h2oai/h2o-llmstudio โญ 4,092
H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://docs.h2o.ai/h2o-llmstudio/
๐Ÿ”— [h2o.ai](https://h2o.ai)

155. katanaml/sparrow โญ 4,090
Sparrow is a solution for efficient data extraction and processing from various documents and images like invoices and receipts
๐Ÿ”— [katanaml.io](https://katanaml.io)

156. mshumer/gpt-llm-trainer โญ 4,036
Input a description of your task, and the system will generate a dataset, parse it, and fine-tune a LLaMA 2 model for you

157. ragapp/ragapp โญ 3,983
The easiest way to use Agentic RAG in any enterprise

158. ravenscroftj/turbopilot โญ 3,821
Turbopilot is an open source large-language-model based code completion engine that runs locally on CPU

159. turboderp/exllamav2 โญ 3,820
A fast inference library for running LLMs locally on modern consumer-class GPUs

160. microsoft/LMOps โญ 3,783
General technology for enabling AI capabilities w/ LLMs and MLLMs
๐Ÿ”— [aka.ms/generalai](https://aka.ms/GeneralAI)

161. eth-sri/lmql โญ 3,760
A language for constraint-guided and efficient LLM programming.
๐Ÿ”— [lmql.ai](https://lmql.ai)

162. mmabrouk/llm-workflow-engine โญ 3,677
Power CLI and Workflow manager for LLMs (core package)

163. luodian/Otter โญ 3,574
๐Ÿฆฆ Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
๐Ÿ”— [otter-ntu.github.io](https://otter-ntu.github.io/)

164. agiresearch/AIOS โญ 3,545
AIOS, a Large Language Model (LLM) Agent operating system, embeds large language model into Operating Systems (OS) as the brain of the OS, enabling an operating system "with soul" -- an important step towards AGI.
๐Ÿ”— [aios.foundation](https://aios.foundation)

165. minimaxir/simpleaichat โญ 3,496
Python package for easily interfacing with chat apps, with robust features and minimal code complexity.

166. truefoundry/cognita โญ 3,484
RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry
๐Ÿ”— [cognita.truefoundry.com](https://cognita.truefoundry.com)

167. defog-ai/sqlcoder โญ 3,482
SoTA LLM for converting natural language questions to SQL queries

168. lm-sys/RouteLLM โญ 3,412
A framework for serving and evaluating LLM routers - save LLM costs without compromising quality

169. iryna-kondr/scikit-llm โญ 3,398
Seamlessly integrate LLMs into scikit-learn.
๐Ÿ”— [beastbyte.ai](https://beastbyte.ai/)

170. minimaxir/gpt-2-simple โญ 3,398
Python package to easily retrain OpenAI's GPT-2 text-generating model on new texts

171. next-gpt/NExT-GPT โญ 3,377
Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model
๐Ÿ”— [next-gpt.github.io](https://next-gpt.github.io/)

172. deep-diver/LLM-As-Chatbot โญ 3,304
LLM as a Chatbot Service

173. jaymody/picoGPT โญ 3,290
An unnecessarily tiny implementation of GPT-2 in NumPy.

174. bclavie/RAGatouille โญ 3,168
Bridging the gap between state-of-the-art research and alchemical RAG pipeline practices.

175. marker-inc-korea/AutoRAG โญ 3,107
AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation
๐Ÿ”— [auto-rag.com](https://auto-rag.com/)

176. microsoft/torchscale โญ 3,038
Foundation Architecture for (M)LLMs
๐Ÿ”— [aka.ms/generalai](https://aka.ms/GeneralAI)

177. huggingface/text-embeddings-inference โญ 3,015
A blazing fast inference solution for text embeddings models
๐Ÿ”— [huggingface.co/docs/text-embeddings-inference/quick_tour](https://huggingface.co/docs/text-embeddings-inference/quick_tour)

178. baichuan-inc/Baichuan-13B โญ 2,978
A 13B large language model developed by Baichuan Intelligent Technology
๐Ÿ”— [huggingface.co/baichuan-inc/baichuan-13b-chat](https://huggingface.co/baichuan-inc/Baichuan-13B-Chat)

179. li-plus/chatglm.cpp โญ 2,962
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)

180. freedomintelligence/LLMZoo โญ 2,947
โšกLLM Zoo is a project that provides data, models, and evaluation benchmark for large language models.โšก

181. cohere-ai/cohere-toolkit โญ 2,902
Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.

182. meta-llama/PurpleLlama โญ 2,817
Set of tools to assess and improve LLM security.

183. verazuo/jailbreak_llms โญ 2,813
Official repo for the ACM CCS 2024 paper "Do Anything Now'': Characterizing and Evaluating In-The-Wild Jailbreak Prompts
๐Ÿ”— [jailbreak-llms.xinyueshen.me](https://jailbreak-llms.xinyueshen.me/)

184. juncongmoo/pyllama โญ 2,806
LLaMA: Open and Efficient Foundation Language Models

185. mistralai/mistral-finetune โญ 2,799
A light-weight codebase that enables memory-efficient and performant finetuning of Mistral's models. It is based on LoRA.

186. hegelai/prompttools โญ 2,755
Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chroma, Weaviate, LanceDB).
๐Ÿ”— [prompttools.readthedocs.io](http://prompttools.readthedocs.io)

187. alpha-vllm/LLaMA2-Accessory โญ 2,745
An Open-source Toolkit for LLM Development
๐Ÿ”— [llama2-accessory.readthedocs.io](https://llama2-accessory.readthedocs.io/)

188. paperswithcode/galai โญ 2,692
Model API for GALACTICA

189. lightning-ai/LitServe โญ 2,677
Lightning-fast serving engine for any AI model of any size. Flexible. Easy. Enterprise-scale.
๐Ÿ”— [lightning.ai/docs/litserve](https://lightning.ai/docs/litserve)

190. mit-han-lab/llm-awq โญ 2,644
AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

191. cheshire-cat-ai/core โญ 2,543
AI agent microservice
๐Ÿ”— [cheshirecat.ai](https://cheshirecat.ai)

192. databricks/dbrx โญ 2,525
Code examples and resources for DBRX, a large language model developed by Databricks
๐Ÿ”— [www.databricks.com](https://www.databricks.com/)

193. noahshinn/reflexion โญ 2,522
[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning

194. nirdiamant/Prompt_Engineering โญ 2,463
A comprehensive collection of tutorials and implementations for Prompt Engineering techniques, ranging from fundamental concepts to advanced strategies.

195. sylphai-inc/AdalFlow โญ 2,445
Unified auto-differentiative framework for both zero-shot prompt optimization and few-shot optimization. It advances existing auto-optimization research, including Text-Grad and DsPy
๐Ÿ”— [adalflow.sylph.ai](http://adalflow.sylph.ai/)

196. ofa-sys/OFA โญ 2,445
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework

197. young-geng/EasyLM โญ 2,435
Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.

198. pytorch/executorch โญ 2,369
An end-to-end solution for enabling on-device inference capabilities across mobile and edge devices including wearables, embedded devices and microcontrollers. It is part of the PyTorch Edge ecosystem and enables efficient deployment of PyTorch models to edge devices.
๐Ÿ”— [pytorch.org/executorch](https://pytorch.org/executorch/)

199. civitai/sd_civitai_extension โญ 2,347
All of the Civitai models inside Automatic 1111 Stable Diffusion Web UI

200. intel/neural-compressor โญ 2,291
SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime
๐Ÿ”— [intel.github.io/neural-compressor](https://intel.github.io/neural-compressor/)

201. predibase/lorax โญ 2,282
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
๐Ÿ”— [loraexchange.ai](https://loraexchange.ai)

202. janhq/cortex.cpp โญ 2,270
Cortex is a Local AI API Platform that is used to run and customize LLMs.
๐Ÿ”— [cortex.so](https://cortex.so)

203. truera/trulens โญ 2,262
Evaluation and Tracking for LLM Experiments
๐Ÿ”— [www.trulens.org](https://www.trulens.org/)

204. evolvinglmms-lab/lmms-eval โญ 2,260
Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.
๐Ÿ”— [lmms-lab.framer.ai](https://lmms-lab.framer.ai/)

205. spcl/graph-of-thoughts โญ 2,234
Official Implementation of "Graph of Thoughts: Solving Elaborate Problems with Large Language Models"
๐Ÿ”— [arxiv.org/pdf/2308.09687.pdf](https://arxiv.org/pdf/2308.09687.pdf)

206. openai/finetune-transformer-lm โญ 2,175
Code and model for the paper "Improving Language Understanding by Generative Pre-Training"
๐Ÿ”— [s3-us-west-2.amazonaws.com/openai-assets/research-covers/language-unsupervised/language_understanding_paper.pdf](https://s3-us-west-2.amazonaws.com/openai-assets/research-covers/language-unsupervised/language_understanding_paper.pdf)

207. openai/simple-evals โญ 2,119
Lightweight library for evaluating language models

208. tairov/llama2.mojo โญ 2,107
Inference Llama 2 in one file of pure ๐Ÿ”ฅ
๐Ÿ”— [www.modular.com/blog/community-spotlight-how-i-built-llama2-by-aydyn-tairov](https://www.modular.com/blog/community-spotlight-how-i-built-llama2-by-aydyn-tairov)

209. openai/image-gpt โญ 2,051
Archived. Code and models from the paper "Generative Pretraining from Pixels"

210. azure-samples/graphrag-accelerator โญ 2,044
One-click deploy of a Knowledge Graph powered RAG (GraphRAG) in Azure
๐Ÿ”— [github.com/microsoft/graphrag](https://github.com/microsoft/graphrag)

211. ist-daslab/gptq โญ 1,987
Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".
๐Ÿ”— [arxiv.org/abs/2210.17323](https://arxiv.org/abs/2210.17323)

212. lucidrains/toolformer-pytorch โญ 1,985
Implementation of Toolformer, Language Models That Can Use Tools, by MetaAI

213. novasky-ai/SkyThought โญ 1,984
Sky-T1: Train your own O1 preview model within $450
๐Ÿ”— [novasky-ai.github.io](https://novasky-ai.github.io/)

214. neulab/prompt2model โญ 1,975
prompt2model - Generate Deployable Models from Natural Language Instructions

215. openai/gpt-2-output-dataset โญ 1,951
Dataset of GPT-2 outputs for research in detection, biases, and more

216. microsoft/Megatron-DeepSpeed โญ 1,949
Ongoing research training transformer language models at scale, including: BERT & GPT-2

217. epfllm/meditron โญ 1,926
Meditron is a suite of open-source medical Large Language Models (LLMs).
๐Ÿ”— [huggingface.co/epfl-llm](https://huggingface.co/epfl-llm)

218. akariasai/self-rag โญ 1,919
This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.
๐Ÿ”— [selfrag.github.io](https://selfrag.github.io/)

219. facebookresearch/chameleon โญ 1,897
Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.
๐Ÿ”— [arxiv.org/abs/2405.09818](https://arxiv.org/abs/2405.09818)

220. casper-hansen/AutoAWQ โญ 1,866
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:
๐Ÿ”— [casper-hansen.github.io/autoawq](https://casper-hansen.github.io/AutoAWQ/)

221. agenta-ai/agenta โญ 1,847
The open-source LLMOps platform: prompt playground, prompt management, LLM evaluation, and LLM Observability all in one place.
๐Ÿ”— [www.agenta.ai](http://www.agenta.ai)

222. argilla-io/distilabel โญ 1,844
Distilabel is the framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.
๐Ÿ”— [distilabel.argilla.io](https://distilabel.argilla.io)

223. minimaxir/aitextgen โญ 1,842
A robust Python tool for text-based AI training and generation using GPT-2.
๐Ÿ”— [docs.aitextgen.io](https://docs.aitextgen.io)

224. openai/gpt-discord-bot โญ 1,793
Example Discord bot written in Python that uses the completions API to have conversations with the `text-davinci-003` model, and the moderations API to filter the messages.

225. ray-project/llm-applications โญ 1,741
A comprehensive guide to building RAG-based LLM applications for production.

226. flashinfer-ai/flashinfer โญ 1,739
FlashInfer is a library and kernel generator for Large Language Models that provides high-performance implementation of LLM GPU kernels such as FlashAttention, SparseAttention, PageAttention, Sampling
๐Ÿ”— [flashinfer.ai](https://flashinfer.ai)

227. noamgat/lm-format-enforcer โญ 1,657
Enforce the output format (JSON Schema, Regex etc) of a language model

228. ruc-nlpir/FlashRAG โญ 1,576
FlashRAG is a Python toolkit for the reproduction and development of RAG research. Our toolkit includes 36 pre-processed benchmark RAG datasets and 15 state-of-the-art RAG algorithms.
๐Ÿ”— [arxiv.org/abs/2405.13576](https://arxiv.org/abs/2405.13576)

229. jina-ai/thinkgpt โญ 1,561
Agent techniques to augment your LLM and push it beyong its limits

230. qwenlm/Qwen-Audio โญ 1,541
The official repo of Qwen-Audio (้€šไน‰ๅƒ้—ฎ-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.

231. huggingface/smollm โญ 1,517
Everything about the SmolLM & SmolLM2 family of models
๐Ÿ”— [huggingface.co/huggingfacetb](https://huggingface.co/HuggingFaceTB)

232. agentops-ai/tokencost โญ 1,514
Easy token price estimates for 400+ LLMs. TokenOps.
๐Ÿ”— [agentops.ai](https://agentops.ai)

233. meetkai/functionary โญ 1,485
Chat language model that can use tools and interpret the results

234. facebookresearch/large_concept_model โญ 1,471
Large Concept Models: Language modeling in a sentence representation space

235. cstankonrad/long_llama โญ 1,455
LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA and fine-tuned with the Focused Transformer (FoT) method.

236. farizrahman4u/loopgpt โญ 1,442
Re-implementation of Auto-GPT as a python package, written with modularity and extensibility in mind.

237. run-llama/llama-lab โญ 1,430
Llama Lab is a repo dedicated to building cutting-edge projects using LlamaIndex

238. roboflow/maestro โญ 1,423
streamline the fine-tuning process for multimodal models: PaliGemma, Florence-2, and Qwen2-VL
๐Ÿ”— [maestro.roboflow.com](https://maestro.roboflow.com)

239. chatarena/chatarena โญ 1,390
ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration capabilities of AIs.

240. huggingface/nanotron โญ 1,370
Minimalistic large language model 3D-parallelism training

241. explosion/spacy-transformers โญ 1,357
๐Ÿ›ธ Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy
๐Ÿ”— [spacy.io/usage/embeddings-transformers](https://spacy.io/usage/embeddings-transformers)

242. bigscience-workshop/Megatron-DeepSpeed โญ 1,351
Ongoing research training transformer language models at scale, including: BERT & GPT-2

243. modelcontextprotocol/python-sdk โญ 1,307
The Model Context Protocol allows applications to provide context for LLMs in a standardized way, separating the concerns of providing context from the actual LLM interaction.
๐Ÿ”— [modelcontextprotocol.io](https://modelcontextprotocol.io)

244. karpathy/nano-llama31 โญ 1,284
This repo is to Llama 3.1 what nanoGPT is to GPT-2. i.e. it is a minimal, dependency-free implementation of the Llama 3.1 architecture

245. ray-project/ray-llm โญ 1,242
RayLLM - LLMs on Ray
๐Ÿ”— [aviary.anyscale.com](https://aviary.anyscale.com)

246. srush/MiniChain โญ 1,220
A tiny library for coding with large language models.
๐Ÿ”— [srush-minichain.hf.space](https://srush-minichain.hf.space/)

247. answerdotai/rerankers โญ 1,219
Welcome to rerankers! Our goal is to provide users with a simple API to use any reranking models.

248. facebookresearch/MobileLLM โญ 1,210
Training code of MobileLLM introduced in our work: "MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases"

249. mlfoundations/dclm โญ 1,195
DataComp for Language Models

250. keirp/automatic_prompt_engineer โญ 1,195
Large Language Models Are Human-Level Prompt Engineers

251. hao-ai-lab/LookaheadDecoding โญ 1,171
Break the Sequential Dependency of LLM Inference Using Lookahead Decoding
๐Ÿ”— [arxiv.org/abs/2402.02057](https://arxiv.org/abs/2402.02057)

252. explosion/spacy-llm โญ 1,164
๐Ÿฆ™ Integrating LLMs into structured NLP pipelines
๐Ÿ”— [spacy.io/usage/large-language-models](https://spacy.io/usage/large-language-models)

253. ibm/Dromedary โญ 1,131
Dromedary: towards helpful, ethical and reliable LLMs.

254. lupantech/chameleon-llm โญ 1,096
Codes for "Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models".
๐Ÿ”— [chameleon-llm.github.io](https://chameleon-llm.github.io)

255. rlancemartin/auto-evaluator โญ 1,066
Evaluation tool for LLM QA chains
๐Ÿ”— [autoevaluator.langchain.com](https://autoevaluator.langchain.com/)

256. topoteretes/cognee โญ 1,028
Reliable LLM Memory for AI Applications and AI Agents
๐Ÿ”— [www.cognee.ai](https://www.cognee.ai)

257. ctlllll/LLM-ToolMaker โญ 1,026
Large Language Models as Tool Makers

258. microsoft/Llama-2-Onnx โญ 1,025
A Microsoft optimized version of the Llama 2 model, available from Meta

259. nomic-ai/pygpt4all โญ 1,022
Official supported Python bindings for llama.cpp + gpt4all
๐Ÿ”— [nomic-ai.github.io/pygpt4all](https://nomic-ai.github.io/pygpt4all/)

260. cerebras/modelzoo โญ 992
Examples of common deep learning models that can be trained on Cerebras hardware

261. pinecone-io/canopy โญ 991
Retrieval Augmented Generation (RAG) framework and context engine powered by Pinecone
๐Ÿ”— [www.pinecone.io](https://www.pinecone.io/)

262. ajndkr/lanarky โญ 984
The web framework for building LLM microservices
๐Ÿ”— [lanarky.ajndkr.com](https://lanarky.ajndkr.com/)

263. likejazz/llama3.np โญ 977
llama3.np is a pure NumPy implementation for Llama 3 model.

264. huggingface/lighteval โญ 936
LightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processing library datatrove and LLM training library nanotron.

265. huggingface/evaluation-guidebook โญ 931
Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard and designing lighteval!

266. nirdiamant/Controllable-RAG-Agent โญ 925
An advanced Retrieval-Augmented Generation (RAG) solution designed to tackle complex questions that simple semantic similarity-based retrieval cannot solve

267. huggingface/optimum-nvidia โญ 907
Optimum-NVIDIA delivers the best inference performance on the NVIDIA platform through Hugging Face. Run LLaMA 2 at 1,200 tokens/second (up to 28x faster than the framework)

268. datadreamer-dev/DataDreamer โญ 887
DataDreamer is a powerful open-source Python library for prompting, synthetic data generation, and training workflows. It is designed to be simple, extremely efficient, and research-grade.
๐Ÿ”— [datadreamer.dev](https://datadreamer.dev)

269. soulter/hugging-chat-api โญ 886
HuggingChat Python API๐Ÿค—

270. muennighoff/sgpt โญ 857
SGPT: GPT Sentence Embeddings for Semantic Search
๐Ÿ”— [arxiv.org/abs/2202.08904](https://arxiv.org/abs/2202.08904)

271. prometheus-eval/prometheus-eval โญ 834
Evaluate your LLM's response with Prometheus and GPT4 ๐Ÿ’ฏ

272. langchain-ai/langsmith-cookbook โญ 812
LangSmith is a platform for building production-grade LLM applications.
๐Ÿ”— [langsmith-cookbook.vercel.app](https://langsmith-cookbook.vercel.app)

273. junruxiong/IncarnaMind โญ 786
Connect and chat with your multiple documents (pdf and txt) through GPT 3.5, GPT-4 Turbo, Claude and Local Open-Source LLMs
๐Ÿ”— [www.incarnamind.com](https://www.incarnamind.com)

274. oliveirabruno01/babyagi-asi โญ 784
BabyAGI: an Autonomous and Self-Improving agent, or BASI

275. nousresearch/Hermes-Function-Calling โญ 772
Code for the Hermes Pro Large Language Model to perform function calling based on the provided schema. It allows users to query the model and retrieve information related to stock prices, company fundamentals, financial statements

276. wandb/weave โญ 765
Weave is a toolkit for developing AI-powered applications, built by Weights & Biases.
๐Ÿ”— [wandb.me/weave](https://wandb.me/weave)

277. opengenerativeai/GenossGPT โญ 750
One API for all LLMs either Private or Public (Anthropic, Llama V2, GPT 3.5/4, Vertex, GPT4ALL, HuggingFace ...) ๐ŸŒˆ๐Ÿ‚ Replace OpenAI GPT with any LLMs in your app with one line.
๐Ÿ”— [genoss.ai](https://genoss.ai/)

278. opengvlab/OmniQuant โญ 749
[ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs.

279. salesforce/xgen โญ 718
Salesforce open-source LLMs with 8k sequence length.

280. squeezeailab/SqueezeLLM โญ 667
[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization
๐Ÿ”— [arxiv.org/abs/2306.07629](https://arxiv.org/abs/2306.07629)

281. tag-research/TAG-Bench โญ 632
Table-Augmented Generation (TAG) is a unified and general-purpose paradigm for answering natural language questions over databases
๐Ÿ”— [arxiv.org/pdf/2408.14717](https://arxiv.org/pdf/2408.14717)

282. lupantech/ScienceQA โญ 620
Data and code for NeurIPS 2022 Paper "Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question Answering".

283. tsinghuadatabasegroup/DB-GPT โญ 592
LLM As Database Administrator
๐Ÿ”— [dbgpt.dbmind.cn](http://dbgpt.dbmind.cn/)

284. zhudotexe/kani โญ 563
kani (ใ‚ซใƒ‹) is a highly hackable microframework for chat-based language models with tool use/function calling. (NLP-OSS @ EMNLP 2023)
๐Ÿ”— [kani.readthedocs.io](https://kani.readthedocs.io)

285. microsoft/VPTQ โญ 550
Extreme Low-bit Vector Post-Training Quantization for Large Language Models

286. modal-labs/llm-finetuning โญ 550
Guide for fine-tuning Llama/Mistral/CodeLlama models and more

287. mlc-ai/xgrammar โญ 543
XGrammar is an open-source library for efficient, flexible, and portable structured generation. It supports general context-free grammar to enable a broad range of structures while bringing careful system optimizations to enable fast executions.
๐Ÿ”— [xgrammar.mlc.ai](https://xgrammar.mlc.ai/)

288. hazyresearch/ama_prompting โญ 543
Ask Me Anything language model prompting

289. minishlab/model2vec โญ 542
Model2Vec is a technique to turn any sentence transformer into a really small static model, reducing model size by 15x and making the models up to 500x faster, with a small drop in performance
๐Ÿ”— [minishlab.github.io](https://minishlab.github.io/)

290. declare-lab/instruct-eval โญ 537
This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.
๐Ÿ”— [declare-lab.github.io/instruct-eval](https://declare-lab.github.io/instruct-eval/)

291. magnivorg/prompt-layer-library โญ 536
๐Ÿฐ PromptLayer - Maintain a log of your prompts and OpenAI API requests. Track, debug, and replay old completions.
๐Ÿ”— [www.promptlayer.com](https://www.promptlayer.com)

292. vahe1994/SpQR โญ 535
Quantization algorithm and the model evaluation code for SpQR method for LLM compression

293. continuum-llms/chatgpt-memory โญ 523
Allows to scale the ChatGPT API to multiple simultaneous sessions with infinite contextual and adaptive memory powered by GPT and Redis datastore.

294. eugeneyan/obsidian-copilot โญ 520
๐Ÿค– A prototype assistant for writing and thinking
๐Ÿ”— [eugeneyan.com/writing/obsidian-copilot](https://eugeneyan.com/writing/obsidian-copilot/)

295. hazyresearch/H3 โญ 516
Language Modeling with the H3 State Space Model

296. judahpaul16/gpt-home โญ 494
ChatGPT at home! Basically a better Google Nest Hub or Amazon Alexa home assistant. Built on the Raspberry Pi using the OpenAI API.
๐Ÿ”— [hub.docker.com/r/judahpaul/gpt-home](https://hub.docker.com/r/judahpaul/gpt-home)

297. huggingface/text-clustering โญ 483
Easily embed, cluster and semantically label text datasets

298. stanford-oval/suql โญ 233
SUQL: Conversational Search over Structured and Unstructured Data with LLMs
๐Ÿ”— [arxiv.org/abs/2311.09818](https://arxiv.org/abs/2311.09818)

299. dottxt-ai/outlines-core โญ 162
Core functionality for structured generation, formerly implemented in Outlines, with a focus on performance and portability.

300. prithivirajdamodaran/Route0x โญ 87
A production-grade query routing solution, leveraging LLMs while optimizing for cost per query

## Math and Science

Mathematical, numerical and scientific libraries.

1. numpy/numpy โญ 28,485
The fundamental package for scientific computing with Python.
๐Ÿ”— [numpy.org](https://numpy.org)

2. camdavidsonpilon/Probabilistic-Programming-and-Bayesian-Methods-for-Hackers โญ 27,079
aka "Bayesian Methods for Hackers": An introduction to Bayesian methods + probabilistic programming with a computation/understanding-first, mathematics-second point of view. All in pure Python ;)
๐Ÿ”— [camdavidsonpilon.github.io/probabilistic-programming-and-bayesian-methods-for-hackers](http://camdavidsonpilon.github.io/Probabilistic-Programming-and-Bayesian-Methods-for-Hackers/)

3. taichi-dev/taichi โญ 26,539
Productive, portable, and performant GPU programming in Python: Taichi Lang is an open-source, imperative, parallel programming language for high-performance numerical computation.
๐Ÿ”— [taichi-lang.org](https://taichi-lang.org)

4. experience-monks/math-as-code โญ 15,231
This is a reference to ease developers into mathematical notation by showing comparisons with Python code

5. scipy/scipy โญ 13,258
SciPy library main repository
๐Ÿ”— [scipy.org](https://scipy.org)

6. sympy/sympy โญ 13,179
A computer algebra system written in pure Python
๐Ÿ”— [sympy.org](https://sympy.org/)

7. google/or-tools โญ 11,459
Google Optimization Tools (a.k.a., OR-Tools) is an open-source, fast and portable software suite for solving combinatorial optimization problems.
๐Ÿ”— [developers.google.com/optimization](https://developers.google.com/optimization/)

8. z3prover/z3 โญ 10,543
Z3 is a theorem prover from Microsoft Research with a Python language binding.

9. cupy/cupy โญ 9,660
NumPy & SciPy for GPU
๐Ÿ”— [cupy.dev](https://cupy.dev)

10. google-deepmind/alphageometry โญ 4,243
Solving Olympiad Geometry without Human Demonstrations

11. pim-book/programmers-introduction-to-mathematics โญ 3,518
Code for A Programmer's Introduction to Mathematics
๐Ÿ”— [pimbook.org](https://pimbook.org)

12. mikedh/trimesh โญ 3,073
Python library for loading and using triangular meshes.
๐Ÿ”— [trimesh.org](https://trimesh.org)

13. talalalrawajfeh/mathematics-roadmap โญ 2,600
A Comprehensive Roadmap to Mathematics

14. pyro-ppl/numpyro โญ 2,359
Probabilistic programming with NumPy powered by JAX for autograd and JIT compilation to GPU/TPU/CPU.
๐Ÿ”— [num.pyro.ai](https://num.pyro.ai)

15. mckinsey/causalnex โญ 2,273
A Python library that helps data scientists to infer causation rather than observing correlation.
๐Ÿ”— [causalnex.readthedocs.io](http://causalnex.readthedocs.io/)

16. pyomo/pyomo โญ 2,078
An object-oriented algebraic modeling language in Python for structured optimization problems.
๐Ÿ”— [www.pyomo.org](https://www.pyomo.org)

17. facebookresearch/theseus โญ 1,816
A library for differentiable nonlinear optimization

18. arviz-devs/arviz โญ 1,629
Exploratory analysis of Bayesian models with Python
๐Ÿ”— [python.arviz.org](https://python.arviz.org)

19. google-research/torchsde โญ 1,595
Differentiable SDE solvers with GPU support and efficient sensitivity analysis.

20. dynamicslab/pysindy โญ 1,491
A package for the sparse identification of nonlinear dynamical systems from data
๐Ÿ”— [pysindy.readthedocs.io/en/latest](https://pysindy.readthedocs.io/en/latest/)

21. geomstats/geomstats โญ 1,281
Computations and statistics on manifolds with geometric structures.
๐Ÿ”— [geomstats.ai](http://geomstats.ai)

22. cma-es/pycma โญ 1,131
pycma is a Python implementation of CMA-ES and a few related numerical optimization tools.

23. pymc-labs/CausalPy โญ 935
A Python package for causal inference in quasi-experimental settings
๐Ÿ”— [causalpy.readthedocs.io](https://causalpy.readthedocs.io)

24. sj001/AI-Feynman โญ 651
Implementation of AI Feynman: a Physics-Inspired Method for Symbolic Regression

25. willianfuks/tfcausalimpact โญ 625
Python Causal Impact Implementation Based on Google's R Package. Built using TensorFlow Probability.

26. lean-dojo/LeanDojo โญ 596
Tool for data extraction and interacting with Lean programmatically.
๐Ÿ”— [leandojo.org](https://leandojo.org)

27. brandondube/prysm โญ 276
Prysm is an open-source library for physical and first-order modeling of optical systems and analysis of related data: numerical and physical optics, integrated modeling, phase retrieval, segmented systems, polynomials and fitting, sequential raytracing.
๐Ÿ”— [prysm.readthedocs.io/en/stable](https://prysm.readthedocs.io/en/stable/)

28. lean-dojo/ReProver โญ 239
Retrieval-Augmented Theorem Provers for Lean
๐Ÿ”— [leandojo.org](https://leandojo.org)

29. albahnsen/pycircular โญ 101
pycircular is a Python module for circular data analysis

30. gbillotey/Fractalshades โญ 28
Arbitrary-precision fractal explorer - Python package

## Machine Learning - General

General and classical machine learning libraries. See below for other sections covering specialised ML areas.

1. openai/openai-cookbook โญ 61,015
Examples and guides for using the OpenAI API
๐Ÿ”— [cookbook.openai.com](https://cookbook.openai.com)

2. scikit-learn/scikit-learn โญ 60,656
scikit-learn: machine learning in Python
๐Ÿ”— [scikit-learn.org](https://scikit-learn.org)

3. suno-ai/bark โญ 36,581
๐Ÿ”Š Text-Prompted Generative Audio Model

4. tencentarc/GFPGAN โญ 36,120
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.

5. google-research/google-research โญ 34,627
This repository contains code released by Google Research
๐Ÿ”— [research.google](https://research.google)

6. facebookresearch/faiss โญ 32,248
A library for efficient similarity search and clustering of dense vectors.
๐Ÿ”— [faiss.ai](https://faiss.ai)

7. google/jax โญ 30,921
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
๐Ÿ”— [jax.readthedocs.io](http://jax.readthedocs.io/)

8. open-mmlab/mmdetection โญ 29,979
OpenMMLab Detection Toolbox and Benchmark
๐Ÿ”— [mmdetection.readthedocs.io](https://mmdetection.readthedocs.io)

9. lutzroeder/netron โญ 29,007
Visualizer for neural network, deep learning and machine learning models
๐Ÿ”— [netron.app](https://netron.app)

10. ageron/handson-ml2 โญ 28,200
A series of Jupyter notebooks that walk you through the fundamentals of Machine Learning and Deep Learning in Python using Scikit-Learn, Keras and TensorFlow 2.

11. google/mediapipe โญ 28,160
Cross-platform, customizable ML solutions for live and streaming media.
๐Ÿ”— [ai.google.dev/edge/mediapipe](https://ai.google.dev/edge/mediapipe)

12. dmlc/xgboost โญ 26,461
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow
๐Ÿ”— [xgboost.readthedocs.io/en/stable](https://xgboost.readthedocs.io/en/stable/)

13. roboflow/supervision โญ 24,585
We write your reusable computer vision tools. ๐Ÿ’œ
๐Ÿ”— [supervision.roboflow.com](https://supervision.roboflow.com)

14. harisiqbal88/PlotNeuralNet โญ 22,514
Latex code for making neural networks diagrams

15. jina-ai/serve โญ 21,230
โ˜๏ธ Build multimodal AI applications with cloud-native stack
๐Ÿ”— [jina.ai/serve](https://jina.ai/serve)

16. onnx/onnx โญ 18,196
Open standard for machine learning interoperability
๐Ÿ”— [onnx.ai](https://onnx.ai/)

17. ml-explore/mlx โญ 18,189
MLX is an array framework for machine learning on Apple silicon, brought to you by Apple machine learning research.
๐Ÿ”— [ml-explore.github.io/mlx](https://ml-explore.github.io/mlx/)

18. microsoft/LightGBM โญ 16,856
A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.
๐Ÿ”— [lightgbm.readthedocs.io/en/latest](https://lightgbm.readthedocs.io/en/latest/)

19. ddbourgin/numpy-ml โญ 15,843
Machine learning, in numpy
๐Ÿ”— [numpy-ml.readthedocs.io](https://numpy-ml.readthedocs.io/)

20. tensorflow/tensor2tensor โญ 15,710
Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.

21. microsoft/onnxruntime โญ 15,216
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
๐Ÿ”— [onnxruntime.ai](https://onnxruntime.ai)

22. aleju/imgaug โญ 14,475
Image augmentation for machine learning experiments.
๐Ÿ”— [imgaug.readthedocs.io](http://imgaug.readthedocs.io)

23. microsoft/nni โญ 14,095
An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.
๐Ÿ”— [nni.readthedocs.io](https://nni.readthedocs.io)

24. jindongwang/transferlearning โญ 13,621
Transfer learning / domain adaptation / domain generalization / multi-task learning etc. Papers, codes, datasets, applications, tutorials.-่ฟ็งปๅญฆไน 
๐Ÿ”— [transferlearning.xyz](http://transferlearning.xyz/)

25. neonbjb/tortoise-tts โญ 13,460
A multi-voice TTS system trained with an emphasis on quality

26. spotify/annoy โญ 13,393
Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk

27. deepmind/deepmind-research โญ 13,388
This repository contains implementations and illustrative code to accompany DeepMind publications

28. deepmind/alphafold โญ 13,060
Implementation of the inference pipeline of AlphaFold v2

29. facebookresearch/AnimatedDrawings โญ 12,083
Code to accompany "A Method for Animating Children's Drawings of the Human Figure"

30. ggerganov/ggml โญ 11,490
Tensor library for machine learning

31. optuna/optuna โญ 11,180
A hyperparameter optimization framework
๐Ÿ”— [optuna.org](https://optuna.org)

32. statsmodels/statsmodels โญ 10,329
Statsmodels: statistical modeling and econometrics in Python
๐Ÿ”— [www.statsmodels.org/devel](http://www.statsmodels.org/devel/)

33. twitter/the-algorithm-ml โญ 10,167
Source code for Twitter's Recommendation Algorithm
๐Ÿ”— [blog.twitter.com/engineering/en_us/topics/open-source/2023/twitter-recommendation-algorithm](https://blog.twitter.com/engineering/en_us/topics/open-source/2023/twitter-recommendation-algorithm)

34. thudm/CogVideo โญ 10,158
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

35. google-gemini/cookbook โญ 10,157
A collection of guides and examples for the Gemini API, including quickstart tutorials for writing prompts.
๐Ÿ”— [ai.google.dev/gemini-api/docs](https://ai.google.dev/gemini-api/docs)

36. cleanlab/cleanlab โญ 9,864
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
๐Ÿ”— [cleanlab.ai](https://cleanlab.ai)

37. epistasislab/tpot โญ 9,793
A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.
๐Ÿ”— [epistasislab.github.io/tpot](http://epistasislab.github.io/tpot/)

38. megvii-basedetection/YOLOX โญ 9,558
YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/

39. wandb/wandb โญ 9,342
The AI developer platform. Use Weights & Biases to train and fine-tune models, and manage models from experimentation to production.
๐Ÿ”— [wandb.ai](https://wandb.ai)

40. pycaret/pycaret โญ 9,055
An open-source, low-code machine learning library in Python
๐Ÿ”— [www.pycaret.org](https://www.pycaret.org)

41. facebookresearch/xformers โญ 8,866
Hackable and optimized Transformers building blocks, supporting a composable construction.
๐Ÿ”— [facebookresearch.github.io/xformers](https://facebookresearch.github.io/xformers/)

42. pymc-devs/pymc โญ 8,814
Bayesian Modeling and Probabilistic Programming in Python
๐Ÿ”— [docs.pymc.io](https://docs.pymc.io/)

43. uberi/speech_recognition โญ 8,517
Speech recognition module for Python, supporting several engines and APIs, online and offline.
๐Ÿ”— [pypi.python.org/pypi/speechrecognition](https://pypi.python.org/pypi/SpeechRecognition/)

44. open-mmlab/mmsegmentation โญ 8,448
OpenMMLab Semantic Segmentation Toolbox and Benchmark.
๐Ÿ”— [mmsegmentation.readthedocs.io/en/main](https://mmsegmentation.readthedocs.io/en/main/)

45. awslabs/autogluon โญ 8,220
Fast and Accurate ML in 3 Lines of Code
๐Ÿ”— [auto.gluon.ai](https://auto.gluon.ai/)

46. catboost/catboost โญ 8,180
A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.
๐Ÿ”— [catboost.ai](https://catboost.ai)

47. huggingface/accelerate โญ 8,136
๐Ÿš€ A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
๐Ÿ”— [huggingface.co/docs/accelerate](https://huggingface.co/docs/accelerate)

48. automl/auto-sklearn โญ 7,675
Automated Machine Learning with scikit-learn
๐Ÿ”— [automl.github.io/auto-sklearn](https://automl.github.io/auto-sklearn)

49. lmcinnes/umap โญ 7,552
Uniform Manifold Approximation and Projection

50. featurelabs/featuretools โญ 7,326
An open source python library for automated feature engineering
๐Ÿ”— [www.featuretools.com](https://www.featuretools.com)

51. hyperopt/hyperopt โญ 7,304
Distributed Asynchronous Hyperparameter Optimization in Python
๐Ÿ”— [hyperopt.github.io/hyperopt](http://hyperopt.github.io/hyperopt)

52. py-why/dowhy โญ 7,221
DoWhy is a Python library for causal inference that supports explicit modeling and testing of causal assumptions. DoWhy is based on a unified language for causal inference, combining causal graphical models and potential outcomes frameworks.
๐Ÿ”— [www.pywhy.org/dowhy](https://www.pywhy.org/dowhy)

53. hips/autograd โญ 7,080
Efficiently computes derivatives of NumPy code.

54. open-mmlab/mmagic โญ 7,019
OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic ๐Ÿช„: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generation, image/video restoration/enhancement, etc.
๐Ÿ”— [mmagic.readthedocs.io/en/latest](https://mmagic.readthedocs.io/en/latest/)

55. scikit-learn-contrib/imbalanced-learn โญ 6,886
A Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning
๐Ÿ”— [imbalanced-learn.org](https://imbalanced-learn.org)

56. probml/pyprobml โญ 6,616
Python code for "Probabilistic Machine learning" book by Kevin Murphy

57. ml-explore/mlx-examples โญ 6,492
Examples in the MLX framework

58. nicolashug/Surprise โญ 6,458
A Python scikit for building and analyzing recommender systems
๐Ÿ”— [surpriselib.com](http://surpriselib.com)

59. google/automl โญ 6,293
Google Brain AutoML

60. yangchris11/samurai โญ 6,259
Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"
๐Ÿ”— [yangchris11.github.io/samurai](https://yangchris11.github.io/samurai/)

61. cleverhans-lab/cleverhans โญ 6,227
An adversarial example library for constructing attacks, building defenses, and benchmarking both

62. kevinmusgrave/pytorch-metric-learning โญ 6,061
The easiest way to use deep metric learning in your application. Modular, flexible, and extensible. Written in PyTorch.
๐Ÿ”— [kevinmusgrave.github.io/pytorch-metric-learning](https://kevinmusgrave.github.io/pytorch-metric-learning/)

63. project-monai/MONAI โญ 6,006
AI Toolkit for Healthcare Imaging
๐Ÿ”— [monai.io](https://monai.io/)

64. open-mmlab/mmcv โญ 5,964
OpenMMLab Computer Vision Foundation
๐Ÿ”— [mmcv.readthedocs.io/en/latest](https://mmcv.readthedocs.io/en/latest/)

65. google-deepmind/graphcast โญ 5,594
GraphCast: Learning skillful medium-range global weather forecasting

66. uber/causalml โญ 5,171
Uplift modeling and causal inference with machine learning algorithms

67. online-ml/river โญ 5,150
๐ŸŒŠ Online machine learning in Python
๐Ÿ”— [riverml.xyz](https://riverml.xyz)

68. mdbloice/Augmentor โญ 5,090
Image augmentation library in Python for machine learning.
๐Ÿ”— [augmentor.readthedocs.io/en/stable](https://augmentor.readthedocs.io/en/stable)

69. rasbt/mlxtend โญ 4,936
A library of extension and helper modules for Python's data analysis and machine learning libraries.
๐Ÿ”— [rasbt.github.io/mlxtend](https://rasbt.github.io/mlxtend/)

70. marqo-ai/marqo โญ 4,709
Unified embedding generation and search engine. Also available on cloud - cloud.marqo.ai
๐Ÿ”— [www.marqo.ai](https://www.marqo.ai/)

71. skvark/opencv-python โญ 4,639
Automated CI toolchain to produce precompiled opencv-python, opencv-python-headless, opencv-contrib-python and opencv-contrib-python-headless packages.
๐Ÿ”— [pypi.org/project/opencv-python](https://pypi.org/project/opencv-python/)

72. apple/coremltools โญ 4,507
Core ML tools contain supporting tools for Core ML model conversion, editing, and validation.
๐Ÿ”— [coremltools.readme.io](https://coremltools.readme.io)

73. sanchit-gandhi/whisper-jax โญ 4,495
JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.

74. nmslib/hnswlib โญ 4,475
Header-only C++/python library for fast approximate nearest neighbors
๐Ÿ”— [github.com/nmslib/hnswlib](https://github.com/nmslib/hnswlib)

75. lucidrains/deep-daze โญ 4,376
Simple command line tool for text to image generation using OpenAI's CLIP and Siren (Implicit neural representation network). Technique was originally created by https://twitter.com/advadnoun

76. districtdatalabs/yellowbrick โญ 4,303
Visual analysis and diagnostic tools to facilitate machine learning model selection.
๐Ÿ”— [www.scikit-yb.org](http://www.scikit-yb.org/)

77. nv-tlabs/GET3D โญ 4,272
Generative Model of High Quality 3D Textured Shapes Learned from Images

78. huggingface/autotrain-advanced โญ 4,180
AutoTrain Advanced: faster and easier training and deployments of state-of-the-art machine learning models
๐Ÿ”— [huggingface.co/autotrain](https://huggingface.co/autotrain)

79. microsoft/FLAML โญ 4,006
A fast library for AutoML and tuning. Join our Discord: https://discord.gg/Cppx2vSPVP.
๐Ÿ”— [microsoft.github.io/flaml](https://microsoft.github.io/FLAML/)

80. cmusphinx/pocketsphinx โญ 3,989
A small speech recognizer

81. ourownstory/neural_prophet โญ 3,931
NeuralProphet: A simple forecasting package
๐Ÿ”— [neuralprophet.com](https://neuralprophet.com)

82. py-why/EconML โญ 3,922
ALICE (Automated Learning and Intelligence for Causation and Economics) is a Microsoft Research project aimed at applying Artificial Intelligence concepts to economic decision making. One of its goals is to build a toolkit that combines state-of-the-art machine learning techniques with econometrics in order to brin...
๐Ÿ”— [www.microsoft.com/en-us/research/project/alice](https://www.microsoft.com/en-us/research/project/alice/)

83. huggingface/notebooks โญ 3,773
Notebooks using the Hugging Face libraries ๐Ÿค—

84. zjunlp/DeepKE โญ 3,668
[EMNLP 2022] An Open Toolkit for Knowledge Graph Extraction and Construction
๐Ÿ”— [deepke.zjukg.cn](http://deepke.zjukg.cn/)

85. huggingface/speech-to-speech โญ 3,661
Speech To Speech: an effort for an open-sourced and modular GPT4-o

86. rucaibox/RecBole โญ 3,537
A unified, comprehensive and efficient recommendation library
๐Ÿ”— [recbole.io](https://recbole.io/)

87. yoheinakajima/instagraph โญ 3,489
Converts text input or URL into knowledge graph and displays

88. facebookresearch/vissl โญ 3,263
VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.
๐Ÿ”— [vissl.ai](https://vissl.ai)

89. pytorch/glow โญ 3,249
Compiler for Neural Network hardware accelerators

90. lightly-ai/lightly โญ 3,239
A python library for self-supervised learning on images.
๐Ÿ”— [docs.lightly.ai/self-supervised-learning](https://docs.lightly.ai/self-supervised-learning/)

91. lucidrains/musiclm-pytorch โญ 3,205
Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch

92. hrnet/HRNet-Semantic-Segmentation โญ 3,183
The OCR approach is rephrased as Segmentation Transformer: https://arxiv.org/abs/1909.11065. This is an official implementation of semantic segmentation for HRNet. https://arxiv.org/abs/1908.07919

93. mljar/mljar-supervised โญ 3,085
Python package for AutoML on Tabular Data with Feature Engineering, Hyper-Parameters Tuning, Explanations and Automatic Documentation
๐Ÿ”— [mljar.com](https://mljar.com)

94. shankarpandala/lazypredict โญ 3,047
Lazy Predict help build a lot of basic models without much code and helps understand which models works better without any parameter tuning

95. huggingface/safetensors โญ 2,983
Implements a new simple format for storing tensors safely (as opposed to pickle) and that is still fast (zero-copy).
๐Ÿ”— [huggingface.co/docs/safetensors](https://huggingface.co/docs/safetensors)

96. scikit-learn-contrib/hdbscan โญ 2,834
A high performance implementation of HDBSCAN clustering.
๐Ÿ”— [hdbscan.readthedocs.io/en/latest](http://hdbscan.readthedocs.io/en/latest/)

97. scikit-optimize/scikit-optimize โญ 2,751
Sequential model-based optimization with a `scipy.optimize` interface
๐Ÿ”— [scikit-optimize.github.io](https://scikit-optimize.github.io)

98. google-research/t5x โญ 2,719
T5X is a modular, composable, research-friendly framework for high-performance, configurable, self-service training, evaluation, and inference of sequence models (starting with language) at many scales.

99. huggingface/optimum โญ 2,650
๐Ÿš€ Accelerate inference and training of ๐Ÿค— Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization tools
๐Ÿ”— [huggingface.co/docs/optimum/main](https://huggingface.co/docs/optimum/main/)

100. apple/ml-ane-transformers โญ 2,578
Reference implementation of the Transformer architecture optimized for Apple Neural Engine (ANE)

101. freedmand/semantra โญ 2,537
Semantra is a multipurpose tool for semantically searching documents. Query by meaning rather than just by matching text.

102. rom1504/clip-retrieval โญ 2,464
Easily compute clip embeddings and build a clip retrieval system with them
๐Ÿ”— [rom1504.github.io/clip-retrieval](https://rom1504.github.io/clip-retrieval/)

103. scikit-learn-contrib/category_encoders โญ 2,416
A library of sklearn compatible categorical variable encoders
๐Ÿ”— [contrib.scikit-learn.org/category_encoders](http://contrib.scikit-learn.org/category_encoders/)

104. neuraloperator/neuraloperator โญ 2,354
Comprehensive library for learning neural operators in PyTorch. It is the official implementation for Fourier Neural Operators and Tensorized Neural Operators.
๐Ÿ”— [neuraloperator.github.io/dev/index.html](https://neuraloperator.github.io/dev/index.html)

105. eric-mitchell/direct-preference-optimization โญ 2,305
Reference implementation for DPO (Direct Preference Optimization)

106. huggingface/huggingface_hub โญ 2,208
The official Python client for the Huggingface Hub.
๐Ÿ”— [huggingface.co/docs/huggingface_hub](https://huggingface.co/docs/huggingface_hub)

107. aws/sagemaker-python-sdk โญ 2,119
A library for training and deploying machine learning models on Amazon SageMaker
๐Ÿ”— [sagemaker.readthedocs.io](https://sagemaker.readthedocs.io/)

108. huggingface/evaluate โญ 2,081
๐Ÿค— Evaluate: A library for easily evaluating machine learning models and datasets.
๐Ÿ”— [huggingface.co/docs/evaluate](https://huggingface.co/docs/evaluate)

109. contextlab/hypertools โญ 1,829
A Python toolbox for gaining geometric insights into high-dimensional data
๐Ÿ”— [hypertools.readthedocs.io/en/latest](http://hypertools.readthedocs.io/en/latest/)

110. rentruewang/koila โญ 1,823
Prevent PyTorch's `CUDA error: out of memory` in just 1 line of code.
๐Ÿ”— [koila.rentruewang.com](https://koila.rentruewang.com)

111. linkedin/greykite โญ 1,815
A flexible, intuitive and fast forecasting library

112. bmabey/pyLDAvis โญ 1,812
Python library for interactive topic model visualization. Port of the R LDAvis package.

113. scikit-learn-contrib/lightning โญ 1,728
Large-scale linear classification, regression and ranking in Python
๐Ÿ”— [contrib.scikit-learn.org/lightning](https://contrib.scikit-learn.org/lightning/)

114. castorini/pyserini โญ 1,714
Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.
๐Ÿ”— [pyserini.io](http://pyserini.io/)

115. tensorflow/addons โญ 1,691
Useful extra functionality for TensorFlow 2.x maintained by SIG-addons

116. microsoft/i-Code โญ 1,682
The ambition of the i-Code project is to build integrative and composable multimodal AI. The "i" stands for integrative multimodal learning.

117. microsoft/Olive โญ 1,681
Olive: Simplify ML Model Finetuning, Conversion, Quantization, and Optimization for CPUs, GPUs and NPUs.
๐Ÿ”— [microsoft.github.io/olive](https://microsoft.github.io/Olive/)

118. visual-layer/fastdup โญ 1,643
fastdup is a powerful, free tool designed to rapidly generate valuable insights from image and video datasets. It helps enhance the quality of both images and labels, while significantly reducing data operation costs, all with unmatched scalability.

119. qdrant/fastembed โญ 1,641
Fast, Accurate, Lightweight Python library to make State of the Art Embedding
๐Ÿ”— [qdrant.github.io/fastembed](https://qdrant.github.io/fastembed/)

120. laekov/fastmoe โญ 1,587
A fast MoE impl for PyTorch
๐Ÿ”— [fastmoe.ai](https://fastmoe.ai)

121. kubeflow/katib โญ 1,529
Automated Machine Learning on Kubernetes
๐Ÿ”— [www.kubeflow.org/docs/components/katib](https://www.kubeflow.org/docs/components/katib)

122. google/vizier โญ 1,517
Python-based research interface for blackbox and hyperparameter optimization, based on the internal Google Vizier Service.
๐Ÿ”— [oss-vizier.readthedocs.io](https://oss-vizier.readthedocs.io)

123. jina-ai/finetuner โญ 1,486
:dart: Task-oriented embedding tuning for BERT, CLIP, etc.
๐Ÿ”— [finetuner.jina.ai](https://finetuner.jina.ai)

124. csinva/imodels โญ 1,413
Interpretable ML package ๐Ÿ” for concise, transparent, and accurate predictive modeling (sklearn-compatible).
๐Ÿ”— [csinva.io/imodels](https://csinva.io/imodels)

125. microsoft/Semi-supervised-learning โญ 1,398
A Unified Semi-Supervised Learning Codebase (NeurIPS'22)
๐Ÿ”— [usb.readthedocs.io](https://usb.readthedocs.io)

126. patchy631/machine-learning โญ 1,395
Machine Learning Tutorials Repository

127. spotify/voyager โญ 1,365
๐Ÿ›ฐ๏ธ An approximate nearest-neighbor search library for Python and Java with a focus on ease of use, simplicity, and deployability.
๐Ÿ”— [spotify.github.io/voyager](https://spotify.github.io/voyager/)

128. borealisai/advertorch โญ 1,317
A Toolbox for Adversarial Robustness Research

129. koaning/scikit-lego โญ 1,292
Extra blocks for scikit-learn pipelines.
๐Ÿ”— [koaning.github.io/scikit-lego](https://koaning.github.io/scikit-lego/)

130. awslabs/dgl-ke โญ 1,282
High performance, easy-to-use, and scalable package for learning large-scale knowledge graph embeddings.
๐Ÿ”— [dglke.dgl.ai/doc](https://dglke.dgl.ai/doc/)

131. priorlabs/TabPFN โญ 1,261
The TabPFN is a neural network that learned to do tabular data prediction. This is the original CUDA-supporting pytorch impelementation.
๐Ÿ”— [priorlabs.ai](http://priorlabs.ai)

132. lightning-ai/lightning-thunder โญ 1,258
Thunder is a source-to-source compiler for PyTorch. It makes PyTorch programs faster by combining and using different hardware executors at once

133. pytorch/FBGEMM โญ 1,233
FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/

134. nvidia/cuda-python โญ 1,034
CUDA Python: Performance meets Productivity
๐Ÿ”— [nvidia.github.io/cuda-python](https://nvidia.github.io/cuda-python/)

135. davidmrau/mixture-of-experts โญ 1,014
PyTorch Re-Implementation of "The Sparsely-Gated Mixture-of-Experts Layer" by Noam Shazeer et al. https://arxiv.org/abs/1701.06538

136. google-research/deeplab2 โญ 1,007
DeepLab2 is a TensorFlow library for deep labeling, aiming to provide a unified and state-of-the-art TensorFlow codebase for dense pixel labeling tasks.

137. opentensor/bittensor โญ 966
Internet-scale Neural Networks
๐Ÿ”— [www.bittensor.com](https://www.bittensor.com/)

138. lmcinnes/pynndescent โญ 908
A Python nearest neighbor descent for approximate nearest neighbors

139. oml-team/open-metric-learning โญ 900
OML is a PyTorch-based framework to train and validate the models producing high-quality embeddings.
๐Ÿ”— [open-metric-learning.readthedocs.io/en/latest/index.html](https://open-metric-learning.readthedocs.io/en/latest/index.html)

140. hazyresearch/safari โญ 873
Convolutions for Sequence Modeling

141. huggingface/optimum-quanto โญ 859
A pytorch quantization backend for optimum

142. criteo/autofaiss โญ 824
Automatically create Faiss knn indices with the most optimal similarity search parameters.
๐Ÿ”— [criteo.github.io/autofaiss](https://criteo.github.io/autofaiss/)

143. replicate/replicate-python โญ 785
Python client for Replicate
๐Ÿ”— [replicate.com](https://replicate.com)

144. pymc-labs/pymc-marketing โญ 741
Bayesian marketing toolbox in PyMC. Media Mix (MMM), customer lifetime value (CLV), buy-till-you-die (BTYD) models and more.
๐Ÿ”— [www.pymc-marketing.io](https://www.pymc-marketing.io/)

145. awslabs/python-deequ โญ 736
Python API for Deequ, a library built on Spark for defining "unit tests for data", which measure data quality in large datasets

146. facebookresearch/balance โญ 691
The balance python package offers a simple workflow and methods for dealing with biased data samples when looking to infer from them to some target population of interest.
๐Ÿ”— [import-balance.org](https://import-balance.org)

147. nicolas-hbt/pygraft โญ 676
Configurable Generation of Synthetic Schemas and Knowledge Graphs at Your Fingertips
๐Ÿ”— [pygraft.readthedocs.io/en/latest](https://pygraft.readthedocs.io/en/latest/)

148. googleapis/python-aiplatform โญ 672
A Python SDK for Vertex AI, a fully managed, end-to-end platform for data science and machine learning.

149. qdrant/quaterion โญ 645
Blazing fast framework for fine-tuning similarity learning models
๐Ÿ”— [quaterion.qdrant.tech](https://quaterion.qdrant.tech/)

150. hpcaitech/EnergonAI โญ 629
Large-scale model inference.

151. huggingface/exporters โญ 629
Export Hugging Face models to Core ML and TensorFlow Lite

152. intel/intel-npu-acceleration-library โญ 565
The Intel NPU Acceleration Library is a Python library designed to boost the efficiency of your applications by leveraging the power of the Intel Neural Processing Unit (NPU) to perform high-speed computations on compatible hardware.

153. nomic-ai/contrastors โญ 559
Contrastive learning toolkit that enables researchers and engineers to train and evaluate contrastive models efficiently.

154. intellabs/bayesian-torch โญ 559
A library for Bayesian neural network layers and uncertainty estimation in Deep Learning extending the core of PyTorch

155. microsoft/Focal-Transformer โญ 548
[NeurIPS 2021 Spotlight] Official code for "Focal Self-attention for Local-Global Interactions in Vision Transformers"

156. mrdbourke/m1-machine-learning-test โญ 531
Code for testing various M1 Chip benchmarks with TensorFlow.

157. linkedin/FastTreeSHAP โญ 527
Fast SHAP value computation for interpreting tree-based models

158. nevronai/MetisFL โญ 527
The first open Federated Learning framework implemented in C++ and Python.
๐Ÿ”— [metisfl.org](https://metisfl.org)

159. dylanhogg/gptauthor โญ 67
GPTAuthor is an AI tool for writing long form, multi-chapter stories given a story prompt.

## Machine Learning - Deep Learning

Machine learning libraries that cross over with deep learning in some way.

1. tensorflow/tensorflow โญ 187,148
An Open Source Machine Learning Framework for Everyone
๐Ÿ”— [tensorflow.org](https://tensorflow.org)

2. pytorch/pytorch โญ 85,611
Tensors and Dynamic neural networks in Python with strong GPU acceleration
๐Ÿ”— [pytorch.org](https://pytorch.org)

3. openai/whisper โญ 73,917
Robust Speech Recognition via Large-Scale Weak Supervision

4. keras-team/keras โญ 62,330
Deep Learning for humans
๐Ÿ”— [keras.io](http://keras.io/)

5. deepfakes/faceswap โญ 52,815
Deepfakes Software For All
๐Ÿ”— [www.faceswap.dev](https://www.faceswap.dev)

6. facebookresearch/segment-anything โญ 48,419
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

7. microsoft/DeepSpeed โญ 36,136
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
๐Ÿ”— [www.deepspeed.ai](https://www.deepspeed.ai/)

8. rwightman/pytorch-image-models โญ 32,820
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
๐Ÿ”— [huggingface.co/docs/timm](https://huggingface.co/docs/timm)

9. facebookresearch/detectron2 โญ 30,954
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
๐Ÿ”— [detectron2.readthedocs.io/en/latest](https://detectron2.readthedocs.io/en/latest/)

10. xinntao/Real-ESRGAN โญ 29,116
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.

11. lightning-ai/pytorch-lightning โญ 28,750
The deep learning framework to pretrain, finetune and deploy AI models. PyTorch Lightning is just organized PyTorch - Lightning disentangles PyTorch code to decouple the science from the engineering.
๐Ÿ”— [lightning.ai](https://lightning.ai)

12. google-research/tuning_playbook โญ 27,711
A playbook for systematically maximizing the performance of deep learning models.

13. openai/CLIP โญ 26,828
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

14. facebookresearch/Detectron โญ 26,290
FAIR's research platform for object detection research, implementing popular algorithms like Mask R-CNN and RetinaNet.

15. matterport/Mask_RCNN โญ 24,816
Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow

16. paddlepaddle/Paddle โญ 22,394
PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice ๏ผˆใ€Ž้ฃžๆกจใ€ๆ ธๅฟƒๆก†ๆžถ๏ผŒๆทฑๅบฆๅญฆไน &ๆœบๅ™จๅญฆไน ้ซ˜ๆ€ง่ƒฝๅ•ๆœบใ€ๅˆ†ๅธƒๅผ่ฎญ็ปƒๅ’Œ่ทจๅนณๅฐ้ƒจ็ฝฒ๏ผ‰
๐Ÿ”— [www.paddlepaddle.org](http://www.paddlepaddle.org/)

17. pyg-team/pytorch_geometric โญ 21,713
Graph Neural Network Library for PyTorch
๐Ÿ”— [pyg.org](https://pyg.org)

18. lucidrains/vit-pytorch โญ 21,339
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

19. apache/mxnet โญ 20,789
Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more
๐Ÿ”— [mxnet.apache.org](https://mxnet.apache.org)

20. sanster/IOPaint โญ 20,109
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
๐Ÿ”— [www.iopaint.com](https://www.iopaint.com/)

21. danielgatis/rembg โญ 17,565
Rembg is a tool to remove images background

22. rasbt/deeplearning-models โญ 16,824
A collection of various deep learning architectures, models, and tips

23. albumentations-team/albumentations โญ 14,453
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
๐Ÿ”— [albumentations.ai](https://albumentations.ai)

24. microsoft/Swin-Transformer โญ 14,158
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
๐Ÿ”— [arxiv.org/abs/2103.14030](https://arxiv.org/abs/2103.14030)

25. facebookresearch/detr โญ 13,838
End-to-End Object Detection with Transformers

26. nvidia/DeepLearningExamples โญ 13,746
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.

27. dmlc/dgl โญ 13,631
Python package built to ease deep learning on graph, on top of existing DL frameworks.
๐Ÿ”— [dgl.ai](http://dgl.ai)

28. mlfoundations/open_clip โญ 10,740
Open source implementation of OpenAI's CLIP (Contrastive Language-Image Pre-training).

29. kornia/kornia โญ 10,122
๐Ÿ Geometric Computer Vision Library for Spatial AI
๐Ÿ”— [kornia.readthedocs.io](https://kornia.readthedocs.io)

30. modelscope/facechain โญ 9,210
FaceChain is a deep-learning toolchain for generating your Digital-Twin.

31. keras-team/autokeras โญ 9,176
AutoML library for deep learning
๐Ÿ”— [autokeras.com](http://autokeras.com/)

32. facebookresearch/pytorch3d โญ 8,954
PyTorch3D is FAIR's library of reusable components for deep learning with 3D data
๐Ÿ”— [pytorch3d.org](https://pytorch3d.org/)

33. arogozhnikov/einops โญ 8,621
Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)
๐Ÿ”— [einops.rocks](https://einops.rocks)

34. pyro-ppl/pyro โญ 8,618
Deep universal probabilistic programming with Python and PyTorch
๐Ÿ”— [pyro.ai](http://pyro.ai)

35. nvidia/apex โญ 8,493
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

36. facebookresearch/ImageBind โญ 8,455
ImageBind One Embedding Space to Bind Them All

37. lucidrains/imagen-pytorch โญ 8,145
Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch

38. google/trax โญ 8,127
Trax โ€” Deep Learning with Clear Code and Speed

39. tencent/HunyuanVideo โญ 7,111
HunyuanVideo: A Systematic Framework For Large Video Generation Model
๐Ÿ”— [aivideo.hunyuan.tencent.com](https://aivideo.hunyuan.tencent.com)

40. xpixelgroup/BasicSR โญ 7,055
Open Source Image and Video Restoration Toolbox for Super-resolution, Denoise, Deblurring, etc. Currently, it includes EDSR, RCAN, SRResNet, SRGAN, ESRGAN, EDVR, BasicVSR, SwinIR, ECBSR, etc. Also support StyleGAN2, DFDNet.
๐Ÿ”— [basicsr.readthedocs.io/en/latest](https://basicsr.readthedocs.io/en/latest/)

41. bytedance/monolith โญ 6,656
A deep learning framework for large scale recommendation modeling with collisionless embedding and real time training captures.

42. google/flax โญ 6,239
Flax is a neural network library for JAX that is designed for flexibility.
๐Ÿ”— [flax.readthedocs.io](https://flax.readthedocs.io)

43. skorch-dev/skorch โญ 5,928
A scikit-learn compatible neural network library that wraps PyTorch

44. facebookresearch/mmf โญ 5,519
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
๐Ÿ”— [mmf.sh](https://mmf.sh/)

45. mosaicml/composer โญ 5,224
Supercharge Your Model Training
๐Ÿ”— [docs.mosaicml.com](http://docs.mosaicml.com)

46. deci-ai/super-gradients โญ 4,641
Easily train or fine-tune SOTA computer vision models with one open source training library. The home of Yolo-NAS.
๐Ÿ”— [www.supergradients.com](https://www.supergradients.com)

47. facebookincubator/AITemplate โญ 4,585
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.

48. nvidiagameworks/kaolin โญ 4,570
A PyTorch Library for Accelerating 3D Deep Learning Research

49. pytorch/ignite โญ 4,566
High-level library to help with training and evaluating neural networks in PyTorch flexibly and transparently.
๐Ÿ”— [pytorch-ignite.ai](https://pytorch-ignite.ai)

50. williamyang1991/VToonify โญ 3,554
[SIGGRAPH Asia 2022] VToonify: Controllable High-Resolution Portrait Video Style Transfer

51. cvg/LightGlue โญ 3,524
LightGlue: Local Feature Matching at Light Speed (ICCV 2023)

52. facebookresearch/PyTorch-BigGraph โญ 3,389
Generate embeddings from large-scale graph-structured data.
๐Ÿ”— [torchbiggraph.readthedocs.io](https://torchbiggraph.readthedocs.io/)

53. google-research/scenic โญ 3,377
Scenic: A Jax Library for Computer Vision Research and Beyond

54. pytorch/botorch โญ 3,139
Bayesian optimization in PyTorch
๐Ÿ”— [botorch.org](https://botorch.org/)

55. alpa-projects/alpa โญ 3,093
Training and serving large-scale neural networks with auto parallelization.
๐Ÿ”— [alpa.ai](https://alpa.ai)

56. deepmind/dm-haiku โญ 2,931
JAX-based neural network library
๐Ÿ”— [dm-haiku.readthedocs.io](https://dm-haiku.readthedocs.io)

57. explosion/thinc โญ 2,829
๐Ÿ”ฎ A refreshing functional take on deep learning, compatible with your favorite libraries
๐Ÿ”— [thinc.ai](https://thinc.ai)

58. nerdyrodent/VQGAN-CLIP โญ 2,630
Just playing with getting VQGAN+CLIP running locally, rather than having to use colab.

59. danielegrattarola/spektral โญ 2,373
Graph Neural Networks with Keras and Tensorflow 2.
๐Ÿ”— [graphneural.network](https://graphneural.network)

60. google-research/electra โญ 2,343
ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators

61. fepegar/torchio โญ 2,105
Medical imaging toolkit for deep learning
๐Ÿ”— [torchio.org](https://torchio.org)

62. neuralmagic/sparseml โญ 2,089
Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models

63. pytorch/torchrec โญ 1,998
Pytorch domain library for recommendation systems
๐Ÿ”— [pytorch.org/torchrec](https://pytorch.org/torchrec/)

64. modelscope/ClearerVoice-Studio โญ 1,934
An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

65. tensorflow/mesh โญ 1,598
Mesh TensorFlow: Model Parallelism Made Easier

66. tensorly/tensorly โญ 1,585
TensorLy: Tensor Learning in Python.
๐Ÿ”— [tensorly.org](http://tensorly.org)

67. vt-vl-lab/FGVC โญ 1,555
[ECCV 2020] Flow-edge Guided Video Completion

68. calculatedcontent/WeightWatcher โญ 1,493
The WeightWatcher tool for predicting the accuracy of Deep Neural Networks

69. jeshraghian/snntorch โญ 1,408
Deep and online learning with spiking neural networks in Python
๐Ÿ”— [snntorch.readthedocs.io/en/latest](https://snntorch.readthedocs.io/en/latest/)

70. hysts/pytorch_image_classification โญ 1,375
PyTorch implementation of image classification models for CIFAR-10/CIFAR-100/MNIST/FashionMNIST/Kuzushiji-MNIST/ImageNet

71. xl0/lovely-tensors โญ 1,172
Tensors, for human consumption
๐Ÿ”— [xl0.github.io/lovely-tensors](https://xl0.github.io/lovely-tensors)

72. deepmind/android_env โญ 1,033
RL research on Android devices.

73. keras-team/keras-cv โญ 1,015
Industry-strength Computer Vision workflows with Keras

74. tensorflow/similarity โญ 1,014
TensorFlow Similarity is a python package focused on making similarity learning quick and easy.

75. kakaobrain/rq-vae-transformer โญ 819
The official implementation of Autoregressive Image Generation using Residual Quantization (CVPR '22)

76. deepmind/chex โญ 804
Chex is a library of utilities for helping to write reliable JAX code
๐Ÿ”— [chex.readthedocs.io](https://chex.readthedocs.io)

77. mlfoundations/datacomp โญ 670
DataComp: In search of the next generation of multimodal datasets
๐Ÿ”— [datacomp.ai](http://datacomp.ai/)

78. whitead/dmol-book โญ 629
Deep learning for molecules and materials book
๐Ÿ”— [dmol.pub](https://dmol.pub)

79. allenai/reward-bench โญ 479
RewardBench is a benchmark designed to evaluate the capabilities and safety of reward models (including those trained with Direct Preference Optimization, DPO)
๐Ÿ”— [huggingface.co/spaces/allenai/reward-bench](https://huggingface.co/spaces/allenai/reward-bench)

## Machine Learning - Interpretability

Machine learning interpretability libraries. Covers explainability, prediction explainations, dashboards, understanding knowledge development in training.

1. slundberg/shap โญ 23,184
A game theoretic approach to explain the output of any machine learning model.
๐Ÿ”— [shap.readthedocs.io](https://shap.readthedocs.io)

2. marcotcr/lime โญ 11,691
Lime: Explaining the predictions of any machine learning classifier

3. interpretml/interpret โญ 6,345
Fit interpretable models. Explain blackbox machine learning.
๐Ÿ”— [interpret.ml/docs](https://interpret.ml/docs)

4. pytorch/captum โญ 5,017
Model interpretability and understanding for PyTorch
๐Ÿ”— [captum.ai](https://captum.ai)

5. tensorflow/lucid โญ 4,682
A collection of infrastructure and tools for research in neural network interpretability.

6. arize-ai/phoenix โญ 4,399
AI Observability & Evaluation
๐Ÿ”— [docs.arize.com/phoenix](https://docs.arize.com/phoenix)

7. pair-code/lit โญ 3,506
The Learning Interpretability Tool: Interactively analyze ML models to understand their behavior in an extensible and framework agnostic interface.
๐Ÿ”— [pair-code.github.io/lit](https://pair-code.github.io/lit)

8. teamhg-memex/eli5 โญ 2,762
A library for debugging/inspecting machine learning classifiers and explaining their predictions
๐Ÿ”— [eli5.readthedocs.io](http://eli5.readthedocs.io)

9. maif/shapash โญ 2,761
๐Ÿ”… Shapash: User-friendly Explainability and Interpretability to Develop Reliable and Transparent Machine Learning Models
๐Ÿ”— [maif.github.io/shapash](https://maif.github.io/shapash/)

10. seldonio/alibi โญ 2,424
Algorithms for explaining machine learning models
๐Ÿ”— [docs.seldon.io/projects/alibi/en/stable](https://docs.seldon.io/projects/alibi/en/stable/)

11. oegedijk/explainerdashboard โญ 2,344
Quickly build Explainable AI dashboards that show the inner workings of so-called "blackbox" machine learning models.
๐Ÿ”— [explainerdashboard.readthedocs.io](http://explainerdashboard.readthedocs.io)

12. eleutherai/pythia โญ 2,331
Interpretability analysis and scaling laws to understand how knowledge develops and evolves during training in autoregressive transformers

13. jalammar/ecco โญ 1,992
Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the behavior of Transformer-based language models (like GPT2, BERT, RoBERTA, T5, and T0).
๐Ÿ”— [ecco.readthedocs.io](https://ecco.readthedocs.io)

14. transformerlensorg/TransformerLens โญ 1,716
A library for mechanistic interpretability of GPT-style language models
๐Ÿ”— [transformerlensorg.github.io/transformerlens](https://transformerlensorg.github.io/TransformerLens/)

15. google-deepmind/penzai โญ 1,708
A JAX library for writing models as legible, functional pytree data structures, along with tools for visualizing, modifying, and analyzing them. Penzai focuses on making it easy to do stuff with models after they have been trained
๐Ÿ”— [penzai.readthedocs.io](https://penzai.readthedocs.io/)

16. trusted-ai/AIX360 โญ 1,651
Interpretability and explainability of data and machine learning models
๐Ÿ”— [aix360.res.ibm.com](https://aix360.res.ibm.com/)

17. cdpierse/transformers-interpret โญ 1,311
Model explainability that works seamlessly with ๐Ÿค— transformers. Explain your transformers model in just 2 lines of code.

18. selfexplainml/PiML-Toolbox โญ 1,227
PiML (Python Interpretable Machine Learning) toolbox for model development & diagnostics
๐Ÿ”— [selfexplainml.github.io/piml-toolbox](https://selfexplainml.github.io/PiML-Toolbox)

19. ethicalml/xai โญ 1,143
XAI is a Machine Learning library that is designed with AI explainability in its core. XAI contains various tools that enable for analysis and evaluation of data and models
๐Ÿ”— [ethical.institute/principles.html#commitment-3](https://ethical.institute/principles.html#commitment-3)

20. salesforce/OmniXAI โญ 887
OmniXAI: A Library for eXplainable AI

21. andyzoujm/representation-engineering โญ 762
Representation Engineering: A Top-Down Approach to AI Transparency
๐Ÿ”— [www.ai-transparency.org](https://www.ai-transparency.org/)

22. jbloomaus/SAELens โญ 558
Training Sparse Autoencoders on LLms. Analyse sparse autoencoders and neural network internals.
๐Ÿ”— [jbloomaus.github.io/saelens](https://jbloomaus.github.io/SAELens/)

## Machine Learning - Ops

MLOps tools, frameworks and libraries: intersection of machine learning, data engineering and DevOps; deployment, health, diagnostics and governance of ML models.

1. apache/airflow โญ 38,192
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
๐Ÿ”— [airflow.apache.org](https://airflow.apache.org/)

2. ray-project/ray โญ 34,768
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
๐Ÿ”— [ray.io](https://ray.io)

3. mlflow/mlflow โญ 19,164
Open source platform for the machine learning lifecycle
๐Ÿ”— [mlflow.org](https://mlflow.org)

4. spotify/luigi โญ 18,004
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.

5. prefecthq/prefect โญ 17,944
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
๐Ÿ”— [prefect.io](https://prefect.io)

6. kestra-io/kestra โญ 15,127
:zap: Workflow Automation Platform. Orchestrate & Schedule code in any language, run anywhere, 500+ plugins. Alternative to Zapier, Rundeck, Camunda, Airflow...
๐Ÿ”— [kestra.io](https://kestra.io)

7. horovod/horovod โญ 14,330
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
๐Ÿ”— [horovod.ai](http://horovod.ai)

8. iterative/dvc โญ 14,058
๐Ÿฆ‰ Data Versioning and ML Experiments
๐Ÿ”— [dvc.org](https://dvc.org)

9. dagster-io/dagster โญ 12,208
An orchestration platform for the development, production, and observation of data assets.
๐Ÿ”— [dagster.io](https://dagster.io)

10. ludwig-ai/ludwig โญ 11,267
Low-code framework for building custom LLMs, neural networks, and other AI models
๐Ÿ”— [ludwig.ai](http://ludwig.ai)

11. bentoml/OpenLLM โญ 10,351
Run any open-source LLMs, such as Llama, Mistral, as OpenAI compatible API endpoint in the cloud.
๐Ÿ”— [bentoml.com](https://bentoml.com)

12. dbt-labs/dbt-core โญ 10,174
dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.
๐Ÿ”— [getdbt.com](https://getdbt.com)

13. great-expectations/great_expectations โญ 10,092
Always know what to expect from your data.
๐Ÿ”— [docs.greatexpectations.io](https://docs.greatexpectations.io/)

14. kedro-org/kedro โญ 10,079
Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable, and modular.
๐Ÿ”— [kedro.org](https://kedro.org)

15. huggingface/text-generation-inference โญ 9,563
A Rust, Python and gRPC server for text generation inference. Used in production at HuggingFace to power Hugging Chat, the Inference API and Inference Endpoint.
๐Ÿ”— [hf.co/docs/text-generation-inference](http://hf.co/docs/text-generation-inference)

16. netflix/metaflow โญ 8,405
Open Source AI/ML Platform
๐Ÿ”— [metaflow.org](https://metaflow.org)

17. activeloopai/deeplake โญ 8,287
Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai
๐Ÿ”— [activeloop.ai](https://activeloop.ai)

18. mage-ai/mage-ai โญ 8,066
๐Ÿง™ Build, run, and manage data pipelines for integrating and transforming data.
๐Ÿ”— [www.mage.ai](https://www.mage.ai)

19. langfuse/langfuse โญ 7,636
๐Ÿชข Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with LlamaIndex, Langchain, OpenAI SDK, LiteLLM, and more. ๐ŸŠYC W23
๐Ÿ”— [langfuse.com/docs](https://langfuse.com/docs)

20. bentoml/BentoML โญ 7,258
The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!
๐Ÿ”— [bentoml.com](https://bentoml.com)

21. flyteorg/flyte โญ 5,890
Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.
๐Ÿ”— [flyte.org](https://flyte.org)

22. allegroai/clearml โญ 5,770
ClearML - Auto-Magical CI/CD to streamline your AI workload. Experiment Management, Data Management, Pipeline, Orchestration, Scheduling & Serving in one MLOps/LLMOps solution
๐Ÿ”— [clear.ml/docs](https://clear.ml/docs)

23. feast-dev/feast โญ 5,706
The Open Source Feature Store for Machine Learning
๐Ÿ”— [feast.dev](https://feast.dev)

24. evidentlyai/evidently โญ 5,585
Evidently is โ€‹โ€‹an open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. From tabular data to Gen AI. 100+ metrics.
๐Ÿ”— [discord.gg/xzjkranp8b](https://discord.gg/xZjKRaNp8b)

25. aimhubio/aim โญ 5,294
Aim ๐Ÿ’ซ โ€” An easy-to-use & supercharged open-source experiment tracker.
๐Ÿ”— [aimstack.io](https://aimstack.io)

26. adap/flower โญ 5,290
Flower: A Friendly Federated AI Framework
๐Ÿ”— [flower.ai](https://flower.ai)

27. internlm/lmdeploy โญ 5,104
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
๐Ÿ”— [lmdeploy.readthedocs.io/en/latest](https://lmdeploy.readthedocs.io/en/latest/)

28. zenml-io/zenml โญ 4,300
ZenML ๐Ÿ™: The bridge between ML and Ops. https://zenml.io.
๐Ÿ”— [zenml.io](https://zenml.io)

29. internlm/xtuner โญ 4,126
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
๐Ÿ”— [xtuner.readthedocs.io/zh-cn/latest](https://xtuner.readthedocs.io/zh-cn/latest/)

30. orchest/orchest โญ 4,096
Build data pipelines, the easy way ๐Ÿ› ๏ธ
๐Ÿ”— [orchest.readthedocs.io/en/stable](https://orchest.readthedocs.io/en/stable/)

31. kubeflow/pipelines โญ 3,648
Machine Learning Pipelines for Kubeflow
๐Ÿ”— [www.kubeflow.org/docs/components/pipelines](https://www.kubeflow.org/docs/components/pipelines/)

32. polyaxon/polyaxon โญ 3,588
MLOps Tools For Managing & Orchestrating The Machine Learning LifeCycle
๐Ÿ”— [polyaxon.com](https://polyaxon.com)

33. ploomber/ploomber โญ 3,534
The fastest โšก๏ธ way to build data pipelines. Develop iteratively, deploy anywhere. โ˜๏ธ
๐Ÿ”— [docs.ploomber.io](https://docs.ploomber.io)

34. towhee-io/towhee โญ 3,276
Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.
๐Ÿ”— [towhee.io](https://towhee.io)

35. determined-ai/determined โญ 3,072
Determined is an open-source machine learning platform that simplifies distributed training, hyperparameter tuning, experiment tracking, and resource management. Works with PyTorch and TensorFlow.
๐Ÿ”— [determined.ai](https://determined.ai)

36. leptonai/leptonai โญ 2,671
A Pythonic framework to simplify AI service building
๐Ÿ”— [lepton.ai](https://lepton.ai/)

37. azure/PyRIT โญ 2,035
The Python Risk Identification Tool for generative AI (PyRIT) is an open access automation framework to empower security professionals and ML engineers to red team foundation models and their applications.
๐Ÿ”— [azure.github.io/pyrit](https://azure.github.io/PyRIT/)

38. dagworks-inc/hamilton โญ 1,960
Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and metadata. Runs and scales everywhere python does.
๐Ÿ”— [hamilton.dagworks.io/en/latest](https://hamilton.dagworks.io/en/latest/)

39. meltano/meltano โญ 1,894
Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to writing, maintaining, and scaling your own API integrations.
๐Ÿ”— [meltano.com](https://meltano.com/)

40. dstackai/dstack โญ 1,643
dstack is a lightweight, open-source alternative to Kubernetes & Slurm, simplifying AI container orchestration with multi-cloud & on-prem support. It natively supports NVIDIA, AMD, & TPU.
๐Ÿ”— [dstack.ai/docs](https://dstack.ai/docs)

41. hi-primus/optimus โญ 1,487
:truck: Agile Data Preparation Workflows madeย easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
๐Ÿ”— [hi-optimus.com](https://hi-optimus.com)

42. kubeflow/examples โญ 1,423
A repository to host extended examples and tutorials

43. dagworks-inc/burr โญ 1,416
Build applications that make decisions (chatbots, agents, simulations, etc...). Monitor, trace, persist, and execute on your own infrastructure.
๐Ÿ”— [burr.dagworks.io](https://burr.dagworks.io)

## Machine Learning - Reinforcement

Machine learning libraries and toolkits that cross over with reinforcement learning in some way: agent reinforcement learning, agent environemnts, RLHF

1. openai/gym โญ 35,089
A toolkit for developing and comparing reinforcement learning algorithms.
๐Ÿ”— [www.gymlibrary.dev](https://www.gymlibrary.dev)

2. openai/baselines โญ 15,938
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

3. google/dopamine โญ 10,601
Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.
๐Ÿ”— [github.com/google/dopamine](https://github.com/google/dopamine)

4. thu-ml/tianshou โญ 8,129
An elegant PyTorch deep reinforcement learning library.
๐Ÿ”— [tianshou.org](https://tianshou.org)

5. deepmind/pysc2 โญ 8,049
StarCraft II Learning Environment

6. farama-foundation/Gymnasium โญ 7,778
An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)
๐Ÿ”— [gymnasium.farama.org](https://gymnasium.farama.org)

7. lucidrains/PaLM-rlhf-pytorch โญ 7,741
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

8. tensorlayer/TensorLayer โญ 7,338
Deep Learning and Reinforcement Learning Library for Scientists and Engineers
๐Ÿ”— [tensorlayerx.com](http://tensorlayerx.com)

9. keras-rl/keras-rl โญ 5,532
Deep Reinforcement Learning for Keras.
๐Ÿ”— [keras-rl.readthedocs.io](http://keras-rl.readthedocs.io/)

10. deepmind/dm_control โญ 3,875
Google DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.

11. ai4finance-foundation/ElegantRL โญ 3,812
Massively Parallel Deep Reinforcement Learning. ๐Ÿ”ฅ
๐Ÿ”— [ai4finance.org](https://ai4finance.org)

12. facebookresearch/ReAgent โญ 3,574
A platform for Reasoning systems (Reinforcement Learning, Contextual Bandits, etc.)
๐Ÿ”— [reagent.ai](https://reagent.ai)

13. deepmind/acme โญ 3,560
A library of reinforcement learning components and agents

14. opendilab/DI-engine โญ 3,177
DI-engine is a generalized decision intelligence engine for PyTorch and JAX. It provides python-first and asynchronous-native task and middleware abstractions
๐Ÿ”— [di-engine-docs.readthedocs.io](https://di-engine-docs.readthedocs.io)

15. eureka-research/Eureka โญ 2,866
Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models" (ICLR 2024)
๐Ÿ”— [eureka-research.github.io](https://eureka-research.github.io/)

16. pettingzoo-team/PettingZoo โญ 2,697
An API standard for multi-agent reinforcement learning environments, with popular reference environments and related utilities
๐Ÿ”— [pettingzoo.farama.org](https://pettingzoo.farama.org)

17. pytorch/rl โญ 2,454
A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.
๐Ÿ”— [pytorch.org/rl](https://pytorch.org/rl)

18. kzl/decision-transformer โญ 2,448
Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.

19. anthropics/hh-rlhf โญ 1,657
Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"
๐Ÿ”— [arxiv.org/abs/2204.05862](https://arxiv.org/abs/2204.05862)

20. arise-initiative/robosuite โญ 1,424
robosuite: A Modular Simulation Framework and Benchmark for Robot Learning
๐Ÿ”— [robosuite.ai](https://robosuite.ai)

21. humancompatibleai/imitation โญ 1,362
Clean PyTorch implementations of imitation and reward learning algorithms
๐Ÿ”— [imitation.readthedocs.io](https://imitation.readthedocs.io/)

22. denys88/rl_games โญ 975
RL Games: High performance RL library

23. google-deepmind/meltingpot โญ 642
A suite of test scenarios for multi-agent reinforcement learning.

## Natural Language Processing

Natural language processing libraries and toolkits: text processing, topic modelling, tokenisers, chatbots. Also see the LLMs and ChatGPT category for crossover.

1. huggingface/transformers โญ 137,264
๐Ÿค— Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
๐Ÿ”— [huggingface.co/transformers](https://huggingface.co/transformers)

2. pytorch/fairseq โญ 30,771
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

3. explosion/spaCy โญ 30,601
๐Ÿ’ซ Industrial-strength Natural Language Processing (NLP) in Python
๐Ÿ”— [spacy.io](https://spacy.io)

4. myshell-ai/OpenVoice โญ 30,344
Instant voice cloning by MIT and MyShell.
๐Ÿ”— [research.myshell.ai/open-voice](https://research.myshell.ai/open-voice)

5. microsoft/unilm โญ 20,533
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
๐Ÿ”— [aka.ms/generalai](https://aka.ms/GeneralAI)

6. huggingface/datasets โญ 19,438
๐Ÿค— The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
๐Ÿ”— [huggingface.co/docs/datasets](https://huggingface.co/docs/datasets)

7. vikparuchuri/marker โญ 19,078
Marker converts PDF, EPUB, and MOBI to markdown. It's 10x faster than nougat, more accurate on most documents, and has low hallucination risk.
๐Ÿ”— [www.datalab.to](https://www.datalab.to)

8. rare-technologies/gensim โญ 15,768
Topic Modelling for Humans
๐Ÿ”— [radimrehurek.com/gensim](https://radimrehurek.com/gensim)

9. ukplab/sentence-transformers โญ 15,704
State-of-the-Art Text Embeddings
๐Ÿ”— [www.sbert.net](https://www.sbert.net)

10. vikparuchuri/surya โญ 15,232
OCR, layout analysis, reading order, table recognition in 90+ languages
๐Ÿ”— [www.datalab.to](https://www.datalab.to)

11. gunthercox/ChatterBot โญ 14,136
ChatterBot is a machine learning, conversational dialog engine for creating chat bots
๐Ÿ”— [chatterbot.readthedocs.io](https://chatterbot.readthedocs.io)

12. flairnlp/flair โญ 14,005
A very simple framework for state-of-the-art Natural Language Processing (NLP)
๐Ÿ”— [flairnlp.github.io/flair](https://flairnlp.github.io/flair/)

13. nltk/nltk โญ 13,742
NLTK Source
๐Ÿ”— [www.nltk.org](https://www.nltk.org)

14. m-bain/whisperX โญ 13,191
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

15. openai/tiktoken โญ 12,926
tiktoken is a fast BPE tokeniser for use with OpenAI's models.

16. nvidia/NeMo โญ 12,633
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
๐Ÿ”— [docs.nvidia.com/nemo-framework/user-guide/latest/overview.html](https://docs.nvidia.com/nemo-framework/user-guide/latest/overview.html)

17. jina-ai/clip-as-service โญ 12,521
๐Ÿ„ Scalable embedding, reasoning, ranking for images and sentences with CLIP
๐Ÿ”— [clip-as-service.jina.ai](https://clip-as-service.jina.ai)

18. allenai/allennlp โญ 11,776
An open-source NLP research library, built on PyTorch.
๐Ÿ”— [www.allennlp.org](http://www.allennlp.org)

19. facebookresearch/seamless_communication โญ 11,060
Foundational Models for State-of-the-Art Speech and Text Translation

20. facebookresearch/ParlAI โญ 10,498
A framework for training and evaluating AI models on a variety of openly available dialogue datasets.
๐Ÿ”— [parl.ai](https://parl.ai)

21. google/sentencepiece โญ 10,435
Unsupervised text tokenizer for Neural Network-based text generation.

22. neuml/txtai โญ 9,871
๐Ÿ’ก All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
๐Ÿ”— [neuml.github.io/txtai](https://neuml.github.io/txtai)

23. doccano/doccano โญ 9,679
Open source annotation tool for machine learning practitioners.

24. sloria/TextBlob โญ 9,206
Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.
๐Ÿ”— [textblob.readthedocs.io](https://textblob.readthedocs.io/)

25. speechbrain/speechbrain โญ 9,151
A PyTorch-based Speech Toolkit
๐Ÿ”— [speechbrain.github.io](http://speechbrain.github.io)

26. facebookresearch/nougat โญ 9,133
Implementation of Nougat Neural Optical Understanding for Academic Documents
๐Ÿ”— [facebookresearch.github.io/nougat](https://facebookresearch.github.io/nougat/)

27. togethercomputer/OpenChatKit โญ 9,016
OpenChatKit provides a powerful, open-source base to create both specialized and general purpose chatbots

28. clips/pattern โญ 8,765
Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.
๐Ÿ”— [github.com/clips/pattern/wiki](https://github.com/clips/pattern/wiki)

29. espnet/espnet โญ 8,658
End-to-End Speech Processing Toolkit
๐Ÿ”— [espnet.github.io/espnet](https://espnet.github.io/espnet/)

30. deeppavlov/DeepPavlov โญ 6,759
An open source library for deep learning end-to-end dialog systems and chatbots.
๐Ÿ”— [deeppavlov.ai](https://deeppavlov.ai)

31. facebookresearch/metaseq โญ 6,520
A codebase for working with Open Pre-trained Transformers, originally forked from fairseq.

32. kingoflolz/mesh-transformer-jax โญ 6,313
Model parallel transformers in JAX and Haiku

33. maartengr/BERTopic โญ 6,288
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
๐Ÿ”— [maartengr.github.io/bertopic](https://maartengr.github.io/BERTopic/)

34. aiwaves-cn/agents โญ 5,386
An Open-source Framework for Data-centric, Self-evolving Autonomous Language Agents

35. layout-parser/layout-parser โญ 5,006
A Unified Toolkit for Deep Learning Based Document Image Analysis
๐Ÿ”— [layout-parser.github.io](https://layout-parser.github.io/)

36. salesforce/CodeGen โญ 4,973
CodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.

37. minimaxir/textgenrnn โญ 4,936
Easily train your own text-generating neural network of any size and complexity on any text dataset with a few lines of code.

38. quivrhq/MegaParse โญ 4,871
File Parser optimised for LLM Ingestion with no loss ๐Ÿง  Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
๐Ÿ”— [megaparse.com](https://megaparse.com)

39. makcedward/nlpaug โญ 4,487
Data augmentation for NLP
๐Ÿ”— [makcedward.github.io](https://makcedward.github.io/)

40. facebookresearch/DrQA โญ 4,481
Reading Wikipedia to Answer Open-Domain Questions

41. argilla-io/argilla โญ 4,164
Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets
๐Ÿ”— [docs.argilla.io](https://docs.argilla.io)

42. thilinarajapakse/simpletransformers โญ 4,139
Transformers for Information Retrieval, Text Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conversational AI
๐Ÿ”— [simpletransformers.ai](https://simpletransformers.ai/)

43. maartengr/KeyBERT โญ 3,640
Minimal keyword extraction with BERT
๐Ÿ”— [maartengr.github.io/keybert](https://MaartenGr.github.io/KeyBERT/)

44. life4/textdistance โญ 3,428
๐Ÿ“ Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.

45. promptslab/Promptify โญ 3,358
Prompt Engineering | Prompt Versioning | Use GPT or other prompt based models to get structured output. Join our discord for Prompt-Engineering, LLMs and other latest research
๐Ÿ”— [discord.gg/m88xfymbk6](https://discord.gg/m88xfYMbK6)

46. jsvine/markovify โญ 3,313
A simple, extensible Markov chain generator.

47. bytedance/lightseq โญ 3,240
LightSeq: A High Performance Library for Sequence Processing and Generation

48. errbotio/errbot โญ 3,151
Errbot is a chatbot, a daemon that connects to your favorite chat service and bring your tools and some fun into the conversation.
๐Ÿ”— [errbot.io](http://errbot.io)

49. neuralmagic/deepsparse โญ 3,069
Sparsity-aware deep learning inference runtime for CPUs
๐Ÿ”— [neuralmagic.com/deepsparse](https://neuralmagic.com/deepsparse/)

50. huawei-noah/Pretrained-Language-Model โญ 3,043
Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.

51. ddangelov/Top2Vec โญ 2,971
Top2Vec learns jointly embedded topic, document and word vectors.

52. jbesomi/texthero โญ 2,893
Text preprocessing, representation and visualization from zero to hero.
๐Ÿ”— [texthero.org](https://texthero.org)

53. huggingface/neuralcoref โญ 2,865
โœจFast Coreference Resolution in spaCy with Neural Networks
๐Ÿ”— [huggingface.co/coref](https://huggingface.co/coref/)

54. salesforce/CodeT5 โญ 2,846
Home of CodeT5: Open Code LLMs for Code Understanding and Generation
๐Ÿ”— [arxiv.org/abs/2305.07922](https://arxiv.org/abs/2305.07922)

55. bigscience-workshop/promptsource โญ 2,736
Toolkit for creating, sharing and using natural language prompts.

56. huggingface/setfit โญ 2,283
SetFit is an efficient and prompt-free framework for few-shot fine-tuning of Sentence Transformers.
๐Ÿ”— [hf.co/docs/setfit](https://hf.co/docs/setfit)

57. bhavnicksm/chonkie โญ 2,195
๐Ÿฆ› CHONK your texts with Chonkie โœจ - The no-nonsense RAG chunking library
๐Ÿ”— [docs.chonkie.ai](https://docs.chonkie.ai)

58. alibaba/EasyNLP โญ 2,084
EasyNLP: A Comprehensive and Easy-to-use NLP Toolkit

59. jamesturk/jellyfish โญ 2,081
๐Ÿชผ a python library for doing approximate and phonetic matching of strings.
๐Ÿ”— [jamesturk.github.io/jellyfish](https://jamesturk.github.io/jellyfish/)

60. thudm/P-tuning-v2 โญ 1,997
An optimized deep prompt tuning strategy comparable to fine-tuning across scales and tasks

61. marella/ctransformers โญ 1,828
Python bindings for the Transformer models implemented in C/C++ using GGML library.

62. featureform/featureform โญ 1,826
The Virtual Feature Store. Turn your existing data infrastructure into a feature store.
๐Ÿ”— [www.featureform.com](https://www.featureform.com)

63. deepset-ai/FARM โญ 1,747
๐Ÿก Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.
๐Ÿ”— [farm.deepset.ai](https://farm.deepset.ai)

64. franck-dernoncourt/NeuroNER โญ 1,702
Named-entity recognition using neural networks. Easy-to-use and state-of-the-art results.
๐Ÿ”— [neuroner.com](http://neuroner.com)

65. explosion/spacy-models โญ 1,676
๐Ÿ’ซ Models for the spaCy Natural Language Processing (NLP) library
๐Ÿ”— [spacy.io](https://spacy.io)

66. google-research/language โญ 1,637
Shared repository for open-sourced projects from the Google AI Language team.
๐Ÿ”— [ai.google/research/teams/language](https://ai.google/research/teams/language/)

67. plasticityai/magnitude โญ 1,636
A fast, efficient universal vector embedding utility package.

68. urchade/GLiNER โญ 1,619
Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024
๐Ÿ”— [arxiv.org/abs/2311.08526](https://arxiv.org/abs/2311.08526)

69. arxiv-vanity/arxiv-vanity โญ 1,613
Renders papers from arXiv as responsive web pages so you don't have to squint at a PDF.
๐Ÿ”— [www.arxiv-vanity.com](https://www.arxiv-vanity.com)

70. chrismattmann/tika-python โญ 1,527
Tika-Python is a Python binding to the Apache Tikaโ„ข REST services allowing Tika to be called natively in the Python community.

71. dmmiller612/bert-extractive-summarizer โญ 1,416
Easy to use extractive text summarization with BERT

72. intellabs/fastRAG โญ 1,414
Efficient Retrieval Augmentation and Generation Framework

73. nomic-ai/nomic โญ 1,407
Interact, analyze and structure massive text, image, embedding, audio and video datasets
๐Ÿ”— [atlas.nomic.ai](https://atlas.nomic.ai)

74. gunthercox/chatterbot-corpus โญ 1,376
A multilingual dialog corpus
๐Ÿ”— [chatterbot-corpus.readthedocs.io](http://chatterbot-corpus.readthedocs.io)

75. jonasgeiping/cramming โญ 1,305
Cramming the training of a (BERT-type) language model into limited compute.

76. pemistahl/lingua-py โญ 1,201
The most accurate natural language detection library for Python, suitable for short text and mixed-language text

77. openai/grade-school-math โญ 1,140
GSM8K, a dataset of 8.5K high quality linguistically diverse grade school math word problems

78. abertsch72/unlimiformer โญ 1,060
Public repo for the NeurIPS 2023 paper "Unlimiformer: Long-Range Transformers with Unlimited Length Input"

79. unitaryai/detoxify โญ 985
Toxic Comment Classification with Pytorch Lightning and Transformers
๐Ÿ”— [www.unitary.ai](https://www.unitary.ai/)

80. answerdotai/ModernBERT โญ 950
Bringing BERT into modernity via both architecture changes and scaling
๐Ÿ”— [arxiv.org/abs/2412.13663](https://arxiv.org/abs/2412.13663)

81. norskregnesentral/skweak โญ 923
skweak: A software toolkit for weak supervision applied to NLP tasks

82. keras-team/keras-hub โญ 826
Pretrained model hub for Keras 3.
๐Ÿ”— [keras.io/keras_hub](https://keras.io/keras_hub/)

83. explosion/spacy-streamlit โญ 819
๐Ÿ‘‘ spaCy building blocks and visualizers for Streamlit apps
๐Ÿ”— [share.streamlit.io/ines/spacy-streamlit-demo/master/app.py](https://share.streamlit.io/ines/spacy-streamlit-demo/master/app.py)

84. paddlepaddle/RocketQA โญ 773
๐Ÿš€ RocketQA, dense retrieval for information retrieval and question answering, including both Chinese and English state-of-the-art models.

85. nvidia/nv-ingest โญ 651
NVIDIA-Ingest is a scalable, performance-oriented document content and metadata extraction microservice.

86. webis-de/small-text โญ 601
Small-Text provides state-of-the-art Active Learning for Text Classification. Several pre-implemented Query Strategies, Initialization Strategies, and Stopping Critera are provided, which can be easily mixed and matched to build active learning experiments or applications.
๐Ÿ”— [small-text.readthedocs.io](https://small-text.readthedocs.io/)

87. babelscape/rebel โญ 507
REBEL is a seq2seq model that simplifies Relation Extraction (EMNLP 2021).

## Packaging

Python packaging, dependency management and bundling.

1. pyenv/pyenv โญ 40,157
pyenv lets you easily switch between multiple versions of Python.

2. astral-sh/uv โญ 34,787
An extremely fast Python package installer and resolver, written in Rust. Designed as a drop-in replacement for pip and pip-compile.
๐Ÿ”— [docs.astral.sh/uv](https://docs.astral.sh/uv)

3. python-poetry/poetry โญ 32,160
Python packaging and dependency management made easy
๐Ÿ”— [python-poetry.org](https://python-poetry.org)

4. pypa/pipenv โญ 24,960
A virtualenv management tool that supports a multitude of systems and nicely bridges the gaps between pip, python and virtualenv.
๐Ÿ”— [pipenv.pypa.io](https://pipenv.pypa.io)

5. mitsuhiko/rye โญ 13,945
a Hassle-Free Python Experience
๐Ÿ”— [rye.astral.sh](https://rye.astral.sh)

6. pyinstaller/pyinstaller โญ 12,038
Freeze (package) Python programs into stand-alone executables
๐Ÿ”— [www.pyinstaller.org](http://www.pyinstaller.org)

7. pypa/pipx โญ 10,908
Install and Run Python Applications in Isolated Environments
๐Ÿ”— [pipx.pypa.io](https://pipx.pypa.io)

8. pdm-project/pdm โญ 8,059
A modern Python package and dependency manager supporting the latest PEP standards
๐Ÿ”— [pdm-project.org](https://pdm-project.org)

9. jazzband/pip-tools โญ 7,804
A set of tools to keep your pinned Python dependencies fresh (pip-compile + pip-sync)
๐Ÿ”— [pip-tools.rtfd.io](https://pip-tools.rtfd.io)

10. mamba-org/mamba โญ 7,057
The Fast Cross-Platform Package Manager: mamba is a reimplementation of the conda package manager in C++
๐Ÿ”— [mamba.readthedocs.io](https://mamba.readthedocs.io)

11. conda-forge/miniforge โญ 6,858
A conda-forge distribution.
๐Ÿ”— [conda-forge.org/download](https://conda-forge.org/download)

12. conda/conda โญ 6,540
A system-level, binary package and environment manager running on all major operating systems and platforms.
๐Ÿ”— [docs.conda.io/projects/conda](https://docs.conda.io/projects/conda/)

13. pypa/hatch โญ 6,226
Modern, extensible Python project management
๐Ÿ”— [hatch.pypa.io/latest](https://hatch.pypa.io/latest/)

14. indygreg/PyOxidizer โญ 5,592
A modern Python application packaging and distribution tool

15. pypa/virtualenv โญ 4,856
A tool to create isolated Python environments. Since Python 3.3, a subset of it has been integrated into the standard lib venv module.
๐Ÿ”— [virtualenv.pypa.io](https://virtualenv.pypa.io)

16. spack/spack โญ 4,493
A flexible package manager that supports multiple versions, configurations, platforms, and compilers.
๐Ÿ”— [spack.io](https://spack.io)

17. prefix-dev/pixi โญ 3,635
pixi is a cross-platform, multi-language package manager and workflow tool built on the foundation of the conda ecosystem.
๐Ÿ”— [pixi.sh](https://pixi.sh)

18. pantsbuild/pex โญ 3,394
A tool for generating .pex (Python EXecutable) files, lock files and venvs.
๐Ÿ”— [docs.pex-tool.org](https://docs.pex-tool.org/)

19. beeware/briefcase โญ 2,719
Tools to support converting a Python project into a standalone native application.
๐Ÿ”— [briefcase.readthedocs.io](https://briefcase.readthedocs.io/)

20. pypa/flit โญ 2,175
Simplified packaging of Python modules
๐Ÿ”— [flit.pypa.io](https://flit.pypa.io/)

21. linkedin/shiv โญ 1,788
shiv is a command line utility for building fully self contained Python zipapps as outlined in PEP 441, but with all their dependencies included.

22. marcelotduarte/cx_Freeze โญ 1,379
cx_Freeze creates standalone executables from Python scripts, with the same performance, is cross-platform and should work on any platform that Python itself works on.
๐Ÿ”— [marcelotduarte.github.io/cx_freeze](https://marcelotduarte.github.io/cx_Freeze/)

23. ofek/pyapp โญ 1,298
Runtime installer for Python applications
๐Ÿ”— [ofek.dev/pyapp](https://ofek.dev/pyapp/)

24. pypa/gh-action-pypi-publish โญ 978
The blessed :octocat: GitHub Action, for publishing your ๐Ÿ“ฆ distribution files to PyPI, the tokenless way: https://github.com/marketplace/actions/pypi-publish
๐Ÿ”— [packaging.python.org/guides/publishing-package-distribution-releases-using-github-actions-ci-cd-workflows](https://packaging.python.org/guides/publishing-package-distribution-releases-using-github-actions-ci-cd-workflows/)

25. py2exe/py2exe โญ 890
Create standalone Windows programs from Python code
๐Ÿ”— [www.py2exe.org](http://www.py2exe.org)

26. prefix-dev/rip โญ 654
RIP is a library that allows the resolving and installing of Python PyPI packages from Rust into a virtual environment. It's based on our experience with building Rattler and aims to provide the same experience but for PyPI instead of Conda.
๐Ÿ”— [prefix.dev](https://prefix.dev)

27. snok/install-poetry โญ 597
Github action for installing and configuring Poetry

28. python-poetry/install.python-poetry.org โญ 218
The official Poetry installation script
๐Ÿ”— [install.python-poetry.org](https://install.python-poetry.org)

## Pandas

Pandas and dataframe libraries: data analysis, statistical reporting, pandas GUIs, pandas performance optimisations.

1. pandas-dev/pandas โญ 44,213
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
๐Ÿ”— [pandas.pydata.org](https://pandas.pydata.org)

2. pola-rs/polars โญ 31,254
Dataframes powered by a multithreaded, vectorized query engine, written in Rust
๐Ÿ”— [docs.pola.rs](https://docs.pola.rs)

3. duckdb/duckdb โญ 25,604
DuckDB is an analytical in-process SQL database management system
๐Ÿ”— [www.duckdb.org](http://www.duckdb.org)

4. gventuri/pandas-ai โญ 13,927
Chat with your database (SQL, CSV, pandas, polars, mongodb, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG.
๐Ÿ”— [pandas-ai.com](https://pandas-ai.com)

5. kanaries/pygwalker โญ 13,665
PyGWalker: Turn your pandas dataframe into an interactive UI for visual analysis
๐Ÿ”— [kanaries.net/pygwalker](https://kanaries.net/pygwalker)

6. ydataai/ydata-profiling โญ 12,632
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
๐Ÿ”— [docs.profiling.ydata.ai](https://docs.profiling.ydata.ai)

7. rapidsai/cudf โญ 8,576
cuDF is a GPU DataFrame library for loading joining, aggregating, filtering, and otherwise manipulating data
๐Ÿ”— [docs.rapids.ai/api/cudf/stable](https://docs.rapids.ai/api/cudf/stable/)

8. aws/aws-sdk-pandas โญ 3,959
pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
๐Ÿ”— [aws-sdk-pandas.readthedocs.io](https://aws-sdk-pandas.readthedocs.io)

9. nalepae/pandarallel โญ 3,714
A simple and efficient tool to parallelize Pandas operations on all availableย CPUs
๐Ÿ”— [nalepae.github.io/pandarallel](https://nalepae.github.io/pandarallel)

10. unionai-oss/pandera โญ 3,509
A light-weight, flexible, and expressive statistical data testing library
๐Ÿ”— [www.union.ai/pandera](https://www.union.ai/pandera)

11. adamerose/PandasGUI โญ 3,207
A GUI for Pandas DataFrames

12. blaze/blaze โญ 3,187
NumPy and Pandas interface to Big Data
๐Ÿ”— [blaze.pydata.org](blaze.pydata.org)

13. pydata/pandas-datareader โญ 2,990
Extract data from a wide range of Internet sources into a pandas DataFrame.
๐Ÿ”— [pydata.github.io/pandas-datareader/stable/index.html](https://pydata.github.io/pandas-datareader/stable/index.html)

14. scikit-learn-contrib/sklearn-pandas โญ 2,821
Pandas integration with sklearn

15. jmcarpenter2/swifter โญ 2,558
A package which efficiently applies any function to a pandas dataframe or series in the fastest available manner

16. eventual-inc/Daft โญ 2,457
Distributed data engine for Python/SQL designed for the cloud, powered by Rust
๐Ÿ”— [getdaft.io](https://getdaft.io)

17. delta-io/delta-rs โญ 2,451
A native Rust library for Delta Lake, with bindings into Python
๐Ÿ”— [delta-io.github.io/delta-rs](https://delta-io.github.io/delta-rs/)

18. fugue-project/fugue โญ 2,029
A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewrites.
๐Ÿ”— [fugue-tutorials.readthedocs.io](https://fugue-tutorials.readthedocs.io/)

19. pyjanitor-devs/pyjanitor โญ 1,378
Clean APIs for data cleaning. Python implementation of R package Janitor
๐Ÿ”— [pyjanitor-devs.github.io/pyjanitor](https://pyjanitor-devs.github.io/pyjanitor)

20. machow/siuba โญ 1,164
Python library for using dplyr like syntax with pandas and SQL
๐Ÿ”— [siuba.org](https://siuba.org)

21. holoviz/hvplot โญ 1,159
A high-level plotting API for pandas, dask, xarray, and networkx built on HoloViews
๐Ÿ”— [hvplot.holoviz.org](https://hvplot.holoviz.org)

22. renumics/spotlight โญ 1,140
Interactively explore unstructured datasets from your dataframe.
๐Ÿ”— [renumics.com](https://renumics.com)

23. tkrabel/bamboolib โญ 941
bamboolib - a GUI for pandas DataFrames
๐Ÿ”— [bamboolib.com](https://bamboolib.com)

24. mwouts/itables โญ 815
This packages changes how Pandas and Polars DataFrames are rendered in Jupyter Notebooks. With itables you can display your tables as interactive DataTables that you can sort, paginate, scroll or filter.
๐Ÿ”— [mwouts.github.io/itables](https://mwouts.github.io/itables/)

## Performance

Performance, parallelisation and low level libraries.

1. celery/celery โญ 25,206
Distributed Task Queue (development branch)
๐Ÿ”— [docs.celeryq.dev](https://docs.celeryq.dev)

2. google/flatbuffers โญ 23,584
FlatBuffers: Memory Efficient Serialization Library
๐Ÿ”— [flatbuffers.dev](https://flatbuffers.dev/)

3. pybind/pybind11 โญ 16,016
Seamless operability between C++11 and Python
๐Ÿ”— [pybind11.readthedocs.io](https://pybind11.readthedocs.io/)

4. exaloop/codon โญ 15,247
A high-performance, zero-overhead, extensible Python compiler using LLVM
๐Ÿ”— [docs.exaloop.io/codon](https://docs.exaloop.io/codon)

5. dask/dask โญ 12,807
Parallel computing with task scheduling
๐Ÿ”— [dask.org](https://dask.org)

6. numba/numba โญ 10,104
NumPy aware dynamic Python compiler using LLVM
๐Ÿ”— [numba.pydata.org](https://numba.pydata.org/)

7. modin-project/modin โญ 9,967
Modin: Scale your Pandas workflows by changing a single line of code
๐Ÿ”— [modin.readthedocs.io](http://modin.readthedocs.io)

8. nebuly-ai/optimate โญ 8,375
A collection of libraries to optimise AI model performances
๐Ÿ”— [www.nebuly.com](https://www.nebuly.com/)

9. vaexio/vaex โญ 8,317
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second ๐Ÿš€
๐Ÿ”— [vaex.io](https://vaex.io)

10. mher/flower โญ 6,546
Real-time monitor and web admin for Celery distributed task queue
๐Ÿ”— [flower.readthedocs.io](https://flower.readthedocs.io)

11. python-trio/trio โญ 6,301
Trio โ€“ a friendly Python library for async concurrency and I/O
๐Ÿ”— [trio.readthedocs.io](https://trio.readthedocs.io)

12. ultrajson/ultrajson โญ 4,354
Ultra fast JSON decoder and encoder written in C with Python bindings
๐Ÿ”— [pypi.org/project/ujson](https://pypi.org/project/ujson/)

13. tlkh/asitop โญ 3,667
Perf monitoring CLI tool for Apple Silicon
๐Ÿ”— [tlkh.github.io/asitop](https://tlkh.github.io/asitop/)

14. facebookincubator/cinder โญ 3,553
Cinder is Meta's internal performance-oriented production version of CPython.
๐Ÿ”— [trycinder.com](https://trycinder.com)

15. airtai/faststream โญ 3,308
FastStream is a powerful and easy-to-use Python framework for building asynchronous services interacting with event streams such as Apache Kafka, RabbitMQ, NATS and Redis.
๐Ÿ”— [faststream.airt.ai/latest](https://faststream.airt.ai/latest/)

16. ipython/ipyparallel โญ 2,599
IPython Parallel: Interactive Parallel Computing in Python
๐Ÿ”— [ipyparallel.readthedocs.io](https://ipyparallel.readthedocs.io/)

17. intel/intel-extension-for-transformers โญ 2,151
โšก Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platformsโšก

18. h5py/h5py โญ 2,103
HDF5 for Python -- The h5py package is a Pythonic interface to the HDF5 binary data format.
๐Ÿ”— [www.h5py.org](http://www.h5py.org)

19. agronholm/anyio โญ 1,878
High level asynchronous concurrency and networking framework that works on top of either trio or asyncio

20. tiangolo/asyncer โญ 1,759
Asyncer, async and await, focused on developer experience.
๐Ÿ”— [asyncer.tiangolo.com](https://asyncer.tiangolo.com/)

21. faster-cpython/ideas โญ 1,705
Discussion and work tracker for Faster CPython project.

22. intel/intel-extension-for-pytorch โญ 1,678
A Python package for extending the official PyTorch that can easily obtain performance on Intel platform

23. dask/distributed โญ 1,587
A distributed task scheduler for Dask
๐Ÿ”— [distributed.dask.org](https://distributed.dask.org)

24. nschloe/perfplot โญ 1,347
:chart_with_upwards_trend: Performance analysis for Python snippets

25. intel/scikit-learn-intelex โญ 1,237
Extension for Scikit-learn is a seamless way to speed up your Scikit-learn application
๐Ÿ”— [intel.github.io/scikit-learn-intelex](https://intel.github.io/scikit-learn-intelex/)

26. markshannon/faster-cpython โญ 944
How to make CPython faster.

27. zerointensity/pointers.py โญ 921
Bringing the hell of pointers to Python.
๐Ÿ”— [pointers.zintensity.dev](https://pointers.zintensity.dev/)

28. brandtbucher/specialist โญ 643
Visualize CPython's specializing, adaptive interpreter. ๐Ÿ”ฅ

## Profiling

Memory and CPU/GPU profiling tools and libraries.

1. bloomberg/memray โญ 13,537
Memray is a memory profiler for Python
๐Ÿ”— [bloomberg.github.io/memray](https://bloomberg.github.io/memray/)

2. benfred/py-spy โญ 13,091
Sampling profiler for Python programs

3. plasma-umass/scalene โญ 12,327
Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals

4. joerick/pyinstrument โญ 6,758
๐Ÿšดย Call stack profiler for Python. Shows you why your code is slow!
๐Ÿ”— [pyinstrument.readthedocs.io](https://pyinstrument.readthedocs.io/)

5. gaogaotiantian/viztracer โญ 5,703
A debugging and profiling tool that can trace and visualize python code execution
๐Ÿ”— [viztracer.readthedocs.io](https://viztracer.readthedocs.io/)

6. pythonprofilers/memory_profiler โญ 4,409
Monitor Memory usage of Python code
๐Ÿ”— [pypi.python.org/pypi/memory_profiler](http://pypi.python.org/pypi/memory_profiler)

7. reloadware/reloadium โญ 2,839
Hot Reloading and Profiling for Python

8. pyutils/line_profiler โญ 2,805
Line-by-line profiling for Python

9. jiffyclub/snakeviz โญ 2,378
An in-browser Python profile viewer
๐Ÿ”— [jiffyclub.github.io/snakeviz](https://jiffyclub.github.io/snakeviz/)

10. p403n1x87/austin โญ 1,962
Python frame stack sampler for CPython
๐Ÿ”— [pypi.org/project/austin-dist](https://pypi.org/project/austin-dist/)

11. pythonspeed/filprofiler โญ 849
A Python memory profiler for data processing and scientific computing applications
๐Ÿ”— [pythonspeed.com/products/filmemoryprofiler](https://pythonspeed.com/products/filmemoryprofiler/)

## Security

Security related libraries: vulnerability discovery, SQL injection, environment auditing.

1. swisskyrepo/PayloadsAllTheThings โญ 62,342
A list of useful payloads and bypass for Web Application Security and Pentest/CTF
๐Ÿ”— [swisskyrepo.github.io/payloadsallthethings](https://swisskyrepo.github.io/PayloadsAllTheThings/)

2. sqlmapproject/sqlmap โญ 33,004
Automatic SQL injection and database takeover tool
๐Ÿ”— [sqlmap.org](http://sqlmap.org)

3. certbot/certbot โญ 31,790
Certbot is EFF's tool to obtain certs from Let's Encrypt and (optionally) auto-enable HTTPS on your server. It can also act as a client for any other CA that uses the ACME protocol.

4. aquasecurity/trivy โญ 24,237
Find vulnerabilities, misconfigurations, secrets, SBOM in containers, Kubernetes, code repositories, clouds and more
๐Ÿ”— [trivy.dev](https://trivy.dev)

5. bridgecrewio/checkov โญ 7,263
Checkov is a static code analysis tool for infrastructure as code (IaC) and also a software composition analysis (SCA) tool for images and open source packages.
๐Ÿ”— [www.checkov.io](https://www.checkov.io/)

6. nccgroup/ScoutSuite โญ 6,840
Multi-Cloud Security Auditing Tool

7. stamparm/maltrail โญ 6,680
Malicious traffic detection system

8. pycqa/bandit โญ 6,603
Bandit is a tool designed to find common security issues in Python code.
๐Ÿ”— [bandit.readthedocs.io](https://bandit.readthedocs.io)

9. rhinosecuritylabs/pacu โญ 4,456
The AWS exploitation framework, designed for testing the security of Amazon Web Services environments.
๐Ÿ”— [rhinosecuritylabs.com/aws/pacu-open-source-aws-exploitation-framework](https://rhinosecuritylabs.com/aws/pacu-open-source-aws-exploitation-framework/)

10. dashingsoft/pyarmor โญ 3,828
A tool used to obfuscate python scripts, bind obfuscated scripts to fixed machine or expire obfuscated scripts.
๐Ÿ”— [pyarmor.dashingsoft.com](http://pyarmor.dashingsoft.com)

11. luijait/DarkGPT โญ 2,193
DarkGPT is an OSINT assistant based on GPT-4-200K (recommended use) designed to perform queries on leaked databases, thus providing an artificial intelligence assistant that can be useful in your traditional OSINT processes.

12. pyupio/safety โญ 1,766
Safety checks Python dependencies for known security vulnerabilities and suggests the proper remediations for vulnerabilities detected.
๐Ÿ”— [safetycli.com/product/safety-cli](https://safetycli.com/product/safety-cli)

13. trailofbits/pip-audit โญ 994
Audits Python environments, requirements files and dependency trees for known security vulnerabilities, and can automatically fix them
๐Ÿ”— [pypi.org/project/pip-audit](https://pypi.org/project/pip-audit/)

14. fadi002/de4py โญ 872
toolkit for python reverse engineering
๐Ÿ”— [de4py.000.pe](https://de4py.000.pe)

15. thecyb3ralpha/BobTheSmuggler โญ 502
A tool that leverages HTML Smuggling Attack and allows you to create HTML files with embedded 7z/zip archives.

## Simulation

Simulation libraries: robotics, economic, agent-based, traffic, physics, astronomy, chemistry, quantum simulation. Also see the Maths and Science category for crossover.

1. atsushisakai/PythonRobotics โญ 23,940
Python sample codes for robotics algorithms.
๐Ÿ”— [atsushisakai.github.io/pythonrobotics](https://atsushisakai.github.io/PythonRobotics/)

2. genesis-embodied-ai/Genesis โญ 21,940
Genesis is a physics platform, and generative data engine, designed for general purpose Robotics/Embodied AI/Physical AI applications

3. bulletphysics/bullet3 โญ 12,861
Bullet Physics SDK: real-time collision detection and multi-physics simulation for VR, games, visual effects, robotics, machine learning etc.
๐Ÿ”— [bulletphysics.org](http://bulletphysics.org)

4. isl-org/Open3D โญ 11,742
Open3D: A Modern Library for 3D Data Processing
๐Ÿ”— [www.open3d.org](http://www.open3d.org)

5. qiskit/qiskit โญ 5,483
Qiskit is an open-source SDK for working with quantum computers at the level of extended quantum circuits, operators, and primitives.
๐Ÿ”— [www.ibm.com/quantum/qiskit](https://www.ibm.com/quantum/qiskit)

6. astropy/astropy โญ 4,529
Astronomy and astrophysics core library
๐Ÿ”— [www.astropy.org](https://www.astropy.org)

7. nvidia/warp โญ 4,416
A Python framework for high performance GPU simulation and graphics
๐Ÿ”— [nvidia.github.io/warp](https://nvidia.github.io/warp/)

8. quantumlib/Cirq โญ 4,382
A Python framework for creating, editing, and invoking Noisy Intermediate-Scale Quantum (NISQ) circuits.

9. chakazul/Lenia โญ 3,572
Lenia is a 2D cellular automata with continuous space, time and states. It produces a huge variety of interesting methematical life forms
๐Ÿ”— [chakazul.github.io/lenia/javascript/lenia.html](https://chakazul.github.io/Lenia/JavaScript/Lenia.html)

10. openai/mujoco-py โญ 2,900
MuJoCo is a physics engine for detailed, efficient rigid body simulations with contacts. mujoco-py allows using MuJoCo from Python 3.

11. rdkit/rdkit โญ 2,736
The official sources for the RDKit library

12. projectmesa/mesa โญ 2,710
Mesa is an open-source Python library for agent-based modeling, ideal for simulating complex systems and exploring emergent behaviors.
๐Ÿ”— [mesa.readthedocs.io](https://mesa.readthedocs.io)

13. nvidia-omniverse/IsaacLab โญ 2,554
Unified framework for robot learning built on NVIDIA Isaac Sim
๐Ÿ”— [isaac-sim.github.io/isaaclab](https://isaac-sim.github.io/IsaacLab)

14. taichi-dev/difftaichi โญ 2,523
10 differentiable physical simulators built with Taichi differentiable programming (DiffTaichi, ICLR 2020)

15. google/brax โญ 2,444
Massively parallel rigidbody physics simulation on accelerator hardware.

16. nvidia-omniverse/IsaacGymEnvs โญ 2,148
Example RL environments for the NVIDIA Isaac Gym high performance environments

17. facebookresearch/habitat-lab โญ 2,071
A modular high-level library to train embodied AI agents across a variety of tasks and environments.
๐Ÿ”— [aihabitat.org](https://aihabitat.org/)

18. quantecon/QuantEcon.py โญ 2,017
A community based Python library for quantitative economics
๐Ÿ”— [quantecon.org/quantecon-py](https://quantecon.org/quantecon-py/)

19. microsoft/PromptCraft-Robotics โญ 1,934
Community for applying LLMs to robotics and a robot simulator with ChatGPT integration
๐Ÿ”— [aka.ms/chatgpt-robotics](https://aka.ms/ChatGPT-Robotics)

20. eloialonso/diamond โญ 1,672
DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model
๐Ÿ”— [diamond-wm.github.io](https://diamond-wm.github.io)

21. deepmodeling/deepmd-kit โญ 1,546
A deep learning package for many-body potential energy representation and molecular dynamics
๐Ÿ”— [docs.deepmodeling.com/projects/deepmd](https://docs.deepmodeling.com/projects/deepmd/)

22. sail-sg/envpool โญ 1,114
C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.
๐Ÿ”— [envpool.readthedocs.io](https://envpool.readthedocs.io)

23. bowang-lab/scGPT โญ 1,088
scGPT: Towards Building a Foundation Model for Single-Cell Multi-omics Using Generative AI
๐Ÿ”— [scgpt.readthedocs.io/en/latest](https://scgpt.readthedocs.io/en/latest/)

24. a-r-j/graphein โญ 1,054
Protein Graph Library
๐Ÿ”— [graphein.ai](https://graphein.ai/)

25. viblo/pymunk โญ 951
Pymunk is a easy-to-use pythonic 2d physics library that can be used whenever you need 2d rigid body physics from Python
๐Ÿ”— [www.pymunk.org](http://www.pymunk.org)

26. google-deepmind/materials_discovery โญ 918
Graph Networks for Materials Science (GNoME) is a project centered around scaling machine learning methods to tackle materials science.

27. nvidia-omniverse/OmniIsaacGymEnvs โญ 898
Reinforcement Learning Environments for Omniverse Isaac Gym

28. altera-al/project-sid โญ 879
This repository contains our technical report: "Project Sid: Many-agent simulations toward AI civilization"

29. google/evojax โญ 860
EvoJAX is a scalable, general purpose, hardware-accelerated neuroevolution toolkit built on the JAX library

30. facebookresearch/fairo โญ 857
A modular embodied agent architecture and platform for building embodied agents

31. eureka-research/DrEureka โญ 832
Official Repository for "DrEureka: Language Model Guided Sim-To-Real Transfer" (RSS 2024)
๐Ÿ”— [eureka-research.github.io/dr-eureka](https://eureka-research.github.io/dr-eureka/)

32. polymathicai/the_well โญ 703
15TB of Physics Simulations: collection of machine learning datasets containing numerical simulations of a wide variety of spatiotemporal physical systems.
๐Ÿ”— [polymathic-ai.org/the_well](https://polymathic-ai.org/the_well/)

33. ur-whitelab/chemcrow-public โญ 665
Chemcrow

34. ur-whitelab/chemcrow-runs โญ 73
ur-whitelab/chemcrow-runs

## Study

Miscellaneous study resources: algorithms, general resources, system design, code repos for textbooks, best practices, tutorials.

1. thealgorithms/Python โญ 196,224
All Algorithms implemented in Python
๐Ÿ”— [thealgorithms.github.io/python](https://thealgorithms.github.io/Python/)

2. microsoft/generative-ai-for-beginners โญ 66,165
21 Lessons, Get Started Building with Generative AI ๐Ÿ”— https://microsoft.github.io/generative-ai-for-beginners/
๐Ÿ”— [microsoft.github.io/generative-ai-for-beginners](https://microsoft.github.io/generative-ai-for-beginners/)

3. jakevdp/PythonDataScienceHandbook โญ 43,566
Python Data Science Handbook: full text in Jupyter Notebooks
๐Ÿ”— [jakevdp.github.io/pythondatasciencehandbook](http://jakevdp.github.io/PythonDataScienceHandbook)

4. mlabonne/llm-course โญ 40,728
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
๐Ÿ”— [mlabonne.github.io/blog](https://mlabonne.github.io/blog/)

5. rasbt/LLMs-from-scratch โญ 37,403
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
๐Ÿ”— [amzn.to/4fqvn0d](https://amzn.to/4fqvn0D)

6. realpython/python-guide โญ 28,518
Python best practices guidebook, written for humans.
๐Ÿ”— [docs.python-guide.org](https://docs.python-guide.org)

7. christoschristofidis/awesome-deep-learning โญ 24,552
A curated list of awesome Deep Learning tutorials, projects and communities.

8. d2l-ai/d2l-en โญ 24,458
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
๐Ÿ”— [d2l.ai](https://D2L.ai)

9. wesm/pydata-book โญ 22,476
Materials and IPython notebooks for "Python for Data Analysis" by Wes McKinney, published by O'Reilly Media

10. hannibal046/Awesome-LLM โญ 20,383
Awesome-LLM: a curated list of Large Language Model

11. microsoft/recommenders โญ 19,564
Best Practices on Recommendation Systems
๐Ÿ”— [recommenders-team.github.io/recommenders/intro.html](https://recommenders-team.github.io/recommenders/intro.html)

12. fchollet/deep-learning-with-python-notebooks โญ 18,885
Jupyter notebooks for the code samples of the book "Deep Learning with Python"

13. graykode/nlp-tutorial โญ 14,375
Natural Language Processing Tutorial for Deep Learning Researchers
๐Ÿ”— [www.reddit.com/r/machinelearning/comments/amfinl/project_nlptutoral_repository_who_is_studying](https://www.reddit.com/r/MachineLearning/comments/amfinl/project_nlptutoral_repository_who_is_studying/)

14. naklecha/llama3-from-scratch โญ 13,989
llama3 implementation one matrix multiplication at a time

15. shangtongzhang/reinforcement-learning-an-introduction โญ 13,726
Python Implementation of Reinforcement Learning: An Introduction

16. karpathy/nn-zero-to-hero โญ 12,618
Neural Networks: Zero to Hero

17. mrdbourke/pytorch-deep-learning โญ 11,662
Materials for the Learn PyTorch for Deep Learning: Zero to Mastery course.
๐Ÿ”— [learnpytorch.io](https://learnpytorch.io)

18. eugeneyan/open-llms โญ 11,435
๐Ÿ“‹ A list of open LLMs available for commercial use.

19. karpathy/micrograd โญ 10,848
A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API

20. rucaibox/LLMSurvey โญ 10,725
The official GitHub page for the survey paper "A Survey of Large Language Models".
๐Ÿ”— [arxiv.org/abs/2303.18223](https://arxiv.org/abs/2303.18223)

21. openai/spinningup โญ 10,338
An educational resource to help anyone learn deep reinforcement learning.
๐Ÿ”— [spinningup.openai.com](https://spinningup.openai.com/)

22. srush/GPU-Puzzles โญ 10,278
Teaching beginner GPU programming in a completely interactive fashion

23. zhanymkanov/fastapi-best-practices โญ 9,859
FastAPI Best Practices and Conventions we used at our startup

24. nielsrogge/Transformers-Tutorials โญ 9,748
This repository contains demos I made with the Transformers library by HuggingFace.

25. mooler0410/LLMsPracticalGuide โญ 9,620
A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)
๐Ÿ”— [arxiv.org/abs/2304.13712v2](https://arxiv.org/abs/2304.13712v2)

26. firmai/industry-machine-learning โญ 7,279
A curated list of applied machine learning and data science notebooks and libraries across different industries (by @firmai)
๐Ÿ”— [www.sov.ai](https://www.sov.ai/)

27. gkamradt/langchain-tutorials โญ 6,834
Overview and tutorial of the LangChain Library

28. udlbook/udlbook โญ 6,832
Understanding Deep Learning - Simon J.D. Prince

29. neetcode-gh/leetcode โญ 5,814
Leetcode solutions for NeetCode.io

30. roboflow/notebooks โญ 5,745
Examples and tutorials on using SOTA computer vision models and techniques. Learn everything from old-school ResNet, through YOLO and object-detection transformers like DETR, to the latest models like Grounding DINO and SAM.
๐Ÿ”— [roboflow.com/models](https://roboflow.com/models)

31. mrdbourke/tensorflow-deep-learning โญ 5,360
All course materials for the Zero to Mastery Deep Learning with TensorFlow course.
๐Ÿ”— [dbourke.link/ztmtfcourse](https://dbourke.link/ZTMTFcourse)

32. udacity/deep-learning-v2-pytorch โญ 5,341
Projects and exercises for the latest Deep Learning ND program https://www.udacity.com/course/deep-learning-nanodegree--nd101

33. alirezadir/Machine-Learning-Interviews โญ 5,185
This repo is meant to serve as a guide for Machine Learning/AI technical interviews.

34. timofurrer/awesome-asyncio โญ 4,684
A curated list of awesome Python asyncio frameworks, libraries, software and resources

35. zotroneneis/machine_learning_basics โญ 4,327
Plain python implementations of basic machine learning algorithms

36. promptslab/Awesome-Prompt-Engineering โญ 4,062
This repository contains a hand-curated resources for Prompt Engineering with a focus on Generative Pre-trained Transformer (GPT), ChatGPT, PaLM etc
๐Ÿ”— [discord.gg/m88xfymbk6](https://discord.gg/m88xfYMbK6)

37. huggingface/deep-rl-class โญ 3,961
This repo contains the syllabus of the Hugging Face Deep Reinforcement Learning Course.

38. huggingface/smol-course โญ 3,856
a practical course on aligning language models for your specific use case. It's a handy way to get started with aligning language models, because everything runs on most local machines.

39. huggingface/diffusion-models-class โญ 3,787
Materials for the Hugging Face Diffusion Models Course

40. rasbt/machine-learning-book โญ 3,775
Code Repository for Machine Learning with PyTorch and Scikit-Learn
๐Ÿ”— [sebastianraschka.com/books/#machine-learning-with-pytorch-and-scikit-learn](https://sebastianraschka.com/books/#machine-learning-with-pytorch-and-scikit-learn)

41. cosmicpython/book โญ 3,433
A Book about Pythonic Application Architecture Patterns for Managing Complexity. Cosmos is the Opposite of Chaos you see. O'R. wouldn't actually let us call it "Cosmic Python" tho.
๐Ÿ”— [www.cosmicpython.com](https://www.cosmicpython.com)

42. fluentpython/example-code-2e โญ 3,370
Example code for Fluent Python, 2nd edition (O'Reilly 2022)
๐Ÿ”— [amzn.to/3j48u2j](https://amzn.to/3J48u2J)

43. handsonllm/Hands-On-Large-Language-Models โญ 3,355
Official code repo for the O'Reilly Book - "Hands-On Large Language Models"
๐Ÿ”— [www.llm-book.com](https://www.llm-book.com/)

44. amanchadha/coursera-deep-learning-specialization โญ 3,274
Notes, programming assignments and quizzes from all courses within the Coursera Deep Learning specialization offered by deeplearning.ai: (i) Neural Networks and Deep Learning; (ii) Improving Deep Neural Networks: Hyperparameter tuning, Regularization and Optimization; (iii) Structuring Machine Learning Projects; (iv...

45. mrdbourke/zero-to-mastery-ml โญ 3,038
All course materials for the Zero to Mastery Machine Learning and Data Science course.
๐Ÿ”— [dbourke.link/ztmmlcourse](https://dbourke.link/ZTMmlcourse)

46. krzjoa/awesome-python-data-science โญ 2,658
Probably the best curated list of data science software in Python.
๐Ÿ”— [krzjoa.github.io/awesome-python-data-science](https://krzjoa.github.io/awesome-python-data-science)

47. gerdm/prml โญ 2,205
Repository of notes, code and notebooks in Python for the book Pattern Recognition and Machine Learning by Christopher Bishop

48. cgpotts/cs224u โญ 2,124
Code for CS224u: Natural Language Understanding

49. cerlymarco/MEDIUM_NoteBook โญ 2,091
Repository containing notebooks of my posts on Medium

50. trananhkma/fucking-awesome-python โญ 1,986
awesome-python with :octocat: โญ and ๐Ÿด

51. huggingface/cookbook โญ 1,758
Community-driven practical examples of building AI applications and solving various tasks with AI using open-source tools and models.
๐Ÿ”— [huggingface.co/learn/cookbook](https://huggingface.co/learn/cookbook)

52. chandlerbang/awesome-self-supervised-gnn โญ 1,628
Papers about pretraining and self-supervised learning on Graph Neural Networks (GNN).

53. atcold/NYU-DLSP21 โญ 1,587
NYU Deep Learning Spring 2021
๐Ÿ”— [atcold.github.io/nyu-dlsp21](https://atcold.github.io/NYU-DLSP21/)

54. patrickloeber/MLfromscratch โญ 1,354
Machine Learning algorithm implementations from scratch.

55. davidadsp/Generative_Deep_Learning_2nd_Edition โญ 1,168
The official code repository for the second edition of the O'Reilly book Generative Deep Learning: Teaching Machines to Paint, Write, Compose and Play.
๐Ÿ”— [www.oreilly.com/library/view/generative-deep-learning/9781098134174](https://www.oreilly.com/library/view/generative-deep-learning/9781098134174/)

56. jackhidary/quantumcomputingbook โญ 815
Companion site for the textbook Quantum Computing: An Applied Approach

57. rasbt/LLM-workshop-2024 โญ 810
A 4-hour coding workshop to understand how LLMs are implemented and used

58. bayesianmodelingandcomputationinpython/BookCode_Edition1 โญ 508
Bayesian Modeling and Computation in Python: open-access version of the text and the code examples in the book
๐Ÿ”— [www.bayesiancomputationbook.com](https://www.bayesiancomputationbook.com)

59. dylanhogg/awesome-python โญ 337
๐Ÿ Hand-picked awesome Python libraries and frameworks, organised by category
๐Ÿ”— [www.awesomepython.org](https://www.awesomepython.org)

## Template

Template tools and libraries: cookiecutter repos, generators, quick-starts.

1. tiangolo/full-stack-fastapi-template โญ 28,888
Full stack, modern web application template. Using FastAPI, React, SQLModel, PostgreSQL, Docker, GitHub Actions, automatic HTTPS and more.

2. cookiecutter/cookiecutter โญ 22,858
A cross-platform command-line utility that creates projects from cookiecutters (project templates), e.g. Python package projects, C projects.
๐Ÿ”— [pypi.org/project/cookiecutter](https://pypi.org/project/cookiecutter/)

3. drivendata/cookiecutter-data-science โญ 8,458
A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.
๐Ÿ”— [cookiecutter-data-science.drivendata.org](https://cookiecutter-data-science.drivendata.org/)

4. buuntu/fastapi-react โญ 2,277
๐Ÿš€ Cookiecutter Template for FastAPI + React Projects. Using PostgreSQL, SQLAlchemy, and Docker

5. pyscaffold/pyscaffold โญ 2,137
๐Ÿ›  Python project template generator with batteries included
๐Ÿ”— [pyscaffold.org](https://pyscaffold.org)

6. cjolowicz/cookiecutter-hypermodern-python โญ 1,834
Cookiecutter template for a Python package based on the Hypermodern Python article series.
๐Ÿ”— [cookiecutter-hypermodern-python.readthedocs.io](http://cookiecutter-hypermodern-python.readthedocs.io/)

7. tezromach/python-package-template โญ 1,088
๐Ÿš€ Your next Python package needs a bleeding-edge project structure.

8. martinheinz/python-project-blueprint โญ 960
Blueprint/Boilerplate For Python Projects

9. fmind/mlops-python-package โญ 927
Best practices designed to support your MLOPs initiatives. You can use this package as part of your MLOps toolkit or platform e.g. Model Registry, Experiment Tracking, Realtime Inference
๐Ÿ”— [fmind.github.io/mlops-python-package](https://fmind.github.io/mlops-python-package/)

## Terminal

Terminal and console tools and libraries: CLI tools, terminal based formatters, progress bars.

1. willmcgugan/rich โญ 50,220
Rich is a Python library for rich text and beautiful formatting in the terminal.
๐Ÿ”— [rich.readthedocs.io/en/latest](https://rich.readthedocs.io/en/latest/)

2. tqdm/tqdm โญ 29,035
:zap: A Fast, Extensible Progress Bar for Python and CLI
๐Ÿ”— [tqdm.github.io](https://tqdm.github.io)

3. google/python-fire โญ 27,314
Python Fire is a library for automatically generating command line interfaces (CLIs) from absolutely any Python object.

4. willmcgugan/textual โญ 26,563
The lean application framework for Python. Build sophisticated user interfaces with a simple Python API. Run your apps in the terminal and a web browser.
๐Ÿ”— [textual.textualize.io](https://textual.textualize.io/)

5. tiangolo/typer โญ 16,157
Typer, build great CLIs. Easy to code. Based on Python type hints.
๐Ÿ”— [typer.tiangolo.com](https://typer.tiangolo.com/)

6. pallets/click โญ 15,939
Python composable command line interface toolkit
๐Ÿ”— [click.palletsprojects.com](https://click.palletsprojects.com)

7. saulpw/visidata โญ 7,987
A terminal spreadsheet multitool for discovering and arranging data
๐Ÿ”— [visidata.org](http://visidata.org)

8. tconbeer/harlequin โญ 3,938
The SQL IDE for Your Terminal.
๐Ÿ”— [harlequin.sh](https://harlequin.sh)

9. manrajgrover/halo โญ 2,908
๐Ÿ’ซ Beautiful spinners for terminal, IPython and Jupyter

10. urwid/urwid โญ 2,843
Console user interface library for Python (official repo)
๐Ÿ”— [urwid.org](urwid.org)

11. textualize/trogon โญ 2,526
Easily turn your Click CLI into a powerful terminal application

12. darrenburns/elia โญ 1,934
A snappy, keyboard-centric terminal user interface for interacting with large language models. Chat with ChatGPT, Claude, Llama 3, Phi 3, Mistral, Gemma and more.

13. tmbo/questionary โญ 1,610
Python library to build pretty command line user prompts โœจEasy to use multi-select lists, confirmations, free text prompts ...

14. jazzband/prettytable โญ 1,408
Display tabular data in a visually appealing ASCII table format
๐Ÿ”— [pypi.org/project/prettytable](https://pypi.org/project/PrettyTable/)

15. 1j01/textual-paint โญ 977
:art: MS Paint in your terminal.
๐Ÿ”— [pypi.org/project/textual-paint](https://pypi.org/project/textual-paint/)

## Testing

Testing libraries: unit testing, load testing, acceptance testing, code coverage, browser automation, plugins.

1. mitmproxy/mitmproxy โญ 37,448
An interactive TLS-capable intercepting HTTP proxy for penetration testers and software developers.
๐Ÿ”— [mitmproxy.org](https://mitmproxy.org)

2. locustio/locust โญ 25,338
Write scalable load tests in plain Python ๐Ÿš—๐Ÿ’จ
๐Ÿ”— [locust.cloud](https://locust.cloud)

3. pytest-dev/pytest โญ 12,299
The pytest framework makes it easy to write small tests, yet scales to support complex functional testing
๐Ÿ”— [pytest.org](https://pytest.org)

4. microsoft/playwright-python โญ 12,149
Python version of the Playwright testing and automation library.
๐Ÿ”— [playwright.dev/python](https://playwright.dev/python/)

5. robotframework/robotframework โญ 10,114
Generic automation framework for acceptance testing and RPA
๐Ÿ”— [robotframework.org](http://robotframework.org)

6. seleniumbase/SeleniumBase โญ 8,740
Python APIs for web automation, testing, and bypassing bot-detection.
๐Ÿ”— [seleniumbase.io](https://seleniumbase.io)

7. getmoto/moto โญ 7,727
A library that allows you to easily mock out tests based on AWS infrastructure.
๐Ÿ”— [docs.getmoto.org/en/latest](http://docs.getmoto.org/en/latest/)

8. hypothesisworks/hypothesis โญ 7,660
Hypothesis is a powerful, flexible, and easy to use library for property-based testing.
๐Ÿ”— [hypothesis.works](https://hypothesis.works)

9. newsapps/beeswithmachineguns โญ 6,454
A utility for arming (creating) many bees (micro EC2 instances) to attack (load test) targets (web applications).
๐Ÿ”— [apps.chicagotribune.com](http://apps.chicagotribune.com/)

10. codium-ai/qodo-cover โญ 4,706
Qodo-Cover: An AI-Powered Tool for Automated Test Generation and Code Coverage Enhancement! ๐Ÿ’ป๐Ÿค–๐Ÿงช๐Ÿž
๐Ÿ”— [qodo.ai](https://qodo.ai/)

11. spulec/freezegun โญ 4,225
Let your Python tests travel through time

12. confident-ai/deepeval โญ 4,194
The LLM Evaluation Framework
๐Ÿ”— [docs.confident-ai.com](https://docs.confident-ai.com/)

13. getsentry/responses โญ 4,189
A utility for mocking out the Python Requests library.

14. tox-dev/tox โญ 3,725
Command line driven CI frontend and development task automation tool.
๐Ÿ”— [tox.wiki](https://tox.wiki)

15. behave/behave โญ 3,218
BDD, Python style.
๐Ÿ”— [behave.readthedocs.io/en/latest](https://behave.readthedocs.io/en/latest/)

16. nedbat/coveragepy โญ 3,060
The code coverage tool for Python
๐Ÿ”— [coverage.readthedocs.io](https://coverage.readthedocs.io)

17. kevin1024/vcrpy โญ 2,731
Automatically mock your HTTP interactions to simplify and speed up testing

18. cobrateam/splinter โญ 2,728
splinter - python test framework for web applications
๐Ÿ”— [splinter.readthedocs.org/en/stable/index.html](http://splinter.readthedocs.org/en/stable/index.html)

19. pytest-dev/pytest-testinfra โญ 2,388
With Testinfra you can write unit tests in Python to test actual state of your servers configured by management tools like Salt, Ansible, Puppet, Chef and so on.
๐Ÿ”— [testinfra.readthedocs.io](https://testinfra.readthedocs.io)

20. pytest-dev/pytest-mock โญ 1,874
Thin-wrapper around the mock package for easier use with pytest
๐Ÿ”— [pytest-mock.readthedocs.io/en/latest](https://pytest-mock.readthedocs.io/en/latest/)

21. pytest-dev/pytest-cov โญ 1,793
Coverage plugin for pytest.

22. pytest-dev/pytest-xdist โญ 1,518
pytest plugin for distributed testing and loop-on-failures testing modes.
๐Ÿ”— [pytest-xdist.readthedocs.io](https://pytest-xdist.readthedocs.io)

23. pytest-dev/pytest-asyncio โญ 1,451
Asyncio support for pytest
๐Ÿ”— [pytest-asyncio.readthedocs.io](https://pytest-asyncio.readthedocs.io)

24. taverntesting/tavern โญ 1,039
A command-line tool and Python library and Pytest plugin for automated testing of RESTful APIs, with a simple, concise and flexible YAML-based syntax
๐Ÿ”— [taverntesting.github.io](https://taverntesting.github.io/)

## Machine Learning - Time Series

Machine learning and classical timeseries libraries: forecasting, seasonality, anomaly detection, econometrics.

1. facebook/prophet โญ 18,715
Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth.
๐Ÿ”— [facebook.github.io/prophet](https://facebook.github.io/prophet)

2. blue-yonder/tsfresh โญ 8,509
Automatic extraction of relevant features from time series:
๐Ÿ”— [tsfresh.readthedocs.io](http://tsfresh.readthedocs.io)

3. unit8co/darts โญ 8,211
A python library for user-friendly forecasting and anomaly detection on time series.
๐Ÿ”— [unit8co.github.io/darts](https://unit8co.github.io/darts/)

4. sktime/sktime โญ 8,063
A unified framework for machine learning with time series
๐Ÿ”— [www.sktime.net](https://www.sktime.net)

5. facebookresearch/Kats โญ 5,496
Kats, a kit to analyze time series data, a lightweight, easy-to-use, generalizable, and extendable framework to perform time series analysis, from understanding the key statistics and characteristics, detecting change points and anomalies, to forecasting future trends.

6. awslabs/gluonts โญ 4,713
Probabilistic time series modeling in Python
๐Ÿ”— [ts.gluon.ai](https://ts.gluon.ai)

7. nixtla/statsforecast โญ 4,068
Lightning โšก๏ธ fast forecasting with statistical and econometric models.
๐Ÿ”— [nixtlaverse.nixtla.io/statsforecast](https://nixtlaverse.nixtla.io/statsforecast)

8. google-research/timesfm โญ 3,977
TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting.
๐Ÿ”— [research.google/blog/a-decoder-only-foundation-model-for-time-series-forecasting](https://research.google/blog/a-decoder-only-foundation-model-for-time-series-forecasting/)

9. tdameritrade/stumpy โญ 3,712
STUMPY is a powerful and scalable Python library for modern time series analysis
๐Ÿ”— [stumpy.readthedocs.io/en/latest](https://stumpy.readthedocs.io/en/latest/)

10. salesforce/Merlion โญ 3,493
Merlion: A Machine Learning Framework for Time Series Intelligence

11. amazon-science/chronos-forecasting โญ 2,790
Chronos: Pretrained Models for Probabilistic Time Series Forecasting
๐Ÿ”— [arxiv.org/abs/2403.07815](https://arxiv.org/abs/2403.07815)

12. rjt1990/pyflux โญ 2,114
Open source time series library for Python

13. aistream-peelout/flow-forecast โญ 2,099
Deep learning PyTorch library for time series forecasting, classification, and anomaly detection (originally for flood forecasting).
๐Ÿ”— [flow-forecast.atlassian.net/wiki/spaces/ff/overview](https://flow-forecast.atlassian.net/wiki/spaces/FF/overview)

14. uber/orbit โญ 1,914
A Python package for Bayesian forecasting with object-oriented design and probabilistic models under the hood.
๐Ÿ”— [orbit-ml.readthedocs.io/en/stable](https://orbit-ml.readthedocs.io/en/stable/)

15. alkaline-ml/pmdarima โญ 1,609
A statistical library designed to fill the void in Python's time series analysis capabilities, including the equivalent of R's auto.arima function.
๐Ÿ”— [www.alkaline-ml.com/pmdarima](https://www.alkaline-ml.com/pmdarima)

16. time-series-foundation-models/lag-llama โญ 1,300
Lag-Llama: Towards Foundation Models for Probabilistic Time Series Forecasting

17. winedarksea/AutoTS โญ 1,145
Automated Time Series Forecasting

18. autoviml/Auto_TS โญ 745
Automatically build ARIMA, SARIMAX, VAR, FB Prophet and XGBoost Models on Time Series data sets with a Single Line of Code. Created by Ram Seshadri. Collaborators welcome.

19. google/temporian โญ 682
Temporian is an open-source Python library for preprocessing โšก and feature engineering ๐Ÿ›  temporal data ๐Ÿ“ˆ for machine learning applications ๐Ÿค–
๐Ÿ”— [temporian.readthedocs.io](https://temporian.readthedocs.io)

## Typing

Typing libraries: static and run-time type checking, annotations.

1. python/mypy โญ 18,762
Optional static typing for Python
๐Ÿ”— [www.mypy-lang.org](https://www.mypy-lang.org/)

2. microsoft/pyright โญ 13,652
Static Type Checker for Python

3. facebook/pyre-check โญ 6,890
Performant type-checking for python.
๐Ÿ”— [pyre-check.org](https://pyre-check.org/)

4. python-attrs/attrs โญ 5,349
Python Classes Without Boilerplate
๐Ÿ”— [www.attrs.org](https://www.attrs.org/)

5. instagram/MonkeyType โญ 4,816
A Python library that generates static type annotations by collecting runtime types

6. google/pytype โญ 4,804
A static type analyzer for Python code
๐Ÿ”— [google.github.io/pytype](https://google.github.io/pytype)

7. python/typeshed โญ 4,448
Collection of library stubs for Python, with static types

8. mtshiba/pylyzer โญ 2,596
A fast, feature-rich static code analyzer & language server for Python
๐Ÿ”— [mtshiba.github.io/pylyzer](http://mtshiba.github.io/pylyzer/)

9. microsoft/pylance-release โญ 1,728
Fast, feature-rich language support for Python. Documentation and issues for Pylance.

10. agronholm/typeguard โญ 1,574
Run-time type checker for Python

11. patrick-kidger/torchtyping โญ 1,410
Type annotations and dynamic checking for a tensor's shape, dtype, names, etc.

12. robertcraigie/pyright-python โญ 191
Python command line wrapper for pyright, a static type checker
๐Ÿ”— [pypi.org/project/pyright](https://pypi.org/project/pyright/)

## Utility

General utility libraries: miscellaneous tools, linters, code formatters, version management, package tools, documentation tools.

1. yt-dlp/yt-dlp โญ 95,562
A feature-rich command-line audio/video downloader
๐Ÿ”— [discord.gg/h5mncfw63r](https://discord.gg/H5MNcFW63r)

2. home-assistant/core โญ 75,540
๐Ÿก Open source home automation that puts local control and privacy first.
๐Ÿ”— [www.home-assistant.io](https://www.home-assistant.io)

3. abi/screenshot-to-code โญ 66,604
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
๐Ÿ”— [screenshottocode.com](https://screenshottocode.com)

4. python/cpython โญ 64,629
The Python programming language
๐Ÿ”— [www.python.org](https://www.python.org)

5. localstack/localstack โญ 56,992
๐Ÿ’ป A fully functional local AWS cloud stack. Develop and test your cloud & Serverless apps offline
๐Ÿ”— [localstack.cloud](https://localstack.cloud)

6. faif/python-patterns โญ 40,734
A collection of design patterns/idioms in Python

7. mingrammer/diagrams โญ 40,064
:art: Diagram as Code for prototyping cloud system architectures
๐Ÿ”— [diagrams.mingrammer.com](https://diagrams.mingrammer.com)

8. ggerganov/whisper.cpp โญ 36,755
Port of OpenAI's Whisper model in C/C++

9. paul-gauthier/aider โญ 24,435
Aider is a command line tool that lets you pair program with LLMs, to edit code stored in your local git repository
๐Ÿ”— [aider.chat](https://aider.chat/)

10. keon/algorithms โญ 24,209
Minimal examples of data structures and algorithms in Python

11. openai/openai-python โญ 23,865
The official Python library for the OpenAI API
๐Ÿ”— [pypi.org/project/openai](https://pypi.org/project/openai/)

12. modularml/mojo โญ 23,521
The Mojo Programming Language
๐Ÿ”— [docs.modular.com/mojo/manual](https://docs.modular.com/mojo/manual/)

13. norvig/pytudes โญ 23,235
Python programs, usually short, of considerable difficulty, to perfect particular skills.

14. pydantic/pydantic โญ 21,913
Data validation using Python type hints
๐Ÿ”— [docs.pydantic.dev](https://docs.pydantic.dev)

15. squidfunk/mkdocs-material โญ 21,628
Documentation that simply works
๐Ÿ”— [squidfunk.github.io/mkdocs-material](https://squidfunk.github.io/mkdocs-material/)

16. facebookresearch/audiocraft โญ 21,271
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

17. chriskiehl/Gooey โญ 20,752
Turn (almost) any Python command line program into a full GUI application with one line

18. delgan/loguru โญ 20,455
Python logging made (stupidly) simple

19. blakeblackshear/frigate โญ 20,157
NVR with realtime local object detection for IP cameras
๐Ÿ”— [frigate.video](https://frigate.video)

20. mkdocs/mkdocs โญ 19,663
Project documentation with Markdown.
๐Ÿ”— [www.mkdocs.org](https://www.mkdocs.org)

21. micropython/micropython โญ 19,662
MicroPython - a lean and efficient Python implementation for microcontrollers and constrained systems
๐Ÿ”— [micropython.org](https://micropython.org)

22. rustpython/RustPython โญ 19,450
A Python Interpreter written in Rust
๐Ÿ”— [rustpython.github.io](https://rustpython.github.io)

23. kivy/kivy โญ 17,900
Open source UI framework written in Python, running on Windows, Linux, macOS, Android and iOS
๐Ÿ”— [kivy.org](https://kivy.org)

24. higherorderco/Bend โญ 17,882
A massively parallel, high-level programming language
๐Ÿ”— [higherorderco.com](https://higherorderco.com)

25. ipython/ipython โญ 16,340
Official repository for IPython itself. Other repos in the IPython organization contain things like the website, documentation builds, etc.
๐Ÿ”— [ipython.readthedocs.org](https://ipython.readthedocs.org)

26. alievk/avatarify-python โญ 16,340
Avatars for Zoom, Skype and other video-conferencing apps.

27. openai/triton โญ 13,914
Development repository for the Triton language and compiler
๐Ÿ”— [triton-lang.org](https://triton-lang.org/)

28. zulko/moviepy โญ 12,848
Video editing with Python
๐Ÿ”— [zulko.github.io/moviepy](https://zulko.github.io/moviepy/)

29. pyo3/pyo3 โญ 12,735
Rust bindings for the Python interpreter
๐Ÿ”— [pyo3.rs](https://pyo3.rs)

30. pyodide/pyodide โญ 12,519
Pyodide is a Python distribution for the browser and Node.js based on WebAssembly
๐Ÿ”— [pyodide.org/en/stable](https://pyodide.org/en/stable/)

31. pytube/pytube โญ 12,497
A lightweight, dependency-free Python library (and command-line utility) for downloading YouTube Videos.
๐Ÿ”— [pytube.io](https://pytube.io)

32. python-pillow/Pillow โญ 12,449
The Python Imaging Library adds image processing capabilities to Python (Pillow is the friendly PIL fork)
๐Ÿ”— [python-pillow.github.io](https://python-pillow.github.io)

33. caronc/apprise โญ 12,352
Apprise - Push Notifications that work with just about every platform!
๐Ÿ”— [hub.docker.com/r/caronc/apprise](https://hub.docker.com/r/caronc/apprise)

34. nuitka/Nuitka โญ 12,330
Nuitka is a Python compiler written in Python. It's fully compatible with Python 2.6, 2.7, 3.4-3.13. You feed it your Python app, it does a lot of clever things, and spits out an executable or extension module.
๐Ÿ”— [nuitka.net](http://nuitka.net)

35. dbader/schedule โญ 11,916
Python job scheduling for humans.
๐Ÿ”— [schedule.readthedocs.io](https://schedule.readthedocs.io/)

36. ninja-build/ninja โญ 11,435
Ninja is a small build system with a focus on speed.
๐Ÿ”— [ninja-build.org](https://ninja-build.org/)

37. secdev/scapy โญ 10,936
Scapy: the Python-based interactive packet manipulation program & library.
๐Ÿ”— [scapy.net](https://scapy.net)

38. asweigart/pyautogui โญ 10,670
A cross-platform GUI automation Python module for human beings. Used to programmatically control the mouse & keyboard.

39. magicstack/uvloop โญ 10,551
Ultra fast asyncio event loop.

40. pallets/jinja โญ 10,496
A very fast and expressive template engine.
๐Ÿ”— [jinja.palletsprojects.com](https://jinja.palletsprojects.com)

41. aristocratos/bpytop โญ 10,357
Linux/OSX/FreeBSD resource monitor

42. cython/cython โญ 9,676
The most widely used Python to C compiler
๐Ÿ”— [cython.org](https://cython.org)

43. aws/serverless-application-model โญ 9,385
The AWS Serverless Application Model (AWS SAM) transform is a AWS CloudFormation macro that transforms SAM templates into CloudFormation templates.
๐Ÿ”— [aws.amazon.com/serverless/sam](https://aws.amazon.com/serverless/sam)

44. paramiko/paramiko โญ 9,209
The leading native Python SSHv2 protocol library.
๐Ÿ”— [paramiko.org](http://paramiko.org)

45. boto/boto3 โญ 9,129
AWS SDK for Python
๐Ÿ”— [aws.amazon.com/sdk-for-python](https://aws.amazon.com/sdk-for-python/)

46. facebookresearch/hydra โญ 8,942
Hydra is a framework for elegantly configuring complex applications
๐Ÿ”— [hydra.cc](https://hydra.cc)

47. arrow-py/arrow โญ 8,762
๐Ÿน Better dates & times for Python
๐Ÿ”— [arrow.readthedocs.io](https://arrow.readthedocs.io)

48. py-pdf/pypdf โญ 8,596
A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
๐Ÿ”— [pypdf.readthedocs.io/en/latest](https://pypdf.readthedocs.io/en/latest/)

49. xonsh/xonsh โญ 8,496
:shell: Python-powered shell. Full-featured and cross-platform.
๐Ÿ”— [xon.sh](http://xon.sh)

50. eternnoir/pyTelegramBotAPI โญ 8,206
Python Telegram bot api.

51. jasonppy/VoiceCraft โญ 7,995
Zero-Shot Speech Editing and Text-to-Speech in the Wild

52. kellyjonbrazil/jc โญ 7,991
CLI tool and python library that converts the output of popular command-line tools, file-types, and common strings to JSON, YAML, or Dictionaries. This allows piping of output to tools like jq and simplifying automation scripts.

53. googleapis/google-api-python-client โญ 7,902
๐Ÿ The official Python client library for Google's discovery based APIs.
๐Ÿ”— [googleapis.github.io/google-api-python-client/docs](https://googleapis.github.io/google-api-python-client/docs/)

54. theskumar/python-dotenv โญ 7,777
Reads key-value pairs from a .env file and can set them as environment variables. It helps in developing applications following the 12-factor principles.
๐Ÿ”— [saurabh-kumar.com/python-dotenv](https://saurabh-kumar.com/python-dotenv/)

55. googlecloudplatform/python-docs-samples โญ 7,528
Code samples used on cloud.google.com

56. icloud-photos-downloader/icloud_photos_downloader โญ 7,384
A command-line tool to download photos from iCloud

57. google/latexify_py โญ 7,366
A library to generate LaTeX expression from Python code.

58. pygithub/PyGithub โญ 7,123
Typed interactions with the GitHub API v3
๐Ÿ”— [pygithub.readthedocs.io](https://pygithub.readthedocs.io/)

59. marshmallow-code/marshmallow โญ 7,070
A lightweight library for converting complex objects to and from simple Python datatypes.
๐Ÿ”— [marshmallow.readthedocs.io](https://marshmallow.readthedocs.io/)

60. bndr/pipreqs โญ 6,957
pipreqs - Generate pip requirements.txt file based on imports of any project. Looking for maintainers to move this project forward.

61. jd/tenacity โญ 6,911
Retrying library for Python
๐Ÿ”— [tenacity.readthedocs.io](http://tenacity.readthedocs.io)

62. hugapi/hug โญ 6,869
Embrace the APIs of the future. Hug aims to make developing APIs as simple as possible, but no simpler.

63. pyca/cryptography โญ 6,783
cryptography is a package designed to expose cryptographic primitives and recipes to Python developers.
๐Ÿ”— [cryptography.io](https://cryptography.io)

64. sphinx-doc/sphinx โญ 6,713
The Sphinx documentation generator
๐Ÿ”— [www.sphinx-doc.org](https://www.sphinx-doc.org/)

65. gorakhargosh/watchdog โญ 6,699
Python library and shell utilities to monitor filesystem events.
๐Ÿ”— [packages.python.org/watchdog](http://packages.python.org/watchdog/)

66. openai/point-e โญ 6,594
Point cloud diffusion for 3D model synthesis

67. timdettmers/bitsandbytes โญ 6,490
Accessible large language models via k-bit quantization for PyTorch.
๐Ÿ”— [huggingface.co/docs/bitsandbytes/main/en/index](https://huggingface.co/docs/bitsandbytes/main/en/index)

68. ijl/orjson โญ 6,429
Fast, correct Python JSON library supporting dataclasses, datetimes, and numpy

69. agronholm/apscheduler โญ 6,416
Task scheduling library for Python

70. sdispater/pendulum โญ 6,319
Python datetimes made easy
๐Ÿ”— [pendulum.eustace.io](https://pendulum.eustace.io)

71. scikit-image/scikit-image โญ 6,136
Image processing in Python
๐Ÿ”— [scikit-image.org](https://scikit-image.org)

72. pdfminer/pdfminer.six โญ 6,103
Community maintained fork of pdfminer - we fathom PDF
๐Ÿ”— [pdfminersix.readthedocs.io](https://pdfminersix.readthedocs.io)

73. wireservice/csvkit โญ 6,058
A suite of utilities for converting to and working with CSV, the king of tabular file formats.
๐Ÿ”— [csvkit.readthedocs.io](https://csvkit.readthedocs.io)

74. pytransitions/transitions โญ 5,864
A lightweight, object-oriented finite state machine implementation in Python with many extensions

75. rsalmei/alive-progress โญ 5,637
A new kind of Progress Bar, with real-time throughput, ETA, and very cool animations!

76. spotify/pedalboard โญ 5,307
๐ŸŽ› ๐Ÿ”Š A Python library for audio.
๐Ÿ”— [spotify.github.io/pedalboard](https://spotify.github.io/pedalboard)

77. buildbot/buildbot โญ 5,296
Python-based continuous integration testing framework; your pull requests are more than welcome!
๐Ÿ”— [www.buildbot.net](https://www.buildbot.net)

78. traceloop/openllmetry โญ 5,259
Open-source observability for your LLM application, based on OpenTelemetry
๐Ÿ”— [www.traceloop.com/openllmetry](https://www.traceloop.com/openllmetry)

79. prompt-toolkit/ptpython โญ 5,250
A better Python REPL

80. pywinauto/pywinauto โญ 5,084
Windows GUI Automation with Python (based on text properties)
๐Ÿ”— [pywinauto.github.io](http://pywinauto.github.io/)

81. pycqa/pycodestyle โญ 5,055
Simple Python style checker in one Python file
๐Ÿ”— [pycodestyle.pycqa.org](https://pycodestyle.pycqa.org)

82. tebelorg/RPA-Python โญ 5,027
Python package for doing RPA

83. jorgebastida/awslogs โญ 4,878
AWS CloudWatch logs for Humansโ„ข

84. pythonnet/pythonnet โญ 4,856
Python for .NET is a package that gives Python programmers nearly seamless integration with the .NET Common Language Runtime (CLR) and provides a powerful application scripting tool for .NET developers.
๐Ÿ”— [pythonnet.github.io](http://pythonnet.github.io)

85. pytoolz/toolz โญ 4,741
A functional standard library for Python.
๐Ÿ”— [toolz.readthedocs.org](http://toolz.readthedocs.org/)

86. hhatto/autopep8 โญ 4,586
A tool that automatically formats Python code to conform to the PEP 8 style guide.
๐Ÿ”— [pypi.org/project/autopep8](https://pypi.org/project/autopep8/)

87. pyinvoke/invoke โญ 4,446
Pythonic task management & command execution.
๐Ÿ”— [pyinvoke.org](http://pyinvoke.org)

88. bogdanp/dramatiq โญ 4,414
A fast and reliable background task processing library for Python 3.
๐Ÿ”— [dramatiq.io](https://dramatiq.io)

89. ashleve/lightning-hydra-template โญ 4,378
PyTorch Lightning + Hydra. A very user-friendly template for ML experimentation. โšก๐Ÿ”ฅโšก

90. blealtan/efficient-kan โญ 4,176
An efficient pure-PyTorch implementation of Kolmogorov-Arnold Network (KAN).

91. adafruit/circuitpython โญ 4,165
CircuitPython - a Python implementation for teaching coding with microcontrollers
๐Ÿ”— [circuitpython.org](https://circuitpython.org)

92. evhub/coconut โญ 4,122
Coconut (coconut-lang.org) is a variant of Python that adds on top of Python syntax new features for simple, elegant, Pythonic functional programming.
๐Ÿ”— [coconut-lang.org](http://coconut-lang.org)

93. pyo3/maturin โญ 4,089
Build and publish crates with pyo3, cffi and uniffi bindings as well as rust binaries as python packages
๐Ÿ”— [maturin.rs](https://maturin.rs)

94. ets-labs/python-dependency-injector โญ 4,079
Dependency injection framework for Python
๐Ÿ”— [python-dependency-injector.ets-labs.org](https://python-dependency-injector.ets-labs.org/)

95. miguelgrinberg/python-socketio โญ 4,066
Python Socket.IO server and client

96. comet-ml/opik โญ 4,056
Opik is an open-source platform for evaluating, testing and monitoring LLM applications.
๐Ÿ”— [www.comet.com/docs/opik](https://www.comet.com/docs/opik/)

97. pyinfra-dev/pyinfra โญ 3,999
pyinfra turns Python code into shell commands and runs them on your servers. Execute ad-hoc commands and write declarative operations. Target SSH servers, local machine and Docker containers. Fast and scales from one server to thousands.
๐Ÿ”— [pyinfra.com](https://pyinfra.com)

98. joblib/joblib โญ 3,932
Computing with Python functions.
๐Ÿ”— [joblib.readthedocs.org](http://joblib.readthedocs.org)

99. python-markdown/markdown โญ 3,852
A Python implementation of John Gruberโ€™s Markdown with Extension support.
๐Ÿ”— [python-markdown.github.io](https://python-markdown.github.io/)

100. rspeer/python-ftfy โญ 3,836
Fixes mojibake and other glitches in Unicode text, after the fact.
๐Ÿ”— [ftfy.readthedocs.org](http://ftfy.readthedocs.org)

101. more-itertools/more-itertools โญ 3,774
More routines for operating on iterables, beyond itertools
๐Ÿ”— [more-itertools.rtfd.io](https://more-itertools.rtfd.io)

102. zeromq/pyzmq โญ 3,760
PyZMQ: Python bindings for zeromq
๐Ÿ”— [zguide.zeromq.org/py:all](http://zguide.zeromq.org/py:all)

103. pydata/xarray โญ 3,674
N-D labeled arrays and datasets in Python
๐Ÿ”— [xarray.dev](https://xarray.dev)

104. hynek/structlog โญ 3,668
Simple, powerful, and fast logging for Python.
๐Ÿ”— [www.structlog.org](https://www.structlog.org/)

105. pypi/warehouse โญ 3,629
The Python Package Index
๐Ÿ”— [pypi.org](https://pypi.org)

106. tartley/colorama โญ 3,589
Simple cross-platform colored terminal text in Python

107. spotify/basic-pitch โญ 3,587
A lightweight yet powerful audio-to-MIDI converter with pitch bend detection
๐Ÿ”— [basicpitch.io](https://basicpitch.io)

108. osohq/oso โญ 3,483
Deprecated: See README

109. jorisschellekens/borb โญ 3,428
borb is a library for reading, creating and manipulating PDF files in python.
๐Ÿ”— [borbpdf.com](https://borbpdf.com/)

110. suor/funcy โญ 3,396
A fancy and practical functional tools

111. pyserial/pyserial โญ 3,282
Python serial port access library

112. camelot-dev/camelot โญ 3,090
A Python library to extract tabular data from PDFs
๐Ÿ”— [camelot-py.readthedocs.io](https://camelot-py.readthedocs.io)

113. libaudioflux/audioFlux โญ 2,958
A library for audio and music analysis, feature extraction.
๐Ÿ”— [audioflux.top](https://audioflux.top)

114. legrandin/pycryptodome โญ 2,900
A self-contained cryptographic library for Python
๐Ÿ”— [www.pycryptodome.org](https://www.pycryptodome.org)

115. tox-dev/pipdeptree โญ 2,843
A command line utility to display dependency tree of the installed Python packages
๐Ÿ”— [pypi.python.org/pypi/pipdeptree](https://pypi.python.org/pypi/pipdeptree)

116. lxml/lxml โญ 2,740
The lxml XML toolkit for Python
๐Ÿ”— [lxml.de](https://lxml.de/)

117. liiight/notifiers โญ 2,690
The easy way to send notifications
๐Ÿ”— [notifiers.readthedocs.io](http://notifiers.readthedocs.io/)

118. whylabs/whylogs โญ 2,673
An open-source data logging library for machine learning models and data pipelines. ๐Ÿ“š Provides visibility into data quality & model performance over time. ๐Ÿ›ก๏ธ Supports privacy-preserving data collection, ensuring safety & robustness. ๐Ÿ“ˆ
๐Ÿ”— [whylogs.readthedocs.io](https://whylogs.readthedocs.io/)

119. cdgriffith/Box โญ 2,659
Python dictionaries with advanced dot notation access
๐Ÿ”— [github.com/cdgriffith/box/wiki](https://github.com/cdgriffith/Box/wiki)

120. pexpect/pexpect โญ 2,642
A Python module for controlling interactive programs in a pseudo-terminal
๐Ÿ”— [pexpect.readthedocs.io](http://pexpect.readthedocs.io/)

121. litl/backoff โญ 2,622
Python library providing function decorators for configurable backoff and retry

122. yaml/pyyaml โญ 2,608
Canonical source repository for PyYAML

123. scrapinghub/dateparser โญ 2,578
python parser for human readable dates

124. pypa/setuptools โญ 2,564
Official project repository for the Setuptools build system
๐Ÿ”— [pypi.org/project/setuptools](https://pypi.org/project/setuptools/)

125. pydantic/logfire โญ 2,518
Uncomplicated Observability for Python and beyond! ๐Ÿชต๐Ÿ”ฅ
๐Ÿ”— [logfire.pydantic.dev/docs](https://logfire.pydantic.dev/docs/)

126. jcrist/msgspec โญ 2,514
A fast serialization and validation library, with builtin support for JSON, MessagePack, YAML, and TOML
๐Ÿ”— [jcristharif.com/msgspec](https://jcristharif.com/msgspec/)

127. pyston/pyston โญ 2,502
(No longer maintained) A faster and highly-compatible implementation of the Python programming language.
๐Ÿ”— [www.pyston.org](https://www.pyston.org/)

128. dosisod/refurb โญ 2,492
A tool for refurbishing and modernizing Python codebases

129. nschloe/tikzplotlib โญ 2,449
๐Ÿ“Š Save matplotlib figures as TikZ/PGFplots for smooth integration into LaTeX.

130. hgrecco/pint โญ 2,448
Operate and manipulate physical quantities in Python
๐Ÿ”— [pint.readthedocs.org](http://pint.readthedocs.org/)

131. grantjenks/python-diskcache โญ 2,432
Python disk-backed cache (Django-compatible). Faster than Redis and Memcached. Pure-Python.
๐Ÿ”— [www.grantjenks.com/docs/diskcache](http://www.grantjenks.com/docs/diskcache/)

132. dateutil/dateutil โญ 2,398
Useful extensions to the standard Python datetime features

133. tkem/cachetools โญ 2,386
Various memoizing collections and decorators, including variants of the Python Standard Library's @lru_cache function decorator

134. pndurette/gTTS โญ 2,364
Python library and CLI tool to interface with Google Translate's text-to-speech API
๐Ÿ”— [gtts.readthedocs.org](http://gtts.readthedocs.org/)

135. kiminewt/pyshark โญ 2,287
Python wrapper for tshark, allowing python packet parsing using wireshark dissectors

136. rhettbull/osxphotos โญ 2,272
Python app to work with pictures and associated metadata from Apple Photos on macOS. Also includes a package to provide programmatic access to the Photos library, pictures, and metadata.

137. pyparsing/pyparsing โญ 2,249
Python library for creating PEG parsers

138. astanin/python-tabulate โญ 2,220
Pretty-print tabular data in Python, a library and a command-line utility. Repository migrated from bitbucket.org/astanin/python-tabulate.
๐Ÿ”— [pypi.org/project/tabulate](https://pypi.org/project/tabulate/)

139. nateshmbhat/pyttsx3 โญ 2,192
Offline Text To Speech synthesis for python

140. ianmiell/shutit โญ 2,151
Automation framework for programmers
๐Ÿ”— [ianmiell.github.io/shutit](http://ianmiell.github.io/shutit/)

141. grahamdumpleton/wrapt โญ 2,085
A Python module for decorators, wrappers and monkey patching.

142. google/gin-config โญ 2,072
Gin provides a lightweight configuration framework for Python

143. seperman/deepdiff โญ 2,063
DeepDiff: Deep Difference and search of any Python object/data. DeepHash: Hash of any object based on its contents. Delta: Use deltas to reconstruct objects by adding deltas together.
๐Ÿ”— [zepworks.com](http://zepworks.com)

144. pyfilesystem/pyfilesystem2 โญ 2,016
Python's Filesystem abstraction layer
๐Ÿ”— [www.pyfilesystem.org](https://www.pyfilesystem.org)

145. omry/omegaconf โญ 2,013
Flexible Python configuration system. The last one you will ever need.

146. mitmproxy/pdoc โญ 2,000
API Documentation for Python Projects
๐Ÿ”— [pdoc.dev](https://pdoc.dev)

147. julienpalard/Pipe โญ 1,978
A Python library to use infix notation in Python

148. python-rope/rope โญ 1,976
a python refactoring library

149. numba/llvmlite โญ 1,972
A lightweight LLVM python binding for writing JIT compilers
๐Ÿ”— [llvmlite.pydata.org](https://llvmlite.pydata.org/)

150. landscapeio/prospector โญ 1,966
Inspects Python source files and provides information about type and location of classes, methods etc

151. carpedm20/emoji โญ 1,916
emoji terminal output for Python

152. pydoit/doit โญ 1,901
CLI task management & automation tool
๐Ÿ”— [pydoit.org](http://pydoit.org)

153. chaostoolkit/chaostoolkit โญ 1,896
Chaos Engineering Toolkit & Orchestration for Developers
๐Ÿ”— [chaostoolkit.org](https://chaostoolkit.org)

154. hbldh/bleak โญ 1,892
A cross platform Bluetooth Low Energy Client for Python using asyncio

155. pygments/pygments โญ 1,878
Pygments is a generic syntax highlighter written in Python
๐Ÿ”— [pygments.org](http://pygments.org/)

156. open-telemetry/opentelemetry-python โญ 1,849
OpenTelemetry Python API and SDK
๐Ÿ”— [opentelemetry.io](https://opentelemetry.io)

157. samuelcolvin/watchfiles โญ 1,847
Simple, modern and fast file watching and code reload in Python.
๐Ÿ”— [watchfiles.helpmanual.io](https://watchfiles.helpmanual.io)

158. p0dalirius/Coercer โญ 1,846
A python script to automatically coerce a Windows server to authenticate on an arbitrary machine through 12 methods.
๐Ÿ”— [podalirius.net](https://podalirius.net/)

159. home-assistant/supervisor โญ 1,827
๐Ÿก Home Assistant Supervisor
๐Ÿ”— [home-assistant.io/hassio](https://home-assistant.io/hassio/)

160. joowani/binarytree โญ 1,810
Python Library for Studying Binary Trees
๐Ÿ”— [binarytree.readthedocs.io](http://binarytree.readthedocs.io)

161. konradhalas/dacite โญ 1,792
Simple creation of data classes from dictionaries.

162. mkdocstrings/mkdocstrings โญ 1,771
๐Ÿ“˜ Automatic documentation from sources, for MkDocs.
๐Ÿ”— [mkdocstrings.github.io](https://mkdocstrings.github.io/)

163. rubik/radon โญ 1,756
Various code metrics for Python code
๐Ÿ”— [radon.readthedocs.org](http://radon.readthedocs.org/)

164. kalliope-project/kalliope โญ 1,721
Kalliope is a framework that will help you to create your own personal assistant.
๐Ÿ”— [kalliope-project.github.io](https://kalliope-project.github.io/)

165. anthropics/anthropic-sdk-python โญ 1,618
SDK providing access to Anthropic's safety-first language model APIs

166. quodlibet/mutagen โญ 1,612
Python module for handling audio metadata
๐Ÿ”— [mutagen.readthedocs.io](https://mutagen.readthedocs.io)

167. instagram/LibCST โญ 1,582
A concrete syntax tree parser and serializer library for Python that preserves many aspects of Python's abstract syntax tree
๐Ÿ”— [libcst.readthedocs.io](https://libcst.readthedocs.io/)

168. facebookincubator/Bowler โญ 1,573
Safe code refactoring for modern Python.
๐Ÿ”— [pybowler.io](https://pybowler.io/)

169. imageio/imageio โญ 1,532
Python library for reading and writing image data
๐Ÿ”— [imageio.readthedocs.io](https://imageio.readthedocs.io)

170. fabiocaccamo/python-benedict โญ 1,517
๐Ÿ“˜ dict subclass with keylist/keypath support, built-in I/O operations (base64, csv, html, ini, json, pickle, plist, query-string, toml, xls, xml, yaml), s3 support and many utilities.

171. lcompilers/lpython โญ 1,510
Python compiler
๐Ÿ”— [lpython.org](https://lpython.org/)

172. nficano/python-lambda โญ 1,503
A toolkit for developing and deploying serverless Python code in AWS Lambda.

173. aws-samples/aws-glue-samples โญ 1,451
AWS Glue code samples

174. lidatong/dataclasses-json โญ 1,395
Easily serialize Data Classes to and from JSON

175. brandon-rhodes/python-patterns โญ 1,353
Source code behind the python-patterns.guide site by Brandon Rhodes

176. aio-libs/yarl โญ 1,350
Yet another URL library
๐Ÿ”— [yarl.aio-libs.org](https://yarl.aio-libs.org)

177. ossf/criticality_score โญ 1,347
Gives criticality score for an open source project

178. oracle/graalpython โญ 1,286
GraalPy โ€“ A high-performance embeddable Python 3 runtime for Java
๐Ÿ”— [www.graalvm.org/python](https://www.graalvm.org/python/)

179. pypy/pypy โญ 1,196
PyPy is a very fast and compliant implementation of the Python language.
๐Ÿ”— [pypy.org](https://pypy.org)

180. pyo3/rust-numpy โญ 1,161
PyO3-based Rust bindings of the NumPy C-API

181. pyfpdf/fpdf2 โญ 1,153
Simple PDF generation for Python
๐Ÿ”— [py-pdf.github.io/fpdf2](https://py-pdf.github.io/fpdf2/)

182. pdoc3/pdoc โญ 1,142
๐Ÿ :arrow_right: ๐Ÿ“œ Auto-generate API documentation for Python projects
๐Ÿ”— [pdoc3.github.io/pdoc](https://pdoc3.github.io/pdoc/)

183. ariebovenberg/whenever โญ 1,101
โฐ Modern datetime library for Python
๐Ÿ”— [whenever.rtfd.io](https://whenever.rtfd.io)

184. fsspec/filesystem_spec โญ 1,086
A specification that python filesystems should adhere to.

185. c4urself/bump2version โญ 1,070
Version-bump your software with a single command
๐Ÿ”— [pypi.python.org/pypi/bump2version](https://pypi.python.org/pypi/bump2version)

186. milvus-io/pymilvus โญ 1,063
Python SDK for Milvus.

187. metachris/logzero โญ 1,024
Robust and effective logging for Python 2 and 3.
๐Ÿ”— [logzero.readthedocs.io](https://logzero.readthedocs.io)

188. extensityai/symbolicai โญ 1,001
Compositional Differentiable Programming Library - divide-and-conquer approach to break down a complex problem into smaller, more manageable problems.

189. fastai/fastcore โญ 996
Python supercharged for the fastai library
๐Ÿ”— [fastcore.fast.ai](http://fastcore.fast.ai)

190. lastmile-ai/aiconfig โญ 976
AIConfig saves prompts, models and model parameters as source control friendly configs. This allows you to iterate on prompts and model parameters separately from your application code.
๐Ÿ”— [aiconfig.lastmileai.dev](https://aiconfig.lastmileai.dev)

191. barracuda-fsh/pyobd โญ 893
An OBD-II compliant car diagnostic tool

192. juanbindez/pytubefix โญ 892
Python3 library for downloading YouTube Videos.
๐Ÿ”— [pytubefix.readthedocs.io](https://pytubefix.readthedocs.io)

193. qdrant/qdrant-client โญ 830
Python client for Qdrant vector search engine
๐Ÿ”— [qdrant.tech](https://qdrant.tech)

194. samuelcolvin/dirty-equals โญ 821
Doing dirty (but extremely useful) things with equals.
๐Ÿ”— [dirty-equals.helpmanual.io](https://dirty-equals.helpmanual.io)

195. modal-labs/modal-examples โญ 763
Examples of programs built using Modal
๐Ÿ”— [modal.com/docs](https://modal.com/docs)

196. open-telemetry/opentelemetry-python-contrib โญ 761
OpenTelemetry instrumentation for Python modules
๐Ÿ”— [opentelemetry.io](https://opentelemetry.io)

197. pypa/build โญ 756
A simple, correct Python build frontend
๐Ÿ”— [build.pypa.io](https://build.pypa.io)

198. gefyrahq/gefyra โญ 705
Blazingly-fast ๐Ÿš€, rock-solid, local application development :arrow_right: with Kubernetes.
๐Ÿ”— [gefyra.dev](https://gefyra.dev)

199. instagram/Fixit โญ 672
Advanced Python linting framework with auto-fixes and hierarchical configuration that makes it easy to write custom in-repo lint rules.
๐Ÿ”— [fixit.rtfd.io/en/latest](https://fixit.rtfd.io/en/latest/)

200. argoproj-labs/hera โญ 634
Hera makes Python code easy to orchestrate on Argo Workflows through native Python integrations. It lets you construct and submit your Workflows entirely in Python. โญ๏ธ Remember to star!
๐Ÿ”— [hera.rtfd.io](https://hera.rtfd.io)

201. fastai/ghapi โญ 632
A delightful and complete interface to GitHub's amazing API
๐Ÿ”— [ghapi.fast.ai](https://ghapi.fast.ai/)

202. google/pyglove โญ 630
Manipulating Python Programs

203. platformdirs/platformdirs โญ 626
A small Python module for determining appropriate platform-specific dirs, e.g. a "user data dir".
๐Ÿ”— [platformdirs.readthedocs.io](https://platformdirs.readthedocs.io)

204. methexis-inc/terminal-copilot โญ 572
A smart terminal assistant that helps you find the right command.

205. steamship-core/steamship-langchain โญ 511
steamship-langchain

206. neuml/annotateai โญ 240
Automatically annotates papers using Large Language Models (LLMs)

## Vizualisation

Vizualisation tools and libraries. Application frameworks, 2D/3D plotting, dashboards, WebGL.

1. apache/superset โญ 63,685
Apache Superset is a Data Visualization and Data Exploration Platform
๐Ÿ”— [superset.apache.org](https://superset.apache.org/)

2. streamlit/streamlit โญ 36,561
Streamlit โ€” A faster way to build and share data apps.
๐Ÿ”— [streamlit.io](https://streamlit.io)

3. gradio-app/gradio โญ 35,054
Build and share delightful machine learning apps, all in Python. ๐ŸŒŸ Star to support our work!
๐Ÿ”— [www.gradio.app](http://www.gradio.app)

4. plotly/dash โญ 21,764
Data Apps & Dashboards for Python. No JavaScript Required.
๐Ÿ”— [plotly.com/dash](https://plotly.com/dash)

5. matplotlib/matplotlib โญ 20,543
matplotlib: plotting with Python
๐Ÿ”— [matplotlib.org/stable](https://matplotlib.org/stable/)

6. danny-avila/LibreChat โญ 20,277
LibreChat is a free, open source AI chat platform. This Web UI offers vast customization, supporting numerous AI providers, services, and integrations.
๐Ÿ”— [librechat.ai](https://librechat.ai/)

7. bokeh/bokeh โญ 19,509
Interactive Data Visualization in the browser, from Python
๐Ÿ”— [bokeh.org](https://bokeh.org)

8. plotly/plotly.py โญ 16,533
The interactive graphing library for Python โœจ This project now includes Plotly Express!
๐Ÿ”— [plotly.com/python](https://plotly.com/python/)

9. mwaskom/seaborn โญ 12,727
Statistical data visualization in Python
๐Ÿ”— [seaborn.pydata.org](https://seaborn.pydata.org)

10. visgl/deck.gl โญ 12,381
WebGL2 powered visualization framework
๐Ÿ”— [deck.gl](https://deck.gl)

11. marceloprates/prettymaps โญ 11,408
A small set of Python functions to draw pretty maps from OpenStreetMap data. Based on osmnx, matplotlib and shapely libraries.

12. altair-viz/altair โญ 9,498
Declarative visualization library for Python
๐Ÿ”— [altair-viz.github.io](https://altair-viz.github.io/)

13. nvidia/TensorRT-LLM โญ 9,090
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT...
๐Ÿ”— [nvidia.github.io/tensorrt-llm](https://nvidia.github.io/TensorRT-LLM)

14. lux-org/lux โญ 5,237
Automatically visualize your pandas dataframe via a single print! ๐Ÿ“Š ๐Ÿ’ก

15. renpy/renpy โญ 5,160
The Ren'Py Visual Novel Engine
๐Ÿ”— [www.renpy.org](http://www.renpy.org/)

16. holoviz/panel โญ 4,929
Panel: The powerful data exploration & web app framework for Python
๐Ÿ”— [panel.holoviz.org](https://panel.holoviz.org)

17. man-group/dtale โญ 4,813
Visualizer for pandas data structures
๐Ÿ”— [alphatechadmin.pythonanywhere.com](http://alphatechadmin.pythonanywhere.com)

18. has2k1/plotnine โญ 4,099
A Grammar of Graphics for Python
๐Ÿ”— [plotnine.org](https://plotnine.org)

19. residentmario/missingno โญ 3,997
missingno provides a small toolset of flexible and easy-to-use missing data visualizations and utilities that allows you to get a quick visual summary of the completeness (or lack thereof) of your dataset.

20. pyqtgraph/pyqtgraph โญ 3,942
Fast data visualization and GUI tools for scientific / engineering applications
๐Ÿ”— [www.pyqtgraph.org](https://www.pyqtgraph.org)

21. vispy/vispy โญ 3,347
Main repository for Vispy
๐Ÿ”— [vispy.org](http://vispy.org)

22. ml-tooling/opyrator โญ 3,116
๐Ÿช„ Turns your machine learning code into microservices with web API, interactive GUI, and more.
๐Ÿ”— [opyrator-playground.mltooling.org](https://opyrator-playground.mltooling.org)

23. netflix/flamescope โญ 3,034
FlameScope is a visualization tool for exploring different time ranges as Flame Graphs.

24. pyvista/pyvista โญ 2,826
3D plotting and mesh analysis through a streamlined interface for the Visualization Toolkit (VTK)
๐Ÿ”— [docs.pyvista.org](https://docs.pyvista.org)

25. facebookresearch/hiplot โญ 2,775
HiPlot makes understanding high dimensional data easy
๐Ÿ”— [facebookresearch.github.io/hiplot](https://facebookresearch.github.io/hiplot/)

26. mckinsey/vizro โญ 2,754
Vizro is a low-code toolkit for building high-quality data visualization apps.
๐Ÿ”— [vizro.readthedocs.io/en/stable](https://vizro.readthedocs.io/en/stable/)

27. holoviz/holoviews โญ 2,733
With Holoviews, your data visualizes itself.
๐Ÿ”— [holoviews.org](https://holoviews.org)

28. kozea/pygal โญ 2,677
pygal is a dynamic SVG charting library written in python.
๐Ÿ”— [www.pygal.org](https://www.pygal.org)

29. napari/napari โญ 2,250
A fast, interactive, multi-dimensional image viewer for Python. It's designed for browsing, annotating, and analyzing large multi-dimensional images.
๐Ÿ”— [napari.org](https://napari.org)

30. marcomusy/vedo โญ 2,068
A python module for scientific analysis of 3D data based on VTK and Numpy
๐Ÿ”— [vedo.embl.es](https://vedo.embl.es)

31. datapane/datapane โญ 1,386
Build and share data reports in 100% Python
๐Ÿ”— [datapane.com](https://datapane.com)

32. facultyai/dash-bootstrap-components โญ 1,131
Bootstrap components for Plotly Dash
๐Ÿ”— [dash-bootstrap-components.opensource.faculty.ai](https://dash-bootstrap-components.opensource.faculty.ai/)

33. nomic-ai/deepscatter โญ 1,070
Zoomable, animated scatterplots in the browser that scales over a billion points

34. holoviz/holoviz โญ 851
High-level tools to simplify visualization in Python.
๐Ÿ”— [holoviz.org](https://holoviz.org/)

35. hazyresearch/meerkat โญ 830
Creative interactive views of any dataset.

36. anvaka/word2vec-graph โญ 708
Exploring word2vec embeddings as a graph of nearest neighbors
๐Ÿ”— [anvaka.github.io/pm/#/galaxy/word2vec-wiki?cx=-4651&cy=4492&cz=-1988&lx=-0.0915&ly=-0.9746&lz=-0.2030&lw=0.0237&ml=300&s=1.75&l=1&v=d50_clean_small](https://anvaka.github.io/pm/#/galaxy/word2vec-wiki?cx=-4651&cy=4492&cz=-1988&lx=-0.0915&ly=-0.9746&lz=-0.2030&lw=0.0237&ml=300&s=1.75&l=1&v=d50_clean_small)

## Web

Web related frameworks and libraries: webapp servers, WSGI, ASGI, asyncio, HTTP, REST, user management.

1. django/django โญ 81,741
The Web framework for perfectionists with deadlines.
๐Ÿ”— [www.djangoproject.com](https://www.djangoproject.com/)

2. tiangolo/fastapi โญ 79,280
FastAPI framework, high performance, easy to learn, fast to code, ready for production
๐Ÿ”— [fastapi.tiangolo.com](https://fastapi.tiangolo.com/)

3. pallets/flask โญ 68,505
The Python micro framework for building web applications.
๐Ÿ”— [flask.palletsprojects.com](https://flask.palletsprojects.com)

4. sherlock-project/sherlock โญ 61,588
Hunt down social media accounts by username across social networks
๐Ÿ”— [sherlockproject.xyz](https://sherlockproject.xyz)

5. psf/requests โญ 52,328
A simple, yet elegant, HTTP library.
๐Ÿ”— [requests.readthedocs.io/en/latest](https://requests.readthedocs.io/en/latest/)

6. tornadoweb/tornado โญ 21,803
Tornado is a Python web framework and asynchronous networking library, originally developed at FriendFeed.
๐Ÿ”— [www.tornadoweb.org](http://www.tornadoweb.org/)

7. reflex-dev/reflex โญ 21,289
๐Ÿ•ธ๏ธ Web apps in pure Python ๐Ÿ
๐Ÿ”— [reflex.dev](https://reflex.dev)

8. wagtail/wagtail โญ 18,599
A Django content management system focused on flexibility and user experience
๐Ÿ”— [wagtail.org](https://wagtail.org)

9. huge-success/sanic โญ 18,188
Accelerate your web app development | Build fast. Run fast.
๐Ÿ”— [sanic.dev](https://sanic.dev)

10. pyscript/pyscript โญ 18,144
A framework that allows users to create rich Python applications in the browser using HTML's interface and the power of Pyodide, WASM, and modern web technologies.
๐Ÿ”— [pyscript.net](https://pyscript.net/)

11. vincigit00/Scrapegraph-ai โญ 16,987
ScrapeGraphAI is a web scraping python library that uses LLM and direct graph logic to create scraping pipelines for websites and local documents
๐Ÿ”— [scrapegraphai.com](https://scrapegraphai.com)

12. aio-libs/aiohttp โญ 15,298
Asynchronous HTTP client/server framework for asyncio and Python
๐Ÿ”— [docs.aiohttp.org](https://docs.aiohttp.org)

13. encode/httpx โญ 13,520
A next generation HTTP client for Python. ๐Ÿฆ‹
๐Ÿ”— [www.python-httpx.org](https://www.python-httpx.org/)

14. getpelican/pelican โญ 12,676
Static site generator that supports Markdown and reST syntax. Powered by Python.
๐Ÿ”— [getpelican.com](https://getpelican.com)

15. flet-dev/flet โญ 12,031
Flet enables developers to easily build realtime web, mobile and desktop apps in Python. No frontend experience required.
๐Ÿ”— [flet.dev](https://flet.dev)

16. aws/chalice โญ 10,723
Python Serverless Microframework for AWS

17. zauberzeug/nicegui โญ 10,485
Create web-based user interfaces with Python. The nice way.
๐Ÿ”— [nicegui.io](https://nicegui.io)

18. encode/starlette โญ 10,432
The little ASGI framework that shines. ๐ŸŒŸ
๐Ÿ”— [www.starlette.io](https://www.starlette.io/)

19. benoitc/gunicorn โญ 9,927
gunicorn 'Green Unicorn' is a WSGI HTTP Server for UNIX, fast clients and sleepy applications.
๐Ÿ”— [www.gunicorn.org](http://www.gunicorn.org)

20. falconry/falcon โญ 9,563
The no-magic web API and microservices framework for Python developers, with an emphasis on reliability and performance at scale.
๐Ÿ”— [falcon.readthedocs.io](https://falcon.readthedocs.io)

21. encode/uvicorn โญ 8,746
An ASGI web server, for Python. ๐Ÿฆ„
๐Ÿ”— [www.uvicorn.org](https://www.uvicorn.org/)

22. bottlepy/bottle โญ 8,505
bottle.py is a fast and simple micro-framework for python web-applications.
๐Ÿ”— [bottlepy.org](http://bottlepy.org/)

23. graphql-python/graphene โญ 8,126
GraphQL framework for Python
๐Ÿ”— [graphene-python.org](http://graphene-python.org/)

24. reactive-python/reactpy โญ 7,913
ReactPy is a library for building user interfaces in Python without Javascript
๐Ÿ”— [reactpy.dev](https://reactpy.dev)

25. vitalik/django-ninja โญ 7,555
๐Ÿ’จ Fast, Async-ready, Openapi, type hints based framework for building APIs
๐Ÿ”— [django-ninja.dev](https://django-ninja.dev)

26. pyeve/eve โญ 6,711
REST API framework designed for human beings
๐Ÿ”— [python-eve.org](https://python-eve.org)

27. pallets/werkzeug โญ 6,686
The comprehensive WSGI web application library.
๐Ÿ”— [werkzeug.palletsprojects.com](https://werkzeug.palletsprojects.com)

28. webpy/webpy โญ 5,894
web.py is a web framework for python that is as simple as it is powerful.
๐Ÿ”— [webpy.org](http://webpy.org)

29. starlite-api/litestar โญ 5,784
Production-ready, Light, Flexible and Extensible ASGI API framework | Effortlessly Build Performant APIs
๐Ÿ”— [litestar.dev](https://litestar.dev/)

30. fastapi-users/fastapi-users โญ 4,795
Ready-to-use and customizable users management for FastAPI
๐Ÿ”— [fastapi-users.github.io/fastapi-users](https://fastapi-users.github.io/fastapi-users/)

31. stephenmcd/mezzanine โญ 4,769
CMS framework for Django
๐Ÿ”— [mezzanine.jupo.org](http://mezzanine.jupo.org)

32. nameko/nameko โญ 4,721
A microservices framework for Python that lets service developers concentrate on application logic and encourages testability.
๐Ÿ”— [www.nameko.io](https://www.nameko.io)

33. pywebio/PyWebIO โญ 4,611
Write interactive web app in script way.
๐Ÿ”— [pywebio.readthedocs.io](https://pywebio.readthedocs.io)

34. strawberry-graphql/strawberry โญ 4,091
A GraphQL library for Python that leverages type annotations ๐Ÿ“
๐Ÿ”— [strawberry.rocks](https://strawberry.rocks)

35. h2oai/wave โญ 4,026
H2O Wave is a software stack for building beautiful, low-latency, realtime, browser-based applications and dashboards entirely in Python/R without using HTML, Javascript, or CSS.
๐Ÿ”— [wave.h2o.ai](https://wave.h2o.ai)

36. pylons/pyramid โญ 3,999
Pyramid - A Python web framework
๐Ÿ”— [trypyramid.com](https://trypyramid.com/)

37. websocket-client/websocket-client โญ 3,606
WebSocket client for Python
๐Ÿ”— [github.com/websocket-client/websocket-client](https://github.com/websocket-client/websocket-client)

38. unbit/uwsgi โญ 3,473
uWSGI application server container
๐Ÿ”— [projects.unbit.it/uwsgi](http://projects.unbit.it/uwsgi)

39. pallets/quart โญ 3,114
An async Python micro framework for building web applications.
๐Ÿ”— [quart.palletsprojects.com](https://quart.palletsprojects.com)

40. fastapi-admin/fastapi-admin โญ 3,007
A fast admin dashboard based on FastAPI and TortoiseORM with tabler ui, inspired by Django admin
๐Ÿ”— [fastapi-admin-docs.long2ice.io](https://fastapi-admin-docs.long2ice.io)

41. flipkart-incubator/Astra โญ 2,525
Automated Security Testing For REST API's

42. dot-agent/nextpy โญ 2,247
๐Ÿค–Self-Modifying Framework from the Future ๐Ÿ”ฎ World's First AMS
๐Ÿ”— [dotagent.ai](https://dotagent.ai)

43. masoniteframework/masonite โญ 2,225
The Modern And Developer Centric Python Web Framework. Be sure to read the documentation and join the Discord channel for questions: https://discord.gg/TwKeFahmPZ
๐Ÿ”— [docs.masoniteproject.com](http://docs.masoniteproject.com)

44. python-restx/flask-restx โญ 2,177
Fork of Flask-RESTPlus: Fully featured framework for fast, easy and documented API development with Flask
๐Ÿ”— [flask-restx.readthedocs.io/en/latest](https://flask-restx.readthedocs.io/en/latest/)

45. s3rius/FastAPI-template โญ 2,073
Feature rich robust FastAPI template.

46. neoteroi/BlackSheep โญ 2,042
Fast ASGI web framework for Python
๐Ÿ”— [www.neoteroi.dev/blacksheep](https://www.neoteroi.dev/blacksheep/)

47. dmontagu/fastapi-utils โญ 1,978
Reusable utilities for FastAPI: a number of utilities to help reduce boilerplate and reuse common functionality across projects
๐Ÿ”— [fastapiutils.github.io/fastapi-utils](https://fastapiutils.github.io/fastapi-utils/)

48. cherrypy/cherrypy โญ 1,864
CherryPy is a pythonic, object-oriented HTTP framework. https://cherrypy.dev
๐Ÿ”— [docs.cherrypy.dev](https://docs.cherrypy.dev)

49. indico/indico โญ 1,802
Indico - A feature-rich event management system, made @ CERN, the place where the Web was born.
๐Ÿ”— [getindico.io](https://getindico.io)

50. jordaneremieff/mangum โญ 1,764
An adapter for running ASGI applications in AWS Lambda to handle Function URL, API Gateway, ALB, and Lambda@Edge events
๐Ÿ”— [mangum.fastapiexpert.com](http://mangum.fastapiexpert.com/)

51. wtforms/wtforms โญ 1,519
A flexible forms validation and rendering library for Python.
๐Ÿ”— [wtforms.readthedocs.io](https://wtforms.readthedocs.io)

52. awtkns/fastapi-crudrouter โญ 1,439
A dynamic FastAPI router that automatically creates CRUD routes for your models
๐Ÿ”— [fastapi-crudrouter.awtkns.com](https://fastapi-crudrouter.awtkns.com)

53. long2ice/fastapi-cache โญ 1,416
fastapi-cache is a tool to cache fastapi response and function result, with backends support redis and memcached.
๐Ÿ”— [github.com/long2ice/fastapi-cache](https://github.com/long2ice/fastapi-cache)

54. rstudio/py-shiny โญ 1,347
Shiny for Python
๐Ÿ”— [shiny.posit.co/py](https://shiny.posit.co/py/)

55. whitphx/stlite โญ 1,290
A port of Streamlit to WebAssembly, powered by Pyodide.
๐Ÿ”— [edit.share.stlite.net](https://edit.share.stlite.net)

56. magicstack/httptools โญ 1,217
Fast HTTP parser

57. koxudaxi/fastapi-code-generator โญ 1,096
This code generator creates FastAPI app from an openapi file.

58. aeternalis-ingenium/FastAPI-Backend-Template โญ 684
A backend project template with FastAPI, PostgreSQL with asynchronous SQLAlchemy 2.0, Alembic for asynchronous database migration, and Docker.

---

Interactive version: [www.awesomepython.org](https://www.awesomepython.org/), Hugging Face Dataset: [awesome-python](https://huggingface.co/datasets/dylanhogg/awesome-python)

Please raise a new issue to suggest a Python repo that you would like to see added.

1,676 hand-picked awesome Python libraries and frameworks, updated 21 Jan 2025

[![Hits](https://hits.seeyoufarm.com/api/count/incr/badge.svg?url=https%3A%2F%2Fgithub.com%2Fdylanhogg%2Fawesome-python&count_bg=%2379C83D&title_bg=%23555555&icon=&icon_color=%23E7E7E7&title=hits&edge_flat=false)](https://hits.seeyoufarm.com)