An open API service indexing awesome lists of open source software.

awesome-opensource-ai

Curated list of the best truly open-source AI projects, models, tools, and infrastructure.
https://github.com/alvinreal/awesome-opensource-ai

Last synced: 15 days ago
JSON representation

  • ๐Ÿ“‹ Contents

    • ๐Ÿ›ก๏ธ 10. AI Safety, Alignment & Interpretability

      • Bloom - research/bloom?style=social) - Open-source agentic framework for automated behavioral evaluations of frontier AI models. Generates targeted evaluation suites to probe LLMs for specific behaviors (sycophancy, self-preservation, political bias, etc.) with quantitative elicitation rates. From Anthropic's safety research team. MIT licensed.
      • Safe-RLHF - Alignment/safe-rlhf?style=social) - Safe reinforcement learning from human feedback.
      • Alignment Handbook - handbook?style=social) - Complete recipes for full-stack alignment.
      • What-If Tool (Google PAIR) - code/what-if-tool?style=social) - Visual tool for investigating model performance and fairness. Interactive probing of ML models with no code required. Integrates with TensorBoard, Jupyter, and Colab. Apache 2.0 licensed.
      • interpret (Microsoft) - Fit interpretable models and explain blackbox machine learning with state-of-the-art explainability techniques including Explainable Boosting Machines and SHAP-based explanations.
      • TransformerLens - Gold-standard for mechanistic interpretability.
      • SAELens - Sparse autoencoders for interpretable features.
      • Captum - PyTorch's official interpretability library.
      • XAI - eXplainability toolbox for machine learning with bias evaluation and production monitoring tools.
      • EasyEdit - Easy-to-use knowledge editing framework for LLMs. Enables precise modification of model knowledge and behavior to correct hallucinations or outdated information. ACL 2024. MIT licensed.
      • AIX360 - AI/AIX360?style=social) - Comprehensive AI explainability toolkit with interpretability algorithms for data and machine learning models. Includes TED, BRCG, and ProtoNN methods for diverse explanation needs. Apache 2.0 licensed.
      • ELI5 - Memex/eli5?style=social) - Library for debugging/inspecting machine learning classifiers and explaining their predictions. Supports scikit-learn, XGBoost, LightGBM, and more with feature importance and explanation visualizations. MIT licensed.
      • Shapash - User-friendly explainability library for transparent ML models. Beautiful visualizations with explicit labels that everyone can understand. Generates web reports and integrates with SHAP/LIME. Apache 2.0 licensed.
      • AI Fairness 360 - AI/AIF360?style=social) - Comprehensive toolkit for detecting, understanding, and mitigating unwanted algorithmic bias in datasets and ML models.
      • Fairlearn - Python package to assess and improve fairness of machine learning models. Provides metrics for disparity assessment and algorithms for unfairness mitigation with scikit-learn integration. MIT licensed.
      • PyRIT (Microsoft) - Python Risk Identification Tool for generative AI. Open-source framework for security professionals to proactively identify risks in generative AI systems through automated red teaming.
      • Garak - Automated LLM vulnerability scanner.
      • Adversarial Robustness Toolbox - AI/adversarial-robustness-toolbox?style=social) - Python library for machine learning security (evasion, poisoning, extraction, inference attacks).
      • DeepTeam - ai/deepteam?style=social) - Framework to red team LLMs and LLM systems.
      • Agentic Security - Agentic LLM vulnerability scanner and AI red teaming kit with multi-step attack simulation and automated security probing. Apache 2.0 licensed.
      • Responsible AI Toolbox - ai-toolbox?style=social) - Suite of tools providing model and data exploration, assessment interfaces and libraries for understanding AI systems. Enables developers to develop and monitor AI more responsibly with better data-driven actions. MIT licensed.
      • AgentOps - AI/agentops?style=social) - Python SDK for AI agent monitoring, LLM cost tracking, benchmarking, and evaluation. Integrates with CrewAI, Agno, OpenAI Agents SDK, LangChain, Autogen, AG2, and CamelAI. MIT licensed.
      • Detoxify - Trained models and code to predict toxic comments on all 3 Jigsaw Toxic Comment Challenges. Built using PyTorch Lightning and Transformers for toxicity, severe toxicity, obscene, threat, insult, identity attack, and sexual explicit content detection. Apache 2.0 licensed.
    • ๐Ÿงฉ 11. Specialized Domains

      • Boltz - Open-source biomolecular interaction prediction models. Boltz-1 was the first fully open source model to approach AlphaFold3 accuracy; Boltz-2 adds binding affinity prediction for drug discovery. MIT licensed.
      • Protenix - High-accuracy open-source biomolecular structure prediction model from ByteDance. First fully open-source model to outperform AlphaFold3 across diverse benchmarks with Apache 2.0 licensing for both academic and commercial use.
      • OpenFold - Trainable PyTorch reproduction of AlphaFold2. Complete open-source pipeline for protein structure prediction with competitive accuracy to the original. Apache 2.0 licensed.
      • DeepChem - Democratizing deep learning for drug discovery, quantum chemistry, materials science, and biology. High-quality open-source toolchain with 50+ models and extensive tutorials. MIT licensed.
      • MONAI - MONAI/MONAI?style=social) - Medical Open Network for AI. End-to-end framework for healthcare imaging with state-of-the-art, production-ready training workflows. Apache 2.0 licensed.
      • nnU-Net - DKFZ/nnUNet?style=social) - Self-configuring deep learning method for medical image segmentation. Automatically adapts to any dataset without manual parameter tuning. Widely adopted as the standard baseline for biomedical segmentation challenges. Apache 2.0 licensed.
      • Unity ML-Agents - Technologies/ml-agents?style=social) - Toolkit for training intelligent agents in games and simulations using deep reinforcement learning. Enables NPC behavior control, automated testing, and game design evaluation. Apache 2.0 licensed.
      • OpenSpiel - deepmind/open_spiel?style=social) - Collection of environments and algorithms for research in general reinforcement learning and search/planning in games from Google DeepMind. Apache 2.0 licensed.
      • OpenBB - finance/OpenBB?style=social) - Financial data platform for analysts, quants and AI agents. Open-source investment research infrastructure with extensive data integrations. AGPL-3.0 licensed.
      • FinGPT - Foundation/FinGPT?style=social) - Open-source financial large language models. Democratizing financial AI with data-centric training pipeline and multiple model releases for trading, analysis, and robo-advising. MIT licensed.
      • FinRL - Foundation/FinRL?style=social) - Financial reinforcement learning framework for quantitative trading. Deep RL library for stock trading, portfolio allocation, and market execution with pre-built environments and benchmarks. MIT licensed.
      • Qlib - AI-oriented quantitative investment platform from Microsoft. Supports diverse ML modeling paradigms including supervised learning, market dynamics modeling, and RL. Now equipped with RD-Agent for automated R&D process. MIT licensed.
      • FinRobot - Foundation/FinRobot?style=social) - Open-source AI agent platform for financial analysis using LLMs. Multi-agent system with specialized agents for trading, analysis, and research. Apache 2.0 licensed.
      • OpenCV - World's most widely used computer vision library.
      • Ultralytics YOLO - State-of-the-art real-time object detection.
      • Detectron2 - High-performance object detection library.
      • CVAT - ai/cvat?style=social) - Industry-leading data annotation platform for computer vision. Interactive video and image annotation tool used by tens of thousands of teams for machine learning at any scale.
      • SAM 2 - Promptable image and video segmentation model with released checkpoints and training code.
      • Kornia - Differentiable computer vision library.
      • MediaPipe - ai-edge/mediapipe?style=social) - Cross-platform multimodal pipelines.
      • LeRobot (Hugging Face) - State-of-the-art machine learning framework for real-world robotics. End-to-end learning with models, datasets, and training tools for robotic manipulation tasks.
      • Stable-Baselines3 - RM/stable-baselines3?style=social) - Production-ready RL algorithms.
      • Isaac Lab - sim/IsaacLab?style=social) - GPU-accelerated robot learning framework.
      • MuJoCo - deepmind/mujoco?style=social) - General-purpose physics simulator for robotics, biomechanics, and ML research. High-fidelity contact dynamics with native Python and C++ bindings. Apache 2.0 licensed.
      • Gymnasium (ex-OpenAI Gym) - Foundation/Gymnasium?style=social) - Standard RL environment API.
      • Time Series Library (TSLib) - Series-Library?style=social) - Comprehensive benchmark for time-series models.
      • Chronos (Amazon) - science/chronos-forecasting?style=social) - Pretrained foundation models for time-series forecasting.
      • GluonTS (AWS Labs) - Probabilistic time series modeling with deep learning. Powers Amazon SageMaker forecasting with PyTorch and MXNet backends. Apache 2.0 licensed.
      • AutoTS - Automated time series forecasting with broad model selection, ensembling, anomaly detection, and holiday effects. Designed for production deployment with minimal setup.
      • ExecuTorch - PyTorch runtime and toolchain for deploying AI models on mobile, embedded, and edge devices.
      • OpenVINO - Intel's toolkit for edge deployment.
      • Apache TVM - Open Machine Learning Compiler Framework. Universal deployment to bring models into minimum deployable modules that can be embedded and run everywhere from datacenter to edge devices. Apache 2.0 licensed.
      • OpenContracts - Source-Legal/OpenContracts?style=social) - Self-hosted document annotation platform for legal AI. Semantic search, contract analysis, version control, and MCP integration for building legal knowledge bases. AGPL-3.0 licensed.
      • CARLA - simulator/carla?style=social) - Open-source simulator for autonomous driving research. High-fidelity simulation of urban environments with realistic physics, sensors, and traffic scenarios. Widely used for training and validating self-driving algorithms. MIT licensed.
      • Webots - Open-source multi-platform robot simulator providing a complete development environment for modeling, programming, and simulating robots, vehicles, and mechanical systems. Used in education, research, and industry. Apache 2.0 licensed.
      • Habitat-Sim - sim?style=social) - High-performance physics-enabled 3D simulator for embodied AI research. Supports 3D scans of indoor/outdoor spaces, CAD models, and configurable sensors. Powers Meta's embodied AI research. MIT licensed.
      • NASA Astrobee - NASA's free-flying robot software for the International Space Station. Flight software for vision-based localization, autonomous navigation, docking, and human-robot interaction. NASA Software of the Year Award Runner-Up 2020. Apache 2.0 licensed.
      • OpenPilot - Operating system for robotics. Currently upgrades driver assistance systems on 300+ supported cars. End-to-end autonomous driving stack with open-source hardware and software. MIT licensed.
      • Autoware - World's leading open-source software project for autonomous driving. Complete stack from localization and object detection to route planning and control. Used by 50+ companies globally. Apache 2.0 licensed.
      • GraphCast - deepmind/graphcast?style=social) - Deep learning weather forecasting model from Google DeepMind. State-of-the-art AI weather prediction with 10-day global forecasts matching or exceeding traditional numerical methods. Apache 2.0 licensed.
      • NVIDIA Modulus - Open-source deep learning framework for physics-informed machine learning (Physics-ML). Build, train, and fine-tune models for AI4science and engineering applications using state-of-the-art SciML methods. Apache 2.0 licensed.
      • TorchGeo - PyTorch domain library for geospatial data. Datasets, samplers, transforms, and pre-trained models for multispectral satellite imagery and remote sensing. First library with pre-trained models for Sentinel-2 bands. MIT licensed.
      • Astropy - Core library for astronomy and astrophysics in Python. Comprehensive tools for celestial coordinates, FITS I/O, cosmological calculations, and data analysis for professional astronomy. BSD-3-Clause licensed.
      • PyMC - devs/pymc?style=social) - Modern, comprehensive probabilistic programming framework in Python. Bayesian modeling with advanced MCMC sampling, variational inference, and seamless integration with ArviZ for visualization. Apache 2.0 licensed.
      • ArviZ - devs/arviz?style=social) - Exploratory analysis of Bayesian models with Python. Comprehensive visualization and diagnostics for probabilistic models, supporting PyMC, Pyro, Stan, and other PPLs. Apache 2.0 licensed.
      • Stanza - Stanford NLP Python library for 100+ human languages. State-of-the-art neural pipelines for tokenization, NER, parsing, and sentiment analysis with pre-trained models. Apache 2.0 licensed.
      • Tianshou - ml/tianshou?style=social) - An elegant PyTorch deep reinforcement learning library with clean API design and comprehensive algorithm implementations. Supports both single-agent and multi-agent RL with GPU acceleration. MIT licensed.
      • RL Baselines3 Zoo - RM/rl-baselines3-zoo?style=social) - A training framework for Stable Baselines3 reinforcement learning agents with hyperparameter optimization, pre-trained agents, and extensive benchmark environments. MIT licensed.
      • skrl - SM/skrl?style=social) - Modular reinforcement learning library implemented in PyTorch, JAX, and NVIDIA Warp with support for Gymnasium, NVIDIA Isaac Lab, MuJoCo Playground, and other environments. MIT licensed.
      • Finetrainers - Scalable and memory-optimized training of diffusion models from Hugging Face. Supports LoRA and full fine-tuning for video and image generation models. Apache 2.0 licensed.
      • torchaudio - PyTorch audio processing library. Comprehensive toolkit for audio I/O, transformations, and deep learning with support for speech recognition, TTS, and audio classification. BSD-2-Clause licensed.
      • Open3D - org/Open3D?style=social) - Modern library for 3D data processing with Python and C++ APIs. Core features include 3D data structures, processing algorithms, scene reconstruction, surface alignment, 3D visualization, and GPU acceleration. MIT licensed.
      • Point Cloud Library (PCL) - Standalone, large-scale open project for 2D/3D image and point cloud processing. Comprehensive algorithms for filtering, feature estimation, surface reconstruction, registration, model fitting, and segmentation. BSD licensed.
      • PyTorch3D - FAIR's library of reusable components for deep learning with 3D data. Provides efficient 3D operators, differentiable rendering, and mesh processing tools integrated with PyTorch. BSD licensed.
      • RTAB-Map - Real-Time Appearance-Based Mapping library for RGB-D, Stereo and LiDAR SLAM. Graph-based SLAM approach with incremental appearance-based loop closure detection for large-scale and long-term operation. BSD licensed.
      • NCNN - High-performance neural network inference framework optimized for mobile platforms. No third-party dependencies, cross-platform, and runs faster than all known open-source frameworks on mobile CPU. Powers Tencent apps including QQ, WeChat, and Pitu. BSD-3-Clause licensed.
      • MNN - Blazing-fast, lightweight inference engine battle-tested by Alibaba. Supports inference and training with industry-leading on-device performance. Powers high-performance LLMs and Edge AI with MNN-LLM runtime. Apache 2.0 licensed.
    • ๐Ÿ–ฅ๏ธ 12. User Interfaces & Self-hosted Platforms

      • OpenClaw - Local-first personal AI assistant with multi-channel integrations and full agentic task execution.
      • Open WebUI - webui/open-webui?style=social) - Most popular self-hosted ChatGPT-style interface.
      • text-generation-webui - generation-webui?style=social) - Web UI for running local LLMs with multiple backends, extensions, and model formats.
      • LobeChat - chat?style=social) - Sleek modern chat UI.
      • LibreChat - avila/LibreChat?style=social) - Feature-packed multi-LLM interface.
      • HuggingChat (self-hosted) - ui?style=social) - Official open-source codebase for HuggingChat.
      • Khoj - ai/khoj?style=social) - Self-hostable personal AI assistant for search, chat, automation, and workflows over local and web data.
      • Newelle - GNOME/Linux desktop virtual assistant with integrated file editor, global hotkeys, and profile manager.
      • NextChat - Light and fast AI assistant supporting Web, iOS, macOS, Android, Linux, and Windows. One-click deploy with multi-model support. MIT licensed.
      • big-AGI - AGI?style=social) - AI suite for power users with multi-model "Beam" chats, AI personas, voice, text-to-image, code execution, and PDF import. MIT licensed.
      • Morphic - AI-powered search engine with a generative UI. Supports multiple AI providers (OpenAI, Anthropic, Google, Ollama) and search providers (Tavily, SearXNG, Brave). Features smart search modes, widgets, and image/video search. Apache 2.0 licensed.
      • Leon - ai/leon?style=social) - Your open-source personal assistant. Built around tools, context, memory, and agentic execution. Self-hosted, privacy-focused, and extensible. MIT licensed.
      • Willow - Open source, local, and self-hosted Amazon Echo/Google Home competitive voice assistant alternative with hardware support. Apache-2.0 licensed.
      • CoPaw - ai/CoPaw?style=social) - Your Personal AI Assistant; easy to install, deploy on your own machine or on the cloud; supports multiple chat apps with easily extensible capabilities. Apache-2.0 licensed.
      • Smart2Brain - papa/obsidian-Smart2Brain?style=social) - Privacy-focused Obsidian plugin for AI-powered second brain functionality. Chat with your notes using local or remote LLMs including Ollama and OpenAI. MIT licensed.
      • Casibase - Open-source enterprise-level AI knowledge base and agent management platform. Supports multiple LLM providers, RAG, and team collaboration. Apache-2.0 licensed.
      • AnythingLLM - Labs/anything-llm?style=social) - All-in-one RAG + agents platform.
      • Flowise - Drag-and-drop LLM app builder.
      • LocalAI - Open-source AI engine running LLMs, vision, voice, image, and video models on any hardware. Self-hosted OpenAI-compatible API. MIT licensed.
      • Onyx - dot-app/onyx?style=social) - Full-featured AI platform with Chat, RAG, Agents, and Actions. 40+ document connectors and every LLM support. MIT licensed (Community Edition).
      • biniou - Self-hosted webUI for 30+ generative AI models. Generate multimedia content with AI on your own computer, even without dedicated GPU (8GB RAM minimum). Works offline once deployed. GPL-3.0 licensed.
      • Self-hosted AI Starter Kit (n8n) - io/self-hosted-ai-starter-kit?style=social) - Open-source Docker Compose template to quickly set up a local AI environment. Curated by n8n, combines self-hosted n8n with Ollama, Qdrant, and PostgreSQL for secure, self-hosted AI workflows. Apache 2.0 licensed.
      • Jan - Local-first AI app framework.
      • Cherry Studio - studio?style=social) - AI productivity studio with smart chat, autonomous agents, and 300+ assistants. Unified access to frontier LLMs. AGPL-3.0 licensed.
      • DeepChat - A smart assistant that connects powerful AI to your personal world. Built-in MCP and ACP support, multiple search engines, privacy-focused with local data storage. Apache-2.0 licensed.
      • SillyTavern - Highly customizable role-playing frontend.
      • ChatALL - shifu/ChatALL?style=social) - Concurrently chat with multiple AI bots to discover the best answers. Desktop app for comparing ChatGPT, Claude, Gemini, and 20+ LLMs side-by-side. Apache 2.0 licensed.
      • Chatbox - Powerful desktop AI client for ChatGPT, Claude, and other LLMs. Cross-platform with modern UI. GPLv3 licensed (Community Edition).
      • Maid - Artificial-Intelligence/maid?style=social) - Free and open-source Android app for interfacing with llama.cpp models locally and remote APIs (Anthropic, DeepSeek, Mistral, Ollama, OpenAI). MIT licensed.
      • Dive - Open-source MCP Host Desktop Application with dual Tauri/Electron architecture. Seamlessly integrates with any LLMs supporting function calling. MIT licensed.
      • LiveKit Agents - Framework for building realtime voice AI agents with WebRTC transport, STT-LLM-TTS pipelines, and production-grade orchestration. Used by Salesforce Agentforce and Tesla. Apache-2.0 licensed.
      • Pipecat - ai/pipecat?style=social) - Open-source framework for voice and multimodal conversational AI. Build real-time voice agents with support for speech-to-text, LLMs, text-to-speech, and live video. BSD-2-Clause licensed.
      • Agent Chat UI - ai/agent-chat-ui?style=social) - Web app for interacting with any LangGraph agent (Python & TypeScript) via a chat interface. Stream messages, handle interruptions, and view agent state. MIT licensed.
      • BionicGPT - gpt/bionic-gpt?style=social) - On-prem ChatGPT replacement for teams with assistants, RAG, access controls, auditing, and enterprise deployment features.
      • CoAI - Next-generation multi-tenant AI one-stop solution with built-in admin and billing system. Enterprise-grade unified LLM gateway supporting 200+ models and 35+ providers. Apache-2.0 licensed.
      • Plane - Open-source Jira, Linear, Monday, and ClickUp alternative. AI-powered project management platform with intelligent task triage, sprint planning, and automated workflows. AGPL-3.0 licensed.
      • RAG Web UI - web-ui/rag-web-ui?style=social) - Intelligent dialogue system based on RAG technology. Build intelligent Q&A systems on your own knowledge base with modern web interface. Apache-2.0 licensed.
      • PocketPal AI - ghorbani/pocketpal-ai?style=social) - Open-source app that brings small language models directly to your phone. Run AI 100% privately on iOS and Android with no cloud required. MIT licensed.
    • ๐Ÿงช 13. Developer Tools & Integrations

      • Continue - Open-source AI coding autopilot for VS Code & JetBrains.
      • Tabby - Self-hosted AI coding assistant.
      • Cline - Open-source IDE coding agent that can edit files, run commands, and use tools with user approval.
      • Open Interpreter - interpreter?style=social) - Lets LLMs run code locally.
      • Roo Code - Code?style=social) - Open-source editor-based coding agent with multiple modes and tool integrations.
      • llama.vim - org/llama.vim?style=social) - Local LLM-powered code completion plugin for Vim/Neovim using llama.cpp. Fast, privacy-first, no API key needed.
      • CodeCompanion.nvim - AI-powered coding assistant for Neovim. Inline code generation, chat, actions, and tool use with support for multiple LLM providers.
      • ProxyAI - Leading open-source AI copilot for JetBrains IDEs. Connect to any model in any environment with auto-apply, image chat, file references, web search, and customizable personas. Apache 2.0 licensed.
      • avante.nvim - Neovim plugin that brings Cursor-like AI IDE features to Vim. Edit code with natural language, generate code from context, and chat with AI about your codebase. Apache 2.0 licensed.
      • vim-ai - ai?style=social) - AI-powered code assistant for Vim and Neovim. Generate code, edit text, and have interactive conversations with GPT models. Supports custom roles, vision capabilities, and any OpenAI-compatible API. MIT licensed.
      • windsurf.vim - Free, ultrafast Copilot alternative for Vim and Neovim. AI-powered code completion with low latency and large context window. MIT licensed.
      • Jupyter AI - ai?style=social) - Chat and code generation inside notebooks.
      • Assistant UI - ui/assistant-ui?style=social) - React/TypeScript library for building production-grade AI chat interfaces. Drop-in components for streaming messages, tool calls, and multi-modal inputs.
      • Deep Chat - chat?style=social) - Fully customizable AI chatbot component for your website. Supports OpenAI, direct API services, and custom endpoints. MIT licensed.
      • CopilotKit - Best-in-class SDK for building full-stack agentic applications, Generative UI, and chat applications. Creators of the AG-UI Protocol adopted by Google, LangChain, AWS, and Microsoft. MIT licensed.
      • PR-Agent (Qodo) - ai/pr-agent?style=social) - AI-powered code review agent for GitHub, GitLab, BitBucket, and Azure DevOps. Automated PR analysis, improvement suggestions, and multi-platform deployment via CLI, GitHub Actions, or webhooks. AGPL-3.0 licensed.
      • LLM (Simon Willison) - CLI tool and Python library for interacting with dozens of LLMs via remote APIs or locally. Extensible plugin ecosystem, SQLite logging. Apache 2.0 licensed.
      • AIChat - All-in-one LLM CLI in Rust featuring Shell Assistant, Chat-REPL, RAG, AI Tools & Agents. Supports 20+ providers. MIT/Apache 2.0 licensed.
      • aicommits - CLI that writes your git commit messages for you with AI. Never write a commit message again. Supports multiple providers including OpenAI, Groq, xAI, Ollama, and LM Studio. MIT licensed.
      • Codex CLI - OpenAI's lightweight coding agent that runs in your terminal. Code generation, file editing, and command execution with approval. Apache 2.0 licensed.
      • Repomix - Powerful tool that packs your entire repository into a single AI-friendly file. Perfect for feeding codebases to LLMs with smart filtering and token counting. MIT licensed.
      • Instructor - Python library for extracting structured, validated data from LLMs using Pydantic models. Handles validation, retries, and error handling with 15+ provider support. MIT licensed.
      • Mirascope - Python toolkit for building LLM applications with automatic versioning, tracing, and cost tracking. The "LLM Anti-Framework" for developers who want control. MIT licensed.
      • Zed - industries/zed?style=social) - High-performance, multiplayer code editor with built-in AI features. From the creators of Atom and Tree-sitter. Native AI agentic editing with support for any LLM provider. GPL licensed.
      • Void Editor - Open-source AI-native code editor forked from VS Code. Features agentic AI editing, inline code generation, and chat interface. Designed as a Cursor alternative with full control over your data. Apache 2.0 licensed.
      • Code Server - server?style=social) - Run VS Code on any machine anywhere and access it in the browser. Self-hosted cloud IDE with full extension support. MIT licensed.
      • Gitpod - io/gitpod?style=social) - Cloud development environment platform with automated prebuilds, ephemeral workspaces, and support for any IDE. Self-hostable with open-source core. AGPL-3.0 licensed.
      • Onlook - dev/onlook?style=social) - Open-source AI-first design and React editing environment for visually building and modifying frontend applications.
      • Daytona - Secure elastic infrastructure for running AI-generated code. Self-hosted alternative to GitHub Codespaces with support for multiple IDEs, prebuilds, and any cloud provider. AGPL-3.0 licensed.
      • Aider - AI/aider?style=social) - Terminal-based AI pair programmer. Edit code in your local editor and aider implements the changes. Supports multiple LLMs, voice coding, and automatic git commits. Top scores on SWE Bench. Apache 2.0 licensed.
      • Kimi CLI - cli?style=social) - Kimi Code CLI agent from Moonshot AI. Terminal-based coding assistant with advanced context understanding and multi-file editing capabilities. Apache 2.0 licensed.
      • Refact - Open-source AI code assistant with autocomplete, chat, and refactoring. Self-hostable with support for multiple LLM providers. BSD-3-Clause licensed.
      • Qwen Code - code?style=social) - Open-source AI agent for the terminal, optimized for Qwen series models. Multi-protocol provider support including OpenAI, Anthropic, Gemini, Alibaba Cloud, OpenRouter. Features agentic workflow with Skills and SubAgents. Apache 2.0 licensed.
      • DeepCode - Transforms research papers and natural language into production-ready code. AI-powered research-to-code automation tool. MIT licensed.
      • Open Notebook - notebook?style=social) - Open-source implementation of Notebook LM with multi-modal content support (PDFs, videos, audio, web pages). Features multi-speaker podcast generation, 18+ AI provider integrations, and full-text + vector search. Self-hosted with complete data sovereignty. MIT licensed.
      • Deta Surf - Personal AI notebook for organizing files and webpages with AI-generated notes. Local-first data storage, open data formats, and open model choice including local models. Cross-platform desktop app for research and thinking workflows. Apache 2.0 licensed.
      • Quarto - dev/quarto-cli?style=social) - Open-source scientific and technical publishing system built on Pandoc. Create dynamic content with Python, R, Julia, and Observable. MIT licensed.
      • Serena - Powerful MCP toolkit for coding agents providing semantic retrieval and editing capabilities. Integrates language servers for IDE-level code understanding. MIT licensed.
      • Minuet AI - ai.nvim?style=social) - Neovim plugin offering code completion as-you-type from popular LLMs including OpenAI, Gemini, Claude, Ollama, Llama.cpp, Codestral, and more. GPL-3.0 licensed.
      • Peekaboo - macOS CLI & MCP server enabling AI agents to capture screenshots and automate UI interactions. Visual question answering through local or remote AI models. MIT licensed.
      • Ruler - Central AI agent rule registry. Manages and distributes rules for AI coding agents across projects. MIT licensed.
      • GitIngest - Replace 'hub' with 'ingest' in any GitHub URL to get a prompt-friendly extract of a codebase. Optimized for Python ecosystem and data science workflows. MIT licensed.
      • Context7 - Up-to-date code documentation for LLMs and AI code editors. Fetches latest docs and code examples directly into LLM context via MCP. Eliminates hallucinated APIs. MIT licensed.
      • Vercel AI SDK - Provider-agnostic TypeScript toolkit for building AI-powered applications and agents. Unified API for OpenAI, Anthropic, Google, and 20+ providers with first-class streaming, tool-calling, and structured output support. Apache 2.0 licensed.
      • GitHub Copilot SDK - sdk?style=social) - Multi-platform SDK for integrating GitHub Copilot Agent into apps and services. Production-tested agent runtime with planning, tool invocation, and context management. Build Copilot-style agents without writing your own orchestration. MIT licensed.
      • Fern - api/fern?style=social) - Open-source SDK generator for REST APIs. Generate type-safe API clients in TypeScript, Python, Go, Java, and more from OpenAPI specs. Powers SDKs for companies like OpenAI, Anthropic, and Cloudflare. Apache 2.0 licensed.
      • Helicone - Open-source LLM observability platform with prompt management, versioning, and experimentation. One-line integration, YC W23 company. Apache 2.0 licensed.
      • OpenPrompt - Unified framework for prompt-learning with pre-trained language models. Supports templating, verbalizing, and optimization strategies. Winner of ACL 2022 Best Demo Paper Award. Apache 2.0 licensed.
      • TextGrad - group/textgrad?style=social) - Automatic "differentiation" via text using LLMs to backpropagate textual gradients. PyTorch-like API for optimizing prompts, code, and solutions. Published in Nature. MIT licensed.
      • GEPA - ai/gepa?style=social) - Reflective prompt evolution optimizer using natural language reflection and Pareto frontier learning. Outperforms reinforcement learning for prompt optimization. Integrated with DSPY and MLflow. MIT licensed.
    • ๐Ÿ“š 14. Resources & Learning

      • Papers with Code - Definitive database linking papers to open code and datasets.
      • Hugging Face Papers - Daily-updated feed of the latest arXiv papers with open weights.
      • Open LLM Leaderboard (Hugging Face) - Real-time ranking of open models.
      • Prompt Engineering Guide (DAIR-AI) - ai/Prompt-Engineering-Guide?style=social) - Comprehensive guides, papers, lessons, and notebooks for prompt engineering, context engineering, RAG, and AI Agents. The definitive open-source resource for learning prompt engineering with 3M+ learners. MIT licensed.
      • Homemade Machine Learning (trekhleb) - machine-learning?style=social) - Python examples of popular machine learning algorithms with interactive Jupyter demos and mathematical explanations. Educational resource for understanding ML from scratch with visualizations. MIT licensed.
      • Start Machine Learning (louisfb01) - machine-learning?style=social) - A complete guide to start and improve in machine learning and AI in 2026 without any background. Curated learning path with the latest news, state-of-the-art techniques, and comprehensive resources for beginners. MIT licensed.
      • r/LocalLLaMA - Go-to subreddit for local/open-source LLM topics.
      • Hugging Face Course - Free hands-on courses using only open models.
      • ML For Beginners (Microsoft) - For-Beginners?style=social) - 12-week, 26-lesson, 52-quiz classic machine learning course for beginners. Comprehensive curriculum covering regression, classification, clustering, and NLP with practical projects.
      • LLM Course (Maxime Labonne) - course?style=social) - End-to-end course for getting into Large Language Models with roadmaps and Colab notebooks. Covers pre-training, fine-tuning, RLHF, quantization, and prompt engineering.
      • AI For Beginners (Microsoft) - For-Beginners?style=social) - 12-week, 24-lesson curriculum on Artificial Intelligence. Covers symbolic AI, neural networks, computer vision, NLP, and reinforcement learning with hands-on labs.
      • Generative AI for Beginners (Microsoft) - ai-for-beginners?style=social) - 21 lessons covering generative AI fundamentals, prompt engineering, RAG applications, fine-tuning, and LLM app deployment with practical exercises.
      • Data Science for Beginners (Microsoft) - Science-For-Beginners?style=social) - 10-week, 20-lesson curriculum on data science fundamentals. Covers data preparation, visualization, modeling, and deployment with practical projects.
      • Learn PyTorch for Deep Learning (Zero to Mastery) - deep-learning?style=social) - Comprehensive PyTorch deep learning course with hundreds of exercises and real-world projects.
      • The Incredible PyTorch - incredible-pytorch?style=social) - Curated list of PyTorch tutorials, papers, projects, and communities for deep learning researchers.
      • Deep RL Class (Hugging Face) - rl-class?style=social) - Free deep reinforcement learning course with hands-on exercises and trained agent publishing to the Hugging Face Hub.
      • Practical RL (Yandex Data School) - Comprehensive reinforcement learning course covering RL fundamentals, deep RL, policy gradients, actor-critic methods, and practical applications in the wild. The Unlicense.
      • NLP Course (Yandex Data School) - YSDA course in Natural Language Processing with 2025 materials covering text classification, language models, transformers, and modern NLP techniques. MIT licensed.
      • Large Language Model Notebooks Course - Language-Model-Notebooks-Course?style=social) - Practical hands-on course about Large Language Models and their applications. Covers Chatbots, Code Generation, OpenAI API, Hugging Face, Vector databases, LangChain, Fine Tuning, PEFT, LoRA, QLoRA. MIT licensed.
      • Transformers Tutorials (Niels Rogge) - Tutorials?style=social) - Comprehensive tutorials and demos using the Hugging Face Transformers library for NLP, vision, and multimodal tasks.
      • Made With ML (Goku Mohandas) - With-ML?style=social) - End-to-end course on building production-grade ML systems with MLOps fundamentals, from design to deployment and iteration.
      • TensorFlow Tutorials - Official guides for beginners to advanced users.
      • Hugging Face Transformers Notebooks - Run Transformers, Datasets, and more in Colab.
      • Awesome Machine Learning - machine-learning?style=social) - The definitive curated list of machine learning frameworks, libraries and software organized by language. Covers Python, C++, Java, JavaScript, and more with comprehensive coverage of the ML ecosystem. CC0-1.0 licensed.
      • Andrej Karpathy Skills - karpathy-skills?style=social) - A single CLAUDE.md file to improve Claude Code behavior, derived from Andrej Karpathy's observations on LLM coding pitfalls. Principles: Think Before Coding, Simplicity First, Surgical Changes, Goal-Driven Execution. MIT licensed.
      • AI Engineering Hub - engineering-hub?style=social) - 93+ production-ready projects with in-depth tutorials on LLMs, RAG, and real-world AI agent applications. Comprehensive resources for all skill levels from beginner to advanced. MIT licensed.
      • Complete Agentic AI Engineering Course - donner/agents?style=social) - 6-week comprehensive course on Agentic AI covering autonomous agents, multi-agent systems, and practical agent development. MIT licensed.
    • ๐Ÿงฌ 1. Core Frameworks & Libraries

      • PyTorch - Dynamic computation graphs, Pythonic API, dominant in research and production. The current standard for most frontier AI work.
      • TensorFlow - End-to-end platform with excellent production deployment, TPU support, and large-scale serving tools.
      • JAX - ml/jax?style=social) + **[Flax](https://github.com/google/flax)** ![GitHub stars](https://img.shields.io/github/stars/google/flax?style=social) - High-performance numerical computing with composable transformations (JIT, vmap, grad). Rising favorite for research and scientific ML.
      • dm-haiku - deepmind/dm-haiku?style=social) - JAX-based neural network library from Google DeepMind. Elegant functional API with state management, widely used in DeepMind's research. Apache 2.0 licensed.
      • Equinox - kidger/equinox?style=social) - Elegant easy-to-use neural networks and scientific computing in JAX. Callable PyTrees with filtered transformations, seamless interoperability with the JAX ecosystem. Apache 2.0 licensed.