Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
awesome-genai
A curated collection of resources, tools, frameworks, and information related to Generative AI
https://github.com/onebirdrocks/awesome-genai
Last synced: 5 days ago
JSON representation
-
Learning Resources
-
Paper
- Attention Is All You Need - Most modern LLMs rely on the transformer architecture, which is a deep neural network architecture introduced in the 2017 paper "Attention Is All You Need"
- TTS Papers - Collection of TTS papers. [![Forks](https://img.shields.io/github/forks/coqui-ai/TTS-papers?style=social)](https://github.com/coqui-ai/TTS-papers/network/members) [![Stars](https://img.shields.io/github/stars/coqui-ai/TTS-papers?style=social)](https://github.com/coqui-ai/TTS-papers/stargazers)
- Fun Audo LLM Paper
- TTS Papers - Collection of TTS papers. [![Forks](https://img.shields.io/github/forks/coqui-ai/TTS-papers?style=social)](https://github.com/coqui-ai/TTS-papers/network/members) [![Stars](https://img.shields.io/github/stars/coqui-ai/TTS-papers?style=social)](https://github.com/coqui-ai/TTS-papers/stargazers)
- Fun Audo LLM Paper
-
Tutorials
- Let's build GPT: from scratch, in code, spelled out.
- Create a Large Language Model from Scratch with Python – Tutorial
- HuggingFace Audio Deeplearning Course
- Generative AI for Beginners by Microsoft - [Video](https://learn.microsoft.com/en-us/shows/generative-ai-for-beginners/) [![Forks](https://img.shields.io/github/forks/microsoft/generative-ai-for-beginners?style=social)](https://github.com/microsoft/generative-ai-for-beginners/network/members) [![Stars](https://img.shields.io/github/stars/microsoft/generative-ai-for-beginners?style=social)](https://github.com/microsoft/generative-ai-for-beginners/stargazers)
- Let's build GPT: from scratch, in code, spelled out.
- Create a Large Language Model from Scratch with Python – Tutorial
- HuggingFace Audio Deeplearning Course
- Generative AI for Beginners by Microsoft - [Video](https://learn.microsoft.com/en-us/shows/generative-ai-for-beginners/) [![Forks](https://img.shields.io/github/forks/microsoft/generative-ai-for-beginners?style=social)](https://github.com/microsoft/generative-ai-for-beginners/network/members) [![Stars](https://img.shields.io/github/stars/microsoft/generative-ai-for-beginners?style=social)](https://github.com/microsoft/generative-ai-for-beginners/stargazers)
- ML for Beginners
- AI for Beginners
- Mastering GitHub Copilot for AI Paired Programming
- llama3-from-scratch - llama3 implementation one matrix multiplication at a time [![Forks](https://img.shields.io/github/forks/naklecha/llama3-from-scratch?style=social)](https://github.com/naklecha/llama3-from-scratch/network/members) [![Stars](https://img.shields.io/github/stars/naklecha/llama3-from-scratch?style=social)](https://github.com/naklecha/llama3-from-scratch/stargazers)
- Build a Large Language Model from Scratch
- Simple Guide to Local LLM Fine tuning on a Mac with MLX
- Elasticsearch Labs - Notebooks & Example Apps for Search & AI Applications with Elasticsearch. [![Forks](https://img.shields.io/github/forks/elastic/elasticsearch-labs?style=social)](https://github.com/elastic/elasticsearch-labs/network/members) [![Stars](https://img.shields.io/github/stars/elastic/elasticsearch-labs?style=social)](https://github.com/elastic/elasticsearch-labs/stargazers)
- ML for Beginners
- AI for Beginners
- Mastering GitHub Copilot for AI Paired Programming
- llama3-from-scratch - llama3 implementation one matrix multiplication at a time [![Forks](https://img.shields.io/github/forks/naklecha/llama3-from-scratch?style=social)](https://github.com/naklecha/llama3-from-scratch/network/members) [![Stars](https://img.shields.io/github/stars/naklecha/llama3-from-scratch?style=social)](https://github.com/naklecha/llama3-from-scratch/stargazers)
- Build a Large Language Model from Scratch
- Simple Guide to Local LLM Fine tuning on a Mac with MLX
- Elasticsearch Labs - Notebooks & Example Apps for Search & AI Applications with Elasticsearch. [![Forks](https://img.shields.io/github/forks/elastic/elasticsearch-labs?style=social)](https://github.com/elastic/elasticsearch-labs/network/members) [![Stars](https://img.shields.io/github/stars/elastic/elasticsearch-labs?style=social)](https://github.com/elastic/elasticsearch-labs/stargazers)
-
blogs
-
-
Models
-
Open Models
- Llama 1-7|13|33|65B
- OPT-1.3|6.7|13|30|66B
- Gemma2-9|27B
- Gemma-2|7B
- RecurrentGemma-2B
- T5
- Phi1-1.3B
- Phi2-2.7B
- Phi3-3.8|7|14B
- OpenELM-1.1|3B
- StableLM-v2-1.6|12B
- DBRX-132B-MoE
- StableCode-3B
- MPT-7B
- Qwen-1.8|7|14|72B
- Qwen1.5-1.8|4|7|14|32|72|110B
- CodeQwen-7B
- Qwen-VL-7B
- Qwen2-0.5|1.5|7|57-MOE|72B
- Llama 1-7|13|33|65B
- OPT-1.3|6.7|13|30|66B
- Gemma2-9|27B
- Gemma-2|7B
- RecurrentGemma-2B
- T5
- Phi1-1.3B
- Phi2-2.7B
- Phi3-3.8|7|14B
- OpenELM-1.1|3B
- StableLM-3B
- StableCode-3B
- MPT-7B
- DBRX-132B-MoE
- CodeQwen-7B
- Qwen-VL-7B
- Qwen2-0.5|1.5|7|57-MOE|72B
- Qwen-1.8|7|14|72B
- Qwen1.5-1.8|4|7|14|32|72|110B
-
-
Benchmark
-
Open Models
- llm-colosseum - Benchmark LLMs by fighting in Street Fighter 3! The new way to evaluate the quality of an LLM. [![Forks](https://img.shields.io/github/forks/OpenGenerativeAI/llm-colosseum?style=social)](https://github.com/OpenGenerativeAI/llm-colosseum/network/members) [![Stars](https://img.shields.io/github/stars/OpenGenerativeAI/llm-colosseum?style=social)](https://github.com/OpenGenerativeAI/llm-colosseum/stargazers)
- llm-colosseum - Benchmark LLMs by fighting in Street Fighter 3! The new way to evaluate the quality of an LLM. [![Forks](https://img.shields.io/github/forks/OpenGenerativeAI/llm-colosseum?style=social)](https://github.com/OpenGenerativeAI/llm-colosseum/network/members) [![Stars](https://img.shields.io/github/stars/OpenGenerativeAI/llm-colosseum?style=social)](https://github.com/OpenGenerativeAI/llm-colosseum/stargazers)
-
-
Tools & Frameworks
-
ML
- Pytorch - Tensors and Dynamic neural networks in Python with strong GPU acceleration. [![Forks](https://img.shields.io/github/forks/pytorch/pytorch?style=social)](https://github.com/pytorch/pytorch/network/members) [![Stars](https://img.shields.io/github/stars/pytorch/pytorch?style=social)](https://github.com/pytorch/pytorch/stargazers)
- Pytorch - Tensors and Dynamic neural networks in Python with strong GPU acceleration. [![Forks](https://img.shields.io/github/forks/pytorch/pytorch?style=social)](https://github.com/pytorch/pytorch/network/members) [![Stars](https://img.shields.io/github/stars/pytorch/pytorch?style=social)](https://github.com/pytorch/pytorch/stargazers)
- Tensorflow - An Open Source Machine Learning Framework for Everyone. [![Forks](https://img.shields.io/github/forks/tensorflow/tensorflow?style=social)](https://github.com/tensorflow/tensorflow/network/members) [![Stars](https://img.shields.io/github/stars/tensorflow/tensorflow?style=social)](https://github.com/tensorflow/tensorflow/stargazers)
- MLX - An array framework for Apple silicon. [![Forks](https://img.shields.io/github/forks/ml-explore/mlx?style=social)](https://github.com/ml-explore/mlx/network/members) [![Stars](https://img.shields.io/github/stars/ml-explore/mlx?style=social)](https://github.com/ml-explore/mlx/stargazers)
- Tensorflow - An Open Source Machine Learning Framework for Everyone. [![Forks](https://img.shields.io/github/forks/tensorflow/tensorflow?style=social)](https://github.com/tensorflow/tensorflow/network/members) [![Stars](https://img.shields.io/github/stars/tensorflow/tensorflow?style=social)](https://github.com/tensorflow/tensorflow/stargazers)
- MLX - An array framework for Apple silicon. [![Forks](https://img.shields.io/github/forks/ml-explore/mlx?style=social)](https://github.com/ml-explore/mlx/network/members) [![Stars](https://img.shields.io/github/stars/ml-explore/mlx?style=social)](https://github.com/ml-explore/mlx/stargazers)
-
Development Frameworks
- AutoGen - A programming framework for agentic AI. [![Forks](https://img.shields.io/github/forks/microsoft/autogen?style=social)](https://github.com/microsoft/autogen/network/members) [![Stars](https://img.shields.io/github/stars/microsoft/autogen?style=social)](https://github.com/microsoft/autogen/stargazers)
- Auto-GPT - AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters. [![Forks](https://img.shields.io/github/forks/Significant-Gravitas/AutoGPT?style=social)](https://github.com/Significant-Gravitas/AutoGPT/network/members) [![Stars](https://img.shields.io/github/stars/Significant-Gravitas/AutoGPT?style=social)](https://github.com/Significant-Gravitas/AutoGPT/stargazers)
- AgentGPT - Assemble, configure, and deploy autonomous AI Agents in your browser. [![Forks](https://img.shields.io/github/forks/reworkd/AgentGPT?style=social)](https://github.com/reworkd/AgentGPT/network/members) [![Stars](https://img.shields.io/github/stars/reworkd/AgentGPT?style=social)](https://github.com/reworkd/AgentGPT/stargazers)
- Langchain - Build context-aware reasoning applications. [![Forks](https://img.shields.io/github/forks/langchain-ai/langchain?style=social)](https://github.com/langchain-ai/langchain/network/members) [![Stars](https://img.shields.io/github/stars/langchain-ai/langchain?style=social)](https://github.com/langchain-ai/langchain/stargazers)
- LamaIndex - LlamaIndex is a data framework for your LLM applications. [![Forks](https://img.shields.io/github/forks/run-llama/llama_index?style=social)](https://github.com/run-llama/llama_index/network/members) [![Stars](https://img.shields.io/github/stars/run-llama/llama_index?style=social)](https://github.com/run-llama/llama_index/stargazers)
- Flowise - Drag & drop UI to build your customized LLM flow. [![Forks](https://img.shields.io/github/forks/FlowiseAI/Flowise?style=social)](https://github.com/FlowiseAI/Flowise/network/members) [![Stars](https://img.shields.io/github/stars/FlowiseAI/Flowise?style=social)](https://github.com/FlowiseAI/Flowise/stargazers)
- Flowise - Drag & drop UI to build your customized LLM flow. [![Forks](https://img.shields.io/github/forks/FlowiseAI/Flowise?style=social)](https://github.com/FlowiseAI/Flowise/network/members) [![Stars](https://img.shields.io/github/stars/FlowiseAI/Flowise?style=social)](https://github.com/FlowiseAI/Flowise/stargazers)
- AutoGen - A programming framework for agentic AI. [![Forks](https://img.shields.io/github/forks/microsoft/autogen?style=social)](https://github.com/microsoft/autogen/network/members) [![Stars](https://img.shields.io/github/stars/microsoft/autogen?style=social)](https://github.com/microsoft/autogen/stargazers)
- Auto-GPT - AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters. [![Forks](https://img.shields.io/github/forks/Significant-Gravitas/AutoGPT?style=social)](https://github.com/Significant-Gravitas/AutoGPT/network/members) [![Stars](https://img.shields.io/github/stars/Significant-Gravitas/AutoGPT?style=social)](https://github.com/Significant-Gravitas/AutoGPT/stargazers)
- AgentGPT - Assemble, configure, and deploy autonomous AI Agents in your browser. [![Forks](https://img.shields.io/github/forks/reworkd/AgentGPT?style=social)](https://github.com/reworkd/AgentGPT/network/members) [![Stars](https://img.shields.io/github/stars/reworkd/AgentGPT?style=social)](https://github.com/reworkd/AgentGPT/stargazers)
- dify - Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production. [![Forks](https://img.shields.io/github/forks/langgenius/dify?style=social)](https://github.com/langgenius/dify/network/members) [![Stars](https://img.shields.io/github/stars/langgenius/dify?style=social)](https://github.com/langgenius/dify/stargazers)
- DB-GPT - AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents. [![Forks](https://img.shields.io/github/forks/eosphoros-ai/DB-GPT?style=social)](https://github.com/eosphoros-ai/DB-GPT/network/members) [![Stars](https://img.shields.io/github/stars/eosphoros-ai/DB-GPT?style=social)](https://github.com/eosphoros-ai/DB-GPT/stargazers)
- AutoDev - AutoDev: The AI-powered coding wizard with multilingual support 🌐, auto code generation 🏗️, and a helpful bug-slaying assistant ! Customizable prompts and a magic Auto Dev/Testing/Document/Agent feature included! [![Forks](https://img.shields.io/github/forks/unit-mesh/auto-dev?style=social)](https://github.com/unit-mesh/auto-dev/network/members) [![Stars](https://img.shields.io/github/stars/unit-mesh/auto-dev?style=social)](https://github.com/unit-mesh/auto-dev/stargazers)
- AgentKit - Starter-kit to build constrained agents with Nextjs, FastAPI and Langchain. [![Forks](https://img.shields.io/github/forks/BCG-X-Official/agentkit?style=social)](https://github.com/BCG-X-Official/agentkit/network/members) [![Stars](https://img.shields.io/github/stars/BCG-X-Official/agentkit?style=social)](https://github.com/BCG-X-Official/agentkit/stargazers)
- Langchain - Build context-aware reasoning applications. [![Forks](https://img.shields.io/github/forks/langchain-ai/langchain?style=social)](https://github.com/langchain-ai/langchain/network/members) [![Stars](https://img.shields.io/github/stars/langchain-ai/langchain?style=social)](https://github.com/langchain-ai/langchain/stargazers)
- LamaIndex - LlamaIndex is a data framework for your LLM applications. [![Forks](https://img.shields.io/github/forks/run-llama/llama_index?style=social)](https://github.com/run-llama/llama_index/network/members) [![Stars](https://img.shields.io/github/stars/run-llama/llama_index?style=social)](https://github.com/run-llama/llama_index/stargazers)
- AutoDev - AutoDev: The AI-powered coding wizard with multilingual support 🌐, auto code generation 🏗️, and a helpful bug-slaying assistant ! Customizable prompts and a magic Auto Dev/Testing/Document/Agent feature included! [![Forks](https://img.shields.io/github/forks/unit-mesh/auto-dev?style=social)](https://github.com/unit-mesh/auto-dev/network/members) [![Stars](https://img.shields.io/github/stars/unit-mesh/auto-dev?style=social)](https://github.com/unit-mesh/auto-dev/stargazers)
- AgentKit - Starter-kit to build constrained agents with Nextjs, FastAPI and Langchain. [![Forks](https://img.shields.io/github/forks/BCG-X-Official/agentkit?style=social)](https://github.com/BCG-X-Official/agentkit/network/members) [![Stars](https://img.shields.io/github/stars/BCG-X-Official/agentkit?style=social)](https://github.com/BCG-X-Official/agentkit/stargazers)
- dify - Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production. [![Forks](https://img.shields.io/github/forks/langgenius/dify?style=social)](https://github.com/langgenius/dify/network/members) [![Stars](https://img.shields.io/github/stars/langgenius/dify?style=social)](https://github.com/langgenius/dify/stargazers)
- DB-GPT - AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents. [![Forks](https://img.shields.io/github/forks/eosphoros-ai/DB-GPT?style=social)](https://github.com/eosphoros-ai/DB-GPT/network/members) [![Stars](https://img.shields.io/github/stars/eosphoros-ai/DB-GPT?style=social)](https://github.com/eosphoros-ai/DB-GPT/stargazers)
- GraphRAG - A modular graph-based Retrieval-Augmented Generation (RAG) system. [![Forks](https://img.shields.io/github/forks/microsoft/graphrag?style=social)](https://github.com/microsoft/graphrag/network/members) [![Stars](https://img.shields.io/github/stars/microsoft/graphrag?style=social)](https://github.com/microsoft/graphrag/stargazers)
- GraphRAG - A modular graph-based Retrieval-Augmented Generation (RAG) system. [![Forks](https://img.shields.io/github/forks/microsoft/graphrag?style=social)](https://github.com/microsoft/graphrag/network/members) [![Stars](https://img.shields.io/github/stars/microsoft/graphrag?style=social)](https://github.com/microsoft/graphrag/stargazers)
-
Open-source projects
- Whisper - Robust Speech Recognition via Large-Scale Weak Supervision. [![Forks](https://img.shields.io/github/forks/openai/whisper?style=social)](https://github.com/openai/whisper/network/members) [![Stars](https://img.shields.io/github/stars/openai/whisper?style=social)](https://github.com/openai/whisper/stargazers)
- Whisper Streamming - Whisper realtime streaming for long speech-to-text transcription and translation. [![Forks](https://img.shields.io/github/forks/ufal/whisper_streaming?style=social)](https://github.com/ufal/whisper_streaming/network/members) [![Stars](https://img.shields.io/github/stars/ufal/whisper_streaming?style=social)](https://github.com/ufal/whisper_streaming/stargazers)
- ChatTTS - A generative speech model for daily dialogue. [![Forks](https://img.shields.io/github/forks/2noise/ChatTTS?style=social)](https://github.com/2noise/ChatTTS/network/members) [![Stars](https://img.shields.io/github/stars/2noise/ChatTTS?style=social)](https://github.com/2noise/ChatTTS/stargazers)
- Coqui TTS - A deep learning toolkit for Text-to-Speech, battle-tested in research and production. [![Forks](https://img.shields.io/github/forks/coqui-ai/TTS?style=social)](https://github.com/coqui-ai/TTS/network/members) [![Stars](https://img.shields.io/github/stars/coqui-ai/TTS?style=social)](https://github.com/coqui-ai/TTS/stargazers)
- Whisper - Robust Speech Recognition via Large-Scale Weak Supervision. [![Forks](https://img.shields.io/github/forks/openai/whisper?style=social)](https://github.com/openai/whisper/network/members) [![Stars](https://img.shields.io/github/stars/openai/whisper?style=social)](https://github.com/openai/whisper/stargazers)
- Whisper Streamming - Whisper realtime streaming for long speech-to-text transcription and translation. [![Forks](https://img.shields.io/github/forks/ufal/whisper_streaming?style=social)](https://github.com/ufal/whisper_streaming/network/members) [![Stars](https://img.shields.io/github/stars/ufal/whisper_streaming?style=social)](https://github.com/ufal/whisper_streaming/stargazers)
- Faster Whisper - Faster Whisper transcription with CTranslate2. [![Forks](https://img.shields.io/github/forks/SYSTRAN/faster-whisper?style=social)](https://github.com/SYSTRAN/faster-whisper/network/members) [![Stars](https://img.shields.io/github/stars/SYSTRAN/faster-whisper?style=social)](https://github.com/SYSTRAN/faster-whisper/stargazers)
- OpenVoice - a versatile instant voice cloning approach that requires only a short audio clip from the reference speaker to replicate their voice and generate speech in multiple languages. [![Forks](https://img.shields.io/github/forks/myshell-ai/OpenVoice?style=social)](https://github.com/myshell-ai/OpenVoice/network/members) [![Stars](https://img.shields.io/github/stars/myshell-ai/OpenVoice?style=social)](https://github.com/myshell-ai/OpenVoice/stargazers)
- Coqui TTS - A deep learning toolkit for Text-to-Speech, battle-tested in research and production. [![Forks](https://img.shields.io/github/forks/coqui-ai/TTS?style=social)](https://github.com/coqui-ai/TTS/network/members) [![Stars](https://img.shields.io/github/stars/coqui-ai/TTS?style=social)](https://github.com/coqui-ai/TTS/stargazers)
- Coqui STT Models - Open models for Coqui STT. [![Forks](https://img.shields.io/github/forks/coqui-ai/STT-models?style=social)](https://github.com/coqui-ai/STT-models/network/members) [![Stars](https://img.shields.io/github/stars/coqui-ai/STT-models?style=social)](https://github.com/coqui-ai/STT-models/stargazers)
- RealtimeTTS - https://github.com/KoljaB/RealtimeTTS. [![Forks](https://img.shields.io/github/forks/KoljaB/RealtimeTTS?style=social)](https://github.com/KoljaB/RealtimeTTS/network/members) [![Stars](https://img.shields.io/github/stars/KoljaB/RealtimeTTS?style=social)](https://github.com/KoljaB/RealtimeTTS/stargazers)
- MockingBird - Clone a voice in 5 seconds to generate arbitrary speech in real-time. [![Forks](https://img.shields.io/github/forks/babysor/MockingBird?style=social)](https://github.com/babysor/MockingBird/network/members) [![Stars](https://img.shields.io/github/stars/babysor/MockingBird?style=social)](https://github.com/babysor/MockingBird/stargazers)
- GPT-SoVITS - 1 min voice data can also be used to train a good TTS model! (few shot voice cloning).
- Faster Whisper - Faster Whisper transcription with CTranslate2. [![Forks](https://img.shields.io/github/forks/SYSTRAN/faster-whisper?style=social)](https://github.com/SYSTRAN/faster-whisper/network/members) [![Stars](https://img.shields.io/github/stars/SYSTRAN/faster-whisper?style=social)](https://github.com/SYSTRAN/faster-whisper/stargazers)
- OpenVoice - a versatile instant voice cloning approach that requires only a short audio clip from the reference speaker to replicate their voice and generate speech in multiple languages. [![Forks](https://img.shields.io/github/forks/myshell-ai/OpenVoice?style=social)](https://github.com/myshell-ai/OpenVoice/network/members) [![Stars](https://img.shields.io/github/stars/myshell-ai/OpenVoice?style=social)](https://github.com/myshell-ai/OpenVoice/stargazers)
- ChatTTS - A generative speech model for daily dialogue. [![Forks](https://img.shields.io/github/forks/2noise/ChatTTS?style=social)](https://github.com/2noise/ChatTTS/network/members) [![Stars](https://img.shields.io/github/stars/2noise/ChatTTS?style=social)](https://github.com/2noise/ChatTTS/stargazers)
- Coqui STT Models - Open models for Coqui STT. [![Forks](https://img.shields.io/github/forks/coqui-ai/STT-models?style=social)](https://github.com/coqui-ai/STT-models/network/members) [![Stars](https://img.shields.io/github/stars/coqui-ai/STT-models?style=social)](https://github.com/coqui-ai/STT-models/stargazers)
- RealtimeTTS - https://github.com/KoljaB/RealtimeTTS. [![Forks](https://img.shields.io/github/forks/KoljaB/RealtimeTTS?style=social)](https://github.com/KoljaB/RealtimeTTS/network/members) [![Stars](https://img.shields.io/github/stars/KoljaB/RealtimeTTS?style=social)](https://github.com/KoljaB/RealtimeTTS/stargazers)
- MockingBird - Clone a voice in 5 seconds to generate arbitrary speech in real-time. [![Forks](https://img.shields.io/github/forks/babysor/MockingBird?style=social)](https://github.com/babysor/MockingBird/network/members) [![Stars](https://img.shields.io/github/stars/babysor/MockingBird?style=social)](https://github.com/babysor/MockingBird/stargazers)
- GPT-SoVITS - 1 min voice data can also be used to train a good TTS model! (few shot voice cloning).
- EmotiVoice - https://github.com/netease-youdao/EmotiVoice. [![Forks](https://img.shields.io/github/forks/netease-youdao/EmotiVoice?style=social)](https://github.com/netease-youdao/EmotiVoice/network/members) [![Stars](https://img.shields.io/github/stars/netease-youdao/EmotiVoice?style=social)](https://github.com/netease-youdao/EmotiVoice/stargazers)
- NeMo - A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech). [![Forks](https://img.shields.io/github/forks/NVIDIA/NeMo?style=social)](https://github.com/NVIDIA/NeMo/network/members) [![Stars](https://img.shields.io/github/stars/NVIDIA/NeMo?style=social)](https://github.com/NVIDIA/NeMo/stargazers)
- Vits - Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech. [![Forks](https://img.shields.io/github/forks/jaywalnut310/vits?style=social)](https://github.com/jaywalnut310/vits/network/members) [![Stars](https://img.shields.io/github/stars/jaywalnut310/vits?style=social)](https://github.com/jaywalnut310/vits/stargazers)
- tacotron - A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial). [![Forks](https://img.shields.io/github/forks/keithito/tacotron?style=social)](https://github.com/keithito/tacotron/network/members) [![Stars](https://img.shields.io/github/stars/keithito/tacotron?style=social)](https://github.com/keithito/tacotron/stargazers)
- Vits - Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech. [![Forks](https://img.shields.io/github/forks/jaywalnut310/vits?style=social)](https://github.com/jaywalnut310/vits/network/members) [![Stars](https://img.shields.io/github/stars/jaywalnut310/vits?style=social)](https://github.com/jaywalnut310/vits/stargazers)
- tacotron - A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial). [![Forks](https://img.shields.io/github/forks/keithito/tacotron?style=social)](https://github.com/keithito/tacotron/network/members) [![Stars](https://img.shields.io/github/stars/keithito/tacotron?style=social)](https://github.com/keithito/tacotron/stargazers)
- tacotron2 - PyTorch implementation with faster-than-realtime inference. [![Forks](https://img.shields.io/github/forks/NVIDIA/tacotron2?style=social)](https://github.com/NVIDIA/tacotron2/network/members) [![Stars](https://img.shields.io/github/stars/NVIDIA/tacotron2?style=social)](https://github.com/NVIDIA/tacotron2/stargazers)
- FastSpeech - An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech. [![Forks](https://img.shields.io/github/forks/ming024/FastSpeech2?style=social)](https://github.com/ming024/FastSpeech2/network/members) [![Stars](https://img.shields.io/github/stars/ming024/FastSpeech2?style=social)](https://github.com/ming024/FastSpeech2/stargazers)
- EmotiVoice - https://github.com/netease-youdao/EmotiVoice. [![Forks](https://img.shields.io/github/forks/netease-youdao/EmotiVoice?style=social)](https://github.com/netease-youdao/EmotiVoice/network/members) [![Stars](https://img.shields.io/github/stars/netease-youdao/EmotiVoice?style=social)](https://github.com/netease-youdao/EmotiVoice/stargazers)
- NeMo - A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech). [![Forks](https://img.shields.io/github/forks/NVIDIA/NeMo?style=social)](https://github.com/NVIDIA/NeMo/network/members) [![Stars](https://img.shields.io/github/stars/NVIDIA/NeMo?style=social)](https://github.com/NVIDIA/NeMo/stargazers)
- VALL-E-X - An open source implementation of Microsoft's VALL-E X zero-shot TTS model. [![Forks](https://img.shields.io/github/forks/Plachtaa/VALL-E-X?style=social)](https://github.com/Plachtaa/VALL-E-X/network/members) [![Stars](https://img.shields.io/github/stars/Plachtaa/VALL-E-X?style=social)](https://github.com/Plachtaa/VALL-E-X/stargazers)
- tacotron2 - PyTorch implementation with faster-than-realtime inference. [![Forks](https://img.shields.io/github/forks/NVIDIA/tacotron2?style=social)](https://github.com/NVIDIA/tacotron2/network/members) [![Stars](https://img.shields.io/github/stars/NVIDIA/tacotron2?style=social)](https://github.com/NVIDIA/tacotron2/stargazers)
- FastSpeech - An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech. [![Forks](https://img.shields.io/github/forks/ming024/FastSpeech2?style=social)](https://github.com/ming024/FastSpeech2/network/members) [![Stars](https://img.shields.io/github/stars/ming024/FastSpeech2?style=social)](https://github.com/ming024/FastSpeech2/stargazers)
- VALL-E-X - An open source implementation of Microsoft's VALL-E X zero-shot TTS model. [![Forks](https://img.shields.io/github/forks/Plachtaa/VALL-E-X?style=social)](https://github.com/Plachtaa/VALL-E-X/network/members) [![Stars](https://img.shields.io/github/stars/Plachtaa/VALL-E-X?style=social)](https://github.com/Plachtaa/VALL-E-X/stargazers)
- SenseVoice - Multilingual Voice Understanding Model. [![Forks](https://img.shields.io/github/forks/FunAudioLLM/SenseVoice?style=social)](https://github.com/FunAudioLLM/SenseVoice/network/members) [![Stars](https://img.shields.io/github/stars/FunAudioLLM/SenseVoice?style=social)](https://github.com/FunAudioLLM/SenseVoice/stargazers)
- SenseVoice - Multilingual Voice Understanding Model. [![Forks](https://img.shields.io/github/forks/FunAudioLLM/SenseVoice?style=social)](https://github.com/FunAudioLLM/SenseVoice/network/members) [![Stars](https://img.shields.io/github/stars/FunAudioLLM/SenseVoice?style=social)](https://github.com/FunAudioLLM/SenseVoice/stargazers)
-
Clothing(Visual Try on)
- IDM-VTON - IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild. [![Forks](https://img.shields.io/github/forks/yisol/IDM-VTON?style=social)](https://github.com/yisol/IDM-VTON/network/members) [![Stars](https://img.shields.io/github/stars/yisol/IDM-VTON?style=social)](https://github.com/yisol/IDM-VTON/stargazers)
- MagicClothing - Official implementation of Magic Clothing: Controllable Garment-Driven Image Synthesis. [![Forks](https://img.shields.io/github/forks/ShineChen1024/MagicClothing?style=social)](https://github.com/ShineChen1024/MagicClothing/network/members) [![Stars](https://img.shields.io/github/stars/ShineChen1024/MagicClothing?style=social)](https://github.com/ShineChen1024/MagicClothing/stargazers)
- IDM-VTON - IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild. [![Forks](https://img.shields.io/github/forks/yisol/IDM-VTON?style=social)](https://github.com/yisol/IDM-VTON/network/members) [![Stars](https://img.shields.io/github/stars/yisol/IDM-VTON?style=social)](https://github.com/yisol/IDM-VTON/stargazers)
- MagicClothing - Official implementation of Magic Clothing: Controllable Garment-Driven Image Synthesis. [![Forks](https://img.shields.io/github/forks/ShineChen1024/MagicClothing?style=social)](https://github.com/ShineChen1024/MagicClothing/network/members) [![Stars](https://img.shields.io/github/stars/ShineChen1024/MagicClothing?style=social)](https://github.com/ShineChen1024/MagicClothing/stargazers)
- StableVITON - [CVPR2024] StableVITON: Learning Semantic Correspondence with Latent Diffusion Model for Virtual Try-On. [![Forks](https://img.shields.io/github/forks/rlawjdghek/StableVITON?style=social)](https://github.com/rlawjdghek/StableVITON/network/members) [![Stars](https://img.shields.io/github/stars/rlawjdghek/StableVITON?style=social)](https://github.com/rlawjdghek/StableVITON/stargazers)
- StableVITON - [CVPR2024] StableVITON: Learning Semantic Correspondence with Latent Diffusion Model for Virtual Try-On. [![Forks](https://img.shields.io/github/forks/rlawjdghek/StableVITON?style=social)](https://github.com/rlawjdghek/StableVITON/network/members) [![Stars](https://img.shields.io/github/stars/rlawjdghek/StableVITON?style=social)](https://github.com/rlawjdghek/StableVITON/stargazers)
- HR Viton - Official PyTorch implementation for the paper High-Resolution Virtual Try-On with Misalignment and Occlusion-Handled Conditions (ECCV 2022). [![Forks](https://img.shields.io/github/forks/sangyun884/HR-VITON?style=social)](https://github.com/sangyun884/HR-VITON/network/members) [![Stars](https://img.shields.io/github/stars/sangyun884/HR-VITON?style=social)](https://github.com/sangyun884/HR-VITON/stargazers)
- HR Viton - Official PyTorch implementation for the paper High-Resolution Virtual Try-On with Misalignment and Occlusion-Handled Conditions (ECCV 2022). [![Forks](https://img.shields.io/github/forks/sangyun884/HR-VITON?style=social)](https://github.com/sangyun884/HR-VITON/network/members) [![Stars](https://img.shields.io/github/stars/sangyun884/HR-VITON?style=social)](https://github.com/sangyun884/HR-VITON/stargazers)
- Dressing in order - (ICCV'21) Official code of "Dressing in Order: Recurrent Person Image Generation for Pose Transfer, Virtual Try-on and Outfit Editing" by Aiyu Cui, Daniel McKee and Svetlana Lazebnik [![Forks](https://img.shields.io/github/forks/cuiaiyu/dressing-in-order?style=social)](https://github.com/cuiaiyu/dressing-in-order/network/members) [![Stars](https://img.shields.io/github/stars/cuiaiyu/dressing-in-order?style=social)](https://github.com/cuiaiyu/dressing-in-order/stargazers)
- Dressing in order - (ICCV'21) Official code of "Dressing in Order: Recurrent Person Image Generation for Pose Transfer, Virtual Try-on and Outfit Editing" by Aiyu Cui, Daniel McKee and Svetlana Lazebnik [![Forks](https://img.shields.io/github/forks/cuiaiyu/dressing-in-order?style=social)](https://github.com/cuiaiyu/dressing-in-order/network/members) [![Stars](https://img.shields.io/github/stars/cuiaiyu/dressing-in-order?style=social)](https://github.com/cuiaiyu/dressing-in-order/stargazers)
- Dress Code - Dress Code: High-Resolution Multi-Category Virtual Try-On. ECCV 2022. [![Forks](https://img.shields.io/github/forks/aimagelab/dress-code?style=social)](https://github.com/aimagelab/dress-code/network/members) [![Stars](https://img.shields.io/github/stars/aimagelab/dress-code?style=social)](https://github.com/aimagelab/dress-code/stargazers)
- Dress Code - Dress Code: High-Resolution Multi-Category Virtual Try-On. ECCV 2022. [![Forks](https://img.shields.io/github/forks/aimagelab/dress-code?style=social)](https://github.com/aimagelab/dress-code/network/members) [![Stars](https://img.shields.io/github/stars/aimagelab/dress-code?style=social)](https://github.com/aimagelab/dress-code/stargazers)
-
Agent
- BabyAGI - Python script that acts as an AI-powered task manager. [![Forks](https://img.shields.io/github/forks/yoheinakajima/babyagi?style=social)](https://github.com/yoheinakajima/babyagi/network/members) [![Stars](https://img.shields.io/github/stars/yoheinakajima/babyagi?style=social)](https://github.com/yoheinakajima/babyagi/stargazers)
- BabyAGI - Python script that acts as an AI-powered task manager. [![Forks](https://img.shields.io/github/forks/yoheinakajima/babyagi?style=social)](https://github.com/yoheinakajima/babyagi/network/members) [![Stars](https://img.shields.io/github/stars/yoheinakajima/babyagi?style=social)](https://github.com/yoheinakajima/babyagi/stargazers)
- SWE-agent - SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.47% of bugs in the SWE-bench evaluation set and takes just 1 minute to run. [![Forks](https://img.shields.io/github/forks/princeton-nlp/SWE-agent?style=social)](https://github.com/princeton-nlp/SWE-agent/network/members) [![Stars](https://img.shields.io/github/stars/princeton-nlp/SWE-agent?style=social)](https://github.com/princeton-nlp/SWE-agent/stargazers)
- SWE-agent - SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.47% of bugs in the SWE-bench evaluation set and takes just 1 minute to run. [![Forks](https://img.shields.io/github/forks/princeton-nlp/SWE-agent?style=social)](https://github.com/princeton-nlp/SWE-agent/network/members) [![Stars](https://img.shields.io/github/stars/princeton-nlp/SWE-agent?style=social)](https://github.com/princeton-nlp/SWE-agent/stargazers)
-
Text2SQL
- SQLCoder - SoTA LLM for converting natural language questions to SQL queries. [![Forks](https://img.shields.io/github/forks/defog-ai/sqlcoder?style=social)](https://github.com/defog-ai/sqlcoder/network/members) [![Stars](https://img.shields.io/github/stars/defog-ai/sqlcoder?style=social)](https://github.com/defog-ai/sqlcoder/stargazers)
- Vanna - Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG. [![Forks](https://img.shields.io/github/forks/vanna-ai/vanna?style=social)](https://github.com/vanna-ai/vanna/network/members) [![Stars](https://img.shields.io/github/stars/vanna-ai/vanna?style=social)](https://github.com/vanna-ai/vanna/stargazers)
- SQLChat - Chat-based SQL Client and Editor for the next decade. [![Forks](https://img.shields.io/github/forks/sqlchat/sqlchat?style=social)](https://github.com/sqlchat/sqlchat/network/members) [![Stars](https://img.shields.io/github/stars/sqlchat/sqlchat?style=social)](https://github.com/sqlchat/sqlchat/stargazers)
- SQLChat - Chat-based SQL Client and Editor for the next decade. [![Forks](https://img.shields.io/github/forks/sqlchat/sqlchat?style=social)](https://github.com/sqlchat/sqlchat/network/members) [![Stars](https://img.shields.io/github/stars/sqlchat/sqlchat?style=social)](https://github.com/sqlchat/sqlchat/stargazers)
- Dataherald - Interact with your SQL database, Natural Language to SQL using LLMs. [![Forks](https://img.shields.io/github/forks/Dataherald/dataherald?style=social)](https://github.com/Dataherald/dataherald/network/members) [![Stars](https://img.shields.io/github/stars/Dataherald/dataherald?style=social)](https://github.com/Dataherald/dataherald/stargazers)
- WrenAI - Wren AI makes your database RAG-ready. Implement Text-to-SQL more accurately and securely. [![Forks](https://img.shields.io/github/forks/Canner/WrenAI?style=social)](https://github.com/Canner/WrenAI/network/members) [![Stars](https://img.shields.io/github/stars/Canner/WrenAI?style=social)](https://github.com/Canner/WrenAI/stargazers)
- Vanna - Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG. [![Forks](https://img.shields.io/github/forks/vanna-ai/vanna?style=social)](https://github.com/vanna-ai/vanna/network/members) [![Stars](https://img.shields.io/github/stars/vanna-ai/vanna?style=social)](https://github.com/vanna-ai/vanna/stargazers)
- SQLCoder - SoTA LLM for converting natural language questions to SQL queries. [![Forks](https://img.shields.io/github/forks/defog-ai/sqlcoder?style=social)](https://github.com/defog-ai/sqlcoder/network/members) [![Stars](https://img.shields.io/github/stars/defog-ai/sqlcoder?style=social)](https://github.com/defog-ai/sqlcoder/stargazers)
- Dataherald - Interact with your SQL database, Natural Language to SQL using LLMs. [![Forks](https://img.shields.io/github/forks/Dataherald/dataherald?style=social)](https://github.com/Dataherald/dataherald/network/members) [![Stars](https://img.shields.io/github/stars/Dataherald/dataherald?style=social)](https://github.com/Dataherald/dataherald/stargazers)
- WrenAI - Wren AI makes your database RAG-ready. Implement Text-to-SQL more accurately and securely. [![Forks](https://img.shields.io/github/forks/Canner/WrenAI?style=social)](https://github.com/Canner/WrenAI/network/members) [![Stars](https://img.shields.io/github/stars/Canner/WrenAI?style=social)](https://github.com/Canner/WrenAI/stargazers)
-
Virtual Human
- MuseV - MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising by Tencent [![Forks](https://img.shields.io/github/forks/TMElyralab/MuseV?style=social)](https://github.com/TMElyralab/MuseV/network/members) [![Stars](https://img.shields.io/github/stars/TMElyralab/MuseV?style=social)](https://github.com/TMElyralab/MuseV/stargazers)
- MuseV - MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising by Tencent [![Forks](https://img.shields.io/github/forks/TMElyralab/MuseV?style=social)](https://github.com/TMElyralab/MuseV/network/members) [![Stars](https://img.shields.io/github/stars/TMElyralab/MuseV?style=social)](https://github.com/TMElyralab/MuseV/stargazers)
-
Deep Fake
- SadTalker - SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation. [![Forks](https://img.shields.io/github/forks/OpenTalker/SadTalker?style=social)](https://github.com/OpenTalker/SadTalker/network/members) [![Stars](https://img.shields.io/github/stars/OpenTalker/SadTalker?style=social)](https://github.com/OpenTalker/SadTalker/stargazers)
- SadTalker - SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation. [![Forks](https://img.shields.io/github/forks/OpenTalker/SadTalker?style=social)](https://github.com/OpenTalker/SadTalker/network/members) [![Stars](https://img.shields.io/github/stars/OpenTalker/SadTalker?style=social)](https://github.com/OpenTalker/SadTalker/stargazers)
- Facefusion - Next generation face swapper and enhancer. [![Forks](https://img.shields.io/github/forks/facefusion/facefusion?style=social)](https://github.com/facefusion/facefusion/network/members) [![Stars](https://img.shields.io/github/stars/facefusion/facefusion?style=social)](https://github.com/facefusion/facefusion/stargazers)
- Facefusion - Next generation face swapper and enhancer. [![Forks](https://img.shields.io/github/forks/facefusion/facefusion?style=social)](https://github.com/facefusion/facefusion/network/members) [![Stars](https://img.shields.io/github/stars/facefusion/facefusion?style=social)](https://github.com/facefusion/facefusion/stargazers)
- Ghost - A new one shot face swap approach for image and video domains. [![Forks](https://img.shields.io/github/forks/ai-forever/ghost?style=social)](https://github.com/ai-forever/ghost/network/members) [![Stars](https://img.shields.io/github/stars/ai-forever/ghost?style=social)](https://github.com/ai-forever/ghost/stargazers)
- Fay - Fay is an open-source digital human framework integrating language models and digital characters. It offers retail, assistant, and agent versions for diverse applications like virtual shopping guides, broadcasters, assistants, waiters, teachers, and voice or text-based mobile assistants. [![Forks](https://img.shields.io/github/forks/xszyou/Fay?style=social)](https://github.com/xszyou/Fay/network/members) [![Stars](https://img.shields.io/github/stars/xszyou/Fay?style=social)](https://github.com/xszyou/Fay/stargazers)
- Ghost - A new one shot face swap approach for image and video domains. [![Forks](https://img.shields.io/github/forks/ai-forever/ghost?style=social)](https://github.com/ai-forever/ghost/network/members) [![Stars](https://img.shields.io/github/stars/ai-forever/ghost?style=social)](https://github.com/ai-forever/ghost/stargazers)
- - [SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild. [![Forks](https://img.shields.io/github/forks/OpenTalker/video-retalking?style=social)](https://github.com/OpenTalker/video-retalking/network/members) [![Stars](https://img.shields.io/github/stars/OpenTalker/video-retalking?style=social)](https://github.com/OpenTalker/video-retalking/stargazers)
- Fay - Fay is an open-source digital human framework integrating language models and digital characters. It offers retail, assistant, and agent versions for diverse applications like virtual shopping guides, broadcasters, assistants, waiters, teachers, and voice or text-based mobile assistants. [![Forks](https://img.shields.io/github/forks/xszyou/Fay?style=social)](https://github.com/xszyou/Fay/network/members) [![Stars](https://img.shields.io/github/stars/xszyou/Fay?style=social)](https://github.com/xszyou/Fay/stargazers)
-
Programming Languages
Sub Categories
Keywords
python
30
ai
28
llm
24
deep-learning
24
tts
22
gpt
22
openai
20
text-to-speech
18
rag
16
pytorch
14
chatgpt
14
speech-synthesis
12
agent
12
langchain
10
machine-learning
10
gpt-4
10
genai
10
speech
8
artificial-intelligence
8
text-to-sql
8
sql
8
virtual-try-on
6
database
6
speech-to-text
6
voice-clone
6
nextjs
6
computer-vision
6
large-language-models
6
chatbot
6
genaistack
4
langchain-python
4
tacotron
4
voice-cloning
4
llamaindex
4
neural-network
4
speech-recognition
4
virtual-tryon
4
chat
4
agents
4
llm-agent
4
tensorflow
4
llmops
4
asr
4
deepfake
4
face-swap
4
llms
4
faceswap
4
deep-fake
4
generative-ai
4
aigc
4