https://github.com/amrzv/awesome-colab-notebooks

Collection of google colaboratory notebooks for fast and easy experiments
https://github.com/amrzv/awesome-colab-notebooks
List: awesome-colab-notebooks
cnn colab-notebooks deep-learning deep-neural-networks generative-adversarial-network google-colab google-colab-notebook google-colab-notebooks google-colab-tutorial google-colaboratory google-colabs jupyter-notebooks machine-learning pytorch tensorflow tensorflow-tutorials
Last synced: about 1 month ago
JSON representation
Collection of google colaboratory notebooks for fast and easy experiments
Host: GitHub
URL: https://github.com/amrzv/awesome-colab-notebooks
Owner: amrzv
License: mit
Created: 2020-12-27T11:47:18.000Z (over 4 years ago)
Default Branch: main
Last Pushed: 2025-04-06T19:59:59.000Z (3 months ago)
Last Synced: 2025-04-11T17:10:40.667Z (2 months ago)
Topics: cnn, colab-notebooks, deep-learning, deep-neural-networks, generative-adversarial-network, google-colab, google-colab-notebook, google-colab-notebooks, google-colab-tutorial, google-colaboratory, google-colabs, jupyter-notebooks, machine-learning, pytorch, tensorflow, tensorflow-tutorials
Language: Python
Homepage:
Size: 1.84 MB
Stars: 1,441
Watchers: 47
Forks: 261
Open Issues: 1
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project

ultimate-awesome - awesome-colab-notebooks - Collection of google colaboratory notebooks for fast and easy experiments. (Other Lists / Julia Lists)
README

        [![Hits](https://hits.seeyoufarm.com/api/count/incr/badge.svg?url=https://github.com/amrzv/awesome-colab-notebooks)](https://hits.seeyoufarm.com)

![awesome-colab-notebooks](https://count.getloli.com/get/@awesome-colab-notebooks?theme=rule34)

[![Word Cloud](images/cloud.svg)](images/cloud.svg)

The page might not be rendered properly. Please open [README.md](https://github.com/amrzv/awesome-colab-notebooks/blob/main/README.md) file directly

# Awesome colab notebooks collection for ML experiments

## Trending

| repositories | papers | packages |

|---|---|---|

| 


agent-starter-pack	[![](https://img.shields.io/github/stars/GoogleCloudPlatform/agent-starter-pack?style=social)](https://github.com/GoogleCloudPlatform/agent-starter-pack)
 Qwen2.5-Omni	[![](https://img.shields.io/github/stars/QwenLM/Qwen2.5-Omni?style=social)](https://github.com/QwenLM/Qwen2.5-Omni)
 verl	[![](https://img.shields.io/github/stars/volcengine/verl?style=social)](https://github.com/volcengine/verl)
 opik	[![](https://img.shields.io/github/stars/comet-ml/opik?style=social)](https://github.com/comet-ml/opik)
 courses	[![](https://img.shields.io/github/stars/anthropics/courses?style=social)](https://github.com/anthropics/courses)
 crawl4ai	[![](https://img.shields.io/github/stars/unclecode/crawl4ai?style=social)](https://github.com/unclecode/crawl4ai)
 TabPFN	[![](https://img.shields.io/github/stars/automl/TabPFN?style=social)](https://github.com/automl/TabPFN)
 langgraph	[![](https://img.shields.io/github/stars/langchain-ai/langgraph?style=social)](https://github.com/langchain-ai/langgraph)
 litellm	[![](https://img.shields.io/github/stars/BerriAI/litellm?style=social)](https://github.com/BerriAI/litellm)
 DL_intro	[![](https://img.shields.io/github/stars/aifoundation-edu/DL_intro?style=social)](https://github.com/aifoundation-edu/DL_intro)
 langfun	[![](https://img.shields.io/github/stars/google/langfun?style=social)](https://github.com/google/langfun)
 SAELens	[![](https://img.shields.io/github/stars/jbloomAus/SAELens?style=social)](https://github.com/jbloomAus/SAELens)
 ARENA_3.0	[![](https://img.shields.io/github/stars/callummcdougall/ARENA_3.0?style=social)](https://github.com/callummcdougall/ARENA_3.0)
 TransformerLens	[![](https://img.shields.io/github/stars/TransformerLensOrg/TransformerLens?style=social)](https://github.com/TransformerLensOrg/TransformerLens)
 InvSR	[![](https://img.shields.io/github/stars/zsyOAOA/InvSR?style=social)](https://github.com/zsyOAOA/InvSR)
 tapnet	[![](https://img.shields.io/github/stars/google-deepmind/tapnet?style=social)](https://github.com/google-deepmind/tapnet)
 unsloth	[![](https://img.shields.io/github/stars/unslothai/unsloth?style=social)](https://github.com/unslothai/unsloth)
 trl	[![](https://img.shields.io/github/stars/huggingface/trl?style=social)](https://github.com/huggingface/trl)
 YuE	[![](https://img.shields.io/github/stars/multimodal-art-projection/YuE?style=social)](https://github.com/multimodal-art-projection/YuE)
 STAR	[![](https://img.shields.io/github/stars/NJU-PCALab/STAR?style=social)](https://github.com/NJU-PCALab/STAR)
 Show-1	[![](https://img.shields.io/github/stars/showlab/Show-1?style=social)](https://github.com/showlab/Show-1)
 MARS5-TTS	[![](https://img.shields.io/github/stars/Camb-ai/MARS5-TTS?style=social)](https://github.com/Camb-ai/MARS5-TTS)
 lm-evaluation-harness	[![](https://img.shields.io/github/stars/EleutherAI/lm-evaluation-harness?style=social)](https://github.com/EleutherAI/lm-evaluation-harness)
 cleanrl	[![](https://img.shields.io/github/stars/vwxyzjn/cleanrl?style=social)](https://github.com/vwxyzjn/cleanrl)
 DiffSynth-Studio	[![](https://img.shields.io/github/stars/Artiprocher/DiffSynth-Studio?style=social)](https://github.com/Artiprocher/DiffSynth-Studio)
 ComfyUI	[![](https://img.shields.io/github/stars/comfyanonymous/ComfyUI?style=social)](https://github.com/comfyanonymous/ComfyUI)
 rlds	[![](https://img.shields.io/github/stars/google-research/rlds?style=social)](https://github.com/google-research/rlds)
 llama-recipes	[![](https://img.shields.io/github/stars/meta-llama/llama-recipes?style=social)](https://github.com/meta-llama/llama-recipes)
 mujoco	[![](https://img.shields.io/github/stars/deepmind/mujoco?style=social)](https://github.com/deepmind/mujoco)
 PGMax	[![](https://img.shields.io/github/stars/deepmind/PGMax?style=social)](https://github.com/deepmind/PGMax)
 dinov2	[![](https://img.shields.io/github/stars/facebookresearch/dinov2?style=social)](https://github.com/facebookresearch/dinov2)
 instructor	[![](https://img.shields.io/github/stars/jxnl/instructor?style=social)](https://github.com/jxnl/instructor)
 autogen	[![](https://img.shields.io/github/stars/microsoft/autogen?style=social)](https://github.com/microsoft/autogen)
 ultralytics	[![](https://img.shields.io/github/stars/ultralytics/ultralytics?style=social)](https://github.com/ultralytics/ultralytics)
 segment-anything-2	[![](https://img.shields.io/github/stars/facebookresearch/segment-anything-2?style=social)](https://github.com/facebookresearch/segment-anything-2)
 xland-minigrid	[![](https://img.shields.io/github/stars/dunnolab/xland-minigrid?style=social)](https://github.com/dunnolab/xland-minigrid)
 ModernBERT	[![](https://img.shields.io/github/stars/AnswerDotAI/ModernBERT?style=social)](https://github.com/AnswerDotAI/ModernBERT)
 rl-baselines3-zoo	[![](https://img.shields.io/github/stars/DLR-RM/rl-baselines3-zoo?style=social)](https://github.com/DLR-RM/rl-baselines3-zoo)
 myosuite	[![](https://img.shields.io/github/stars/facebookresearch/myosuite?style=social)](https://github.com/facebookresearch/myosuite)
 GroundingDINO	[![](https://img.shields.io/github/stars/IDEA-Research/GroundingDINO?style=social)](https://github.com/IDEA-Research/GroundingDINO)
 ollama	[![](https://img.shields.io/github/stars/ollama/ollama?style=social)](https://github.com/ollama/ollama)
 dsp-theory	[![](https://img.shields.io/github/stars/hukenovs/dsp-theory?style=social)](https://github.com/hukenovs/dsp-theory)
 Retrieval-based-Voice-Conversion-WebUI	[![](https://img.shields.io/github/stars/RVC-Project/Retrieval-based-Voice-Conversion-WebUI?style=social)](https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI)
 NeMo	[![](https://img.shields.io/github/stars/NVIDIA/NeMo?style=social)](https://github.com/NVIDIA/NeMo)
 evidently	[![](https://img.shields.io/github/stars/evidentlyai/evidently?style=social)](https://github.com/evidentlyai/evidently)
 xla	[![](https://img.shields.io/github/stars/openxla/xla?style=social)](https://github.com/openxla/xla)
 Qwen	[![](https://img.shields.io/github/stars/QwenLM/Qwen?style=social)](https://github.com/QwenLM/Qwen)
 stable-baselines3	[![](https://img.shields.io/github/stars/DLR-RM/stable-baselines3?style=social)](https://github.com/DLR-RM/stable-baselines3)
 torchgeo	[![](https://img.shields.io/github/stars/microsoft/torchgeo?style=social)](https://github.com/microsoft/torchgeo)
 datachain	[![](https://img.shields.io/github/stars/iterative/datachain?style=social)](https://github.com/iterative/datachain)

 | 
 | 

opik	[![](https://img.shields.io/pypi/dw/opik?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/opik/)
 presidio-structured	[![](https://img.shields.io/pypi/dw/presidio-structured?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/presidio-structured/)
 Crawl4AI	[![](https://img.shields.io/pypi/dw/Crawl4AI?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/Crawl4AI/)
 presidio-anonymizer	[![](https://img.shields.io/pypi/dw/presidio-anonymizer?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/presidio-anonymizer/)
 presidio-analyzer	[![](https://img.shields.io/pypi/dw/presidio-analyzer?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/presidio-analyzer/)
 loralib	[![](https://img.shields.io/pypi/dw/loralib?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/loralib/)
 langchain	[![](https://img.shields.io/pypi/dw/langchain?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/langchain/)
 reformer-pytorch	[![](https://img.shields.io/pypi/dw/reformer-pytorch?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/reformer-pytorch/)
 xmanager	[![](https://img.shields.io/pypi/dw/xmanager?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/xmanager/)
 giskard	[![](https://img.shields.io/pypi/dw/giskard?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/giskard/)
 litellm	[![](https://img.shields.io/pypi/dw/litellm?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/litellm/)
 agent-starter-pack	[![](https://img.shields.io/pypi/dw/agent-starter-pack?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/agent-starter-pack/)
 imagededup	[![](https://img.shields.io/pypi/dw/imagededup?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/imagededup/)
 bert-score	[![](https://img.shields.io/pypi/dw/bert-score?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/bert-score/)
 unsloth	[![](https://img.shields.io/pypi/dw/unsloth?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/unsloth/)
 langgraph	[![](https://img.shields.io/pypi/dw/langgraph?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/langgraph/)
 ollama	[![](https://img.shields.io/pypi/dw/ollama?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/ollama/)
 google-generativeai	[![](https://img.shields.io/pypi/dw/google-generativeai?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/google-generativeai/)
 mistral-inference	[![](https://img.shields.io/pypi/dw/mistral-inference?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/mistral-inference/)
 sglang	[![](https://img.shields.io/pypi/dw/sglang?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/sglang/)
 folium	[![](https://img.shields.io/pypi/dw/folium?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/folium/)
 transformer-lens	[![](https://img.shields.io/pypi/dw/transformer-lens?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/transformer-lens/)
 google-cloud-aiplatform	[![](https://img.shields.io/pypi/dw/google-cloud-aiplatform?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/google-cloud-aiplatform/)
 mmsegmentation	[![](https://img.shields.io/pypi/dw/mmsegmentation?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/mmsegmentation/)
 llama-index	[![](https://img.shields.io/pypi/dw/llama-index?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/llama-index/)
 supervision	[![](https://img.shields.io/pypi/dw/supervision?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/supervision/)
 autofaiss	[![](https://img.shields.io/pypi/dw/autofaiss?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/autofaiss/)
 mmdet	[![](https://img.shields.io/pypi/dw/mmdet?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/mmdet/)
 mmpose	[![](https://img.shields.io/pypi/dw/mmpose?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/mmpose/)
 panda-gym	[![](https://img.shields.io/pypi/dw/panda-gym?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/panda-gym/)
 neural-tangents	[![](https://img.shields.io/pypi/dw/neural-tangents?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/neural-tangents/)
 sae-lens	[![](https://img.shields.io/pypi/dw/sae-lens?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/sae-lens/)
 rl-games	[![](https://img.shields.io/pypi/dw/rl-games?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/rl-games/)
 dopamine-rl	[![](https://img.shields.io/pypi/dw/dopamine-rl?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/dopamine-rl/)
 gin-config	[![](https://img.shields.io/pypi/dw/gin-config?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/gin-config/)
 google-cloud-discoveryengine	[![](https://img.shields.io/pypi/dw/google-cloud-discoveryengine?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/google-cloud-discoveryengine/)
 pybullet	[![](https://img.shields.io/pypi/dw/pybullet?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/pybullet/)
 tensorflow-datasets	[![](https://img.shields.io/pypi/dw/tensorflow-datasets?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/tensorflow-datasets/)
 tfx	[![](https://img.shields.io/pypi/dw/tfx?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/tfx/)
 tensorflow-federated	[![](https://img.shields.io/pypi/dw/tensorflow-federated?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/tensorflow-federated/)
 tensorflow-data-validation	[![](https://img.shields.io/pypi/dw/tensorflow-data-validation?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/tensorflow-data-validation/)
 earthengine-api	[![](https://img.shields.io/pypi/dw/earthengine-api?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/earthengine-api/)
 catboost	[![](https://img.shields.io/pypi/dw/catboost?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/catboost/)
 dm-reverb	[![](https://img.shields.io/pypi/dw/dm-reverb?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/dm-reverb/)
 tensorflow-gnn	[![](https://img.shields.io/pypi/dw/tensorflow-gnn?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/tensorflow-gnn/)
 guidance	[![](https://img.shields.io/pypi/dw/guidance?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/guidance/)
 scikit-video	[![](https://img.shields.io/pypi/dw/scikit-video?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/scikit-video/)
 tensor2tensor	[![](https://img.shields.io/pypi/dw/tensor2tensor?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/tensor2tensor/)
 rudolph	[![](https://img.shields.io/pypi/dw/rudolph?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/rudolph/)
 img2dataset	[![](https://img.shields.io/pypi/dw/img2dataset?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/img2dataset/)

 |

## Courses

COURSES

| name | description | authors | links | colaboratory | update |

|------|-------------|:--------|:------|:------------:|:------:|

| LLM Engineering Essentials course | 12-week course, created by experts from academia and industry, is designed specifically for developers and engineers | 


[Stanislav Fedotov](https://github.com/st-fedotov)
 [Alexey Bukhtiyarov](https://github.com/leshanbog)
 [Nikita Pavlichenko](https://www.pavlichenko.info/)
others[Sergei Petrov](https://www.linkedin.com/in/sergei-petrov-12589210b/)
 [Sergei Skvortsov](https://github.com/southfreebird)
 [Alex Umnov](https://github.com/AlexUmnov)

 | [![](https://img.shields.io/github/stars/Nebius-Academy/LLM-Engineering-Essentials?style=social)](https://github.com/Nebius-Academy/LLM-Engineering-Essentials) 

[](https://x.com/karpathy/status/1886192184808149383)

[website](https://academy.nebius.com/llm-engineering-essentials)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/Nebius-Academy/LLM-Engineering-Essentials/blob/main/topic1/1.1_intro_to_llm_apis.ipynb) | 08.05.2025 |

| Deep Learning School course (ML + CV) |  | [Nina Konovalova](https://github.com/Nina-Konovalova) | [![](https://img.shields.io/github/stars/DeepLearningSchool/part_1_ml_cv?style=social)](https://github.com/DeepLearningSchool/part_1_ml_cv) 

[](https://www.kaggle.com/datasets/abrambeyer/openintro-possum)

[website](https://dls.samcs.ru/)

[](https://www.youtube.com/playlist?list=PL0Ks75aof3TiHbkJ95vxNlQefujrj1N2w), [](https://www.youtube.com/playlist?list=PL5-da3qGB5ICCsgW1MxlZ0Hq8LL5U3u9y)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/DeepLearningSchool/part_1_ml_cv/blob/main/week_01_ml_intro/Practice/part_1_data_analysis.ipynb) | 14.02.2025 |

| Introduction to Deep Learning course |  | [Tatiana Gaintseva](https://atmyre.github.io/) | [![](https://img.shields.io/github/stars/aifoundation-edu/DL_intro?style=social)](https://github.com/aifoundation-edu/DL_intro) 

[](https://medium.com/@zxr.nju/what-is-the-kernel-trick-why-is-it-important-98a98db0961d), [](https://medium.com/analytics-vidhya/how-relu-works-f317a947bdc6)

[](https://playground.tensorflow.org/)

[](https://en.wikipedia.org/wiki/Kernel_method)

[](https://www.youtube.com/playlist?list=PL0Ks75aof3ThuLLtLIVl_KPUDDQlTDyJI), [](https://youtu.be/oa3IVkENrdM), [](https://www.youtube.com/live/qvauAflpnyo), [](https://youtu.be/Q7vT0--5VII)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/aifoundation-edu/DL_intro/blob/main/01_Intro_to_Neural_Networks/01_NN_intro.ipynb) | 24.01.2025 |

| ARENA | Provide talented individuals with the skills, tools, and environment necessary for upskilling in ML engineering, for the purpose of contributing directly to AI alignment in technical roles | [Callum McDougall](https://www.perfectlynormal.co.uk/) | [![](https://img.shields.io/github/stars/callummcdougall/ARENA_3.0?style=social)](https://github.com/callummcdougall/ARENA_3.0) 

[](https://arxiv.org/abs/2211.00593)

[](https://join.slack.com/t/arena-uk/shared_invite/zt-2noug8mpy-TRYbCnc3pzj7ITNrZIjKww)

[website](https://arena-resources.notion.site/)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1vuQOB2Gd7OcfzH2y9djXm9OdZA_DcxYz) | 30.12.2024 |

| The Autodiff Cookbook | You'll go through a whole bunch of neat autodiff ideas that you can cherry pick for your own work, starting with the basics | 

[Alex Wiltschko](https://github.com/alexbw)
 [Matthew Johnson](http://people.csail.mit.edu/mattjj/)

 | [![](https://img.shields.io/github/stars/google/jax?style=social)](https://github.com/google/jax/issues/446#issuecomment-467105048) 

[](https://arxiv.org/abs/1406.2572), [](https://arxiv.org/abs/1706.04454), [](https://arxiv.org/abs/1802.03451), [](https://arxiv.org/abs/1811.07062)

[book](https://mitpress.mit.edu/sites/default/files/titles/content/sicm_edition_2/book.html), [book](https://mitpress.mit.edu/books/functional-differential-geometry)

[](https://github.com/google/jax#auto-vectorization-with-vmap), [](https://github.com/hips/autograd)

[tutorial](http://videolectures.net/deeplearning2017_johnson_automatic_differentiation/)

[](https://en.wikipedia.org/wiki/Truncated_Newton_method), [](https://en.wikipedia.org/wiki/Pullback_(differential_geometry)), [](https://en.wikipedia.org/wiki/Holomorphic_function), [](https://en.wikipedia.org/wiki/Cauchy%E2%80%93Riemann_equations)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/google/jax/blob/main/docs/notebooks/autodiff_cookbook.ipynb) | 20.09.2024 |

| Machine Learning Simplified | A Gentle Introduction to Supervised Learning | [Andrew Wolf](https://5x12.ai/) | [![](https://img.shields.io/github/stars/5x12/themlsbook?style=social)](https://github.com/5x12/themlsbook) 

[](https://medium.com/geekculture/i-found-a-great-machine-learning-book-deed11db2688)

[](https://www.reddit.com/r/Python/comments/t8st9l/i_wrote_a_book_on_machine_learning_w_python_code/), [](https://www.reddit.com/r/learnmachinelearning/comments/snxlly/machine_learning_simplified_book/)

[website](https://www.themlsbook.com/)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/5x12/themlsbook/blob/master/chapter2/knn.ipynb) | 29.08.2024 |

| Anthropic courses | Anthropic's educational courses | [Anthropic](https://www.anthropic.com/) | [![](https://img.shields.io/github/stars/anthropics/courses?style=social)](https://github.com/anthropics/courses) 

[](https://docs.anthropic.com/en/docs/resources/courses)

[](https://www.reddit.com/r/ClaudeAI/comments/1f7czsx/anthropics_official_educational_courses_on_prompt/)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/anthropics/courses/blob/master/anthropic_api_fundamentals/01_getting_started.ipynb) | 22.08.2024 |

| mlcourse.ai | Open Machine Learning Course | [Yury Kashnitsky](https://yorko.github.io/) | [![](https://img.shields.io/github/stars/Yorko/mlcourse.ai?style=social)](https://github.com/Yorko/mlcourse.ai) 

[blog post](https://habr.com/company/ods/blog/344044/)

[](https://www.kaggle.com/kashnitsky/mlcourse)

[](https://medium.com/open-machine-learning-course)

[project](https://mlcourse.ai/book/index.html)

[](https://opendatascience.slack.com/archives/C91N8TL83/p1567408586359500)

[](https://www.youtube.com/playlist?list=PLVlY_7IJCMJeRfZ68eVfEcu-UcN9BbwiX)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/Yorko/mlcourse.ai/blob/main/jupyter_english/topic01_pandas_data_analysis/topic1_pandas_data_analysis.ipynb) | 19.08.2024 |

| Deep RL Course | The Hugging Face Deep Reinforcement Learning Course | 

[Thomas Simonini](https://www.simoninithomas.com/)
 [Omar Sanseviero](https://osanseviero.github.io/hackerllama/)
 [Sayak Paul](https://sayak.dev/)

 | [![](https://img.shields.io/github/stars/huggingface/deep-rl-class?style=social)](https://github.com/huggingface/deep-rl-class) 

[](https://github.com/alex-petrenko/sample-factory)

[](https://huggingface.co/deep-rl-course/unit0/introduction), [](https://huggingface.co/spaces/huggingface-projects/Deep-Reinforcement-Learning-Leaderboard)

[](https://pytorch.org/tutorials/beginner/deep_learning_60min_blitz.html)

[syllabus](https://simoninithomas.github.io/deep-rl-course)

[](https://youtu.be/2GwBez0D20A), [](https://youtu.be/CsuIANBnSq8), [](https://youtu.be/AQKAOXJa6qg)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/huggingface/deep-rl-class/blob/main/notebooks/unit1/unit1.ipynb) | 24.06.2024 |

| Generative AI for Beginners - A Course | A 12 Lesson course teaching everything you need to know to start building Generative AI applications | [microsoft](https://www.microsoft.com/) | [![](https://img.shields.io/github/stars/microsoft/xr-development-for-beginners?style=social)](https://github.com/microsoft/xr-development-for-beginners) 

[](https://aka.ms/genai-discord)

[](https://github.com/microsoft/Web-Dev-For-Beginners)

[project](https://microsoft.github.io/generative-ai-for-beginners/)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/microsoft/generative-ai-for-beginners/blob/main/06-text-generation-apps/notebook-azure-openai.ipynb) | 22.02.2024 |

| DSP theory | Theory of digital signal processing: signals, filtration (IIR, FIR, CIC, MAF), transforms (FFT, DFT, Hilbert, Z-transform) etc | 

[Alexander Kapitanov](https://github.com/hukenovs)
 [Vladimir Fadeev](https://github.com/kirlf)
 [Karina Kvanchiani](https://github.com/karinakvanchiani)
others[Elizaveta Petrova](https://github.com/kleinsbotle)
 [Andrei Makhliarchuk](https://github.com/anotherhelloworld)

 | [![](https://img.shields.io/github/stars/hukenovs/dsp-theory?style=social)](https://github.com/hukenovs/dsp-theory) [blog post](https://habr.com/ru/articles/460445/) | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/hukenovs/dsp-theory/blob/master/src/dsp_theory_1_signals.ipynb) | 18.10.2022 |

| Machine learning course | This course is broad and shallow, but author will provide additional links so that you can deepen your understanding of the ML method you need | [Тимчишин Віталій](https://github.com/fbeilstein) | [![](https://img.shields.io/github/stars/fbeilstein/machine_learning?style=social)](https://github.com/fbeilstein/machine_learning) 

[blog post](https://vas3k.com/blog/machine_learning/)

[](https://www.youtube.com/playlist?list=PLkDeTjsoxDVgnb2lIYo9-1l4XYhrIyS6A), [](https://youtu.be/-RdOwhmqP5s), [](https://youtu.be/R13BD8qKeTg), [](https://youtu.be/ZkjP5RJLQF4), [](https://youtu.be/J4Wdy0Wc_xQ), [](https://youtu.be/mBcLRGuAFUk), [](https://youtu.be/YIGtalP1mv0), [](https://youtu.be/Yz5pySyEtsU), [](https://youtu.be/x5zLaWT5KPs), [](https://youtu.be/yBwpo-L80Mc), [](https://www.youtube.com/playlist?list=PL3FW7Lu3i5JvHM8ljYj-zLfQRF3EO8sYv)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/fbeilstein/machine_learning/blob/master/lecture_01_introduction.ipynb) | 02.09.2021 |

| NYU-DLSP20 | This course concerns the latest techniques in deep learning and representation learning, focusing on supervised and unsupervised deep learning, embedding methods, metric learning, convolutional and recurrent nets, with applications to computer vision, natural language understanding, and speech recognition | 

[Yann LeCun](https://yann.lecun.com/)
 [Alfredo Canziani](https://atcold.github.io/)

 | [![](https://img.shields.io/github/stars/Atcold/NYU-DLSP20?style=social)](https://github.com/Atcold/NYU-DLSP20) 

[](https://discord.gg/CthuqsX8Pb)

[](https://github.com/Atcold/NYU-DLSP21), [](https://github.com/Atcold/NYU-DLFL22)

[](https://www.reddit.com/r/NYU_DeepLearning/)

[website](https://atcold.github.io/NYU-DLSP20/)

[](https://www.youtube.com/playlist?list=PLLHTzKZzVU9eaEyErdV26ikyolxOsz6mq)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/Atcold/NYU-DLSP20/blob/master/00-logic_neuron_programming.ipynb) | 30.10.2019 |

## Research

RESEARCH

| name | description | authors | links | colaboratory | update |

|------|-------------|:--------|:------|:------------:|:------:|

| Qwen2.5-Omni | End-to-end multimodal model designed to perceive diverse modalities, including text, images, audio, and video, while simultaneously generating text and natural speech responses in a streaming manner | 


[Jin Xu](https://jxu-thu.github.io/)
 [Zhifang Guo](https://scholar.google.com/citations?user=LHzzlAgAAAAJ)
 [Jinzheng He](https://scholar.google.com/citations?user=tsQVykcAAAAJ)
others[Ting He](https://scholar.google.com/citations?user=Yxat_NMAAAAJ)
 [Shuai Bai](https://github.com/ShuaiBai623)
 [Keqin Chen](https://github.com/OliverChen0602)
 [Jialin Wang](https://github.com/JialinWangPKU)
 [Yang Fan](https://github.com/fyabc)
 [Peng Wang](https://scholar.google.com/citations?user=7fjqA0YAAAAJ)
 [Bin Zhang](https://zhangchbin.github.io/)
 [Xiong Wang](https://github.com/wangxiongts)
 [Yunfei Chu](https://scholar.google.com/citations?user=41QhCyYAAAAJ)
 [Junyang Lin](https://justinlin610.github.io/)

 | [![](https://img.shields.io/github/stars/QwenLM/Qwen2.5-Omni?style=social)](https://github.com/QwenLM/Qwen2.5-Omni) 

[API](https://help.aliyun.com/zh/model-studio/user-guide/qwen-omni)

[Chat](https://chat.qwen.ai/)

[](https://arxiv.org/abs/2503.20215)

[blog post](https://qwenlm.github.io/blog/qwen2.5-omni/)

[](https://discord.gg/CV4E9rpNSD)

[](https://hub.docker.com/r/qwenllm/qwen-omni)

[](https://github.com/Dao-AILab/flash-attention)

[](https://huggingface.co/collections/Qwen/qwen25-omni-67de1e5f0f9464dc6314b36e), [](https://huggingface.co/spaces/Qwen/Qwen2.5-Omni-7B-Demo)

[](https://medium.com/data-science-in-your-pocket/qwen2-5-omni-7b-input-anything-output-text-and-audio-llm-87c050716172)

[](https://youtu.be/yKcANdkRuNI), [](https://youtu.be/-JY1wcRRXMA), [](https://youtu.be/Vez5TyC5YTE), [](https://youtu.be/5BqFvIm0joU)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/QwenLM/Qwen2.5-Omni/blob/main/cookbooks/omni_chatting_for_math.ipynb) | 29.04.2025 |

| SigLIP 2 | Family of new multilingual vision-language encoders that build on the success of the original SigLIP | 

[Michael Tschannen](https://mitscha.github.io/)
 [Alexey Gritsenko](https://github.com/AlexeyG)
 [Xiao Wang](https://scholar.google.com/citations?user=ukyXqzMAAAAJ)
others[Muhammad Ferjad Naeem](https://ferjad.github.io/)
 [Ibrahim Alabdulmohsin](https://ibomohsin.github.io/)
 [Nikhil Parthasarathy](https://scholar.google.com/citations?user=X9mO4ckAAAAJ)
 [Talfan Evans](https://talfanevans.co.uk/)
 [Lucas Beyer](http://lucasb.eyer.be)
 [Ye Xia](https://scholar.google.com/citations?user=QQhJ1pAAAAAJ)
 [Basil Mustafa](https://www.basilmustafa.com/)
 [Olivier Hénaff](https://www.olivierhenaff.com/)
 [Jeremiah Harmsen](https://research.google/people/jeremiahharmsen)
 [Andreas Steiner](https://scholar.google.com/citations?user=vIZeAu4AAAAJ)
 [Xiaohua Zhai](https://sites.google.com/view/xzhai)

 | [![](https://img.shields.io/github/stars/google-research/big_vision?style=social)](https://github.com/google-research/big_vision/blob/main/big_vision/models/vit.py) 

[](https://arxiv.org/abs/2502.14786), [](https://arxiv.org/abs/2303.15343)

[](https://github.com/google-research/big_vision/blob/main/big_vision/models/proj/image_text/two_towers.py), [](https://github.com/google-research/big_vision/blob/main/big_vision/models/proj/image_text/naflex_vit.py), [](https://github.com/google-research/big_vision/blob/main/big_vision/pp/proj/image_text/ops_naflex.py)

[](https://huggingface.co/collections/google/siglip2-67b5dcef38c175486e240107)

[](https://artgor.medium.com/paper-review-siglip-2-multilingual-vision-language-encoders-with-improved-semantic-understanding-b7b578002adc), [](https://ritvik19.medium.com/papers-explained-320-siglip-2-dba08ff09559)

[](https://youtu.be/r8y7gXIpb4A)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/google-research/big_vision/blob/main/big_vision/configs/proj/image_text/SigLIP2_demo.ipynb) | 17.03.2025 |

| STAR | Spatial Temporal Augmentation with T2V models for Real-world video super-resolution, a novel approach that leverages T2V models for real-world video super-resolution, achieving realistic spatial details and robust temporal consistency | 

[Rui Xie](https://github.com/CSRuiXie)
 [Yinhong Liu](https://github.com/yhliu04)
 [Penghao Zhou](https://scholar.google.com/citations?user=yWq1Fd4AAAAJ)
others[Chen Zhao](https://scholar.google.com/citations?user=Uhp3JKgAAAAJ)
 [Jun Zhou](https://scholar.google.com/citations?user=w03CHFwAAAAJ)
 [Kai Zhang](https://cszn.github.io/)
 [Zhenyu Zhang](https://jessezhang92.github.io/)
 [Jian Yang](https://scholar.google.com/citations?user=6CIDtZQAAAAJ)
 [Zhenheng Yang](https://scholar.google.com/citations?user=Ds5wwRoAAAAJ)
 [Ying Tai](https://tyshiwo.github.io/index.html)

 | [![](https://img.shields.io/github/stars/NJU-PCALab/STAR?style=social)](https://github.com/NJU-PCALab/STAR) 

[](https://arxiv.org/abs/2501.02976)

[](https://github.com/ali-vilab/VGen), [](https://github.com/Vchitect/VEnhancer), [](https://github.com/NJU-PCALab/OpenVid-1M), [](https://github.com/hpcaitech/Open-Sora/tree/main/tools/caption#pllava-captioning)

[](https://huggingface.co/spaces/SherryX/STAR)

[](https://artgor.medium.com/paper-review-star-spatial-temporal-augmentation-with-text-to-video-models-for-real-world-video-ff0ddcc6352f)

[project](https://nju-pcalab.github.io/projects/STAR)

[](https://youtu.be/hx0zrql-SrU)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1K8A1U_BNpAteRhhW9A8pAYs6LWjItQs_) | 22.01.2025 |

| InvSR | Image super-resolution technique based on diffusion inversion, aiming at harnessing the rich image priors encapsulated in large pre-trained diffusion models to improve SR performance | 

[Zongsheng Yue](https://zsyoaoa.github.io/)
 [Kang Liao](https://kangliao929.github.io/)
 [Chen Change Loy](https://www.mmlab-ntu.com/person/ccloy/)

 | [![](https://img.shields.io/github/stars/zsyOAOA/InvSR?style=social)](https://github.com/zsyOAOA/InvSR) 

[](https://arxiv.org/abs/2412.09013)

[](https://github.com/csjcai/RealSR)

[](https://huggingface.co/spaces/OAOA/InvSR), [](https://huggingface.co/stabilityai/sd-turbo)

[](https://youtu.be/mCd7ODSmols)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1hjgCFnAU4oUUhh9VRfTwsFN1AiIjdcSR) | 21.01.2025 |

| ModernBERT | Bringing modern model optimizations to encoder-only models and representing a major Pareto improvement over older encoders | 

[Benjamin Warner](https://benjaminwarner.dev/)
 [Antoine Chaffin](https://antoine.chaffin.fr/)
 [Benjamin Clavié](https://ben.clavie.eu/)
others[Orion Weller](https://orionweller.github.io/)
 [Oskar Hallström](https://github.com/ohallstrom)
 [Said Taghadouini](https://github.com/staghado)
 [Alexis Gallagher](https://alexisgallagher.com/)
 [Raja Biswas](https://github.com/rbiswasfc)
 [Faisal Ladhak](https://github.com/fladhak)
 [Tom Aarsen](https://www.tomaarsen.com/home)
 [Nathan Cooper](https://nathancooper.io/)
 [Griffin Adams](https://github.com/griff4692)
 [Jeremy Howard](https://jeremy.fast.ai/)
 [Iacopo Poli](https://github.com/iacolippo)

 | [![](https://img.shields.io/github/stars/AnswerDotAI/ModernBERT?style=social)](https://github.com/AnswerDotAI/ModernBERT) 

[](https://arxiv.org/abs/2412.13663)

[](https://github.com/mosaicml/examples/tree/main/examples/benchmarks/bert), [](https://github.com/lightonai/pylate)

[](https://huggingface.co/blog/modernbert)

[](https://medium.com/@sahin.samia/modernbert-the-next-generation-of-encoder-models-a-guide-to-using-and-fine-tuning-for-nlp-tasks-995a50d5232f)

[](https://youtu.be/Z1Dl3juwtSU), [](https://youtu.be/-kXUWeNcfUw), [](https://www.youtube.com/live/ZWo6Q8580sA), [](https://youtu.be/92HKsDHD9XI)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/AnswerDotAI/ModernBERT/blob/master/examples/finetune_modernbert_on_glue.ipynb) | 22.12.2024 |

| TAPIR | Tracking Any Point with per-frame Initialization and temporal Refinement | 

[Carl Doersch](http://www.carldoersch.com/)
 [Yi Yang](https://yangyi02.github.io/)
 [Mel Vecerik](https://scholar.google.com/citations?user=Jvi_XPAAAAAJ)
others[Dilara Gokay](https://scholar.google.com/citations?user=cnbENAEAAAAJ)
 [Ankush Gupta](https://ankushgupta.org/)
 [Yusuf Aytar](https://people.csail.mit.edu/yusuf/)
 [Joao Carreira](https://scholar.google.com/citations?user=IUZ-7_cAAAAJ)
 [Andrew Zisserman](https://www.robots.ox.ac.uk/~az/)

 | [![](https://img.shields.io/github/stars/google-deepmind/tapnet?style=social)](https://github.com/google-deepmind/tapnet) 

[](https://arxiv.org/abs/2306.08637), [](https://arxiv.org/abs/2308.15975)

[blog post](https://deepmind-tapir.github.io/), [blog post](https://deepmind-tapir.github.io/blogpost.html)

[](https://www.deepmind.com/open-source/kinetics)

[](https://github.com/google-research/kubric/tree/main/challenges/point_tracking)

[](https://medium.com/@jumabek4044/what-is-tapir-tracking-any-point-with-per-frame-initialization-and-temporal-refinement-and-how-it-bdad9946dc53)

[](https://proceedings.neurips.cc/paper_files/paper/2022/hash/58168e8a92994655d6da3939e7cc0918-Abstract-Datasets_and_Benchmarks.html)

[](https://youtu.be/2HSHofqoJ9M), [](https://youtu.be/I1DQJH3v7Nk)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/deepmind/tapnet/blob/master/colabs/causal_tapir_demo.ipynb) | 30.11.2024 |

| PuLID | Pure and Lightning ID customization, a tuning-free ID customization method for text-to-image generation | 

[Zinan Guo](https://github.com/guozinan126)
 [Yanze Wu](https://tothebeginning.github.io/)
 [Zhuowei Chen](https://scholar.google.com/citations?user=ow1jGJkAAAAJ)
others[Lang Chen](https://scholar.google.com/citations?user=h5xex20AAAAJ)
 [Qian He](https://scholar.google.com/citations?user=9rWWCgUAAAAJ)

 | [![](https://img.shields.io/github/stars/ToTheBeginning/PuLID?style=social)](https://github.com/ToTheBeginning/PuLID) 

[](https://arxiv.org/abs/2404.16022)

[](https://github.com/cubiq/PuLID_ComfyUI), [](https://github.com/ZHO-ZHO-ZHO/ComfyUI-PuLID-ZHO), [](https://github.com/Mikubill/sd-webui-controlnet/pull/2838)

[](https://www.reddit.com/r/comfyui/comments/1cnv269/pulid_pure_and_lightning_id_customization_via/)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/camenduru/PuLID-jupyter/blob/main/PuLID_jupyter.ipynb) | 09.11.2024 |

| CoTracker | Architecture that jointly tracks multiple points throughout an entire video | 

[Nikita Karaev](https://nikitakaraevv.github.io/)
 [Ignacio Rocco](https://www.irocco.info/)
 [Benjamin Graham](https://ai.meta.com/people/benjamin-graham/)
others[Natalia Neverova](https://nneverova.github.io/)
 [Andrea Vedaldi](https://www.robots.ox.ac.uk/~vedaldi/)
 [Christian Rupprecht](https://chrirupp.github.io/)

 | [![](https://img.shields.io/github/stars/facebookresearch/co-tracker?style=social)](https://github.com/facebookresearch/co-tracker) 

[](https://arxiv.org/abs/2307.07635), [](https://arxiv.org/abs/2303.11898)

[](https://github.com/benjiebob/BADJA)

[project](https://co-tracker.github.io/)

[](https://youtu.be/w5QVc7BVGPA)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/facebookresearch/co-tracker/blob/main/notebooks/demo.ipynb) | 16.10.2024 |

| Segment Anything 2 | Foundation model towards solving promptable visual segmentation in images and videos | 

[Nikhila Ravi](https://nikhilaravi.com/)
 [Valentin Gabeur](https://gabeur.github.io/)
 [Yuan-Ting Hu](https://scholar.google.com/citations?user=E8DVVYQAAAAJ)
others[Ronghang Hu](https://ronghanghu.com/)
 [Chaitanya Ryali](https://scholar.google.com/citations?user=4LWx24UAAAAJ)
 [Tengyu Ma](https://scholar.google.com/citations?user=VeTSl0wAAAAJ)
 [Haitham Khedr](https://hkhedr.com/)
 [Roman Rädle](https://scholar.google.de/citations?user=Tpt57v0AAAAJ)
 [Chloé Rolland](https://scholar.google.com/citations?user=n-SnMhoAAAAJ)
 [Laura Gustafson](https://scholar.google.com/citations?user=c8IpF9gAAAAJ)
 [Eric Mintun](https://ericmintun.github.io/)
 [Junting Pan](https://junting.github.io/)
 [Kalyan Vasudev](lwala](https://scholar.google.co.in/citations?user=m34oaWEAAAAJ)
 [Nicolas Carion](https://www.nicolascarion.com/)
 [Chao-Yuan](u](https://chaoyuan.org/)
 [Ross Girshick](https://www.rossgirshick.info/)
 [Piotr Dollár](https://pdollar.github.io/)
 [Christoph Feichtenhofer](https://feichtenhofer.github.io/)

 | [![](https://img.shields.io/github/stars/facebookresearch/segment-anything-2?style=social)](https://github.com/facebookresearch/segment-anything-2) 

[](https://arxiv.org/abs/2408.00714)

[demo](https://sam2.metademolab.com/)

[](https://github.com/zsef123/Connected_components_PyTorch)

[](https://huggingface.co/models?search=facebook/sam2)

[](https://ai.meta.com/research/publications/sam-2-segment-anything-in-images-and-videos/), [](https://ai.meta.com/datasets/segment-anything-video), [](https://ai.meta.com/blog/segment-anything-2)

[project](https://ai.meta.com/sam2/)

[](https://x.com/AIatMeta/status/1818055906179105010)

[](https://www.youtube.com/watch?v=w-cmMcMZoZ4&t=2325s), [](https://youtu.be/O8QdvZbRDp4), [](https://www.youtube.com/live/Dv003fTyO-Y), [](https://youtu.be/IW7jFq3vQbw)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/facebookresearch/segment-anything-2/blob/main/notebooks/image_predictor_example.ipynb) | 01.10.2024 |

| Deep Painterly Harmonization | Algorithm produces significantly better results than photo compositing or global stylization techniques and that it enables creative painterly edits that would be otherwise difficult to achieve | 

[Fujun Luan](https://luanfujun.github.io/)
 [Sylvain Paris](http://people.csail.mit.edu/sparis/)
 [Eli Shechtman](https://research.adobe.com/person/eli-shechtman/)
 [Kavita Bala](https://www.cs.cornell.edu/~kb/)

 | [![](https://img.shields.io/github/stars/luanfujun/deep-painterly-harmonization?style=social)](https://github.com/luanfujun/deep-painterly-harmonization) 

[](https://arxiv.org/abs/1804.03189), [](https://arxiv.org/abs/1701.08893)

[](https://github.com/jcjohnson/neural-style), [](https://github.com/torch/torch7), [](https://github.com/szagoruyko/loadcaffe)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/gist/eyaler/5303782669fb43510d398bd346c6e3e6/deep-painterly-harmonization.ipynb) | 23.09.2024 |

| audio2photoreal | Framework for generating full-bodied photorealistic avatars that gesture according to the conversational dynamics of a dyadic interaction | 

[Evonne Ng](https://people.eecs.berkeley.edu/~evonne_ng/)
 [Javier Romero](https://scholar.google.com/citations?user=Wx62iOsAAAAJ)
 [Timur Bagautdinov](https://scholar.google.ch/citations?user=oLi7xJ0AAAAJ)
others[Shaojie Bai](https://jerrybai1995.github.io/)
 [Trevor Darrell](https://people.eecs.berkeley.edu/~trevor/)
 [Angjoo Kanazawa](https://people.eecs.berkeley.edu/~kanazawa/)
 [Alexander Richard](https://alexanderrichard.github.io/)

 | [![](https://img.shields.io/github/stars/facebookresearch/audio2photoreal?style=social)](https://github.com/facebookresearch/audio2photoreal) 

[](https://arxiv.org/abs/2401.01885)

[](https://github.com/facebookresearch/ca_body)

[project](https://people.eecs.berkeley.edu/~evonne_ng/projects/audio2photoreal/)

[](https://youtu.be/Y0GMaMtUynQ)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1A6EwKM3PeX7dcKV66zxQWuP-v_dKlX_0) | 13.09.2024 |

| Fast Segment Anything | CNN Segment Anything Model trained using only 2% of the SA-1B dataset published by SAM authors | 

[Xu Zhao](https://scholar.google.com/citations?user=F0cYEyAAAAAJ)
 [Wenchao Ding](https://github.com/berry-ding)
 [Yongqi An](https://github.com/an-yongqi)
others[Yinglong Du](https://github.com/YinglongDu)
 [Tao Yu](https://github.com/tianjinren)
 [Min Li](https://github.com/limin2021)
 [Ming Tang](https://www.researchgate.net/profile/Ming-Tang-2)
 [Jinqiao Wang](https://scholar.google.com/citations?user=7_BkyxEAAAAJ)

 | [![](https://img.shields.io/github/stars/CASIA-IVA-Lab/FastSAM?style=social)](https://github.com/CASIA-IVA-Lab/FastSAM) 

[](https://arxiv.org/abs/2306.12156), [](https://arxiv.org/abs/2112.10003)

[](https://github.com/ChuRuaNh0/FastSam_Awsome_TensorRT)

[](https://medium.com/@mahimairaja/so-what-exactly-is-fastsam-the-ultimate-guide-ddae21d3b486)

[](https://youtu.be/yHNPyqazYYU), [](https://youtu.be/SslzS0AsiAw), [](https://www.youtube.com/live/qvqkjP1wCDE)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1oX14f6IneGGw612WgVlAiy91UHwFAvr9) | 10.09.2024 |

| Neuralangelo | Framework for high-fidelity 3D surface reconstruction from RGB video captures | 

[Zhaoshuo Li](https://mli0603.github.io/)
 [Thomas Müller](https://tom94.net/)
 [Alex Evans](https://scholar.google.com/citations?user=ToqGImkAAAAJ)
others[Russell Taylor](https://www.cs.jhu.edu/~rht/)
 [Mathias Unberath](https://mathiasunberath.github.io/)
 [Ming-Yu Liu](https://mingyuliu.net/)
 [Chen-Hsuan Lin](https://chenhsuanlin.bitbucket.io/)

 | [![](https://img.shields.io/github/stars/NVlabs/neuralangelo?style=social)](https://github.com/NVlabs/neuralangelo) 

[](https://arxiv.org/abs/2306.03092)

[blog post](https://blogs.nvidia.com/blog/2023/06/01/neuralangelo-ai-research-3d-reconstruction/)

[](https://github.com/mli0603/BlenderNeuralangelo)

[project](https://research.nvidia.com/labs/dir/neuralangelo/)

[](https://youtu.be/PQMNCXR-WF8), [](https://youtu.be/Qpdw3SW54kI), [](https://youtu.be/lC2uPDfaTcE)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1i16s8W_OV0Hd3-PIuo64JKDwwdOesgXQ) | 02.09.2024 |

| YOLOv10 | Aim to further advance the performance-efficiency boundary of YOLOs from both the post-processing and model architecture | 

[Ao Wang](https://github.com/jameslahm)
 [Hui Chen](https://huichen24.github.io/)
 [Kai Chen](https://scholar.google.com/citations?user=bZQX708AAAAJ)
others[Zijia Lin](https://sites.google.com/site/linzijia72)
 [Jungong Han](https://jungonghan.github.io/)
 [Guiguang Ding](https://scholar.google.com/citations?user=B7F3yt4AAAAJ)

 | [![](https://img.shields.io/github/stars/THU-MIG/yolov10?style=social)](https://github.com/THU-MIG/yolov10) 

[](https://arxiv.org/abs/2405.14458)

[blog post](https://learnopencv.com/yolov10/)

[demo](https://openbayes.com/console/public/tutorials/im29uYrnIoz)

[](https://github.com/rlggyp/YOLOv10-OpenVINO-CPP-Inference), [](https://github.com/Seeed-Projects/jetson-examples/blob/main/reComputer/scripts/yolov10/README.md), [](https://github.com/kaylorchen/rk3588-yolo-demo), [](https://github.com/openvinotoolkit/openvino_notebooks/blob/latest/notebooks/yolov10-optimization/yolov10-optimization.ipynb), [](https://github.com/sujanshresstha/YOLOv10_DeepSORT), [](https://github.com/CVHub520/X-AnyLabeling), [](https://github.com/DanielSarmiento04/yolov10cpp), [](https://github.com/lyuwenyu/RT-DETR)

[](https://huggingface.co/collections/jameslahm/yolov10-665b0d90b0b5bb85129460c2), [](https://huggingface.co/spaces/jameslahm/YOLOv10), [](https://huggingface.co/spaces/kadirnar/Yolov10), [](https://huggingface.co/spaces/Xenova/yolov10-web)

[](https://medium.com/@batuhansenerr/yolov10-custom-object-detection-bd7298ddbfd3), [](https://medium.com/@sunidhi.ashtekar/yolov10-revolutionizing-real-time-object-detection-72ef04ad441a)

[](https://www.reddit.com/r/GPTFutureScience/comments/1d34rj1/yolov10_the_future_of_realtime_object_detection/)

[](https://youtu.be/29tnSxhB3CY), [](https://youtu.be/2ZFJbeJXXDM), [](https://youtu.be/wM6nO75keOQ)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/roboflow-ai/notebooks/blob/main/notebooks/train-yolov10-object-detection-on-custom-dataset.ipynb) | 20.08.2024 |

| SpecVQGAN | Taming the visually guided sound generation by shrinking a training dataset to a set of representative vectors | 

[Vladimir Iashin](https://iashin.ai/)
 [Esa Rahtu](https://esa.rahtu.fi/)

 | [![](https://img.shields.io/github/stars/v-iashin/SpecVQGAN?style=social)](https://github.com/v-iashin/SpecVQGAN) 

[](http://arxiv.org/abs/2110.08791), [](https://arxiv.org/abs/2012.09841), [](https://arxiv.org/abs/1711.00937), [](https://arxiv.org/abs/2008.00820), [](https://arxiv.org/abs/1712.01393), [](https://arxiv.org/abs/1512.08512)

[](https://github.com/PeihaoChen/regnet), [](https://github.com/toshas/torch-fidelity), [](https://github.com/descriptinc/melgan-neurips), [](https://github.com/google/lyra)

[project](https://iashin.ai/SpecVQGAN)

[](https://en.wikipedia.org/wiki/Foley_(filmmaking)), [](https://en.wikipedia.org/wiki/Row-_and_column-major_order), [](https://en.wikipedia.org/wiki/Kullback%E2%80%93Leibler_divergence)

[](https://www.youtube.com/watch?v=Bucb3nAa398)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1pxTIMweAKApJZ3ZFqyBee3HtMqFpnwQ0) | 12.07.2024 |

| LivePortrait | Video-driven portrait animation framework with a focus on better generalization, controllability, and efficiency for practical usage | 

[Jianzhu Guo](https://guojianzhu.com/)
 [Dingyun Zhang](https://github.com/DingyunZhang)
 [Xiaoqiang Liu](https://github.com/Liu-lxq)
others[Zhizhou Zhong](https://scholar.google.com/citations?user=t88nyvsAAAAJ)
 [Yuan Zhang](https://scholar.google.com/citations?user=_8k1ubAAAAAJ)
 [Pengfei Wan](https://scholar.google.com/citations?user=P6MraaYAAAAJ)
 [Di Zhang](https://openreview.net/profile?id=~Di_ZHANG3)

 | [![](https://img.shields.io/github/stars/KwaiVGI/LivePortrait?style=social)](https://github.com/KwaiVGI/LivePortrait) 

[](https://arxiv.org/abs/2407.03168)

[](https://github.com/kijai/ComfyUI-LivePortraitKJ), [](https://github.com/shadowcz007/comfyui-liveportrait), [](https://github.com/zhanglonghao1992/One-Shot_Free-View_Neural_Talking_Head_Synthesis), [](https://github.com/NVlabs/SPADE), [](https://github.com/deepinsight/insightface)

[](https://huggingface.co/spaces/KwaiVGI/LivePortrait)

[project](https://liveportrait.github.io/)

[](https://www.reddit.com/r/StableDiffusion/comments/1dvepjx/liveportrait_efficient_portrait_animation_with/)

[](https://youtu.be/uyjSTAOY7yI), [](https://youtu.be/8-IcDDmiUMM), [](https://youtu.be/aFcS31OWMjE), [](https://youtu.be/bRHf2oQwgG4), [](https://youtu.be/FPtpNrmuwXk), [](https://youtu.be/wG7oPp01COg)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/camenduru/LivePortrait-jupyter/blob/main/LivePortrait_jupyter.ipynb) | 10.07.2024 |

| StoryDiffusion | Way of self-attention calculation, termed Consistent Self-Attention, that significantly boosts the consistency between the generated images and augments prevalent pretrained diffusion-based text-to-image models in a zero-shot manner | 

[Yupeng Zhou](https://mmcheng.net/zyp/)
 [Daquan Zhou](https://github.com/zhoudaquan)
 [Ming-Ming Cheng](https://mmcheng.net/cmm/)
others[Jiashi Feng](https://sites.google.com/site/jshfeng/?pli=1)
 [Qibin Hou](https://houqb.github.io/)

 | [![](https://img.shields.io/github/stars/HVision-NKU/StoryDiffusion?style=social)](https://github.com/HVision-NKU/StoryDiffusion) 

[](https://arxiv.org/abs/2405.01434)

[](https://youtu.be/GeNyP4VY9rE?si=qW1jcW_GbKutmKQv)

[project](https://storydiffusion.github.io/)

[](https://www.reddit.com/r/StoryDiffusion/)

[](https://youtu.be/jZWRENqCl6I), [](https://youtu.be/GeNyP4VY9rE)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/HVision-NKU/StoryDiffusion/blob/main/Comic_Generation.ipynb) | 04.05.2024 |

| VoiceCraft | token infilling neural codec language model, that achieves state-of-the-art performance on both speech editing and zero-shot text-to-speech on audiobooks, internet videos, and podcasts | 

[Puyuan Peng](https://jasonppy.github.io/)
 [Po-Yao Huang](https://berniebear.github.io/)
 [Shang-Wen Li](https://swdanielli.github.io/)
others[Abdelrahman Mohamed](https://www.cs.toronto.edu/~asamir/)
 [David Harwath](https://www.cs.utexas.edu/~harwath/)

 | [![](https://img.shields.io/github/stars/jasonppy/VoiceCraft?style=social)](https://github.com/jasonppy/VoiceCraft) 

[](https://arxiv.org/abs/2403.16973)

[](https://github.com/lifeiteng/vall-e)

[](https://huggingface.co/pyp1/VoiceCraft)

[project](https://jasonppy.github.io/VoiceCraft_web/)

[](https://www.reddit.com/r/LocalLLaMA/comments/1bmxfk3/voicecraft_zeroshot_speech_editing_and/)

[](https://youtu.be/eikybOi8iwU), [](https://youtu.be/PJ2qSjycLcw), [](https://youtu.be/JxRrHpq-hys)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/jasonppy/VoiceCraft/blob/master/voicecraft-gradio-colab.ipynb) | 21.04.2024 |

| ZeST | Method for zero-shot material transfer to an object in the input image given a material exemplar image | 

[Ta-Ying Cheng](https://ttchengab.github.io/)
 [Prafull Sharma](https://prafullsharma.net/)
 [Andrew Markham](https://www.cs.ox.ac.uk/people/andrew.markham/)
others[Niki Trigoni](https://www.cs.ox.ac.uk/people/niki.trigoni/)
 [Varun Jampani](https://varunjampani.github.io/)

 | [![](https://img.shields.io/github/stars/ttchengab/zest_code?style=social)](https://github.com/ttchengab/zest_code) 

[](https://arxiv.org/abs/2404.06425)

[](https://github.com/kealiu/ComfyUI-ZeroShot-MTrans)

[](https://huggingface.co/h94/IP-Adapter), [](https://github.com/intel-isl/DPT/releases/download/1_0/dpt_hybrid-midas-501f0c75.pt)

[](https://xthemadgenius.medium.com/zest-unlocks-material-magic-in-single-image-transfers-05f7ff7ee483)

[project](https://ttchengab.github.io/zest/)

[](https://www.reddit.com/r/learnmachinelearning/comments/1c0wpjd/zest_zeroshot_material_transfer_from_a_single/)

[](https://youtu.be/atG1VvgeG_g)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/camenduru/zest-jupyter/blob/main/zest_jupyter.ipynb) | 16.04.2024 |

| InstantMesh | Feed-forward framework for instant 3D mesh generation from a single image, featuring state-of-the-art generation quality and significant training scalability | 

[Jiale Xu](https://github.com/bluestyle97)
 [Weihao Cheng](https://www.cheng.website/)
 [Yiming Gao](https://scholar.google.com/citations?user=uRCc-McAAAAJ)
others[Xintao Wang](https://xinntao.github.io/)
 [Shenghua Gao](https://scholar.google.com/citations?user=fe-1v0MAAAAJ)
 [Ying Shan](https://scholar.google.com/citations?user=4oXBp9UAAAAJ)

 | [![](https://img.shields.io/github/stars/TencentARC/InstantMesh?style=social)](https://github.com/TencentARC/InstantMesh) 

[](https://arxiv.org/abs/2404.07191)

[](https://github.com/danielgatis/rembg), [](https://github.com/3DTopia/OpenLRM), [](https://github.com/nv-tlabs/FlexiCubes)

[](https://huggingface.co/TencentARC/InstantMesh)

[](https://www.reddit.com/r/StableDiffusion/comments/1c5hs3e/instantmesh_efficient_3d_mesh_generation_from_a/)

[](https://youtu.be/BvngSJOStvQ)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/camenduru/InstantMesh-jupyter/blob/main/InstantMesh_jupyter.ipynb) | 16.04.2024 |

| Würstchen | Architecture for text-to-image synthesis that combines competitive performance with unprecedented cost-effectiveness for large-scale text-to-image diffusion models | 

[Pablo Pernias](https://github.com/pabloppp)
 [Dominic Rampas](https://github.com/dome272)
 [Mats Richter](https://scholar.google.com/citations?user=xtlV5SAAAAAJ)
others[Christopher Pal](https://www.polymtl.ca/expertises/pal-christopher-j)
 [Marc Aubreville](https://lme.tf.fau.de/person/aubreville/)

 | [![](https://img.shields.io/github/stars/dome272/wuerstchen?style=social)](https://github.com/dome272/wuerstchen) 

[](https://arxiv.org/abs/2306.00637)

[](https://huggingface.co/blog/wuerstchen)

[](https://www.reddit.com/r/StableDiffusion/comments/16hsklt/w%C3%BCrstchen_is_here_a_game_changing_fastest/)

[](https://youtu.be/ogJsCPqgFMk)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/dome272/Wuerstchen/blob/main/w%C3%BCrstchen-stage-C.ipynb) | 06.04.2024 |

| AudioSep | Foundation model for open-domain audio source separation with natural language queries | 

[Xubo Liu](https://liuxubo717.github.io/)
 [Qiuqiang Kong](https://qiuqiangkong.github.io/)
 [Yan Zhao](https://cliffzhao.github.io/)
others[Haohe Liu](https://haoheliu.github.io/)
 [Yi Yuan](https://www.surrey.ac.uk/people/yi-yuan)
 [Yuzhuo Liu](https://github.com/redrabbit94)
 [Rui Xia](https://scholar.google.co.uk/citations?user=26oErxwAAAAJ)
 [Yuxuan Wang](https://scholar.google.com/citations?user=3RaOfJkAAAAJ)
 [Mark Plumbley](https://www.surrey.ac.uk/people/mark-plumbley)
 [Wenwu Wang](http://personal.ee.surrey.ac.uk/Personal/W.Wang/)

 | [![](https://img.shields.io/github/stars/Audio-AGI/AudioSep?style=social)](https://github.com/Audio-AGI/AudioSep) 

[](https://arxiv.org/abs/2308.05037)

[project](https://audio-agi.github.io/Separate-Anything-You-Describe/)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/Audio-AGI/AudioSep/blob/main/AudioSep_Colab.ipynb) | 15.03.2024 |

| AQLM | Extreme Compression of Large Language Models via Additive Quantization | 

[Vage Egiazarian](https://github.com/Vahe1994)
 [Andrei Panferov](https://blog.panferov.org/)
 [Denis Kuznedelev](https://github.com/Godofnothing)
others[Elias Frantar](https://efrantar.github.io/)
 [Artem Babenko](https://scholar.google.com/citations?user=2Kv3JP0AAAAJ)
 [Dan Alistarh](https://github.com/dalistarh)

 | [![](https://img.shields.io/github/stars/Vahe1994/AQLM?style=social)](https://github.com/Vahe1994/AQLM) 

[](https://arxiv.org/abs/2401.06118)

[](https://huggingface.co/docs/datasets/main/en/cache#cache-directory), [](https://huggingface.co/datasets/togethercomputer/RedPajama-Data-1T-Sample), [](https://huggingface.co/datasets/Vahe1994/AQLM)

[](https://www.reddit.com/r/LearningMachines/comments/1atvrnl/240106118_extreme_compression_of_large_language/)

[](https://youtu.be/Qx8PNk4OkUA), [](https://youtu.be/hAHBKAXO-88)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/Vahe1994/AQLM/blob/main/notebooks/colab_example.ipynb) | 08.03.2024 |

| YOLOv9 | Learning What You Want to Learn Using Programmable Gradient Information | 

[Chien-Yao Wang](https://scholar.google.com/citations?user=DkQh4M4AAAAJ)
 [I-Hau Yeh](https://ieeexplore.ieee.org/author/37088448531)
 [Hong-Yuan Mark Liao](https://homepage.iis.sinica.edu.tw/pages/liao/index_zh.html)

 | [![](https://img.shields.io/github/stars/WongKinYiu/yolov9?style=social)](https://github.com/WongKinYiu/yolov9) 

[](https://arxiv.org/abs/2402.13616), [](https://arxiv.org/abs/2309.16921)

[blog post](https://learnopencv.com/yolov9-advancing-the-yolo-legacy/)

[](https://github.com/WongKinYiu/yolor), [](https://github.com/VDIGPKU/DynamicDet), [](https://github.com/DingXiaoH/RepVGG)

[](https://huggingface.co/spaces/kadirnar/Yolov9), [](https://huggingface.co/merve/yolov9)

[](https://medium.com/@Mert.A/how-to-use-yolov9-for-object-detection-93598ad88d7d)

[](https://youtu.be/XHT2c8jT3Bc), [](https://youtu.be/3iLJ6YWPg28), [](https://youtu.be/dccf_sJF0Gg)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/roboflow-ai/notebooks/blob/main/notebooks/train-yolov9-object-detection-on-custom-dataset.ipynb) | 05.03.2024 |

| Multi-LoRA Composition | LoRA Switch and LoRA Composite, approaches that aim to surpass traditional techniques in terms of accuracy and image quality, especially in complex compositions | 

[Ming Zhong](https://maszhongming.github.io/)
 [Yelong Shen](https://scholar.google.com/citations?user=S6OFEFEAAAAJ)
 [Shuohang Wang](https://www.microsoft.com/en-us/research/people/shuowa/)
others[Yadong Lu](https://adamlu123.github.io/)
 [Yizhu Jiao](https://yzjiao.github.io/)
 [Siru Ouyang](https://ozyyshr.github.io/)
 [Donghan Yu](https://plusross.github.io/)
 [Jiawei Han](https://hanj.cs.illinois.edu/)
 [Weizhu Chen](https://www.microsoft.com/en-us/research/people/wzchen/)

 | [![](https://img.shields.io/github/stars/maszhongming/Multi-LoRA-Composition?style=social)](https://github.com/maszhongming/Multi-LoRA-Composition) 

[](https://arxiv.org/abs/2402.16843)

[](https://medium.com/@letscodeai/multi-lora-composition-for-image-generation-f2706528c590)

[](https://www.reddit.com/r/ninjasaid13/comments/1b13q8s/multilora_composition_for_image_generation/)

[](https://x.com/MingZhong_/status/1762347881812443575?s=20)

[website](https://maszhongming.github.io/Multi-LoRA-Composition/)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1eSTj6qGOtSY5NaazwwN3meXOzEZxgaZq) | 03.03.2024 |

| AMARETTO | Multiscale and multimodal inference of regulatory networks to identify cell circuits and their drivers shared and distinct within and across biological systems of human disease | 

[Nathalie Pochet](http://portals.broadinstitute.org/pochetlab/)
 [Olivier Gevaert](https://profiles.stanford.edu/olivier-gevaert)
 [Mohsen Nabian](https://github.com/monabiyan)
others[Jayendra Shinde](https://jayendrashinde91.github.io/)
 [Celine Everaert](http://www.crig.ugent.be/en/node/510)
 [Thorin Tabor](http://thorin.tabcreations.com/)

 | [![](https://img.shields.io/github/stars/gevaertlab/AMARETTO?style=social)](https://github.com/gevaertlab/AMARETTO) 

[bioconductor](https://bioconductor.org/packages/release/bioc/html/AMARETTO.html)

[project](http://portals.broadinstitute.org/pochetlab/amaretto.html)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1JfnRoNgTVX_7VEGAAmjGjwP_yX2tdDxs) | 28.02.2024 |

| ViT | Vision Transformer and MLP-Mixer Architectures | 

[Alexey Dosovitskiy](https://scholar.google.com/citations?user=FXNJRDoAAAAJ)
 [Lucas Beyer](http://lucasb.eyer.be)
 [Alexander Kolesnikov](https://github.com/akolesnikoff)
others[Dirk Weissenborn](https://github.com/dirkweissenborn)
 [Xiaohua Zhai](https://github.com/xiaohuazhai)
 [Thomas Unterthiner](https://github.com/untom)
 [Mostafa Dehghani](https://www.mostafadehghani.com/)
 [Matthias Minderer](https://matthias.minderer.net/)
 [Georg Heigold](https://scholar.google.com/citations?user=WwqlChAAAAAJ)
 [Sylvain Gelly](https://scholar.google.com/citations?user=m7LvuTkAAAAJ)
 [Jakob Uszkoreit](https://scholar.google.com/citations?user=mOG0bwsAAAAJ)
 [Neil Houlsby](https://neilhoulsby.github.io/)

 | [![](https://img.shields.io/github/stars/google-research/vision_transformer?style=social)](https://github.com/google-research/vision_transformer) 

[](https://arxiv.org/abs/2010.11929), [](https://arxiv.org/abs/2105.01601), [](https://arxiv.org/abs/2105.01601), [](https://arxiv.org/abs/2106.10270), [](https://arxiv.org/abs/2106.01548), [](https://arxiv.org/abs/2111.07991), [](https://arxiv.org/abs/2203.08065)

[blog post](https://blog.research.google/2022/04/locked-image-tuning-adding-language.html)

[](https://github.com/huggingface/pytorch-image-models), [](https://github.com/google/flaxformer)

[](https://www.kaggle.com/models)

[](https://medium.com/@weiwen21/an-image-is-worth-16x16-words-transformers-for-image-recognition-at-scale-957f88e53726)

[](https://youtu.be/TrdevFK_am4), [](https://youtu.be/HZ4j_U3FC94), [](https://youtu.be/7K4Z8RqjWIk), [](https://youtu.be/oDtcobGQ7xU?si=C2EgZTESzhTXFSq6), [](https://youtu.be/v6xj_DG-UEo)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/google-research/vision_transformer/blob/main/vit_jax.ipynb) | 06.02.2024 |

| Qwen | Comprehensive language model series that encompasses distinct models with varying parameter counts | [qwenlm](https://qwenlm.github.io/) | [![](https://img.shields.io/github/stars/QwenLM/Qwen?style=social)](https://github.com/QwenLM/Qwen) 

[](https://arxiv.org/abs/2309.16609), [](https://arxiv.org/abs/2106.09685), [](https://arxiv.org/abs/2305.14314), [](https://arxiv.org/abs/2307.11088)

[](https://discord.gg/CV4E9rpNSD)

[](https://hub.docker.com/r/qwenllm/qwen)

[](https://github.com/QwenLM/Qwen2.5), [](https://github.com/QwenLM/Qwen-Agent), [](https://github.com/QwenLM/qwen.cpp), [](https://github.com/Dao-AILab/flash-attention), [](https://github.com/AutoGPTQ/AutoGPTQ), [](https://github.com/microsoft/DeepSpeed)

[](https://huggingface.co/Qwen)

[](https://pytorch.org/docs/stable/elastic/run.html)

[](https://youtu.be/y6Wh4SpRoao), [](https://youtu.be/bJmx_fAOW78), [](https://youtu.be/ALArhCnz8rY)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/QwenLM/Qwen/blob/master/recipes/quickstart/qwen.ipynb) | 30.01.2024 |

| VALL-E X | Cross-lingual neural codec language model for cross-lingual speech synthesis | 

[Ziqiang Zhang](https://github.com/onisac-K)
 [Long Zhou](https://long-zhou.github.io/)
 [Chengyi Wang](https://cywang97.github.io/)
others[Sanyuan Chen](https://sanyuan-chen.github.io/)
 [Yu Wu](https://www.microsoft.com/en-us/research/people/yuwu1/)
 [Shujie Liu](https://www.microsoft.com/en-us/research/people/shujliu/)
 [Zhuo Chen](https://www.microsoft.com/en-us/research/people/zhuc/)
 [Yanqing Liu](https://scholar.google.com/citations?user=dIJFz4UAAAAJ)
 [Huaming Wang](https://scholar.google.com/citations?user=aJDLg5IAAAAJ)
 [Jinyu Li](https://www.microsoft.com/en-us/research/people/jinyli/)
 [Lei He](https://scholar.google.com/citations?user=EKl9yY8AAAAJ)
 [Sheng Zhao](https://scholar.google.com/citations?user=689bIIwAAAAJ)
 [Furu Wei](https://www.microsoft.com/en-us/research/people/fuwei/)

 | [![](https://img.shields.io/github/stars/Plachtaa/VALL-E-X?style=social)](https://github.com/Plachtaa/VALL-E-X) 

[](https://arxiv.org/abs/2303.03926), [](https://arxiv.org/abs/2301.02111), [](https://arxiv.org/abs/2209.03143)

[demo](https://plachtaa.github.io/)

[](https://discord.gg/qCBRmAnTxg)

[](https://github.com/lifeiteng/vall-e)

[](https://huggingface.co/Plachta/VALL-E-X)

[](https://medium.com/syncedreview/speak-a-foreign-language-in-your-own-voice-1dafa42f78d9)

[project](https://www.microsoft.com/en-us/research/project/vall-e-x)

[](https://youtu.be/7qgfoVFQmvk)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1yyD_sz531QntLKowMHo-XxorsFBCfKul) | 19.01.2024 |

| PhotoMaker | Efficient personalized text-to-image generation method, which mainly encodes an arbitrary number of input ID images into a stack ID embedding for preserving ID information | 

[Zhen Li](https://paper99.github.io/)
 [Mingdeng Cao](https://github.com/ljzycmd)
 [Xintao Wang](https://xinntao.github.io/)
others[Zhongang Qi](https://scholar.google.com/citations?user=zJvrrusAAAAJ)
 [Ming-Ming Cheng](https://mmcheng.net/cmm/)
 [Ying Shan](https://scholar.google.com/citations?user=4oXBp9UAAAAJ)

 | [![](https://img.shields.io/github/stars/TencentARC/PhotoMaker?style=social)](https://github.com/TencentARC/PhotoMaker) 

[](https://arxiv.org/abs/2312.04461)

[](https://github.com/bmaltais/PhotoMaker), [](https://github.com/sdbds/PhotoMaker-for-windows), [](https://github.com/ZHO-ZHO-ZHO/ComfyUI-PhotoMaker), [](https://github.com/mit-han-lab/fastcomposer), [](https://github.com/TencentARC/T2I-Adapter), [](https://github.com/tencent-ailab/IP-Adapter)

[](https://huggingface.co/TencentARC/PhotoMaker)

[](https://medium.com/@christopheverdier/photomaker-the-art-of-ai-consistent-characters-generation-cf2cd037bc3e)

[project](https://photo-maker.github.io/)

[](https://www.reddit.com/r/StableDiffusion/comments/197bfj9/tencentarc_releases_photomaker/)

[](https://youtu.be/NWIdzTEk5O4), [](https://youtu.be/ZTck128jfFY)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/TencentARC/PhotoMaker/blob/main/photomaker_demo.ipynb) | 18.01.2024 |

| DDColor | End-to-end method with dual decoders for image colorization | 

[Xiaoyang Kang](https://piddnad.github.io/xiaoyangkang)
 [Tao Yang](https://cg.cs.tsinghua.edu.cn/people/~tyang/)
 [Wenqi Ouyang](https://vicky0522.github.io/Wenqi-Ouyang/)
others[Peiran Ren](https://scholar.google.com/citations?user=x5dEuxsAAAAJ)
 [Lingzhi Li](https://lingzhili.com/)
 [Xuansong Xie](https://github.com/xungie)

 | [![](https://img.shields.io/github/stars/piddnad/DDColor?style=social)](https://github.com/piddnad/DDColor) 

[](https://arxiv.org/abs/2212.11613)

[](https://github.com/jixiaozhong/ColorFormer), [](https://github.com/KIMGEONUNG/BigColor)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/camenduru/DDColor-colab/blob/main/DDColor_colab.ipynb) | 15.01.2024 |

| PASD | Pixel-aware stable diffusion network to achieve robust Real-ISR as well as personalized stylization | 

[Tao Yang](https://cg.cs.tsinghua.edu.cn/people/~tyang)
 [Peiran Ren](http://renpr.org/)
 [Xuansong Xie](https://github.com/xungie)
 [Lei Zhang](https://www4.comp.polyu.edu.hk/~cslzhang)

 | [![](https://img.shields.io/github/stars/yangxy/PASD?style=social)](https://github.com/yangxy/PASD) 

[](https://arxiv.org/abs/2308.14469)

[](https://github.com/pkuliyi2015/multidiffusion-upscaler-for-automatic1111)

[](https://huggingface.co/runwayml/stable-diffusion-v1-5), [](https://huggingface.co/nitrosocke/mo-di-diffusion)

[](https://www.reddit.com/r/StableDiffusion/comments/18qxe5q/pixelaware_stable_diffusion_for_realistic_image/)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1lZ_-rSGcmreLCiRniVT973x6JLjFiC-b) | 12.01.2024 |

| HandRefiner | Refining Malformed Hands in Generated Images by Diffusion-based Conditional Inpainting | 

[Wenquan Lu](https://github.com/wenquanlu)
 [Yufei Xu](https://scholar.google.com/citations?user=hlYWxX8AAAAJ)
 [Jing Zhang](https://scholar.google.com/citations?user=9jH5v74AAAAJ)
others[Chaoyue Wang](https://wang-chaoyue.github.io/)
 [Dacheng Tao](https://scholar.google.com/citations?user=RwlJNLcAAAAJ)

 | [![](https://img.shields.io/github/stars/wenquanlu/HandRefiner?style=social)](https://github.com/wenquanlu/HandRefiner) 

[](https://arxiv.org/abs/2311.17957)

[](https://github.com/Fannovel16/comfyui_controlnet_aux), [](https://github.com/Mikubill/sd-webui-controlnet), [](https://github.com/microsoft/MeshGraphormer)

[](https://www.reddit.com/r/StableDiffusion/comments/1881z4v/handrefiner_refining_malformed_hands_in_generated/)

[](https://youtu.be/Tt-Fyn1RA6c)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/camenduru/HandRefiner-colab/blob/main/HandRefiner_colab.ipynb) | 08.01.2024 |

| LLaVA | Large Language and Vision Assistant, an end-to-end trained large multimodal model that connects a vision encoder and LLM for general-purpose visual and language understanding | 

[Haotian Liu](https://hliu.cc/)
 [Chunyuan Li](https://chunyuan.li/)
 [Qingyang Wu](https://qywu.github.io/)
others[Yong Jae Lee](https://pages.cs.wisc.edu/~yongjaelee/)
 [Yuheng Li](https://yuheng-li.github.io/)

 | [![](https://img.shields.io/github/stars/haotian-liu/LLaVA?style=social)](https://github.com/haotian-liu/LLaVA) 

[](https://arxiv.org/abs/2304.08485), [](https://arxiv.org/abs/2310.03744), [](https://arxiv.org/abs/2306.00890), [](https://arxiv.org/abs/2309.09958), [](https://arxiv.org/abs/2306.14895)

[demo](https://llava.hliu.cc/)

[](https://github.com/ggerganov/llama.cpp/pull/3436), [](https://github.com/microsoft/LLaVA-Med), [](https://github.com/UX-Decoder/Segment-Everything-Everywhere-All-At-Once), [](https://github.com/Luodian/Otter), [](https://github.com/Instruction-Tuning-with-GPT-4/GPT-4-LLM)

[](https://huggingface.co/datasets/liuhaotian/LLaVA-Pretrain), [](https://huggingface.co/liuhaotian/LLaVA-Pretrained-Projectors)

[](https://xthemadgenius.medium.com/how-to-use-llava-large-language-and-vision-assistant-732c666b5ed0)

[project](https://llava-vl.github.io/)

[](https://youtu.be/mkI7EPD1vp8), [](https://youtu.be/kx1VpI6JzsY), [](https://youtu.be/RxBSmbdJ1I8), [](https://youtu.be/mdYycY4lsuE), [](https://youtu.be/t7I46dxfmWs), [](https://youtu.be/KRAQkJC-XJU)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/camenduru/LLaVA-colab/blob/main/LLaVA_13b_4bit_vanilla_colab.ipynb) | 22.12.2023 |

| SMPLer-X | Scaling up EHPS towards the first generalist foundation model, with up to ViT-Huge as the backbone and training with up to 4.5M instances from diverse data sources | 

[Zhongang Cai](https://caizhongang.github.io/)
 [Wanqi Yin](https://scholar.google.com/citations?user=zlIJwBEAAAAJ)
 [Ailing Zeng](https://ailingzeng.site/)
others[Chen Wei](https://github.com/Wei-Chen-hub)
 [Qingping Sun](https://github.com/ttxskk)
 [Yanjun Wang](https://github.com/WYJSJTU)
 [Hui En Pang](https://pangyyyyy.github.io/)
 [Haiyi Mei](https://haiyi-mei.com/)
 [Mingyuan Zhang](https://mingyuan-zhang.github.io/)
 [Lei Zhang](https://www.leizhang.org/)
 [Chen Change Loy](https://www.mmlab-ntu.com/person/ccloy/)
 [Lei Yang](https://scholar.google.com/citations?user=jZH2IPYAAAAJ)
 [Ziwei Liu](https://liuziwei7.github.io/)

 | [![](https://img.shields.io/github/stars/caizhongang/SMPLer-X?style=social)](https://github.com/caizhongang/SMPLer-X) 

[](https://arxiv.org/abs/2309.17448)

[](https://github.com/open-mmlab/mmhuman3d/blob/main/docs/human_data.md), [](https://github.com/mks0601/Hand4Whole_RELEASE), [](https://github.com/IDEA-Research/OSX)

[](https://neurips.cc/virtual/2023/poster/73473)

[project](https://caizhongang.com/projects/SMPLer-X/)

[](https://www.reddit.com/r/machinelearningnews/comments/176c5z7/this_ai_research_proposes_smplerx_a_generalist/)

[](https://youtu.be/DepTqbPpVzY), [](https://youtu.be/aFTGFInUnM4)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/camenduru/SMPLer-X-colab/blob/main/SMPLer_X_colab.ipynb) | 18.12.2023 |

| DeepCache | Training-free paradigm that accelerates diffusion models from the perspective of model architecture | 

[Xinyin Ma](https://horseee.github.io/)
 [Gongfan Fang](https://fangggf.github.io/)
 [Xinchao Wang](https://sites.google.com/site/sitexinchaowang/)

 | [![](https://img.shields.io/github/stars/horseee/DeepCache?style=social)](https://github.com/horseee/DeepCache) 

[](https://arxiv.org/abs/2312.00858)

[](https://huggingface.co/docs/diffusers/v0.24.0/en/api/pipelines/stable_diffusion/text2img#diffusers.StableDiffusionPipeline)

[project](https://horseee.github.io/Diffusion_DeepCache/)

[](https://www.reddit.com/r/StableDiffusion/comments/18b40hh/deepcache_accelerating_diffusion_models_for_free/)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/camenduru/DeepCache-colab/blob/main/DeepCache_colab.ipynb) | 18.12.2023 |

| MagicAnimate | Diffusion-based framework that aims at enhancing temporal consistency, preserving reference image faithfully, and improving animation fidelity | 

[Zhongcong Xu](https://scholar.google.com/citations?user=-4iADzMAAAAJ)
 [Jianfeng Zhang](http://jeff95.me/)
 [Jun Hao Liew](https://scholar.google.com/citations?user=8gm-CYYAAAAJ)
others[Hanshu Yan](https://hanshuyan.github.io/)
 [Jiawei Liu](https://jia-wei-liu.github.io/)
 [Chenxu Zhang](https://zhangchenxu528.github.io/)
 [Jiashi Feng](https://sites.google.com/site/jshfeng/home)
 [Mike Shou](https://sites.google.com/view/showlab)

 | [![](https://img.shields.io/github/stars/magic-research/magic-animate?style=social)](https://github.com/magic-research/magic-animate) 

[](https://arxiv.org/abs/2311.16498)

[](https://huggingface.co/zcxu-eric/MagicAnimate), [](https://huggingface.co/runwayml/stable-diffusion-v1-5), [](https://huggingface.co/stabilityai/sd-vae-ft-mse)

[](https://medium.com/@AIWorldBlog/revolutionizing-image-animation-with-magicanimate-technology-78cc94151915)

[project](https://showlab.github.io/magicanimate/)

[website](https://www.magicanimate.org/)

[](https://youtu.be/td27SyA9M80), [](https://youtu.be/1pATjLFvNtY), [](https://youtu.be/HeXknItbMM8)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/camenduru/MagicAnimate-colab/blob/main/MagicAnimate_colab.ipynb) | 18.12.2023 |

| DiffBIR | Towards Blind Image Restoration with Generative Diffusion Prior | 

[Xinqi Lin](https://github.com/0x3f3f3f3fun)
 [Jingwen He](https://github.com/hejingwenhejingwen)
 [Ziyan Chen](https://github.com/ziyannchen)
others[Zhaoyang Lyu](https://zhaoyanglyu.github.io/)
 [Ben Fei](https://scholar.google.com/citations?user=skQROj8AAAAJ)
 [Bo Dai](http://daibo.info/)
 [Wanli Ouyang](https://wlouyang.github.io/)
 [Yu Qiao](https://mmlab.siat.ac.cn/yuqiao)
 [Chao Dong](http://xpixel.group/2010/01/20/chaodong.html)

 | [![](https://img.shields.io/github/stars/XPixelGroup/DiffBIR?style=social)](https://github.com/XPixelGroup/DiffBIR) 

[](https://arxiv.org/abs/2308.15070)

[](https://github.com/albarji/mixture-of-diffusers)

[](https://huggingface.co/stabilityai/stable-diffusion-2-1-base)

[project](https://0x3f3f3f3fun.github.io/projects/diffbir/)

[](https://youtu.be/rGnrpxWjBOg), [](https://youtu.be/MIRiJGuGqsg)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/camenduru/DiffBIR-colab/blob/main/DiffBIR_colab.ipynb) | 18.12.2023 |

| Segment and Track Anything | Framewoork that allows users to precisely and effectively segment and track any object in a video | 

[Yangming Cheng](https://github.com/yamy-cheng)
 [Liulei Li](https://github.com/lingorX)
 [Yuanyou Xu](https://github.com/yoxu515)
others[Xiaodi Li](https://github.com/LiNO3Dy)
 [Zongxin Yang](https://z-x-yang.github.io/)
 [Wenguan Wang](https://sites.google.com/view/wenguanwang)
 [Yi Yang](https://scholar.google.com/citations?user=RMSuNFwAAAAJ)

 | [![](https://img.shields.io/github/stars/z-x-yang/Segment-and-Track-Anything?style=social)](https://github.com/z-x-yang/Segment-and-Track-Anything) 

[](https://arxiv.org/abs/2305.06558), [](https://arxiv.org/abs/2304.02643), [](https://proceedings.neurips.cc/paper_files/paper/2022/hash/eb890c36af87e4ca82e8ef7bcba6a284-Abstract-Conference.html)

[](https://github.com/yoxu515/aot-benchmark)

[](https://huggingface.co/ShilongLiu/GroundingDINO/resolve/main/groundingdino_swint_ogc.pth)

[](https://proceedings.neurips.cc/paper_files/paper/2022/hash/eb890c36af87e4ca82e8ef7bcba6a284-Abstract-Conference.html), [](https://proceedings.neurips.cc/paper/2021/hash/147702db07145348245dc5a2f2fe5683-Abstract.html)

[](https://youtu.be/DF0iFSsX8KY), [](https://youtu.be/UJvKPng9_DA), [](https://youtu.be/m1oFavjIaCM), [](https://youtu.be/UPhtpf1k6HA), [](https://youtu.be/Xyd54AngvV8), [](https://youtu.be/eZrdna8JkoQ), [](https://youtu.be/hPjw28Ul4cw), [](https://youtu.be/l7hXM1a3nEA), [](https://youtu.be/nXfq17X6ohk), [](https://youtu.be/5oieHqFIJPc), [](https://youtu.be/cK5MPFdJdSY), [](https://youtu.be/UFtwFaOfx2I)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1R10N70AJaslzADFqb-a5OihYkllWEVxB) | 08.12.2023 |

| AudioLDM | Text-to-audio system that is built on a latent space to learn the continuous audio representations from contrastive language-audio pretraining latents | 

[Haohe Liu](https://haoheliu.github.io/)
 [Zehua Chen](https://github.com/zehuachenImperial)
 [Yi Yuan](https://www.surrey.ac.uk/people/yi-yuan)
others[Xinhao Mei](https://xinhaomei.github.io/)
 [Xubo Liu](https://liuxubo717.github.io/)
 [Danilo Mandic](https://www.imperial.ac.uk/people/d.mandic)
 [Wenwu Wang](http://personal.ee.surrey.ac.uk/Personal/W.Wang/)
 [Mark Plumbley](https://www.surrey.ac.uk/people/mark-plumbley)

 | [![](https://img.shields.io/github/stars/haoheliu/AudioLDM?style=social)](https://github.com/haoheliu/AudioLDM) 

[](https://arxiv.org/abs/2301.12503)

[](https://github.com/LAION-AI/CLAP), [](https://github.com/CompVis/stable-diffusion), [](https://github.com/toshas/torch-fidelity)

[project](https://audioldm.github.io/)

[](https://youtu.be/_0VTltNYhao)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/olaviinha/NeuralTextToAudio/blob/main/AudioLDM_pub.ipynb) | 02.12.2023 |

| TabPFN | Neural network that learned to do tabular data prediction | 

[Noah Hollmann](https://github.com/noahho)
 [Samuel Müller](https://scholar.google.com/citations?user=pevYEjAAAAAJ)
 [Katharina Eggensperger](https://github.com/KEggensperger)
 [Frank Hutter](https://ml.informatik.uni-freiburg.de/profile/hutter/)

 | [![](https://img.shields.io/github/stars/automl/TabPFN?style=social)](https://github.com/automl/TabPFN) 

[](https://arxiv.org/abs/2207.01848), [](https://arxiv.org/abs/2106.11189), [](https://arxiv.org/abs/2106.01342), [](https://arxiv.org/abs/2106.03253), [](https://arxiv.org/abs/2106.11189), [](https://arxiv.org/abs/2112.10510)

[blog post](https://www.automl.org/tabpfn-a-transformer-that-solves-small-tabular-classification-problems-in-a-second/)

[](https://twitter.com/tunguz/status/1578730907711655937)

[](https://youtu.be/BGTO5N5-ack)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/194mCs6SEPEW6C0rcP7xWzcEtt1RBc8jJ) | 29.11.2023 |

| Concept Sliders | Plug-and-play low rank adaptors applied on top of pretrained models | 

[Rohit Gandikota](https://rohitgandikota.github.io/)
 [Joanna Materzyńska](https://joaanna.github.io/)
 [Tingrui Zhou](https://www.p1at.dev/)
others[Antonio Torralba](https://groups.csail.mit.edu/vision/torralbalab/)
 [David Bau](https://baulab.info/)

 | [![](https://img.shields.io/github/stars/rohitgandikota/sliders?style=social)](https://github.com/rohitgandikota/sliders) 

[](https://arxiv.org/abs/2311.12092), [](https://arxiv.org/abs/2207.12598)

[](https://medium.com/@furkangozukara/concept-sliders-lora-adaptors-for-precise-control-in-diffusion-models-b7f6b36fabee)

[](https://proceedings.neurips.cc/paper/2020/hash/49856ed476ad01fcff881d57e161d73f-Abstract.html)

[project](https://sliders.baulab.info/)

[](https://www.reddit.com/r/StableDiffusion/comments/180zon7/concept_sliders_lora_adaptors_for_precise_control/)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/rohitgandikota/sliders/blob/main/demo_concept_sliders.ipynb) | 26.11.2023 |

| Qwen-VL | Set of large-scale vision-language models designed to perceive and understand both text and images | 

[Jinze Bai](https://github.com/jinze1994)
 [Shuai Bai](https://github.com/ShuaiBai623)
 [Shusheng Yang](https://shushengyang.com/)
others[Shijie Wang](https://scholar.google.com/citations?user=DuAqyTwAAAAJ)
 [Sinan Tan](https://www.tinytangent.com/)
 [Peng Wang](https://scholar.google.com/citations?user=7fjqA0YAAAAJ)
 [Junyang Lin](https://justinlin610.github.io/)
 [Chang Zhou](https://scholar.google.com/citations?user=QeSoG3sAAAAJ)
 [Jingren Zhou](http://www.cs.columbia.edu/~jrzhou/)

 | [![](https://img.shields.io/github/stars/QwenLM/Qwen-VL?style=social)](https://github.com/QwenLM/Qwen-VL) 

[](https://arxiv.org/abs/2308.12966), [](https://arxiv.org/abs/2106.09685), [](https://arxiv.org/abs/2305.14314)

[demo](https://modelscope.cn/studios/qwen/Qwen-VL-Chat-Demo/summary)

[](https://discord.gg/z3GAxXZ9Ce)

[](https://github.com/BradyFU/Awesome-Multimodal-Large-Language-Models/tree/Evaluation), [](https://github.com/OFA-Sys/TouchStone), [](https://github.com/PanQiWei/AutoGPTQ)

[](https://huggingface.co/spaces/AILab-CVC/SEED-Bench_Leaderboard), [](https://huggingface.co/Qwen/Qwen-VL)

[](https://youtu.be/ElrSJDg23Po), [](https://youtu.be/E3MS8GfGWj4), [](https://youtu.be/ju09YaO7BGA)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/camenduru/Qwen-VL-Chat-colab/blob/main/Qwen_VL_Chat_colab.ipynb) | 24.11.2023 |

| PixArt-Σ | Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation | 

[Junsong Chen](https://lawrence-cj.github.io/)
 [Chongjian Ge](https://chongjiange.github.io/)
 [Enze Xie](https://xieenze.github.io/)
others[Yue Wu](https://yuewuhkust.github.io/)
 [Lewei Yao](https://scholar.google.com/citations?user=hqDyTg8AAAAJ)
 [Xiaozhe Ren](https://scholar.google.com/citations?user=3t2j87YAAAAJ)
 [Zhongdao Wang](https://zhongdao.github.io/)
 [Ping Luo](http://luoping.me/)
 [Huchuan Lu](https://scholar.google.com/citations?user=D3nE0agAAAAJ)
 [Zhenguo Li](https://scholar.google.com/citations?user=XboZC1AAAAAJ)

 | [![](https://img.shields.io/github/stars/PixArt-alpha/PixArt-sigma?style=social)](https://github.com/PixArt-alpha/PixArt-sigma) 

[](https://arxiv.org/abs/2403.04692), [](https://arxiv.org/abs/2310.00426), [](https://arxiv.org/abs/2401.05252)

[](https://discord.gg/rde6eaE5Ta)

[](https://huggingface.co/spaces/PixArt-alpha/PixArt-alpha), [](https://huggingface.co/spaces/PixArt-alpha/PixArt-LCM)

[project](https://pixart-alpha.github.io/PixArt-sigma-project/)

[](https://www.reddit.com/r/PixArtSigma/)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1jZ5UZXk7tcpTfVwnX33dDuefNMcnW9ME) | 07.11.2023 |

| Zero123++ | Image-conditioned diffusion model for generating 3D-consistent multi-view images from a single input view | 

[Ruoxi Shi](https://rshi.top/)
 [Hansheng Chen](https://lakonik.github.io/)
 [Zhuoyang Zhang](https://github.com/zhuoyang20)
others[Minghua Liu](https://cseweb.ucsd.edu/~mil070/)
 [Chao Xu](https://chaoxu.xyz/)
 [Xinyue Wei](https://sarahweiii.github.io/)
 [Linghao Chen](https://ootts.github.io/)
 [Chong Zeng](https://www.chong-zeng.com/)
 [Hao Su](https://cseweb.ucsd.edu/~haosu/)

 | [![](https://img.shields.io/github/stars/SUDO-AI-3D/zero123plus?style=social)](https://github.com/SUDO-AI-3D/zero123plus) 

[](https://arxiv.org/abs/2310.15110)

[](https://github.com/One-2-3-45/One-2-3-45), [](https://github.com/cvlab-columbia/zero123)

[](https://huggingface.co/spaces/sudo-ai/zero123plus-demo-space), [](https://huggingface.co/spaces/ysharma/Zero123PlusDemo)

[](https://xthemadgenius.medium.com/zero123-your-guide-to-single-view-to-multi-view-3d-image-transformation-b4346b0e6615)

[](https://www.reddit.com/r/StableDiffusion/comments/17f4c6p/zero123_a_single_image_to_consistent_multiview/)

[](https://youtu.be/V9AR-81pAgk)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1_5ECnTOosRuAsm2tUp0zvBG0DppL-F3V) | 26.10.2023 |

| Show-1 | Hybrid model, dubbed as Show-1, which marries pixel-based and latent-based VDMs for text-to-video generation | 

[David Junhao Zhang](https://junhaozhang98.github.io/)
 [Jay Zhangjie Wu](https://zhangjiewu.github.io/)
 [Jiawei Liu](https://jia-wei-liu.github.io/)
others[Rui Zhao](https://ruizhaocv.github.io/)
 [Lingmin Ran](https://siacorplab.nus.edu.sg/people/ran-lingmin/)
 [Yuchao Gu](https://ycgu.site/)
 [Difei Gao](https://scholar.google.com/citations?user=No9OsocAAAAJ)
 [Mike Zheng Shou](https://sites.google.com/view/showlab/home)

 | [![](https://img.shields.io/github/stars/showlab/Show-1?style=social)](https://github.com/showlab/Show-1) 

[](https://arxiv.org/abs/2309.15818)

[](https://huggingface.co/showlab/show-1-base), [](https://huggingface.co/showlab/show-1-interpolation), [](https://huggingface.co/showlab/show-1-sr1), [](https://huggingface.co/showlab/show-1-sr2), [](https://huggingface.co/damo-vilab/modelscope-damo-text-to-video-synthesis), [](https://huggingface.co/cerspense/zeroscope_v2_576w)

[project](https://showlab.github.io/Show-1/)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/camenduru/Show-1-colab/blob/main/Show_1_steps_colab.ipynb) | 15.10.2023 |

| DA-CLIP | Degradation-aware vision-language model to better transfer pretrained vision-language models to low-level vision tasks as a universal framework for image restoration | 

[Ziwei Luo](https://algolzw.github.io/)
 [Fredrik Gustafsson](http://www.fregu856.com/)
 [Zheng Zhao](https://zz.zabemon.com/)
others[Jens Sjölund](https://github.com/jsjol)
 [Thomas Schön](https://user.it.uu.se/~thosc112/index.html)

 | [![](https://img.shields.io/github/stars/Algolzw/daclip-uir?style=social)](https://github.com/Algolzw/daclip-uir) 

[](https://arxiv.org/abs/2310.01018)

[](https://github.com/Algolzw/image-restoration-sde)

[](https://huggingface.co/weblzw/daclip-uir-ViT-B-32-irsde)

[project](https://algolzw.github.io/daclip-uir/)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/camenduru/daclip-uir-colab/blob/main/daclip_uir_gradio_colab.ipynb) | 11.10.2023 |

| Musika | Music generation system that can be trained on hundreds of hours of music using a single consumer GPU, and that allows for much faster than real-time generation of music of arbitrary length on a consumer CPU | 

[Marco Pasini](https://github.com/marcoppasini)
 [Jan Schlüter](https://www.ofai.at/~jan.schlueter/)

 | [![](https://img.shields.io/github/stars/marcoppasini/musika?style=social)](https://github.com/marcoppasini/musika) 

[](https://arxiv.org/abs/2208.08706), [](https://arxiv.org/abs/2005.08526)

[data](https://magenta.tensorflow.org/datasets/maestro)

[](https://github.com/hendriks73/tempo-cnn), [](https://github.com/CPJKU/madmom)

[](https://huggingface.co/spaces/marcop/musika)

[project](https://marcoppasini.github.io/musika)

[](https://youtu.be/QBl8y2Z_i7Y), [](https://youtu.be/0l7OSM-bFvc)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1PowSw3doBURwLE-OTCiWkO8HVbS5paRb) | 09.10.2023 |

| YOLOv6 | Single-stage object detection framework dedicated to industrial applications | 

[Kaiheng Weng](https://github.com/khwengXU)
 [Meng Cheng](https://github.com/MTChengMeng)
 [Yiduo Li](https://github.com/yili123123)
others[Xiangxiang Chu](https://scholar.google.com/citations?&user=jn21pUsAAAAJ)
 [Xiaolin Wei](https://scholar.google.com/citations?user=s5b7lU4AAAAJ)

 | [![](https://img.shields.io/github/stars/meituan/YOLOv6?style=social)](https://github.com/meituan/YOLOv6) 

[](https://arxiv.org/abs/2209.02976), [](https://arxiv.org/abs/2301.05586)

[blog post](https://learnopencv.com/yolov6-object-detection/)

[data](https://cocodataset.org/#download)

[](https://yolov6-docs.readthedocs.io/zh_CN/latest/)

[](https://github.com/FeiGeChuanShu/ncnn-android-yolov6), [](https://github.com/DefTruth/lite.ai.toolkit/blob/main/lite/ort/cv/yolov6.cpp), [](https://github.com/Linaom1214/TensorRT-For-YOLO-Series), [](https://github.com/zhiqwang/yolov5-rt-stack/tree/main/deployment/tensorrt-yolov6)

[](https://youtu.be/3OpwcGU7VvE), [](https://youtu.be/GJ0lVOE3a7c), [](https://youtu.be/3hqkbqJ5ag8), [](https://youtu.be/fFCWrMFH2UY)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/meituan/YOLOv6/blob/master/turtorial.ipynb) | 08.10.2023 |

| DreamGaussian | Algorithm to convert 3D Gaussians into textured meshes and apply a fine-tuning stage to refine the details | 

[Jiaxiang Tang](https://me.kiui.moe/)
 [Jiawei Ren](https://jiawei-ren.github.io/)
 [Hang Zhou](https://hangz-nju-cuhk.github.io/)
others[Ziwei Liu](https://liuziwei7.github.io/)
 [Gang Zeng](http://www.cis.pku.edu.cn/info/1177/1378.htm)

 | [![](https://img.shields.io/github/stars/dreamgaussian/dreamgaussian?style=social)](https://github.com/dreamgaussian/dreamgaussian) 

[](https://arxiv.org/abs/2309.16653)

[](https://github.com/graphdeco-inria/diff-gaussian-rasterization), [](https://github.com/NVlabs/nvdiffrast), [](https://github.com/hoffstadt/DearPyGui)

[project](https://dreamgaussian.github.io/)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1sLpYmmLS209-e5eHgcuqdryFRRO6ZhFS) | 04.10.2023 |

| DINOv2 | Produce high-performance visual features that can be directly employed with classifiers as simple as linear layers on a variety of computer vision tasks; these visual features are robust and perform well across domains without any requirement for fine-tuning | 

[Maxime Oquab](https://scholar.google.com/citations?user=5vteYV8AAAAJ)
 [Timothée Darcet](https://github.com/TimDarcet)
 [Théo Moutakanni](https://github.com/TheoMoutakanni)
others[Huy Vo](https://huyvvo.github.io/)
 [Marc Szafraniec](https://github.com/MarcSzafraniec/)
 [Vasil Khalidov](https://scholar.google.com/citations?user=tjazz3AAAAAJ)
 [Pierre Fernandez](https://pierrefdz.github.io/)
 [Daniel Haziza](https://scholar.google.com/citations?user=2eSKdFMAAAAJ)
 [Francisco Massa](https://github.com/fmassa)
 [Alaaeldin El-Nouby](https://aelnouby.github.io/)
 [Mahmoud Assran](http://www.midoassran.ca/)
 [Nicolas Ballas](https://scholar.google.com/citations?user=euUV4iUAAAAJ)
 [Wojciech Galuba](https://scholar.google.com/citations?user=jyaTX64AAAAJ)
 [Russell Howes](http://www.russellhowes.net/)
 [Po-Yao Huang](https://berniebear.github.io/)
 [Shang-Wen Li](https://swdanielli.github.io/)
 [Ishan Misra](http://imisra.github.io/)
 [Michael Rabbat](https://scholar.google.com/citations?user=cMPKe9UAAAAJ)
 [Vasu Sharma](https://vasusharma.github.io/)
 [Gabriel Synnaeve](https://syhw.github.io/)
 [Hu Xu](https://howardhsu.github.io/)
 [Hervé Jegou](https://github.com/jegou)
 [Julien Mairal](http://thoth.inrialpes.fr/people/mairal/)
 [Patrick Labatut](https://github.com/patricklabatut)
 [Armand Joulin](https://scholar.google.com/citations?user=kRJkDakAAAAJ)
 [Piotr Bojanowski](https://github.com/piotr-bojanowski)

 | [![](https://img.shields.io/github/stars/facebookresearch/dinov2?style=social)](https://github.com/facebookresearch/dinov2) 

[](https://arxiv.org/abs/2304.07193)

[blog post](https://ai.facebook.com/blog/dino-v2-computer-vision-self-supervised-learning/)

[demo](https://dinov2.metademolab.com/)

[](https://huggingface.co/docs/transformers/main/model_doc/dinov2)

[](https://purnasaigudikandula.medium.com/dinov2-image-classification-visualization-and-paper-review-745bee52c826), [](https://towardsdatascience.com/meta-ais-another-revolutionary-large-scale-model-dinov2-for-image-feature-extraction-1114b287eadd)

[](https://youtu.be/csEgtSh7jV4), [](https://www.youtube.com/live/KSZiJ4k28b4), [](https://youtu.be/RZEkdOc3szU)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/facebookresearch/dinov2/blob/main/notebooks/semantic_segmentation.ipynb) | 31.08.2023 |

| StyleGAN 3 | Alias-Free Generative Adversarial Networks | 

[Tero Karras](https://research.nvidia.com/person/tero-karras)
 [Miika Aittala](https://research.nvidia.com/person/Miika-Aittala)
 [Samuli Laine](https://research.nvidia.com/person/Samuli-Laine)
others[Erik Härkönen](https://github.com/harskish)
 [Janne Hellsten](https://research.nvidia.com/person/Janne-Hellsten)
 [Jaakko Lehtinen](https://users.aalto.fi/~lehtinj7/)
 [Timo Aila](https://research.nvidia.com/person/timo-aila)

 | [![](https://img.shields.io/github/stars/NVlabs/stylegan3?style=social)](https://github.com/NVlabs/stylegan3) 

[](https://arxiv.org/abs/2106.12423), [](https://arxiv.org/abs/1706.08500), [](https://arxiv.org/abs/1801.01401), [](https://arxiv.org/abs/1904.06991), [](https://arxiv.org/abs/1812.04948), [](https://arxiv.org/abs/1606.03498)

[](https://github.com/NVlabs/stylegan3-detector), [](https://github.com/NVlabs/ffhq-dataset), [](https://github.com/NVlabs/metfaces-dataset), [](https://github.com/NVlabs/stylegan2-ada-pytorch), [](https://github.com/NVlabs/stylegan2-ada)

[](https://proceedings.neurips.cc/paper/2021/hash/076ccd93ad68be51f23707988e934906-Abstract.html)

[project](https://nvlabs.github.io/stylegan3)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1BXNHZBai-pXtP-ncliouXo_kUiG1Pq7M) | 13.08.2023 |

| Big GAN | Large Scale GAN Training for High Fidelity Natural Image Synthesis | 

[Andrew Brock](https://github.com/ajbrock)
 [Jeff Donahue](https://jeffdonahue.com/)
 [Karen Simonyan](https://scholar.google.com/citations?user=L7lMQkQAAAAJ)

 | [](https://arxiv.org/abs/1809.11096) | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/tensorflow/hub/blob/master/examples/colab/biggan_generation_with_tf_hub.ipynb) | 03.08.2023 |

| CutLER | Simple approach for training unsupervised object detection and segmentation models | 

[Xudong Wang](https://people.eecs.berkeley.edu/~xdwang/)
 [Rohit Girdhar](https://rohitgirdhar.github.io/)
 [Stella Yu](https://www1.icsi.berkeley.edu/~stellayu/)
 [Ishan Misra](https://imisra.github.io/)

 | [![](https://img.shields.io/github/stars/facebookresearch/CutLER?style=social)](https://github.com/facebookresearch/CutLER) 

[](https://arxiv.org/abs/2301.11320), [](https://arxiv.org/abs/1706.02677)

[](https://detectron2.readthedocs.io/en/latest/tutorials/datasets.html)

[project](http://people.eecs.berkeley.edu/~xdwang/projects/CutLER/)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1NgEyFHvOfuA2MZZnfNPWg1w5gSr3HOBb) | 24.07.2023 |

| Recognize Anything & Tag2Text | Vision language pre-training framework, which introduces image tagging into vision-language models to guide the learning of visual-linguistic features | 

[Xinyu Huang](https://xinyu1205.github.io/)
 [Youcai Zhang](https://github.com/Coler1994)
 [Jinyu Ma](https://github.com/majinyu666)
others[Zhaoyang Li](https://github.com/ZhaoyangLi-nju)
 [Yanchun Xie](https://scholar.google.com/citations?user=T0xk9-wAAAAJ)
 [Yuzhuo Qin](https://scholar.google.com/citations?user=5ZG65AkAAAAJ)
 [Tong Luo](https://ieeexplore.ieee.org/author/37089387319)
 [Yaqian Li](https://openreview.net/profile?id=~Yaqian_Li1)
 [Yandong Guo](http://www.lsl.zone/)
 [Yandong Guo](https://scholar.google.com/citations?user=fWDoWsQAAAAJ)
 [Lei Zhang](https://www.leizhang.org/)

 | [![](https://img.shields.io/github/stars/xinyu1205/recognize-anything?style=social)](https://github.com/xinyu1205/recognize-anything) 

[](https://arxiv.org/abs/2306.03514), [](https://arxiv.org/abs/2303.05657)

[](https://github.com/OpenGVLab/Ask-Anything), [](https://github.com/positive666/Prompt-Can-Anything)

[](https://artgor.medium.com/paper-review-recognize-anything-a-strong-image-tagging-model-9e5e1c6dd0af)

[project](https://recognize-anything.github.io/), [project](https://recognize-anything.github.io/)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/mhd-medfa/recognize-anything/blob/main/recognize_anything_demo.ipynb) | 09.07.2023 |

| MobileSAM | Towards Lightweight SAM for Mobile Applications | 

[Chaoning Zhang](https://github.com/ChaoningZhang)
 [Dongshen Han](https://github.com/dongshenhan)
 [Yu Qiao](https://github.com/qiaoyu1002)
others[Jung Uk Kim](https://visualai.khu.ac.kr/)
 [Sung-Ho Bae](https://scholar.google.com/citations?user=EULut5oAAAAJ)
 [Seungkyu Lee](https://scholar.google.com/citations?user=3Pf6C6cAAAAJ)
 [Choong Seon Hong](https://scholar.google.com/citations?user=oKANWloAAAAJ)

 | [![](https://img.shields.io/github/stars/ChaoningZhang/MobileSAM?style=social)](https://github.com/ChaoningZhang/MobileSAM) 

[](https://arxiv.org/abs/2306.14289)

[](https://github.com/jolibrain/joliGEN), [](https://github.com/akbartus/MobileSAM-in-the-Browser), [](https://github.com/qiaoyu1002/Inpaint-Anything), [](https://github.com/qiaoyu1002/Personalize-SAM), [](https://github.com/Jumpat/SegmentAnythingin3D), [](https://github.com/vietanhdev/anylabeling), [](https://github.com/wangsssky/SonarSAM), [](https://github.com/continue-revolution/sd-webui-segment-anything)

[](https://twitter.com/_akhaliq/status/1674410573075718145)

[](https://youtu.be/eTEfq_kWabQ)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/ChaoningZhang/MobileSAM/blob/master/notebooks/predictor_example.ipynb) | 30.06.2023 |

| Grounding DINO | Marrying DINO with Grounded Pre-Training for Open-Set Object Detection | 

[Shilong Liu](https://github.com/SlongLiu)
 [Zhaoyang Zeng](https://scholar.google.com/citations?user=U_cvvUwAAAAJ)
 [Tianhe Ren](https://rentainhe.github.io/)
others[Feng Li](https://scholar.google.com/citations?user=ybRe9GcAAAAJ)
 [Hao Zhang](https://scholar.google.com/citations?user=B8hPxMQAAAAJ)
 [Jie Yang](https://yangjie-cv.github.io/)
 [Chunyuan Li](https://scholar.google.com/citations?user=Zd7WmXUAAAAJ)
 [Jianwei Yang](https://jwyang.github.io/)
 [Hang Su](https://www.suhangss.me/)
 [Jun Zhu](https://scholar.google.com/citations?user=axsP38wAAAAJ)
 [Lei Zhang](https://www.leizhang.org/)

 | [![](https://img.shields.io/github/stars/IDEA-Research/GroundingDINO?style=social)](https://github.com/IDEA-Research/GroundingDINO) 

[](https://arxiv.org/abs/2303.05499)

[](https://github.com/IDEA-Research/DINO), [](https://github.com/UX-Decoder/Semantic-SAM), [](https://github.com/OptimalScale/DetGPT), [](https://github.com/IDEA-Research/OpenSeeD), [](https://github.com/UX-Decoder/Segment-Everything-Everywhere-All-At-Once), [](https://github.com/microsoft/X-Decoder/tree/xgpt), [](https://github.com/IDEA-Research/detrex)

[](https://paperswithcode.com/sota/zero-shot-object-detection-on-mscoco?p=grounding-dino-marrying-dino-with-grounded), [](https://paperswithcode.com/sota/zero-shot-object-detection-on-odinw?p=grounding-dino-marrying-dino-with-grounded), [](https://paperswithcode.com/sota/object-detection-on-coco-minival?p=grounding-dino-marrying-dino-with-grounded), [](https://paperswithcode.com/sota/object-detection-on-coco?p=grounding-dino-marrying-dino-with-grounded)

[](https://youtu.be/wxWDt5UiwY8), [](https://youtu.be/cMa77r3YrDk), [](https://youtu.be/C4NqaRBz_Kw), [](https://youtu.be/oEQYStnF2l8)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/roboflow-ai/notebooks/blob/main/notebooks/zero-shot-object-detection-with-grounding-dino.ipynb) | 28.06.2023 |

| T5X | Modular, composable, research-friendly framework for high-performance, configurable, self-service training, evaluation, and inference of sequence models at many scales | 

[Adam Roberts](https://github.com/adarob)
 [Hyung Won Chung](https://github.com/hwchung27)
 [Anselm Levskaya](https://anselmlevskaya.com/)
others[Gaurav Mishra](https://research.google/people/GauravMishra/)
 [James Bradbury](https://github.com/jekbradbury)
 [Daniel Andor](https://github.com/andorardo)
 [Sharan Narang](https://github.com/sharannarang)
 [Brian Lester](https://blester125.com/)
 [Colin Gaffney](https://github.com/cpgaffney1)
 [Afroz Mohiuddin](https://github.com/afrozenator)
 [Curtis Hawthorne](https://github.com/cghawthorne)
 [Aitor Lewkowycz](https://scholar.google.com/citations?user=Yum1ah0AAAAJ)
 [Alex Salcianu](https://scholar.google.com/citations?user=HSrT1wsAAAAJ)
 [Marc van Zee](https://github.com/marcvanzee)
 [Jacob Austin](https://jacobaustin123.github.io/)
 [Sebastian Goodman](https://github.com/0x0539)
 [Livio Baldini Soares](https://liviosoares.github.io/)
 [Haitang Hu](https://hthu.github.io/)
 [Sasha Tsvyashchenko](https://endl.ch/)
 [Aakanksha Chowdhery](http://www.achowdhery.com/)
 [Jasmijn Bastings](https://jasmijn.ninja/)
 [Jannis Bulian](http://bulian.org/)
 [Xavier Garcia](https://scholar.google.com/citations?user=Y2Hio6MAAAAJ)
 [Jianmo Ni](https://nijianmo.github.io/)
 [Kathleen Kenealy](https://scholar.google.com/citations?&user=HgRBC5gAAAAJ)
 [Jonathan Clark](http://www.cs.cmu.edu/~jhclark/)
 [Dan Garrette](http://www.dhgarrette.com/)
 [James Lee-Thorp](https://scholar.google.com/citations?user=qsPv098AAAAJ)
 [Colin Raffel](https://colinraffel.com/)
 [Noam Shazeer](https://scholar.google.com/citations?user=wsGvgA8AAAAJ)
 [Marvin Ritter](https://scholar.google.com/citations?user=arcf5FgAAAAJ)
 [Maarten Bosma](https://scholar.google.com/citations?user=wkeFQPgAAAAJ)
 [Alexandre Passos](https://www.ic.unicamp.br/~tachard/)
 [Jeremy Maitin-Shepard](https://research.google/people/JeremyMaitinShepard/)
 [Noah Fiedel](https://scholar.google.com/citations?user=XWpV9DsAAAAJ)
 [Brennan Saeta](https://github.com/saeta)
 [Ryan Sepassi](https://ryansepassi.com/)
 [Alexander Spiridonov](https://research.google/people/AlexanderSpiridonov/)
 [Joshua Newlan](https://github.com/joshnewlan)
 [Andrea Gesmundo](https://github.com/agesmundo)

 | [![](https://img.shields.io/github/stars/google-research/t5x?style=social)](https://github.com/google-research/t5x) 

[](https://arxiv.org/abs/2203.17189), [](https://arxiv.org/abs/1910.10683)

[](https://t5x.readthedocs.io/en/latest/)

[](https://github.com/tensorflow/mesh), [](https://github.com/tensorflow/serving)

[](https://www.tensorflow.org/datasets/catalog/wmt_t2t_translate), [](https://www.tensorflow.org/guide/data), [](https://www.tensorflow.org/tensorboard)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/google-research/t5x/blob/main/t5x/notebooks/introduction.ipynb) | 27.06.2023 |

| Gen-L-Video | Extending off-the-shelf short video diffusion models for generating and editing videos comprising hundreds of frames with diverse semantic segments without introducing additional training, all while preserving content consistency | 

[Fu-Yun Wang](https://g-u-n.github.io/)
 [Wenshuo Chen](https://github.com/winshot-thu)
 [Guanglu Song](https://songguanglu.github.io/)
others[Han-Jia Ye](https://github.com/Han-Jia)
 [Yu Liu](https://liuyu.us/)
 [Hongsheng Li](https://www.ee.cuhk.edu.hk/~hsli/)

 | [![](https://img.shields.io/github/stars/G-U-N/Gen-L-Video?style=social)](https://github.com/G-U-N/Gen-L-Video) 

[](https://arxiv.org/abs/2305.18264)

[](https://github.com/lllyasviel/ControlNet), [](https://github.com/TencentARC/T2I-Adapter), [](https://github.com/sail-sg/EditAnything)

[project](https://g-u-n.github.io/projects/gen-long-video/index.html)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/G-U-N/Gen-L-Video/blob/master/notebooks/glv_colab.ipynb) | 04.06.2023 |

| First Order Motion Model for Image Animation | Transferring facial movements from video to image | [Aliaksandr Siarohin](https://aliaksandrsiarohin.github.io/aliaksandr-siarohin-website/) | [![](https://img.shields.io/github/stars/AliaksandrSiarohin/first-order-model?style=social)](https://github.com/AliaksandrSiarohin/first-order-model) 

[](https://papers.nips.cc/paper/2019/hash/31c0b36aef265d9221af80872ceb62f9-Abstract.html)

[project](https://aliaksandrsiarohin.github.io/first-order-model-website/)

[](https://www.youtube.com/watch?v=u-0cQ-grXBQ)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/AliaksandrSiarohin/first-order-model/blob/master/demo.ipynb) | 04.06.2023 |

| MMS | The Massively Multilingual Speech project expands speech technology from about 100 languages to over 1000 by building a single multilingual speech recognition model supporting over 1100 languages, language identification models able to identify over 4000 languages, pretrained models supporting over 1400 languages, and text-to-speech models for over 1100 languages | 

[Vineel Pratap](https://github.com/vineelpratap)
 [Andros Tjandra](https://github.com/androstj)
 [Bowen Shi](https://scholar.google.com/citations?user=xqyoorYAAAAJ)
others[Paden Tomasello](https://scholar.google.com/citations?user=sBtWMGYAAAAJ)
 [Arun Babu](https://scholar.google.com/citations?user=oJfoTakAAAAJ)
 [Sayani Kundu](https://www.linkedin.com/in/sayani-kundu)
 [Ali Elkahky](https://scholar.google.com/citations?user=KB3S8RoAAAAJ)
 [Zhaoheng Ni](https://scholar.google.com/citations?user=SYFMSNsAAAAJ)
 [Apoorv Vyas](https://apoorv2904.github.io/)
 [Maryam Fazel-Zarandi](https://www.maryamfazel.com/)
 [Alexei Baevski](https://github.com/alexeib)
 [Yossi Adi](https://www.cs.huji.ac.il/~adiyoss/)
 [Xiaohui Zhang](https://github.com/xiaohui-zhang)
 [Wei-Ning Hsu](https://wnhsu.github.io/)
 [Alexis Conneau](https://github.com/aconneau)
 [Michael Auli](https://github.com/michaelauli)

 | [![](https://img.shields.io/github/stars/facebookresearch/fairseq?style=social)](https://github.com/facebookresearch/fairseq/tree/main/examples/mms) 

[](https://arxiv.org/abs/2305.13516)

[](https://huggingface.co/docs/transformers/main/en/model_doc/mms), [](https://huggingface.co/facebook/mms-cclms/), [](https://huggingface.co/blog/mms_adapters)

[](https://ai.facebook.com/blog/multilingual-model-speech-recognition/)

[](https://youtu.be/GEzxHxWys2s), [](https://youtu.be/g06agCmxS7I)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/facebookresearch/fairseq/blob/main/examples/mms/asr/tutorial/MMS_ASR_Inference_Colab.ipynb) | 26.05.2023 |

| FAB | Flow AIS Bootstrap uses AIS to generate samples in regions where the flow is a poor approximation of the target, facilitating the discovery of new modes | 

[Laurence Midgley](https://lollcat.github.io/laurence-midgley/)
 [Vincent Stimper](https://is.mpg.de/person/vstimper)
 [Gregor N. C. Simm](https://www.gncs.me/)
others[Bernhard Schölkopf](https://scholar.google.com/citations?user=DZ-fHPgAAAAJ)
 [José Miguel Hernández-Lobato](https://jmhl.org/)

 | [![](https://img.shields.io/github/stars/lollcat/fab-torch?style=social)](https://github.com/lollcat/fab-torch) 

[](https://arxiv.org/abs/2208.01893)

[](https://github.com/lollcat/fab-jax-old), [](https://github.com/deepmind/annealed_flow_transport)

[](https://youtu.be/xQQXvOWu9nE)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/lollcat/fab-torch/blob/master/experiments/gmm/fab_gmm.ipynb) | 29.04.2023 |

| CodeFormer | Transformer-based prediction network to model global composition and context of the low-quality faces for code prediction, enabling the discovery of natural faces that closely approximate the target faces even when the inputs are severely degraded | 

[Shangchen Zhou](https://shangchenzhou.com/)
 [Kelvin Chan](https://ckkelvinchan.github.io/)
 [Chongyi Li](https://li-chongyi.github.io/)
 [Chen Change Loy](https://www.mmlab-ntu.com/person/ccloy/)

 | [![](https://img.shields.io/github/stars/sczhou/CodeFormer?style=social)](https://github.com/sczhou/CodeFormer) 

[](https://arxiv.org/abs/2206.11253)

[](https://github.com/samb-t/unleashing-transformers), [](https://github.com/deepcam-cn/yolov5-face), [](https://github.com/xinntao/facexlib)

[](https://proceedings.neurips.cc/paper_files/paper/2022/hash/c573258c38d0a3919d8c1364053c45df-Abstract-Conference.html)

[project](https://shangchenzhou.com/projects/CodeFormer/)

[](https://youtu.be/d3VDpkXlueI), [](https://youtu.be/PtwWu-FugbA), [](https://youtu.be/ORtYP8NW4T0), [](https://youtu.be/xc5lKOKBCcg)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1m52PNveE4PBhYrecj34cnpEeiHcC5LTb) | 21.04.2023 |

| Segment Anything | The Segment Anything Model produces high quality object masks from input prompts such as points or boxes, and it can be used to generate masks for all objects in an image | 

[Alexander Kirillov](https://alexander-kirillov.github.io/)
 [Eric Mintun](https://ericmintun.github.io/)
 [Nikhila Ravi](https://nikhilaravi.com/)
others[Hanzi Mao](https://hanzimao.me/)
 [Chloé Rolland](https://scholar.google.com/citations?user=n-SnMhoAAAAJ)
 [Laura Gustafson](https://scholar.google.com/citations?user=c8IpF9gAAAAJ)
 [Tete Xiao](https://tetexiao.com/)
 [Spencer Whitehead](https://www.spencerwhitehead.com/)
 [Alex Berg](http://acberg.com/)
 [Wan-Yen Lo](https://github.com/wanyenlo)
 [Piotr Dollár](https://pdollar.github.io/)
 [Ross Girshick](https://www.rossgirshick.info/)

 | [![](https://img.shields.io/github/stars/facebookresearch/segment-anything?style=social)](https://github.com/facebookresearch/segment-anything) 

[](https://arxiv.org/abs/2304.02643)

[data](https://ai.facebook.com/datasets/segment-anything/)

[](https://ai.facebook.com/research/publications/segment-anything/), [](https://ai.facebook.com/blog/segment-anything-foundation-model-image-segmentation/)

[website](https://segment-anything.com/)

[](https://youtu.be/2O_vecl28OA), [](https://youtu.be/fVeW9a6wItM), [](https://youtu.be/FjYE0tKWOiY)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/facebookresearch/segment-anything/blob/main/notebooks/predictor_example.ipynb) | 10.04.2023 |

| EVA3D | High-quality unconditional 3D human generative model that only requires 2D image collections for training | 

[Fangzhou Hong](https://hongfz16.github.io/)
 [Zhaoxi Chen](https://frozenburning.github.io/)
 [Yushi Lan](https://github.com/NIRVANALAN)
others[Liang Pan](https://github.com/paul007pl)
 [Ziwei Liu](https://liuziwei7.github.io/)

 | [![](https://img.shields.io/github/stars/hongfz16/EVA3D?style=social)](https://github.com/hongfz16/EVA3D) 

[](https://arxiv.org/abs/2210.04888)

[project](https://hongfz16.github.io/projects/EVA3D.html)

[](https://youtu.be/JNV0FJ0aDWM), [](https://youtu.be/M-kyvzTQrBI)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/hongfz16/EVA3D/blob/main/notebook/EVA3D_Demo.ipynb) | 06.04.2023 |

| Stable Dreamfusion | Using a pretrained 2D text-to-image diffusion model to perform text-to-3D synthesis | 

[Jiaxiang Tang](https://me.kiui.moe/)
 [Ben Poole](https://cs.stanford.edu/~poole/)
 [Ajay Jain](https://ajayj.com/)
others[Jon Barron](https://jonbarron.info/)
 [Ben Mildenhall](https://bmild.github.io/)

 | [![](https://img.shields.io/github/stars/ashawkey/stable-dreamfusion?style=social)](https://github.com/ashawkey/stable-dreamfusion) 

[](https://arxiv.org/abs/2209.14988)

[](https://github.com/ashawkey/torch-ngp), [](https://github.com/hoffstadt/DearPyGui)

[](https://huggingface.co/runwayml/stable-diffusion-v1-5)

[project](https://dreamfusion3d.github.io/)

[](https://pytorch.org/docs/stable/cpp_extension.html#torch.utils.cpp_extension.load)

[](https://youtu.be/uM5NPodZZ1U?t=219), [](https://youtu.be/zWD5ZR5GtJM), [](https://youtu.be/L3G0dx1Q0R8), [](https://youtu.be/dIgDbBTztUM)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1MXT3yfOFvO0ooKEfiUUvTKwUkrrlCHpF) | 04.04.2023 |

| Visual ChatGPT | Connects ChatGPT and a series of Visual Foundation Models to enable sending and receiving images during chatting | 

[Chenfei Wu](https://github.com/chenfei-wu)
 [Shengming Yin](https://github.com/shengming-yin)
 [Weizhen Qi](https://github.com/WeizhenQ)
others[Xiaodong Wang](https://wang-xiaodong1899.github.io/)
 [Zecheng Tang](https://github.com/CODINNLG)
 [Nan Duan](https://nanduan.github.io/)

 | [![](https://img.shields.io/github/stars/microsoft/visual-chatgpt?style=social)](https://github.com/microsoft/visual-chatgpt) 

[](https://arxiv.org/abs/2303.04671)

[](https://github.com/hwchase17/langchain), [](https://github.com/lllyasviel/ControlNet), [](https://github.com/timothybrooks/instruct-pix2pix), [](https://github.com/timojl/clipseg)

[](https://youtu.be/0UfXlFUwLms), [](https://youtu.be/7YEiEyfPF5U)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/11BtP3h-w0dZjA-X8JsS9_eo8OeGYvxXB) | 15.03.2023 |

| GPEN | GAN Prior Embedded Network for Blind Face Restoration in the Wild | 

[Tao Yang](https://cg.cs.tsinghua.edu.cn/people/~tyang/)
 [Peiran Ren](https://scholar.google.com/citations?&user=x5dEuxsAAAAJ)
 [Xuansong Xie](https://scholar.google.com/citations?user=M0Ei1zkAAAAJ)
 [Lei Zhang](http://www4.comp.polyu.edu.hk/~cslzhang/)

 | [![](https://img.shields.io/github/stars/yangxy/GPEN?style=social)](https://github.com/yangxy/GPEN) 

[](https://arxiv.org/abs/2105.06070)

[demo](https://vision.aliyun.com/experience/detail?spm=a211p3.14020179.J_7524944390.17.66cd4850wVDkUQ&tagName=facebody&children=EnhanceFace)

[](https://github.com/biubug6/Pytorch_Retinaface), [](https://github.com/rosinality/stylegan2-pytorch)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/yangxy/GPEN/blob/main/GPEN.ipynb) | 15.02.2023 |

| Disco Diffusion | A frankensteinian amalgamation of notebooks, models and techniques for the generation of AI Art and Animations | 

[Max Ingham](https://github.com/somnai-dreams)
 [Adam Letts](https://linktr.ee/gandamu)
 [Daniel Russell](https://github.com/russelldc)
 [Chigozie Nri](https://github.com/chigozienri)

 | [![](https://img.shields.io/github/stars/alembics/disco-diffusion?style=social)](https://github.com/alembics/disco-diffusion) 

[](https://github.com/openai/guided-diffusion)

[](https://youtu.be/_DtWfh9oS54), [](https://youtu.be/gWxmtdZL8FE), [](https://youtu.be/yVJB6oD0_gM)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/alembics/disco-diffusion/blob/main/Disco_Diffusion.ipynb) | 11.02.2023 |

| GrooVAE | Some applications of machine learning for generating and manipulating beats and drum performances | 

[Jon Gillick](https://www.jongillick.com/)
 [Adam Roberts](https://github.com/adarob)
 [Jesse Engel](https://github.com/jesseengel)

 | [![](https://img.shields.io/github/stars/magenta/magenta?style=social)](https://github.com/magenta/magenta/tree/main/magenta/models/music_vae) 

[](https://arxiv.org/abs/1905.06118)

[blog post](https://g.co/magenta/groovae)

[data](https://g.co/magenta/groove-datasets)

[web app](https://groove-drums.glitch.me/)

[](https://www.youtube.com/watch?v=x2YLmXzovDo)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/tensorflow/magenta-demos/blob/master/colab-notebooks/GrooVAE.ipynb) | 02.02.2023 |

| Multitrack MusicVAE | The models in this notebook are capable of encoding and decoding single measures of up to 8 tracks, optionally conditioned on an underlying chord | 

[Ian Simon](https://github.com/iansimon)
 [Adam Roberts](https://github.com/adarob)
 [Colin Raffel](https://colinraffel.com//)
others[Jesse Engel](https://github.com/jesseengel)
 [Curtis Hawthorne](https://github.com/cghawthorne)
 [Douglas Eck](https://github.com/douglaseck)

 | 

[](https://arxiv.org/abs/1806.00195)

[blog post](http://g.co/magenta/multitrack)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/magenta/magenta-demos/blob/master/colab-notebooks/Multitrack_MusicVAE.ipynb) | 02.02.2023 |

| MusicVAE | A Hierarchical Latent Vector Model for Learning Long-Term Structure in Music | 

[Adam Roberts](https://github.com/adarob)
 [Jesse Engel](https://github.com/jesseengel)
 [Colin Raffel](https://colinraffel.com//)
others[Curtis Hawthorne](https://github.com/cghawthorne)
 [Douglas Eck](https://github.com/douglaseck)

 | 

[](https://arxiv.org/abs/1803.05428)

[blog post](https://g.co/magenta/music-vae)

[project](https://magenta.tensorflow.org/music-vae)

[](https://www.youtube.com/playlist?list=PLBUMAYA6kvGU8Cgqh709o5SUvo-zHGTxr)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/magenta/magenta-demos/blob/master/colab-notebooks/MusicVAE.ipynb) | 02.02.2023 |

| LORA | Low-Rank Adaptation, which freezes the pre-trained model weights and injects trainable rank decomposition matrices into each layer of the Transformer architecture, greatly reducing the number of trainable parameters for downstream tasks | 

[Edward Hu](https://edwardjhu.com/)
 [Yelong Shen](https://scholar.google.com/citations?user=S6OFEFEAAAAJ)
 [Phillip Wallis](https://scholar.google.com/citations?user=8IqHSXYAAAAJ)
others[Zeyuan Allen-Zhu](http://zeyuan.allen-zhu.com/)
 [Yuanzhi Li](https://scholar.google.com/citations?user=aHtfItQAAAAJ)
 [Shean Wang](https://www.linkedin.com/in/shean-wang-18a20841)
 [Lu Wang](https://scholar.google.com/citations?user=JWJ_SNcAAAAJ)
 [Weizhu Chen](https://www.microsoft.com/en-us/research/people/wzchen/)

 | [![](https://img.shields.io/github/stars/cloneofsimo/lora?style=social)](https://github.com/cloneofsimo/lora) 

[](https://arxiv.org/abs/2106.09685), [](https://arxiv.org/abs/2208.12242), [](https://arxiv.org/abs/2208.01618), [](https://arxiv.org/abs/2106.05744), [](https://arxiv.org/abs/2106.04647)

[](https://github.com/microsoft/LoRA)

[](https://huggingface.co/spaces/lora-library/LoRA-DreamBooth-Training-UI), [](https://huggingface.co/blog/lora)

[](https://medium.com/@shelikohan/low-rank-adapter-lora-explained-0d3677395639), [](https://towardsdatascience.com/understanding-lora-low-rank-adaptation-for-finetuning-large-models-936bce1a07c6/)

[](https://pypi.org/project/loralib/)

[](https://www.reddit.com/r/LocalLLaMA/comments/1diw5fb/understanding_lora_a_5minute_visual_guide_to/), [](https://www.reddit.com/r/LocalLLaMA/comments/17z91wk/practical_tips_for_finetuning_llms_using_lora/)

[](https://youtu.be/dA-NhCtrrVE), [](https://youtu.be/t509sv5MT0w), [](https://youtu.be/KEv-F5UkhxU), [](https://youtu.be/DhRoTONcyZE), [](https://youtu.be/PXWYUTMt-AU), [](https://youtu.be/X4VvO3G6_vw)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1iSFDpRBKEWr2HLlz243rbym3J2X95kcy) | 30.01.2023 |

| Fourier Feature Networks | Fourier Features Let Networks Learn High Frequency Functions in Low Dimensional Domains | 

[Matthew Tancik](https://www.matthewtancik.com/)
 [Pratul Srinivasan](https://pratulsrinivasan.github.io/)
 [Ben Mildenhall](https://bmild.github.io/)
others[Sara Fridovich-Keil](https://people.eecs.berkeley.edu/~sfk/)
 [Nithin Raghavan](https://cseweb.ucsd.edu//~n2raghavan/)
 [Utkarsh Singhal](https://scholar.google.com/citations?user=lvA86MYAAAAJ)
 [Ravi Ramamoorthi](https://cseweb.ucsd.edu//~ravir/)
 [Jon Barron](https://jonbarron.info/)
 [Ren Ng](https://www2.eecs.berkeley.edu/Faculty/Homepages/yirenng.html)

 | [![](https://img.shields.io/github/stars/tancik/fourier-feature-networks?style=social)](https://github.com/tancik/fourier-feature-networks) 

[](https://arxiv.org/abs/1806.07572)

[](https://proceedings.neurips.cc/paper/2020/hash/55053683268957697aa39fba6f231c68-Abstract.html), [](https://papers.nips.cc/paper/2007/hash/013a006f03dbc5392effeb8f18fda755-Abstract.html)

[project](https://bmild.github.io/fourfeat/)

[](https://youtu.be/nVA6K6Sn2S4)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/tancik/fourier-feature-networks/blob/master/Demo.ipynb) | 17.01.2023 |

| Demucs | Hybrid Spectrogram and Waveform Source Separation | [Alexandre Défossez](https://ai.honu.io/) | [![](https://img.shields.io/github/stars/facebookresearch/demucs?style=social)](https://github.com/facebookresearch/demucs) 

[](https://arxiv.org/abs/2111.03600), [](https://arxiv.org/abs/2010.01733), [](https://arxiv.org/abs/2109.05418), [](https://arxiv.org/abs/1805.02410)

[](https://github.com/adefossez/mdx21_demucs), [](https://github.com/CarlGao4/Demucs-Gui), [](https://github.com/kuielab/mdx-net-submission), [](https://github.com/f90/Wave-U-Net)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1dC9nVxk3V_VPjUADsnFu8EiT-xnU1tGH) | 21.11.2022 |

| MotionDiffuse | The first diffusion model-based text-driven motion generation framework, which demonstrates several desired properties over existing methods | 

[Mingyuan Zhang](https://mingyuan-zhang.github.io/)
 [Zhongang Cai](https://caizhongang.github.io/)
 [Liang Pan](https://github.com/paul007pl)
others[Fangzhou Hong](https://hongfz16.github.io/)
 [Xinying Guo](https://gxyes.github.io/)
 [Lei Yang](https://scholar.google.com/citations?user=jZH2IPYAAAAJ)
 [Ziwei Liu](https://liuziwei7.github.io/)

 | [![](https://img.shields.io/github/stars/mingyuan-zhang/MotionDiffuse?style=social)](https://github.com/mingyuan-zhang/MotionDiffuse) 

[](https://arxiv.org/abs/2208.15001)

[](https://huggingface.co/spaces/mingyuan/MotionDiffuse)

[project](https://mingyuan-zhang.github.io/projects/MotionDiffuse.html)

[](https://youtu.be/U5PTnw490SA)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1Dp6VsZp2ozKuu9ccMmsDjyij_vXfCYb3) | 13.10.2022 |

| PyMAF | Pyramidal Mesh Alignment Feedback loop in regression network for well-aligned body mesh recovery and extend it for the recovery of expressive full-body models | 

[Hongwen Zhang](https://hongwenzhang.github.io/)
 [Yating Tian](https://github.com/tinatiansjz)
 [Yuxiang Zhang](https://zhangyux15.github.io/)
others[Mengcheng Li](https://github.com/Dw1010)
 [Liang An](https://anl13.github.io/)
 [Zhenan Sun](http://www.cbsr.ia.ac.cn/users/znsun/)
 [Yebin Liu](https://www.liuyebin.com/)

 | [![](https://img.shields.io/github/stars/HongwenZhang/PyMAF?style=social)](https://github.com/HongwenZhang/PyMAF) 

[](https://arxiv.org/abs/2207.06400), [](https://arxiv.org/abs/2103.16507)

[](https://github.com/facebookresearch/eft), [](https://github.com/HongwenZhang/DaNet-DensePose2SMPL), [](https://github.com/facebookresearch/DensePose), [](https://github.com/Microsoft/human-pose-estimation.pytorch)

[project](https://www.liuyebin.com/pymaf-x/)

[](https://youtu.be/yqEmznSKjYI), [](https://youtu.be/ylOB0wCeV34)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/11RXLsH9BdoSCwY6G-IX7KgqDxVoImu6K) | 06.10.2022 |

| Functa | From data to functa: Your data point is a function and you can treat it like one | 

[Emilien Dupont](https://emiliendupont.github.io/)
 [Hyunjik Kim](https://hyunjik11.github.io/)
 [Ali Eslami](http://arkitus.com/)
others[Danilo Rezende](https://danilorezende.com/about/)
 [Dan Rosenbaum](https://danrsm.github.io/)

 | [![](https://img.shields.io/github/stars/deepmind/functa?style=social)](https://github.com/deepmind/functa) 

[](https://arxiv.org/abs/2201.12204)

[](https://github.com/sxyu/pixel-nerf), [](https://github.com/deepmind/jaxline)

[](https://www.tensorflow.org/datasets/catalog/celeb_a_hq)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/deepmind/functa/blob/main/modulation_visualization_colab.ipynb) | 24.09.2022 |

| Whisper | Automatic speech recognition system trained on 680,000 hours of multilingual and multitask supervised data collected from the web | 

[Alec Radford](http://newmu.github.io/)
 [Jong Wook Kim](https://jongwook.kim/)
 [Tao Xu](https://github.com/bayesian)
others[Greg Brockman](https://gregbrockman.com/)
 [Christine McLeavey](http://christinemcleavey.com/)
 [Ilya Sutskever](http://www.cs.toronto.edu/~ilya/)

 | [![](https://img.shields.io/github/stars/openai/whisper?style=social)](https://github.com/openai/whisper) 

[](https://arxiv.org/abs/2212.04356)

[blog post](https://openai.com/research/whisper)

[](https://github.com/kkroening/ffmpeg-python)

[](https://youtu.be/OCBZtgQGt1I), [](https://youtu.be/8SQV-B83tPU), [](https://youtu.be/nE5iVtwKerA)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/openai/whisper/blob/master/notebooks/LibriSpeech.ipynb) | 21.09.2022 |

| DeOldify (video) | Colorize your own videos! | [Jason Antic](https://github.com/jantic) | [![](https://img.shields.io/github/stars/jantic/DeOldify?style=social)](https://github.com/jantic/DeOldify) 

[](https://arxiv.org/abs/1805.08318), [](https://arxiv.org/abs/1706.08500)

[](https://medium.com/element-ai-research-lab/stabilizing-neural-style-transfer-for-video-62675e203e42)

[model](https://data.deepai.org/deoldify/ColorizeVideo_gen.pth)

[](https://www.reddit.com/r/Nickelodeons/), [](https://www.reddit.com/r/silentmoviegifs/)

[](https://twitter.com/DeOldify)

[website](https://deoldify.ai/)

[](http://www.youtube.com/watch?v=l3UXXid04Ys), [](http://www.youtube.com/watch?v=EXn-n2iqEjI)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/jantic/DeOldify/blob/master/VideoColorizerColab.ipynb) | 19.09.2022 |

| DeOldify (photo) | Colorize your own photos! | 

[Jason Antic](https://github.com/jantic)
 [Matt Robinson](https://github.com/mc-robinson)
 [María Benavente](https://github.com/mariabg)

 | [![](https://img.shields.io/github/stars/jantic/DeOldify?style=social)](https://github.com/jantic/DeOldify) 

[](https://arxiv.org/abs/1805.08318), [](https://arxiv.org/abs/1706.08500)

[model](https://data.deepai.org/deoldify/ColorizeArtistic_gen.pth)

[](https://www.reddit.com/r/TheWayWeWere/)

[](https://twitter.com/DeOldify)

[website](https://deoldify.ai/)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/jantic/DeOldify/blob/master/ImageColorizerColab.ipynb) | 19.09.2022 |

| Decision Transformers | An architecture that casts the problem of RL as conditional sequence modeling | 

[Lili Chen](http://www.lilichen.me/)
 [Kevin Lu](https://kzl.github.io/)
 [Aravind Rajeswaran](https://aravindr93.github.io/)
others[Kimin Lee](https://sites.google.com/view/kiminlee)
 [Aditya Grover](https://aditya-grover.github.io/)
 [Michael Laskin](https://www.mishalaskin.com/)
 [Pieter Abbeel](http://people.eecs.berkeley.edu/~pabbeel/)
 [Aravind Srinivas](https://github.com/aravindsrinivas)
 [Igor Mordatch](https://scholar.google.com/citations?user=Vzr1RukAAAAJ)

 | [![](https://img.shields.io/github/stars/kzl/decision-transformer?style=social)](https://github.com/kzl/decision-transformer) 

[](https://arxiv.org/abs/2106.01345)

[](https://huggingface.co/models?other=gym-continous-control), [](https://huggingface.co/edbeeching/decision-transformer-gym-hopper-expert), [](https://huggingface.co/docs/transformers/model_doc/decision_transformer)

[project](https://sites.google.com/berkeley.edu/decision-transformer)

[](https://en.wikipedia.org/wiki/Autoregressive_model)

[](https://youtu.be/k08N5a0gG0A), [](https://youtu.be/-buULmf7dec), [](https://youtu.be/83QN9S-0I84), [](https://youtu.be/w4Bw8WYL8Ps)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1K3UuajwoPY1MzRKNkONNRS3gS5DxZ-qF) | 06.09.2022 |

| textual-inversion | An Image is Worth One Word: Personalizing Text-to-Image Generation using Textual Inversion | 

[Rinon Gal](https://rinongal.github.io/)
 [Yuval Alaluf](https://yuval-alaluf.github.io/)
 [Yuval Atzmon](https://research.nvidia.com/person/yuval-atzmon)
others[Or Patashnik](https://orpatashnik.github.io/)
 [Amit Bermano](https://www.cs.tau.ac.il/~amberman/)
 [Gal Chechik](https://research.nvidia.com/person/gal-chechik)
 [Daniel Cohen-Or](https://danielcohenor.com/)

 | [![](https://img.shields.io/github/stars/rinongal/textual_inversion?style=social)](https://github.com/rinongal/textual_inversion) 

[](https://arxiv.org/abs/2208.01618)

[project](https://textual-inversion.github.io/)

[](https://youtu.be/f3oXa7_SYek), [](https://youtu.be/opD_H9bED9Y)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/rinongal/textual_inversion/blob/master/scripts/latent_imagenet_diffusion.ipynb) | 21.08.2022 |

| Make-A-Scene | Scene-Based Text-to-Image Generation with Human Priors | 

[Oran Gafni](https://github.com/ogafni)
 [Adam Polyak](https://scholar.google.com/citations?user=CP62OTMAAAAJ)
 [Oron Ashual](https://scholar.google.com/citations?user=CUA9JCkAAAAJ)
others[Shelly Sheynin](https://github.com/shellysheynin)
 [Devi Parikh](https://faculty.cc.gatech.edu/~parikh/)
 [Yaniv Taigman](https://ytaigman.github.io/)

 | [![](https://img.shields.io/github/stars/CasualGANPapers/Make-A-Scene?style=social)](https://github.com/CasualGANPapers/Make-A-Scene) 

[](https://arxiv.org/abs/2203.13131)

[](https://youtu.be/ZM06MjPdoxw)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1SPyQ-epTsAOAu8BEohUokN4-b5RM_TnE) | 12.08.2022 |

| YOLOv7 | Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors | 

[Chien-Yao Wang](https://scholar.google.com/citations?user=DkQh4M4AAAAJ)
 [Alexey Bochkovskiy](http://www.alexeyab.com/)
 [Mark Liao](https://www.iis.sinica.edu.tw/pages/liao/)

 | [![](https://img.shields.io/github/stars/WongKinYiu/yolov7?style=social)](https://github.com/WongKinYiu/yolov7) 

[](https://arxiv.org/abs/2207.02696)

[data](http://images.cocodataset.org/annotations/annotations_trainval2017.zip), [data](http://images.cocodataset.org/zips/train2017.zip), [data](http://images.cocodataset.org/zips/val2017.zip), [data](https://github.com/WongKinYiu/yolov7/releases/download/v0.1/coco2017labels-segments.zip)

[](https://github.com/WongKinYiu/yolor), [](https://github.com/WongKinYiu/PyTorch_YOLOv4), [](https://github.com/WongKinYiu/ScaledYOLOv4), [](https://github.com/Megvii-BaseDetection/YOLOX), [](https://github.com/DingXiaoH/RepVGG), [](https://github.com/JUGGHM/OREPA_CVPR2022), [](https://github.com/TexasInstruments/edgeai-yolov5/tree/yolo-pose)

[](https://paperswithcode.com/sota/real-time-object-detection-on-coco?p=yolov7-trainable-bag-of-freebies-sets-new)

[](https://www.youtube.com/playlist?list=PL_Nji0JOuXg2QMohGK7wfzgJ-MavzXRHW), [](https://youtu.be/-QWxJ0j9EY8)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/WongKinYiu/yolov7/blob/main/tools/compare_YOLOv7_vs_YOLOv5m6_half.ipynb) | 09.08.2022 |

| OPT | Open Pre-trained Transformers is a family of NLP models trained on billions of tokens of text obtained from the internet | 

[Susan Zhang](https://github.com/suchenzang)
 [Stephen Roller](https://stephenroller.com/)
 [Naman Goyal](https://github.com/ngoyal2707)
others[Mikel Artetxe](https://github.com/artetxem)
 [Moya Chen](https://moyachen.com/)
 [Christopher Dewan](https://github.com/m3rlin45)
 [Mona Diab](https://scholar.google.com/citations?user=-y6SIhQAAAAJ)
 [Xi Victoria Lin](http://victorialin.net/)
 [Todor Mihaylov](https://github.com/tbmihailov)
 [Myle Ott](https://myleott.com/)
 [Sam Shleifer](https://github.com/sshleifer)
 [Kurt Shuster](https://github.com/klshuster)
 [Daniel Simig](https://scholar.google.com/citations?user=TtWU9fsAAAAJ)
 [Punit Singh Koura](https://github.com/punitkoura)
 [Anjali Sridhar](https://www.linkedin.com/in/anjalisridhar/)
 [Tianlu Wang](https://tianlu-wang.github.io/)
 [Luke Zettlemoyer](https://www.cs.washington.edu/people/faculty/lsz/)

 | [![](https://img.shields.io/github/stars/facebookresearch/metaseq?style=social)](https://github.com/facebookresearch/metaseq/tree/main/projects/OPT) 

[](https://arxiv.org/abs/2205.01068), [](https://arxiv.org/abs/1906.02243), [](https://arxiv.org/abs/2104.10350), [](https://arxiv.org/abs/2201.11990)

[blog post](https://ai.facebook.com/blog/democratizing-access-to-large-scale-language-models-with-opt-175b/)

[](https://github.com/NVIDIA/Megatron-LM)

[](https://youtu.be/Ejg0OunCi9U)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/14wnxMvD9zsiBQo2FtTpxn6w2cpXCcb-7) | 29.06.2022 |

| Customizing a Transformer Encoder | We will learn how to customize the encoder to employ new network architectures | [Chen Chen](https://github.com/chenGitHuber) | [![](https://img.shields.io/github/stars/tensorflow/models?style=social)](https://github.com/tensorflow/models/tree/master/official/nlp/modeling) 

[](https://arxiv.org/abs/1706.03762)

[](https://github.com/tensorflow/models/blob/master/official/nlp/modeling/networks/encoder_scaffold.py)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/tensorflow/models/blob/master/official/colab/nlp/customize_encoder.ipynb) | 22.06.2022 |

| SimCTG | Contrastive training objective to calibrate the model's representation space, and a decoding method -- contrastive search -- to encourage diversity while maintaining coherence in the generated text | 

[Yixuan Su](https://yxuansu.github.io/)
 [Tian Lan](https://github.com/gmftbyGMFTBY)
 [Yan Wang](https://libertywing.github.io/yanwang.github.io/)
others[Dani Yogatama](https://dyogatama.github.io/)
 [Lingpeng Kong](https://ikekonglp.github.io/)
 [Nigel Collier](https://github.com/nhcollier)

 | [![](https://img.shields.io/github/stars/yxuansu/SimCTG?style=social)](https://github.com/yxuansu/SimCTG) 

[](https://arxiv.org/abs/2202.06417), [](https://arxiv.org/abs/2210.14140)

[](https://github.com/yxuansu/Contrastive_Search_versus_Contrastive_Decoding), [](https://github.com/yxuansu/Contrastive_Search_Is_What_You_Need)

[](https://huggingface.co/blog/introducing-csearch), [](https://huggingface.co/spaces/joaogante/contrastive_search_generation), [](https://huggingface.co/docs/transformers/model_doc/opt)

[](https://proceedings.neurips.cc/paper_files/paper/2022/hash/871cae8f599cb8bbfcb0f58fe1af95ad-Abstract-Conference.html)

[](https://pypi.org/project/simctg/)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1ImvR-ldHf9rJyFzOCMJ_zjAGK8n1iTW7) | 04.06.2022 |

| T0 | Multitask Prompted Training Enables Zero-Shot Task Generalization | 

[Victor Sanh](https://github.com/VictorSanh)
 [Albert Webson](https://representation.ai/)
 [Colin Raffel](https://colinraffel.com//)
others[Stephen Bach](http://cs.brown.edu/people/sbach/)
 [Lintang Sutawika](https://github.com/lintangsutawika)
 [Zaid Alyafeai](https://github.com/zaidalyafeai)
 [Antoine Chaffin](https://antoine.chaffin.fr/)
 [Arnaud Stiegler](https://github.com/arnaudstiegler)
 [Teven Scao](https://scholar.google.com/citations?user=ik0_vxsAAAAJ)
 [Arun Raja](https://www.arunraja.dev/)
 [Manan Dey](https://github.com/manandey)
 [M Saiful Bari](https://sbmaruf.github.io/)
 [Canwen Xu](https://www.canwenxu.net/)
 [Urmish Thakker](https://github.com/Urmish)
 [Shanya Sharma](https://shanyas10.github.io/)
 [Eliza Szczechla](https://elsanns.github.io/)
 [Taewoon Kim](https://tae898.github.io/)
 [Gunjan Chhablani](https://gchhablani.github.io/)
 [Nihal Nayak](https://nihalnayak.github.io/)
 [Debajyoti Datta](http://debajyotidatta.github.io/)
 [Jonathan Chang](https://github.com/cccntu/)
 [Mike Tian-Jian Jiang](https://github.com/tianjianjiang)
 [Matteo Manica](https://github.com/drugilsberg)
 [Sheng Shen](https://sincerass.github.io/)
 [Zheng Xin Yong](https://yongzx.github.io/)
 [Harshit Pandey](https://scholar.google.com/citations?user=BPIs78gAAAAJ)
 [Rachel Bawden](https://rbawden.github.io/)
 [Trishala Neeraj](https://github.com/trishalaneeraj)
 [Jos Rozen](https://scholar.google.com/citations?user=OxEDKogAAAAJ)
 [Abheesht Sharma](https://github.com/abheesht-sharma)
 [Andrea Santilli](https://teelinsan.github.io/)
 [Thibault Fevry](http://thibaultfevry.com/)
 [Jason Alan Fries](https://web.stanford.edu/~jfries/)
 [Ryan Teehan](https://github.com/rteehas)
 [Stella Biderman](https://www.stellabiderman.com/)
 [Leo Gao](https://github.com/leogao2)
 [Tali Bers](https://github.com/tbers-coursera)
 [Thomas Wolf](https://thomwolf.io/)
 [Alexander Rush](https://scholar.google.com/citations?user=LIjnUGgAAAAJ)

 | [![](https://img.shields.io/github/stars/bigscience-workshop/promptsource?style=social)](https://github.com/bigscience-workshop/promptsource) 

[](https://arxiv.org/abs/2110.08207)

[](https://youtu.be/iJ0IVZgGjTM), [](https://youtu.be/YToXXfrIu6w)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1xx7SgdLaAu23YFBirXmaQViDr8caowX_) | 29.05.2022 |

| Text2Mesh | Text-Driven Neural Stylization for Meshes | 

[Oscar Michel](https://ojmichel.github.io/)
 [Roi Bar-On](https://github.com/roibaron)
 [Richard Liu](https://github.com/factoryofthesun)
others[Sagie Benaim](https://sagiebenaim.github.io/)
 [Rana Hanocka](http://people.cs.uchicago.edu/~ranahanocka/)

 | [![](https://img.shields.io/github/stars/threedle/text2mesh?style=social)](https://github.com/threedle/text2mesh) 

[CLIP](https://openai.com/blog/clip/)

[](https://arxiv.org/abs/2112.03221)

[](https://www.kaggle.com/code/neverix/text2mesh/notebook)

[project](https://threedle.github.io/text2mesh/)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/threedle/text2mesh/blob/master/colab_demo.ipynb) | 14.05.2022 |

| T5 | Text-To-Text Transfer Transformer | 

[Colin Raffel](https://colinraffel.com/)
 [Noam Shazeer](https://scholar.google.com/citations?user=wsGvgA8AAAAJ)
 [Adam Roberts](https://github.com/adarob)
others[Katherine Lee](https://github.com/katelee168)
 [Sharan Narang](https://github.com/sharannarang)
 [Michael Matena](https://scholar.google.com/citations?user=rN_9vroAAAAJ)
 [Yanqi Zhou](https://zhouyanqi.github.io)
 [Wei Li](https://research.google/people/106528/)
 [Peter J. Liu](https://scholar.google.com/citations?user=1EPxhywAAAAJ)

 | [![](https://img.shields.io/github/stars/google-research/text-to-text-transfer-transformer?style=social)](https://github.com/google-research/text-to-text-transfer-transformer) 

[](https://arxiv.org/abs/1910.10683)

[](https://github.com/tensorflow/mesh/tree/master/mesh_tensorflow/transformer)

[](https://www.tensorflow.org/datasets)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/google-research/text-to-text-transfer-transformer/blob/main/notebooks/t5-trivia.ipynb) | 11.05.2022 |

| XLS-R | Self-supervised Cross-lingual Speech Representation Learning at Scale | 

[Arun Babu](https://github.com/arbabu123)
 [Changhan Wang](https://www.changhan.me/)
 [Andros Tjandra](https://github.com/androstj)
others[Kushal Lakhotia](https://about.me/hikushalhere)
 [Qiantong Xu](https://github.com/xuqiantong)
 [Naman Goyal](https://github.com/ngoyal2707)
 [Kritika Singh](https://scholar.google.com/citations?user=Ltk3SykAAAAJ)
 [Patrick von Platen](https://github.com/patrickvonplaten)
 [Yatharth Saraf](https://scholar.google.com/citations?user=KJTtNJwAAAAJ)
 [Juan Pino](https://scholar.google.com/citations?user=weU_-4IAAAAJ)
 [Alexei Baevski](https://github.com/alexeib)
 [Alexis Conneau](https://github.com/aconneau)
 [Michael Auli](https://github.com/michaelauli)

 | [![](https://img.shields.io/github/stars/facebookresearch/fairseq?style=social)](https://github.com/facebookresearch/fairseq/blob/main/examples/wav2vec/xlsr/README.md) 

[](https://arxiv.org/abs/2111.09296)

[blog post](https://huggingface.co/blog/fine-tune-xlsr-wav2vec2)

[](https://github.com/facebookresearch/fairscale)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/patrickvonplaten/notebooks/blob/master/Fine_Tune_XLS_R_on_Common_Voice.ipynb) | 10.05.2022 |

| MAGIC | Training-free framework, iMAge-Guided text generatIon with CLIP, for plugging in visual controls in the generation process and enabling LMs to perform multimodal tasks in a zero-shot manner | 

[Yixuan Su](https://yxuansu.github.io/)
 [Tian Lan](https://github.com/gmftbyGMFTBY)
 [Yahui Liu](https://yhlleo.github.io/)
others[Fangyu Liu](https://fangyuliu.me/about)
 [Dani Yogatama](https://dyogatama.github.io/)
 [Yan Wang](https://libertywing.github.io/yanwang.github.io/)
 [Lingpeng Kong](https://www.cs.cmu.edu/~lingpenk/)
 [Nigel Collier](https://sites.google.com/site/nhcollier/)

 | [![](https://img.shields.io/github/stars/yxuansu/magic?style=social)](https://github.com/yxuansu/magic) [](https://arxiv.org/abs/2205.02655) | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1NDVkKpanbsaUwecHoRp_2kIpMztOFW25) | 02.05.2022 |

| DiffCSE | Unsupervised contrastive learning framework for learning sentence embeddings | 

[Yung-Sung Chuang](https://people.csail.mit.edu/yungsung/)
 [Rumen Dangovski](http://super-ms.mit.edu/rumen.html)
 [Hongyin Luo](https://luohongyin.github.io/)
others[Yang Zhang](https://mitibmwatsonailab.mit.edu/people/yang-zhang/)
 [Shiyu Chang](https://code-terminator.github.io/)
 [Marin Soljačić](http://www.mit.edu/~soljacic/marin.html)
 [Shang-Wen Li](https://swdanielli.github.io/)
 [Scott Wen-tau Yih](https://scottyih.org/)
 [Yoon Kim](https://people.csail.mit.edu/yoonkim/)
 [James Glass](http://groups.csail.mit.edu/sls/people/glass.shtml)

 | [![](https://img.shields.io/github/stars/voidism/diffcse?style=social)](https://github.com/voidism/diffcse) 

[](https://arxiv.org/abs/2204.10298), [](https://arxiv.org/abs/2104.08821), [](https://arxiv.org/abs/2111.00899)

[](https://github.com/princeton-nlp/SimCSE)

[](https://huggingface.co/voidism)

[](https://twitter.com/YungSungChuang/status/1517518077902000129)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/voidism/DiffCSE/blob/master/diffcse_evaluation.ipynb) | 24.04.2022 |

| ViDT+ | An Extendable, Efficient and Effective Transformer-based Object Detector | 

[Hwanjun Song](https://songhwanjun.github.io/)
 [Deqing Sun](https://deqings.github.io/)
 [Sanghyuk Chun](https://sanghyukchun.github.io/home/)
others[Varun Jampani](https://varunjampani.github.io/)
 [Dongyoon Han](https://sites.google.com/site/dyhan0920/)
 [Byeongho Heo](https://sites.google.com/view/byeongho-heo/home)
 [Wonjae Kim](https://wonjae.kim/)
 [Ming-Hsuan Yang](http://faculty.ucmerced.edu/mhyang/)

 | [![](https://img.shields.io/github/stars/naver-ai/vidt?style=social)](https://github.com/naver-ai/vidt/tree/vidt-plus) 

[](https://arxiv.org/abs/2204.07962), [](https://arxiv.org/abs/2110.03921)

[](https://github.com/fundamentalvision/Deformable-DETR), [](https://github.com/EherSenaw/ViDT_colab)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/EherSenaw/ViDT_colab/blob/main/vidt_colab.ipynb) | 20.04.2022 |

| GP-UNIT | Novel framework, Generative Prior-guided UNsupervised Image-to-image Translation, to improve the overall quality and applicability of the translation algorithm | 

[Shuai Yang](https://williamyang1991.github.io/)
 [Liming Jiang](https://liming-jiang.com/)
 [Ziwei Liu](https://liuziwei7.github.io/)
 [Chen Change Loy](https://www.mmlab-ntu.com/person/ccloy/)

 | [![](https://img.shields.io/github/stars/williamyang1991/GP-UNIT?style=social)](https://github.com/williamyang1991/GP-UNIT) 

[ImageNet](https://image-net.org/download.php)

[](https://arxiv.org/abs/2204.03641)

[](https://github.com/clovaai/stargan-v2#datasets-and-pre-trained-networks), [](https://github.com/switchablenorms/CelebAMask-HQ), [](https://github.com/NVlabs/metfaces-dataset), [](https://github.com/TreB1eN/InsightFace_Pytorch), [](https://github.com/NVlabs/SPADE), [](https://github.com/nvlabs/imaginaire), [](https://doi.org/10.1109/CVPR52688.2022.01779)

[project](https://www.mmlab-ntu.com/project/gpunit/)

[](https://youtu.be/dDApWs_oDrM)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/williamyang1991/GP-UNIT/blob/main/notebooks/inference_playground.ipynb) | 02.04.2022 |

| CLIPasso | Semantically-Aware Object Sketching | 

[Yael Vinker](https://yaelvi116.wixsite.com/mysite)
 [Ehsan Pajouheshgar](https://pajouheshgar.github.io/)
 [Jessica Y. Bo](https://jessica-bo.github.io/)
others[Roman Bachmann](https://roman-bachmann.github.io/)
 [Amit Bermano](https://www.cs.tau.ac.il/~amberman/)
 [Daniel Cohen-Or](https://danielcohenor.com/)
 [Amir Zamir](https://vilab.epfl.ch/zamir/)
 [Ariel Shamir](https://faculty.runi.ac.il/arik/site/index.asp)

 | [![](https://img.shields.io/github/stars/yael-vinker/CLIPasso?style=social)](https://github.com/yael-vinker/CLIPasso) 

[](https://arxiv.org/abs/2202.05822), [](https://arxiv.org/abs/2106.14843)

[demo](https://replicate.com/yael-vinker/clipasso)

[](https://github.com/BachiLi/diffvg)

[project](https://clipasso.github.io/clipasso/)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/yael-vinker/CLIPasso/blob/main/CLIPasso.ipynb) | 21.03.2022 |

| Disentangled Lifespan Face Synthesis | LFS model is proposed to disentangle the key face characteristics including shape, texture and identity so that the unique shape and texture age transformations can be modeled effectively | 

[Sen He](https://senhe.github.io/)
 [Wentong Liao](https://www.tnt.uni-hannover.de/en/staff/liao/)
 [Michael Yang](https://sites.google.com/site/michaelyingyang/)
others[Yi-Zhe Song](http://personal.ee.surrey.ac.uk/Personal/Y.Song/)
 [Bodo Rosenhahn](https://scholar.google.com/citations?user=qq3TxtcAAAAJ)
 [Tao Xiang](http://personal.ee.surrey.ac.uk/Personal/T.Xiang/index.html)

 | [![](https://img.shields.io/github/stars/SenHe/DLFS?style=social)](https://github.com/SenHe/DLFS) 

[](https://arxiv.org/abs/2108.02874)

[project](https://senhe.github.io/projects/iccv_2021_lifespan_face/)

[](https://www.youtube.com/watch?v=uklX03ns0m0)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1fgVAoxCSaqPkj0rUK4RmBh7GTQRqLNpE) | 22.02.2022 |

| ClipCap | CLIP Prefix for Image Captioning | 

[Ron Mokady](https://rmokady.github.io/)
 [Amir Hertz](https://github.com/amirhertz)
 [Amit Bermano](https://www.cs.tau.ac.il/~amberman/)

 | [![](https://img.shields.io/github/stars/rmokady/CLIP_prefix_caption?style=social)](https://github.com/rmokady/CLIP_prefix_caption) 

[](https://arxiv.org/abs/2111.09734)

[data](https://cocodataset.org/)

[](https://huggingface.co/spaces/akhaliq/CLIP_prefix_captioning)

[](https://medium.com/@uppalamukesh/clipcap-clip-prefix-for-image-captioning-3970c73573bc)

[](https://youtu.be/VQDrmuccWDo)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/rmokady/CLIP_prefix_caption/blob/main/notebooks/clip_prefix_captioning_inference.ipynb#scrollTo=glBzYsgIwhwF) | 15.02.2022 |

| Pose with Style | Detail-Preserving Pose-Guided Image Synthesis with Conditional StyleGAN | 

[Badour AlBahar](https://badouralbahar.github.io/)
 [Jingwan Lu](https://research.adobe.com/person/jingwan-lu/)
 [Jimei Yang](https://github.com/jimeiyang)
others[Zhixin Shu](https://zhixinshu.github.io/)
 [Eli Shechtman](https://research.adobe.com/person/eli-shechtman/)
 [Jia-Bin Huang](https://jbhuang0604.github.io/)

 | [![](https://img.shields.io/github/stars/BadourAlBahar/pose-with-style?style=social)](https://github.com/BadourAlBahar/pose-with-style) 

[](https://arxiv.org/abs/2109.06166)

[](https://github.com/rosinality/stylegan2-pytorch)

[project](https://pose-with-style.github.io/)

[](https://youtu.be/d_ETeAVLilw)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/tg-bomze/collection-of-notebooks/blob/master/HomeStylist.ipynb) | 19.01.2022 |

| diffsort | Differentiable Sorting Networks | 

[Felix Petersen](http://petersen.ai/)
 [Christian Borgelt](https://borgelt.net/)
 [Hilde Kuehne](https://hildekuehne.github.io/)
 [Oliver Deussen](https://www.cgmi.uni-konstanz.de/personen/prof-dr-oliver-deussen/)

 | [![](https://img.shields.io/github/stars/Felix-Petersen/diffsort?style=social)](https://github.com/Felix-Petersen/diffsort) 

[](https://arxiv.org/abs/2105.04019), [](https://arxiv.org/abs/2203.09630)

[](https://youtu.be/Rl-sFaE1z4M)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1q0TZFFYB9FlOJYWKt0_7ZaXQT190anhm) | 17.01.2022 |

| GLIDE | Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models | 

[Alex Nichol](https://aqnichol.com/)
 [Prafulla Dhariwal](https://github.com/prafullasd)
 [Aditya Ramesh](http://adityaramesh.com/)
others[Pranav Shyam](https://github.com/pranv)
 [Pamela Mishkin](https://manlikemishap.github.io/)
 [Bob McGrew](https://github.com/bmcgrew)
 [Ilya Sutskever](http://www.cs.utoronto.ca/~ilya/)
 [Mark Chen](https://scholar.google.com/citations?user=5fU-QMwAAAAJ)

 | [![](https://img.shields.io/github/stars/openai/glide-text2im?style=social)](https://github.com/openai/glide-text2im) 

[](https://arxiv.org/abs/2112.10741)

[](https://youtu.be/ItKi3h7IY2o)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/openai/glide-text2im/blob/master/notebooks/inpaint.ipynb) | 22.12.2021 |

| SOAT | StyleGAN of All Trades: Image Manipulation with Only Pretrained StyleGAN | 

[Min Jin Chong](https://mchong6.github.io/)
 [Hsin-Ying Lee](http://hsinyinglee.com/)
 [David Forsyth](http://luthuli.cs.uiuc.edu/~daf/)

 | [![](https://img.shields.io/github/stars/mchong6/SOAT?style=social)](https://github.com/mchong6/SOAT) 

[](https://arxiv.org/abs/2111.01619)

[](https://github.com/justinpinkney/toonify), [](https://github.com/rosinality/stylegan2-pytorch)

[](https://huggingface.co/spaces/akhaliq/SOAT)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/mchong6/SOAT/blob/master/infinity.ipynb) | 13.11.2021 |

| Arnheim | Generative Art Using Neural Visual Grammars and Dual Encoders | 

[Chrisantha Fernando](https://www.chrisantha.co.uk/)
 [Ali Eslami](http://arkitus.com/)
 [Jean-Baptiste Alayrac](https://www.jbalayrac.com/)
others[Piotr Mirowski](https://piotrmirowski.com/)
 [Dylan Banarse](https://www.2ne1.com/)
 [Simon Osindero](https://scholar.google.com/citations?user=Jq8ZS5kAAAAJ)

 | [![](https://img.shields.io/github/stars/deepmind/arnheim?style=social)](https://github.com/deepmind/arnheim) 

[](https://arxiv.org/abs/2105.00162), [](https://arxiv.org/abs/2106.14843), [](https://arxiv.org/abs/1801.07729), [](https://arxiv.org/abs/1606.02580), [](https://arxiv.org/abs/1609.09106)

[](https://github.com/openai/dall-e)

[](https://en.wikipedia.org/wiki/Compositional_pattern-producing_network)

[](https://www.youtube.com/watch?v=U7guaMdeF4g), [](https://www.youtube.com/watch?v=zh0goLbS-l0), [](https://www.youtube.com/watch?v=SYJGNt7yu6M), [](https://www.youtube.com/watch?v=MxkYKa0x5AU)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/deepmind/arnheim/blob/master/arnheim_2.ipynb) | 11.11.2021 |

| GPT-2 | Retrain an advanced text generating neural network on any text dataset using gpt-2-simple! | [Max Woolf](https://minimaxir.com/) | [![](https://img.shields.io/github/stars/openai/gpt-2?style=social)](https://github.com/openai/gpt-2) 

[blog post](https://minimaxir.com/2019/09/howto-gpt2/), [blog post](https://openai.com/research/better-language-models)

[](https://github.com/minimaxir/gpt-2-simple)

[](https://www.reddit.com/r/MachineLearning/comments/aqlzde/r_openai_better_language_models_and_their/)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1VLG8e7YSEwypxU-noRNhsv5dW4NfTGce) | 18.10.2021 |

| ConvMixer | An extremely simple model that is similar in spirit to the ViT and the even-more-basic MLP-Mixer in that it operates directly on patches as input, separates the mixing of spatial and channel dimensions, and maintains equal size and resolution throughout the network | 

[Asher Trockman](http://ashertrockman.com/)
 [Zico Kolter](http://zicokolter.com/)

 | [![](https://img.shields.io/github/stars/locuslab/convmixer?style=social)](https://github.com/locuslab/convmixer) 

[](https://arxiv.org/abs/2201.09792)

[](https://github.com/locuslab/convmixer-cifar10), [](https://github.com/rwightman/pytorch-image-models)

[](https://medium.com/codex/an-overview-on-convmixer-patches-are-all-you-need-8502a8d87011)

[](https://youtu.be/Gl0s0GDqN3c?t=990)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/locuslab/convmixer/blob/main/pytorch-image-models/notebooks/EffResNetComparison.ipynb) | 06.10.2021 |

| IC-GAN | Instance-Conditioned GAN | 

[Arantxa Casanova](https://github.com/ArantxaCasanova)
 [Marlène Careil](https://www.linkedin.com/in/marl%C3%A8ne-careil-901804155)
 [Jakob Verbeek](http://thoth.inrialpes.fr/~verbeek/)
others[Michał Drożdżal](https://scholar.google.com/citations?user=XK_ktwQAAAAJ)
 [Adriana Romero-Soriano](https://sites.google.com/site/adriromsor)

 | [![](https://img.shields.io/github/stars/facebookresearch/ic_gan?style=social)](https://github.com/facebookresearch/ic_gan) 

[](https://arxiv.org/abs/2109.05070)

[blog post](https://ai.facebook.com/blog/instance-conditioned-gans/)

[](https://github.com/facebookresearch/faiss), [](https://github.com/ajbrock/BigGAN-PyTorch), [](https://github.com/NVlabs/stylegan2-ada-pytorch), [](https://github.com/bioinf-jku/TTUR), [](https://github.com/mit-han-lab/data-efficient-gans)

[](https://proceedings.neurips.cc/paper/2021/hash/e7ac288b0f2d41445904d071ba37aaff-Abstract.html)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/facebookresearch/ic_gan/blob/master/inference/icgan_colab.ipynb) | 01.10.2021 |

| VITS | Parallel end-to-end TTS method that generates more natural sounding audio than current two-stage models | 

[Jaehyeon Kim](https://jaywalnut310.github.io/)
 [Jungil Kong](https://github.com/jik876)
 [Juhee Son](https://juheeuu.github.io/)

 | [![](https://img.shields.io/github/stars/jaywalnut310/vits?style=social)](https://github.com/jaywalnut310/vits) 

[](https://arxiv.org/abs/2106.06103)

[demo](https://jaywalnut310.github.io/vits-demo/)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1CO61pZizDj7en71NQG_aqqKdGaA_SaBf) | 23.08.2021 |

| CogView | Mastering Text-to-Image Generation via Transformers | 

[Ming Ding](https://scholar.google.com/citations?user=Va50YzkAAAAJ)
 [Zhuoyi Yang](https://scholar.google.com/citations?user=tgAt-gEAAAAJ)
 [Wenyi Hong](https://github.com/wenyihong)
others[Wendi Zheng](https://github.com/minkowski0125)
 [Chang Zhou](https://scholar.google.com/citations?user=QeSoG3sAAAAJ)
 [Junyang Lin](https://justinlin610.github.io/)
 [Xu Zou](http://xuzou.cn/)
 [Zhou Shao](https://www.researchgate.net/profile/Shao_Zhou4)
 [Hongxia Yang](https://sites.google.com/site/hystatistics/home)
 [Jie Tang](https://keg.cs.tsinghua.edu.cn/jietang/)

 | [![](https://img.shields.io/github/stars/THUDM/CogView?style=social)](https://github.com/THUDM/CogView) 

[](https://arxiv.org/abs/2105.13290)

[demo](https://thudm.github.io/CogView/index.html)

[](https://github.com/NVIDIA/apex), [](https://github.com/Sleepychord/cogdata)

[](https://towardsdatascience.com/cogview-image-generation-and-language-modelling-at-scale-8d358a0686d2)

[](https://proceedings.neurips.cc/paper/2021/hash/a4d92e2cd541fca87e4620aba658316d-Abstract.html)

[](https://www.reddit.com/r/MachineLearning/comments/nmxsd8/r_cogview_mastering_texttoimage_generation_via/)

[](https://youtu.be/Cw1r8ACIj8U)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1Bi2TnSUp2vNiSUhamsNuC4HqkZ2J4WwZ) | 21.06.2021 |

| GANs N' Roses | Stable, Controllable, Diverse Image to Image Translation | 

[Min Jin Chong](https://mchong6.github.io/)
 [David Forsyth](http://luthuli.cs.uiuc.edu/~daf/)

 | [![](https://img.shields.io/github/stars/mchong6/GANsNRoses?style=social)](https://github.com/mchong6/GANsNRoses) 

[](https://arxiv.org/abs/2106.06561), [](https://arxiv.org/abs/2007.06600)

[](https://github.com/rosinality/stylegan2-pytorch), [](https://github.com/znxlwm/UGATIT-pytorch)

[](https://youtu.be/VNg0NyCGl_4)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/mchong6/GANsNRoses/blob/master/inference_colab.ipynb) | 19.06.2021 |

| Score SDE | Score-Based Generative Modeling through Stochastic Differential Equations | 

[Yang Song](https://yang-song.net/)
 [Jascha Sohl-Dickstein](http://www.sohldickstein.com/)
 [Diederik Kingma](http://dpkingma.com/)
others[Abhishek Kumar](https://abhishek.umiacs.io/)
 [Stefano Ermon](https://cs.stanford.edu/~ermon/)
 [Ben Poole](https://cs.stanford.edu/~poole/)

 | [![](https://img.shields.io/github/stars/yang-song/score_sde?style=social)](https://github.com/yang-song/score_sde) 

[](https://arxiv.org/abs/2011.13456), [](https://arxiv.org/abs/1907.05600), [](https://arxiv.org/abs/2006.09011), [](https://arxiv.org/abs/2006.11239)

[](https://github.com/yang-song/score_sde_pytorch), [](https://github.com/google/ml_collections)

[](https://youtu.be/L9ZegT87QK8)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/yang-song/score_sde/blob/main/Score_SDE_demo.ipynb) | 18.03.2021 |

| Talking Head Anime from a Single Image | The network takes as input an image of an anime character's face and a desired pose, and it outputs another image of the same character in the given pose | [Pramook Khungurn](https://pkhungurn.github.io/) | [![](https://img.shields.io/github/stars/pkhungurn/talking-head-anime-demo?style=social)](https://github.com/pkhungurn/talking-head-anime-demo) 

[](https://github.com/lincolnhard/head-pose-estimation)

[project](https://pkhungurn.github.io/talking-head-anime/)

[](https://en.wikipedia.org/wiki/Virtual_YouTuber), [](https://en.wikipedia.org/wiki/MikuMikuDance)

[](https://youtu.be/kMQCERkTdO0), [](https://youtu.be/T1Gp-RxFZwU), [](https://youtu.be/FioRJ6x_RbI)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/pkhungurn/talking-head-anime-demo/blob/master/tha_colab.ipynb) | 23.02.2021 |

| NFNet | An adaptive gradient clipping technique, a significantly improved class of Normalizer-Free ResNets | 

[Andrew Brock](https://github.com/ajbrock)
 [Soham De](https://sohamde.github.io/)
 [Samuel L. Smith](https://scholar.google.co.uk/citations?user=fyEqU5oAAAAJ)
 [Karen Simonyan](https://scholar.google.com/citations?user=L7lMQkQAAAAJ)

 | [![](https://img.shields.io/github/stars/deepmind/deepmind-research?style=social)](https://github.com/deepmind/deepmind-research/tree/master/nfnets) 

[](https://arxiv.org/abs/2102.06171), [](https://arxiv.org/abs/2101.08692)

[](https://github.com/deepmind/jaxline)

[](https://youtu.be/rNkHjZtH0RQ), [](https://www.youtube.com/live/qyy2WhRRSI4?feature=share)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/deepmind/deepmind-research/blob/master/nfnets/nfnet_demo_colab.ipynb) | 17.02.2021 |

| CLIP | A neural network which efficiently learns visual concepts from natural language supervision | 

[Jong Wook Kim](https://jongwook.kim/)
 [Alec Radford](http://newmu.github.io/)
 [Ilya Sutskever](http://www.cs.utoronto.ca/~ilya/)

 | [![](https://img.shields.io/github/stars/openai/CLIP?style=social)](https://github.com/openai/CLIP) 

[](https://arxiv.org/abs/2103.00020)

[data](https://www.cs.toronto.edu/~kriz/cifar.html)

[paper](https://cdn.openai.com/papers/Learning_Transferable_Visual_Models_From_Natural_Language_Supervision.pdf)

[project](https://openai.com/blog/clip/)

[slides](https://icml.cc/media/icml-2021/Slides/9193.pdf)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/openai/clip/blob/master/Interacting_with_CLIP.ipynb) | 29.01.2021 |

| Adversarial Patch | A method to create universal, robust, targeted adversarial image patches in the real world | [Tom Brown](https://github.com/nottombrown) | [](https://arxiv.org/abs/1712.09665) | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/cleverhans-lab/cleverhans/blob/master/examples/adversarial_patch/AdversarialPatch.ipynb) | 27.01.2021 |

| MusicXML Documentation | The goal of this notebook is to explore one of the magenta libraries for music | 

[Prakruti Joshi](https://github.com/prakruti-joshi)
 [Falak Shah](https://falaktheoptimist.github.io/)
 [Twisha Naik](https://github.com/twisha96)

 | 

[magenta](https://magenta.tensorflow.org/)

[music theory](http://musictheoryblog.blogspot.com/2008/02/learn-music-theory.html)

[musicXML](https://www.musicxml.com/for-developers/)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/magenta/magenta-demos/blob/master/colab-notebooks/MusicXML_Document_Structure_Documentation.ipynb) | 08.01.2021 |

| Neural Magic Eye | Learning to See and Understand the Scene Behind an Autostereogram | 

[Zhengxia Zou](http://www-personal.umich.edu/~zzhengxi/)
 [Tianyang Shi](https://www.shitianyang.tech/)
 [Yi Yuan](https://yiyuan1991.github.io/)
 [Zhenwei Shi](http://levir.buaa.edu.cn/)

 | [![](https://img.shields.io/github/stars/jiupinjia/neural-magic-eye?style=social)](https://github.com/jiupinjia/neural-magic-eye) 

[](https://arxiv.org/abs/2012.15692)

[project](https://jiupinjia.github.io/neuralmagiceye/)

[](https://www.youtube.com/watch?v=Fkh7DEblqJ8)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1f59dFLJ748i2TleE54RkbUZSMo9Hyx7l) | 01.01.2021 |

| SIREN | Implicit Neural Representations with Periodic Activation Functions | 

[Vincent Sitzmann](https://vsitzmann.github.io/)
 [Julien Martel](http://web.stanford.edu/~jnmartel/)

 | [![](https://img.shields.io/github/stars/vsitzmann/siren?style=social)](https://github.com/vsitzmann/siren) 

[](https://arxiv.org/abs/2006.09661)

[data](https://drive.google.com/drive/folders/1_iq__37-hw7FJOEUK1tX7mdp8SKB368K)

[](https://proceedings.neurips.cc/paper/2020/hash/53c04118df112c13a8c34b38343b9c10-Abstract.html)

[project](https://vsitzmann.github.io/siren/)

[](https://www.youtube.com/watch?v=Q2fLWGBeaiI)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/vsitzmann/siren/blob/master/explore_siren.ipynb) | 25.06.2020 |

| Onsets and Frames | Onsets and Frames is an automatic music transcription framework with piano and drums models | 

[Curtis Hawthorne](https://github.com/cghawthorne)
 [Erich Elsen](https://github.com/ekelsen)

 | [![](https://img.shields.io/github/stars/magenta/magenta?style=social)](https://github.com/magenta/magenta/tree/main/magenta/models/onsets_frames_transcription) 

[](https://arxiv.org/abs/1710.11153), [](https://arxiv.org/abs/1810.12247), [](https://arxiv.org/abs/2004.00188)

[blog post](http://g.co/magenta/onsets-frames)

[data](https://g.co/magenta/maestro-wave2midi2wave), [data](https://magenta.tensorflow.org/datasets/e-gmd)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/notebooks/magenta/onsets_frames_transcription/onsets_frames_transcription.ipynb) | 02.04.2020 |

| FBA Matting | Low-cost modification to alpha matting networks to also predict the foreground and background colours | 

[Marco Forte](https://github.com/MarcoForte)
 [François Pitié](https://francois.pitie.net/)

 | [![](https://img.shields.io/github/stars/MarcoForte/FBA_Matting?style=social)](https://github.com/MarcoForte/FBA_Matting) 

[](https://arxiv.org/abs/2003.07711)

[](https://github.com/MarcoForte/closed-form-matting)

[](https://huggingface.co/spaces/leonelhs/FBA-Matting)

[](https://paperswithcode.com/sota/image-matting-on-composition-1k?p=f-b-alpha-matting)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1Ut2szLBTxPejGHt_GYUkua21yUVWseOE) | 19.03.2020 |

| BERT score | An automatic evaluation metric for text generation | [Tianyi Zhang](https://tiiiger.github.io/) | [![](https://img.shields.io/github/stars/Tiiiger/bert_score?style=social)](https://github.com/Tiiiger/bert_score) 

[](https://arxiv.org/abs/1904.09675)

[](https://pypi.org/project/bert-score/)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/Tiiiger/bert_score/blob/master/example/Demo.ipynb) | 05.03.2020 |

| ProxylessNAS | Directly learn the architectures for large-scale target tasks and target hardware platforms | 

[Han Cai](https://han-cai.github.io/)
 [Ligeng Zhu](https://lzhu.me/)
 [Song Han](https://hanlab.mit.edu/songhan)

 | [![](https://img.shields.io/github/stars/mit-han-lab/proxylessnas?style=social)](https://github.com/mit-han-lab/proxylessnas) 

[](https://arxiv.org/abs/1812.00332), [](https://arxiv.org/abs/1802.03494), [](https://arxiv.org/abs/1811.08886)

[](https://sh-tsang.medium.com/paper-proxylessnas-direct-neural-architecture-search-on-target-task-image-classification-73c35ebd8aed)

[](https://pytorch.org/hub/pytorch_vision_proxylessnas/)

[](https://www.reddit.com/r/MachineLearning/comments/a3a1xy/r_proxylessnas_direct_neural_architecture_search/)

[](https://youtu.be/X0aZmppnO1s), [](https://youtu.be/6AeCIJH9eGI)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/pytorch/pytorch.github.io/blob/master/assets/hub/pytorch_vision_proxylessnas.ipynb) | 29.10.2019 |

| Generating Piano Music with Transformer | This Colab notebook lets you play with pretrained Transformer models for piano music generation, based on the Music Transformer | 

[Ian Simon](https://github.com/iansimon)
 [Anna Huang](https://github.com/czhuang)
 [Jesse Engel](https://github.com/jesseengel)
 [Curtis Hawthorne](https://github.com/cghawthorne)

 | 

[](https://arxiv.org/abs/1706.03762), [](https://arxiv.org/abs/1809.04281)

[blog post](http://g.co/magenta/music-transformer)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/notebooks/magenta/piano_transformer/piano_transformer.ipynb) | 16.09.2019 |

| GANSynth | This notebook is a demo GANSynth, which generates audio with Generative Adversarial Networks | [Jesse Engel](https://github.com/jesseengel) | [![](https://img.shields.io/github/stars/magenta/magenta?style=social)](https://github.com/magenta/magenta/tree/main/magenta/models/gansynth) 

[](https://arxiv.org/abs/1902.08710), [](https://arxiv.org/abs/1809.11096)

[project](https://storage.googleapis.com/magentadata/papers/gansynth/index.html)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/notebooks/magenta/gansynth/gansynth_demo.ipynb) | 25.02.2019 |

| Latent Constraints | Conditional Generation from Unconditional Generative Models | 

[Jesse Engel](https://github.com/jesseengel)
 [Matthew Hoffman](http://matthewdhoffman.com/)
 [Adam Roberts](https://github.com/adarob)

 | 

[](https://arxiv.org/abs/1711.05772)

[data](http://mmlab.ie.cuhk.edu.hk/projects/CelebA.html)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/notebooks/latent_constraints/latentconstraints.ipynb) | 27.11.2017 |

| Performance RNN | This notebook shows you how to generate new performed compositions from a trained model | 

[Ian Simon](https://github.com/iansimon)
 [Sageev Oore](https://github.com/osageev)
 [Curtis Hawthorne](https://github.com/cghawthorne)

 | [![](https://img.shields.io/github/stars/magenta/magenta?style=social)](https://github.com/magenta/magenta/tree/master/magenta/models/performance_rnn) 

[blog post](https://magenta.tensorflow.org/performance-rnn)

[data](http://www.piano-e-competition.com/)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/notebooks/magenta/performance_rnn/performance_rnn.ipynb) | 11.07.2017 |

| NSynth | This colab notebook has everything you need to upload your own sounds and use NSynth models to reconstruct and interpolate between them | 

[Jesse Engel](https://github.com/jesseengel)
 [Cinjon Resnick](https://github.com/cinjon)
 [Adam Roberts](https://github.com/adarob)
others[Sander Dieleman](https://benanne.github.io/)
 [Karen Simonyan](https://scholar.google.com/citations?user=L7lMQkQAAAAJ)
 [Mohammad Norouzi](https://norouzi.github.io/)
 [Douglas Eck](https://github.com/douglaseck)

 | [![](https://img.shields.io/github/stars/tensorflow/magenta?style=social)](https://github.com/tensorflow/magenta/tree/master/magenta/models/nsynth) 

[](https://arxiv.org/abs/1704.01279)

[blog post](https://magenta.tensorflow.org/nsynth)

[data](https://magenta.tensorflow.org/datasets/nsynth)

[tutorial](https://magenta.tensorflow.org/nsynth-fastgen)

[](https://www.youtube.com/watch?v=AaALLWQmCdI), [](https://www.youtube.com/watch?v=BOoSy-Pg8is)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/notebooks/magenta/nsynth/nsynth.ipynb) | 06.04.2017 |

## Tutorials

TUTORIALS

| name | description | authors | links | colaboratory | update |

|------|-------------|:--------|:------|:------------:|:------:|

| YOLOv8 | State-of-the-art model that builds upon the success of previous YOLO versions and introduces new features and improvements to further boost performance and flexibility | [Glenn Jocher](https://github.com/glenn-jocher) | [![](https://img.shields.io/github/stars/ultralytics/ultralytics?style=social)](https://github.com/ultralytics/ultralytics) 


[COCO](http://cocodataset.org/)

[ImageNet](https://www.image-net.org/)

[blog post](https://habr.com/ru/articles/710016/)

[](https://ultralytics.com/discord)

[](https://hub.docker.com/r/ultralytics/ultralytics)

[](https://docs.ultralytics.com/)

[](https://www.kaggle.com/ultralytics/yolov8)

[](https://twitter.com/ultralytics)

[](https://youtube.com/ultralytics), [](https://youtu.be/m9fH9OWn8YM), [](https://youtu.be/wuZtUMEiKWY), [](https://youtu.be/gRAyOPjQ9_s), [](https://youtu.be/fhzCwJkDONE), [](https://youtu.be/IHbJcOex6dk)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/ultralytics/ultralytics/blob/main/examples/tutorial.ipynb) | 01.05.2025 |

| dm_control | DeepMind Infrastructure for Physics-Based Simulation | 

[Saran Tunyasuvunakool](https://github.com/saran-t)
 [Alistair Muldal](https://github.com/alimuldal)
 [Yotam Doron](http://www.yotamdoron.com/)
others[Siqi Liu](http://siqi.fr/)
 [Steven Bohez](https://github.com/sbohez)
 [Josh Merel](https://sites.google.com/site/jsmerel/)
 [Tom Erez](https://github.com/erez-tom)
 [Timothy Lillicrap](https://contrastiveconvergence.net/~timothylillicrap/index.php)
 [Nicolas Heess](https://scholar.google.com/citations?user=79k7bGEAAAAJ)
 [Yuval Tassa](https://github.com/yuvaltassa)

 | [![](https://img.shields.io/github/stars/deepmind/dm_control?style=social)](https://github.com/deepmind/dm_control) 

[](https://arxiv.org/abs/2006.12983), [](https://arxiv.org/abs/1801.00690), [](https://arxiv.org/abs/1902.07151), [](https://arxiv.org/abs/1707.02286), [](https://arxiv.org/abs/1802.09564), [](https://arxiv.org/abs/1802.10567)

[blog post](https://www.deepmind.com/publications/dm-control-software-and-tasks-for-continuous-control)

[](https://en.wikipedia.org/wiki/Tippe_top)

[](https://youtu.be/CMjoiU482Jk), [](https://youtu.be/rAai4QzcYbs), [](https://youtu.be/WhaRsrlaXLk)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/deepmind/dm_control/blob/master/tutorial.ipynb) | 30.04.2025 |

| MuJoCo | A general purpose physics engine that aims to facilitate research and development in robotics, biomechanics, graphics and animation, machine learning, and other areas which demand fast and accurate simulation of articulated structures interacting with their environment | 

[Emo Todorov](https://homes.cs.washington.edu/~todorov/)
 [Tom Erez](https://github.com/erez-tom)
 [Yuval Tassa](https://github.com/yuvaltassa)

 | [![](https://img.shields.io/github/stars/deepmind/mujoco?style=social)](https://github.com/deepmind/mujoco) 

[](https://arxiv.org/abs/2006.12983)

[](https://www.deepmind.com/blog/opening-up-a-physics-simulator-for-robotics), [](https://www.deepmind.com/blog/open-sourcing-mujoco)

[](https://mujoco.readthedocs.io/en/latest/overview.html)

[website](https://mujoco.org/)

[](https://en.wikipedia.org/wiki/Tippe_top), [](https://en.wikipedia.org/wiki/Chaos_theory), [](https://en.wikipedia.org/wiki/3D_projection#Mathematical_formula)

[](https://youtu.be/0ORsj_E17B0), [](https://youtu.be/yHZVVfsJ8mc), [](https://youtu.be/eyzzsGJ1iic)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/deepmind/dm_control/blob/master/dm_control/mujoco/tutorial.ipynb) | 30.04.2025 |

| Optimum | Extension of Transformers and Diffusers, providing a set of optimization tools enabling maximum efficiency to train and run models on targeted hardware, while keeping things easy to use | [Hugging Face](https://huggingface.co/) | [![](https://img.shields.io/github/stars/huggingface/optimum?style=social)](https://github.com/huggingface/optimum) 

[](https://github.com/openvinotoolkit/nncf)

[](https://huggingface.co/docs/optimum/index), [](https://huggingface.co/docs/transformers/main_classes/trainer)

[](https://youtu.be/UJnfePM0Ur8), [](https://www.youtube.com/live/b1Gk9q9empA), [](https://youtu.be/_AKFDOnrZz8)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/huggingface/notebooks/blob/main/examples/question_answering_ort.ipynb) | 29.04.2025 |

| Ray | Unified framework for scaling AI and Python applications | 

[Philipp Moritz](https://github.com/pcmoritz)
 [Robert Nishihara](https://github.com/robertnishihara)
 [Stephanie Wang](https://stephanie-wang.github.io/)
others[Alexey Tumanov](https://faculty.cc.gatech.edu/~atumanov/)
 [Richard Liaw](https://github.com/richardliaw)
 [Eric Liang](https://github.com/ericl)
 [Melih Elibol](https://research.nvidia.com/person/melih-elibol)
 [Zongheng Yang](https://zongheng.me/)
 [William Paul](https://github.com/Wapaul1)
 [Michael Jordan](https://people.eecs.berkeley.edu/~jordan/)
 [Ion Stoica](https://people.eecs.berkeley.edu/~istoica/)

 | [![](https://img.shields.io/github/stars/ray-project/ray?style=social)](https://github.com/ray-project/ray) 

[](https://arxiv.org/abs/1712.05889), [](https://arxiv.org/abs/2203.05072), [](https://arxiv.org/abs/1712.09381), [](https://arxiv.org/abs/1807.05118), [](https://arxiv.org/abs/1703.03924)

[](https://docs.ray.io/en/latest/index.html)

[website](https://www.ray.io/)

[](https://youtu.be/LmROEotKhJA), [](https://youtu.be/uzt-CwohQC8), [](https://youtu.be/XME90SGL6Vs)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/ray-project/ray/blob/master/doc/source/tune/examples/optuna_example.ipynb) | 29.04.2025 |

| verl | HybridFlow, which combines single-controller and multi-controller paradigms in a hybrid manner to enable flexible representation and efficient execution of the RLHF dataflow | 

[Guangming Sheng](https://github.com/PeterSH6)
 [Chi Zhang](https://github.com/vermouth1992)
 [Zilingfeng Ye](https://www.researchgate.net/profile/Ye-Zilingfeng)
others[Xibin Wu](https://github.com/wuxibin89)
 [Wang Zhang](https://zw0610.github.io/)
 [Ru Zhang](https://github.com/zhangru)
 [Yanghua Peng](https://sites.google.com/view/yanghuapeng/)
 [Haibin Lin](https://sites.google.com/view/haibinlin/)
 [Chuan Wu](https://i.cs.hku.hk/~cwu/)

 | [![](https://img.shields.io/github/stars/volcengine/verl?style=social)](https://github.com/volcengine/verl) 

[](https://arxiv.org/abs/2409.19256), [](https://arxiv.org/abs/2503.24289), [](https://arxiv.org/abs/2410.21236), [](https://arxiv.org/abs/2410.09302)

[](https://verl.readthedocs.io/en/latest/)

[](https://github.com/eric-haibin-lin/verl-community/blob/main/slides/verl-lmsys-meetup.pdf), [](https://github.com/hkust-nlp/simpleRL-reason), [](https://github.com/hiyouga/EasyR1), [](https://github.com/Unakar/Logic-RL), [](https://github.com/PeterGriffinJin/Search-R1), [](https://github.com/RAGEN-AI/RAGENv), [](https://github.com/PRIME-RL/PRIME), [](https://github.com/agentica-project/deepscaler)

[](https://join.slack.com/t/verlgroup/shared_invite/zt-2w5p9o4c3-yy0x2Q56s_VlGLsJ93A6vA)

[](https://twitter.com/verl_project)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/volcengine/verl/blob/master/examples/ppo_trainer/verl_getting_started.ipynb) | 29.04.2025 |

| LangGraph | Library for building stateful, multi-actor applications with LLMs, used to create agent and multi-agent workflows | [William FH](https://github.com/hinthornw) | [![](https://img.shields.io/github/stars/langchain-ai/langgraph?style=social)](https://github.com/langchain-ai/langgraph) 

[blog post](https://www.langchain.com/langgraph)

[](https://langchain-ai.github.io/langgraph/)

[](https://github.com/langchain-ai/langgraphjs)

[](https://towardsdatascience.com/from-basics-to-advanced-exploring-langgraph-e8c1cf4db787?gi=eb24d42206bf), [](https://medium.com/cyberark-engineering/building-production-ready-ai-agents-with-langgraph-a-real-life-use-case-7bda34c7f4e4)

[](https://pypi.org/project/langgraph/)

[website](https://www.langchain.com/langgraph)

[](https://www.youtube.com/playlist?list=PLfaIDFEXuae16n2TWUkKq5PgJ0w6Pkwtg), [](https://youtu.be/1bUy-1hGZpI), [](https://youtu.be/PqS1kib7RTw), [](https://youtu.be/PNr3f7QyQU4), [](https://youtu.be/qaWOwbFw3cs)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/langchain-ai/langgraph/blob/main/docs/docs/tutorials/introduction.ipynb) | 25.04.2025 |

| Brax | A differentiable physics engine that simulates environments made up of rigid bodies, joints, and actuators | 

[Daniel Freeman](https://github.com/cdfreeman-google)
 [Erik Frey](https://fawx.com/)
 [Anton Raichuk](https://scholar.google.com/citations?user=fquIpvgAAAAJ)
others[Sertan Girgin](https://sites.google.com/site/girgint/home)
 [Igor Mordatch](https://scholar.google.com/citations?user=Vzr1RukAAAAJ)
 [Olivier Bachem](http://olivierbachem.ch/)

 | [![](https://img.shields.io/github/stars/google/brax?style=social)](https://github.com/google/brax) 

[](https://arxiv.org/abs/2106.13281)

[](https://neurips.cc/Conferences/2021/CallForDatasetsBenchmarks)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/google/brax/blob/main/notebooks/basics.ipynb) | 22.04.2025 |

| Opik | From RAG chatbots to code assistants to complex agentic pipelines and beyond, build LLM systems that run better, faster, and cheaper with tracing, evaluations, and dashboards | [Jacques Verré](https://github.com/jverre) | [![](https://img.shields.io/github/stars/comet-ml/opik?style=social)](https://github.com/comet-ml/opik) 

[](https://www.comet.com/docs/opik/self-host/local_deployment/)

[](https://www.comet.com/docs/opik/)

[](https://pypi.org/project/opik/)

[](https://www.reddit.com/r/Python/comments/1fq33rw/opik_open_source_llm_evaluation_framework/), [](https://www.reddit.com/r/programming/comments/1fj13qx/opik_opensource_llm_evaluation_framework/), [](https://www.reddit.com/r/machinelearningnews/comments/1fj99eu/comet_launches_opik_a_comprehensive_opensource/)

[](https://chat.comet.com/)

[](https://x.com/Cometml)

[website](https://www.comet.com/site/products/opik/)

[](https://youtu.be/B4oboG62lyA), [](https://youtu.be/9JN-JaaOpa8)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/comet-ml/opik/blob/main/apps/opik-documentation/documentation/docs/cookbook/quickstart_notebook.ipynb) | 21.04.2025 |

| YOLOv3 | You Only Look Once | [Glenn Jocher](https://github.com/glenn-jocher) | [![](https://img.shields.io/github/stars/ultralytics/yolov3?style=social)](https://github.com/ultralytics/yolov3) 

[data](http://cocodataset.org/#upload)

[](https://www.kaggle.com/ultralytics/yolov3), [](https://www.kaggle.com/ultralytics/coco128)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/ultralytics/yolov3/blob/master/tutorial.ipynb) | 18.04.2025 |

| YOLOv5 | You Only Look Once | [Glenn Jocher](https://github.com/glenn-jocher) | [![](https://img.shields.io/github/stars/ultralytics/yolov5?style=social)](https://github.com/ultralytics/yolov5) 

[data](http://cocodataset.org/#upload)

[](https://www.kaggle.com/ultralytics/yolov5), [](https://www.kaggle.com/ultralytics/coco128)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/ultralytics/yolov5/blob/master/tutorial.ipynb) | 18.04.2025 |

| Giskard | Open-source library to detect hallucinations and security issues to turn them into test suites that you can automatically execute | 

[Jean-Marie John-Mathews](https://github.com/jmsquare)
 [Alex Combessie](https://alex.combessie.com/)

 | [![](https://img.shields.io/github/stars/Giskard-AI/giskard?style=social)](https://github.com/Giskard-AI/giskard) 

[](https://gisk.ar/discord)

[](https://docs.giskard.ai/en/stable/)

[](https://medium.com/@rohithreddy66666/mastering-rag-evaluation-a-deep-dive-into-giskard-ai-for-pdf-chatbots-c6d1163330aa)

[](https://pypi.org/project/giskard/)

[](https://www.youtube.com/@giskard_ai), [](https://youtu.be/rkjFFx_nXhU), [](https://youtu.be/KeY6qPAvyq0), [](https://youtu.be/6wSNPrt7izQ)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/giskard-ai/giskard/blob/main/docs/getting_started/quickstart/quickstart_llm.ipynb) | 16.04.2025 |

| AutoGen | Framework that enables development of LLM applications using multiple agents that can converse with each other to solve tasks | [microsoft](https://github.com/microsoft) | [![](https://img.shields.io/github/stars/microsoft/autogen?style=social)](https://github.com/microsoft/autogen) 

[blog post](https://www.microsoft.com/en-us/research/blog/autogen-enabling-next-generation-large-language-model-applications/)

[](https://discord.gg/pAbnFJrkgZ)

[](https://medium.com/@multiplatform.ai/microsoft-autogen-transforming-ai-frameworks-for-enhanced-problem-solving-video-ac2655e7cdf)

[project](https://microsoft.github.io/autogen/)

[](https://youtu.be/zdcCD--IieY), [](https://youtu.be/dCCr52uT0W8), [](https://youtu.be/JMpgsx74XDI)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/microsoft/autogen/blob/main/python/packages/autogen-core/docs/src/user-guide/core-user-guide/quickstart.ipynb) | 15.04.2025 |

| Evidently | An open-source framework to evaluate, test and monitor ML models in production | 

[Elena Samuylova](https://github.com/elenasamuylova)
 [Emeli Dral](https://github.com/emeli-dral)
 [Olga Filippova](https://github.com/0lgaF)

 | [![](https://img.shields.io/github/stars/evidentlyai/evidently?style=social)](https://github.com/evidentlyai/evidently) 

[](https://docs.evidentlyai.com/)

[](https://github.com/0lgaF/my_tab_with_evidently)

[website](https://evidentlyai.com/)

[](https://www.youtube.com/c/EvidentlyAI), [](https://youtu.be/L4Pv6ExBQPM)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/evidentlyai/evidently/blob/main/examples/sample_notebooks/getting_started_tutorial.ipynb) | 08.04.2025 |

| Llama 4 | Open-weight natively multimodal models with unprecedented context length support and our first built using a MoE architecture | [meta](https://www.llama.com/) | [![](https://img.shields.io/github/stars/meta-llama/llama-cookbook?style=social)](https://github.com/meta-llama/llama-cookbook) 

[](https://www.llama.com/docs/overview)

[](https://github.blog/changelog/2025-04-14-the-llama-4-herd-is-now-generally-available-in-github-models/), [](https://github.com/vllm-project/vllm)

[](https://huggingface.co/meta-llama)

[](https://medium.com/@ignacio.de.gregorio.noblejas/llama-4-is-out-meta-the-us-are-in-big-trouble-0d8c2a157fef)

[](https://ai.meta.com/blog/llama-4-multimodal-intelligence/)

[](https://www.reddit.com/r/LocalLLaMA/comments/1jsl37d/im_incredibly_disappointed_with_llama4/)

[](https://youtu.be/j8tssDT73Yc), [](https://youtu.be/r8d62HsnQA0), [](https://youtu.be/90LS4ze4BjQ), [](https://youtu.be/TIZmjmsBh20), [](https://youtu.be/jwE6_ujYcPw), [](https://youtu.be/Vv3n1Jggc10)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/meta-llama/llama-cookbook/blob/main/getting-started/build_with_llama_4.ipynb) | 07.04.2025 |

| Agent Starter Pack | Collection of production-ready Generative AI Agent templates built for Google Cloud | [Kristopher Overholt](https://github.com/koverholt) | [![](https://img.shields.io/github/stars/GoogleCloudPlatform/agent-starter-pack?style=social)](https://github.com/GoogleCloudPlatform/agent-starter-pack) 

[](https://medium.com/google-cloud/agentic-rag-with-bigquery-dataframes-and-agent-starter-pack-743553adf997), [](https://medium.com/google-cloud/genai-app-starter-pack-now-with-rag-pattern-vertex-ai-search-fb81bf61bcd5)

[](https://pypi.org/project/agent-starter-pack/)

[](https://www.reddit.com/r/googlecloud/comments/1j49uz7/agent_starter_pack_build_deploy_genai_agents_on/)

[](https://youtu.be/jHt-ZVD660g), [](https://youtu.be/kwRG7cnqSu0), [](https://www.youtube.com/watch?v=yIRIT_EtALs&t=235s), [](https://www.youtube.com/playlist?list=PLIivdWyY5sqLRCzKJyixrIDPQKwU6XHpn)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/GoogleCloudPlatform/generative-ai/blob/main/gemini/agent-engine/intro_agent_engine.ipynb) | 31.03.2025 |

| Vertex AI | Search brings together the power of deep information retrieval, state-of-the-art natural language processing, and the latest in LLM processing to understand user intent and return the most relevant results for the user | [Megha Agarwal](https://github.com/agarwal22megha) | [![](https://img.shields.io/github/stars/googleapis/python-aiplatform?style=social)](https://github.com/googleapis/python-aiplatform) 

[](https://arxiv.org/abs/2407.16833)

[blog post](https://blog.google/products/search/improving-search-next-20-years/v)

[](https://deepmind.google/technologies/gemini/pro/)

[](https://cloud.google.com/generative-ai-app-builder/docs/enterprise-search-introduction)

[](https://pypi.org/project/google-cloud-aiplatform/), [](https://pypi.org/project/google-cloud-discoveryengine/)

[](https://www.youtube.com/playlist?list=PLIivdWyY5sqJ1YuMdGjRwJ3fFYZ_vWQ62), [](https://www.youtube.com/playlist?list=PLIivdWyY5sqJAyUJbbsc8ZyGLNT4isnuB), [](https://youtu.be/HD_xreaLKb4), [](https://youtu.be/-3Olw-C4FN4), [](https://youtu.be/XVO3zsHdvio)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/GoogleCloudPlatform/generative-ai/blob/main/search/vertexai-search-options/vertexai_search_options.ipynb) | 31.03.2025 |

| xFormers | Toolbox to Accelerate Research on Transformers | 

[Benjamin Lefaudeux](https://github.com/blefaudeux)
 [Francisco Massa](https://github.com/fmassa)
 [Diana Liskovich](https://www.linkedin.com/in/dianaliskovich)
others[Wenhan Xiong](https://xwhan.github.io/)
 [Vittorio Caggiano](https://vittorio-caggiano.github.io/)
 [Sean Naren](https://github.com/SeanNaren)
 [Min Xu](https://github.com/min-xu-ai)
 [Jieru Hu](https://github.com/jieru-hu)
 [Marta Tintore](https://github.com/MartaTintore)
 [Susan Zhang](https://suchenzang.github.io/)
 [Patrick Labatut](https://github.com/patricklabatut)
 [Daniel Haziza](https://scholar.google.com/citations?user=2eSKdFMAAAAJ)

 | [![](https://img.shields.io/github/stars/facebookresearch/xformers?style=social)](https://github.com/facebookresearch/xformers) 

[](https://facebookresearch.github.io/xformers/)

[](https://github.com/google-research/sputnik), [](https://github.com/hgyhungry/ge-spmm), [](https://github.com/openai/triton), [](https://github.com/RobinBruegger/RevTorch), [](https://github.com/mlpen/Nystromformer), [](https://github.com/facebookresearch/fairscale), [](https://github.com/huggingface/pytorch-image-models), [](https://github.com/Dao-AILab/flash-attention)

[](https://youtu.be/NJyZCdxnGe4)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/facebookresearch/xformers/blob/main/docs/source/xformers_mingpt.ipynb) | 25.03.2025 |

| SkyThought | Train your own O1 preview model within $450 | [Sumanth Hegde](https://sumanthrh.com/) | [![](https://img.shields.io/github/stars/NovaSky-AI/SkyThought?style=social)](https://github.com/NovaSky-AI/SkyThought) 

[abs](https://arxiv.org/abs/2412.09413)

[](https://arxiv.org/abs/2502.14382)

[](https://discord.gg/RBAjeWSA)

[](https://huggingface.co/NovaSky-AI)

[](https://pypi.org/project/skythought/)

[](https://x.com/NovaSkyAI)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/NovaSky-AI/SkyThought/blob/master/examples/scoring.ipynb) | 20.03.2025 |

| Sentence Transformers | Multilingual Sentence, Paragraph, and Image Embeddings using BERT & Co | 

[Nils Reimers](https://www.nils-reimers.de/)
 [Iryna Gurevych](https://www.informatik.tu-darmstadt.de/ukp/ukp_home/head_ukp/index.en.jsp)

 | [![](https://img.shields.io/github/stars/UKPLab/sentence-transformers?style=social)](https://github.com/UKPLab/sentence-transformers) 

[](https://arxiv.org/abs/1908.10084), [](https://arxiv.org/abs/2004.09813), [](https://arxiv.org/abs/2010.08240)

[](https://www.sbert.net/)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/UKPLab/sentence-transformers/blob/master/examples/applications/retrieve_rerank/retrieve_rerank_simple_wikipedia.ipynb) | 07.03.2025 |

| Crawl4AI | LLM Friendly Web Crawler & Scrapper | [UncleCode](https://github.com/unclecode) | [![](https://img.shields.io/github/stars/unclecode/crawl4ai?style=social)](https://github.com/unclecode/crawl4ai) 

[](https://crawl4ai.com/mkdocs/)

[](https://medium.com/@pankaj_pandey/crawl4ai-your-ultimate-asynchronous-web-crawling-companion-%EF%B8%8F-66a21cf57c0a)

[](https://pypi.org/project/Crawl4AI/)

[](https://twitter.com/unclecode)

[](https://youtu.be/Ex3EpKxlMO0), [](https://youtu.be/KAvuVUh0XU8), [](https://youtu.be/lpOb1bQO7aM), [](https://youtu.be/81KIBvg0bsQ)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/unclecode/crawl4ai/blob/main/docs/examples/quickstart.ipynb) | 28.02.2025 |

| LangChain | Framework for developing applications powered by large language models | [Bagatur](https://github.com/baskaryan) | [![](https://img.shields.io/github/stars/langchain-ai/langchain?style=social)](https://github.com/langchain-ai/langchain) 

[](https://python.langchain.com/docs/introduction/)

[](https://github.com/langchain-ai/langchainjs), [](https://github.com/langchain-ai/langchain-extract), [](https://github.com/langchain-ai/chat-langchain), [](https://github.com/langchain-ai/weblangchain)

[](https://medium.com/@neelmakvana168/what-is-lang-chain-in-llm-e55e021da2b3), [](https://medium.com/@bijit211987/llm-powered-applications-building-with-langchain-cad4032d733c), [](https://medium.com/munchy-bytes/exploring-langchain-ff13fff63340)

[](https://pypi.org/project/langchain/)

[](https://twitter.com/langchainai)

[](https://en.wikipedia.org/wiki/LangChain)

[](https://www.youtube.com/playlist?list=PLqZXAkvF1bPNQER9mLmDbntNfSpzdDIU5), [](https://youtu.be/1bUy-1hGZpI), [](https://youtu.be/9AXP7tCI9PI), [](https://youtu.be/aywZrzNaKjs), [](https://youtu.be/dXxQ0LR-3Hg), [](https://youtu.be/sVcwVQRHIc8), [](https://youtu.be/MlK6SIjcjE8), [](https://youtu.be/TLf90ipMzfE), [](https://www.youtube.com/playlist?list=PLZoTAELRMXVORE4VF7WQ_fAl0L1Gljtar)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/langchain-ai/langchain/blob/master/docs/docs/tutorials/qa_chat_history.ipynb) | 25.02.2025 |

| LM Evaluation Harness | Framework for few-shot evaluation of language models. | [Lintang Sutawika](https://lintang.sutawika.com/) | [![](https://img.shields.io/github/stars/EleutherAI/lm-evaluation-harness?style=social)](https://github.com/EleutherAI/lm-evaluation-harness) 

[](https://arxiv.org/abs/2005.14165)

[](https://discord.gg/eleutherai)

[](https://github.com/AutoGPTQ/AutoGPTQ), [](https://github.com/EleutherAI/gpt-neox), [](https://github.com/microsoft/Megatron-DeepSpeed), [](https://github.com/vllm-project/vllm)

[project](https://www.eleuther.ai/projects/large-language-model-evaluation)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/EleutherAI/lm-evaluation-harness/blob/main/examples/lm-eval-overview.ipynb) | 21.02.2025 |

| Multimodal Maestro | Gives you more control over large multimodal models to get the outputs you want | [Piotr Skalski](https://github.com/SkalskiP) | [![](https://img.shields.io/github/stars/roboflow/multimodal-maestro?style=social)](https://github.com/roboflow/multimodal-maestro) 

[](https://arxiv.org/abs/2310.11441), [](https://arxiv.org/abs/2309.17421)

[blog post](https://blog.roboflow.com/multimodal-maestro-advanced-lmm-prompting/)

[](https://discord.gg/GbfgXGJ8Bk)

[](https://pypi.org/project/maestro/)

[](https://www.reddit.com/r/computervision/comments/186o2b2/multimodal_maestro_prompt_tools_for_use_with_lmms/)

[website](https://maestro.roboflow.com/)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/roboflow/maestro/blob/main/cookbooks/maestro_paligemma_2_json_extraction.ipynb) | 18.02.2025 |

| YuE | Groundbreaking series of open-source foundation models designed for music generation, specifically for transforming lyrics into full songs | [Mozer](https://github.com/Mozer) | [![](https://img.shields.io/github/stars/multimodal-art-projection/YuE?style=social)](https://github.com/multimodal-art-projection/YuE) 

[](https://discord.gg/ssAyWMnMzu)

[](https://github.com/Mozer/YuE-extend), [](https://github.com/sgsdxzy/YuE-exllamav2), [](https://github.com/deepbeepmeep/YuEGP), [](https://github.com/sdbds/YuE-for-windows), [](https://github.com/WrongProtocol/YuE-exllamav2-UI), [](https://github.com/alisson-anjos/YuE-Interface), [](https://github.com/Anjok07/ultimatevocalremovergui)

[](https://huggingface.co/collections/m-a-p/yue-6797d55e22990ae89b90a3d6)

[project](https://map-yue.github.io/)

[](https://www.reddit.com/r/LocalLLaMA/comments/1ibzmef/new_bomb_dropped_from_asian_researchers_yue_open/)

[](https://x.com/abrakjamson/status/1885932885406093538), [](https://x.com/cocktailpeanut/status/1886456240156348674)

[](https://youtu.be/RSMNH9GitbA), [](https://youtu.be/zw46L4pDFWo)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/Mozer/YuE-extend/blob/main/colab/Yue_extend_with_exllama.ipynb) | 16.02.2025 |

| Datasets | A Community Library for Natural Language Processing | 

[Quentin Lhoest](https://github.com/lhoestq)
 [Albert Villanova](https://albertvillanova.github.io/)
 [Yacine Jernite](https://yjernite.github.io/)
others[Abhishek Thakur](https://github.com/abhishekkrthakur)
 [Patrick von Platen](https://github.com/patrickvonplaten)
 [Suraj Patil](https://github.com/patil-suraj)
 [Julien Chaumond](https://github.com/julien-c)
 [Mariama Dramé](https://scholar.google.com/citations?user=0pwfXH0AAAAJ)
 [Julien Plu](https://jplu.github.io/)
 [Lewis Tunstall](https://lewtun.github.io/blog/)
 [Joe Davison](https://joeddav.github.io/)
 [Mario Šaško](https://github.com/mariosasko)
 [Gunjan Chhablani](https://gchhablani.github.io/)
 [Bhavitvya Malik](https://github.com/bhavitvyamalik)
 [Simon Brandeis](https://github.com/SBrandeis)
 [Teven Le Scao](https://github.com/TevenLeScao)
 [Victor Sanh](https://github.com/VictorSanh)
 [Canwen Xu](https://www.canwenxu.net/)
 [Nicolas Patry](https://github.com/Narsil)
 [Angelina McMillan-Major](https://github.com/mcmillanmajora)
 [Philipp Schmid](https://www.philschmid.de/)
 [Sylvain Gugger](https://github.com/sgugger)
 [Clément Delangue](https://scholar.google.com/citations?user=bRMboT8AAAAJ)
 [Théo Matussière](https://theo.matussie.re/)
 [Lysandre Debut](http://lysand.re/)
 [Stas Bekman](https://stasosphere.com/machine-learning/)
 [Pierric Cistac](https://github.com/Pierrci)
 [Thibault Goehringer](https://github.com/beurkinger)
 [Victor Mustar](https://github.com/gary149)
 [François Lagunas](https://github.com/madlag)
 [Alexander Rush](https://rush-nlp.com/)
 [Thomas Wolf](https://thomwolf.io/)

 | [![](https://img.shields.io/github/stars/huggingface/datasets?style=social)](https://github.com/huggingface/datasets) 

[](https://arxiv.org/abs/2109.02846)

[](https://huggingface.co/docs/datasets)

[](https://huggingface.co/datasets)

[](https://www.kaggle.com/code/nbroad/intro-to-hugging-face-datasets)

[](https://youtu.be/uaIJ96syPnM)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/huggingface/notebooks/blob/main/datasets_doc/en/quickstart.ipynb) | 13.02.2025 |

| moondream | Tiny vision language model that kicks ass and runs anywhere | [Vik Korrapati](https://github.com/vikhyat) | [![](https://img.shields.io/github/stars/vikhyat/moondream?style=social)](https://github.com/vikhyat/moondream) 

[](https://discord.com/invite/tRUdpjDQfH)

[](https://github.com/kijai/ComfyUI-moondream)

[](https://huggingface.co/vikhyatk/moondream2), [](https://huggingface.co/datasets/google/docci), [](https://huggingface.co/vikhyatk/moondream1)

[](https://medium.com/@indradumnabanerjee/getting-started-with-vision-language-model-moondream-783c264a02b9)

[website](https://moondream.ai/)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/vikhyat/moondream/blob/main/notebooks/Finetuning.ipynb) | 06.02.2025 |

| VC | Client software for performing real-time voice conversion using various Voice Conversion AI | [w-okada](https://github.com/w-okada) | [![](https://img.shields.io/github/stars/w-okada/voice-changer?style=social)](https://github.com/w-okada/voice-changer) 

[](https://github.com/yxlllc/DDSP-SVC)

[](https://huggingface.co/wok000/vcclient000)

[](https://youtu.be/POo_Cg0eFMU), [](https://youtu.be/fba9Zhsukqw), [](https://youtu.be/s_GirFEGvaA), [](https://youtu.be/Q7bbEC4aeKM), [](https://youtu.be/_JXbvSTGPoo), [](https://youtu.be/pHhjg2JwdPI), [](https://youtu.be/We5oYpCR3WQ), [](https://youtu.be/aVfoC1EHlVs), [](https://youtu.be/YF1lBaqeyt8)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/hinabl/voice-changer-colab/blob/master/Hina_Modified_Realtime_Voice_Changer_on_Colab.ipynb) | 30.01.2025 |

| Building Your Own Federated Learning Algorithm | We discuss how to implement federated learning algorithms without deferring to the tff.learning API | [Zachary Charles](https://zachcharles.com/) | 

[](https://arxiv.org/abs/1907.08610)

[blog post](https://ai.googleblog.com/2020/05/federated-analytics-collaborative-data.html)

[](https://paperswithcode.com/task/federated-learning)

[](https://pypi.org/project/tensorflow-federated/)

[](https://www.tensorflow.org/federated/api_docs/python/tff/learning/Model)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/tensorflow/federated/blob/master/docs/tutorials/building_your_own_federated_learning_algorithm.ipynb) | 29.01.2025 |

| Federated Learning for Image Classification | We use the classic MNIST training example to introduce the Federated Learning API layer of TFF, tff.learning - a set of higher-level interfaces that can be used to perform common types of federated learning tasks, such as federated training, against user-supplied models implemented in TensorFlow | [Krzysztof Ostrowski](https://github.com/krzys-ostrowski) | 

[](https://arxiv.org/abs/1602.05629)

[data](https://www.nist.gov/srd/nist-special-database-19)

[](https://medium.com/tensorflow/standardizing-on-keras-guidance-on-high-level-apis-in-tensorflow-2-0-bad2b04c819a)

[](https://paperswithcode.com/task/federated-learning), [](https://paperswithcode.com/task/image-classification)

[](https://pypi.org/project/tensorflow-federated/)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/tensorflow/federated/blob/master/docs/tutorials/federated_learning_for_image_classification.ipynb) | 29.01.2025 |

| Federated Learning for Text Generation | We start with a RNN that generates ASCII characters, and refine it via federated learning | [Krzysztof Ostrowski](https://github.com/krzys-ostrowski) | 

[](https://arxiv.org/abs/1812.01097), [](https://arxiv.org/abs/1602.05629)

[data](http://www.ibiblio.org/pub/docs/books/gutenberg/9/98/98.txt), [data](http://www.ibiblio.org/pub/docs/books/gutenberg/4/46/46.txt)

[](https://pypi.org/project/tensorflow-federated/)

[](https://www.tensorflow.org/hub)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/tensorflow/federated/blob/master/docs/tutorials/federated_learning_for_text_generation.ipynb) | 29.01.2025 |

| Custom Federated Algorithms, Part 1: Introduction to the Federated Core | This tutorial is the first part of a two-part series that demonstrates how to implement custom types of federated algorithms in TensorFlow Federated using the Federated Core - a set of lower-level interfaces that serve as a foundation upon which we have implemented the Federated Learning layer | [Krzysztof Ostrowski](https://github.com/krzys-ostrowski) | 

[](https://arxiv.org/abs/1602.05629)

[](https://paperswithcode.com/task/federated-learning)

[](https://pypi.org/project/tensorflow-federated/)

[](https://www.tensorflow.org/federated/federated_core), [](https://www.tensorflow.org/federated/federated_learning)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/tensorflow/federated/blob/master/docs/tutorials/custom_federated_algorithms_1.ipynb) | 29.01.2025 |

| Custom Federated Algorithms, Part 2: Implementing Federated Averaging | This tutorial is the second part of a two-part series that demonstrates how to implement custom types of federated algorithms in TFF using the Federated Core, which serves as a foundation for the Federated Learning layer | [Krzysztof Ostrowski](https://github.com/krzys-ostrowski) | [![](https://img.shields.io/github/stars/tensorflow/federated?style=social)](https://github.com/tensorflow/federated/blob/master/tensorflow_federated/python/learning/federated_averaging.py) 

[](https://paperswithcode.com/task/federated-learning)

[](https://pypi.org/project/tensorflow-federated/)

[](https://www.tensorflow.org/federated/federated_core), [](https://www.tensorflow.org/federated/federated_learning)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/tensorflow/federated/blob/master/docs/tutorials/custom_federated_algorithms_2.ipynb) | 29.01.2025 |

| CodeGemma | How to load, fine-tune and deploy CodeGemma model on SQL by utilising Hugging Face | [Carlo Fisicaro](https://github.com/carlofisicaro) | 

[](https://arxiv.org/abs/2406.11409)

[](https://ai.google.dev/gemma/docs/codegemma)

[](https://huggingface.co/google/codegemma-7b)

[](https://www.kaggle.com/models/google/codegemma)

[](https://www.reddit.com/r/LocalLLaMA/comments/1bzr24e/codegemma_an_official_google_release_for_code/)

[](https://youtu.be/diqk-LscvBs)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/google-gemini/gemma-cookbook/blob/main/CodeGemma/CodeGemma_finetuned_on_SQL_with_HF.ipynb) | 22.01.2025 |

| Invariant Agent Stack | Framework-less approach that currently consists of three key projects, each of which can be used independently or in combination to build, test, and secure AI agents | [Fei Xie](https://github.com/feixieliz) | [![](https://img.shields.io/github/stars/invariantlabs-ai/invariant?style=social)](https://github.com/invariantlabs-ai/invariant) 

[](https://arxiv.org/abs/2405.15793), [](https://arxiv.org/abs/2402.16893)

[blog post](https://invariantlabs.ai/blog/icml2024-agents-formal-security), [blog post](https://embracethered.com/blog/posts/2023/google-bard-data-exfiltration/)

[](https://discord.gg/dZuZfhKnJ4)

[](https://explorer.invariantlabs.ai/docs/)

[explorer](https://explorer.invariantlabs.ai/)

[](https://github.com/invariantlabs-ai/explorer)

[playground](https://playground.invariantlabs.ai/)

[](https://pypi.org/project/invariant-ai/)

[](https://x.com/invariantlabsai)

[](https://en.wikipedia.org/wiki/Datalog)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/invariantlabs-ai/invariant/blob/main/invariant/analyzer/examples/sql_injection/sql_injection_query_pipeline.ipynb) | 20.01.2025 |

| Ollama | Get up and running with large language models | [Michael Yang](https://github.com/mxyng) | [![](https://img.shields.io/github/stars/ollama/ollama?style=social)](https://github.com/ollama/ollama) 

[](https://hub.docker.com/r/ollama/ollama)

[](https://github.com/ollama/ollama-python), [](https://github.com/ollama/ollama-js), [](https://github.com/ggerganov/llama.cpp)

[](https://pypi.org/project/ollama/)

[](https://x.com/ollama)

[website](https://ollama.com/)

[](https://youtu.be/rIRkxZSn-A8), [](https://youtu.be/1xdneyn6zjw), [](https://youtu.be/cTxENLLX1ho), [](https://youtu.be/ztBJqzBU5kc), [](https://youtu.be/Ox8hhpgrUi0), [](https://youtu.be/lhQ8ixnYO2Y), [](https://youtu.be/pxhkDaKzBaY)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/ollama/ollama/blob/master/examples/jupyter-notebook/ollama.ipynb) | 13.01.2025 |

| NotebookLlama | Open Source version of NotebookLM | [Sanyam Bhutani](https://github.com/init27) | [![](https://img.shields.io/github/stars/meta-llama/llama-recipes?style=social)](https://github.com/meta-llama/llama-recipes/tree/main/recipes/quickstart/NotebookLlama) 

[](https://medium.com/ai-disruption/meta-launches-open-source-version-notebookllama-rivals-googles-popular-notebooklm-9a41edd99c24)

[meidum](https://medium.com/ai-artistry/notebook-llama-an-open-source-guide-to-building-a-pdf-to-podcast-workflow-e8fceec888a9)

[](https://www.reddit.com/r/OpenSourceeAI/comments/1gdsmax/meta_ai_silently_releases_notebookllama_an_open/)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/meta-llama/llama-cookbook/blob/main/end-to-end-use-cases/NotebookLlama/Step-1%20PDF-Pre-Processing-Logic.ipynb) | 09.01.2025 |

| Anomalib | Deep learning library that aims to collect state-of-the-art anomaly detection algorithms for benchmarking on both public and private datasets | 

[Samet Akcay](https://github.com/samet-akcay)
 [Dick Ameln](https://github.com/djdameln)
 [Ashwin Vaidya](https://ashwinvaidya.com/)
others[Barath Lakshmanan](https://github.com/blakshma)
 [Nilesh Ahuja](https://github.com/nahuja-intel)
 [Utku Genc](https://github.com/ugenc-intel)

 | [![](https://img.shields.io/github/stars/openvinotoolkit/anomalib?style=social)](https://github.com/openvinotoolkit/anomalib) 

[](https://arxiv.org/abs/2011.08785)

[data](https://www.mvtec.com/company/research/datasets/mvtec-ad)

[](https://openvinotoolkit.github.io/anomalib/)

[](https://github.com/rwightman/pytorch-image-models), [](https://github.com/vnk8071/anomaly-detection-in-industry-manufacturing/tree/master/anomalib_contribute)

[](https://towardsdatascience.com/getting-started-with-pytorch-image-models-timm-a-practitioners-guide-4e77b4bf9055)

[](https://paperswithcode.com/lib/timm)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/openvinotoolkit/anomalib/blob/main/notebooks/000_getting_started/001_getting_started.ipynb) | 08.01.2025 |

| Hello, many worlds | This tutorial shows how a classical neural network can learn to correct qubit calibration errors | [Michael Broughton](https://github.com/MichaelBroughton) | 

[](https://www.tensorflow.org/quantum/api_docs/python/tfq/layers), [](https://www.tensorflow.org/quantum/api_docs/python/tfq/get_expectation_op), [](https://www.tensorflow.org/guide/keras/functional)

[](https://en.wikipedia.org/wiki/Pauli_matrices)

[](https://youtu.be/-o9AhIz1uvo)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/tensorflow/quantum/blob/master/docs/tutorials/hello_many_worlds.ipynb) | 04.01.2025 |

| LightAutoML | Allows you create machine learning models using just a few lines of code, or build your own custom pipeline using ready blocks | 

[Alexander Ryzhkov](https://github.com/alexmryzhkov)
 [Anton Vakhrushev](https://www.kaggle.com/btbpanda)
 [Dmitry Simakov](https://github.com/DESimakov)

 | [![](https://img.shields.io/github/stars/sb-ai-lab/LightAutoML?style=social)](https://github.com/sb-ai-lab/LightAutoML) 

[](https://arxiv.org/abs/2109.01528)

[](https://lightautoml.readthedocs.io/en/latest/)

[](https://github.com/Rishat-skoltech/LightAutoML_GPU), [](https://github.com/sb-ai-lab/SLAMA)

[](https://www.kaggle.com/alexryzhkov/n3-tps-april-21-lightautoml-starter), [](https://www.kaggle.com/alexryzhkov/lightautoml-titanic-love), [](https://www.kaggle.com/alexryzhkov/lightautoml-extreme-short-titanic-solution), [](https://www.kaggle.com/alexryzhkov/lightautoml-houseprices-love), [](https://www.kaggle.com/simakov/lama-whitebox-preset-example), [](https://www.kaggle.com/simakov/lama-custom-automl-pipeline-example), [](https://www.kaggle.com/code/mikhailkuz/lightautoml-nn-happiness)

[](https://alexmryzhkov.medium.com/lightautoml-preset-usage-tutorial-2cce7da6f936)

[](https://pypi.org/project/lightautoml)

[website](https://developers.sber.ru/portal/products/lightautoml)

[](https://www.youtube.com/live/4pbO673B9Oo), [](https://youtu.be/ci8uqgWFJGg), [](https://youtu.be/TYu1UG-E9e8), [](https://www.youtube.com/playlist?list=PLJU_M19giWaEXcQtWWhpOKJf_luMc12B2), [](https://youtu.be/hr8GbPOHaEE)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/AILab-MLTools/LightAutoML/blob/master/examples/tutorials/Tutorial_1_basics.ipynb) | 22.12.2024 |

| TorchGeo | PyTorch domain library that provides datasets, transforms, samplers, and pre-trained models specific to geospatial data | 

[Adam Stewart](https://github.com/adamjstewart)
 [Caleb Robinson](https://calebrob.com/)
 [Isaac Corley](https://github.com/isaaccorley)
others[Anthony Ortiz](https://github.com/anthonymlortiz)
 [Juan Lavista Ferres](https://www.microsoft.com/en-us/research/people/jlavista/)
 [Arindam Banerjee](https://arindam.cs.illinois.edu/)

 | [![](https://img.shields.io/github/stars/microsoft/torchgeo?style=social)](https://github.com/microsoft/torchgeo) 

[NDBI](https://www.linkedin.com/pulse/ndvi-ndbi-ndwi-calculation-using-landsat-7-8-tek-bahadur-kshetri/)

[NDVI](https://gisgeography.com/ndvi-normalized-difference-vegetation-index/)

[NDWI](https://custom-scripts.sentinel-hub.com/custom-scripts/sentinel-2/ndwi/)

[](https://arxiv.org/abs/2111.08872)

[data](https://docs.sentinel-hub.com/api/latest/data/sentinel-2-l2a/), [data](https://www.cogeo.org/)

[](https://github.com/davemlz/awesome-spectral-indices)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/microsoft/torchgeo/blob/main/docs/tutorials/indices.ipynb) | 19.12.2024 |

| Langfun | PyGlove powered library that aims to make language models fun to work with | [Daiyi Peng](https://github.com/daiyip) | [![](https://img.shields.io/github/stars/google/langfun?style=social)](https://github.com/google/langfun) 

[](https://discord.gg/U6wPN9R68k)

[](https://github.com/pallets/jinja)

[](https://pypi.org/project/langfun/)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/google/langfun/blob/main/docs/notebooks/langfun101.ipynb) | 16.12.2024 |

| ComfyUI | Powerful and modular stable diffusion GUI and backend | [comfyanonymous](https://github.com/comfyanonymous) | [![](https://img.shields.io/github/stars/comfyanonymous/ComfyUI?style=social)](https://github.com/comfyanonymous/ComfyUI) 

[examples](https://comfyanonymous.github.io/ComfyUI_examples/)

[](https://github.com/madebyollin/taesd)

[pytorch](https://developer.apple.com/metal/pytorch/)

[](https://www.reddit.com/r/StableDiffusion/comments/10lzgze/i_figured_out_a_way_to_apply_different_prompts_to/)

[](https://youtu.be/vUTV85D51yk), [](https://youtu.be/gySLXbe7WZQ), [](https://youtu.be/ovjeVGmy6ZM)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/comfyanonymous/ComfyUI/blob/master/notebooks/comfyui_colab.ipynb) | 13.12.2024 |

| TransformerLens | Library for doing mechanistic interpretability of GPT-2 Style language models | 

[Neel Nanda](https://www.neelnanda.io/about)
 [Joseph Bloom](https://github.com/jbloomAus)

 | [![](https://img.shields.io/github/stars/TransformerLensOrg/TransformerLens?style=social)](https://github.com/TransformerLensOrg/TransformerLens) 

[](https://arxiv.org/abs/2302.03025), [](https://arxiv.org/abs/2303.08112)

[](https://transformerlensorg.github.io/TransformerLens/)

[](https://github.com/jbloomAus/DecisionTransformerInterpretability)

[](https://medium.com/@fgkffbvkhg/transformerlens-understanding-the-model-e339be551299)

[](https://pypi.org/project/transformer-lens/)

[](https://join.slack.com/t/opensourcemechanistic/shared_invite/zt-1qosyh8g3-9bF3gamhLNJiqCL_QqLFrA)

[](https://www.youtube.com/channel/UCBMJ0D-omcRay8dh4QT0doQ), [](https://youtu.be/oL67e-uEgWI)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/TransformerLensOrg/TransformerLens/blob/main/demos/Main_Demo.ipynb) | 09.12.2024 |

| Supervision | Reusable computer vision tools | [Piotr Skalski](https://github.com/SkalskiP) | [![](https://img.shields.io/github/stars/roboflow/supervision?style=social)](https://github.com/roboflow/supervision) 

[](https://discord.gg/GbfgXGJ8Bk)

[](https://github.com/roboflow/inference), [](https://docs.roboflow.com/)

[](https://github.com/roboflow/notebooks)

[](https://huggingface.co/spaces/Roboflow/Annotators)

[](https://www.kaggle.com/code/leoroboflow/inferring-on-a-dataset-with-a-roboflow-model)

[](https://pypi.org/project/supervision/)

[website](https://supervision.roboflow.com/)

[](https://youtu.be/uWP6UjDeZvY), [](https://youtu.be/4Q3ut7vqD5o), [](https://youtube.com/roboflow)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/roboflow/supervision/blob/main/demo.ipynb) | 05.12.2024 |

| XManager | Framework for managing machine learning experiment | [Andrew Chen](https://github.com/andrewluchen) | [![](https://img.shields.io/github/stars/google-deepmind/xmanager?style=social)](https://github.com/google-deepmind/xmanager) 

[](https://pypi.org/project/xmanager/)

[slides](https://storage.googleapis.com/gresearch/xmanager/deepmind_xmanager_slides.pdf)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/google-deepmind/xmanager/blob/main/colab_codelab.ipynb) | 05.12.2024 |

| Flax | Neural network library and ecosystem for JAX designed for flexibility | 

[Jonathan Heek](https://github.com/jheek)
 [Anselm Levskaya](https://anselmlevskaya.com/)
 [Avital Oliver](https://github.com/avital)
others[Marvin Ritter](https://github.com/Marvin182)
 [Bertrand Rondepierre](https://github.com/BertrandRdp)
 [Andreas Steiner](https://github.com/andsteing)
 [Marc van Zee](https://research.google/people/marc-van-zee/)

 | [![](https://img.shields.io/github/stars/google/flax?style=social)](https://github.com/google/flax) 

[](https://flax.readthedocs.io/)

[](https://github.com/huggingface/transformers/tree/main/examples/flax)

[](https://medium.com/syncedreview/google-introduces-flax-a-neural-network-library-for-jax-84bdc6f8f160)

[](https://www.reddit.com/r/MachineLearning/comments/erpdf7/p_flax_a_neural_network_library_for_jax_designed/)

[](https://youtu.be/e8StU6WQCqw), [](https://youtu.be/HOlQzrn84A4), [](https://youtu.be/5eUSmJvK8WA)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/google/flax/blob/main/docs/quick_start.ipynb) | 05.12.2024 |

| Haiku | A library built on top of JAX designed to provide simple, composable abstractions for machine learning research | 

[Tom Hennigan](https://github.com/tomhennigan)
 [Trevor Cai](https://github.com/trevorcai)
 [Tamara Norman](https://github.com/tamaranorman)
 [Igor Babuschkin](https://www.babushk.in/)

 | [![](https://img.shields.io/github/stars/deepmind/dm-haiku?style=social)](https://github.com/deepmind/dm-haiku) 

[](https://dm-haiku.readthedocs.io/en/latest/)

[website](https://www.haiku-os.org/)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/deepmind/dm-haiku/blob/main/examples/haiku_lstms.ipynb) | 05.12.2024 |

| SGLang | Fast serving framework for large language models and vision language models | 

[Lianmin Zheng](https://lmzheng.net/)
 [Liangsheng Yin](https://www.lsyin.me/)
 [Zhiqiang Xie](https://zhiqiangxie.com/)
others[Chuyue Sun](https://web.stanford.edu/~chuyues/)
 [Jeff Huang](https://engineering.tamu.edu/cse/profiles/huang-jeff.html)
 [Hao Yu](https://comaniac.github.io/)
 [Shiyi Cao](https://shiyicao.com/)
 [Christos Kozyrakis](https://web.stanford.edu/~kozyraki/)
 [Ion Stoica](https://people.eecs.berkeley.edu/~istoica/)
 [Joseph Gonzalez](https://people.eecs.berkeley.edu/~jegonzal/)
 [Clark Barrett](https://theory.stanford.edu/~barrett/)
 [Ying Sheng](https://zhyncs.com/)

 | [![](https://img.shields.io/github/stars/sgl-project/sglang?style=social)](https://github.com/sgl-project/sglang) 

[](https://arxiv.org/abs/2312.07104)

[blog post](https://lmsys.org/blog/2024-01-17-sglang/)

[](https://docs.sglang.ai/)

[](https://github.com/flashinfer-ai/flashinfer), [](https://github.com/dottxt-ai/outlines), [](https://github.com/eth-sri/lmql), [](https://github.com/vllm-project/vllm)

[](https://proceedings.neurips.cc/paper_files/paper/2024/hash/724be4472168f31ba1c9ac630f15dec8-Abstract-Conference.html)

[project](https://sky.cs.berkeley.edu/project/sglang/)

[](https://pytorch.org/blog/sglang-joins-pytorch/)

[](https://pypi.org/project/sglang/)

[](https://slack.sglang.ai/)

[](https://x.com/lmsysorg/status/1887262321636221412)

[](https://www.youtube.com/playlist?list=PLIOOI9FTWEi84bDmxzsXMatlEAQwNRX_l), [](https://youtu.be/XQylGyG7yp8), [](https://youtu.be/rwnIdla82pc), [](https://youtu.be/Ny4xxErgFgQ), [](https://youtu.be/AqMVjuY4peQ)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/sgl-project/sglang/blob/master/examples/frontend_language/usage/rag_using_parea/trace_and_evaluate_rag_using_parea.ipynb) | 05.12.2024 |

| SAE Lens | Training Sparse Autoencoders on Language Models | 

[Joseph Bloom](https://github.com/jbloomAus)
 [Curt Tigges](https://curttigges.com/)
 [David Chanin](https://chanind.github.io/)

 | [![](https://img.shields.io/github/stars/jbloomAus/SAELens?style=social)](https://github.com/jbloomAus/SAELens) 

[](https://jbloomaus.github.io/SAELens/)

[](https://pypi.org/project/sae-lens/)

[](https://join.slack.com/t/opensourcemechanistic/shared_invite/zt-2k0id7mv8-CsIgPLmmHd03RPJmLUcapw)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/jbloomAus/SAELens/blob/main/tutorials/tutorial_2_0.ipynb) | 03.12.2024 |

| Feast | An open source feature store for machine learning | 

[Willem Pienaar](https://github.com/woop)
 [Danny Chiao](https://github.com/adchia)
 [Achal Shah](http://achals.com/)
others[Terence Lim](https://terryyylim.github.io/portfolio/)
 [Ches Martin](https://github.com/ches)
 [Judah Rand](https://github.com/judahrand)
 [Matt Delacour](https://github.com/MattDelac)
 [Miguel Trejo Marrufo](https://github.com/TremaMiguel)
 [Francisco Javier Arceo](https://franciscojavierarceo.github.io/)

 | [![](https://img.shields.io/github/stars/feast-dev/feast?style=social)](https://github.com/feast-dev/feast) 

[](https://docs.feast.dev/)

[](https://github.com/baineng/feast-hive), [](https://github.com/Shopify/feast-trino), [](https://github.com/Azure/feast-azure), [](https://github.com/amundsen-io/amundsen/blob/main/databuilder/databuilder/extractor/feast_extractor.py)

[website](https://feast.dev/)

[](https://youtu.be/DaNv-Wf1MBA), [](https://youtu.be/p2cuq4eJ2BY)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/feast-dev/feast/blob/master/examples/quickstart/quickstart.ipynb) | 22.11.2024 |

| FiftyOne | Open-source tool for building high-quality datasets and computer vision models | 

[Brian Moore](https://github.com/brimoor)
 [Jason Corso](https://web.eecs.umich.edu/~jjcorso/)

 | [![](https://img.shields.io/github/stars/voxel51/fiftyone?style=social)](https://github.com/voxel51/fiftyone) 

[blog post](https://voxel51.com/blog/)

[](https://docs.voxel51.com/)

[](https://github.com/voxel51/fiftyone-examples)

[](https://medium.com/voxel51), [](https://towardsdatascience.com/open-source-tools-for-fast-computer-vision-model-building-b39755aab490)

[](https://slack.voxel51.com/)

[](https://twitter.com/voxel51)

[website](https://voxel51.com/fiftyone/)

[](https://www.youtube.com/playlist?list=PLuREAXoPgT0SJLKsgFzKxffMApbXp90Gi)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/voxel51/fiftyone-examples/blob/master/examples/quickstart.ipynb) | 21.11.2024 |

| CatBoost | High-performance open source library for gradient boosting on decision trees | 

[Anna Veronika Dorogush](https://github.com/annaveronika)
 [Vasily Ershov](https://linkedin.com/in/vasily-ershov-04768199)
 [Andrey Gulin](https://www.linkedin.com/in/andreygulin)
others[Liudmila Prokhorenkova](https://github.com/ostroumova-la)
 [Gleb Gusev](https://scholar.google.com/citations?user=RWX4sYcAAAAJ)
 [Aleksandr Vorobev](https://scholar.google.com/citations?user=WiCXGGIAAAAJ)

 | [![](https://img.shields.io/github/stars/catboost/catboost?style=social)](https://github.com/catboost/catboost) 

[](https://arxiv.org/abs/1810.11363), [](https://arxiv.org/abs/1706.09516)

[](https://catboost.ai/en/docs/)

[](https://medium.com/@mohan-gupta/catboost-algorithm-2156129d740d)

[](https://papers.nips.cc/paper_files/paper/2018/hash/14491b756b3a51daac41c24863285549-Abstract.html)

[](https://pypi.org/project/catboost/)

[](https://twitter.com/CatBoostML)

[website](https://catboost.ai/)

[](https://en.wikipedia.org/wiki/CatBoost)

[](https://youtu.be/8o0e-r0B5xQ), [](https://youtu.be/usdEWSDisS0), [](https://youtu.be/KXOTSkPL2X4), [](https://youtu.be/UYDwhuyWYSo), [](https://youtu.be/xl1fwCza9C8), [](https://youtu.be/Q_xa4RvnDcY), [](https://youtu.be/ySla2kczbeM), [](https://youtu.be/47-mAVms-b8), [](https://youtu.be/nrGt5VKZpzc)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/catboost/tutorials/blob/master/python_tutorial.ipynb) | 18.11.2024 |

| Llama 3.1 | First openly available model that rivals the top AI models when it comes to state-of-the-art capabilities in general knowledge, steerability, math, tool use, and multilingual translation | [unsloth](https://unsloth.ai/) | [![](https://img.shields.io/github/stars/unslothai/unsloth?style=social)](https://github.com/unslothai/unsloth) 

[blog post](https://unsloth.ai/blog/llama3-1)

[](https://discord.gg/unsloth)

[](https://huggingface.co/meta-llama)

[](https://www.kaggle.com/danielhanchen/kaggle-llama-3-1-8b-unsloth-notebook)

[](https://ai.meta.com/blog/meta-llama-3-1/), [](https://llama.meta.com/), [](https://about.fb.com/news/2024/07/open-source-ai-is-the-path-forward/)

[](https://pypi.org/project/unsloth/)

[](https://twitter.com/unslothai)

[](https://youtu.be/QyRWqJehK7I), [](https://youtu.be/1xdneyn6zjw), [](https://youtu.be/p5O-_AiKD_Q), [](https://youtu.be/4rk9fHIOGTU)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1Ys44kVvmeZtnICzWz0xgpRnrIOjZAuxp) | 17.11.2024 |

| Mistral Small | Enterprise-grade small model | [unsloth](https://unsloth.ai/) | [![](https://img.shields.io/github/stars/unslothai/unsloth?style=social)](https://github.com/unslothai/unsloth) 

[](https://discord.gg/unsloth)

[](https://pypi.org/project/unsloth/)

[](https://www.reddit.com/r/LocalLLaMA/comments/1fj4unz/mistralaimistralsmallinstruct2409_new_22b_from/)

[website](https://mistral.ai/)

[](https://youtu.be/damcEQdlpqY)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1oCEHcED15DzL8xXGU1VTx5ZfOJM8WY01) | 17.11.2024 |

| DPO Zephyr | Starting from a dataset of outputs ranked by a teacher model, we apply distilled direct preference optimization to learn a chat model with significantly improved intent alignment | 

[Lewis Tunstall](https://lewtun.github.io/blog/)
 [Edward Beeching](https://edbeeching.github.io/)
 [Nathan Lambert](https://www.natolambert.com/)
others[Nazneen Rajani](https://www.nazneenrajani.com/)
 [Kashif Rasul](https://github.com/kashif)
 [Younes Belkada](https://github.com/younesbelkada)
 [Shengyi Huang](https://costa.sh/)
 [Leandro Werra](https://github.com/lvwerra)
 [Clémentine Fourrier](https://github.com/clefourrier)
 [Nathan Habib](https://github.com/NathanHB)
 [Nathan Sarrazin](https://nsarrazin.com/)
 [Omar Sanseviero](https://osanseviero.github.io/hackerllama/)
 [Alexander Rush](https://scholar.google.com/citations?user=LIjnUGgAAAAJ)
 [Thomas Wolf](https://thomwolf.io/)

 | [![](https://img.shields.io/github/stars/xfactlab/orpo?style=social)](https://github.com/xfactlab/orpo) 

[](https://arxiv.org/abs/2310.16944)

[](https://discord.gg/unsloth)

[](https://github.com/unslothai/unsloth)

[](https://huggingface.co/blog/alvarobartt/notus-7b-v1), [](https://huggingface.co/alignment-handbook/zephyr-7b-dpo-full)

[](https://artgor.medium.com/paper-review-zephyr-direct-distillation-of-lm-alignment-035191df982e), [](https://isamu-website.medium.com/understanding-zephyr-12c5b9d3b822)

[](https://pypi.org/project/unsloth/)

[](https://www.reddit.com/r/MachineLearning/comments/17lmxk7/r_zephyr_direct_distillation_of_lm_alignment/)

[](https://x.com/Thom_Wolf/status/1717821614467739796)

[](https://youtu.be/wr2bHf41VE4), [](https://youtu.be/Js59UF_HUAs), [](https://youtu.be/cuObPxCOBCw), [](https://youtu.be/r2xpSzwetp0)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/15vttTpzzVXv_tJwEk-hIcQ0S9FcEWvwP) | 17.11.2024 |

| ORPO | Get up and running with large language models | 

[Jiwoo Hong](https://jiwooya1000.github.io/)
 [Noah Lee](https://nlee-208.github.io/)
 [James Thorne](https://jamesthorne.com/)

 | [![](https://img.shields.io/github/stars/xfactlab/orpo?style=social)](https://github.com/xfactlab/orpo) 

[](https://arxiv.org/abs/2403.07691)

[](https://discord.gg/unsloth)

[](https://github.com/unslothai/unsloth)

[](https://huggingface.co/datasets/reciperesearch/dolphin-sft-v0.1-preference), [](https://huggingface.co/docs/trl/main/en/orpo_trainer)

[](https://medium.com/@AriaLeeNotAriel/numbynum-orpo-monolithic-optimization-without-reference-model-hong-et-al-2024-reviewed-262d0778e08c), [](https://medium.com/@zergtant/optimizing-language-model-preferences-without-a-reference-model-introducing-the-orpo-method-1144b3e7aec3)

[](https://pypi.org/project/unsloth/)

[](https://www.reddit.com/r/LLMResearch/comments/1bh8iq5/orpo_monolithic_preference_optimization_without/)

[](https://youtu.be/52kMBrAI_IM), [](https://youtu.be/6kkJGkPZP88), [](https://youtu.be/8MEPCPdKUH8)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/11t4njE3c4Lxl-07OD8lJSMKkfyJml3Tn) | 17.11.2024 |

| Phi-3.5 | 3.8 billion parameter language model trained on 3.3 trillion tokens, whose overall performance, as measured by both academic benchmarks and internal testing, rivals that of models such as Mixtral 8x7B and GPT-3.5, despite being small enough to be deployed on a phone | [unsloth](https://unsloth.ai/) | [![](https://img.shields.io/github/stars/unslothai/unsloth?style=social)](https://github.com/unslothai/unsloth) 

[](https://arxiv.org/abs/2404.14219)

[blog post](https://azure.microsoft.com/en-us/blog/introducing-phi-3-redefining-whats-possible-with-slms/)

[](https://discord.gg/unsloth)

[](https://huggingface.co/collections/microsoft/phi-3-6626e15e9585a200d2d761e3)

[](https://medium.com/@mysocial81/phi-3-5-microsofts-efficient-multilingual-and-secure-open-source-slms-5ed7d36738aa)

[](https://pypi.org/project/unsloth/)

[](https://www.reddit.com/r/mlscaling/comments/1cberec/phi3_technical_report_a_highly_capable_language/), [](https://www.reddit.com/r/LocalLLaMA/comments/1ey5i22/phi35_is_very_safe_microsoft_really_outdid/)

[](https://twitter.com/unslothai)

[website](https://azure.microsoft.com/en-us/products/phi-3)

[](https://youtu.be/Enp70Kkjb8k)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1lN6hPQveB_mHSnTOYifygFcrO8C1bxq4) | 17.11.2024 |

| Simple audio recognition | This tutorial will show you how to build a basic speech recognition network that recognizes ten different words | [Google](https://www.tensorflow.org/) | 

[coursera](https://www.coursera.org/lecture/audio-signal-processing/stft-2-tjEQe)

[](https://paperswithcode.com/task/speech-recognition)

[](https://www.tensorflow.org/datasets/catalog/speech_commands), [](https://www.tensorflow.org/tutorials/audio/simple_audio)

[tf.js](https://codelabs.developers.google.com/codelabs/tensorflowjs-audio-codelab/index.html)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/tensorflow/docs/blob/master/site/en/tutorials/audio/simple_audio.ipynb) | 15.11.2024 |

| High-performance simulations with TFF | This tutorial will describe how to setup high-performance simulations with TFF in a variety of common scenarios | [Krzysztof Ostrowski](https://github.com/krzys-ostrowski) | 

[](https://paperswithcode.com/task/federated-learning)

[](https://pypi.org/project/tensorflow-federated/)

[](https://www.tensorflow.org/federated/tutorials/simulations)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/tensorflow/federated/blob/master/docs/tutorials/simulations.ipynb) | 01.11.2024 |

| Autodistill | Uses big, slower foundation models to train small, faster supervised models | [autodistill](https://github.com/autodistill) | [![](https://img.shields.io/github/stars/autodistill/autodistill?style=social)](https://github.com/autodistill/autodistill) 

[blog post](https://blog.roboflow.com/autodistill/)

[](https://docs.autodistill.com/)

[](https://github.com/autodistill/autodistill-grounded-sam), [](https://github.com/autodistill/autodistill-yolov8), [](https://github.com/autodistill/autodistill-yolonas), [](https://github.com/autodistill/autodistill-yolov5), [](https://github.com/autodistill/autodistill-detr), [](https://github.com/autodistill/autodistill-detic), [](https://github.com/autodistill/autodistill-grounding-dino), [](https://github.com/autodistill/autodistill-owl-vit), [](https://github.com/autodistill/autodistill-sam-clip), [](https://github.com/autodistill/autodistill-llava), [](https://github.com/autodistill/autodistill-kosmos-2), [](https://github.com/autodistill/autodistill-owlv2), [](https://github.com/autodistill/autodistill-roboflow-universe), [](https://github.com/autodistill/autodistill-azure-vision), [](https://github.com/autodistill/autodistill-rekognition), [](https://github.com/autodistill/autodistill-gcp-vision), [](https://github.com/roboflow/inference)

[](https://youtu.be/gKTYMfwPo4M), [](https://youtu.be/M_QZ_Q0zT0k), [](https://youtube.com/roboflow)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/roboflow-ai/notebooks/blob/main/notebooks/how-to-auto-train-yolov8-model-with-autodistill.ipynb) | 01.11.2024 |

| Swarm | Educational framework exploring ergonomic, lightweight multi-agent orchestration | 

[Ilan Bigio](https://ilanbigio.com/)
 [James Hills](https://github.com/jhills20)
 [Shyamal Anadkat](https://shyamal.me/)
others[Charu Jaiswal](https://github.com/charuj)
 [Colin Jarvis](https://github.com/colin-openai)
 [Katia Guzman](https://github.com/katia-openai)

 | [![](https://img.shields.io/github/stars/openai/swarm?style=social)](https://github.com/openai/swarm) 

[](https://medium.com/@michael_79773/exploring-openais-swarm-an-experimental-framework-for-multi-agent-systems-5ba09964ca18), [](https://ai.plainenglish.io/openai-releases-swarm-what-is-it-b61ecb88d67e)

[](https://www.reddit.com/r/LocalLLaMA/comments/1g56itb/openai_swarm_the_agentic_framework_should_you_care/)

[](https://youtu.be/Cw0ME8OZ0xI), [](https://youtu.be/q7_5eCmu0MY), [](https://youtu.be/LBih635lzps), [](https://youtu.be/npAljHBeKPc)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/NirDiamant/GenAI_Agents/blob/main/all_agents_tutorials/blog_writer_swarm.ipynb) | 15.10.2024 |

| TRL | Set of tools to train transformer language models with Reinforcement Learning, from the Supervised Fine-tuning step, Reward Modeling step to the Proximal Policy Optimization step | 

[Leandro Werra](https://github.com/lvwerra)
 [Younes Belkada](https://github.com/younesbelkada)
 [Lewis Tunstall](https://lewtun.github.io/blog/)
others[Edward Beeching](https://edbeeching.github.io/)
 [Tristan Thrush](http://www.tristanthrush.com/)
 [Nathan Lambert](https://www.natolambert.com/)

 | [![](https://img.shields.io/github/stars/huggingface/trl?style=social)](https://github.com/huggingface/trl) 

[](https://arxiv.org/abs/1909.08593)

[](http://hf.co/docs/trl)

[](https://github.com/openai/lm-human-preferences)

[](https://youtu.be/xQ5nc1CF7iQ), [](https://youtu.be/67SO20dszNA)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/huggingface/trl/blob/master/examples/notebooks/best_of_n.ipynb) | 24.09.2024 |

| TFX | End-to-end platform for deploying production ML pipelines | 

[Denis Baylor](https://github.com/lamberta)
 [Eric Breck](https://www.linkedin.com/in/eric-breck-6312606)
 [Heng-Tze Cheng](http://users.ece.cmu.edu/~hengtzec/MySite/index.html)
others[Noah Fiedel](https://scholar.google.com/citations?user=XWpV9DsAAAAJ)
 [Chuan Yu Foo](https://dl.acm.org/profile/99659193746)
 [Zakaria Haque](https://www.linkedin.com/in/zakariahaque)
 [Salem Haykal](https://www.linkedin.com/in/salem-haykal-aba16031)
 [Mustafa Ispir](https://ch.linkedin.com/in/mustafa-ispir-66a7b31)
 [Vihan Jain](https://scholar.google.co.in/citations?user=_IK8IzgAAAAJ)
 [Levent Koc](https://pages.cs.wisc.edu/~koc/)
 [Chiu Yuen Koo](https://www.iacr.org/cryptodb/data/author.php?authorkey=1710)
 [Lukasz Lew](https://scholar.google.com/citations?user=q65lmCAAAAAJ)
 [Clemens Mewald](https://www.linkedin.com/in/clemensmewald)
 [Akshay Naresh](https://github.com/akshaym)
 [Neoklis Polyzotis](https://scholar.google.com/citations?user=vfIriy4AAAAJ)
 [Sukriti Ramesh](https://github.com/sukritiramesh)
 [Sudip Roy](https://scholar.google.com/citations?user=jRRrZZcAAAAJ)
 [Steven Euijong Whang](https://sites.google.com/view/whanglab/professor)
 [Martin Wicke](https://research.google/people/martinwicke/)
 [Jarek Wilkiewicz](http://jarekwilkiewicz.com/blog/about/)
 [Xin Zhang](https://dl.acm.org/profile/99659193152)
 [Martin Zinkevich](https://martin.zinkevich.org/)

 | [![](https://img.shields.io/github/stars/tensorflow/tfx?style=social)](https://github.com/tensorflow/tfx) 

[KDD](https://dl.acm.org/doi/10.1145/3097983.3098021)

[coursera](https://www.coursera.org/learn/introduction-to-machine-learning-in-production)

[](https://tensorflow.github.io/tfx/)

[](https://github.com/google/ml-metadata), [](https://github.com/tensorflow/community/blob/master/governance/TF-RFCs.md)

[](https://medium.com/@robertf99/mlops-with-tensorflow-extended-tfx-and-tensorflow-decision-forest-tf-df-part-1-bfa2f61580dc)

[](https://pypi.org/project/tfx/)

[](https://www.tensorflow.org/tfx), [](https://blog.tensorflow.org/2019/06/tensorflow-extended-tfx-real-world_26.html)

[](https://www.youtube.com/playlist?list=PLQY2H8rRoyvxR15n04JiW0ezF5HQRs_8F), [](https://youtu.be/17l3VR2MIeg)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/tensorflow/tfx/blob/master/docs/tutorials/tfx/components.ipynb) | 18.09.2024 |

| TFDV | TensorFlow Data Validation is a library for exploring and validating machine learning data | 

[Eric Breck](https://www.linkedin.com/in/eric-breck-6312606)
 [Neoklis Polyzotis](https://scholar.google.com/citations?user=vfIriy4AAAAJ)
 [Sudip Roy](https://scholar.google.com/citations?user=jRRrZZcAAAAJ)
others[Steven Euijong Whang](https://sites.google.com/view/whanglab/professor)
 [Martin Zinkevich](https://martin.zinkevich.org/)

 | [![](https://img.shields.io/github/stars/tensorflow/data-validation?style=social)](https://github.com/tensorflow/data-validation) 

[MLSys](https://proceedings.mlsys.org/paper_files/paper/2019/hash/928f1160e52192e3e0017fb63ab65391-Abstract.html)

[](https://medium.com/tensorflow/introducing-tensorflow-data-validation-data-understanding-validation-and-monitoring-at-scale-d38e3952c2f0)

[](https://pypi.org/project/tensorflow-data-validation/)

[](https://www.tensorflow.org/tfx/data_validation/get_started)

[](https://youtu.be/eGIG_qHgQ08), [](https://youtu.be/aMDSmNyvuJ0), [](https://youtu.be/twOlbCM_Cns)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/tensorflow/tfx/blob/master/docs/tutorials/data_validation/tfdv_basic.ipynb) | 18.09.2024 |

| PEFT | Parameter-Efficient Fine-Tuning methods enable efficient adaptation of pre-trained language models to various downstream applications without fine-tuning all the model's parameters | 

[Sourab Mangrulkar](https://github.com/pacman100)
 [Sylvain Gugger](https://github.com/sgugger)
 [Lysandre Debut](http://lysand.re/)
others[Younes Belkada](https://github.com/younesbelkada)
 [Sayak Paul](https://sayak.dev/)

 | [![](https://img.shields.io/github/stars/huggingface/peft?style=social)](https://github.com/huggingface/peft) 

[blog post](https://www.philschmid.de/fine-tune-flan-t5-peft)

[](https://huggingface.co/docs/peft)

[](https://github.com/microsoft/DeepSpeed/issues/3002)

[](https://huggingface.co/datasets/ought/raft/viewer/twitter_complaints), [](https://huggingface.co/bigscience/T0_3B), [](https://huggingface.co/bigscience/mt0-xxl), [](https://huggingface.co/facebook/opt-6.7b), [](https://huggingface.co/roberta-large), [](https://huggingface.co/datasets/glue/viewer/mrpc)

[](https://youtu.be/YVU5wAA6Txo), [](https://youtu.be/Us5ZFp16PaU), [](https://youtu.be/YKCtbIJC3kQ)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/huggingface/peft/blob/master/examples/int8_training/Finetune_flan_t5_large_bnb_peft.ipynb) | 13.09.2024 |

| SAA+ | Framework, Segment Any Anomaly +, for zero-shot anomaly segmentation with hybrid prompt regularization to improve the adaptability of modern foundation models | 

[Yunkang Cao](https://caoyunkang.github.io/)
 [Xiaohao Xu](https://scholar.google.com/citations?user=3Ifn2DoAAAAJ)
 [Chen Sun](https://www.researchgate.net/profile/Chen-Sun-58)
others[Yuqi Cheng](https://scholar.google.com/citations?user=02BC-WgAAAAJ)
 [Zongwei Du](https://github.com/duzongwei)
 [Liang Gao](https://scholar.google.com/citations?user=NqIi8_8AAAAJ)
 [Weiming Shen](https://scholar.google.com/citations?user=FuSHsx4AAAAJ)

 | [![](https://img.shields.io/github/stars/caoyunkang/Segment-Any-Anomaly?style=social)](https://github.com/caoyunkang/Segment-Any-Anomaly) 

[](https://arxiv.org/abs/2305.10724)

[](https://github.com/abin24/Magnetic-tile-defect-datasets.), [](https://github.com/caoyunkang/WinClip)

[](https://huggingface.co/spaces/Caoyunkang/Segment-Any-Anomaly)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/12Sh0j92YYmTa0oIuSEWWpPBCpIwCSVhz) | 13.09.2024 |

| TensorRT | SDK for high-performance deep learning inference, includes a deep learning inference optimizer and runtime that delivers low latency and high throughput for inference applications | [nvidia](https://developer.nvidia.com/) | [![](https://img.shields.io/github/stars/NVIDIA/TensorRT?style=social)](https://github.com/NVIDIA/TensorRT) 

[blog post](https://developer.nvidia.com/blog/speeding-up-deep-learning-inference-using-tensorrt-updated/)

[](https://docs.nvidia.com/deeplearning/tensorrt/)

[forum](https://forums.developer.nvidia.com/c/ai-data-science/deep-learning/tensorrt)

[website](https://developer.nvidia.com/tensorrt)

[](https://youtu.be/TU5BMU6iYZ0), [](https://youtu.be/6rZNLaS775w), [](https://youtu.be/G_KhUFCUSsY), [](https://youtu.be/7kJ-jph9gCw)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/NVIDIA/TensorRT/blob/main/quickstart/IntroNotebooks/0.%20Running%20This%20Guide.ipynb) | 12.09.2024 |

| guidance | Enables you to control modern language models more effectively and efficiently than traditional prompting or chaining | [Scott Lundberg](https://scottlundberg.com/) | [![](https://img.shields.io/github/stars/guidance-ai/guidance?style=social)](https://github.com/guidance-ai/guidance) 

[](https://guidance.readthedocs.io/en/latest/)

[](https://pypi.org/project/guidance/)

[website](https://www.microsoft.com/en-us/research/project/guidance-control-lm-output/)

[](https://youtu.be/qXMNPVVlCMs), [](https://youtu.be/TQ_8tFM2e40), [](https://youtu.be/4Wz61w5zbCk)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/guidance-ai/guidance/blob/main/notebooks/tutorials/intro_to_guidance.ipynb) | 09.09.2024 |

| Arena-Hard-Auto | Automated pipeline that leverages LLMs to curate high-quality, open-ended prompts from large, crowd-sourced datasets, enabling continuous benchmark updates without human in the loop | 

[Tianle Li](https://github.com/CodingWithTim)
 [Wei-Lin Chiang](https://infwinston.github.io/)
 [Evan Frick](https://efrick2002.github.io/)
others[Lisa Dunlap](https://www.lisabdunlap.com/)
 [Tianhao Wu](https://thwu1.github.io/tianhaowu/)
 [Banghua Zhu](https://people.eecs.berkeley.edu/~banghua/)
 [Joseph Gonzalez](https://people.eecs.berkeley.edu/~jegonzal/)
 [Ion Stoica](https://people.eecs.berkeley.edu/~istoica/)

 | [![](https://img.shields.io/github/stars/lmarena/arena-hard-auto?style=social)](https://github.com/lmarena/arena-hard-auto) 

[](https://arxiv.org/abs/2406.11939)

[blog post](https://lmsys.org/blog/2024-05-17-category-hard/), [blog post](https://lmsys.org/blog/2024-08-28-style-control/)

[](https://huggingface.co/spaces/lmarena-ai/arena-hard-viewer), [](https://huggingface.co/datasets/lmarena-ai/arena-hard-auto)

[](https://x.com/lmarena_ai)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1ar6XLWREN_dXEh404WNOxroFVUe_4njp) | 09.09.2024 |

| DataChain | AI-dataframe to enrich, transform and analyze data from cloud storages for ML training and LLM apps | [Daniel K](https://github.com/volkfox) | [![](https://img.shields.io/github/stars/iterative/datachain?style=social)](https://github.com/iterative/datachain) 

[](https://dvc.org/chat)

[](https://datachain.dvc.ai/)

[](https://pypi.org/project/datachain/)

[](https://twitter.com/DVCorg)

[](https://youtu.be/qoqhllB3gN8), [](https://www.youtube.com/live/JT5AwGz5QMI)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/iterative/datachain-examples/blob/main/multimodal/clip_fine_tuning.ipynb) | 09.09.2024 |

| TFF for Federated Learning Research: Model and Update Compression | We use the EMNIST dataset to demonstrate how to enable lossy compression algorithms to reduce communication cost in the Federated Averaging algorithm | [Weikang Song](https://github.com/swkpku) | 

[](https://arxiv.org/abs/1602.05629)

[](https://paperswithcode.com/task/federated-learning)

[](https://pypi.org/project/tensorflow-federated/)

[tensor encoding](http://jakubkonecny.com/files/tensor_encoding.pdf)

[](https://www.tensorflow.org/federated/api_docs/python/tff/simulation/datasets/emnist), [](https://www.tensorflow.org/federated/api_docs/python/tff/learning/build_federated_averaging_process), [](https://www.tensorflow.org/federated/tutorials/tff_for_federated_learning_research_compression)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/tensorflow/federated/blob/master/docs/tutorials/tff_for_federated_learning_research_compression.ipynb) | 05.09.2024 |

| LlamaIndex | Data framework for your LLM application | [Jerry Liu](https://github.com/jerryjliu) | [![](https://img.shields.io/github/stars/run-llama/llama_index?style=social)](https://github.com/run-llama/llama_index) 

[](https://discord.gg/dGcwcsnxhU)

[](https://docs.llamaindex.ai/en/stable/)

[](https://github.com/run-llama/LlamaIndexTS), [](https://github.com/run-llama/llama-lab)

[](https://llama.meta.com/docs/integration-guides/llamaindex/)

[](https://pypi.org/project/llama-index/)

[](https://twitter.com/llama_index)

[website](https://www.llamaindex.ai/)

[](https://www.youtube.com/@LlamaIndex), [](https://youtu.be/TRjq7t2Ms5I), [](https://youtu.be/pApPGFwbigI), [](https://youtu.be/zeAyuLc_f3Q), [](https://youtu.be/hH4WkgILUD4), [](https://youtu.be/v6g8eo86T8A), [](https://youtu.be/FQBou-YgxyE), [](https://youtu.be/bQw92baScME), [](https://youtu.be/cNMYeW2mpBs)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/run-llama/llama_index/blob/main/docs/docs/examples/cookbooks/oreilly_course_cookbooks/Module-2/Components_Of_LlamaIndex.ipynb) | 05.09.2024 |

| Deforum Stable Diffusion | Open source project is designed to be free to use and easy to modify for custom needs and pipelines | 

[EnzymeZoo](https://linktr.ee/enzymezoo)
 [Артем Храпов](https://github.com/kabachuha)
 [Forest Star Walz](https://github.com/reallybigname)
 [pharmapsychotic](https://github.com/pharmapsychotic)

 | [![](https://img.shields.io/github/stars/deforum-art/deforum-stable-diffusion?style=social)](https://github.com/deforum-art/deforum-stable-diffusion) 

[](https://discord.gg/deforum)

[](https://docs.google.com/document/d/1RrQv7FntzOuLg4ohjRZPVL7iptIyBhwwbcEYEW2OfcI)

[project](https://deforum.github.io/)

[](https://youtu.be/w_sxuDMt_V0), [](https://youtu.be/bicPayZDI60), [](https://youtu.be/dqkQo2alZvU)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/deforum-art/deforum-stable-diffusion/blob/main/Deforum_Stable_Diffusion.ipynb) | 30.08.2024 |

| Nerfstudio | API that allows for a simplified end-to-end process of creating, training, and testing NeRFs | 

[Matthew Tancik](https://github.com/tancik)
 [Ethan Weber](https://ethanweber.me/)
 [Evonne Ng](http://people.eecs.berkeley.edu/~evonne_ng/)
others[Ruilong Li](http://www.liruilong.cn/)
 [Brent Yi](https://github.com/brentyi)
 [Justin Kerr](https://kerrj.github.io/)
 [Terrance Wang](https://github.com/terrancewang)
 [Alexander Kristoffersen](https://akristoffersen.com/)
 [Jake Austin](https://github.com/jake-austin)
 [Kamyar Salahi](https://github.com/TheQuantumFractal)
 [Abhik Ahuja](https://abhikahuja.com/)
 [David McAllister](https://github.com/mcallisterdavid)
 [Angjoo Kanazawa](https://github.com/akanazawa)

 | [![](https://img.shields.io/github/stars/nerfstudio-project/nerfstudio?style=social)](https://github.com/nerfstudio-project/nerfstudio) 

[Viewer](https://viewer.nerf.studio/)

[](https://arxiv.org/abs/2302.04264)

[](https://discord.gg/uMbNqcraFc)

[](https://docs.nerf.studio/en/latest/)

[](https://github.com/NVlabs/tiny-cuda-nn)

[](https://twitter.com/nerfstudioteam)

[](https://youtu.be/XwKq7qDQCQk), [](https://youtu.be/nSFsugarWzk), [](https://youtu.be/h5EWiRRxYEQ), [](https://youtu.be/8cv9G7izdPY)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/nerfstudio-project/nerfstudio/blob/main/colab/demo.ipynb) | 19.08.2024 |

| highway-env | A collection of environments for autonomous driving and tactical decision-making tasks | [Edouard Leurent](https://edouardleurent.com/) | [![](https://img.shields.io/github/stars/eleurent/highway-env?style=social)](https://github.com/eleurent/highway-env) 

[](https://arxiv.org/abs/2102.03483), [](https://arxiv.org/abs/2105.05701), [](https://arxiv.org/abs/2101.07140)

[](https://highway-env.readthedocs.io/en/latest/)

[](https://github.com/eleurent/rl-agents), [](https://github.com/eleurent/finite-mdp), [](https://github.com/openai/baselines/tree/master/baselines/her)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/eleurent/highway-env/blob/master/scripts/parking_model_based.ipynb) | 09.08.2024 |

| GNN | Production-tested library for building GNNs at large scale | 

[Oleksandr Ferludin](https://github.com/aferludin)
 [Arno Eigenwillig](https://github.com/arnoegw)
 [Martin Blais](https://github.com/blais)
others[Dustin Zelle](https://github.com/dzelle)
 [Jan Pfeifer](https://github.com/janpfeifer)
 [Alvaro Sanchez-Gonzalez](https://github.com/alvarosg)
 [Wai Lok Sibon Li](https://scholar.google.com/citations?user=qX9aUx8AAAAJ)
 [Sami Abu-El-Haija](https://samihaija.github.io/)
 [Peter Battaglia](https://scholar.google.com/citations?user=nQ7Ij30AAAAJ)
 [Neslihan Bulut](https://scholar.google.com/citations?user=k_cadGsAAAAJ)
 [Jonathan Halcrow](https://scholar.google.com/citations?user=2zZucy4AAAAJ)
 [Filipe Miguel Gonçalves de Almeida](https://github.com/fmgda)
 [Pedro Gonnet](https://research.google/people/pedro-gonnet/)
 [Liangze Jiang](https://liangzejiang.github.io/)
 [Parth Kothari](https://thedebugger811.github.io/)
 [Silvio Lattanzi](https://sites.google.com/site/silviolattanzi/)
 [André Linhares](https://scholar.google.com/citations?user=YYRnhTkAAAAJ)
 [Brandon Mayer](https://github.com/brandonmayer-zz)
 [Vahab Mirrokni](https://people.csail.mit.edu/mirrokni/Welcome.html)
 [John Palowitch](http://ml.johnpalowitch.com/)
 [Mihir Paradkar](https://www.linkedin.com/in/mihir-paradkar-22b88579)
 [Jennifer She](https://scholar.google.com/citations?user=Gjf_sd0AAAAJ)
 [Anton Tsitsulin](https://tsitsul.in/)
 [Kevin Villela](https://www.linkedin.com/in/kevin-villela-612a6443)
 [Lisa Wang](https://scholar.google.com/citations?user=5KmYPkIAAAAJ)
 [Bryan Perozzi](http://www.perozzi.net/)

 | [![](https://img.shields.io/github/stars/tensorflow/gnn?style=social)](https://github.com/tensorflow/gnn) 

[](https://arxiv.org/abs/2207.03522)

[](https://www.kaggle.com/code/fidels/introduction-to-tf-gnn)

[](https://medium.com/@techtes.com/getting-started-with-tf-gnn-with-python-26d8e341db05)

[](https://pypi.org/project/tensorflow-gnn/)

[](https://blog.tensorflow.org/2024/02/graph-neural-networks-in-tensorflow.html), [](https://blog.tensorflow.org/2021/11/introducing-tensorflow-gnn.html)

[](https://www.youtube.com/playlist?list=PL2PZTwLd0HMJC1fU_NkwwpRkcjoGqAECX), [](https://youtu.be/JqWROPYeqjA), [](https://youtu.be/YdGN-J322y4), [](https://youtu.be/VDzrvhgyxsU), [](https://www.youtube.com/live/e6WHg1l7AMs), [](https://youtu.be/a75Q6dtg1_s)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/tensorflow/gnn/blob/master/examples/notebooks/graph_network_shortest_path.ipynb) | 09.08.2024 |

| Image classification | This tutorial shows how to classify images of flowers | [Billy Lamberta](https://github.com/lamberta) | 

[](https://paperswithcode.com/task/image-classification)

[](https://www.tensorflow.org/tutorials/images/classification), [](https://www.tensorflow.org/api_docs/python/tf/keras/Sequential), [](https://www.tensorflow.org/api_docs/python/tf/keras/preprocessing/image_dataset_from_directory)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/tensorflow/docs/blob/master/site/en/tutorials/images/classification.ipynb) | 24.07.2024 |

| Kor | Half-baked prototype that "helps" you extract structured data from text using LLMs | [Eugene Yurtsev](https://eyurtsev.github.io/) | [![](https://img.shields.io/github/stars/eyurtsev/kor?style=social)](https://github.com/eyurtsev/kor) 

[](https://discord.com/channels/1038097195422978059/1170024642245832774)

[](https://eyurtsev.github.io/kor/)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/eyurtsev/kor/blob/main/docs/source/guidelines.ipynb) | 20.07.2024 |

| Mistral Inference | Minimal code to run Mistral models | [mistral](https://mistral.ai/) | [![](https://img.shields.io/github/stars/mistralai/mistral-inference?style=social)](https://github.com/mistralai/mistral-inference) 

[blog post](https://mistral.ai/news/announcing-mistral-7b/)

[](https://discord.com/invite/mistralai)

[](https://docs.mistral.ai/)

[](https://medium.com/@parikshitsaikia1619/mistral-mastery-fine-tuning-fast-inference-guide-62e163198b06)

[](https://pypi.org/project/mistral-inference/)

[](https://youtu.be/mYRqvB1_gRk)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/mistralai/mistral-inference/blob/main/tutorials/getting_started.ipynb) | 16.07.2024 |

| XLand-MiniGrid | Suite of tools and grid-world environments for meta-reinforcement learning research | 

[Alexander Nikulin](https://howuhh.github.io/)
 [Vladislav Kurenkov](https://github.com/vkurenkov)
 [Ilya Zisman](https://zis.mn/)
others[Artem Agarkov](https://github.com/agarkovv)
 [Viacheslav Sinii](https://github.com/ummagumm-a)
 [Sergey Kolesnikov](https://scitator.com/)

 | [![](https://img.shields.io/github/stars/dunnolab/xland-minigrid?style=social)](https://github.com/dunnolab/xland-minigrid) 

[](https://arxiv.org/abs/2312.12044)

[](https://deepmind.google/discover/blog/generally-capable-agents-emerge-from-open-ended-play/), [](https://sites.google.com/view/adaptive-agent/)

[](https://github.com/dunno-lab/xland-minigrid-datasets), [](https://github.com/Farama-Foundation/MiniGrid), [](https://github.com/MichaelTMatthews/Craftax)

[](https://huggingface.co/datasets/Howuhh/xland_minigrid)

[](https://neurips.cc/virtual/2023/77592)

[](https://pypi.org/project/xminigrid/)

[](https://twitter.com/vladkurenkov/status/1731709425524543550)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/corl-team/xland-minigrid/blob/main/examples/walkthrough.ipynb) | 12.07.2024 |

| PyTorch3D  | Library for deep learning with 3D data | 

[Nikhila Ravi](https://nikhilaravi.com/)
 [Jeremy Reizenstein](https://github.com/bottler)
 [David Novotny](https://d-novotny.github.io/)
others[Taylor Gordon](https://scholar.google.com/citations?user=CNOoeQ0AAAAJ)
 [Wan-Yen Lo](https://github.com/wanyenlo)
 [Justin Johnson](https://web.eecs.umich.edu/~justincj/)
 [Georgia Gkioxari](https://gkioxari.github.io/)

 | [![](https://img.shields.io/github/stars/facebookresearch/pytorch3d?style=social)](https://github.com/facebookresearch/pytorch3d) 

[](https://arxiv.org/abs/2007.08501), [](https://arxiv.org/abs/1906.02739)

[blog post](https://ai.meta.com/blog/implicitron-a-new-modular-extensible-framework-for-neural-implicit-representations-in-pytorch3d/), [blog post](https://ai.meta.com/blog/-introducing-pytorch3d-an-open-source-library-for-3d-deep-learning/)

[](https://pytorch3d.readthedocs.org/)

[](https://www.kaggle.com/code/sohonjit/rendering-with-pytorch3d)

[](https://towardsdatascience.com/glimpse-into-pytorch3d-an-open-source-3d-deep-learning-library-291a4beba30f), [](https://medium.com/@phamtdong0406/crafting-realistic-renderings-with-pytorch3d-947a38194f0a), [](https://towardsdatascience.com/how-to-render-3d-files-using-pytorch3d-ef9de72483f8)

[website](https://pytorch3d.org/)

[](https://youtu.be/0JEb7knenps), [](https://youtu.be/Pph1r-x9nyY), [](https://youtu.be/eCDBA_SbxCE), [](https://youtu.be/MOBAJb5nJRI), [](https://youtu.be/g50RiDnfIfY), [](https://youtu.be/hgBk9WlF-XA), [](https://youtu.be/Sb9gCCnSAUg), [](https://youtu.be/ZLqJ33Ey-MU)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/facebookresearch/pytorch3d/blob/master/docs/tutorials/implicitron_config_system.ipynb) | 11.07.2024 |

| Stable Diffusion Videos | Create videos with Stable Diffusion by exploring the latent space and morphing between text prompts | [Nathan Raw](https://github.com/nateraw) | [![](https://img.shields.io/github/stars/nateraw/stable-diffusion-videos?style=social)](https://github.com/nateraw/stable-diffusion-videos) [](https://gist.github.com/karpathy/00103b0037c5aaea32fe1da1af553355), [](https://gist.github.com/nateraw/c989468b74c616ebbc6474aa8cdd9e53) | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/nateraw/stable-diffusion-videos/blob/main/stable_diffusion_videos.ipynb) | 11.07.2024 |

| Presidio | Context aware, pluggable and customizable PII de-identification service for text and images | [Omri Mendels](https://github.com/omri374) | [![](https://img.shields.io/github/stars/microsoft/presidio?style=social)](https://github.com/microsoft/presidio) 

[](https://hub.docker.com/r/microsoft/presidio-analyzer), [](https://hub.docker.com/r/microsoft/presidio-anonymizer), [](https://hub.docker.com/r/microsoft/presidio-image-redactor)

[](https://microsoft.github.io/presidio/)

[](https://aka.ms/presidio-demo)

[](https://medium.com/@olegolego1997/text-anonymization-with-presidio-and-faker-be251f36d5bf), [](https://medium.com/@parasmadan.in/understanding-the-importance-of-microsoft-presidio-in-large-language-models-llms-12728b0f9c1c)

[](https://pypi.org/project/presidio-analyzer/), [](https://pypi.org/project/presidio-anonymizer/), [](https://pypi.org/project/presidio-image-redactor/), [](https://pypi.org/project/presidio-structured/)

[](https://youtu.be/oUAYZPY-tw8), [](https://youtu.be/7NrzPuICLtg), [](https://youtu.be/jewJIn5Ecfk), [](https://youtu.be/4GuK9sPUvus), [](https://youtu.be/qVWcidbrOHs)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/microsoft/presidio/blob/main/docs/samples/python/customizing_presidio_analyzer.ipynb) | 10.07.2024 |

| Transfer learning and fine-tuning | You will learn how to classify images of cats and dogs by using transfer learning from a pre-trained network | [François Chollet](https://fchollet.com/) | 

[](https://paperswithcode.com/task/transfer-learning)

[](https://www.tensorflow.org/tutorials/images/transfer_learning)

[](https://en.wikipedia.org/wiki/Transfer_learning)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/tensorflow/docs/blob/master/site/en/tutorials/images/transfer_learning.ipynb) | 26.06.2024 |

| MARS5 | Speech model for insane prosody | [Matthew Baas](https://rf5.github.io/) | [![](https://img.shields.io/github/stars/Camb-ai/MARS5-TTS?style=social)](https://github.com/Camb-ai/MARS5-TTS) 

[demo](https://6b1a3a8e53ae.ngrok.app/)

[](https://discord.gg/FFQNCSKSXX)

[](https://hub.docker.com/r/cambai/mars5ttsimage)

[](https://docs.camb.ai/)

[](https://github.com/RF5/transfusion-asr), [](https://github.com/ehoogeboom/multinomial_diffusion), [](https://github.com/karpathy/minbpe)

[](https://huggingface.co/CAMB-AI/MARS5-TTS)

[](https://youtu.be/bmJSLPYrKtE)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/Camb-ai/mars5-tts/blob/master/mars5_demo.ipynb) | 25.06.2024 |

| ToonCrafter | Can interpolate two cartoon images by leveraging the pre-trained image-to-video diffusion priors | 

[Jinbo Xing](https://doubiiu.github.io/)
 [Hanyuan Liu](https://github.com/hyliu)
 [Menghan Xia](https://menghanxia.github.io/)
others[Yong Zhang](https://yzhang2016.github.io/)
 [Xintao Wang](https://xinntao.github.io/)
 [Ying Shan](https://scholar.google.com/citations?user=4oXBp9UAAAAJ)
 [Tien-Tsin Wong](https://ttwong12.github.io/myself.html)

 | [![](https://img.shields.io/github/stars/ToonCrafter/ToonCrafter?style=social)](https://github.com/ToonCrafter/ToonCrafter) 

[](https://arxiv.org/abs/2405.17933v1)

[project](https://doubiiu.github.io/projects/ToonCrafter/)

[](https://www.reddit.com/r/StableDiffusion/comments/1d470rv/tooncrafter_generative_cartoon_interpolation/)

[](https://youtu.be/u3F35do93_8), [](https://youtu.be/E89R5_hQ5bQ), [](https://youtu.be/kK-A9jOaO1U), [](https://youtu.be/ricylysRayw), [](https://youtu.be/hc5nF6rGa68), [](https://youtu.be/mEn3CYU7s_A)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/gist/0smboy/baef995b8f5974f19ac114ec20ac37d5/tooncrafter.ipynb) | 20.06.2024 |

| DiffSynth | Restructured architectures including Text Encoder, UNet, VAE, among others, maintaining compatibility with models from the open-source community while enhancing computational performance | [Artiprocher](https://github.com/Artiprocher) | [![](https://img.shields.io/github/stars/Artiprocher/DiffSynth-Studio?style=social)](https://github.com/Artiprocher/DiffSynth-Studio) 

[](https://arxiv.org/abs/2401.16224)

[](https://huggingface.co/Helsinki-NLP/opus-mt-en-zh), [](https://huggingface.co/alibaba-pai/pai-bloom-1b1-text2prompt-sd)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/Artiprocher/DiffSynth-Studio/blob/main/examples/Diffutoon.ipynb) | 06.06.2024 |

| Transformer | This tutorial trains a Transformer model to translate Portuguese to English | 

[Ashish Vaswani](https://en.wikipedia.org/wiki/Ashish_Vaswani)
 [Noam Shazeer](https://en.wikipedia.org/wiki/Noam_Shazeer)
 [Niki Parmar](https://scholar.google.com/citations?user=q2YXPSgAAAAJ)
others[Jakob Uszkoreit](http://jakob.uszkoreit.net/)
 [Llion Jones](https://scholar.google.com/citations?user=_3_P5VwAAAAJ)
 [Aidan Gomez](https://aidangomez.ca/)
 [Łukasz Kaiser](https://scholar.google.com/citations?user=JWmiQR0AAAAJ)
 [Illia Polosukhin](https://scholar.google.com/citations?user=3SyxFIAAAAAJ)

 | [![](https://img.shields.io/github/stars/neulab/word-embeddings-for-nmt?style=social)](https://github.com/neulab/word-embeddings-for-nmt) 

[](https://arxiv.org/abs/1706.03762), [](https://arxiv.org/abs/1903.03878)

[link](https://deepmind.com/blog/article/alphastar-mastering-real-time-strategy-game-starcraft-ii)

[](https://papers.nips.cc/paper/7181-attention-is-all-you-need)

[](https://www.tensorflow.org/text/tutorials/transformer)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/tensorflow/text/blob/master/docs/tutorials/transformer.ipynb) | 31.05.2024 |

| NeMo | A conversational AI toolkit built for researchers working on automatic speech recognition, natural language processing, and text-to-speech synthesis | 

[Oleksii Kuchaiev](http://kuchaev.com/)
 [Jason Li](https://scholar.google.com/citations?user=V28bxDwAAAAJ)
 [Chip Huyen](https://huyenchip.com/)
others[Oleksii Hrinchuk](https://github.com/AlexGrinch)
 [Ryan Leary](https://github.com/ryanleary)
 [Boris Ginsburg](https://github.com/borisgin)
 [Samuel Kriman](https://github.com/sam1373)
 [Stanislav Beliaev](https://github.com/stasbel)
 [Vitaly Lavrukhin](https://github.com/vsl9)
 [Jack Cook](https://jackcook.com/)

 | [![](https://img.shields.io/github/stars/NVIDIA/NeMo?style=social)](https://github.com/NVIDIA/NeMo) 

[project](https://docs.nvidia.com/deeplearning/nemo/)

[](https://youtu.be/wBgpMf_KQVw)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/NVIDIA/NeMo/blob/master/tutorials/00_NeMo_Primer.ipynb) | 25.05.2024 |

| SentencePiece | An unsupervised text tokenizer and detokenizer mainly for Neural Network-based text generation systems where the vocabulary size is predetermined prior to the neural model training | 

[Taku Kudo](http://chasen.org/~taku/)
 [John Richardson](https://scholar.google.com/citations?user=PEvmYfgAAAAJ)

 | [![](https://img.shields.io/github/stars/google/sentencepiece?style=social)](https://github.com/google/sentencepiece) 

[](https://arxiv.org/abs/1808.06226), [](https://arxiv.org/abs/1508.07909), [](https://arxiv.org/abs/1804.10959), [](https://arxiv.org/abs/1910.13267), [](https://arxiv.org/abs/1609.08144)

[](https://github.com/moses-smt/mosesdecoder/blob/master/scripts/tokenizer/tokenizer.perl), [](https://github.com/rsennrich/subword-nmt), [](https://github.com/gperftools/gperftools), [](https://github.com/Microsoft/vcpkg)

[](https://jacky2wong.medium.com/understanding-sentencepiece-under-standing-sentence-piece-ac8da59f6b08)

[](https://youtu.be/U51ranzJBpY)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/google/sentencepiece/blob/master/python/sentencepiece_python_module_example.ipynb) | 21.05.2024 |

| Llama3 from scratch | Llama3 from scratch, one tensor and matrix multiplication at a time | [Nishant Aklecha](https://www.naklecha.com/) | [![](https://img.shields.io/github/stars/naklecha/llama3-from-scratch?style=social)](https://github.com/naklecha/llama3-from-scratch) 

[](https://github.com/karpathy/minbpe)

[](https://twitter.com/naklecha), [](https://twitter.com/aaaaaaaaaaorg)

[](https://youtu.be/o29P0Kpobz0?t=530)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/naklecha/llama3-from-scratch/blob/main/llama3-from-scratch.ipynb) | 19.05.2024 |

| IC-Light | Manipulate the illumination of images | 

[Lvmin Zhang](https://github.com/lllyasviel)
 [Anyi Rao](https://anyirao.com/)
 [Maneesh Agrawala](https://graphics.stanford.edu/~maneesh/)

 | [![](https://img.shields.io/github/stars/lllyasviel/IC-Light?style=social)](https://github.com/lllyasviel/IC-Light) 

[](https://arxiv.org/abs/2312.06886), [](https://arxiv.org/abs/2402.18848)

[](https://youtu.be/U_ZIkFb9P8w), [](https://youtu.be/3EsJrdXGnpo), [](https://youtu.be/BuSsw8Nv1N4)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/camenduru/IC-Light-jupyter/blob/main/IC_Light_jupyter.ipynb) | 09.05.2024 |

| Neural style transfer | This tutorial uses deep learning to compose one image in the style of another image | 

[Leon Gatys](https://scholar.google.com/citations?user=ADMVEmsAAAAJ)
 [Alexander Ecker](https://eckerlab.org/)
 [Matthias Bethge](https://bethgelab.org/)

 | [](https://arxiv.org/abs/1508.06576) | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/tensorflow/docs/blob/master/site/en/tutorials/generative/style_transfer.ipynb) | 06.05.2024 |

| Autoencoders | This tutorial introduces autoencoders with three examples: the basics, image denoising, and anomaly detection | [Billy Lamberta](https://github.com/lamberta) | 

[blog post](https://blog.keras.io/building-autoencoders-in-keras.html)

[book](https://www.deeplearningbook.org/contents/autoencoders.html)

[data](http://www.timeseriesclassification.com/description.php?Dataset=ECG5000)

[examples](https://anomagram.fastforwardlabs.com/#/)

[](https://paperswithcode.com/method/autoencoder)

[](https://www.tensorflow.org/tutorials/generative/autoencoder)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/tensorflow/docs/blob/master/site/en/tutorials/generative/autoencoder.ipynb) | 15.04.2024 |

| MagicTime | Metamorphic time-lapse video generation model, which learns real-world physics knowledge from time-lapse videos and implements metamorphic generation | 

[Shenghai Yuan](https://shyuanbest.github.io/)
 [Jinfa Huang](https://infaaa.github.io/)
 [Yujun Shi](https://yujun-shi.github.io/)
others[Yongqi Xu](https://cheliosoops.github.io/YongqiXu.io/)
 [Ruijie Zhu](https://ruijie-zhu.github.io/)
 [Bin Lin](https://github.com/LinB203)
 [Xinhua Cheng](https://cxh0519.github.io/)
 [Li Yuan](https://yuanli2333.github.io/)
 [Jiebo Luo](https://www.cs.rochester.edu/u/jluo/)

 | [![](https://img.shields.io/github/stars/PKU-YuanGroup/MagicTime?style=social)](https://github.com/PKU-YuanGroup/MagicTime) 

[](https://arxiv.org/abs/2404.05014), [](https://arxiv.org/abs/2406.18522)

[](https://github.com/PKU-YuanGroup/ChronoMagic-Bench), [](https://github.com/kijai/ComfyUI-MagicTimeWrapper), [](https://github.com/xuduo35/MakeLongVideo), [](https://github.com/Vchitect/LaVie), [](https://github.com/Vchitect/Latte)

[](https://huggingface.co/spaces/BestWishYsh/MagicTime?logs=build), [](https://huggingface.co/datasets/BestWishYsh/ChronoMagic), [](https://huggingface.co/cerspense/zeroscope_v2_576w)

[project](https://pku-yuangroup.github.io/MagicTime/)

[](https://www.reddit.com/r/StableDiffusion/comments/1c1rv7q/magictime_demo_timelapse_video_generation_models/)

[](https://x.com/_akhaliq/status/1777538468043792473), [](https://twitter.com/vhjf36495872/status/1777525817087553827?s=61&t=r2HzCsU2AnJKbR8yKSprKw)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/camenduru/MagicTime-jupyter/blob/main/MagicTime_jupyter.ipynb) | 14.04.2024 |

| SAGE | Methodology for generative spelling correction, which was tested on English and Russian languages and potentially can be extended to any language with minor changes | 

[Nikita Martynov](https://github.com/meduzick)
 [Mark Baushenko](https://github.com/e0xextazy)
 [Anastasia Kozlova](https://github.com/anastasia-kozlova)
others[Katerina Kolomeytseva](https://www.linkedin.com/in/katerina-kolomeytseva-394a7a21a)
 [Aleksandr Abramov](https://github.com/Ab1992ao)
 [Alena Fenogenova](https://github.com/Alenush)

 | [![](https://img.shields.io/github/stars/ai-forever/sage?style=social)](https://github.com/ai-forever/sage) 

[](https://arxiv.org/abs/2308.09435)

[](https://github.com/ai-forever/augmentex)

[](https://huggingface.co/ai-forever/RuM2M100-1.2B), [](https://huggingface.co/ai-forever/FRED-T5-large-spell), [](https://huggingface.co/ai-forever/RuM2M100-418M), [](https://huggingface.co/ai-forever/T5-large-spell), [](https://huggingface.co/datasets/ai-forever/spellcheck_benchmark)

[](https://en.wikipedia.org/wiki/Levenshtein_distance)

[](https://youtu.be/yFfkV0Qjuu0)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/ai-forever/sage/blob/main/notebooks/text_correction_demo.ipynb) | 11.04.2024 |

| Open-Sora Plan | Simple and efficient design along with remarkable performance in text-to-video generation | [YUAN Lab at PKU](https://github.com/PKU-YuanGroup) | [![](https://img.shields.io/github/stars/PKU-YuanGroup/Open-Sora-Plan?style=social)](https://github.com/PKU-YuanGroup/Open-Sora-Plan) 

[](https://arxiv.org/abs/2306.15595)

[](https://discord.gg/YtsBNg7n)

[](https://github.com/PKU-YuanGroup/Open-Sora-Dataset), [](https://github.com/Vchitect/Latte), [](https://github.com/whlzy/FiT)

[](https://huggingface.co/spaces/LanguageBind/Open-Sora-Plan-v1.1.0), [](https://huggingface.co/datasets/LanguageBind/Open-Sora-Plan-v1.0.0)

[](https://youtu.be/cRUz3c7hRs4), [](https://youtu.be/mYnRwR0RyvE)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/camenduru/Open-Sora-Plan-jupyter/blob/main/Open_Sora_Plan_jupyter.ipynb) | 07.04.2024 |

| AniPortrait | Framework for generating high-quality animation driven by audio and a reference portrait image | 

[Zejun Yang](https://github.com/Zejun-Yang)
 [Zhisheng Wang](https://scholar.google.com/citations?user=XrK2HNcAAAAJ)

 | [![](https://img.shields.io/github/stars/Zejun-Yang/AniPortrait?style=social)](https://github.com/Zejun-Yang/AniPortrait) 

[](https://arxiv.org/abs/2403.17694)

[](https://github.com/CelebV-HQ/CelebV-HQ), [](https://github.com/HumanAIGC/EMO), [](https://github.com/MooreThreads/Moore-AnimateAnyone), [](https://github.com/magic-research/magic-animate), [](https://github.com/guoqincode/Open-AnimateAnyone)

[](https://huggingface.co/ZJYang/AniPortrait), [](https://huggingface.co/runwayml/stable-diffusion-v1-5), [](https://huggingface.co/stabilityai/sd-vae-ft-mse), [](https://huggingface.co/lambdalabs/sd-image-variations-diffusers/tree/main/image_encoder), [](https://huggingface.co/facebook/wav2vec2-base-960h)

[](https://www.reddit.com/r/StableDiffusion/comments/1bp7rnj/aniportrait_audiodriven_synthesis_of/)

[](https://youtu.be/wdRhYLQFQH8), [](https://youtu.be/T-B6xJRG6fQ)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/camenduru/AniPortrait-jupyter/blob/main/AniPortrait_vid2vid_jupyter.ipynb) | 27.03.2024 |

| OpenVINO | Open-source toolkit for optimizing and deploying AI inference | [intel](https://www.intel.com/content/www/us/en/developer/topic-technology/open/overview.html) | [![](https://img.shields.io/github/stars/openvinotoolkit/openvino?style=social)](https://github.com/openvinotoolkit/openvino) 

[blog post](https://blog.openvino.ai/)

[](https://discord.gg/7pVRxUwdWG)

[](https://docs.openvino.ai/)

[forum](https://software.intel.com/en-us/forums/computer-vision)

[](https://github.com/openvinotoolkit/open_model_zoo), [](https://github.com/Tencent/TNN), [](https://github.com/openvinotoolkit/openvino_contrib), [](https://github.com/openvinotoolkit/training_extensions), [](https://github.com/openvinotoolkit/model_server), [](https://github.com/opencv/cvat), [](https://github.com/openvinotoolkit/datumaro)

[](https://huggingface.co/OpenVINO)

[](https://medium.com/@openvino), [](https://medium.com/openvino-toolkit)

[](https://en.wikipedia.org/wiki/OpenVINO)

[](https://www.youtube.com/playlist?list=PLg-UKERBljNxdIQir1wrirZJ50yTp4eHv), [](https://youtu.be/Je8n8M0OwxQ), [](https://youtu.be/Ru51DELfc-Q), [](https://youtu.be/5X0RmlH6JI4), [](https://youtu.be/hhVRSLbpI5Q), [](https://youtu.be/JH8fsEAIaXo), [](https://www.youtube.com/playlist?list=PLWw98q-Xe7iH06qxEW5a22SBsSNsGnYjZ)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/openvinotoolkit/openvino_notebooks/blob/main/notebooks/001-hello-world/001-hello-world.ipynb) | 25.03.2024 |

| Google Generative AI | Documentation for Google's Gen AI site - including the Gemini API and Gemma | [Google](https://ai.google.dev/) | [![](https://img.shields.io/github/stars/google/generative-ai-docs?style=social)](https://github.com/google/generative-ai-docs) 

[](https://ai.google.dev/gemini-api/docs), [](https://ai.google.dev/gemma/docs)

[](https://www.kaggle.com/datasets/bhavikjikadara/google-generative-ai-documentation)

[](https://pypi.org/project/google-generativeai/)

[website](https://ai.google.dev/)

[](https://www.youtube.com/playlist?list=PLIivdWyY5sqLRCzKJyixrIDPQKwU6XHpn), [](https://www.youtube.com/playlist?list=PLBgogxgQVM9sl-KnKywVEhkb3QtLHU4OK), [](https://youtu.be/G2fqAlgmoPo)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/google/generative-ai-docs/blob/main/site/en/tutorials/quickstart_colab.ipynb) | 22.03.2024 |

| Gazelle | Joint Speech Language Model | [Chris Hua](https://github.com/stillmatic) | [![](https://img.shields.io/github/stars/tincans-ai/gazelle?style=social)](https://github.com/tincans-ai/gazelle) 

[blog post](https://tincans.ai/slm)

[demo](https://demo.tincans.ai/)

[](https://discord.gg/qyC5h3FSzU)

[](https://www.reddit.com/r/LocalLLaMA/comments/1cr84gb/joint_speechlanguage_model_respond_directly_to/)

[](https://huggingface.co/tincans-ai/gazelle-v0.1), [](https://huggingface.co/tincans-ai/gazelle-v0.2), [](https://huggingface.co/tincans-ai/gazelle-v0.2-dpo), [](https://huggingface.co/facebook/wav2vec2-base-960h), [](https://huggingface.co/meta-llama/Llama-2-7b-chat)

[](https://en.wikipedia.org/wiki/Spike_/(software_development))

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/tincans-ai/gazelle/blob/main/examples/infer_quantized.ipynb) | 20.03.2024 |

| Intel® Extension for Transformers | Transformer-based Toolkit to Accelerate GenAI/LLM Everywhere | [intel](https://www.intel.com/content/www/us/en/developer/topic-technology/open/overview.html) | [![](https://img.shields.io/github/stars/intel/intel-extension-for-transformers?style=social)](https://github.com/intel/intel-extension-for-transformers) 

[](https://arxiv.org/abs/2309.17453), [](https://arxiv.org/abs/2311.00502), [](https://arxiv.org/abs/2211.07715), [](https://arxiv.org/abs/2210.17114), [](https://arxiv.org/abs/2111.05754)

[](https://discord.gg/Wxk3J3ZJkU)

[](https://intel.github.io/intel-extension-for-transformers/latest/docs/Welcome.html)

[](https://github.com/ggerganov/ggml), [](https://github.com/ggerganov/llama.cpp), [](https://github.com/TimDettmers/bitsandbytes), [](https://github.com/IntelLabs/fastRAG), [](https://github.com/IST-DASLab/gptq), [](https://github.com/mit-han-lab/streaming-llm)

[](https://huggingface.co/blog/assisted-generation), [](https://huggingface.co/Intel/neural-chat-7b-v3-1), [](https://huggingface.co/blog/Andyrasika/neural-chat-intel)

[](https://medium.com/@NeuralCompressor/creating-your-own-llms-on-your-laptop-a08cc4f7c91b), [](https://medium.com/@NeuralCompressor/the-practice-of-supervised-finetuning-and-direct-preference-optimization-on-habana-gaudi2-a1197d8a3cd3), [](https://medium.com/@NeuralCompressor/llm-performance-of-intel-extension-for-transformers-f7d061556176), [](https://medium.com/@NeuralCompressor/high-performance-low-bit-layer-wise-weight-only-quantization-on-a-laptop-712580899396), [](https://medium.com/intel-analytics-software/reduce-large-language-model-carbon-footprint-with-intel-neural-compressor-and-intel-extension-for-dfadec3af76a)

[](https://youtu.be/bWhZ1u_1rlc), [](https://www.youtube.com/watch?v=RbKRELWP9y8&t=2954s), [](https://youtu.be/7_urstS-noU), [](https://youtu.be/bWhZ1u_1rlc), [](https://youtu.be/KWT6yKfu4n0)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/intel/intel-extension-for-transformers/blob/main/docs/tutorials/pytorch/text-classification/SetFit_model_compression_AGNews.ipynb) | 19.03.2024 |

| Instructor | Library that makes it a breeze to work with structured outputs from large language models | [Jason Liu](https://jxnl.co/) | [![](https://img.shields.io/github/stars/jxnl/instructor?style=social)](https://github.com/jxnl/instructor) 

[](https://discord.gg/CV8sPM5k5Y)

[](https://python.useinstructor.com/)

[](https://twitter.com/jxnlco)

[](https://youtu.be/rDP44EVpHTA), [](https://youtu.be/dq1Sjb8IGow), [](https://youtu.be/higlHgYDc5E)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1iBkrEh2G5U8yh8RmI8EkWxjLq6zIIuVm) | 13.03.2024 |

| MetaVoice | 1.2B parameter base model trained on 100K hours of speech for TTS | [MetaVoice](https://themetavoice.xyz/) | [![](https://img.shields.io/github/stars/metavoiceio/metavoice-src?style=social)](https://github.com/metavoiceio/metavoice-src) 

[demo](https://ttsdemo.themetavoice.xyz/)

[](https://huggingface.co/metavoiceio)

[](https://twitter.com/MetaVoiceAI)

[](https://youtu.be/Y_k3bHPcPTo), [](https://youtu.be/gVKbf31hrYs)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1UmjE1mzfG4td0rCjJEaAWGQXpn_GuwwY) | 26.02.2024 |

| OmegaConf | Hierarchical configuration system, with support for merging configurations from multiple sources providing a consistent API regardless of how the configuration was created | [Omry Yadan](https://github.com/omry) | [![](https://img.shields.io/github/stars/omry/omegaconf?style=social)](https://github.com/omry/omegaconf) 

[](https://omegaconf.readthedocs.io/)

[](https://majianglin2003.medium.com/python-omegaconf-a33be1b748ab)

[slides](https://docs.google.com/presentation/d/e/2PACX-1vT_UIV7hCnquIbLUm4NnkUpXvPEh33IKiUEvPRF850WKA8opOlZOszjKdZ3tPmf8u7hGNP6HpqS-NT5/pub)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/omry/omegaconf/blob/master/docs/notebook/Tutorial.ipynb) | 15.02.2024 |

| Optuna | An automatic hyperparameter optimization software framework, particularly designed for machine learning | 

[Takuya Akiba](https://iwiwi.github.io/)
 [Shotaro Sano](https://github.com/g-votte)
 [Toshihiko Yanase](https://github.com/toshihikoyanase)
others[Takeru Ohta](https://github.com/sile)
 [Masanori Koyama](https://scholar.google.com/citations?user=oY1gA10AAAAJ)

 | [![](https://img.shields.io/github/stars/optuna/optuna?style=social)](https://github.com/optuna/optuna) 

[](https://arxiv.org/abs/1907.10902)

[](https://hub.docker.com/r/optuna/optuna)

[](https://optuna.readthedocs.io/en/stable/)

[](https://github.com/optuna/optuna-dashboard)

[website](https://optuna.org/)

[](https://youtu.be/J_aymk4YXhg), [](https://youtu.be/tcrcLRopTX0), [](https://youtu.be/-UeC4MR3PHM), [](https://youtu.be/oC8zFYcfYXU)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/optuna/optuna-examples/blob/main/quickstart.ipynb) | 15.02.2024 |

| Data augmentation | This tutorial demonstrates data augmentation: a technique to increase the diversity of your training set by applying random transformations such as image rotation | [Billy Lamberta](https://github.com/lamberta) | 

[](https://paperswithcode.com/task/data-augmentation)

[](https://www.tensorflow.org/datasets/catalog/tf_flowers), [](https://www.tensorflow.org/tutorials/images/data_augmentation)

[](https://en.wikipedia.org/wiki/Data_augmentation)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/tensorflow/docs/blob/master/site/en/tutorials/images/data_augmentation.ipynb) | 14.02.2024 |

| Stable Cascade | Text to image model introduces an interesting three-stage approach, setting new benchmarks for quality, flexibility, fine-tuning, and efficiency with a focus on further eliminating hardware barriers | [Stability AI](https://stability.ai/research) | [![](https://img.shields.io/github/stars/Stability-AI/StableCascade?style=social)](https://github.com/Stability-AI/StableCascade) 

[](https://arxiv.org/abs/2306.00637)

[blog post](https://stability.ai/news/introducing-stable-cascade)

[](https://discord.gg/stablediffusion)

[](https://huggingface.co/stabilityai/stable-cascade), [](https://huggingface.co/datasets/nateraw/parti-prompts)

[](https://medium.com/intelligent-art/stable-cascade-a-super-easy-local-installation-guide-ce0cbd06d800), [](https://medium.com/@yushantripleseven/stable-cascade-training-inference-a52e12ecc5fa)

[](https://twitter.com/stabilityai)

[](https://youtu.be/Ybu6qTbEsewc), [](https://youtu.be/JuX-uukwdkI), [](https://youtu.be/YMxXtaiVHks), [](https://youtu.be/UgM-z2q3Xe0), [](https://youtu.be/W6YLIyA3Kco), [](https://youtu.be/X1rLWFRagIw)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/mkshing/notebooks/blob/main/stable_cascade.ipynb) | 14.02.2024 |

| CleanVision | Automatically detects potential issues in image datasets like images that are: blurry, under/over-exposed, (near) duplicates, etc | [Sanjana](https://github.com/sanjanag) | [![](https://img.shields.io/github/stars/cleanlab/cleanvision?style=social)](https://github.com/cleanlab/cleanvision) 

[blog post](https://cleanlab.ai/blog/cleanvision/)

[](https://cleanvision.readthedocs.io/)

[](https://github.com/cleanlab/cleanvision-examples)

[](https://pypi.org/project/cleanvision/)

[](https://cleanlab.ai/slack)

[](https://twitter.com/CleanlabAI)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/cleanlab/cleanvision/blob/main/docs/source/tutorials/tutorial.ipynb) | 13.02.2024 |

| DynamiCrafter | Animating Open-domain Images with Video Diffusion Priors | 

[Jinbo Xing](https://doubiiu.github.io/)
 [Menghan Xia](https://menghanxia.github.io/)
 [Yong Zhang](https://yzhang2016.github.io/)
others[Haoxin Chen](https://scutpaul.github.io/)
 [Wangbo Yu](https://github.com/GooDrYu)
 [Hanyuan Liu](https://github.com/hyliu)
 [Xintao Wang](https://xinntao.github.io/)
 [Tien-Tsin Wong](https://ttwong12.github.io/myself.html)
 [Ying Shan](https://scholar.google.com/citations?user=4oXBp9UAAAAJ)

 | [![](https://img.shields.io/github/stars/Doubiiu/DynamiCrafter?style=social)](https://github.com/Doubiiu/DynamiCrafter) 

[](https://arxiv.org/abs/2310.12190)

[](https://github.com/chaojie/ComfyUI-DynamiCrafter), [](https://github.com/AILab-CVC/VideoCrafter), [](https://github.com/YingqingHe/ScaleCrafter), [](https://github.com/AILab-CVC/TaleCrafter), [](https://github.com/AILab-CVC/FreeNoise)

[](https://huggingface.co/Doubiiu/DynamiCrafter_1024)

[project](https://doubiiu.github.io/projects/DynamiCrafter/)

[](https://www.reddit.com/r/StableDiffusion/comments/1aj7gcw/dynamicrafter_gets_updated/)

[](https://x.com/noguchis/status/1754488826016432341?s=20)

[](https://youtu.be/0NfmIsNAg-g), [](https://youtu.be/PtW7hjCawbo)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/camenduru/DynamiCrafter-colab/blob/main/DynamiCrafter_colab_576_1024.ipynb) | 12.02.2024 |

| XLA | Accelerated Linear Algebra is an open-source machine learning compiler for GPUs, CPUs, and ML accelerators | [George Karpenkov](https://metaworld.me/) | [![](https://img.shields.io/github/stars/openxla/xla?style=social)](https://github.com/openxla/xla) 

[](https://medium.com/@muhammedashraf2661/demystifying-xla-unlocking-the-power-of-accelerated-linear-algebra-9b62f8180dbd), [](https://runaker.medium.com/one-code-to-rule-them-all-simplifying-ai-development-with-hardware-agnostic-abstraction-layers-a61448bb6d22)

[](https://pytorch.org/xla)

[](https://www.tensorflow.org/xla)

[](https://en.wikipedia.org/wiki/Accelerated_Linear_Algebra)

[](https://www.youtube.com/playlist?list=PLlFotmaRrOzs23kqlSF-r8v1dJHz5GxZs), [](https://www.youtube.com/playlist?list=PLlFotmaRrOzu8TQsTahDo_Cn7QdntFlUL), [](https://www.youtube.com/playlist?list=PLlFotmaRrOzt8xOwckcXL7vObZmr8PK1y), [](https://youtu.be/QNSxFXJ-xMM)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/openxla/xla/blob/main/docs/tf2xla/tutorials/jit_compile.ipynb) | 02.02.2024 |

| Composer | PyTorch library that enables you to train neural networks faster, at lower cost, and to higher accuracy | [The Mosaic ML Team](https://www.mosaicml.com/team) | [![](https://img.shields.io/github/stars/mosaicml/composer?style=social)](https://github.com/mosaicml/composer) 

[app](https://app.mosaicml.com/)

[](https://arxiv.org/abs/2202.05924), [](https://arxiv.org/abs/2002.04688)

[blog post](https://www.mosaicml.com/blog/5-best-practices-for-efficient-model-training)

[](http://docs.mosaicml.com/)

[](https://pypi.org/project/composer/)

[](https://join.slack.com/t/mosaicml-community/shared_invite/zt-w0tiddn9-WGTlRpfjcO9J5jyrMub1dg)

[](https://twitter.com/mosaicml)

[website](https://www.mosaicml.com/composer)

[](https://en.wikipedia.org/wiki/Amdahl's_law)

[](https://www.youtube.com/@mosaicml6047/videos), [](https://youtu.be/n-1WV5QdIDc), [](https://youtu.be/Xi_5wq2MpOw)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/mosaicml/composer/blob/dev/examples/getting_started.ipynb) | 01.02.2024 |

| Transformer Engine | Library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point precision on Hopper, Ada, and Blackwell GPUs, to provide better performance with lower memory utilization in both training and inference | 

[Przemyslaw Tredak](https://github.com/ptrendx)
 [Kirthi Shankar Sivamani](https://ksivaman.github.io/)

 | [![](https://img.shields.io/github/stars/NVIDIA/TransformerEngine?style=social)](https://github.com/NVIDIA/TransformerEngine) 

[](https://arxiv.org/abs/1909.08053), [](https://arxiv.org/abs/2205.05198), [](https://arxiv.org/abs/2209.05433)

[blog post](https://developer.nvidia.com/blog/new-nvidia-nemo-framework-features-and-nvidia-h200-supercharge-llm-training-performance-and-versatility/), [blog post](https://www.databricks.com/blog/turbocharged-training-optimizing-databricks-mosaic-ai-stack-fp8)

[](https://docs.nvidia.com/deeplearning/transformer-engine/)

[](https://github.com/aws/deep-learning-containers/pull/3315), [](https://github.com/Dao-AILab/flash-attention/issues/358), [](https://github.com/NVIDIA/JAX-Toolbox/tree/main/rosetta/rosetta/projects), [](https://github.com/Lightning-AI/pytorch-lightning/issues/17172), [](https://github.com/NVIDIA/Megatron-LM), [](https://github.com/NVIDIA/NeMo-Framework-Launcher), [](https://github.com/stanford-crfm/levanter), [](https://github.com/EleutherAI/gpt-neox), [](https://github.com/huggingface/nanotron), [](https://github.com/hpcaitech/ColossalAI)

[](https://towardsdatascience.com/accelerating-pytorch-training-workloads-with-fp8-5a5123aec7d7/)

[](https://papers.nips.cc/paper_files/paper/2017/hash/3f5ee243547dee91fbd053c1c4a845aa-Abstract.html)

[](https://pypi.org/project/transformer-engine/)

[video](https://www.nvidia.com/en-us/on-demand/session/gtc24-s62457/)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/NVIDIA/TransformerEngine/blob/main/docs/examples/quickstart.ipynb) | 19.01.2024 |

| Integrated gradients | This tutorial demonstrates how to implement Integrated Gradients, an Explainable AI technique | 

[Mukund Sundararajan](https://scholar.google.com/citations?user=q39nzokAAAAJ)
 [Ankur Taly](https://theory.stanford.edu/~ataly/)
 [Qiqi Yan](https://scholar.google.com/citations?user=Wn8xr_gAAAAJ)

 | [![](https://img.shields.io/github/stars/ankurtaly/Integrated-Gradients?style=social)](https://github.com/ankurtaly/Integrated-Gradients) 

[](https://arxiv.org/abs/1703.01365)

[](https://github.com/GoogleCloudPlatform/training-data-analyst/tree/master/blogs/integrated_gradients)

[](https://medium.com/codex/explainable-ai-integrated-gradients-for-deep-neural-network-predictions-eb4f96248afb), [](https://towardsdatascience.com/understanding-deep-learning-models-with-integrated-gradients-24ddce643dbf)

[](https://www.tensorflow.org/tutorials/interpretability/integrated_gradients)

[visualizing](https://distill.pub/2020/attribution-baselines/)

[](https://en.wikipedia.org/wiki/Explainable_artificial_intelligence), [](https://en.wikipedia.org/wiki/Linear_interpolation), [](https://en.wikipedia.org/wiki/Riemann_sum)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/tensorflow/docs/blob/master/site/en/tutorials/interpretability/integrated_gradients.ipynb) | 17.01.2024 |

| MAGNeT | Masked generative sequence modeling method that operates directly over several streams of audio tokens | 

[Alon Ziv](https://www.cs.huji.ac.il/w~alonzi/)
 [Itai Gat](https://itaigat.com/)
 [Gaël Le Lan](https://github.com/gl3lan)
others[Tal Remez](https://talremez.github.io/)
 [Felix Kreuk](https://felixkreuk.github.io/)
 [Alexandre Défossez](https://ai.honu.io/)
 [Jade Copet](https://scholar.google.com/citations?&user=GRMLwjAAAAAJ)
 [Gabriel Synnaeve](https://syhw.github.io/)
 [Yossi Adi](https://www.cs.huji.ac.il/~adiyoss/)

 | [![](https://img.shields.io/github/stars/facebookresearch/audiocraft?style=social)](https://github.com/facebookresearch/audiocraft/blob/main/docs/MAGNET.md) 

[](https://arxiv.org/abs/2401.04577), [](https://arxiv.org/abs/2305.09636), [](https://arxiv.org/abs/2307.04686)

[](https://github.com/FurkanGozukara/Stable-Diffusion/blob/main/Tutorials/AI-Music-Generation-Audiocraft-Tutorial.md#more-info-about-top-k-top-p-temperature-and-classifier-free-guidance-from-chatgpt)

[](https://huggingface.co/facebook/magnet-medium-10secs), [](https://huggingface.co/facebook/magnet-medium-30secs), [](https://huggingface.co/facebook/audio-magnet-medium)

[](https://generativeai.pub/metas-ai-magnet-the-next-big-thing-in-text-to-audio-technology-7d524d9459ef)

[project](https://pages.cs.huji.ac.il/adiyoss-lab/MAGNeT/)

[](https://www.reddit.com/r/ArtificialInteligence/comments/19808gf/magnet_masked_audio_generation_using_a_single/)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/camenduru/MAGNeT-colab/blob/main/MAGNET_colab.ipynb) | 16.01.2024 |

| AutoFaiss | Automatically create Faiss knn indices with the most optimal similarity search parameters | [Romain Beaumont](https://github.com/rom1504) | [![](https://img.shields.io/github/stars/criteo/autofaiss?style=social)](https://github.com/criteo/autofaiss) 

[](https://criteo.github.io/autofaiss/)

[](https://github.com/facebookresearch/faiss)

[](https://medium.com/criteo-engineering/introducing-autofaiss-an-automatic-k-nearest-neighbor-indexing-library-at-scale-c90842005a11)

[](https://pypi.python.org/pypi/autofaiss)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/criteo/autofaiss/blob/master/docs/notebooks/autofaiss_getting_started.ipynb) | 12.01.2024 |

| Retrieval based Voice Conversion WebUI | An easy-to-use Voice Conversion framework based on VITS | [源文雨](https://github.com/fumiama) | [![](https://img.shields.io/github/stars/RVC-Project/Retrieval-based-Voice-Conversion-WebUI?style=social)](https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI) 

[](https://discord.gg/HcsmBBGyVk)

[](https://github.com/auspicious3000/contentvec), [](https://github.com/jik876/hifi-gan), [](https://github.com/FFmpeg/FFmpeg), [](https://github.com/Anjok07/ultimatevocalremovergui), [](https://github.com/openvpi/audio-slicer), [](https://github.com/Dream-High/RMVPE)

[](https://huggingface.co/lj1995/VoiceConversionWebUI)

[](https://medium.com/@ja.harr91/decoding-the-sound-of-virality-a-deep-dive-into-adversarial-ai-for-voice-conversion-tasks-on-m1-d60d32cfb2d4)

[](https://youtu.be/-JcvdDErkAU), [](https://youtu.be/9TroP5mR3CM), [](https://youtu.be/Y8IxVVQBEpc), [](https://youtu.be/qZ12-Vm2ryc), [](https://youtu.be/5i_Pyw0gH-M)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/RVC-Project/Retrieval-based-Voice-Conversion-WebUI/blob/main/Retrieval_based_Voice_Conversion_WebUI.ipynb) | 11.01.2024 |

| Big Vision | This codebase is designed for training large-scale vision models using Cloud TPU VMs or GPU machines | 

[Lucas Beyer](http://lucasb.eyer.be/)
 [Xiaohua Zhai](https://github.com/xiaohuazhai)
 [Alexander Kolesnikov](https://github.com/akolesnikoff)

 | [![](https://img.shields.io/github/stars/google-research/big_vision?style=social)](https://github.com/google-research/big_vision) 

[](https://arxiv.org/abs/2010.11929), [](https://arxiv.org/abs/2106.04560), [](https://arxiv.org/abs/2105.01601), [](https://arxiv.org/abs/2205.01580), [](https://arxiv.org/abs/2212.08013), [](https://arxiv.org/abs/2305.13035), [](https://arxiv.org/abs/2303.17376), [](https://arxiv.org/abs/2306.07915), [](https://arxiv.org/abs/2305.16999), [](https://arxiv.org/abs/2302.08242), [](https://arxiv.org/abs/2006.07159)

[](https://www.tensorflow.org/guide/data), [](https://www.tensorflow.org/datasets)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/google-research/big_vision/blob/main/big_vision/configs/proj/image_text/lit.ipynb) | 03.01.2024 |

| Open Interpreter | An open-source, locally running implementation of OpenAI's Code Interpreter | [Killian Lucas](https://github.com/KillianLucas) | [![](https://img.shields.io/github/stars/KillianLucas/open-interpreter?style=social)](https://github.com/KillianLucas/open-interpreter) 

[](https://discord.gg/6p3fD6rBVm)

[](https://docs.openinterpreter.com/)

[website](https://openinterpreter.com/)

[](https://youtu.be/SqnXUHwIa3c), [](https://youtu.be/s-f4lCETxu0), [](https://youtu.be/J-H2un1Adr0), [](https://youtu.be/jaijpff58vw), [](https://youtu.be/7KFbG_3dKKs), [](https://youtu.be/4OhuFjPyZNQ), [](https://youtu.be/01tQLn_RRcE), [](https://youtu.be/uyfoHQVgeY0)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1WKmRXZgsErej2xUriKzxrEAXdxMSgWbb) | 03.01.2024 |

| Seamless Communication | Family of AI models that enable more natural and authentic communication across languages | 

[Loïc Barrault](https://loicbarrault.github.io/)
 [Yu-An Chung](https://iamyuanchung.github.io/)
 [Mariano Coria](https://www.linkedin.com/in/marianocoria)
others[David Dale](https://daviddale.ru/)
 [Ning Dong](https://scholar.google.com/citations?user=gg1hvjoAAAAJ)
 [Mark Duppenthaler](https://github.com/mduppes)
 [Paul-Ambroise Duquenne](https://scholar.google.com/citations?user=Uah8IcAAAAAJ)
 [Hady Elsahar](https://www.hadyelsahar.io/)
 [Min-Jae Hwang](https://mjhwang93.github.io/)
 [Hirofumi Inaguma](https://hirofumi0810.github.io/)
 [Ilia Kulikov](https://github.com/uralik)
 [Pengwei Li](https://scholar.google.com/citations?user=hQB3YsYAAAAJ)
 [Daniel Licht](https://github.com/Lichtphyz)
 [Jean Maillard](https://scholar.google.com/citations?user=_ewOoK0AAAAJ)
 [Ruslan Mavlyutov](https://github.com/mavlyutovr)
 [Kaushik Ram Sadagopan](https://github.com/kauterry)
 [Abinesh Ramakrishnan](https://github.com/ibanesh)
 [Tuan Tran](https://antoine-tran.github.io/)
 [Guillaume Wenzek](https://github.com/gwenzek)
 [Yilin Yang](https://yilinyang7.github.io/)
 [Ethan Ye](https://github.com/yeyinthtoon)
 [Ivan Evtimov](https://ivanevtimov.eu/)
 [Pierre Fernandez](https://pierrefdz.github.io/)
 [Robin San Roman](https://scholar.google.com/citations?user=AJ3ir84AAAAJ)
 [Bokai Yu](https://scholar.google.com/citations?user=7jNmPwUAAAAJ)
 [Pierre Andrews](https://github.com/Mortimerp9)
 [Can Balioglu](http://canbalioglu.com/)
 [Peng-Jen Chen](https://scholar.google.com/citations?user=rOXs9VMAAAAJ)
 [Marta Costa-jussà](https://costa-jussa.com/)
 [Maha Elbayad](http://elbayadm.github.io/)
 [Hongyu Gong](https://github.com/hygong-fb)
 [Francisco Guzmán](https://guzmanhe.github.io/)
 [Kevin Heffernan](https://github.com/heffernankevin)
 [Somya Jain](https://scholar.google.com/citations?user=AmBxU3kAAAAJ)
 [Justine Kao](https://scholar.google.com/citations?user=Y9BLeTAAAAAJ)
 [Ann Lee](https://www.stat.cmu.edu/~annlee/)
 [Xutai Ma](https://github.com/xutaima)
 [Benjamin Peloquin](https://scholar.google.com/citations?user=5GNAjB8AAAAJ)
 [Juan Pino](https://scholar.google.com/citations?user=weU_-4IAAAAJ)
 [Sravya Popuri](https://scholar.google.com/citations?user=MtmqG3UAAAAJ)
 [Holger Schwenk](https://github.com/hoschwenk)
 [Anna Sun](https://github.com/annasun28)
 [Paden Tomasello](https://scholar.google.com/citations?user=sBtWMGYAAAAJ)
 [Changhan Wang](https://www.changhan.me/)
 [Skyler Wang](https://www.skylerwang.com/)
 [Mary Williamson](https://scholar.google.com/citations?user=Ys4xB-QAAAAJ)

 | [![](https://img.shields.io/github/stars/facebookresearch/seamless_communication?style=social)](https://github.com/facebookresearch/seamless_communication) 

[](https://arxiv.org/abs/2312.05187)

[blog post](https://ai.meta.com/research/seamless-communication/)

[](https://github.com/libsndfile/libsndfile), [](https://github.com/facebookresearch/fairseq2), [](https://github.com/facebookresearch/SimulEval), [](https://github.com/facebookresearch/stopes), [](https://github.com/facebookresearch/SONAR)

[](https://huggingface.co/facebook/seamless-m4t-v2-large), [](https://huggingface.co/facebook/seamless-expressive), [](https://huggingface.co/facebook/seamless-streaming)

[](https://ngwaifoong92.medium.com/beginners-guide-to-seamlessm4t-81efad6e8ca6)

[](https://www.youtube.com/watch?v=0padjtkHXTE), [](https://youtu.be/rNN7qsoCKBo), [](https://youtu.be/RKEFZ44YOcc)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/facebookresearch/seamless_communication/blob/main/Seamless_Tutorial.ipynb) | 14.12.2023 |

| CleanRL | Deep Reinforcement Learning library that provides high-quality single-file implementation with research-friendly features | 

[Shengyi Huang](https://costa.sh/)
 [Rousslan Dossa](https://dosssman.github.io/)
 [Chang Ye](https://github.com/yooceii)
others[Jeff Braga](https://github.com/bragajj)
 [Dipam Chakraborty](https://github.com/dipamc)
 [Kinal Mehta](https://kinalmehta.github.io/)
 [João Araújo](https://github.com/joaogui1)

 | [![](https://img.shields.io/github/stars/vwxyzjn/cleanrl?style=social)](https://github.com/vwxyzjn/cleanrl) 

[](https://arxiv.org/abs/1707.06347), [](https://arxiv.org/abs/1707.06887), [](https://arxiv.org/abs/1812.05905), [](https://arxiv.org/abs/1509.02971), [](https://arxiv.org/abs/1802.09477), [](https://arxiv.org/abs/2009.04416), [](https://arxiv.org/abs/1810.12894)

[](https://docs.cleanrl.dev/)

[](https://github.com/tinkoff-ai/CORL), [](https://github.com/Farama-Foundation/Gymnasium), [](https://github.com/openai/baselines), [](https://github.com/ikostrikov/jaxrl)

[](https://huggingface.co/cleanrl)

[paper](https://www.jmlr.org/papers/v23/21-1342.html)

[](https://www.youtube.com/channel/UCDdC6BIFRI0jvcwuhi3aI6w), [](https://youtu.be/dm4HdGujpPs)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/vwxyzjn/cleanrl/blob/master/docs/get-started/CleanRL_Huggingface_Integration_Demo.ipynb) | 28.11.2023 |

| Vocos | Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis | [Hubert Siuzdak](https://github.com/hubertsiuzdak) | [![](https://img.shields.io/github/stars/gemelo-ai/vocos?style=social)](https://github.com/gemelo-ai/vocos) 

[](https://arxiv.org/abs/2306.00814)

[project](https://gemelo-ai.github.io/vocos/)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/charactr-platform/vocos/blob/main/notebooks/Bark%2BVocos.ipynb) | 21.11.2023 |

| X—LLM | Easy LLM Finetuning using the most advanced methods | [Boris Zubarev](https://github.com/BobaZooba) | [![](https://img.shields.io/github/stars/BobaZooba/xllm?style=social)](https://github.com/BobaZooba/xllm) 

[](https://arxiv.org/abs/2305.18290)

[](https://discord.gg/5znbxBgwZP)

[](https://github.com/BobaZooba/xllm-demo), [](https://github.com/BobaZooba/wgpt), [](https://github.com/BobaZooba/shurale)

[](https://huggingface.co/TachyHealth), [](https://huggingface.co/BobaZooba/Shurale7b-v1)

[](https://pypi.org/project/xllm/)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1zsNmJFns1PKZy5VE5p5nsQL-mZF7SwHf) | 15.11.2023 |

| Distil-Whisper | Maintains the robustness of the Whisper model to difficult acoustic conditions, while being less prone to hallucination errors on long-form audio | 

[Sanchit Gandhi](https://github.com/sanchit-gandhi)
 [Patrick von Platen](https://github.com/patrickvonplaten)
 [Alexander Rush](https://scholar.google.com/citations?&user=LIjnUGgAAAAJ)

 | [![](https://img.shields.io/github/stars/huggingface/distil-whisper?style=social)](https://github.com/huggingface/distil-whisper) 

[](https://arxiv.org/abs/2311.00430), [](https://arxiv.org/abs/2211.17192)

[](https://github.com/huggingface/safetensors), [](https://github.com/Dao-AILab/flash-attention)

[](https://huggingface.co/collections/distil-whisper/training-datasets-6538d05c69721489d1db1e49), [](https://huggingface.co/docs/transformers/model_doc/auto#transformers.AutoModelForSpeechSeq2Seq), [](https://huggingface.co/docs/transformers/model_doc/auto#transformers.AutoProcessor), [](https://huggingface.co/docs/transformers/main_classes/pipelines#transformers.AutomaticSpeechRecognitionPipeline), [](https://huggingface.co/docs/transformers/v4.34.1/en/model_doc/whisper#transformers.WhisperForConditionalGeneration.forward.example), [](https://huggingface.co/docs/transformers/main/en/main_classes/text_generation#transformers.GenerationMixin.generate.assistant_model), [](https://huggingface.co/docs/transformers/main/en/perf_infer_gpu_one#flashattention-2), [](https://huggingface.co/docs/transformers/main/en/perf_infer_gpu_one#bettertransformer)

[](https://medium.com/prompt-engineering/transcribing-audio-with-python-and-distil-whisper-9b4fec3d53bf)

[](https://www.reddit.com/r/MachineLearning/comments/17vqtcb/p_distilwhisper_a_distilled_variant_of_whisper/)

[](https://youtu.be/46Q6fbdUCbg), [](https://youtu.be/SZtHEKyvuug), [](https://www.youtube.com/live/kI1pA1CADxM)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/sanchit-gandhi/notebooks/blob/main/Distil_Whisper_Benchmark.ipynb) | 08.11.2023 |

| AnimateDiff | Practical framework to animate most of the existing personalized text-to-image models once and for all, saving efforts in model-specific tuning | 

[Yuwei Guo](https://github.com/guoyww)
 [Ceyuan Yang](https://github.com/limbo0000)
 [Anyi Rao](https://anyirao.com/)
others[Yaohui Wang](https://wyhsirius.github.io/)
 [Yu Qiao](https://mmlab.siat.ac.cn/yuqiao/)
 [Dahua Lin](http://dahua.site/)
 [Bo Dai](https://daibo.info/)

 | [![](https://img.shields.io/github/stars/guoyww/animatediff?style=social)](https://github.com/guoyww/animatediff/) 

[](https://arxiv.org/abs/2307.04725)

[](https://github.com/continue-revolution/sd-webui-animatediff), [](https://github.com/talesofai/AnimateDiff), [](https://youtu.be/-wki7IrQ_sU)

[project](https://animatediff.github.io/)

[](https://youtu.be/rdnOhM8L8nE), [](https://youtu.be/LcHAZaJjA5k), [](https://www.youtube.com/live/66JgpI3a650?feature=share)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/camenduru/AnimateDiff-colab/blob/main/AnimateDiff_colab.ipynb) | 30.10.2023 |

| Intel® Neural Compressor | Aims to provide popular model compression techniques such as quantization, pruning (sparsity), distillation, and neural architecture search on mainstream frameworks such as TensorFlow, PyTorch, ONNX Runtime, and MXNet, as well as Intel extensions such as Intel Extension for TensorFlow and Intel Extension for PyTorch | [intel](https://www.intel.com/content/www/us/en/developer/topic-technology/open/overview.html) | [![](https://img.shields.io/github/stars/intel/neural-compressor?style=social)](https://github.com/intel/neural-compressor) 

[](https://arxiv.org/abs/2309.14592), [](https://arxiv.org/abs/2309.05516), [](https://arxiv.org/abs/2211.07715)

[](https://discord.com/invite/Wxk3J3ZJkU)

[](https://github.com/intel/neural-compressor)

[](https://github.com/intel/intel-extension-for-tensorflow), [](https://github.com/intel/intel-extension-for-pytorch), [](https://github.com/Lightning-AI/pytorch-lightning/blob/master/docs/source-pytorch/advanced/post_training_quantization.rst)

[](https://medium.com/pytorch/pytorch-inference-acceleration-with-intel-neural-compressor-842ef4210d7d), [](https://medium.com/intel-analytics-software/efficient-text-classification-with-intel-neural-compressor-4853296deeac)

[](https://neurips.cc/virtual/2022/59433)

[](https://pytorch.org/tutorials/recipes/intel_neural_compressor_for_pytorch.html)

[](https://youtu.be/SswQbIHUrvQ), [](https://youtu.be/5xHKe4wWLes), [](https://youtu.be/H7Gg-EmGpAI), [](https://youtu.be/ie3w_j0Ntsk), [](https://youtu.be/m2LokuUdeVg), [](https://youtu.be/38wrDHEQZuM)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/intel/neural-compressor/blob/master/examples/notebook/onnxruntime/Quick_Started_Notebook_of_INC_for_ONNXRuntime.ipynb) | 27.10.2023 |

| Bark | Transformer-based text-to-audio model | [suno](https://www.suno.ai/) | [![](https://img.shields.io/github/stars/suno-ai/bark?style=social)](https://github.com/suno-ai/bark) 

[](https://arxiv.org/abs/2209.03143), [](https://arxiv.org/abs/2301.02111)

[](https://discord.gg/J2B2vsjKuE)

[examples](https://suno-ai.notion.site/Bark-Examples-5edae8b02a604b54a42244ba45ebc2e2)

[](https://github.com/facebookresearch/encodec), [](https://github.com/karpathy/nanoGPT)

[](https://huggingface.co/docs/huggingface_hub/package_reference/environment_variables#hfhome)

[](https://twitter.com/OnusFM)

[](https://youtu.be/84LzaXAo6vE), [](https://youtu.be/rU5Do9yHbwM), [](https://youtu.be/w41-MUfxIWo), [](https://youtu.be/_m-MxEpHUQY)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1dWWkZzvu7L9Bunq9zvD-W02RFUXoW-Pd) | 25.10.2023 |

| Mistral Transformer | The most powerful language model for its size to date | 

[Albert Jiang](https://albertqjiang.github.io/)
 [Alexandre Sablayrolles](https://github.com/alexandresablayrolles)
 [Arthur Mensch](https://github.com/arthurmensch)
others[Chris Bamford](https://griddly.ai/)
 [Devendra Chaplot](https://devendrachaplot.github.io/)
 [Diego Casas](https://github.com/diegolascasas)
 [Florian Bressand](https://www.linkedin.com/in/florianbressand)
 [Gianna Lengyel](https://www.linkedin.com/in/gianna-maria-lengyel)
 [Guillaume Lample](https://github.com/glample)
 [Lucile Saulnier](https://scholar.google.com/citations?user=Baj_9IsAAAAJ)
 [Lélio Renard Lavaud](https://github.com/lerela)
 [Marie-Anne Lachaux](https://scholar.google.com/citations?user=dSEMIJ8AAAAJ)
 [Pierre Stock](https://github.com/pierrestock)
 [Teven Scao](https://scholar.google.com/citations?user=ik0_vxsAAAAJ)
 [Thibaut Lavril](https://scholar.google.com/citations?user=9nPunCEAAAAJ)
 [Thomas Wang](https://github.com/thomasw21)
 [Timothée Lacroix](https://scholar.google.com/citations?&user=tZGS6dIAAAAJ)
 [William Sayed](https://www.linkedin.com/in/william-el-sayed-48672312a)

 | [![](https://img.shields.io/github/stars/mistralai/mistral-src?style=social)](https://github.com/mistralai/mistral-src) 

[](https://arxiv.org/abs/2310.06825), [](https://arxiv.org/abs/1904.10509), [](https://arxiv.org/abs/2004.05150), [](https://arxiv.org/abs/2306.05685)

[blog post](https://mistral.ai/news/announcing-mistral-7b/)

[](https://discord.com/invite/mistralai)

[](https://docs.mistral.ai/)

[](https://github.com/vllm-project/vllm), [](https://github.com/ggerganov/ggml), [](https://github.com/Dao-AILab/flash-attention), [](https://github.com/skypilot-org/skypilot)

[](https://huggingface.co/mistralai)

[](https://towardsdatascience.com/mistral-7b-recipes-for-fine-tuning-and-quantization-on-your-computer-631401583f77)

[](https://youtu.be/g7kVVBlCGo0), [](https://youtu.be/ASpageg8nPw), [](https://youtu.be/OMIuP6lQXe4), [](https://youtu.be/jnPZApwtE4I), [](https://youtu.be/3SdopNwQJ-c)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/camenduru/Mistral-colab/blob/main/Mistral_colab.ipynb) | 09.10.2023 |

| Fooocus | Image generating software | [Lvmin Zhang](https://lllyasviel.github.io/Style2PaintsResearch/lvmin) | [![](https://img.shields.io/github/stars/lllyasviel/Fooocus?style=social)](https://github.com/lllyasviel/Fooocus) 

[](https://arxiv.org/abs/2210.00939)

[](https://youtu.be/8krykSwOz3E), [](https://youtu.be/558W8rfnP-Q), [](https://youtu.be/TJkrzuPdmvE), [](https://youtu.be/NfNwmKM3sxc)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/lllyasviel/Fooocus/blob/main/colab.ipynb) | 03.10.2023 |

| Actor-Critic | This tutorial demonstrates how to implement the Actor-Critic method using TensorFlow to train an agent on the Open AI Gym CartPole-V0 environment | 

[Vijay Konda](https://scholar.google.com/citations?user=bi-WXQIAAAAJ)
 [John Tsitsiklis](https://web.mit.edu/jnt/www/home.html)

 | 

[gym](https://gym.openai.com/)

[](https://papers.nips.cc/paper/1786-actor-critic-algorithms), [](https://papers.nips.cc/paper/1713-policy-gradient-methods-for-reinforcement-learning-with-function-approximation.pdf)

[](https://www.tensorflow.org/tutorials/reinforcement_learning/actor_critic)

[](https://en.wikipedia.org/wiki/Temporal_difference_learning)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/tensorflow/docs/blob/master/site/en/tutorials/reinforcement_learning/actor_critic.ipynb) | 28.09.2023 |

| LiteLLM | Call all LLM APIs using the OpenAI format [Bedrock, Huggingface, VertexAI, TogetherAI, Azure, OpenAI, Groq etc.] | 

[Ishaan Jaff](https://github.com/ishaan-jaff)
 [Krish Dholakia](http://www.krishdholakia.com/)

 | [![](https://img.shields.io/github/stars/BerriAI/litellm?style=social)](https://github.com/BerriAI/litellm) 

[](https://discord.gg/wuPM9dRgDw)

[](https://docs.litellm.ai/docs/)

[](https://robert-mcdermott.medium.com/centralizing-multi-vendor-llm-services-with-litellm-9874563f3062), [](https://medium.com/@manthapavankumar11/building-robust-llm-applications-for-production-grade-scale-using-litellm-449290bd6e45)

[](https://pypi.org/project/litellm/)

[website](https://www.litellm.ai/)

[](https://youtu.be/nQCOTzS5oU0), [](https://youtu.be/sfcM-bfFyP4), [](https://youtu.be/_x3F5ZHxVJI), [](https://youtu.be/04t0X9BSVFs), [](https://youtu.be/29_ipKNI8I0)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/BerriAI/litellm/blob/main/cookbook/liteLLM_Getting_Started.ipynb) | 23.09.2023 |

| MMagic | AIGC toolbox for professional AI researchers and machine learning engineers to explore image and video processing, editing and generation | [OpenMMLab](https://openmmlab.com/) | [![](https://img.shields.io/github/stars/open-mmlab/mmagic?style=social)](https://github.com/open-mmlab/mmagic) 

[](https://discord.gg/raweFPmdzG)

[](https://mmagic.readthedocs.io/en/latest/)

[](https://github.com/open-mmlab/mmgeneration), [](https://github.com/open-mmlab/mmengine/blob/main/mmengine/model/wrappers/seperate_distributed.py), [](https://github.com/open-mmlab/mmcv), [](https://github.com/open-mmlab/mim)

[](https://openmmlab.medium.com/)

[](https://twitter.com/OpenMMLab)

[](https://www.youtube.com/openmmlab)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/open-mmlab/mmagic/blob/main/demo/mmagic_inference_tutorial.ipynb) | 11.09.2023 |

| SeqIO | Library for processing sequential data to be fed into downstream sequence models | 

[Adam Roberts](https://github.com/adarob)
 [Hyung Won Chung](https://github.com/hwchung27)
 [Anselm Levskaya](https://anselmlevskaya.com/)
others[Gaurav Mishra](https://github.com/gauravmishra)
 [James Bradbury](https://github.com/jekbradbury)
 [Daniel Andor](https://github.com/andorardo)
 [Sharan Narang](https://github.com/sharannarang)
 [Brian Lester](https://blester125.com/)
 [Colin Gaffney](https://github.com/cpgaffney1)
 [Afroz Mohiuddin](https://github.com/afrozenator)
 [Curtis Hawthorne](https://github.com/cghawthorne)
 [Aitor Lewkowycz](https://scholar.google.com/citations?user=Yum1ah0AAAAJ)
 [Alex Salcianu](https://scholar.google.com/citations?user=HSrT1wsAAAAJ)
 [Marc van Zee](https://github.com/marcvanzee)
 [Jacob Austin](https://jacobaustin123.github.io/)
 [Sebastian Goodman](https://github.com/0x0539)
 [Livio Baldini Soares](https://liviosoares.github.io/)
 [Haitang Hu](https://hthu.github.io/)
 [Sasha Tsvyashchenko](https://endl.ch/)
 [Aakanksha Chowdhery](http://www.achowdhery.com/)
 [Jasmijn Bastings](https://jasmijn.ninja/)
 [Jannis Bulian](http://bulian.org/)
 [Xavier Garcia](https://scholar.google.com/citations?user=Y2Hio6MAAAAJ)
 [Jianmo Ni](https://nijianmo.github.io/)
 [Andrew Chen](https://github.com/andrewluchen)
 [Kathleen Kenealy](https://github.com/kkenealy)
 [Jonathan Clark](http://www.cs.cmu.edu/~jhclark/)
 [Stephan Lee](https://github.com/stephanwlee)
 [Dan Garrette](https://www.dhgarrette.com/)
 [James Lee-Thorp](https://scholar.google.com/citations?user=qsPv098AAAAJ)
 [Colin Raffel](https://www.colinraffel.com/)
 [Noam Shazeer](https://github.com/nshazeer)
 [Marvin Ritter](https://github.com/Marvin182)
 [Maarten Bosma](https://scholar.google.com/citations?user=wkeFQPgAAAAJ)
 [Alexandre Passos](https://www.ic.unicamp.br/~tachard/)
 [Jeremy Maitin-Shepard](https://research.google/people/JeremyMaitinShepard/)
 [Noah Fiedel](https://scholar.google.com/citations?user=XWpV9DsAAAAJ)
 [Mark Omernick](https://github.com/markomernick)
 [Brennan Saeta](https://github.com/saeta)
 [Ryan Sepassi](https://ryansepassi.com/)
 [Alexander Spiridonov](https://research.google/people/AlexanderSpiridonov/)
 [Joshua Newlan](https://github.com/joshnewlan)
 [Andrea Gesmundo](https://github.com/agesmundo)

 | [![](https://img.shields.io/github/stars/google/seqio?style=social)](https://github.com/google/seqio) 

[](https://arxiv.org/abs/2203.17189), [](https://arxiv.org/abs/1910.10683), [](https://arxiv.org/abs/1810.04805), [](https://arxiv.org/abs/2002.08910)

[](https://seqio.readthedocs.io/en/latest/)

[](https://www.tensorflow.org/api_docs/python/tf/data/Dataset), [](https://www.tensorflow.org/datasets), [](https://www.tensorflow.org/tutorials/load_data/tfrecord), [](https://www.tensorflow.org/guide/function#autograph_transformations), [](https://www.tensorflow.org/api_docs/python/tf/py_function), [](https://www.tensorflow.org/guide/random_numbers#stateless_rngs)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/google/seqio/blob/main/seqio/notebooks/Evaluation.ipynb) | 08.09.2023 |

| MMAction2 | An open-source toolbox for video understanding based on PyTorch | [MMAction2 Contributors](https://openmmlab.com/aboutus) | [![](https://img.shields.io/github/stars/open-mmlab/mmaction2?style=social)](https://github.com/open-mmlab/mmaction2) 

[](https://arxiv.org/abs/2106.13230), [](https://arxiv.org/abs/2107.10161), [](https://arxiv.org/abs/2103.17263), [](https://arxiv.org/abs/2104.13586), [](https://arxiv.org/abs/2102.05095), [](https://arxiv.org/abs/2003.13042)

[data](https://sdolivia.github.io/FineGym/), [data](http://www.svcl.ucsd.edu/projects/resound/dataset.html), [data](https://research.google.com/ava/index.html), [data](https://www.deepmind.com/open-source/kinetics)

[](https://mmaction2.readthedocs.io/)

[](https://github.com/open-mmlab/mmcv), [](https://github.com/SwinTransformer/Video-Swin-Transformer), [](https://github.com/Cogito2012/DEAR), [](https://github.com/xvjiarui/VFS), [](https://github.com/holistic-video-understanding/HVU-Dataset)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/open-mmlab/mmaction2/blob/master/demo/mmaction2_tutorial.ipynb) | 06.09.2023 |

| Home Robot | Low-level API for controlling various home robots | [Chris Paxton](https://cpaxton.github.io/) | [![](https://img.shields.io/github/stars/facebookresearch/home-robot?style=social)](https://github.com/facebookresearch/home-robot) [](https://github.com/cpaxton/contact_graspnet/tree/cpaxton/devel), [](https://github.com/facebookresearch/fairo), [](https://github.com/hello-robot/stretch_body), [](https://github.com/hello-robot/stretch_firmware), [](https://github.com/hello-robot/stretch_ros), [](https://github.com/hello-robot/stretch_ros2), [](https://github.com/hello-robot/stretch_web_interface), [](https://github.com/RoboStack/ros-noetic), [](https://github.com/codekansas/stretch-robot) | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/facebookresearch/home-robot/blob/master/src/home_robot_sim/notebooks/velocity_control_sim.ipynb) | 30.08.2023 |

| Neural Tangents | Library designed to enable research into infinite-width neural networks | 

[Roman Novak](https://github.com/romanngg)
 [Lechao Xiao](https://sites.google.com/site/lechaoxiao/)
 [Jiri Hron](https://sites.google.com/view/jirihron)
others[Jaehoon Lee](https://jaehlee.github.io/)
 [Alexander Alemi](https://www.alexalemi.com/)
 [Jascha Sohl-Dickstein](https://sohldickstein.com/)
 [Samuel Schoenholz](https://scholar.google.com/citations?user=mk-zQBsAAAAJ)

 | [![](https://img.shields.io/github/stars/google/neural-tangents?style=social)](https://github.com/google/neural-tangents) 

[ICLR](https://iclr.cc/virtual_2020/poster_SklD9yrFPS.html)

[](https://arxiv.org/abs/1912.02803), [](https://arxiv.org/abs/1605.07146), [](https://arxiv.org/abs/1902.06720), [](https://arxiv.org/abs/1806.07572), [](https://arxiv.org/abs/2001.07301), [](https://arxiv.org/2003.02237)

[](https://neural-tangents.readthedocs.io/en/latest/)

[](https://towardsdatascience.com/infinitely-wide-neural-networks-neural-tangents-explained-d6c6d896fcbf)

[](https://pypi.org/project/neural-tangents/)

[](https://en.wikipedia.org/wiki/Neural_network_Gaussian_process), [](https://en.wikipedia.org/wiki/Neural_tangent_kernel)

[](https://youtu.be/VUX2bsrYag8)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/google/neural-tangents/blob/main/notebooks/neural_tangents_cookbook.ipynb) | 29.08.2023 |

| Stable Diffusion 2 | New stable diffusion model at 768x768 resolution. Same number of parameters in the U-Net as 1.5, but uses OpenCLIP-ViT/H as the text encoder and is trained from scratch | 

[Robin Rombach](https://github.com/rromb)
 [Andreas Blattmann](https://github.com/ablattmann)
 [Dominik Lorenz](https://github.com/qp-qp)
others[Patrick Esser](https://github.com/pesser)
 [Björn Ommer](https://ommer-lab.com/people/ommer/)
 [qunash](https://github.com/qunash)

 | [![](https://img.shields.io/github/stars/Stability-AI/stablediffusion?style=social)](https://github.com/Stability-AI/stablediffusion) 

[](https://arxiv.org/abs/2112.10752), [](https://arxiv.org/abs/2202.00512), [](https://arxiv.org/abs/2010.02502), [](https://arxiv.org/abs/2108.01073), [](https://arxiv.org/abs/2202.09778), [](https://arxiv.org/abs/2206.00927)

[](https://github.com/qunash/stable-diffusion-2-gui), [](https://github.com/isl-org/MiDaS), [](https://github.com/lucidrains/denoising-diffusion-pytorch), [](https://github.com/runwayml/stable-diffusion/blob/main/scripts/inpaint_st.py), [](https://github.com/crowsonkb/k-diffusion)

[](https://huggingface.co/stabilityai/stable-diffusion-2-1), [](https://huggingface.co/stabilityai/stable-diffusion-2-1-base), [](https://huggingface.co/stabilityai/stable-diffusion-2-depth), [](https://huggingface.co/stabilityai/stable-diffusion-2-inpainting)

[](https://youtu.be/HytucGhwTRs)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/qunash/stable-diffusion-2-gui/blob/main/stable_diffusion_2_0.ipynb) | 26.08.2023 |

| DALL·E Mini | Generate images from a text prompt | 

[Boris Dayma](https://github.com/borisdayma)
 [Suraj Patil](https://github.com/patil-suraj)
 [Pedro Cuenca](https://github.com/pcuenca)
others[Khalid Saifullah](https://khalidsaifullaah.github.io/)
 [Tanishq Abraham](https://github.com/tmabraham)
 [Phúc H. Lê Khắc](https://lkhphuc.com/)
 [Luke Melas](https://lukemelas.github.io/)
 [Ritobrata Ghosh](https://ghosh-r.github.io/)

 | [![](https://img.shields.io/github/stars/borisdayma/dalle-mini?style=social)](https://github.com/borisdayma/dalle-mini) 

[](https://arxiv.org/abs/2102.08981), [](https://arxiv.org/abs/2012.09841), [](https://arxiv.org/abs/1910.13461), [](https://arxiv.org/abs/2103.00020), [](https://arxiv.org/abs/2012.09841), [](https://arxiv.org/abs/1807.04015)

[blog post](https://wandb.ai/dalle-mini/dalle-mini/reports/DALL-E-mini--Vmlldzo4NjIxODA)

[data](https://aclanthology.org/P18-1238/)

[](https://github.com/huggingface/transformers/tree/master/examples/research_projects/jax-projects), [](https://github.com/openai/CLIP/blob/main/data/yfcc100m.md)

[](https://huggingface.co/spaces/flax-community/dalle-mini)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/borisdayma/dalle-mini/blob/main/tools/inference/inference_pipeline.ipynb) | 22.08.2023 |

| Kandinsky 2.1 | As text and image encoder it uses CLIP model and diffusion image prior between latent spaces of CLIP modalities | 

[Arseniy Shakhmatov](https://github.com/cene555)
 [Anton Razzhigaev](https://github.com/razzant)
 [Aleksandr Nikolich](https://github.com/AlexWortega)
others[Vladimir Arkhipkin](https://github.com/oriBetelgeuse)
 [Igor Pavlov](https://github.com/boomb0om)
 [Andrey Kuznetsov](https://github.com/kuznetsoffandrey)
 [Denis Dimitrov](https://github.com/denndimitrov)

 | [![](https://img.shields.io/github/stars/ai-forever/Kandinsky-2?style=social)](https://github.com/ai-forever/Kandinsky-2) 

[blog post](https://habr.com/ru/companies/sberbank/articles/725282/)

[demo](https://editor.fusionbrain.ai/)

[](https://huggingface.co/sberbank-ai/Kandinsky_2.1)

[](https://youtu.be/LZvp4SWcCao), [](https://youtu.be/IoPhRE37XSU), [](https://youtu.be/dYt9xJ7dnpU), [](https://youtu.be/rN2J5TL2RZ0)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1xSbu-b-EwYd6GdaFPRVgvXBX_mciZ41e) | 07.08.2023 |

| Stable Diffusion web UI | A web interface for Stable Diffusion, implemented using Gradio library | [AUTOMATIC1111](https://github.com/AUTOMATIC1111) | [![](https://img.shields.io/github/stars/AUTOMATIC1111/stable-diffusion-webui?style=social)](https://github.com/AUTOMATIC1111/stable-diffusion-webui) 

[](https://arxiv.org/abs/2211.06679)

[](https://github.com/yfszzx/stable-diffusion-webui-images-browser), [](https://github.com/pharmapsychotic/clip-interrogator), [](https://github.com/openvinotoolkit/stable-diffusion-webui/wiki/Installation-on-Intel-Silicon), [](https://github.com/wangshuai09/stable-diffusion-webui/wiki/Install-and-run-on-Ascend-NPUs), [](https://github.com/Stability-AI/sd3-ref), [](https://github.com/crowsonkb/k-diffusion), [](https://github.com/chaiNNer-org/spandrel), [](https://github.com/AminRezaei0x443/memory-efficient-attention), [](https://github.com/isl-org/MiDaS), [](https://github.com/invoke-ai/InvokeAI), [](https://github.com/jquesnelle/txt2imghd), [](https://github.com/parlance-zz/g-diffuser-bot), [](https://github.com/rinongal/textual_inversion), [](https://github.com/KichangKim/DeepDanbooru), [](https://github.com/tfernd/HyperTile), [](https://github.com/Newbeeer/diffusion_restart_sampling), [](https://github.com/madebyollin/taesdv), [](https://github.com/wl-zhao/UniPCv), [](https://github.com/timothybrooks/instruct-pix2pix)

[](https://huggingface.co/segmind/SSD-1B)

[](https://medium.com/@david.wimmer_86959/difoosion-a-simple-web-interface-for-stable-diffusion-models-9c67c55753b8), [](https://medium.com/@sathish851rsh/stable-diffusion-web-ui-7100c4b489a3), [](https://medium.com/codex/how-to-install-stable-diffusion-for-free-image-generation-c3c7ad67994d)

[](https://www.reddit.com/r/StableDiffusion/comments/x28a76/stable_diffusion_web_ui/)

[](https://youtu.be/onmqbI5XPH8), [](https://youtu.be/kqXpAKVQDNU), [](https://www.youtube.com/playlist?list=PLXS4AwfYDUi5sbsxZmDQWxOQTml9Uqyd2)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/camenduru/stable-diffusion-webui-colab/blob/main/stable/stable_diffusion_webui_colab.ipynb) | 05.08.2023 |

| SoftVC VITS | Singing Voice Conversion | [svc develop team](https://github.com/svc-develop-team) | [![](https://img.shields.io/github/stars/svc-develop-team/so-vits-svc?style=social)](https://github.com/svc-develop-team/so-vits-svc) 

[](https://github.com/NaruseMioShirakana/MoeVoiceStudio), [](https://github.com/openvpi/DiffSinger/tree/refactor/modules/nsf_hifigan), [](https://github.com/auspicious3000/contentvec), [](https://github.com/yxlllc/DDSP-SVC), [](https://github.com/flutydeer/audio-slicer), [](https://github.com/openvpi/audio-slicer)

[](https://huggingface.co/NaruseMioShirakana/MoeSS-SUBModel/tree/main)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/svc-develop-team/so-vits-svc/blob/4.1-Stable/sovits4_for_colab.ipynb) | 31.07.2023 |

| threestudio | Unified framework for 3D content creation from text prompts, single images, and few-shot images, by lifting 2D text-to-image generation models | 

[Yuan-Chen Guo](https://github.com/bennyguo)
 [Ying-Tian Liu](https://github.com/thuliu-yt16)
 [Ruizhi Shao](https://github.com/DSaurus)
others[Christian Laforte](https://github.com/claforte)
 [Vikram Voleti](https://github.com/voletiv)
 [Guan Luo](https://github.com/logan0601)
 [Chia-Hao Chen](https://scholar.google.com/citations?user=X0zirvMAAAAJ)
 [Zi-Xin Zou](https://github.com/zouzx)
 [Chen Wang](https://cwchenwang.github.io/)
 [Yanpei Cao](https://yanpei.me/)
 [Song-Hai Zhang](https://scholar.google.com/citations?user=AWtV-EQAAAAJ)

 | [![](https://img.shields.io/github/stars/threestudio-project/threestudio?style=social)](https://github.com/threestudio-project/threestudio) 

[](https://arxiv.org/abs/2303.15413), [](https://arxiv.org/abs/2305.16213), [](https://arxiv.org/abs/2211.10440)

[](https://discord.gg/ejer2MAB8N)

[](https://github.com/DSaurus/Tensor4D), [](https://github.com/eladrich/latent-nerf), [](https://github.com/Gorilla-Lab-SCUT/Fantasia3D), [](https://github.com/cvlab-columbia/zero123), [](https://github.com/guochengqian/Magic123), [](https://github.com/ayaanzhaque/instruct-nerf2nerf), [](https://github.com/KAIR-BAIR/nerfacc), [](https://github.com/Lightning-AI/lightning), [](https://github.com/ashawkey/fantasia3d.unofficial)

[](https://huggingface.co/DeepFloyd/IF-I-XL-v1.0), [](https://huggingface.co/docs/huggingface_hub/v0.14.1/guides/download#download-an-entire-repository)

[](https://www.reddit.com/r/StableDiffusion/comments/1635cb0/threestudio_a_unified_framework_for_3d_content/)

[](https://youtu.be/gT8Xvx5b6IE)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/threestudio-project/threestudio/blob/main/threestudio.ipynb) | 28.07.2023 |

| Image captioning | Given an image our goal is to generate a caption | 

[Kelvin Xu](https://kelvinxu.github.io/)
 [Jimmy Ba](https://jimmylba.github.io/)
 [Ryan Kiros](https://github.com/ryankiros)
others[Kyunghyun Cho](https://kyunghyuncho.me/)
 [Aaron Courville](https://mila.quebec/en/directory/aaron-courville)
 [Ruslan Salakhutdinov](https://www.cs.cmu.edu/~rsalakhu/)
 [Richard Zemel](https://www.cs.columbia.edu/~zemel/)
 [Yoshua Bengio](https://yoshuabengio.org/)

 | 

[](https://arxiv.org/abs/1502.03044)

[data](https://cocodataset.org/#home)

[](https://medium.com/@labbikarmacharya/paper-review-show-attend-and-tell-neural-image-caption-generation-with-visual-attention-03928d8fe17b)

[](https://www.tensorflow.org/text/tutorials/image_captioning)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/tensorflow/docs/blob/master/site/en/tutorials/text/image_captioning.ipynb) | 25.07.2023 |

| Word2Vec | Word2Vec is not a singular algorithm, rather, it is a family of model architectures and optimizations that can be used to learn word embeddings from large datasets | 

[Tomas Mikolov](https://scholar.google.com/citations?user=oBu8kMMAAAAJ)
 [Kai Chen](https://scholar.google.com/citations?user=TKvd_Z4AAAAJ)
 [Greg Corrado](https://research.google/people/gregcorrado/)
 [Jeffrey Dean](https://research.google/people/jeff/)

 | 

[](https://arxiv.org/abs/1301.3781)

[link](https://web.stanford.edu/class/cs224n/readings/cs224n-2019-notes01-wordvecs1.pdf)

[](https://medium.com/@harshvivek14/summary-of-efficient-estimation-of-word-representations-in-vector-space-8f96d38f4b13), [](https://medium.com/@towardsautonomy/word2vec-part1-5436c846c3fa)

[](https://papers.nips.cc/paper_files/paper/2013/hash/9aa42b31882ec039965f3c4923ce901b-Abstract.html)

[projector](http://projector.tensorflow.org/)

[](https://paperswithcode.com/method/cbow-word2vec), [](https://paperswithcode.com/method/skip-gram-word2vec)

[](https://en.wikipedia.org/wiki/Word2vec), [](https://en.wikipedia.org/wiki/Zipf%27s_law)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/tensorflow/docs/blob/master/site/en/tutorials/text/word2vec.ipynb) | 25.07.2023 |

| Word embeddings | This tutorial contains an introduction to word embeddings | [Billy Lamberta](https://github.com/lamberta) | 

[data](http://ai.stanford.edu/~amaas/data/sentiment/)

[projector](http://projector.tensorflow.org/)

[](https://www.tensorflow.org/text/guide/word_embeddings)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/tensorflow/text/blob/master/docs/tutorials/word_embeddings.ipynb) | 25.07.2023 |

| Tortoise | A multi-voice TTS system trained with an emphasis on quality | [James Betker](https://nonint.com/) | [![](https://img.shields.io/github/stars/neonbjb/tortoise-tts?style=social)](https://github.com/neonbjb/tortoise-tts) 

[](https://arxiv.org/abs/2102.12092), [](https://arxiv.org/abs/2102.09672), [](https://arxiv.org/abs/2106.07889)

[examples](https://nonint.com/static/tortoise_v2_examples.html)

[](https://github.com/neonbjb/DL-Art-School)

[](https://huggingface.co/patrickvonplaten), [](https://huggingface.co/spaces/osanseviero/tortoisse-tts)

[](https://youtu.be/J3-jfS29RF4)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/neonbjb/tortoise-tts/blob/main/tortoise_tts.ipynb) | 15.07.2023 |

| Petals | Run 100B+ language models at home, BitTorrent-style | [BigScience](https://bigscience.huggingface.co/) | [![](https://img.shields.io/github/stars/bigscience-workshop/petals?style=social)](https://github.com/bigscience-workshop/petals) 

[](https://arxiv.org/abs/2209.01188), [](https://arxiv.org/abs/2108.07258)

[](https://github.com/borzunov/chat.petals.ml), [](https://github.com/timDettmers/bitsandbytes)

[](https://huggingface.co/bigscience/bloom)

[project](https://petals.ml/)

[](https://en.wikipedia.org/wiki/BitTorrent)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1Ervk6HPNS6AYVr3xVdQnY5a-TjjmLCdQ) | 05.07.2023 |

| Epistemic Neural Networks | A library for neural networks that know what they don't know | 

[Ian Osband](http://iosband.github.io/)
 [Zheng Wen](http://zheng-wen.com/)
 [Seyed Mohammad Asghari](https://github.com/mohammadasghari)
others[Vikranth Dwaracherla](https://github.com/dvikranth)
 [Morteza Ibrahimi](https://github.com/mibrahimi)
 [Xiuyuan Lu](https://scholar.google.com/citations?user=SPL_2lIAAAAJ)
 [Benjamin Van Roy](https://web.stanford.edu/~bvr/)

 | [![](https://img.shields.io/github/stars/deepmind/enn?style=social)](https://github.com/deepmind/enn) 

[](https://arxiv.org/abs/2107.08924)

[](https://medium.com/syncedreview/deepminds-epistemic-neural-networks-open-new-avenues-for-uncertainty-modelling-in-large-and-fa83ab00aba3)

[](https://youtu.be/j8an0dKcX4A)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/deepmind/enn/blob/master/enn/colabs/enn_demo.ipynb) | 27.06.2023 |

| DeepFloyd IF | State-of-the-art open-source text-to-image model with a high degree of photorealism and language understanding | 

[Alex Shonenkov](https://linktr.ee/shonenkovAI)
 [Misha Konstantinov](https://github.com/zeroshot-ai)
 [Daria Bakshandaeva](https://github.com/Gugutse)
others[Christoph Schuhmann](http://christoph-schuhmann.de/)
 [Ksenia Ivanova](https://github.com/ivksu)
 [Nadiia Klokova](https://github.com/vauimpuls)

 | [![](https://img.shields.io/github/stars/deep-floyd/IF?style=social)](https://github.com/deep-floyd/IF) 

[](https://arxiv.org/abs/2205.11487)

[](https://discord.gg/umz62Mgr)

[](https://huggingface.co/DeepFloyd), [](https://huggingface.co/docs/diffusers/optimization/fp16#model-offloading-for-fast-inference-and-memory-savings), [](https://huggingface.co/docs/diffusers/api/pipelines/if#optimizing-for-speed), [](https://huggingface.co/docs/diffusers/api/pipelines/if#optimizing-for-memory), [](https://huggingface.co/blog/if), [](https://huggingface.co/docs/diffusers/main/en/api/pipelines/if)

[](https://www.kaggle.com/code/shonenkov/deepfloyd-if-4-3b-generator-of-pictures)

[](https://twitter.com/deepfloydai)

[website](https://deepfloyd.ai/deepfloyd-if)

[](https://youtu.be/4Zkipll5Rjc), [](https://youtu.be/tq5ZXZWwTPA), [](https://youtu.be/rLtfd1TvYJk)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/huggingface/notebooks/blob/main/diffusers/deepfloyd_if_free_tier_google_colab.ipynb) | 26.06.2023 |

| normflows | PyTorch implementation of discrete normalizing flows | 

[Vincent Stimper](https://is.mpg.de/person/vstimper)
 [David Liu](https://davindicode.github.io/)
 [Andrew Campbell](https://github.com/andrew-cr)
others[Vincent Berenz](http://vincentberenz.is.tuebingen.mpg.de/)
 [Lukas Ryll](https://github.com/lukasryll)
 [Bernhard Schölkopf](https://scholar.google.com/citations?user=DZ-fHPgAAAAJ)
 [José Miguel Hernández-Lobato](https://jmhl.org/)

 | [![](https://img.shields.io/github/stars/VincentStimper/normalizing-flows?style=social)](https://github.com/VincentStimper/normalizing-flows) 

[](https://arxiv.org/abs/2302.12014)

[](https://vincentstimper.github.io/normalizing-flows/)

[](https://github.com/VincentStimper/resampled-base-flows), [](https://github.com/VincentStimper/hmc-hyperparameter-tuning)

[](https://en.wikipedia.org/wiki/Von_Mises_distribution)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/VincentStimper/normalizing-flows/blob/master/examples/paper_example_nsf_colab.ipynb) | 26.06.2023 |

| MMPose | Toolbox for pose estimation based on PyTorch | [OpenMMLab](https://openmmlab.com/) | [![](https://img.shields.io/github/stars/open-mmlab/mmpose?style=social)](https://github.com/open-mmlab/mmpose) 

[](https://discord.com/channels/1037617289144569886/1072798105428299817)

[](https://mmpose.readthedocs.io/en/latest/)

[](https://openmmlab.medium.com/)

[](https://pypi.org/project/mmpose/)

[](https://twitter.com/OpenMMLab)

[](https://www.youtube.com/openmmlab), [](https://youtu.be/nFcZ2H1Ix3w)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/open-mmlab/mmpose/blob/master/demo/MMPose_Tutorial.ipynb) | 19.06.2023 |

| MyoSuite | A collection of musculoskeletal environments and tasks simulated with the MuJoCo physics engine and wrapped in the OpenAI gym API to enable the application of Machine Learning to bio-mechanic control problems | 

[Vittorio Caggiano](https://github.com/Vittorio-Caggiano)
 [Huawei Wang](https://huaweiwang.github.io/)
 [Guillaume Durandau](https://people.utwente.nl/g.v.durandau)
others[Massimo Sartori](https://people.utwente.nl/m.sartori)
 [Vikash Kumar](https://vikashplus.github.io/)

 | [![](https://img.shields.io/github/stars/facebookresearch/myosuite?style=social)](https://github.com/facebookresearch/myosuite) 

[](https://arxiv.org/abs/2205.13600)

[](https://myosuite.readthedocs.io/en/latest/)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1U6vo6Q_rPhDaq6oUMV7EAZRm6s0fD1wn) | 16.06.2023 |

| Audiocraft | PyTorch library for deep learning research on audio generation | 

[Jade Copet](https://scholar.google.com/citations?&user=GRMLwjAAAAAJ)
 [Felix Kreuk](https://felixkreuk.github.io/)
 [Itai Gat](https://itaigat.com/)
others[Tal Remez](https://talremez.github.io/)
 [David Kant](https://www.linkedin.com/in/david-kant-339a3b1b7)
 [Gabriel Synnaeve](https://syhw.github.io/)
 [Yossi Adi](https://www.cs.huji.ac.il/~adiyoss/)
 [Alexandre Défossez](https://ai.honu.io/)

 | [![](https://img.shields.io/github/stars/facebookresearch/audiocraft?style=social)](https://github.com/facebookresearch/audiocraft) 

[](https://arxiv.org/abs/2306.05284), [](https://arxiv.org/abs/2301.11325)

[](https://github.com/facebookresearch/encodec), [](https://github.com/camenduru/MusicGen-colab)

[](https://huggingface.co/facebook/musicgen-large)

[project](https://ai.honu.io/papers/musicgen/)

[](https://youtu.be/v-YpvPkhdO4), [](https://www.youtube.com/watch?v=EGfxuTy9Eeo), [](https://youtu.be/la2fGS0dW98), [](https://youtu.be/v-YpvPkhdO4)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1fxGqfg96RBUvGxZ1XXN07s3DthrKUl4-) | 11.06.2023 |

| Detectron2 | FAIR's next-generation platform for object detection and segmentation | [Yuxin Wu](http://ppwwyyxx.com/) | [![](https://img.shields.io/github/stars/facebookresearch/detectron2?style=social)](https://github.com/facebookresearch/detectron2) 

[blog post](https://ai.facebook.com/blog/-detectron2-a-pytorch-based-modular-object-detection-library-/)

[](https://detectron2.readthedocs.io/en/latest/)

[](https://github.com/matterport/Mask_RCNN/tree/master/samples/balloon)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/16jcaJoc6bCFAQ96jDe2HwtXj7BMD_-m5) | 26.05.2023 |

| Reverb | Efficient and easy-to-use data storage and transport system designed for machine learning research | 

[Albin Cassirer](https://github.com/acassirer)
 [Gabriel Barth-Maron](https://github.com/fastturtle)
 [Eugene Brevdo](https://ebrevdo.github.io/)
others[Sabela Ramos](https://github.com/sabelaraga)
 [Toby Boyd](https://github.com/tfboyd)
 [Thibault Sottiaux](https://github.com/thso)

 | [![](https://img.shields.io/github/stars/google-deepmind/reverb?style=social)](https://github.com/google-deepmind/reverb) 

[](https://arxiv.org/abs/2102.04736), [](https://arxiv.org/abs/1801.01290), [](https://arxiv.org/abs/1509.02971), [](https://arxiv.org/abs/1707.01495), [](https://arxiv.org/abs/1511.05952), [](https://arxiv.org/abs/1804.08617), [](https://arxiv.org/abs/1802.01561), [](https://arxiv.org/abs/1707.06347)

[](https://pypi.org/project/dm-reverb/)

[](https://www.reddit.com/r/reinforcementlearning/comments/lhnrkd/reverb_a_framework_for_experience_replay/)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/deepmind/reverb/blob/master/examples/demo.ipynb) | 23.05.2023 |

| MMDetection | Open source object detection toolbox based on PyTorch | [OpenMMLab](https://openmmlab.com/) | [![](https://img.shields.io/github/stars/open-mmlab/mmdetection?style=social)](https://github.com/open-mmlab/mmdetection) 

[](https://arxiv.org/abs/1906.07155), [](https://arxiv.org/abs/2401.02361), [](https://arxiv.org/abs/2212.07784)

[](https://discord.com/channels/1037617289144569886/1046608014234370059)

[](https://mmdetection.readthedocs.io/en/latest/)

[](https://github.com/tusen-ai/simpledet), [](https://github.com/open-mmlab/mmcv), [](https://github.com/open-mmlab/mmengine)

[](https://openmmlab.medium.com/)

[](https://paperswithcode.com/sota/real-time-instance-segmentation-on-mscoco?p=rtmdet-an-empirical-study-of-designing-real), [](https://paperswithcode.com/sota/object-detection-in-aerial-images-on-dota-1?p=rtmdet-an-empirical-study-of-designing-real), [](https://paperswithcode.com/sota/object-detection-in-aerial-images-on-hrsc2016?p=rtmdet-an-empirical-study-of-designing-real)

[](https://pypi.org/project/mmdet)

[](https://twitter.com/OpenMMLab)

[](https://www.youtube.com/openmmlab), [](https://youtu.be/5kgWyo6Sg4E), [](https://youtu.be/4SuwN4xSM3Q), [](https://www.youtube.com/live/SWB2pTY3UDM), [](https://youtu.be/AEIDB6Dd6bM), [](https://youtu.be/7c2JKPMVPm0)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/open-mmlab/mmdetection/blob/main/demo/MMDet_Tutorial.ipynb) | 17.05.2023 |

| ChatRWKV | Like ChatGPT but powered by RWKV (100% RNN) language model, which is the only RNN that can match transformers in quality and scaling, while being faster and saves VRAM | 

[Bo Peng](https://github.com/BlinkDL)
 [Eric Alcaide](https://hypnopump.github.io/)
 [Quentin Anthony](https://quentin-anthony.github.io/)
others[Alon Albalak](https://alon-albalak.github.io/)
 [Samuel Arcadinho](https://github.com/SSamDav)
 [Matteo Grella](http://www.matteogrella.com/)
 [Kranthi Kiran](https://kranthigv.github.io/)
 [Haowen Hou](https://github.com/howard-hou)
 [Przemyslaw Kazienko](https://kazienko.eu/en)
 [Jan Kocon](https://github.com/KoconJan)
 [Bartlomiej Koptyra](https://github.com/bkoptyra)
 [Ipsit Mantri](https://ipsitmantri.github.io/)
 [Ferdinand Mom](https://3outeille.github.io/)
 [Xiangru Tang](https://github.com/tangxiangru)
 [Johan Wind](https://johanwind.github.io/)
 [Stanisław Woźniak](https://www.researchgate.net/profile/Stanislaw-Wozniak-3)
 [Qihang Zhao](https://www.researchgate.net/profile/Qihang-Zhao-2)
 [Peng Zhou](https://pengzhou.sites.ucsc.edu/)
 [Jian Zhu](https://lingjzhu.github.io/)
 [Rui-Jie Zhu](https://scholar.google.com/citations?user=08ITzJsAAAAJ)

 | [![](https://img.shields.io/github/stars/BlinkDL/ChatRWKV?style=social)](https://github.com/BlinkDL/ChatRWKV) 

[](https://arxiv.org/abs/2305.13048)

[](https://discord.gg/bDSBUMeFpc)

[](https://github.com/saharNooby/rwkv.cpp), [](https://github.com/harrisonvanderbyl/rwkv-cpp-cuda), [](https://github.com/Blealtan/RWKV-LM-LoRA), [](https://github.com/josStorer/RWKV-Runner)

[](https://huggingface.co/BlinkDL)

[](https://www.reddit.com/r/MachineLearning/comments/1135aew/r_rwkv4_14b_release_and_chatrwkv_a_surprisingly/)

[](https://twitter.com/BlinkDL_AI)

[website](https://www.rwkv.com/)

[](https://youtu.be/UeAD1qWNb1U)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/resloved/RWKV-notebooks/blob/master/RWKV_ChatRWKV.ipynb) | 08.05.2023 |

| PyGlove | General-purpose library for Python object manipulation | 

[Daiyi Peng](https://github.com/daiyip)
 [Xuanyi Dong](https://xuanyidong.com/)
 [Esteban Real](https://www.estebanreal.com/)
others[Mingxing Tan](https://scholar.google.com/citations?user=6POeyBoAAAAJ)
 [Yifeng Lu](https://github.com/yifenglou)
 [Gabriel Bender](https://scholar.google.com/citations?user=-kPFcUUAAAAJ)
 [Hanxiao Liu](https://quark0.github.io/)
 [Adam Kraft](https://adamwkraft.github.io/)
 [Chen Liang](https://crazydonkey200.github.io/)
 [Quoc Le](https://cs.stanford.edu/~quocle/)

 | [![](https://img.shields.io/github/stars/google/pyglove?style=social)](https://github.com/google/pyglove) 

[](https://arxiv.org/abs/2101.08809)

[](https://pyglove.readthedocs.io/en/latest/)

[](https://synced.medium.com/google-brain-introduces-symbolic-programming-pyglove-library-to-reformulate-automl-bde1d60cb8f6)

[](https://proceedings.neurips.cc/paper/2020/hash/012a91467f210472fab4e11359bbfef6-Abstract.html)

[](https://pypi.org/project/pyglove/)

[](https://www.reddit.com/r/Python/comments/lc9r1d/researchers_from_google_brain_introduces_symbolic/)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/google/pyglove/blob/main/docs/notebooks/ml/efficiently_exchange_ml_ideas_as_code.ipynb) | 06.05.2023 |

| Python Data Science Handbook | Jupyter notebook version of the Python Data Science Handbook by Jake VanderPlas | [Jake Vanderplas](http://vanderplas.com/) | [![](https://img.shields.io/github/stars/jakevdp/PythonDataScienceHandbook?style=social)](https://github.com/jakevdp/PythonDataScienceHandbook) [project](https://jakevdp.github.io/PythonDataScienceHandbook/) | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/jakevdp/PythonDataScienceHandbook/blob/master/notebooks/Index.ipynb) | 06.05.2023 |

| PGMax | General factor graphs for discrete probabilistic graphical models, and hardware-accelerated differentiable loopy belief propagation in JAX | 

[Guangyao Zhou](https://stanniszhou.github.io/)
 [Nishanth Kumar](http://nishanthjkumar.com/)
 [Antoine Dedieu](https://github.com/antoine-dedieu)
others[Miguel Lázaro-Gredilla](https://www.tsc.uc3m.es/~miguel/)
 [Shrinu Kushagra](https://cs.uwaterloo.ca/~skushagr/)
 [Dileep George](https://dileeplearning.github.io/)

 | [![](https://img.shields.io/github/stars/deepmind/PGMax?style=social)](https://github.com/deepmind/PGMax) 

[](https://arxiv.org/abs/2202.04110)

[](https://en.wikipedia.org/wiki/Belief_propagation)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/deepmind/PGMax/blob/main/examples/rcn.ipynb) | 05.05.2023 |

| StableLM | Stability AI Language Models | [Stability AI](https://stability.ai/research) | [![](https://img.shields.io/github/stars/Stability-AI/StableLM?style=social)](https://github.com/Stability-AI/StableLM) 

[blog post](https://stability.ai/blog/stability-ai-launches-the-first-of-its-stablelm-suite-of-language-models)

[](https://github.com/facebookresearch/llama), [](https://github.com/tatsu-lab/stanford_alpaca), [](https://github.com/nomic-ai/gpt4all), [](https://github.com/databrickslabs/dolly), [](https://github.com/anthropics/hh-rlhf), [](https://github.com/ggerganov/llama.cpp)

[](https://huggingface.co/lmsys/vicuna-13b-delta-v0), [](https://huggingface.co/datasets/RyokoAI/ShareGPT52K), [](https://huggingface.co/stabilityai)

[](https://youtu.be/dypPSs4t77g), [](https://youtu.be/nWf1StvtoRw), [](https://youtu.be/Hg-s2RTaTFE), [](https://youtu.be/qXtJjoEfTnA)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/Stability-AI/StableLM/blob/main/notebooks/stablelm-alpha.ipynb) | 27.04.2023 |

| TTS | A library for advanced Text-to-Speech generation, built on the latest research, was designed to achieve the best trade-off among ease-of-training, speed and quality | 

[Eren Gölge](https://github.com/erogol)
 [Aya-AlJafari](https://github.com/Aya-AlJafari)
 [Edresson Casanova](https://github.com/Edresson)
others[Josh Meyer](http://jrmeyer.github.io/)
 [Kelly Davis](https://github.com/kdavis-coqui)
 [Reuben Morais](https://github.com/reuben)

 | [![](https://img.shields.io/github/stars/coqui-ai/TTS?style=social)](https://github.com/coqui-ai/TTS) 

[blog post](https://coqui.ai/blog/tts/solving-attention-problems-of-tts-models-with-double-decoder-consistency)

[](https://tts.readthedocs.io/en/latest/)

[](https://github.com/coqui-ai/TTS-papers)

[samples](https://erogol.github.io/ddc-samples/)

[website](https://coqui.ai/)

[](https://youtu.be/ADnBCz0Wd1U), [](https://youtu.be/Yglxf2WbkLU), [](https://youtu.be/alpI-DnVlO0)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/coqui-ai/TTS/blob/dev/notebooks/Tutorial_2_train_your_first_TTS_model.ipynb) | 26.04.2023 |

| OpenCLIP | An open source implementation of CLIP | 

[Ross Wightman](https://rwightman.com/)
 [Cade Gordon](https://cadegordon.io/)
 [Vaishaal Shankar](http://vaishaal.com/)

 | [![](https://img.shields.io/github/stars/mlfoundations/open_clip?style=social)](https://github.com/mlfoundations/open_clip) 

[](https://arxiv.org/abs/2109.01903), [](https://arxiv.org/abs/2103.00020), [](https://arxiv.org/abs/2111.02114), [](https://arxiv.org/abs/2107.04649), [](https://arxiv.org/abs/1902.10811), [](https://arxiv.org/abs/2107.04649)

[data](https://ai.google.com/research/ConceptualCaptions/download), [data](https://laion.ai/blog/laion-5b/), [data](https://laion.ai/blog/laion-400-open-dataset/)

[](https://github.com/mlfoundations/wise-ft), [](https://github.com/webdataset/webdataset), [](https://github.com/webdataset/tarp), [](https://github.com/google-research-datasets/conceptual-12m)

[](https://huggingface.co/datasets/laion/laion2B-en), [](https://huggingface.co/laion/CLIP-ViT-B-32-laion2B-s34B-b79K), [](https://huggingface.co/laion/CLIP-ViT-L-14-laion2B-s32B-b82K), [](https://huggingface.co/laion/CLIP-ViT-H-14-laion2B-s32B-b79K), [](https://huggingface.co/laion/CLIP-ViT-g-14-laion2B-s12B-b42K)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/mlfoundations/open_clip/blob/master/docs/Interacting_with_open_clip.ipynb) | 16.04.2023 |

| Stable Baselines3 | Set of reliable implementations of reinforcement learning algorithms in PyTorch | 

[Antonin Raffin](https://araffin.github.io/)
 [Ashley Hill](https://hill-a.me/)
 [Adam Gleave](https://www.gleave.me/)
others[Anssi Kanervisto](https://github.com/Miffyli)
 [Maximilian Ernestus](https://github.com/ernestum)
 [Noah Dormann](https://github.com/ndormann)

 | [![](https://img.shields.io/github/stars/DLR-RM/stable-baselines3?style=social)](https://github.com/DLR-RM/stable-baselines3) 

[](https://stable-baselines3.readthedocs)

[](https://github.com/Stable-Baselines-Team/stable-baselines3-contrib), [](https://github.com/hill-a/stable-baselines), [](https://github.com/openai/gym/wiki/Environments)

[paper](https://jmlr.org/papers/v22/20-1364.html)

[](https://www.reddit.com/r/reinforcementlearning/)

[](https://www.youtube.com/playlist?list=PLQVvvaa0QuDf0O2DWwLZBfJeYY-JOeZB1)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/Stable-Baselines-Team/rl-colab-notebooks/blob/sb3/stable_baselines_getting_started.ipynb) | 14.04.2023 |

| RL Baselines3 Zoo | Training Framework for Stable Baselines3 Reinforcement Learning Agents | [Antonin Raffin](https://araffin.github.io/) | [![](https://img.shields.io/github/stars/DLR-RM/rl-baselines3-zoo?style=social)](https://github.com/DLR-RM/rl-baselines3-zoo) 

[](https://arxiv.org/abs/2005.05719)

[](https://stable-baselines3.readthedocs.io/en/master/)

[](https://github.com/DLR-RM/rl-baselines3-zoo), [](https://github.com/openai/roboschool), [](https://github.com/Farama-Foundation/Minigrid)

[](https://huggingface.co/sb3)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/Stable-Baselines-Team/rl-colab-notebooks/blob/sb3/rl-baselines-zoo.ipynb) | 14.04.2023 |

| Grounded-SAM | Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect, Segment and Generate Anything | [IDEA-Research](https://www.idea.edu.cn/) | [![](https://img.shields.io/github/stars/IDEA-Research/Grounded-Segment-Anything?style=social)](https://github.com/IDEA-Research/Grounded-Segment-Anything) 

[](https://arxiv.org/abs/2304.02643), [](https://arxiv.org/abs/2303.05499)

[](https://github.com/MasterBin-IIAU/UNINEXT), [](https://github.com/IDEA-Research/OSX), [](https://github.com/dvlab-research/VoxelNeXt), [](https://github.com/UX-Decoder/Semantic-SAM), [](https://github.com/UX-Decoder/Segment-Everything-Everywhere-All-At-Once), [](https://github.com/IDEA-Research/OpenSeeD), [](https://github.com/Computer-Vision-in-the-Wild/CVinW_Readings), [](https://github.com/sail-sg/EditAnything), [](https://github.com/feizc/IEA), [](https://github.com/Li-Qingyun/sam-mmrotate), [](https://github.com/VainF/Awesome-Anything), [](https://github.com/RockeyCoss/Prompt-Segment-Anything)

[](https://youtu.be/oEQYStnF2l8), [](https://youtu.be/gKTYMfwPo4M), [](https://youtu.be/0Fpb8TBH0nM), [](https://youtu.be/GuEDDBWrN24)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/betogaona7/Grounded-Segment-Anything/blob/main/grounded_sam_colab_demo.ipynb) | 12.04.2023 |

| TFDS | Collection of ready-to-use datasets for use with TensorFlow, Jax, and other Machine Learning frameworks | [Ryan Sepassi](https://ryansepassi.com/) | [![](https://img.shields.io/github/stars/tensorflow/datasets?style=social)](https://github.com/tensorflow/datasets) 

[](https://towardsdatascience.com/youre-importing-data-wrong-c171f52eea00)

[](https://pypi.org/project/tensorflow-datasets/)

[](https://www.tensorflow.org/datasets)

[](https://youtu.be/YrMy-BAqk8k), [](https://youtu.be/6th3rahsw9Y), [](https://youtu.be/3HYy0SPd7TE), [](https://youtu.be/MvcK-MaXbHk)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/tensorflow/datasets/blob/master/docs/overview.ipynb) | 11.04.2023 |

| MMSegmentation | Open source semantic segmentation toolbox based on PyTorch | [OpenMMLab](https://openmmlab.com/) | [![](https://img.shields.io/github/stars/open-mmlab/mmsegmentation?style=social)](https://github.com/open-mmlab/mmsegmentation) 

[](https://discord.gg/raweFPmdzG)

[](https://mmsegmentation.readthedocs.io/en/main/)

[](https://openmmlab.medium.com/), [](https://mducducd33.medium.com/sematic-segmentation-using-mmsegmentation-bcf58fb22e42)

[](https://pypi.org/project/mmsegmentation)

[](https://twitter.com/OpenMMLab)

[](https://www.youtube.com/openmmlab)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/open-mmlab/mmsegmentation/blob/main/demo/MMSegmentation_Tutorial.ipynb) | 31.03.2023 |

| LAVIS | Python deep learning library for LAnguage-and-VISion intelligence research and applications | 

[Dongxu Li](https://github.com/dxli94)
 [Junnan Li](https://github.com/LiJunnan1992)
 [Hung Le](https://sites.google.com/view/henryle2018/home)
others[Guangsen Wang](https://github.com/guangsen-wang)
 [Silvio Savarese](https://scholar.google.com/citations?user=ImpbxLsAAAAJ)
 [Steven Hoi](https://sites.google.com/view/stevenhoi)

 | [![](https://img.shields.io/github/stars/salesforce/LAVIS?style=social)](https://github.com/salesforce/LAVIS) 

[](https://arxiv.org/abs/2209.09019), [](https://arxiv.org/abs/2305.06500), [](https://arxiv.org/abs/2301.12597), [](https://arxiv.org/abs/2212.10846), [](https://arxiv.org/abs/2210.08773)

[blog post](https://blog.salesforceairesearch.com/lavis-language-vision-library/)

[](https://opensource.salesforce.com/LAVIS//latest/index.html)

[](https://en.wikipedia.org/wiki/Merlion)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/salesforce/LAVIS/blob/main/projects/img2llm-vqa/img2llm_vqa.ipynb) | 24.03.2023 |

| pymdp | Package for simulating Active Inference agents in Markov Decision Process environments | 

[Conor Heins](https://github.com/conorheins)
 [Alec Tschantz](https://github.com/alec-tschantz)
 [Beren Millidge](https://www.beren.io/)
others[Brennan Klein](https://github.com/jkbren)
 [Arun Niranjan](https://github.com/Arun-Niranjan)
 [Daphne Demekas](https://github.com/daphnedemekas)

 | [![](https://img.shields.io/github/stars/infer-actively/pymdp?style=social)](https://github.com/infer-actively/pymdp) 

[](https://arxiv.org/abs/2201.03904)

[](https://pymdp-rtd.readthedocs.io/en/stable/)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/infer-actively/pymdp/blob/master/docs/notebooks/active_inference_from_scratch.ipynb) | 19.03.2023 |

| MMSelfSup | Open-source framework for visual pre-training | 

[Xiaohang Zhan](https://xiaohangzhan.github.io/)
 [Jiahao Xie](https://jiahao000.github.io/)
 [Enze Xie](https://xieenze.github.io/)
others[Xiangxiang Chu](https://github.com/cxxgtxy)
 [Zijian He](https://github.com/scnuhealthy)

 | [![](https://img.shields.io/github/stars/open-mmlab/mmselfsup?style=social)](https://github.com/open-mmlab/mmselfsup) 

[](https://discord.com/channels/1037617289144569886/1072798105428299817)

[](https://mmselfsup.readthedocs.io/en/latest/)

[](https://github.com/open-mmlab/mmengine)

[](https://openmmlab.medium.com/rebirth-openselfsup-is-upgraded-to-mmselfsup-67546f2d9e6)

[](https://pypi.org/project/mmselfsup/)

[](https://twitter.com/OpenMMLab)

[](https://www.youtube.com/openmmlab)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/open-mmlab/mmselfsup/blob/master/demo/mmselfsup_colab_tutorial.ipynb) | 14.03.2023 |

| Tzer | Coverage-Guided Tensor Compiler Fuzzing with Joint IR-Pass Mutation | 

[Jiawei Liu](https://jiawei-site.github.io/)
 [Yuxiang Wei](https://yuxiang.cs.illinois.edu/)
 [Sen Yang](https://github.com/syang-ng)
others[Yinlin Deng](https://dengyinlin.github.io/)
 [Lingming Zhang](http://lingming.cs.illinois.edu/)

 | [![](https://img.shields.io/github/stars/ise-uiuc/tzer?style=social)](https://github.com/ise-uiuc/tzer) 

[](https://arxiv.org/abs/2202.09947)

[](https://hub.docker.com/repository/docker/tzerbot/oopsla)

[](https://tzer.readthedocs.io/en/latest/index.html)

[](https://github.com/ganler/memcov)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/ise-uiuc/tzer/blob/main/bug-report.ipynb) | 09.03.2023 |

| ArtLine | A Deep Learning based project for creating line art portraits | [Vijish Madhavan](https://github.com/vijishmadhavan) | [![](https://img.shields.io/github/stars/vijishmadhavan/ArtLine?style=social)](https://github.com/vijishmadhavan/ArtLine) 

[](https://arxiv.org/abs/1805.08318), [](https://arxiv.org/abs/1710.10196), [](https://arxiv.org/abs/1707.02921), [](https://arxiv.org/abs/1603.08155)

[data](https://cg.cs.tsinghua.edu.cn/people/~Yongjin/APDrawingDB.zip)

[](https://github.com/yiranran/APDrawingGAN), [](https://github.com/jantic/DeOldify)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/vijishmadhavan/ArtLine/blob/main/ControlNet_%2BArtLine_.ipynb) | 03.03.2023 |

| AmpliGraph | A suite of neural machine learning models for relational Learning, a branch of machine learning that deals with supervised learning on knowledge graphs | 

[Luca Costabello](https://luca.costabello.info/)
 [Adrianna Janik](https://github.com/adrijanik)
 [Chan Le Van](https://github.com/chanlevan)
others[Nicholas McCarthy](https://github.com/NicholasMcCarthy)
 [Rory McGrath](http://www.rorymcgrath.ie/)
 [Sumit Pai](https://github.com/sumitpai)

 | [![](https://img.shields.io/github/stars/Accenture/AmpliGraph?style=social)](https://github.com/Accenture/AmpliGraph) 

[](http://arxiv.org/abs/1702.05563), [](http://arxiv.org/abs/1705.10744), [](https://arxiv.org/abs/2105.08683), [](http://arxiv.org/abs/1612.03975), [](https://arxiv.org/abs/1912.10000), [](https://arxiv.org/abs/1412.6575)

[](https://docs.ampligraph.org)

[](https://papers.nips.cc/paper/2013/hash/1cecc7a77928ca8133fa24680a88d2f9-Abstract.html), [](https://papers.nips.cc/paper/2013/hash/b337e84de8752b27eda3a12363109e80-Abstract.html)

[](https://youtu.be/gX_KHaU8ChI)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/Accenture/AmpliGraph/blob/main/docs/tutorials/AmpliGraphBasicsTutorial.ipynb) | 23.02.2023 |

| NMT with attention | This notebook trains a seq2seq model for Spanish to English translation | 

[Minh-Thang Luong](https://nlp.stanford.edu/~lmthang/)
 [Hieu Pham](https://huyhieupham.github.io/)
 [Christopher Manning](https://nlp.stanford.edu/~manning/)

 | 

[](https://arxiv.org/abs/1508.04025), [](https://arxiv.org/abs/1409.0473)

[data](http://www.manythings.org/anki/)

[](https://www.tensorflow.org/text/tutorials/nmt_with_attention)

[](https://en.wikipedia.org/wiki/Neural_machine_translation)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/tensorflow/text/blob/master/docs/tutorials/nmt_with_attention.ipynb) | 15.02.2023 |

| GLUE using BERT on TPU | This tutorial contains complete end-to-end code to train models on a TPU | [Anirudh Dubey](https://github.com/anirudh161) | 

[GLUE](https://gluebenchmark.com/)

[](https://arxiv.org/abs/1810.04805)

[](https://www.tensorflow.org/guide/tpu), [](https://www.tensorflow.org/text/tutorials/bert_glue)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/tensorflow/text/blob/master/docs/tutorials/bert_glue.ipynb) | 15.02.2023 |

| TensorBoard | Suite of web applications for inspecting and understanding your TensorFlow runs and graphs | [Yuan Tang](https://terrytangyuan.github.io/) | [![](https://img.shields.io/github/stars/tensorflow/tensorboard?style=social)](https://github.com/tensorflow/tensorboard) 

[](https://github.com/tensorflow/tensorflow/blob/master/tensorflow/python/training/supervisor.py)

[](https://www.tensorflow.org/tensorboard/get_started), [](https://www.tensorflow.org/api_docs/python/tf/summary), [](https://www.tensorflow.org/api_docs/python/tf/linalg/matmul), [](https://www.tensorflow.org/api_docs/python/tf/nn/relu), [](https://www.tensorflow.org/api_docs/python/tf/compat/v1/train/summary_iterator)

[website](https://tensorboard.dev/)

[](https://en.wikipedia.org/wiki/Reservoir_sampling)

[](https://youtu.be/eBbEDRsCmv4), [](https://youtu.be/BqgTU7_cBnk), [](https://youtu.be/qEQ-_EId-D0), [](https://youtu.be/3bownM3L5zM)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/tensorflow/tensorboard/blob/master/docs/scalars_and_keras.ipynb) | 10.02.2023 |

| High-performance Simulation with Kubernetes | This tutorial will describe how to set up high-performance simulation using a TFF runtime running on Kubernetes | [Jason Roselander](https://github.com/roselander) | 

[GKE](https://cloud.google.com/kubernetes-engine/)

[](https://paperswithcode.com/task/federated-learning)

[](https://pypi.org/project/tensorflow-federated/)

[shell](https://cloud.google.com/shell/)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/tensorflow/federated/blob/master/docs/tutorials/high_performance_simulation_with_kubernetes.ipynb) | 31.01.2023 |

| Compel | Text prompt weighting and blending library for transformers-type text embedding systems | [Damian Stewart](http://damianstewart.com/) | [![](https://img.shields.io/github/stars/damian0815/compel?style=social)](https://github.com/damian0815/compel) 

[](https://github.com/invoke-ai/InvokeAI/issues/2832)

[](https://huggingface.co/cactusfriend/nightmare-invokeai-prompts)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/damian0815/compel/blob/main/compel-demo.ipynb) | 26.01.2023 |

| DALL·E Flow | An interactive workflow for generating high-definition images from text prompt | 

[Han Xiao](https://hanxiao.io/)
 [Delgermurun Purevkhuu](https://delgermurun.com/)
 [Alex Cureton-Griffiths](http://blog.alexcg.net/)

 | [![](https://img.shields.io/github/stars/jina-ai/dalle-flow?style=social)](https://github.com/jina-ai/dalle-flow) 

[](https://github.com/Jack000/glid-3-xl), [](https://github.com/jina-ai/docarray)

[](https://huggingface.co/CompVis/stable-diffusion-v-1-4-original)

[](https://www.youtube.com/playlist?list=PL3UBBWOUVhFYRUa_gpYYKBqEAkO4sxmne), [](https://www.youtube.com/c/jina-ai)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/jina-ai/dalle-flow/blob/main/client.ipynb) | 26.01.2023 |

| Diffusers | Provides pretrained diffusion models across multiple modalities, such as vision and audio, and serves as a modular toolbox for inference and training of diffusion models | [Hugging Face](https://huggingface.co/) | [![](https://img.shields.io/github/stars/huggingface/diffusers?style=social)](https://github.com/huggingface/diffusers) 

[](https://arxiv.org/abs/2006.11239), [](https://arxiv.org/abs/2006.11239), [](https://arxiv.org/abs/2010.02502), [](https://arxiv.org/abs/2202.09778), [](https://arxiv.org/abs/2204.13902)

[](https://github.com/hojonathanho/diffusion), [](https://github.com/pesser/pytorch_diffusion), [](https://github.com/ermongroup/ddim), [](https://github.com/heejkoo/Awesome-Diffusion-Models)

[](https://huggingface.co/spaces/CompVis/text2img-latent-diffusion), [](https://huggingface.co/spaces/CompVis/celeba-latent-diffusion), [](https://huggingface.co/spaces/fusing/celeba-diffusion), [](https://huggingface.co/spaces/huggingface/diffuse-the-rest), [](https://huggingface.co/spaces/Shuang59/Composable-Diffusion)

[](https://towardsdatascience.com/hugging-face-just-released-the-diffusers-library-846f32845e65)

[](https://youtu.be/UzkdOg7wWmI)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/huggingface/notebooks/blob/main/diffusers/diffusers_intro.ipynb) | 17.01.2023 |

| Sample Factory | One of the fastest RL libraries focused on very efficient synchronous and asynchronous implementations of policy gradients | 

[Aleksei Petrenko](https://alex-petrenko.github.io/)
 [Zhehui Huang](https://zhehui-huang.github.io/)
 [Tushar Kumar](https://github.com/tushartk)
others[Gaurav Sukhatme](http://robotics.usc.edu/~gaurav/)
 [Vladlen Koltun](http://vladlen.info/)

 | [![](https://img.shields.io/github/stars/alex-petrenko/sample-factory?style=social)](https://github.com/alex-petrenko/sample-factory) 

[ICML](http://proceedings.mlr.press/v119/petrenko20a.html)

[](https://arxiv.org/abs/2006.11751)

[](https://www.samplefactory.dev/)

[](https://github.com/alex-petrenko/faster-fifo)

[](https://youtu.be/lLG17LKKSZc)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/alex-petrenko/sample-factory/blob/master/sf_examples/notebooks/samplefactory_hub_example.ipynb) | 17.01.2023 |

| Open-Assistant | Chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so | 

[Andreas Köpf](https://github.com/andreaskoepf)
 [Yannic Kilcher](https://github.com/yk)
 [Huu Nguyen](https://github.com/ontocord)
others[Christoph Schuhmann](http://christoph-schuhmann.de/)
 [Keith Stevens](https://fozziethebeat.github.io/)
 [Abdullah Barhoum](https://github.com/AbdBarho)
 [Nguyen Minh Duc](https://github.com/notmd)
 [Oliver Stanley](https://olliestanley.github.io/)
 [James Melvin Ebenezer](https://github.com/melvinebenezer)

 | [![](https://img.shields.io/github/stars/LAION-AI/Open-Assistant?style=social)](https://github.com/LAION-AI/Open-Assistant) 

[](https://arxiv.org/abs/2203.02155)

[](https://projects.laion.ai/Open-Assistant/)

[](https://huggingface.co/OpenAssistant)

[](https://generativeai.pub/open-assistant-a-free-and-open-source-alternative-to-chatgpt-67d15229813)

[website](https://open-assistant.io/)

[](https://youtu.be/64Izfm24FKA), [](https://youtu.be/ddG2fM9i4Kk), [](https://youtu.be/FQIHLFLrTw0)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/LAION-AI/Open-Assistant/blob/main/notebooks/data-augmentation/stackexchange-builder/stackexchange-builder.ipynb) | 14.01.2023 |

| panda-gym | Set of robotic environments based on PyBullet physics engine and gymnasium | 

[Quentin Gallouédec](https://gallouedec.com/)
 [Nicolas Cazin](https://github.com/NicolasCAZIN)
 [Emmanuel Dellandréa](http://perso.ec-lyon.fr/emmanuel.dellandrea/)
 [Liming Chen](https://sites.google.com/view/limingchen/accueil)

 | [![](https://img.shields.io/github/stars/qgallouedec/panda-gym?style=social)](https://github.com/qgallouedec/panda-gym) 

[](https://arxiv.org/abs/2106.13687)

[](https://panda-gym.readthedocs.io/en/latest/)

[](https://pypi.org/project/panda-gym/)

[](https://youtu.be/BgvpoSP45hA)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/qgallouedec/panda-gym/blob/master/examples/PickAndPlace.ipynb) | 02.01.2023 |

| BANMo | Given multiple casual videos capturing a deformable object, BANMo reconstructs an animatable 3D model, including an implicit canonical 3D shape, appearance, skinning weights, and time-varying articulations, without pre-defined shape templates or registered cameras | 

[Gengshan Yang](https://gengshan-y.github.io/)
 [Minh Vo](https://minhpvo.github.io/)
 [Natalia Neverova](https://nneverova.github.io/)
others[Deva Ramanan](http://www.cs.cmu.edu/~deva/)
 [Andrea Vedaldi](https://www.robots.ox.ac.uk/~vedaldi/)
 [Hanbyul Joo](https://jhugestar.github.io/)

 | [![](https://img.shields.io/github/stars/facebookresearch/banmo?style=social)](https://github.com/facebookresearch/banmo) 

[](https://arxiv.org/abs/2112.12761)

[](https://github.com/kwea123/nerf_pl), [](https://github.com/gengshan-y/rigidmask), [](https://github.com/ShichenLiu/SoftRas), [](https://github.com/ThibaultGROUEIX/ChamferDistancePytorch)

[project](https://banmo-www.github.io/)

[](https://youtu.be/1NUa-yvFGA0), [](https://youtu.be/jDTy-liFoCQ)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1dQJn1vsuz0DkyRZbOA1SulkVQ0V1kMUP) | 30.12.2022 |

| tensor_parallel | Run large PyTorch models on multiple GPUs in one line of code with potentially linear speedup | [Andrei Panferov](https://blog.panferov.org/) | [![](https://img.shields.io/github/stars/BlackSamorez/tensor_parallel?style=social)](https://github.com/BlackSamorez/tensor_parallel) 

[](https://github.com/microsoft/DeepSpeed), [](https://github.com/facebookresearch/fairscale), [](https://github.com/NVIDIA/Megatron-LM), [](https://github.com/tunib-ai/parallelformers), [](https://github.com/alpa-projects/alpa)

[](https://huggingface.co/docs/transformers/model_doc/gpt2)

[](https://www.kaggle.com/code/blacksamorez/tensor-parallel-int4-llm/), [](https://www.kaggle.com/code/muellerzr/multi-gpu-and-accelerate)

[](https://pypi.org/project/tensor-parallel/)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/BlackSamorez/tensor_parallel/blob/master/examples/training_flan-t5-xl.ipynb) | 29.12.2022 |

| TPU | Reference models and tools for Cloud TPUs | [Hongkun Yu](https://github.com/saberkun) | [![](https://img.shields.io/github/stars/tensorflow/tpu?style=social)](https://github.com/tensorflow/tpu) 

[website](https://cloud.google.com/tpu/)

[](https://en.wikipedia.org/wiki/Tensor_Processing_Unit)

[](https://youtu.be/W7A-9MYvPwI), [](https://youtu.be/MXxN4fv01c8), [](https://youtu.be/FsxthdQ_sL4), [](https://youtu.be/zEOtG-ChmZE), [](https://youtu.be/kBjYK3K3P6M), [](https://youtu.be/8j1MWZGNoXM), [](https://youtu.be/hszd5UqnfLk)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/tensorflow/tpu/blob/master/tools/colab/keras_mnist_tpu.ipynb) | 20.12.2022 |

| rliable | Library for reliable evaluation, even with a handful of runs, on reinforcement learning and machine learnings benchmarks | 

[Rishabh Agarwal](https://agarwl.github.io/)
 [Max Schwarzer](https://scholar.google.com/citations?user=YmWRSvgAAAAJ)
 [Pablo Castro](https://psc-g.github.io/)
others[Aaron Courville](https://mila.quebec/en/directory/aaron-courville)
 [Marc Bellemare](http://www.marcgbellemare.info/)

 | [![](https://img.shields.io/github/stars/google-research/rliable?style=social)](https://github.com/google-research/rliable) 

[](https://psc-g.github.io/)

[blog post](https://research.google/blog/rliable-towards-reliable-evaluation-reporting-in-reinforcement-learning/), [blog post](https://araffin.github.io/post/rliable/)

[](https://proceedings.neurips.cc/paper/2021/hash/f514cec81cb148559cf475e7426eed5e-Abstract.html)

[podcast](https://podcasts.apple.com/dk/podcast/deep-reinforcement-learning-at-the-edge-of/id1116303051?i=1000551066163)

[poster](https://agarwl.github.io/rliable/pdfs/Precipice_poster.pdf)

[project](https://agarwl.github.io/rliable/)

[slides](https://agarwl.github.io/rliable/assets/slides_mlc.pdf)

[](https://x.com/agarwl_/status/1432800830621687817)

[](https://youtu.be/XSY9JwqD-bw), [](https://youtu.be/gO33pSls-jI), [](https://youtu.be/HDyK3oNN2i0), [](https://youtu.be/mqcnHYwWzD8), [](https://youtu.be/E00gxHrHzZ4), [](https://youtu.be/M3OzJDAjz3o)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1a0pSD-1tWhMmeJeeoyZM1A-HCW3yf1xR) | 19.12.2022 |

| TF-Agents | A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning | 

[Sergio Guadarrama](https://github.com/sguada)
 [Anoop Korattikara](https://github.com/kbanoop)
 [Oscar Ramirez](https://github.com/oars)
others[Pablo Castro](https://psc-g.github.io/)
 [Ethan Holly](https://github.com/eholly-g)
 [Sam Fishman](http://sam.fish/)
 [Ke Wang](https://scholar.google.com/citations?user=QRYX59sAAAAJ)
 [Ekaterina Gonina](https://github.com/egonina)
 [Neal Wu](https://twitter.com/WuNeal)
 [Efi Kokiopoulou](https://github.com/efiko)
 [Luciano Sbaiz](https://scholar.google.com/citations?user=fKBmhcUAAAAJ)
 [Jamie Smith](https://scholar.google.com/citations?user=jk17mo8AAAAJ)
 [Gábor Bartók](https://github.com/bartokg)
 [Jesse Berent](https://www.linkedin.com/in/jesse-berent-a1b6875)
 [Chris Harris](https://www.linkedin.com/in/charris)
 [Vincent Vanhoucke](https://vincent.vanhoucke.com/)
 [Eugene Brevdo](https://ebrevdo.github.io/)

 | [![](https://img.shields.io/github/stars/tensorflow/agents?style=social)](https://github.com/tensorflow/agents) 

[](https://www.tensorflow.org/agents/api_docs/python/tf_agents)

[](https://towardsdatascience.com/introduction-to-tf-agents-a-library-for-reinforcement-learning-in-tensorflow-68ab9add6ad6), [](https://medium.com/analytics-vidhya/tf-agents-a-flexible-reinforcement-learning-library-for-tensorflow-5f125420f64b)

[](https://www.tensorflow.org/agents)

[](https://youtu.be/2nKD6zFQ8xI), [](https://youtu.be/-TTziY7EmUA), [](https://youtu.be/52DTXidSVWc), [](https://youtu.be/U7g7-Jzj9qo), [](https://youtu.be/tAOApRQAgpc), [](https://youtu.be/X4eruXqNbDc), [](https://youtu.be/g0yDlAbi6Pc), [](https://youtu.be/VmZI_YkfPBM), [](https://youtu.be/7QFSziiAnxI)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/tensorflow/agents/blob/master/docs/tutorials/0_intro_rl.ipynb) | 15.12.2022 |

| PyG | Library built upon PyTorch to easily write and train Graph Neural Networks for a wide range of applications related to structured data | 

[Matthias Fey](https://rusty1s.github.io/#/)
 [Jan Eric Lenssen](https://github.com/janericlenssen)

 | [![](https://img.shields.io/github/stars/pyg-team/pytorch_geometric?style=social)](https://github.com/pyg-team/pytorch_geometric) 

[](https://arxiv.org/abs/1903.02428), [](https://arxiv.org/abs/1801.07829), [](https://arxiv.org/abs/1609.02907), [](https://arxiv.org/abs/2003.03123), [](https://arxiv.org/abs/1905.05178), [](https://arxiv.org/abs/1706.08566), [](https://arxiv.org/abs/1907.10903), [](https://arxiv.org/abs/1905.07953)

[](https://pytorch-geometric.readthedocs.io/en/latest/)

[](https://github.com/snap-stanford/ogb/tree/master/examples), [](https://github.com/pyg-team/pyg-lib), [](https://github.com/rusty1s/pytorch_scatter), [](https://github.com/rusty1s/pytorch_sparse), [](https://github.com/rusty1s/pytorch_cluster), [](https://github.com/AntonioLonga/PytorchGeometricTutorial)

[](https://papers.nips.cc/paper/2018/hash/e77dbaf6759253c7c6d0efc5690369c7-Abstract.html), [](https://papers.nips.cc/paper/2017/hash/5dd9db5e033da9c6fb5ba83c7a7ebea9-Abstract.html), [](https://nips.cc/virtual/2020/public/poster_3fe230348e9a12c13120749e3f9fa4cd.html)

[](https://pytorch.org/tutorials/beginner/basics/optimization_tutorial.html#full-implementation)

[](https://www.youtube.com/playlist?list=PLGMXrbDNfqTzqxB1IGgimuhtfAhGd8lHF), [](https://www.youtube.com/playlist?list=PLGMXrbDNfqTwPxitLVHEbT9Pd6-oR_cud), [](https://youtu.be/-UjytpbqX4A)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1h3-vJGRVloF5zStxL5I0rSy4ZUPNsjy8) | 08.12.2022 |

| ruGPT3 | Example of inference of RuGPT3XL | [Anton Emelyanov](https://github.com/king-menin) | [![](https://img.shields.io/github/stars/ai-forever/ru-gpts?style=social)](https://github.com/ai-forever/ru-gpts) 

[cristofari](https://sbercloud.ru/ru/christofari)

[](https://github.com/microsoft/DeepSpeedExamples/tree/master/Megatron-LM)

[](https://huggingface.co/transformers/main_classes/model.html#transformers.generation_utils.GenerationMixin.generate)

[sparse attention](https://www.deepspeed.ai/tutorials/sparse-attention/)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/ai-forever/ru-gpts/blob/master/examples/ruGPT3XL_generation.ipynb) | 07.12.2022 |

| Mubert | Prompt-based music generation via Mubert API | [Ilya Belikov](https://github.com/ferluht) | [![](https://img.shields.io/github/stars/MubertAI/Mubert-Text-to-Music?style=social)](https://github.com/MubertAI/Mubert-Text-to-Music) 

[](https://mubert2.docs.apiary.io/)

[project](https://mubert.com/)

[](https://youtu.be/YJu0iXn-T_U), [](https://youtu.be/5UsaxJsFvAI), [](https://youtu.be/B0kkIpWifG4)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/ferluht/Mubert-Text-to-Music/blob/main/Mubert_Text_to_Music.ipynb) | 18.10.2022 |

| RuDOLPH | A fast and light text-image-text transformer designed for a quick and easy fine-tuning setup for the solution of various tasks: from generating images by text description and image classification to visual question answering and more | 

[Alex Shonenkov](https://github.com/shonenkov)
 [Misha Konstantinov](https://github.com/zeroshot-ai)

 | [![](https://img.shields.io/github/stars/ai-forever/ru-dolph?style=social)](https://github.com/ai-forever/ru-dolph) 

[](https://arxiv.org/abs/2005.14165), [](https://arxiv.org/abs/2102.12092), [](https://arxiv.org/abs/2103.00020)

[](https://pypi.org/project/rudolph/)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/ai-forever/ru-dolph/blob/master/jupyters/RUDOLPH_tune_i2t_pl.ipynb) | 06.10.2022 |

| Batch RL | Offline RL using the DQN replay dataset comprising the entire replay experience of a DQN agent on 60 Atari 2600 games | 

[Rishabh Agarwal](https://agarwl.github.io/)
 [Dale Schuurmans](https://webdocs.cs.ualberta.ca/~dale/)
 [Mohammad Norouzi](https://norouzi.github.io/)

 | [![](https://img.shields.io/github/stars/google-research/batch_rl?style=social)](https://github.com/google-research/batch_rl) 

[DQN](https://www.nature.com/articles/nature14236?wm=book_wap_0005)

[](https://arxiv.org/abs/1907.04543), [](https://arxiv.org/abs/1709.06009)

[blog post](https://ai.googleblog.com/2020/04/an-optimistic-perspective-on-offline.html)

[data](https://console.cloud.google.com/storage/browser/atari-replay-datasets), [data](https://research.google/resources/datasets/dqn-replay/)

[](https://github.com/openai/atari-py/tree/0.2.5/atari_py/atari_roms), [](https://github.com/mgbellemare/Arcade-Learning-Environment), [](https://github.com/mila-iqia/SGI/blob/master/src/offline_dataset.py), [](https://github.com/kzl/decision-transformer/tree/master/atari)

[project](https://offline-rl.github.io/)

[slides](https://docs.google.com/presentation/d/1ROltXr6FIeYKrnGl0tKHGWI0pL4Zo8CnvAK2-cdpQyY)

[talk](https://slideslive.com/38928373/an-optimistic-perspective-on-offline-deep-reinforcement-learning)

[](https://www.tensorflow.org/install/install_linux)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1ktlNni_vwFpFtCgUez-RHW0OdGc2U_Wv) | 04.10.2022 |

| RL Games | High performance RL library | 

[Denys Makoviichuk](https://github.com/Denys88)
 [Viktor Makoviychuk](https://github.com/ViktorM)

 | [![](https://img.shields.io/github/stars/Denys88/rl_games?style=social)](https://github.com/Denys88/rl_games) 

[](https://discord.gg/hnYRq7DsQh)

[](https://github.com/isaac-sim/IsaacGymEnvs), [](https://github.com/NVlabs/cule), [](https://github.com/NVlabs/tiny-cuda-nn)

[](https://pypi.org/project/rl-games/)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/Denys88/rl_games/blob/master/notebooks/brax_training.ipynb) | 27.09.2022 |

| ACME | A library of reinforcement learning components and agents | 

[Matt Hoffman](https://www.mwhoffman.com/)
 [Bobak Shahriari](https://github.com/bshahr)
 [John Aslanides](https://www.aslanides.io/)
others[Gabriel Barth-Maron](https://github.com/fastturtle)
 [Feryal Behbahani](https://feryal.github.io/)
 [Tamara Norman](https://github.com/tamaranorman)
 [Abbas Abdolmaleki](https://scholar.google.com/citations?user=cCYTVWQAAAAJ)
 [Albin Cassirer](https://github.com/acassirer)
 [Fan Yang](https://github.com/ddmbr)
 [Kate Baumli](https://github.com/katebaumli)
 [Sarah Henderson](https://www.linkedin.com/in/sarah-henderson-agilecoach/)
 [Alex Novikov](https://scholar.google.ru/citations?user=jMUkLqwAAAAJ)
 [Sergio Gómez Colmenarejo](https://scholar.google.ru/citations?user=0Dkf68EAAAAJ)
 [Serkan Cabi](https://scholar.google.ru/citations?&user=l-HhJaUAAAAJ)
 [Caglar Gulcehre](https://www.caglarg.com/)
 [Tom Le Paine](http://tomlepaine.github.io/)
 [Andrew Cowie](https://scholar.google.ru/citations?&user=aTvi5mUAAAAJ)
 [Ziyu Wang](https://ziyuw.github.io/)
 [Bilal Piot](https://scholar.google.ru/citations?&user=fqxNUREAAAAJ)
 [Nando de Freitas](https://github.com/nandodf)

 | [![](https://img.shields.io/github/stars/deepmind/acme?style=social)](https://github.com/deepmind/acme) 

[](https://arxiv.org/abs/2006.00979)

[blog post](https://www.deepmind.com/publications/acme-a-new-framework-for-distributed-reinforcement-learning)

[](https://dm-acme.readthedocs.io/en/latest/)

[](https://github.com/deepmind/dm_env)

[](https://youtu.be/NUwDr42bPOw), [](https://youtu.be/J1XCWjuyRaI), [](https://youtu.be/pFMuQWpHI5k)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/deepmind/acme/blob/master/examples/tutorial.ipynb) | 26.09.2022 |

| RWKV | Reinventing RNNs for the Transformer Era | 

[Bo Peng](https://github.com/BlinkDL)
 [Eric Alcaide](https://hypnopump.github.io/)
 [Quentin Anthony](https://quentin-anthony.github.io/)
others[Alon Albalak](https://alon-albalak.github.io/)
 [Samuel Arcadinho](https://github.com/SSamDav)
 [Matteo Grella](http://www.matteogrella.com/)
 [Kranthi Kiran](https://kranthigv.github.io/)
 [Haowen Hou](https://github.com/howard-hou)
 [Przemyslaw Kazienko](https://kazienko.eu/en)
 [Jan Kocon](https://github.com/KoconJan)
 [Bartlomiej Koptyra](https://github.com/bkoptyra)
 [Ipsit Mantri](https://ipsitmantri.github.io/)
 [Ferdinand Mom](https://3outeille.github.io/)
 [Xiangru Tang](https://github.com/tangxiangru)
 [Johan Wind](https://johanwind.github.io/)
 [Stanisław Woźniak](https://www.researchgate.net/profile/Stanislaw-Wozniak-3)
 [Qihang Zhao](https://www.researchgate.net/profile/Qihang-Zhao-2)
 [Peng Zhou](https://pengzhou.sites.ucsc.edu/)
 [Jian Zhu](https://lingjzhu.github.io/)
 [Rui-Jie Zhu](https://scholar.google.com/citations?user=08ITzJsAAAAJ)

 | [![](https://img.shields.io/github/stars/BlinkDL/RWKV-LM?style=social)](https://github.com/BlinkDL/RWKV-LM) 

[](https://arxiv.org/abs/2305.13048), [](https://arxiv.org/abs/2105.14103), [](https://arxiv.org/abs/2002.05202)

[data](https://dldata-public.s3.us-east-2.amazonaws.com/simplebooks.zip)

[demo](https://josephrocca.github.io/rwkv-v4-web/demo/)

[](https://discord.gg/bDSBUMeFpc)

[](https://github.com/saharNooby/rwkv.cpp), [](https://github.com/cgisky1980/ai00_rwkv_server), [](https://github.com/harrisonvanderbyl/rwkv-cpp-cuda), [](https://github.com/Blealtan/RWKV-LM-LoRA), [](https://github.com/TheRamU/Fay/blob/main/README_EN.md), [](https://github.com/ridgerchu/SpikeGPT), [](https://github.com/BlinkDL/RWKV-v2-RNN-Pile/tree/main/RWKV-v3), [](https://github.com/BlinkDL/SmallInitEmb), [](https://github.com/BlinkDL/RWKV-CUDA), [](https://github.com/BlinkDL/minGPT-tuned)

[](https://huggingface.co/BlinkDL), [](https://huggingface.co/BlinkDL/clip-guided-binary-autoencoder)

[](https://www.reddit.com/r/MachineLearning/comments/umq908/r_rwkvv2rnn_a_parallelizable_rnn_with/)

[](https://twitter.com/BlinkDL_AI), [](https://twitter.com/HochreiterSepp/status/1524270961314484227)

[website](https://www.rwkv.com/)

[](https://youtu.be/x8pW19wKfXQ), [](https://youtu.be/B3Qa2rRsaXo), [](https://youtu.be/w-xydM6C6Qc)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1F7tZoPZaWJf1fsCmZ5tjw6sYHiFOYVWM) | 21.09.2022 |

| NetKet | Open-source project delivering cutting-edge methods for the study of many-body quantum systems with artificial neural networks and machine learning techniques | 

[Filippo Vicentini](https://filippovicentini.com/)
 [Damian Hofmann](https://github.com/femtobit)
 [Attila Szabó](https://github.com/attila-i-szabo)
others[Dian Wu](https://github.com/wdphy16)
 [Christopher Roth](https://github.com/chrisrothUT)
 [Clemens Giuliani](https://github.com/inailuig)
 [Gabriel Pescia](https://github.com/gpescia)
 [Jannes Nys](https://github.com/jwnys)
 [Vladimir Vargas-Calderón](https://github.com/VolodyaCO)
 [Nikita Astrakhantsev](https://github.com/nikita-astronaut)
 [Giuseppe Carleo](https://github.com/gcarleo)
 [Kenny Choo](https://github.com/kchoo1118)
 [James Smith](https://jamesetsmith.github.io/)
 [Tom Westerhout](https://github.com/twesterhout)
 [Fabien Alet](https://github.com/fabienalet)
 [Emily Davis](https://github.com/emilyjd)
 [Stavros Efthymiou](https://github.com/stavros11)
 [Ivan Glasser](https://www.researchgate.net/profile/Ivan-Glasser)
 [Sheng-Hsuan Lin](https://shhslin.github.io/)
 [Marta Mauri](https://github.com/martamau)
 [Mazzola Guglielmo](https://www.ics.uzh.ch/en/research/research-groups/Guglielmo-Mazzola0.html)
 [Christian Mendl](http://christian.mendl.net/)
 [Evert Nieuwenburg](https://evert.info/)
 [Ossian O'Reilly](https://github.com/ooreilly)
 [Hugo Théveniaut](https://github.com/theveniaut)
 [Giacomo Torlai](https://github.com/GTorlai)
 [Alexander Wietek](https://awietek.github.io/)

 | [![](https://img.shields.io/github/stars/netket/netket?style=social)](https://github.com/netket/netket) 

[](https://arxiv.org/abs/2112.10526)

[](https://netket.readthedocs.io/en/latest/index.html)

[](https://github.com/mpi4jax/mpi4jax), [](https://github.com/cloudhan/jax-windows-builder)

[website](https://www.netket.org/)

[](https://youtu.be/Ryz-o71tuy8)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/PhilipVinc/Lectures/blob/main/2202_NetKet/01_intro.ipynb) | 15.09.2022 |

| Stable Diffusion | A latent text-to-image diffusion model | 

[Robin Rombach](https://github.com/rromb)
 [Andreas Blattmann](https://github.com/ablattmann)
 [Dominik Lorenz](https://github.com/qp-qp)
others[Patrick Esser](https://github.com/pesser)
 [Björn Ommer](https://ommer-lab.com/people/ommer/)

 | [![](https://img.shields.io/github/stars/CompVis/stable-diffusion?style=social)](https://github.com/CompVis/stable-diffusion) 

[](https://arxiv.org/abs/2205.11487), [](https://arxiv.org/abs/2207.12598), [](https://arxiv.org/abs/2202.09778), [](https://arxiv.org/abs/2108.01073)

[](https://arxiv.org/abs/2112.10752), [](https://github.com/christophschuhmann/improved-aesthetic-predictor), [](https://github.com/ShieldMnt/invisible-watermark), [](https://github.com/openai/guided-diffusion), [](https://github.com/lucidrains/denoising-diffusion-pytorch), [](https://github.com/lucidrains/x-transformers)

[](https://huggingface.co/CompVis), [](https://huggingface.co/datasets/laion/laion2B-en), [](https://huggingface.co/datasets/laion/laion-high-resolution)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/CompVis/stable-diffusion/blob/main/scripts/latent_imagenet_diffusion.ipynb) | 10.08.2022 |

| Deep-MAC | Welcome to the Novel class segmentation demo | [Vighnesh Birodkar](http://vighneshbirodkar.github.io/) | 

[](https://arxiv.org/abs/2104.00613)

[](https://paperswithcode.com/method/deep-mac)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/tensorflow/models/blob/master/research/object_detection/colab_tutorials/deepmac_colab.ipynb) | 10.08.2022 |

| NL-Augmenter | A collaborative effort intended to add transformations of datasets dealing with natural language | 

[Aadesh Gupta](https://github.com/aadesh11)
 [Timothy Sum Hon Mun](https://github.com/timothy22000)
 [Aditya Srivatsa](https://github.com/kvadityasrivatsa)
others[Xudong Shen](https://github.com/XudongOliverShen)
 [Juan Diego Rodriguez](https://github.com/juand-r)
 [Ashish Shrivastava](https://github.com/ashish3586)
 [Nagender Aneja](https://researchid.co/naneja)
 [Zijie Wang](https://zijie.wang/)
 [Yiwen Shi](https://github.com/Yiwen-Shi)
 [Afnan Mir](https://github.com/afnanmmir)
 [William Soto](https://github.com/sotwi)
 [Chandan Singh](https://csinva.io/)
 [Claude Roux](https://github.com/ClaudeRoux)
 [Abinaya Mahendiran](https://github.com/AbinayaM02)
 [Anna Shvets](https://github.com/asnota)
 [Kaustubh Dhole](https://github.com/kaustubhdhole)
 [Bryan Wilie](https://github.com/bryanwilie)
 [Jamie Simon](https://james-simon.github.io/)
 [Mukund Varma](https://github.com/MukundVarmaT)
 [Sang Han](https://github.com/jjangsangy)
 [Denis Kleyko](https://github.com/denkle)
 [Samuel Cahyawijaya](https://github.com/SamuelCahyawijaya)
 [Filip Cornell](https://github.com/Filco306)
 [Tanay Dixit](https://tanay2001.github.io/)
 [Connor Boyle](https://github.com/boyleconnor)
 [Genta Indra Winata](https://gentawinata.com/)
 [Seungjae Ryan Lee](https://github.com/seungjaeryanlee)
 [Marcin Namysl](https://github.com/mnamysl)
 [Roman Sitelew](https://github.com/RomanPlusPlus)
 [Zhenhao Li](https://zhenhaoli.net/)
 [Fiona Tan](https://tanfiona.github.io/)

 | [![](https://img.shields.io/github/stars/GEM-benchmark/NL-Augmenter?style=social)](https://github.com/GEM-benchmark/NL-Augmenter) 

[](https://arxiv.org/abs/2112.02721)

[website](https://gem-benchmark.com/nl_augmenter)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/GEM-benchmark/NL-Augmenter/blob/main/notebooks/Write_a_sample_transformation.ipynb) | 06.08.2022 |

| Accelerate | A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision | [Hugging Face](https://huggingface.co/) | [![](https://img.shields.io/github/stars/huggingface/accelerate?style=social)](https://github.com/huggingface/accelerate) [](https://huggingface.co/docs/accelerate/index) | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/huggingface/notebooks/blob/master/examples/accelerate/simple_nlp_example.ipynb) | 27.07.2022 |

| YOLOv5 on Custom Objects | This notebook shows training on your own custom objects | [Jacob Solawetz](https://blog.roboflow.com/author/jacob/) | 

[blog post](https://blog.roboflow.com/how-to-train-yolov5-on-a-custom-dataset/)

[data](https://public.roboflow.ai/object-detection/bccd)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1gDZ2xcTOgR39tGGs-EZ6i3RTs16wmzZQ) | 20.07.2022 |

| MindsEye | Graphical user interface built to run multimodal ai art models for free from a Google Colab, without needing edit a single line of code or know any programming | 

[multimodal.art](https://multimodal.art/)
 [João Paulo Apolinário Passos](http://www.apolinariopassos.com.br/portfolio/)

 | [![](https://img.shields.io/github/stars/multimodalart/mindseye?style=social)](https://github.com/multimodalart/mindseye) 

[](https://github.com/openai/guided-diffusion)

[project](https://multimodal.art/mindseye)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1cg0LZ5OfN9LAIB37Xq49as0fSJxcKtC5) | 06.07.2022 |

| BIG-bench | A collaborative benchmark intended to probe large language models and extrapolate their future capabilities | 

[Jaehoon Lee](https://jaehlee.github.io/)
 [Jascha Sohl-Dickstein](http://www.sohldickstein.com/)
 [Vinay Ramasesh](https://ramasesh.github.io/)
others[Sajant Anand](https://github.com/sajantanand)
 [Alicia Parrish](https://aliciaparrish.com/)
 [Ethan Dyer](https://github.com/ethansdyer)
 [Liam Dugan](http://liamdugan.com/)
 [Dieuwke Hupkes](https://github.com/dieuwkehupkes)
 [Daniel Freeman](https://github.com/cdfreeman-google)
 [Guy Gur-Ari](https://github.com/guygurari)
 [Aitor Lewkowycz](https://github.com/lewkowycz)

 | [![](https://img.shields.io/github/stars/google/BIG-bench?style=social)](https://github.com/google/BIG-bench) 

[API](https://google.github.io/BIG-bench/docs/html/bigbench/index.html)

[](https://arxiv.org/abs/2206.04615)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/google/BIG-bench/blob/master/notebooks/colab_examples.ipynb) | 28.06.2022 |

| HuggingArtists | Choose your favorite Artist and train a language model to write new lyrics based on their unique voice | [Aleksey Korshuk](https://github.com/AlekseyKorshuk) | [![](https://img.shields.io/github/stars/AlekseyKorshuk/huggingartists?style=social)](https://github.com/AlekseyKorshuk/huggingartists) [](https://huggingface.co/spaces/AlekseyKorshuk/huggingartists), [](https://huggingface.co/huggingartists) | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/AlekseyKorshuk/huggingartists/blob/master/huggingartists-demo.ipynb) | 25.06.2022 |

| Introduction to the TensorFlow Models NLP library | You will learn how to build transformer-based models for common NLP tasks including pretraining, span labelling and classification using the building blocks from NLP modeling library | [Chen Chen](https://github.com/chenGitHuber) | [![](https://img.shields.io/github/stars/tensorflow/models?style=social)](https://github.com/tensorflow/models/tree/master/official/nlp/modeling) [](https://arxiv.org/abs/1810.04805) | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/tensorflow/models/blob/master/official/colab/nlp/nlp_modeling_library_intro.ipynb) | 22.06.2022 |

| Cirq | A python framework for creating, editing, and invoking Noisy Intermediate Scale Quantum circuits | 

[Balint Pato](https://refactorium.com/)
 [Matthew Harrigan](https://mpharrigan.com/)
 [Animesh Sinha](https://github.com/AnimeshSinha1309)
others[Matthew Neeley](https://github.com/maffoo)
 [Dave Bacon](https://dabacon.org/)
 [Matteo Pompili](https://github.com/matpompili)
 [Michael Broughton](https://github.com/MichaelBroughton)

 | [![](https://img.shields.io/github/stars/quantumlib/Cirq?style=social)](https://github.com/quantumlib/Cirq) 

[](https://en.wikipedia.org/wiki/Quantum_logic_gate#Hadamard_gate)

[](https://youtu.be/16ZfkPRVf2w)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/quantumlib/Cirq/blob/master/docs/tutorials/basics.ipynb) | 21.06.2022 |

| CLIP-as-service | A low-latency high-scalability service for embedding images and text | [Han Xiao](https://hanxiao.io/) | [![](https://img.shields.io/github/stars/jina-ai/clip-as-service?style=social)](https://github.com/jina-ai/clip-as-service) 

[data](https://sites.google.com/view/totally-looks-like-dataset)

[](https://github.com/jina-ai/docarray)

[website](https://clip-as-service.jina.ai/)

[](https://www.youtube.com/playlist?list=PL3UBBWOUVhFYRUa_gpYYKBqEAkO4sxmne), [](https://www.youtube.com/c/jina-ai)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/jina-ai/clip-as-service/blob/main/docs/hosting/cas-on-colab.ipynb) | 19.06.2022 |

| Jina | MLOps framework that empowers anyone to build cross-modal and multi-modal applications on the cloud | [Han Xiao](https://hanxiao.io/) | [![](https://img.shields.io/github/stars/jina-ai/jina?style=social)](https://github.com/jina-ai/jina) 

[data](https://sites.google.com/view/totally-looks-like-dataset)

[](https://docs.jina.ai/)

[](https://github.com/jina-ai/example-grafana-prometheus/blob/main/grafana-dashboards/flow.json)

[hub](https://hub.jina.ai/)

[](https://www.youtube.com/playlist?list=PL3UBBWOUVhFYRUa_gpYYKBqEAkO4sxmne), [](https://www.youtube.com/c/jina-ai)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/jina-ai/jina/blob/master/docs/Using_Jina_on_Colab.ipynb) | 11.06.2022 |

| Aesthetics Predictor | A linear estimator on top of clip to predict the aesthetic quality of pictures | [LAION AI](https://laion.ai/) | [![](https://img.shields.io/github/stars/LAION-AI/aesthetic-predictor?style=social)](https://github.com/LAION-AI/aesthetic-predictor) 

[blog post](https://laion.ai/blog/laion-aesthetics/)

[](https://github.com/rom1504/embedding-reader/blob/main/examples/aesthetic_inference.py)

[](https://www.kaggle.com/discussions/general/464229)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/LAION-AI/aesthetic-predictor/blob/main/asthetics_predictor.ipynb) | 04.06.2022 |

| Flashlight | Fast, flexible machine learning library written entirely in C++ | 

[Jacob Kahn](https://jacobkahn.me/)
 [Vineel Pratap](https://github.com/vineelpratap)
 [Tatiana Likhomanenko](https://github.com/tlikhomanenko)
others[Qiantong Xu](https://github.com/xuqiantong)
 [Awni Hannun](https://awnihannun.com/)
 [Jeff Cai](https://ieeexplore.ieee.org/author/37086866180)
 [Paden Tomasello](https://github.com/padentomasello)
 [Ann Lee](https://scholar.google.com/citations?user=Am6PakYAAAAJ)
 [Edouard Grave](https://github.com/EdouardGrave)
 [Gilad Avidov](https://github.com/avidov)
 [Benoit Steiner](http://bsteiner.info/)
 [Vitaliy Liptchinsky](https://scholar.google.com/citations?user=zl4dA-gAAAAJ)
 [Gabriel Synnaeve](https://syhw.github.io/)
 [Ronan Collobert](https://ronan.collobert.com/)

 | [![](https://img.shields.io/github/stars/flashlight/flashlight?style=social)](https://github.com/flashlight/flashlight) 

[](https://arxiv.org/abs/2201.12465)

[](https://hub.docker.com/r/flml/flashlight/tags?page=1&ordering=last_updated&name=cuda-latest)

[](https://fl.readthedocs.io/en/latest/)

[](https://github.com/arrayfire/arrayfire), [](https://github.com/microsoft/vcpkg), [](https://github.com/arrayfire/arrayfire-ml/), [](https://github.com/nvidia/cub), [](https://github.com/USCiLab/cereal), [](https://github.com/nothings/stb), [](https://github.com/facebookincubator/gloo), [](https://github.com/oneapi-src/oneDNN), [](https://github.com/google/glog), [](https://github.com/gflags/gflags), [](https://github.com/flashlight/text)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/flashlight/flashlight/blob/master/flashlight/app/asr/tutorial/notebooks/FinetuneCTC.ipynb) | 01.06.2022 |

| RL Unplugged | Suite of benchmarks for offline reinforcement learning | 

[Caglar Gulcehre](https://www.caglarg.com/)
 [Ziyu Wang](https://ziyuw.github.io/)
 [Alexander Novikov](https://scholar.google.com/citations?user=jMUkLqwAAAAJ)
others[Tom Le Paine](http://tomlepaine.github.io/)
 [Sergio Gómez Colmenarejo](https://scholar.google.com/citations?user=0Dkf68EAAAAJ)
 [Konrad Żołna](https://github.com/kondiz)
 [Rishabh Agarwal](https://agarwl.github.io/)
 [Josh Merel](https://sites.google.com/site/jsmerel/)
 [Daniel Mankowitz](https://danielmankowitz.wixsite.com/danielm)
 [Cosmin Paduraru](https://scholar.google.com/citations?user=oz4Ca9AAAAAJ)
 [Gabriel Dulac-Arnold](http://gabe.squirrelsoup.net/)
 [Jerry Li](https://github.com/jerryli27)
 [Mohammad Norouzi](https://norouzi.github.io/)
 [Matt Hoffman](https://www.mwhoffman.com/)
 [Ofir Nachum](https://scholar.google.com/citations?user=C-ZlBWMAAAAJ)
 [George Tucker](https://sites.google.com/view/gjt)
 [Nicolas Heess](https://scholar.google.com/citations?user=79k7bGEAAAAJ)
 [Nando de Freitas](https://github.com/nandodf)

 | [![](https://img.shields.io/github/stars/deepmind/deepmind-research?style=social)](https://github.com/deepmind/deepmind-research/tree/master/rl_unplugged) 

[](https://arxiv.org/abs/2006.13888), [](https://arxiv.org/abs/1907.04543), [](https://arxiv.org/abs/1709.06009), [](https://arxiv.org/abs/1811.09656), [](https://arxiv.org/abs/1811.11711), [](https://arxiv.org/abs/1909.12238), [](https://arxiv.org/abs/1911.09451), [](https://arxiv.org/abs/1801.00690), [](https://arxiv.org/abs/2003.11881), [](https://arxiv.org/abs/2103.09575)

[data](https://console.cloud.google.com/storage/browser/rl_unplugged)

[](https://github.com/deepmind/lab), [](https://github.com/google-research/realworldrl_suite#installation)

[](https://youtu.be/n8yNYzbUMJ0)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/deepmind/deepmind_research/blob/master/rl_unplugged/dmlab_r2d2.ipynb) | 26.05.2022 |

| Text generation with RNN | This tutorial demonstrates how to generate text using a character-based RNN | [Anirudh Dubey](https://github.com/anirudh161) | 

[link](http://karpathy.github.io/2015/05/21/rnn-effectiveness/)

[](https://paperswithcode.com/task/text-generation)

[](https://www.tensorflow.org/text/tutorials/text_generation)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/tensorflow/text/blob/master/docs/tutorials/text_generation.ipynb) | 03.05.2022 |

| CLIPDraw | Synthesize drawings to match a text prompt | 

[Kevin Frans](https://www.kvfrans.com/)
 [Lisa Soros](https://scholar.google.com/citations?user=iUkpvMUAAAAJ)
 [Olaf Witkowski](https://olafwitkowski.com/)

 | [![](https://img.shields.io/github/stars/kvfrans/clipdraw?style=social)](https://github.com/kvfrans/clipdraw) 

[](https://arxiv.org/abs/2106.14843), [](https://arxiv.org/abs/1508.06576), [](https://arxiv.org/abs/2105.00162)

[blog post](https://kvfrans.com/clipdraw-exploring-text-to-drawing-synthesis/)

[](https://github.com/BachiLi/diffvg/blob/master/apps/painterly_rendering.py)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/kvfrans/clipdraw/blob/main/clipdraw.ipynb) | 29.04.2022 |

| DeepLab2 | Library for deep labeling, aiming to provide a unified and state-of-the-art TensorFlow codebase for dense pixel labeling tasks, including, but not limited to semantic segmentation, instance segmentation, panoptic segmentation, depth estimation, or even video panoptic segmentation | 

[Mark Weber](https://github.com/markweberdev)
 [Huiyu Wang](https://github.com/csrhddlam)
 [Siyuan Qiao](https://www.cs.jhu.edu/~syqiao/)
others[Jun Xie](https://github.com/ClaireXie)
 [Maxwell Collins](https://pages.cs.wisc.edu/~mcollins/)
 [Yukun Zhu](https://github.com/YknZhu)
 [Liangzhe Yuan](https://github.com/yuanliangzhe)
 [Dahun Kim](https://mcahny.github.io/)
 [Qihang Yu](https://yucornetto.github.io/)
 [Daniel Cremers](https://cvg.cit.tum.de/members/cremers)
 [Laura Leal-Taixé](https://dvl.in.tum.de/team/lealtaixe/)
 [Alan Yuille](https://www.cs.jhu.edu/~ayuille/index.html)
 [Florian Schroff](https://www.florian-schroff.de/)
 [Hartwig Adam](https://scholar.google.com/citations?user=fWd88tEAAAAJ)
 [Liang-Chieh Chen](http://liangchiehchen.com/)

 | [![](https://img.shields.io/github/stars/google-research/deeplab2?style=social)](https://github.com/google-research/deeplab2) 

[](https://arxiv.org/abs/2106.09748), [](https://arxiv.org/abs/2206.07704), [](https://arxiv.org/abs/2207.04044)

[](https://www.reddit.com/r/MachineLearning/comments/od2csk/p_deeplab2_a_tensorflow_library_for_deep_labeling/)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/google-research/deeplab2/blob/main/ViP_DeepLab_Demo.ipynb) | 27.04.2022 |

| CodeGen | Family of open-source model for program synthesis | 

[Erik Nijkamp](https://eriknijkamp.com/)
 [Bo Pang](https://scholar.google.com/citations?user=s9fNEVEAAAAJ)
 [Hiroaki Hayashi](https://hiroakih.me/)
others[Lifu Tu](https://lifu-tu.github.io/)
 [Huan Wang](https://huan-december.github.io/)
 [Yingbo Zhou](https://scholar.google.com/citations?user=H_6RQ7oAAAAJ)
 [Silvio Savarese](https://cvgl.stanford.edu/silvio/)
 [Caiming Xiong](http://cmxiong.com/)

 | [![](https://img.shields.io/github/stars/salesforce/CodeGen?style=social)](https://github.com/salesforce/CodeGen) 

[](https://arxiv.org/abs/2203.13474), [](https://arxiv.org/abs/2305.02309)

[](https://github.com/salesforce/jaxformer)

[](https://huggingface.co/models?search=salesforce+codegen)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1fQI8OgzMAR0bquCrvhlAtXSw6iMFbVgI) | 23.04.2022 |

| Jraph | library for graph neural networks in jax | 

[Jonathan Godwin](https://github.com/jg8610)
 [Thomas Keck](https://github.com/thomaskeck)
 [Peter Battaglia](https://scholar.google.com/citations?user=nQ7Ij30AAAAJ)
others[Victor Bapst](https://linkedin.com/in/victor-bapst-73430a89)
 [Thomas Kipf](https://tkipf.github.io/)
 [Yujia Li](https://yujiali.github.io/)
 [Kimberly Stachenfeld](https://neurokim.com/)
 [Petar Veličković](https://petar-v.com/)
 [Alvaro Sanchez-Gonzalez](https://github.com/alvarosg)

 | [![](https://img.shields.io/github/stars/google-deepmind/jraph?style=social)](https://github.com/google-deepmind/jraph) 

[](https://arxiv.org/abs/1806.01261)

[](https://jraph.readthedocs.io/en/latest/)

[](https://youtu.be/S3sRy4oqvCM)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/google-deepmind/educational/blob/master/colabs/summer_schools/intro_to_graph_nets_tutorial_with_jraph.ipynb) | 15.04.2022 |

| Text classification with RNN | This text classification tutorial trains a recurrent neural network on the IMDB large movie review dataset for sentiment analysis | [Anirudh Dubey](https://github.com/anirudh161) | 

[data](http://ai.stanford.edu/~amaas/data/sentiment/)

[link](https://developers.google.com/machine-learning/glossary/#recurrent_neural_network)

[](https://paperswithcode.com/task/text-classification)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/tensorflow/text/blob/master/docs/tutorials/text_classification_rnn.ipynb) | 17.03.2022 |

| TriMap | Dimensionality reduction technique based on triplet constraints, which preserves the global structure of the data better than the other commonly used methods such as t-SNE, LargeVis, and UMAP | 

[Ehsan Amid](https://sites.google.com/view/eamid/)
 [Manfred Warmuth](https://mwarmuth.bitbucket.io/)

 | [![](https://img.shields.io/github/stars/eamid/trimap?style=social)](https://github.com/eamid/trimap) 

[](https://arxiv.org/abs/1910.00204)

[data](https://www.cs.columbia.edu/CAVE/software/softlib/coil-100.php)

[](https://github.com/google-research/google-research/tree/master/trimap), [](https://github.com/spotify/annoy), [](https://github.com/zalandoresearch/fashion-mnist)

[](https://en.wikipedia.org/wiki/Principal_component_analysis#/media/File:GaussianScatterPCA.svg), [](https://en.wikipedia.org/wiki/MNIST_database)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/eamid/examples/blob/master/TriMap.ipynb#scrollTo=nSyGA-ymB-jN) | 17.03.2022 |

| RLDS | Reinforcement Learning Datasets and it is an ecosystem of tools to store, retrieve and manipulate episodic data in the context of Sequential Decision Making including RL, Learning for Demonstrations, Offline RL or Imitation Learning | 

[Sabela Ramos](https://github.com/sabelaraga)
 [Sertan Girgin](https://sites.google.com/site/girgint/home)
 [Léonard Hussenot](https://leonardhussenot.github.io/)
others[Damien Vincent](https://www.linkedin.com/in/damien-vincent-1958381)
 [Hanna Yakubovich](https://github.com/yakubanna)
 [Daniel Toyama](https://github.com/kenjitoyama)
 [Anita Gergely](https://www.linkedin.com/in/anita-g-318064b2/)
 [Piotr Stanczyk](https://scholar.google.com/citations?user=fKVK0dYAAAAJ)
 [Raphaël Marinier](https://github.com/RaphaelMarinier)
 [Jeremiah Harmsen](https://github.com/jharmsen)
 [Olivier Pietquin](https://research.google/people/105812/)
 [Nikola Momchev](https://scholar.google.com/citations?user=PbWgaswAAAAJ)

 | [![](https://img.shields.io/github/stars/google-research/rlds?style=social)](https://github.com/google-research/rlds) 

[](https://arxiv.org/abs/2111.02767)

[blog post](https://ai.googleblog.com/2021/12/rlds-ecosystem-to-generate-share-and.html)

[](https://github.com/deepmind/envlogger), [](https://github.com/google-research/rlds-creator), [](https://github.com/Farama-Foundation/D4RL), [](https://github.com/deepmind/dm_env/blob/master/docs/index.md)

[](http://www.tensorflow.org/datasets/catalog/overview), [](https://www.tensorflow.org/datasets/catalog/robosuite_panda_pick_place_can), [](https://www.tensorflow.org/datasets/catalog/locomotion), [](https://www.tensorflow.org/datasets/catalog/mt_opt), [](https://www.tensorflow.org/datasets/external_tfrecord?hl=en#load_dataset_with_tfds), [](https://www.tensorflow.org/api_docs/python/tf/data), [](https://www.tensorflow.org/guide/data_performance#optimize_performance), [](https://www.tensorflow.org/api_docs/python/tf/data/Dataset#shuffle), [](https://www.tensorflow.org/datasets/splits), [](https://www.tensorflow.org/datasets/api_docs/python/tfds/load)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/google-research/rlds/blob/main/rlds/examples/rlds_tutorial.ipynb) | 16.03.2022 |

| Real-Time Voice Cloning | SV2TTS with a vocoder that works in real-time | 

[Corentin Jemine](https://github.com/CorentinJ)
 [Erdene-Ochir Tuguldur](https://github.com/tugstugi)

 | [![](https://img.shields.io/github/stars/CorentinJ/Real-Time-Voice-Cloning?style=social)](https://github.com/CorentinJ/Real-Time-Voice-Cloning) 

[](https://arxiv.org/abs/1806.04558), [](https://arxiv.org/abs/1802.08435), [](https://arxiv.org/abs/1703.10135), [](https://arxiv.org/abs/1710.10467)

[](https://github.com/fatchord/WaveRNN), [](https://github.com/coqui-ai/tts), [](https://github.com/resemble-ai/Resemblyzer)

[](https://youtu.be/-O_hYhToKoA)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/tugstugi/dl-colab-notebooks/blob/master/notebooks/RealTimeVoiceCloning.ipynb) | 08.03.2022 |

| BLIP | VLP framework which transfers flexibly to both vision-language understanding and generation tasks | 

[Junnan Li](https://github.com/LiJunnan1992)
 [Dongxu Li](https://sites.google.com/view/dongxu-li/home)
 [Caiming Xiong](http://cmxiong.com/)
 [Steven Hoi](https://sites.google.com/view/stevenhoi)

 | [![](https://img.shields.io/github/stars/salesforce/BLIP?style=social)](https://github.com/salesforce/BLIP) 

[](https://arxiv.org/abs/2201.12086)

[blog post](https://blog.salesforceairesearch.com/blip-bootstrapping-language-image-pretraining/)

[](https://github.com/facebookresearch/fairscale), [](https://github.com/salesforce/ALPRO), [](https://github.com/dmlc/decord), [](https://github.com/salesforce/ALBEF), [](https://github.com/rwightman/pytorch-image-models/tree/main/timm)

[](https://youtu.be/X2k7n4FuI7c)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/salesforce/BLIP/blob/main/demo.ipynb) | 03.03.2022 |

| VideoGPT | A conceptually simple architecture for scaling likelihood based generative modeling to natural videos | 

[Wilson Yan](https://wilson1yan.github.io/)
 [Yunzhi Zhang](https://zzyunzhi.github.io/)
 [Pieter Abbeel](https://people.eecs.berkeley.edu/~pabbeel/)
 [Aravind Srinivas](https://people.eecs.berkeley.edu/~aravind/)

 | [![](https://img.shields.io/github/stars/wilson1yan/VideoGPT?style=social)](https://github.com/wilson1yan/VideoGPT) 

[](https://arxiv.org/abs/2104.10157), [](https://arxiv.org/abs/1904.10509)

[data](https://www.crcv.ucf.edu/data/UCF101.php)

[](https://huggingface.co/spaces/akhaliq/VideoGPT)

[project](https://wilson1yan.github.io/videogpt/index.html)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/wilson1yan/VideoGPT/blob/master/notebooks/Using_VideoGPT.ipynb) | 02.03.2022 |

| Silero Models | Pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple | [Silero team](https://www.silero.ai/about/) | [![](https://img.shields.io/github/stars/snakers4/silero-models?style=social)](https://github.com/snakers4/silero-models) 

[STT](https://thegradient.pub/towards-an-imagenet-moment-for-speech-to-text/), [STT](https://thegradient.pub/a-speech-to-text-practitioners-criticisms-of-industry-and-academia/), [STT](https://habr.com/ru/post/519562/)

[TTS](https://habr.com/ru/post/660571/), [TTS](https://habr.com/ru/post/549482/)

[Text Enhancement](https://habr.com/ru/post/581960/)

[VAD](https://thegradient.pub/one-voice-detector-to-rule-them-all/), [VAD](https://habr.com/ru/post/537276/)

[website](https://www.silero.ai/)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/snakers4/silero-models/blob/master/examples.ipynb) | 27.02.2022 |

| Real-CUGAN | AI super resolution model for anime images, trained in a million scale anime dataset, using the same architecture as Waifu2x-CUNet | [bilibili](https://github.com/bilibili) | [![](https://img.shields.io/github/stars/bilibili/ailab?style=social)](https://github.com/bilibili/ailab/tree/main/Real-CUGAN) 

[](https://github.com/nihui/realcugan-ncnn-vulkan), [](https://github.com/nagadomi/nunif), [](https://github.com/Justin62628/Squirrel-RIFE)

[](https://huggingface.co/spaces/mayhug/Real-CUGAN)

[](https://youtu.be/IVo19n4zFsc)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/bilibili/ailab/blob/main/Real-CUGAN/colab-demo.ipynb) | 27.02.2022 |

| ArcaneGAN | Process video in the style of the Arcane animated series | [Alexander Spirin](https://github.com/Sxela) | [![](https://img.shields.io/github/stars/Sxela/ArcaneGAN?style=social)](https://github.com/Sxela/ArcaneGAN) 

[](https://github.com/Sxela/stylegan3_blending)

[](https://youtu.be/Fi199uFW6jE), [](https://youtu.be/AJG4X7IokG8)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1r1hhciakk5wHaUn1eJk7TP58fV9mjy_W) | 17.02.2022 |

| textlesslib | A library aimed to facilitate research in Textless NLP | 

[Eugene Kharitonov](https://eugene-kharitonov.github.io/)
 [Jade Copet](https://scholar.google.com/citations?user=GRMLwjAAAAAJ)
 [Kushal Lakhotia](https://about.me/hikushalhere)
others[Nguyễn Tú Anh](https://tuanh208.github.io/)
 [Paden Tomasello](https://scholar.google.com/citations?user=sBtWMGYAAAAJ)
 [Ann Lee](https://ai.facebook.com/people/ann-lee)
 [Ali Elkahky](https://scholar.google.com/citations?user=KB3S8RoAAAAJ)
 [Wei-Ning Hsu](https://wnhsu.github.io/)
 [Abdelrahman Mohamed](https://ai.facebook.com/people/abdelrahman-mohamed/)
 [Emmanuel Dupoux](http://www.lscp.net/persons/dupoux/)
 [Yossi Adi](https://www.cs.huji.ac.il/~adiyoss/)

 | [![](https://img.shields.io/github/stars/facebookresearch/textlesslib?style=social)](https://github.com/facebookresearch/textlesslib) 

[](https://arxiv.org/abs/2202.07359)

[](https://github.com/NVIDIA/waveglow), [](https://github.com/keithito/tacotron), [](https://github.com/NVIDIA/tacotron2), [](https://github.com/pseeth/torch-stft)

[](https://paperswithcode.com/dataset/librispeech)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/facebookresearch/textlesslib/blob/main/examples/resynthesis_and_continuation.ipynb) | 15.02.2022 |

| AV-HuBERT | Self-supervised representation learning framework for audio-visual speech | 

[Bowen Shi](https://home.ttic.edu/~bshi/)
 [Wei-Ning Hsu](http://people.csail.mit.edu/wnhsu/)
 [Kushal Lakhotia](https://about.me/hikushalhere)
 [Abdelrahman Mohamed](http://www.cs.toronto.edu/~asamir/)

 | [![](https://img.shields.io/github/stars/facebookresearch/av_hubert?style=social)](https://github.com/facebookresearch/av_hubert) 

[](https://arxiv.org/abs/2201.02184), [](https://arxiv.org/abs/2201.01763), [](https://arxiv.org/abs/1810.04805), [](https://arxiv.org/abs/1911.04890)

[blog post](https://ai.facebook.com/blog/ai-that-understands-speech-by-looking-as-well-as-hearing/)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1bNXkfpHiVHzXQH8WjGhzQ-fsDxolpUjD) | 12.02.2022 |

| Lingvo | Framework for building neural networks in Tensorflow, particularly sequence models | 

[Jonathan Shen](https://github.com/jonathanasdf)
 [Patrick Nguyen](https://scholar.google.com/citations?user=38fqeIYAAAAJ)
 [Yonghui Wu](https://scholar.google.com/citations?user=55FnA9wAAAAJ)
 [Zhifeng Chen](https://github.com/zffchen78)

 | [![](https://img.shields.io/github/stars/tensorflow/lingvo?style=social)](https://github.com/tensorflow/lingvo) 

[](https://arxiv.org/abs/1902.08295), [](https://arxiv.org/abs/1508.01211), [](https://arxiv.org/abs/1412.1602), [](https://arxiv.org/abs/1602.02410), [](https://arxiv.org/abs/2006.16668), [](https://arxiv.org/abs/2106.04060)

[](https://github.com/tensorflow/lingvo/blob/master/docker/dev.Dockerfile), [](https://github.com/tensorflow/lingvo/blob/master/docker/lib.dockerfile)

[](https://tensorflow.github.io/lingvo/)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/tensorflow/lingvo/blob/master/codelabs/introduction.ipynb) | 28.01.2022 |

| DeepDream | This tutorial contains a minimal implementation of DeepDream: an experiment that visualizes the patterns learned by a neural network | 

[Alexander Mordvintsev](https://znah.net/)
 [Billy Lamberta](https://github.com/lamberta)

 | 

[](https://arxiv.org/abs/1409.4842)

[blog post](https://research.google/blog/inceptionism-going-deeper-into-neural-networks/)

[](https://medium.com/@nik.nagarajan2/deepdream-a-psychedelic-ai-experience-ab482dd5228b), [](https://towardsdatascience.com/dreaming-over-text-f6745c829cee)

[](https://www.tensorflow.org/tutorials/generative/deepdream)

[](https://en.wikipedia.org/wiki/Inception), [](https://en.wikipedia.org/wiki/DeepDream)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/tensorflow/docs/blob/master/site/en/tutorials/generative/deepdream.ipynb) | 13.01.2022 |

| FuseDream | Training-Free Text-to-Image Generation with Improved CLIP+GAN Space Optimization | 

[Xingchao Liu](https://scholar.google.com/citations?user=VOTVE0UAAAAJ)
 [Chengyue Gong](https://github.com/ChengyueGongR)
 [Lemeng Wu](https://github.com/klightz)
others[Hao Su](https://cseweb.ucsd.edu//~haosu/)
 [Qiang Liu](https://www.cs.utexas.edu/~lqiang/)

 | [](https://arxiv.org/abs/2112.01573) | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/190tKQf0aFj-Hi8STUrLc2m4DeOviv7NO) | 02.01.2022 |

| MLP | The most basic neural network architectures, a multilayer perceptron, also known as a feedforward network | [Ben Trevett](https://bentrevett.com/) | 

[NN and DL](http://neuralnetworksanddeeplearning.com/)

[](https://arxiv.org/abs/1702.03118), [](https://arxiv.org/abs/2108.12943), [](https://arxiv.org/abs/2111.04020)

[optimization](https://ruder.io/optimizing-gradient-descent/)

[](https://pytorch.org/vision/stable/transforms.html#transforms-on-pil-image-only), [](https://pytorch.org/vision/stable/transforms.html#transforms-on-torch-tensor-only)

[](https://en.wikipedia.org/wiki/Multilayer_perceptron)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/bentrevett/pytorch-image-classification/blob/master/1_mlp.ipynb) | 26.12.2021 |

| AlexNet | A neural network model that uses convolutional neural network layers and was designed for the ImageNet challenge | [Ben Trevett](https://bentrevett.com/) | [![](https://img.shields.io/github/stars/davidtvs/pytorch-lr-finder?style=social)](https://github.com/davidtvs/pytorch-lr-finder) 

[ILSVRC](https://image-net.org/challenges/LSVRC/)

[LR](https://sgugger.github.io/how-do-you-find-a-good-learning-rate.html)

[PMLR](https://proceedings.mlr.press/v9/glorot10a.html)

[](https://arxiv.org/abs/1409.0575)

[cifar-10](https://www.cs.toronto.edu/~kriz/cifar.html)

[dropout](https://sebastianraschka.com/faq/docs/dropout-activation.html)

[](https://papers.nips.cc/paper/2012/hash/c399862d3b9d6b76c8436e924a68c45b-Abstract.html)

[](https://pytorch.org/vision/stable/models.html)

[](https://paperswithcode.com/method/alexnet)

[](https://en.wikipedia.org/wiki/Regularization_(mathematics)), [](https://en.wikipedia.org/wiki/AlexNet)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/bentrevett/pytorch-image-classification/blob/master/3_alexnet.ipynb) | 26.12.2021 |

| VGG | Very Deep Convolutional Networks for Large-Scale Image Recognition | [Ben Trevett](https://bentrevett.com/) | [![](https://img.shields.io/github/stars/pytorch/vision?style=social)](https://github.com/pytorch/vision/blob/main/torchvision/models/vgg.py#L47) 

[ILSVRC](https://image-net.org/challenges/LSVRC/)

[](https://arxiv.org/abs/1409.1556), [](https://arxiv.org/abs/1506.01186), [](https://arxiv.org/abs/1801.06146), [](https://arxiv.org/abs/1502.03167), [](https://arxiv.org/abs/1805.11604)

[cifar-10](https://www.cs.toronto.edu/~kriz/cifar.html)

[](https://pytorch.org/vision/stable/models.html)

[](https://paperswithcode.com/method/vgg)

[](https://youtu.be/HR0lt1hlR6U?t=5900), [](https://youtu.be/j1jIoHN3m0s), [](https://youtu.be/RNnKtNrsrmg)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/bentrevett/pytorch-image-classification/blob/master/4_vgg.ipynb) | 26.12.2021 |

| LeNet | A neural network model that uses convolutional neural network layers and was designed for classifying handwritten characters | [Ben Trevett](https://bentrevett.com/) | 

[CNN](https://cs231n.github.io/convolutional-networks/)

[LeNet-5](http://yann.lecun.com/exdb/lenet/)

[guide](https://adeshpande3.github.io/A-Beginner%27s-Guide-To-Understanding-Convolutional-Neural-Networks/)

[paper](http://yann.lecun.com/exdb/publis/pdf/lecun-01a.pdf)

[](https://paperswithcode.com/method/lenet)

[](https://en.wikipedia.org/wiki/Convolution), [](https://en.wikipedia.org/wiki/Sobel_operator), [](https://en.wikipedia.org/wiki/Gaussian_blur)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/bentrevett/pytorch-image-classification/blob/master/2_lenet.ipynb) | 26.12.2021 |

| Music Composer | Synthesizing symbolic music in MIDI format using the Music Transformer model | [bazanovvanya](https://github.com/bazanovvanya) | [![](https://img.shields.io/github/stars/ai-forever/music-composer?style=social)](https://github.com/ai-forever/music-composer) 

[](https://arxiv.org/abs/1909.05858)

[blog post](https://habr.com/ru/company/sberbank/blog/583592/)

[data](https://magenta.tensorflow.org/datasets/maestro), [data](https://colinraffel.com//projects/lmd/)

[](https://github.com/gwinndr/MusicTransformer-Pytorch), [](https://github.com/bytedance/GiantMIDI-Piano), [](https://github.com/mdeff/fma)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/ai-forever/music-composer/blob/master/src/Music_Composer_Demo_Colab_en.ipynb) | 20.12.2021 |

| FLAML | Lightweight Python library that finds accurate machine learning models automatically, efficiently and economically | 

[Chi Wang](https://github.com/sonichi)
 [Qingyun Wu](https://qingyun-wu.github.io/)

 | [![](https://img.shields.io/github/stars/microsoft/FLAML?style=social)](https://github.com/microsoft/FLAML) 

[](https://arxiv.org/abs/2106.04815), [](https://arxiv.org/abs/2005.01571)

[](https://microsoft.github.io/FLAML/)

[paper](https://www.microsoft.com/en-us/research/publication/flaml-a-fast-and-lightweight-automl-library/)

[](https://www.youtube.com/channel/UCfU0zfFXHXdAd5x-WvFBk5A), [](https://youtu.be/euXpDYGgkGM)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/microsoft/FLAML/blob/master/notebook/flaml_automl.ipynb) | 17.12.2021 |

| CompilerGym | A reinforcement learning toolkit for compiler optimizations | 

[Chris Cummins](https://chriscummins.cc/)
 [Bram Wasti](https://github.com/bwasti)
 [Jiadong Guo](https://jd-eth.github.io/)
others[Brandon Cui](https://www.linkedin.com/in/bcui19/)
 [Jason Ansel](https://jasonansel.com/)
 [Sahir Gomez](https://github.com/sahirgomez1)
 [Olivier Teytaud](https://github.com/teytaud)
 [Benoit Steiner](http://bsteiner.info/)
 [Yuandong Tian](http://yuandong-tian.com/)
 [Hugh Leather](https://github.com/hughleat)

 | [![](https://img.shields.io/github/stars/facebookresearch/CompilerGym?style=social)](https://github.com/facebookresearch/CompilerGym) 

[](https://arxiv.org/abs/2109.08267)

[](https://facebookresearch.github.io/CompilerGym/)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/facebookresearch/CompilerGym/blob/development/examples/getting-started.ipynb) | 16.11.2021 |

| Reformer | Performs on par with Transformer models while being much more memory-efficient and much faster on long sequences | 

[Phil Wang](https://lucidrains.github.io/)
 [Nikita Kitaev](https://kitaev.com/)
 [Łukasz Kaiser](https://scholar.google.com/citations?user=JWmiQR0AAAAJ)
 [Anselm Levskaya](https://anselmlevskaya.com/)

 | [![](https://img.shields.io/github/stars/lucidrains/reformer-pytorch?style=social)](https://github.com/lucidrains/reformer-pytorch) 

[](https://arxiv.org/abs/2001.04451), [](https://arxiv.org/abs/1907.01470), [](https://arxiv.org/abs/1910.05895), [](https://arxiv.org/abs/1909.11556), [](https://arxiv.org/abs/1911.02150), [](https://arxiv.org/abs/2002.05202), [](https://arxiv.org/abs/2003.05997), [](https://arxiv.org/abs/2003.04887), [](https://arxiv.org/abs/2002.07028), [](https://arxiv.org/abs/2103.03404), [](https://arxiv.org/abs/2104.09864)

[blog post](https://ai.googleblog.com/2020/01/reformer-efficient-transformer.html)

[](https://github.com/lucidrains/routing-transformer), [](https://github.com/lucidrains/sinkhorn-transformer), [](https://github.com/lucidrains/performer-pytorch), [](https://github.com/lucidrains/linear-attention-transformer/), [](https://github.com/lucidrains/compressive-transformer-pytorch)

[](https://proceedings.neurips.cc/paper/2019/hash/9d8df73a3cfbf3c5b47bc9b50f214aff-Abstract.html), [](https://proceedings.neurips.cc/paper_files/paper/2017/hash/3f5ee243547dee91fbd053c1c4a845aa-Abstract.html)

[](https://pypi.org/project/reformer-pytorch/)

[](https://youtu.be/i4H0kjxrias), [](https://youtu.be/Kf3x3lqf9cQ), [](https://youtu.be/0eTULzrOztQ)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1awNgXYtjvUeXl1gS-v1iyDXTJJ-fyJIK) | 07.11.2021 |

| ruDALL·E | Generate images from texts in Russian | [Alex Shonenkov](https://github.com/shonenkov) | [![](https://img.shields.io/github/stars/ai-forever/ru-dalle?style=social)](https://github.com/ai-forever/ru-dalle) 

[](https://github.com/bes-dev/vqvae_dwt_distiller.pytorch), [](https://github.com/boomb0om/Real-ESRGAN-colab)

[](https://huggingface.co/spaces/multimodalart/rudalle)

[project](https://rudalle.ru/)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/ai-forever/ru-dalle/blob/master/jupyters/ruDALLE-example-generation-A100.ipynb) | 03.11.2021 |

| DeepStyle | The Neural Style algorithm synthesizes a pastiche by separating and combining the content of one image with the style of another image using convolutional neural networks | 

[Cameron Smith](https://github.com/cysmith)
 [Alexander Spirin](https://github.com/Sxela)

 | [![](https://img.shields.io/github/stars/cysmith/neural-style-tf?style=social)](https://github.com/cysmith/neural-style-tf) 

[](https://arxiv.org/abs/1604.08610), [](https://arxiv.org/abs/1606.05897), [](https://arxiv.org/abs/1508.06576)

[cvpr](https://www.cv-foundation.org/openaccess/content_cvpr_2016/papers/Gatys_Image_Style_Transfer_CVPR_2016_paper.pdf)

[](https://en.wikipedia.org/wiki/Pastiche), [](https://en.wikipedia.org/wiki/The_Starry_Night), [](https://en.wikipedia.org/wiki/YUV), [](https://en.wikipedia.org/wiki/Lab_color_space), [](https://en.wikipedia.org/wiki/YCbCr), [](https://en.wikipedia.org/wiki/CIELUV), [](https://en.wikipedia.org/wiki/Pareidolia)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/14aJ7HQPbcP0sNRIY-FRO4u6lxtlyyxI_) | 01.10.2021 |

| Text2Animation | Generate images from text phrases with VQGAN and CLIP with animation and keyframes | 

[Katherine Crowson](https://kath.io/)
 [Ryan Murdock](https://twitter.com/advadnoun)
 [Chigozie Nri](https://github.com/chigozienri)
 [Denis Malimonov](https://github.com/tg-bomze)

 | [![](https://img.shields.io/github/stars/chigozienri/VQGAN-CLIP-animations?style=social)](https://github.com/chigozienri/VQGAN-CLIP-animations) 

[](https://arxiv.org/abs/2012.09841), [](https://arxiv.org/abs/2103.00020)

[](https://www.youtube.com/channel/UCToztRy9FSTIhEen_1x4FAw)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/tg-bomze/collection-of-notebooks/blob/master/Text2Animation.ipynb) | 29.09.2021 |

| EfficientNetV2 | A family of image classification models, which achieve better parameter efficiency and faster training speed than prior arts | 

[Mingxing Tan](https://scholar.google.com/citations?user=6POeyBoAAAAJ)
 [Quoc Le](https://cs.stanford.edu/~quocle/)

 | [![](https://img.shields.io/github/stars/google/automl?style=social)](https://github.com/google/automl/tree/master/efficientnetv2) 

[](https://arxiv.org/abs/2104.00298), [](https://arxiv.org/abs/1905.11946)

[](https://github.com/NVIDIA/TensorRT/tree/master/samples/python/efficientnet)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/google/automl/blob/master/efficientnetv2/tutorial.ipynb) | 24.09.2021 |

| Clip retrieval | Easily compute clip embeddings and build a clip retrieval system with them | [Romain Beaumont](https://github.com/rom1504) | [![](https://img.shields.io/github/stars/rom1504/clip-retrieval?style=social)](https://github.com/rom1504/clip-retrieval) 

[](https://discord.gg/eq3cAMZtCC)

[](https://github.com/LAION-AI/CLIP_benchmark), [](https://github.com/rom1504/laion-prepro), [](https://github.com/dzryk/antarctic-captions), [](https://github.com/LAION-AI/CLIP-based-NSFW-Detector), [](https://github.com/ml-research/OffImgDetectionCLIP)

[](https://rom1504.medium.com/semantic-search-with-embeddings-index-anything-8fb18556443c)

[project](https://rom1504.github.io/clip-retrieval)

[](https://pypi.python.org/pypi/clip-retrieval)

[](https://en.wikipedia.org/wiki/Locality_of_reference)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/rom1504/clip-retrieval/blob/master/notebook/clip-retrieval-getting-started.ipynb) | 21.09.2021 |

| img2dataset | Easily turn large sets of image urls to an image dataset | [Romain Beaumont](https://github.com/rom1504) | [![](https://img.shields.io/github/stars/rom1504/img2dataset?style=social)](https://github.com/rom1504/img2dataset) 

[](https://discord.gg/eq3cAMZtCC)

[](https://github.com/uber/petastorm), [](https://github.com/fsspec/filesystem_spec/blob/6233f315548b512ec379323f762b70764efeb92c/fsspec/registry.py#L87), [](https://github.com/fsspec/sshfs), [](https://github.com/rom1504/cah-prepro)

[](https://huggingface.co/docs/hub/datasets-viewer), [](https://huggingface.co/docs/huggingface_hub/guides/hf_file_system)

[](https://rom1504.medium.com/semantic-search-at-billions-scale-95f21695689a)

[](https://pypi.python.org/pypi/img2dataset)

[](https://www.tensorflow.org/guide/data)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/rom1504/img2dataset/blob/master/notebook/img2dataset_getting_started.ipynb) | 17.09.2021 |

| Droidlet | A modular embodied agent architecture and platform for building embodied agents | 

[Anurag Pratik](https://github.com/anuragprat1k)
 [Soumith Chintala](https://soumith.ch/)
 [Kavya Srinet](https://github.com/kavyasrinet)
others[Dhiraj Gandhi](https://dhiraj100892.github.io/)
 [Rebecca Qian](https://github.com/Rebecca-Qian)
 [Yuxuan Sun](https://github.com/snyxan)
 [Ryan Drew](https://rdrew.dev/)
 [Sara Elkafrawy](https://github.com/saraEbrahim)
 [Anoushka Tiwari](https://www.linkedin.com/in/anoushka-tiwari)
 [Tucker Hart](https://www.linkedin.com/in/tucker-hart-05a638133)
 [Mary Williamson](https://scholar.google.com/citations?user=Ys4xB-QAAAAJ)
 [Abhinav Gupta](http://www.cs.cmu.edu/~abhinavg/)
 [Arthur Szlam](https://scholar.google.com/citations?user=u3-FxUgAAAAJ)

 | [![](https://img.shields.io/github/stars/facebookresearch/droidlet?style=social)](https://github.com/facebookresearch/droidlet) 

[](https://arxiv.org/abs/2101.10384), [](https://arxiv.org/abs/1907.08584)

[](https://facebookresearch.github.io/droidlet/)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/facebookresearch/droidlet/blob/master/examples_and_tutorials/tutorials/droidlet_for_physical_robots.ipynb) | 15.09.2021 |

| GPT-J-6B | A 6 billion parameter, autoregressive text generation model trained on The Pile | 

[Ben Wang](https://benwang.dev/)
 [Aran Komatsuzaki](https://arankomatsuzaki.wordpress.com/about-me/)
 [Janko Prester](https://www.jankoprester.com/)

 | [![](https://img.shields.io/github/stars/kingoflolz/mesh-transformer-jax?style=social)](https://github.com/kingoflolz/mesh-transformer-jax) 

[The Pile](https://pile.eleuther.ai/)

[blog post](https://arankomatsuzaki.wordpress.com/2021/06/04/gpt-j/)

[](https://github.com/EleutherAI/gpt-neox), [](https://github.com/microsoft/DeepSpeed)

[web demo](https://6b.eleuther.ai/)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/kingoflolz/mesh-transformer-jax/blob/master/colab_demo.ipynb) | 15.09.2021 |

| Lucid Sonic Dreams | Syncs GAN-generated visuals to music | [Mikael Alafriz](https://github.com/mikaelalafriz) | [![](https://img.shields.io/github/stars/mikaelalafriz/lucid-sonic-dreams?style=social)](https://github.com/mikaelalafriz/lucid-sonic-dreams) 

[](https://github.com/NVlabs/stylegan2), [](https://github.com/justinpinkney/awesome-pretrained-stylegan2)

[](https://towardsdatascience.com/introducing-lucid-sonic-dreams-sync-gan-art-to-music-with-a-few-lines-of-python-code-b04f88722de1)

[](https://youtu.be/l-nGC-ve7sI)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1Y5i50xSFIuN3V4Md8TB30_GOAtts7RQD) | 24.08.2021 |

| textgenrnn | Generate text using a pretrained neural network with a few lines of code, or easily train your own text-generating neural network of any size and complexity | [Max Woolf](https://minimaxir.com/) | [![](https://img.shields.io/github/stars/minimaxir/textgenrnn?style=social)](https://github.com/minimaxir/textgenrnn) 

[blog post](http://minimaxir.com/2018/05/text-neural-networks/)

[](https://www.youtube.com/watch?v=RW7mP6BfZuY)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1mMKGnVxirJnqDViH7BDJxFqWrsXlPSoK) | 13.07.2021 |

| DDSP | Differentiable Digital Signal Processing library, which enables direct integration of classic signal processing elements with deep learning methods | 

[Jesse Engel](https://github.com/jesseengel)
 [Lamtharn Hantrakul](https://lh-hantrakul.com/)
 [Chenjie Gu](https://scholar.google.com/citations?user=_4B6OTAAAAAJ)
 [Adam Roberts](https://github.com/adarob)

 | [![](https://img.shields.io/github/stars/magenta/ddsp?style=social)](https://github.com/magenta/ddsp) 

[](https://arxiv.org/abs/2001.04643)

[blog post](https://magenta.tensorflow.org/ddsp)

[](https://deepmind.com/blog/article/wavenet-generative-model-raw-audio)

[](https://github.com/acids-ircam/ddsp_pytorch)

[project](https://storage.googleapis.com/ddsp/index.html)

[](https://en.wikipedia.org/wiki/Chorus_effect), [](https://en.wikipedia.org/wiki/Flanging), [](https://en.wikipedia.org/wiki/Vibrato), [](https://en.wikipedia.org/wiki/Nyquist_frequency), [](https://en.wikipedia.org/wiki/Aliasing)

[](https://www.youtube.com/playlist?list=PLBUMAYA6kvGXyzrZJMACr6VNZuunv8DMU), [](https://youtu.be/EXz1TJQ-hSo), [](https://youtu.be/g1D4brEhxWw?t=1272), [](https://youtu.be/IZBlqcbpmxY)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/magenta/ddsp/blob/main/ddsp/colab/tutorials/0_processor.ipynb) | 01.07.2021 |

| BasicSR | Open Source Image and Video Restoration Toolbox for Super-resolution, Denoise, Deblurring, etc. | 

[Xintao Wang](https://xinntao.github.io/)
 [Liangbin Xie](https://liangbinxie.github.io/)
 [Ke Yu](https://github.com/yuke93)
others[Kelvin Chan](https://ckkelvinchan.github.io/)
 [Chen Change Loy](https://www.mmlab-ntu.com/person/ccloy/)
 [Chao Dong](https://scholar.google.com/citations?user=OSDCB0UAAAAJ)

 | [![](https://img.shields.io/github/stars/XPixelGroup/BasicSR?style=social)](https://github.com/XPixelGroup/BasicSR) 

[](https://arxiv.org/abs/2012.02181)

[](https://basicsr.readthedocs.io/en/latest/)

[](https://github.com/xinntao/ESRGAN), [](https://github.com/xindongzhang/ECBSR), [](https://github.com/Lotayou/Face-Renovation), [](https://github.com/csxmli2016/DFDNet), [](https://github.com/rosinality/stylegan2-pytorch), [](https://github.com/xinntao/facexlib), [](https://github.com/xinntao/HandyView), [](https://github.com/xinntao/HandyFigure), [](https://github.com/xinntao/SFTGAN), [](https://github.com/xinntao/DNI), [](https://github.com/xinntao/HandyCrawler), [](https://github.com/xinntao/HandyWriting)

[](https://youtu.be/KaMYsxWkmww)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1JQScYICvEC3VqaabLu-lxvq9h7kSV1ML) | 07.06.2021 |

| TensorFlowTTS | Real-time state-of-the-art speech synthesis architectures such as Tacotron-2, Melgan, Multiband-Melgan, FastSpeech, FastSpeech2 based-on TensorFlow 2 | 

[Minh Nguyen Quan Anh](https://github.com/dathudeptrai)
 [Eren Gölge](https://github.com/erogol)
 [Kuan Chen](https://github.com/azraelkuan)
 [Takuya Ebata](https://github.com/MokkeMeguru)

 | [![](https://img.shields.io/github/stars/TensorSpeech/TensorFlowTTS?style=social)](https://github.com/TensorSpeech/TensorFlowTTS) 

[](https://github.com/thorstenMueller/Thorsten-Voice)

[](https://huggingface.co/spaces/akhaliq/TensorFlowTTS), [](https://huggingface.co/tensorspeech)

[](https://www.kaggle.com/datasets/bryanpark/korean-single-speaker-speech-dataset)

[project](https://tensorspeech.github.io/TensorFlowTTS/)

[](https://pypi.org/project/TensorFlowTTS/)

[](https://www.tensorflow.org/model_optimization/guide/quantization/training_comprehensive_guide), [](https://www.tensorflow.org/model_optimization/guide/pruning/pruning_with_keras)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/TensorSpeech/TensorFlowTTS/blob/master/notebooks/TensorFlowTTS_FastSpeech_with_TFLite.ipynb) | 01.06.2021 |

| Hyperopt | Python library for serial and parallel optimization over awkward search spaces, which may include real-valued, discrete, and conditional dimensions | 

[James Bergstra](https://github.com/jaberg)
 [Dan Yamins](https://github.com/yamins81)
 [David Cox](https://scholar.google.com/citations?user=6S-WgLkAAAAJ)

 | [![](https://img.shields.io/github/stars/hyperopt/hyperopt?style=social)](https://github.com/hyperopt/hyperopt) 

[ICML](https://proceedings.mlr.press/v28/bergstra13.html)

[](http://hyperopt.github.io/hyperopt/)

[](https://github.com/hyperopt/hyperopt-sklearn), [](https://github.com/hyperopt/hyperopt-nnet), [](https://github.com/hyperopt/hyperopt-nnet), [](https://github.com/hyperopt/hyperopt-convnet), [](https://github.com/hyperopt/hyperopt-gpsmbo)

[](https://papers.nips.cc/paper/2011/hash/86e8f7ab32cfd12577bc2619bc635690-Abstract.html)

[](https://youtu.be/Mp1xnPfE4PY), [](https://youtu.be/tdwgR1AqQ8Y), [](https://youtu.be/tteE_Vtmrv4)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/hyperopt/hyperopt/blob/master/tutorial/01.BasicTutorial.ipynb) | 01.06.2021 |

| CNN | This tutorial demonstrates training a simple Convolutional Neural Network to classify CIFAR images | [Billy Lamberta](https://github.com/lamberta) | 

[cifar](https://www.cs.toronto.edu/~kriz/cifar.html)

[link](https://developers.google.com/machine-learning/glossary/#convolutional_neural_network)

[](https://www.tensorflow.org/tutorials/images/cnn)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/tensorflow/docs/blob/master/site/en/tutorials/images/cnn.ipynb) | 22.05.2021 |

| Custom GPT-2 + Tokenizer | Train a custom GPT-2 model for free on a GPU using aitextgen! | [Max Woolf](https://minimaxir.com/) | [![](https://img.shields.io/github/stars/minimaxir/aitextgen?style=social)](https://github.com/minimaxir/aitextgen) 

[data](https://raw.githubusercontent.com/karpathy/char-rnn/master/data/tinyshakespeare/input.txt)

[](https://docs.aitextgen.io/)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/144MdX5aLqrQ3-YW-po81CQMrD6kpgpYh) | 17.05.2021 |

| Train a GPT-2 Text-Generating Model | Retrain an advanced text generating neural network on any text dataset for free on a GPU using Colaboratory using aitextgen! | [Max Woolf](https://minimaxir.com/) | [![](https://img.shields.io/github/stars/minimaxir/aitextgen?style=social)](https://github.com/minimaxir/aitextgen) 

[data](https://raw.githubusercontent.com/karpathy/char-rnn/master/data/tinyshakespeare/input.txt)

[](https://docs.aitextgen.io/)

[](https://paperswithcode.com/task/text-generation)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/15qBZx5y9rdaQSyWpsreMDnTiZ5IlN0zD) | 17.05.2021 |

| EasyNMT | Easy to use, state-of-the-art machine translation for more than 100+ languages | [Nils Reimers](https://www.nils-reimers.de/) | [![](https://img.shields.io/github/stars/UKPLab/EasyNMT?style=social)](https://github.com/UKPLab/EasyNMT) 

[](https://arxiv.org/abs/2008.00401), [](https://arxiv.org/abs/2010.11125)

[demo](http://easynmt.net/demo/)

[](https://github.com/Helsinki-NLP/Opus-MT), [](https://github.com/facebookresearch/fairseq/tree/main/examples/multilingual)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1X47vgSiOphpxS5w_LPtjQgJmiSTNfRNC) | 26.04.2021 |

| SkinDeep | Remove Body Tattoo Using Deep Learning | [Vijish Madhavan](https://github.com/vijishmadhavan) | [![](https://img.shields.io/github/stars/vijishmadhavan/SkinDeep?style=social)](https://github.com/vijishmadhavan/SkinDeep) 

[](https://arxiv.org/abs/1805.08318), [](https://arxiv.org/abs/1710.10196), [](https://arxiv.org/abs/1707.02921), [](https://arxiv.org/abs/1603.08155)

[](https://github.com/jantic/DeOldify)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/vijishmadhavan/SkinDeep/blob/master/SkinDeep_good.ipynb) | 24.04.2021 |

| PaddleHub | Pre-trained models toolkit based on PaddlePaddle: 400+ models including Image, Text, Audio, Video and Cross-Modal with Easy Inference & Serving | 

[Zeyu Chen](https://github.com/ZeyuChen)
 [Zewu Wu](https://github.com/nepeplwu)
 [Bin Long](https://github.com/sjtubinlong)
others[Xuefei Zhang](https://github.com/Steffy-zxf)
 [Jinxuan Qiu](https://github.com/kinghuin)
 [Yuhan Shen](https://github.com/ShenYuhan)
 [Yuying Hao](https://github.com/haoyuying)
 [Xiaojie Chen](https://github.com/KPatr1ck)

 | [![](https://img.shields.io/github/stars/PaddlePaddle/PaddleHub?style=social)](https://github.com/PaddlePaddle/PaddleHub) 

[](https://paddlehub.readthedocs.io/en)

[](https://github.com/PaddlePaddle/PaddleOCR), [](https://github.com/PaddlePaddle/PaddleDetection), [](https://github.com/PaddlePaddle/PaddleGAN), [](https://github.com/CMU-Perceptual-Computing-Lab/openpose), [](https://github.com/PaddlePaddle/PaddleSeg), [](https://github.com/PaddlePaddle/PaddleClas), [](https://github.com/PaddlePaddle/ERNIE), [](https://github.com/baidu/LAC), [](https://github.com/baidu/DDParser), [](https://github.com/PaddlePaddle/PaddleSpeech)

[](https://huggingface.co/PaddlePaddle)

[](https://medium.com/analytics-vidhya/paddlehub-fdd1ec75a07b)

[website](https://www.paddlepaddle.org.cn/en)

[](https://youtu.be/9adXuF_lTSg)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/PaddlePaddle/PaddleHub/blob/develop/demo/serving/bentoml/cloud-native-model-serving-with-bentoml.ipynb) | 20.04.2021 |

| OCTIS | Framework for training, analyzing, and comparing Topic Models, whose optimal hyper-parameters are estimated using a Bayesian Optimization approach | 

[Silvia Terragni](https://silviatti.github.io/)
 [Elisabetta Fersini](https://www.unimib.it/elisabetta-fersini)
 [Antonio Candelieri](https://www.unimib.it/antonio-candelieri)
others[Pietro Tropeano](https://github.com/pietrotrope)
 [Bruno Galuzzi](https://github.com/brunoG89)
 [Lorenzo Famiglini](https://github.com/lorenzofamiglini)
 [Davide Pietrasanta](https://github.com/davidepietrasanta)

 | [![](https://img.shields.io/github/stars/mind-Lab/octis?style=social)](https://github.com/mind-Lab/octis) 

[](https://arxiv.org/abs/1703.01488)

[data](https://www.dbpedia.org/resources/ontology/), [data](https://www.statmt.org/europarl/)

[](https://github.com/estebandito22/PyTorchAVITM)

[](https://towardsdatascience.com/a-beginners-guide-to-octis-optimizing-and-comparing-topic-models-is-simple-590554ec9ba6), [](https://towardsdatascience.com/a-beginners-guide-to-octis-vol-2-optimizing-topic-models-1214e58be1e5)

[](https://papers.nips.cc/paper/2000/hash/f9d1152547c0bde01830b7e8bd60024c-Abstract.html)

[paper](https://aclanthology.org/2021.eacl-demos.31/)

[](https://paperswithcode.com/dataset/20-newsgroups)

[](https://youtu.be/nPmiWBFFJ8E)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/MIND-Lab/OCTIS/blob/master/examples/OCTIS_Optimizing_CTM.ipynb) | 19.04.2021 |

| GPT Neo | An implementation of model & data parallel GPT2 & GPT3 -like models, with the ability to scale up to full GPT3 sizes (and possibly more!), using the mesh-tensorflow library | [EleutherAI](https://www.eleuther.ai/) | [![](https://img.shields.io/github/stars/EleutherAI/gpt-neo?style=social)](https://github.com/EleutherAI/gpt-neo) 

[GPT-2](https://openai.com/blog/better-language-models/)

[](https://arxiv.org/abs/2005.14165), [](https://arxiv.org/abs/2004.05150), [](https://arxiv.org/abs/1701.06538)

[](https://github.com/tensorflow/mesh), [](https://github.com/EleutherAI/gpt-neox/)

[pretrained](https://the-eye.eu/public/AI/gptneo-release/)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/EleutherAI/GPTNeo/blob/master/GPTNeo_example_notebook.ipynb) | 28.03.2021 |

| CVAE | This notebook demonstrates how train a Variational Autoencoder on the MNIST dataset | 

[Diederik Kingma](http://www.dpkingma.com/)
 [Max Welling](https://staff.fnwi.uva.nl/m.welling/)
 [Danilo Rezende](https://danilorezende.com/about/)
others[Shakir Mohamed](https://shakirm.com/)
 [Daan Wierstra](https://scholar.google.com/citations?user=aDbsf28AAAAJ)

 | 

[](https://arxiv.org/abs/1312.6114), [](https://arxiv.org/abs/1401.4082)

[](https://www.tensorflow.org/tutorials/generative/cvae)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/tensorflow/docs/blob/master/site/en/tutorials/generative/cvae.ipynb) | 22.03.2021 |

| Big Sleep | Text to image generation, using OpenAI's CLIP and a BigGAN | [Phil Wang](https://lucidrains.github.io/) | [![](https://img.shields.io/github/stars/lucidrains/big-sleep?style=social)](https://github.com/lucidrains/big-sleep) 

[](https://arxiv.org/abs/2103.00020), [](https://arxiv.org/abs/1809.11096)

[](https://pypi.org/project/big-sleep/)

[](https://www.reddit.com/r/bigsleep/comments/lxawb4/how_to_use_some_of_the_newer_features_of/), [](https://www.reddit.com/r/bigsleep/)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1MEWKbm-driRNF8PrU7ogS5o3se-ePyPb) | 17.03.2021 |

| Deep Daze | Text to image generation using OpenAI's CLIP and Siren | [Phil Wang](https://lucidrains.github.io/) | [![](https://img.shields.io/github/stars/lucidrains/deep-daze?style=social)](https://github.com/lucidrains/deep-daze) 

[](https://arxiv.org/abs/2103.00020), [](https://arxiv.org/abs/2006.09661)

[](https://pypi.org/project/deep-daze/)

[](https://www.reddit.com/r/deepdaze/)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1_YOHdORb0Fg1Q7vWZ_KlrtFe9Ur3pmVj) | 17.03.2021 |

| DCGAN | This tutorial demonstrates how to generate images of handwritten digits using a Deep Convolutional Generative Adversarial Network | 

[Alec Radford](https://scholar.google.com/citations?user=dOad5HoAAAAJ)
 [Luke Metz](https://lukemetz.com/)
 [Soumith Chintala](https://soumith.ch/)

 | 

[](https://arxiv.org/abs/1511.06434), [](https://arxiv.org/abs/1701.00160)

[](https://www.kaggle.com/jessicali9530/celeba-dataset)

[](https://medium.com/@vedantjagtap2002/artificial-intelligence-approach-to-reduce-energy-used-for-cooling-data-centres-d2d78d92c107)

[](https://www.tensorflow.org/tutorials/generative/dcgan)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/tensorflow/docs/blob/master/site/en/tutorials/generative/dcgan.ipynb) | 12.03.2021 |

| Adversarial FGSM | This tutorial creates an adversarial example using the Fast Gradient Signed Method attack. This was one of the first and most popular attacks to fool a neural network. | 

[Ian Goodfellow](https://www.iangoodfellow.com/)
 [Jonathon Shlens](https://shlens.github.io/)
 [Christian Szegedy](https://scholar.google.com/citations?user=bnQMuzgAAAAJ)

 | 

[](https://arxiv.org/abs/1412.6572)

[imagenet](http://www.image-net.org/)

[](https://medium.com/@zachariaharungeorge/a-deep-dive-into-the-fast-gradient-sign-method-611826e34865)

[](https://www.tensorflow.org/versions/r2.0/api_docs/python/tf/keras/applications/MobileNetV2), [](https://www.tensorflow.org/tutorials/generative/adversarial_fgsm)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/tensorflow/docs/blob/master/site/en/tutorials/generative/adversarial_fgsm.ipynb) | 12.03.2021 |

| GAN steerability | We will navigate in GAN latent space to simulate various camera transformations | 

[Ali Jahanian](http://people.csail.mit.edu/jahanian/)
 [Lucy Chai](http://people.csail.mit.edu/lrchai/)
 [Phillip Isola](http://web.mit.edu/phillipi/)

 | [![](https://img.shields.io/github/stars/ali-design/gan_steerability?style=social)](https://github.com/ali-design/gan_steerability) 

[](https://arxiv.org/abs/1907.07171), [](https://arxiv.org/abs/1809.11096)

[project](https://ali-design.github.io/gan_steerability/)

[](https://youtu.be/nS0V64sF7Cw)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1kn6yG8PqD1U2bUcy32V1iAVjzlcQWcG3) | 04.03.2021 |

| Trax | End-to-end library for deep learning that focuses on clear code and speed | [Google](https://research.google/teams/brain/) | [![](https://img.shields.io/github/stars/google/trax?style=social)](https://github.com/google/trax) 

[](https://arxiv.org/abs/1910.00177)

[discuss](https://groups.google.com/u/1/g/trax-discuss)

[](https://trax-ml.readthedocs.io/en/latest/)

[](https://www.kaggle.com/abhinavwalia95/entity-annotated-corpus), [](https://www.kaggle.com/code/dschettler8845/exploration-of-trax-framework)

[](https://towardsdatascience.com/get-started-with-google-trax-for-nlp-ff8dcd3119cf), [](https://medium.com/analytics-vidhya/brief-view-of-googles-trax-library-b78eae008cb6)

[](https://www.tensorflow.org/datasets/catalog/overview), [](https://tensorflow.org/guide/tf_numpy)

[](https://youtu.be/qlTsaHAtJBY)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/google/trax/blob/master/trax/intro.ipynb) | 18.02.2021 |

| bsuite | A collection of carefully-designed experiments that investigate core capabilities of an RL agent with two main objectives | 

[Ian Osband](http://iosband.github.io/)
 [Yotam Doron](http://www.yotamdoron.com/)
 [Matteo Hessel](https://github.com/mtthss)
others[John Aslanides](https://www.aslanides.io/)
 [Eren Sezener](http://erensezener.com/)
 [Andre Saraiva](https://andresnds.wordpress.com/)
 [Katrina McKinney](https://medium.com/@katrinamckinney)
 [Tor Lattimore](http://tor-lattimore.com/)
 [Csaba Szepesvari](https://sites.ualberta.ca/~szepesva/)
 [Satinder Singh](http://web.eecs.umich.edu/~baveja/)
 [Benjamin Van Roy](https://web.stanford.edu/~bvr/)
 [Richard Sutton](http://www.incompleteideas.net/)
 [David Silver](https://www.davidsilver.uk/)
 [Hado Van Hasselt](https://hadovanhasselt.com/)

 | [![](https://img.shields.io/github/stars/deepmind/bsuite?style=social)](https://github.com/deepmind/bsuite) 

[](https://github.com/openai/gym)

[paper](https://openreview.net/forum?id=rygf-kSYwH)

[](https://youtu.be/Wcv4eU_qtZU)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1rU20zJ281sZuMD1DHbsODFr1DbASL0RH) | 13.02.2021 |

| TF-Ranking | End-to-end walkthrough of training a TensorFlow Ranking neural network model which incorporates sparse textual features | [Rama Kumar](https://github.com/ramakumar1729) | [![](https://img.shields.io/github/stars/tensorflow/ranking?style=social)](https://github.com/tensorflow/ranking) 

[](https://arxiv.org/abs/1910.09676), [](https://arxiv.org/abs/1812.00073), [](https://arxiv.org/abs/1905.08957), [](https://arxiv.org/abs/1811.04415)

[data](http://hamedz.ir/resources/)

[](https://github.com/tensorflow/serving/blob/master/tensorflow_serving/apis/input.proto#L72)

[](https://en.wikipedia.org/wiki/Mean_reciprocal_rank), [](https://en.wikipedia.org/wiki/Discounted_cumulative_gain)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/tensorflow/ranking/blob/master/tensorflow_ranking/examples/handling_sparse_features.ipynb) | 04.02.2021 |

| Toon-Me | A fun project to toon portrait images | [Vijish Madhavan](https://github.com/vijishmadhavan) | [![](https://img.shields.io/github/stars/vijishmadhavan/Toon-Me?style=social)](https://github.com/vijishmadhavan/Toon-Me) [](https://arxiv.org/abs/1710.10196), [](https://arxiv.org/abs/1707.02921), [](https://arxiv.org/abs/1603.08155) | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/vijishmadhavan/Light-Up/blob/master/Toon_Me_(Try_it_on_Colab).ipynb) | 22.01.2021 |

| TensorNetwork | A library for easy and efficient manipulation of tensor networks | [Chase Roberts](http://thenerdstation.github.io/) | [![](https://img.shields.io/github/stars/google/TensorNetwork?style=social)](https://github.com/google/TensorNetwork) 

[](https://arxiv.org/abs/1708.00006), [](https://arxiv.org/abs/1306.2164)

[](https://tensornetwork.readthedocs.io/)

[](https://www.youtube.com/watch?v=YN2YBB0viKo)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/google/TensorNetwork/blob/master/colabs/Tensor_Networks_in_Neural_Networks.ipynb) | 21.01.2021 |

| Spleeter | Deezer source separation library including pretrained models | 

[Romain Hennequin](http://romain-hennequin.fr/)
 [Anis Khlif](https://github.com/alreadytaikeune)
 [Félix Voituret](https://github.com/Faylixe)
 [Manuel Moussallam](https://mmoussallam.github.io/)

 | [![](https://img.shields.io/github/stars/deezer/spleeter?style=social)](https://github.com/deezer/spleeter) 

[blog post](https://deezer.io/releasing-spleeter-deezer-r-d-source-separation-engine-2b88985e797e)

[data](https://sigsep.github.io/datasets/musdb.html)

[project](https://research.deezer.com/projects/spleeter.html)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/deezer/spleeter/blob/master/spleeter.ipynb) | 10.01.2021 |

| Bullet Physics SDK | Real-time collision detection and multi-physics simulation for VR, games, visual effects, robotics, machine learning etc | 

[Erwin Coumans](https://github.com/erwincoumans)
 [Yunfei Bai](https://github.com/YunfeiBai)

 | [![](https://img.shields.io/github/stars/bulletphysics/bullet3?style=social)](https://github.com/bulletphysics/bullet3) 

[](https://docs.google.com/document/d/10sXEhzFRSnvFcl3XxNGhnD4N2SedqwdAvK3dsihxVUA/edit#heading=h.2ye70wns7io3)

[](https://github.com/Microsoft/vcpkg)

[](https://pypi.org/project/pybullet/)

[website](https://pybullet.org)

[](https://www.youtube.com/playlist?list=PLinBNdD-7nkNCfoEKap4z3qadLVj8QB4a), [](https://youtu.be/9p0O941opGc), [](https://youtu.be/kZxPaGdoSJY), [](https://www.youtube.com/playlist?list=PL9LUFPiB6N3YrS0O7XM_1sBVWRnSRB643)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/bulletphysics/bullet3/blob/master/examples/pybullet/notebooks/HelloPyBullet.ipynb) | 14.10.2020 |

| Person Remover | Project that combines Pix2Pix and YOLO arhitectures in order to remove people or other objects from photos | 

[Javier Gamazo](https://www.javiergamazo.com/)
 [Daryl Autar](https://github.com/Daryl149)

 | [![](https://img.shields.io/github/stars/javirk/Person_remover?style=social)](https://github.com/javirk/Person_remover) 

[](https://github.com/javirk/Person-remover-partial-convolutions), [](https://github.com/zzh8829/yolov3-tf2)

[](https://www.youtube.com/watch?v=_dRjY9gMcxE)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1JDpH8MAjaKoekQ_H9ZaxYJ9_axiDtDGm) | 22.08.2020 |

| Gin Config | Lightweight configuration framework for Python, based on dependency injection | 

[Dan Holtmann-Rice](https://github.com/dhr)
 [Sergio Guadarrama](https://github.com/sguada)
 [Nathan Silberman](http://nsilberman.com/)

 | [![](https://img.shields.io/github/stars/google/gin-config?style=social)](https://github.com/google/gin-config) 

[](https://towardsdatascience.com/stop-worrying-about-configs-with-gin-218562dd5c91)

[](https://pypi.org/project/gin-config/)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/google/gin-config/blob/master/gin/gin_intro.ipynb) | 13.08.2020 |

| Dopamine | Research framework for fast prototyping of reinforcement learning algorithms | 

[Pablo Castro](https://psc-g.github.io/)
 [Subhodeep Moitra](http://www.deepmoitra.com/)
 [Carles Gelada](https://github.com/cgel)
others[Saurabh Kumar](https://scholar.google.com/citations?user=Rkr2uT8AAAAJ)
 [Marc Bellemare](http://www.marcgbellemare.info/)

 | [![](https://img.shields.io/github/stars/google/dopamine?style=social)](https://github.com/google/dopamine) 

[](https://arxiv.org/abs/1812.06110), [](https://arxiv.org/abs/1511.05952), [](https://arxiv.org/abs/1812.05905), [](https://arxiv.org/abs/1806.06923)

[baselines](https://google.github.io/dopamine/baselines/)

[blog post](https://opensource.googleblog.com/2019/02/dopamine-2.0.html)

[](https://google.github.io/dopamine/docker/)

[](https://google.github.io/dopamine/docs/)

[](https://github.com/openai/atari-py#roms), [](https://github.com/openai/mujoco-py#install-mujoco)

[](https://medium.com/the-21st-century/google-dopamine-new-rl-framework-f84a35b7fb3f)

[](https://pypi.org/project/dopamine-rl/)

[](https://www.youtube.com/live/FWFoyFjeAaM?feature=share), [](https://youtu.be/bd4CsDp00RA)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/google/dopamine/blob/master/dopamine/colab/jax_agent_visualizer.ipynb) | 04.08.2020 |

| Analyzing Tennis Serve | We'll use the Video Intelligence API to analyze a tennis serve, including the angle of the arms and legs during the serve | [Dale Markowitz](https://daleonai.com/) | [![](https://img.shields.io/github/stars/google/making_with_ml?style=social)](https://github.com/google/making_with_ml/tree/master/sports_ai) 

[blog post](https://daleonai.com/machine-learning-for-sports)

[](https://manivannan-ai.medium.com/find-the-angle-between-three-points-from-2d-using-python-348c513e2cd)

[](https://www.youtube.com/watch?v=yLrOy2Xedgk)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/google/making_with_ml/blob/master/sports_ai/Sports_AI_Analysis.ipynb) | 14.07.2020 |

| YOLOv4 | This tutorial will help you build YOLOv4 easily in the cloud with GPU enabled so that you can run object detections in milliseconds! | [Alexey Bochkovskiy](http://www.alexeyab.com/) | [![](https://img.shields.io/github/stars/AlexeyAB/darknet?style=social)](https://github.com/AlexeyAB/darknet) 

[](https://arxiv.org/abs/2004.10934), [](https://arxiv.org/abs/2011.08036)

[](https://alexeyab84.medium.com/yolov4-the-most-accurate-real-time-neural-network-on-ms-coco-dataset-73adfd3602fe), [](https://alexeyab84.medium.com/scaled-yolo-v4-is-the-best-neural-network-for-object-detection-on-ms-coco-dataset-39dfa22fa982)

[project](https://pjreddie.com/darknet/)

[](https://pypi.org/project/yolov4/)

[](https://www.reddit.com/r/MachineLearning/comments/gydxzd/p_yolov4_the_most_accurate_realtime_neural/)

[](https://youtu.be/1_SiUOYUoOI), [](https://youtu.be/YDFf-TqJOFE)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/1_GdoqCJWXsChrOiY8sZMr_zbr_fH-0Fg) | 25.06.2020 |

| GAN Dissection | Visualizing and Understanding Generative Adversarial Networks | 

[David Bau](https://people.csail.mit.edu/davidbau/home/)
 [Jun-Yan Zhu](https://www.cs.cmu.edu/~junyanz/)
 [Hendrik Strobelt](http://hendrik.strobelt.com/)
others[Bolei Zhou](https://boleizhou.github.io/)
 [Joshua Tenenbaum](https://mitibmwatsonailab.mit.edu/people/joshua-tenenbaum/)
 [William Freeman](https://billf.mit.edu/)
 [Antonio Torralba](https://groups.csail.mit.edu/vision/torralbalab/)

 | [![](https://img.shields.io/github/stars/CSAILVision/GANDissect?style=social)](https://github.com/CSAILVision/GANDissect) 

[](https://arxiv.org/abs/1811.10597), [](https://arxiv.org/abs/1901.09887), [](https://arxiv.org/abs/1807.10221)

[demo](http://gandissect.res.ibm.com/ganpaint.html)

[](https://github.com/CSAILVision/NetDissect), [](https://github.com/junyanz/iGAN)

[project](https://gandissect.csail.mit.edu/)

[](https://www.youtube.com/watch?v=yVCgUYe4JTM)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/SIDN-IAP/global-model-repr/blob/master/notebooks/gandissect_solutions.ipynb) | 04.05.2020 |

| Sonnet | Library built on top of TensorFlow 2 designed to provide simple, composable abstractions for machine learning research | 

[Malcolm Reynolds](https://github.com/malcolmreynolds)
 [Jack Rae](https://github.com/dm-jrae)
 [Andreas Fidjeland](https://github.com/akfidjeland)
others[Fabio Viola](https://github.com/fabioviola)
 [Adrià Puigdomènech](https://github.com/adria-p)
 [Frederic Besse](https://github.com/fbesse)
 [Tim Green](http://tfgg.me/)
 [Sébastien Racanière](https://scholar.google.com/citations?user=o-h0vrQAAAAJ)
 [Gabriel Barth-Maron](https://github.com/fastturtle)
 [Diego Casas](https://github.com/diegolascasas)

 | [![](https://img.shields.io/github/stars/deepmind/sonnet?style=social)](https://github.com/deepmind/sonnet) 

[](https://www.deepmind.com/blog/open-sourcing-sonnet-a-new-library-for-constructing-neural-networks)

[](https://sonnet.readthedocs.io/en/latest/index.html)

[](https://papers.nips.cc/paper/2016/hash/fb87582825f9d28a8d42c5e5e5e8b23d-Abstract.html)

[](https://www.tensorflow.org/guide/checkpoint), [](https://www.tensorflow.org/guide/saved_model)

[](https://youtu.be/rlpQjnUvoKw)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/deepmind/sonnet/blob/v2/examples/little_gan_on_mnist.ipynb) | 17.04.2020 |

| Classification of chest vs. adominal X-rays | The goal of this tutorial is to build a deep learning classifier to accurately differentiate between chest and abdominal X-rays | [tmoneyx01](https://github.com/tmoneyx01) | [![](https://img.shields.io/github/stars/mdai/mdai-client-py?style=social)](https://github.com/mdai/mdai-client-py) 

[annotator](https://public.md.ai/annotator/project/PVq9raBJ)

[](https://docs.md.ai/)

[](https://pypi.org/project/mdai/)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/mdai/ml-lessons/blob/master/lesson1-xray-images-classification.ipynb) | 07.03.2020 |

| Earth Engine Python API and Folium Interactive Mapping | This notebook demonstrates how to setup the Earth Engine and provides several examples for visualizing Earth Engine processed data interactively using the folium library | [Qiusheng Wu](https://wetlands.io/) | [![](https://img.shields.io/github/stars/python-visualization/folium?style=social)](https://github.com/python-visualization/folium) 

[api](https://developers.google.com/earth-engine/python_install)

[](https://pypi.org/project/earthengine-api/), [](https://pypi.org/project/folium/)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/giswqs/qgis-earthengine-examples/blob/master/Folium/ee-api-folium-setup.ipynb) | 20.01.2020 |

| Tensor2Tensor | Library for deep learning models that is well-suited for neural machine translation and includes the reference implementation of the state-of-the-art Transformer model | 

[Ashish Vaswani](https://scholar.google.com/citations?user=oR9sCGYAAAAJ)
 [Samy Bengio](https://scholar.google.com/citations?user=Vs-MdPcAAAAJ)
 [Eugene Brevdo](https://ebrevdo.github.io/)
others[François Chollet](https://fchollet.com/)
 [Aidan Gomez](https://gom.ai/)
 [Stephan Gouws](https://scholar.google.com/citations?user=lLTdYUYAAAAJ)
 [Llion Jones](https://www.linkedin.com/in/llion-jones-9ab3064b)
 [Łukasz Kaiser](https://scholar.google.com/citations?user=JWmiQR0AAAAJ)
 [Nal Kalchbrenner](https://www.nal.ai/)
 [Niki Parmar](https://github.com/nikiparmar)
 [Ryan Sepassi](https://ryansepassi.com/)
 [Noam Shazeer](https://github.com/nshazeer)
 [Jakob Uszkoreit](https://scholar.google.com/citations?user=mOG0bwsAAAAJ)

 | [![](https://img.shields.io/github/stars/tensorflow/tensor2tensor?style=social)](https://github.com/tensorflow/tensor2tensor) 

[](https://arxiv.org/abs/1803.07416), [](https://arxiv.org/abs/1812.02825), [](https://arxiv.org/abs/1706.03762), [](https://arxiv.org/abs/1706.03059), [](https://arxiv.org/abs/1706.05137), [](https://arxiv.org/abs/1801.09797)

[blog post](https://ai.googleblog.com/2017/06/accelerating-deep-learning-research.html)

[data](https://research.fb.com/downloads/babi/)

[](https://towardsdatascience.com/tensor2tensor-and-one-model-to-learn-them-all-7ef3f9b61ba4)

[](https://pypi.org/project/tensor2tensor/)

[](https://tensorflow.github.io/tensor2tensor/cloud_mlengine.html), [](https://tensorflow.github.io/tensor2tensor/cloud_tpu.html)

[](https://youtu.be/O2UvKxaOH7c), [](https://youtu.be/VYQ8n3Besrw), [](https://youtu.be/cS2UZKHq4i4)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/tensorflow/tensor2tensor/blob/master/tensor2tensor/notebooks/Transformer_translate.ipynb) | 14.01.2020 |

| Traffic counting | Making Road Traffic Counting App based on Computer Vision and OpenCV | [Andrey Nikishaev](https://github.com/creotiv) | [![](https://img.shields.io/github/stars/scikit-video/scikit-video?style=social)](https://github.com/scikit-video/scikit-video) 

[](http://www.scikit-video.org/stable/)

[](https://github.com/creotiv/object_detection_projects/tree/master/opencv_traffic_counting)

[](https://medium.com/machine-learning-world/tutorial-making-road-traffic-counting-app-based-on-computer-vision-and-opencv-166937911660)

[](https://pypi.org/project/scikit-video/)

[](https://www.youtube.com/watch?v=_o5iLbRHKao)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/drive/12N4m_RYKqrpozRzh9qe7nQE_sIqQH9U8) | 10.01.2020 |

| Imagededup | This package provides functionality to make use of hashing algorithms that are particularly good at finding exact duplicates as well as convolutional neural networks which are also adept at finding near duplicates | 

[Tanuj Jain](https://github.com/tanujjain)
 [Christopher Lennan](https://github.com/clennan)
 [Dat Tran](https://dat-tran.com/)

 | [![](https://img.shields.io/github/stars/idealo/imagededup?style=social)](https://github.com/idealo/imagededup) 

[](https://arxiv.org/abs/1704.04861)

[](https://fullstackml.com/wavelet-image-hash-in-python-3504fdd282b5)

[project](https://idealo.github.io/imagededup/)

[](https://pypi.org/project/imagededup/)

 | [![Open In Colab](images/colab.svg)](https://colab.research.google.com/github/idealo/imagededup/blob/master/examples/CIFAR10_duplicates.ipynb) | 03.10.2019 |

# Best of the best

| authors | repositories | papers | packages |

|---|---|---|---|

| 


[Adam Roberts](https://github.com/adarob)
 [Jesse Engel](https://github.com/jesseengel)
 [Curtis Hawthorne](https://github.com/cghawthorne)
 [Billy Lamberta](https://github.com/lamberta)
 [Ziwei Liu](https://liuziwei7.github.io/)
 [Chen Change Loy](https://www.mmlab-ntu.com/person/ccloy/)
 [Xintao Wang](https://xinntao.github.io/)
 [Krzysztof Ostrowski](https://github.com/krzys-ostrowski)
 [Max Woolf](https://minimaxir.com/)
 [Anselm Levskaya](https://anselmlevskaya.com/)
 [Ryan Sepassi](https://ryansepassi.com/)
 [Yossi Adi](https://www.cs.huji.ac.il/~adiyoss/)
 [Gabriel Synnaeve](https://syhw.github.io/)
 [Ying Shan](https://scholar.google.com/citations?user=4oXBp9UAAAAJ)
 [Ben Trevett](https://bentrevett.com/)
 [OpenMMLab](https://openmmlab.com/)
 [Sharan Narang](https://github.com/sharannarang)
 [Noah Fiedel](https://scholar.google.com/citations?user=XWpV9DsAAAAJ)
 [Alexandre Défossez](https://ai.honu.io/)
 [Nikhila Ravi](https://nikhilaravi.com/)
 [Paden Tomasello](https://scholar.google.com/citations?user=sBtWMGYAAAAJ)
 [Shang-Wen Li](https://swdanielli.github.io/)
 [Karen Simonyan](https://scholar.google.com/citations?user=L7lMQkQAAAAJ)
 [Amit Bermano](https://www.cs.tau.ac.il/~amberman/)
 [Lei Zhang](https://www.leizhang.org/)
 [Ian Simon](https://github.com/iansimon)
 [Colin Raffel](https://colinraffel.com//)
 [Douglas Eck](https://github.com/douglaseck)
 [Mohammad Norouzi](https://norouzi.github.io/)
 [Thomas Wolf](https://thomwolf.io/)
 [Kushal Lakhotia](https://about.me/hikushalhere)
 [Patrick von Platen](https://github.com/patrickvonplaten)
 [Junyang Lin](https://justinlin610.github.io/)
 [Vijish Madhavan](https://github.com/vijishmadhavan)
 [Phil Wang](https://lucidrains.github.io/)
 [Łukasz Kaiser](https://scholar.google.com/citations?user=JWmiQR0AAAAJ)
 [Anirudh Dubey](https://github.com/anirudh161)
 [Pablo Castro](https://psc-g.github.io/)
 [Eugene Brevdo](https://ebrevdo.github.io/)
 [Glenn Jocher](https://github.com/glenn-jocher)
 [Rishabh Agarwal](https://agarwl.github.io/)
 [Gabriel Barth-Maron](https://github.com/fastturtle)
 [Hugging Face](https://huggingface.co/)
 [Younes Belkada](https://github.com/younesbelkada)
 [Lewis Tunstall](https://lewtun.github.io/blog/)
 [Han Xiao](https://hanxiao.io/)
 [Romain Beaumont](https://github.com/rom1504)
 [intel](https://www.intel.com/content/www/us/en/developer/topic-technology/open/overview.html)
 [Ion Stoica](https://people.eecs.berkeley.edu/~istoica/)
 [unsloth](https://unsloth.ai/)

 | 

stable-diffusion-webui	[![](https://img.shields.io/github/stars/AUTOMATIC1111/stable-diffusion-webui?style=social)](https://github.com/AUTOMATIC1111/stable-diffusion-webui)
 ollama	[![](https://img.shields.io/github/stars/ollama/ollama?style=social)](https://github.com/ollama/ollama)
 langchain	[![](https://img.shields.io/github/stars/langchain-ai/langchain?style=social)](https://github.com/langchain-ai/langchain)
 whisper	[![](https://img.shields.io/github/stars/openai/whisper?style=social)](https://github.com/openai/whisper)
 models	[![](https://img.shields.io/github/stars/tensorflow/models?style=social)](https://github.com/tensorflow/models)
 ComfyUI	[![](https://img.shields.io/github/stars/comfyanonymous/ComfyUI?style=social)](https://github.com/comfyanonymous/ComfyUI)
 stable-diffusion	[![](https://img.shields.io/github/stars/CompVis/stable-diffusion?style=social)](https://github.com/CompVis/stable-diffusion)
 open-interpreter	[![](https://img.shields.io/github/stars/KillianLucas/open-interpreter?style=social)](https://github.com/KillianLucas/open-interpreter)
 Real-Time-Voice-Cloning	[![](https://img.shields.io/github/stars/CorentinJ/Real-Time-Voice-Cloning?style=social)](https://github.com/CorentinJ/Real-Time-Voice-Cloning)
 yolov5	[![](https://img.shields.io/github/stars/ultralytics/yolov5?style=social)](https://github.com/ultralytics/yolov5)
 segment-anything	[![](https://img.shields.io/github/stars/facebookresearch/segment-anything?style=social)](https://github.com/facebookresearch/segment-anything)
 Fooocus	[![](https://img.shields.io/github/stars/lllyasviel/Fooocus?style=social)](https://github.com/lllyasviel/Fooocus)
 PythonDataScienceHandbook	[![](https://img.shields.io/github/stars/jakevdp/PythonDataScienceHandbook?style=social)](https://github.com/jakevdp/PythonDataScienceHandbook)
 autogen	[![](https://img.shields.io/github/stars/microsoft/autogen?style=social)](https://github.com/microsoft/autogen)
 crawl4ai	[![](https://img.shields.io/github/stars/unclecode/crawl4ai?style=social)](https://github.com/unclecode/crawl4ai)
 llama_index	[![](https://img.shields.io/github/stars/run-llama/llama_index?style=social)](https://github.com/run-llama/llama_index)
 stablediffusion	[![](https://img.shields.io/github/stars/Stability-AI/stablediffusion?style=social)](https://github.com/Stability-AI/stablediffusion)
 ultralytics	[![](https://img.shields.io/github/stars/ultralytics/ultralytics?style=social)](https://github.com/ultralytics/ultralytics)
 TTS	[![](https://img.shields.io/github/stars/coqui-ai/TTS?style=social)](https://github.com/coqui-ai/TTS)
 unsloth	[![](https://img.shields.io/github/stars/unslothai/unsloth?style=social)](https://github.com/unslothai/unsloth)
 bark	[![](https://img.shields.io/github/stars/suno-ai/bark?style=social)](https://github.com/suno-ai/bark)
 Open-Assistant	[![](https://img.shields.io/github/stars/LAION-AI/Open-Assistant?style=social)](https://github.com/LAION-AI/Open-Assistant)
 ray	[![](https://img.shields.io/github/stars/ray-project/ray?style=social)](https://github.com/ray-project/ray)
 visual-chatgpt	[![](https://img.shields.io/github/stars/microsoft/visual-chatgpt?style=social)](https://github.com/microsoft/visual-chatgpt)
 jax	[![](https://img.shields.io/github/stars/google/jax?style=social)](https://github.com/google/jax)
 detectron2	[![](https://img.shields.io/github/stars/facebookresearch/detectron2?style=social)](https://github.com/facebookresearch/detectron2)
 fairseq	[![](https://img.shields.io/github/stars/facebookresearch/fairseq?style=social)](https://github.com/facebookresearch/fairseq)
 mmdetection	[![](https://img.shields.io/github/stars/open-mmlab/mmdetection?style=social)](https://github.com/open-mmlab/mmdetection)
 Retrieval-based-Voice-Conversion-WebUI	[![](https://img.shields.io/github/stars/RVC-Project/Retrieval-based-Voice-Conversion-WebUI?style=social)](https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI)
 diffusers	[![](https://img.shields.io/github/stars/huggingface/diffusers?style=social)](https://github.com/huggingface/diffusers)
 CLIP	[![](https://img.shields.io/github/stars/openai/CLIP?style=social)](https://github.com/openai/CLIP)
 so-vits-svc	[![](https://img.shields.io/github/stars/svc-develop-team/so-vits-svc?style=social)](https://github.com/svc-develop-team/so-vits-svc)
 spleeter	[![](https://img.shields.io/github/stars/deezer/spleeter?style=social)](https://github.com/deezer/spleeter)
 supervision	[![](https://img.shields.io/github/stars/roboflow/supervision?style=social)](https://github.com/roboflow/supervision)
 gpt-2	[![](https://img.shields.io/github/stars/openai/gpt-2?style=social)](https://github.com/openai/gpt-2)
 LLaVA	[![](https://img.shields.io/github/stars/haotian-liu/LLaVA?style=social)](https://github.com/haotian-liu/LLaVA)
 pytorch_geometric	[![](https://img.shields.io/github/stars/pyg-team/pytorch_geometric?style=social)](https://github.com/pyg-team/pytorch_geometric)
 litellm	[![](https://img.shields.io/github/stars/BerriAI/litellm?style=social)](https://github.com/BerriAI/litellm)
 darknet	[![](https://img.shields.io/github/stars/AlexeyAB/darknet?style=social)](https://github.com/AlexeyAB/darknet)
 audiocraft	[![](https://img.shields.io/github/stars/facebookresearch/audiocraft?style=social)](https://github.com/facebookresearch/audiocraft)
 jina	[![](https://img.shields.io/github/stars/jina-ai/jina?style=social)](https://github.com/jina-ai/jina)
 guidance	[![](https://img.shields.io/github/stars/guidance-ai/guidance?style=social)](https://github.com/guidance-ai/guidance)
 datasets	[![](https://img.shields.io/github/stars/huggingface/datasets?style=social)](https://github.com/huggingface/datasets)
 swarm	[![](https://img.shields.io/github/stars/openai/swarm?style=social)](https://github.com/openai/swarm)
 magenta	[![](https://img.shields.io/github/stars/magenta/magenta?style=social)](https://github.com/magenta/magenta)
 magenta	[![](https://img.shields.io/github/stars/tensorflow/magenta?style=social)](https://github.com/tensorflow/magenta)
 peft	[![](https://img.shields.io/github/stars/huggingface/peft?style=social)](https://github.com/huggingface/peft)
 DeOldify	[![](https://img.shields.io/github/stars/jantic/DeOldify?style=social)](https://github.com/jantic/DeOldify)
 Qwen	[![](https://img.shields.io/github/stars/QwenLM/Qwen?style=social)](https://github.com/QwenLM/Qwen)
 voice-changer	[![](https://img.shields.io/github/stars/w-okada/voice-changer?style=social)](https://github.com/w-okada/voice-changer)

 | 
 | 

langchain	[![](https://img.shields.io/pypi/dm/langchain?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/langchain/)
 google-cloud-aiplatform	[![](https://img.shields.io/pypi/dm/google-cloud-aiplatform?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/google-cloud-aiplatform/)
 litellm	[![](https://img.shields.io/pypi/dm/litellm?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/litellm/)
 google-generativeai	[![](https://img.shields.io/pypi/dm/google-generativeai?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/google-generativeai/)
 llama-index	[![](https://img.shields.io/pypi/dm/llama-index?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/llama-index/)
 langgraph	[![](https://img.shields.io/pypi/dm/langgraph?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/langgraph/)
 catboost	[![](https://img.shields.io/pypi/dm/catboost?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/catboost/)
 folium	[![](https://img.shields.io/pypi/dm/folium?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/folium/)
 tensorflow-datasets	[![](https://img.shields.io/pypi/dm/tensorflow-datasets?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/tensorflow-datasets/)
 ollama	[![](https://img.shields.io/pypi/dm/ollama?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/ollama/)
 supervision	[![](https://img.shields.io/pypi/dm/supervision?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/supervision/)
 google-cloud-discoveryengine	[![](https://img.shields.io/pypi/dm/google-cloud-discoveryengine?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/google-cloud-discoveryengine/)
 presidio-analyzer	[![](https://img.shields.io/pypi/dm/presidio-analyzer?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/presidio-analyzer/)
 presidio-anonymizer	[![](https://img.shields.io/pypi/dm/presidio-anonymizer?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/presidio-anonymizer/)
 bert-score	[![](https://img.shields.io/pypi/dm/bert-score?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/bert-score/)
 autofaiss	[![](https://img.shields.io/pypi/dm/autofaiss?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/autofaiss/)
 unsloth	[![](https://img.shields.io/pypi/dm/unsloth?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/unsloth/)
 gin-config	[![](https://img.shields.io/pypi/dm/gin-config?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/gin-config/)
 pyglove	[![](https://img.shields.io/pypi/dm/pyglove?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/pyglove/)
 Crawl4AI	[![](https://img.shields.io/pypi/dm/Crawl4AI?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/Crawl4AI/)
 mmdet	[![](https://img.shields.io/pypi/dm/mmdet?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/mmdet/)
 earthengine-api	[![](https://img.shields.io/pypi/dm/earthengine-api?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/earthengine-api/)
 loralib	[![](https://img.shields.io/pypi/dm/loralib?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/loralib/)
 tensorflow-data-validation	[![](https://img.shields.io/pypi/dm/tensorflow-data-validation?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/tensorflow-data-validation/)
 pybullet	[![](https://img.shields.io/pypi/dm/pybullet?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/pybullet/)
 scikit-video	[![](https://img.shields.io/pypi/dm/scikit-video?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/scikit-video/)
 opik	[![](https://img.shields.io/pypi/dm/opik?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/opik/)
 sglang	[![](https://img.shields.io/pypi/dm/sglang?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/sglang/)
 mmsegmentation	[![](https://img.shields.io/pypi/dm/mmsegmentation?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/mmsegmentation/)
 transformer-engine	[![](https://img.shields.io/pypi/dm/transformer-engine?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/transformer-engine/)
 tfx	[![](https://img.shields.io/pypi/dm/tfx?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/tfx/)
 tensorflow-federated	[![](https://img.shields.io/pypi/dm/tensorflow-federated?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/tensorflow-federated/)
 guidance	[![](https://img.shields.io/pypi/dm/guidance?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/guidance/)
 langfun	[![](https://img.shields.io/pypi/dm/langfun?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/langfun/)
 composer	[![](https://img.shields.io/pypi/dm/composer?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/composer/)
 dopamine-rl	[![](https://img.shields.io/pypi/dm/dopamine-rl?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/dopamine-rl/)
 giskard	[![](https://img.shields.io/pypi/dm/giskard?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/giskard/)
 transformer-lens	[![](https://img.shields.io/pypi/dm/transformer-lens?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/transformer-lens/)
 mmpose	[![](https://img.shields.io/pypi/dm/mmpose?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/mmpose/)
 imagededup	[![](https://img.shields.io/pypi/dm/imagededup?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/imagededup/)
 sae-lens	[![](https://img.shields.io/pypi/dm/sae-lens?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/sae-lens/)
 img2dataset	[![](https://img.shields.io/pypi/dm/img2dataset?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/img2dataset/)
 reformer-pytorch	[![](https://img.shields.io/pypi/dm/reformer-pytorch?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/reformer-pytorch/)
 datachain	[![](https://img.shields.io/pypi/dm/datachain?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/datachain/)
 mistral-inference	[![](https://img.shields.io/pypi/dm/mistral-inference?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/mistral-inference/)
 dm-reverb	[![](https://img.shields.io/pypi/dm/dm-reverb?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/dm-reverb/)
 rl-games	[![](https://img.shields.io/pypi/dm/rl-games?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/rl-games/)
 tensor2tensor	[![](https://img.shields.io/pypi/dm/tensor2tensor?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/tensor2tensor/)
 presidio-image-redactor	[![](https://img.shields.io/pypi/dm/presidio-image-redactor?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/presidio-image-redactor/)
 clip-retrieval	[![](https://img.shields.io/pypi/dm/clip-retrieval?style=flat&logo=pypi&label=%E2%80%8D&labelColor=f7f7f4&color=006dad)](https://pypi.org/clip-retrieval/)

 |

[![Stargazers over time](https://starchart.cc/amrzv/awesome-colab-notebooks.svg?variant=adaptive)](https://starchart.cc/amrzv/awesome-colab-notebooks)

(generated by [generate_markdown.py](generate_markdown.py) based on [research.json](data/research.json), [tutorials.json](data/tutorials.json), [courses.json](data/courses.json))
ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/amrzv/awesome-colab-notebooks

Awesome Lists containing this project

README