Projects in Awesome Lists by AILab-CVC

https://github.com/ailab-cvc/yolo-world

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Last synced: 12 May 2025

https://github.com/ailab-cvc/videocrafter

VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models

image-to-video text-to-video video-generation

Last synced: 14 May 2025

https://github.com/AILab-CVC/YOLO-World

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Last synced: 20 Mar 2025

https://github.com/AILab-CVC/VideoCrafter

VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models

image-to-video text-to-video video-generation

Last synced: 28 Mar 2025

https://ailab-cvc.github.io/videocrafter/

VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models

image-to-video text-to-video video-generation

Last synced: 28 Mar 2025

https://github.com/ailab-cvc/unireplknet

[CVPR'24] UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition

architecture artificial-intelligence convolutional-neural-networks deep-learning multimodal-learning

Last synced: 15 May 2025

https://github.com/AILab-CVC/UniRepLKNet

[CVPR'24] UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition

architecture artificial-intelligence convolutional-neural-networks deep-learning multimodal-learning

Last synced: 20 Mar 2025

https://github.com/StevenGrove/GPT4Tools

GPT4Tools is an intelligent system that can automatically decide, control, and utilize different visual foundation models, allowing the user to interact with images during a conversation.

Last synced: 21 Apr 2025

https://github.com/AILab-CVC/GPT4Tools

GPT4Tools is an intelligent system that can automatically decide, control, and utilize different visual foundation models, allowing the user to interact with images during a conversation.

Last synced: 19 Mar 2025

https://github.com/ailab-cvc/gpt4tools

GPT4Tools is an intelligent system that can automatically decide, control, and utilize different visual foundation models, allowing the user to interact with images during a conversation.

Last synced: 04 Apr 2025