Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Projects in Awesome Lists tagged with clip

A curated list of projects in awesome lists tagged with clip .

https://github.com/easychen/pushdeer

开放源码的无App推送服务,iOS14+扫码即用。亦支持快应用/iOS和Mac客户端、Android客户端、自制设备

app clip notification-service push

Last synced: 30 Sep 2024

https://github.com/cvhub520/x-anylabeling

Effortless data labeling with AI support from Segment Anything and other awesome models.

clip deep-learning deeplearning labeling-tool llm onnx paddle pytorch resnet sam yolo

Last synced: 01 Oct 2024

https://github.com/CVHub520/X-AnyLabeling

Effortless data labeling with AI support from Segment Anything and other awesome models.

clip deep-learning deeplearning labeling-tool llm onnx paddle pytorch resnet sam yolo

Last synced: 31 Jul 2024

https://github.com/pharmapsychotic/clip-interrogator

Image to prompt with BLIP and CLIP

clip pytorch

Last synced: 29 Sep 2024

https://github.com/rom1504/clip-retrieval

Easily compute clip embeddings and build a clip retrieval system with them

ai clip deep-learning knn multimodal semantic-search

Last synced: 30 Sep 2024

https://rom1504.github.io/clip-retrieval/?back=https%3A%2F%2Fknn5.laion.ai&index=laion5B&useMclip=false

Easily compute clip embeddings and build a clip retrieval system with them

ai clip deep-learning knn multimodal semantic-search

Last synced: 03 Aug 2024

https://github.com/mbzuai-oryx/video-chatgpt

[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.

chatbot clip gpt-4 llama llava mulit-modal vicuna video-chatboat video-conversation vision-language vision-language-pretraining

Last synced: 27 Sep 2024

https://github.com/unum-cloud/uform

Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than OpenAI CLIP and LLaVA 🖼️ & 🖋️

bert clip clustering contrastive-learning cross-attention huggingface-transformers image-search language-vision llava multi-lingual multimodal neural-network openai openclip pretrained-models pytorch representation-learning semantic-search transformer vector-search

Last synced: 30 Sep 2024

https://github.com/EdVince/Stable-Diffusion-NCNN

Stable Diffusion in NCNN with c++, supported txt2img and img2img

android clip cpp diffusion executable img2img mnn ncnn onnx stable-diffusion tensorrt tnn txt2img

Last synced: 01 Aug 2024

https://github.com/haltakov/natural-language-image-search

Search photos on Unsplash using natural language

clip computer-vision image-search machine-learning photos unsplash

Last synced: 01 Aug 2024

https://github.com/haltakov/natural-language-youtube-search

Search inside YouTube videos using natural language

clip computer-vision machine-learning search youtube

Last synced: 31 Jul 2024

https://github.com/omerbt/Text2LIVE

Official Pytorch Implementation for "Text2LIVE: Text-Driven Layered Image and Video Editing" (ECCV 2022 Oral)

clip eccv2022 generative-model image-editing image-manipulation single-image single-video text-driven-editing text2live video-editing

Last synced: 31 Jul 2024

https://github.com/ArrowLuo/CLIP4Clip

An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"

activitynet clip didemo lsmdc msrvtt msvd multimodal multimodal-learning multimodality ranking retrieval retrieval-model search video-clip-retrieval video-text-retrieval

Last synced: 01 Aug 2024

https://github.com/eps696/aphantasia

CLIP + FFT/DWT/RGB = text to image/video

clip text-to-image text-to-video

Last synced: 01 Aug 2024

https://github.com/hila-chefer/Transformer-MM-Explainability

[ICCV 2021- Oral] Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers, a novel method to visualize any Transformer-based network. Including examples for DETR, VQA.

clip detr explainability explainable-ai interpretability lxmert transformer transformers visualbert visualization vqa

Last synced: 01 Aug 2024

https://github.com/Sense-GVT/DeCLIP

Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm

big-model clip image-text multi-model self-supervised vision-language-pretraining zero-shot

Last synced: 01 Aug 2024

https://github.com/pablosichert/react-truncate

React component for truncating multi-line spans and adding an ellipsis.

clip ellipsis react truncate

Last synced: 01 Oct 2024

https://github.com/v-iashin/video_features

Extract video features from raw videos using multiple GPUs. We support RAFT flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, and TIMM models.

audio-features clip feature-extraction i3d ig65m laion multi-gpu optical-flow parallel pytorch r2plus1d raft resnet s3d swin timm vggish video-features visual-features vit

Last synced: 01 Aug 2024

https://github.com/open-compass/vlmevalkit

Open-source evaluation toolkit of large vision-language models (LVLMs), support GPT-4v, Gemini, QwenVLPlus, 40+ HF models, 20+ benchmarks

chatgpt claude clip computer-vision evaluation gemini gpt gpt-4v gpt4 large-language-models llava llm multi-modal openai openai-api pytorch qwen vit vqa

Last synced: 02 Aug 2024

https://github.com/open-compass/VLMEvalKit

Open-source evaluation toolkit of large vision-language models (LVLMs), support GPT-4v, Gemini, QwenVLPlus, 40+ HF models, 20+ benchmarks

chatgpt claude clip computer-vision evaluation gemini gpt gpt-4v gpt4 large-language-models llava llm multi-modal openai openai-api pytorch qwen vit vqa

Last synced: 08 Aug 2024

https://github.com/yangjianxin1/CLIP-Chinese

中文CLIP预训练模型

chinese clip

Last synced: 08 Aug 2024

https://github.com/zcf0508/autocut-client

AutoCut Client

autocut clip electron video vue

Last synced: 31 Jul 2024

https://github.com/OpenGVLab/Instruct2Act

Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model

chatgpt clip llm robotics segment-anything

Last synced: 02 Aug 2024

https://github.com/opengvlab/instruct2act

Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model

chatgpt clip llm robotics segment-anything

Last synced: 02 Aug 2024

https://github.com/xmed-lab/CLIP_Surgery

CLIP Surgery for Better Explainability with Enhancement in Open-Vocabulary Tasks

clip explainability interpretability multilabel multimodal open-vocabulary sam segment-anything segmentation vision-transformer

Last synced: 31 Jul 2024

https://github.com/wisconsinaivision/vip-llava

[CVPR2024] ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts

chatbot clip cvpr2024 foundation-models gpt-4 gpt-4-vision llama llama2 llava multi-modal vision-language visual-prompting

Last synced: 27 Sep 2024

https://liruiw.github.io/gensim/

GenSim: Generating Robotic Simulation Tasks via Large Language Models

clip gpt-4 llm pybullet simulation

Last synced: 01 Aug 2024

https://github.com/haofanwang/natural-language-joint-query-search

Search photos on Unsplash based on OpenAI's CLIP model, support search with joint image+text queries and attention visualization.

attention clip computer-vision image-retrieval image-search multi-modal-search unsplash visualizations

Last synced: 01 Aug 2024

https://github.com/chao1224/MoleculeSTM

Multi-modal Molecule Structure-text Model for Text-based Editing and Retrieval, Nat Mach Intell 2023 (https://www.nature.com/articles/s42256-023-00759-6)

clip computation-chemistry drug-discovery editing foundation-model molecule-editing moleculeclip moleculestm pretraining retrieval

Last synced: 03 Aug 2024

https://github.com/mbzuai-oryx/videogpt-plus

Official Repository of paper VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding

chatbot clip dual-encoder gpt4 gpt4o image-encoder llama3 llava multimodal phi-3-mini vicuna video-chatbot video-conversation video-encoder vision-language vision-language-pretraining

Last synced: 27 Sep 2024

https://github.com/josephrocca/clip-image-sorter

Sort a folder of images according to their similarity with provided text in your browser (uses a browser-ported version of OpenAI's CLIP model and the web's new File System Access API)

clip file-system-access-api openai openai-clip

Last synced: 01 Aug 2024

https://github.com/Shishkebaboo/VodRecovery

The purpose of this script is to obtain videos or clips that are either marked as "sub-only" or have been deleted on Twitch.

broadcast clip clips commad-line commandline console development ffmpeg live m3u8 m3u8-playlist m3u8-videos mp4 python recover twitch twitchclips twitchtv vodrecovery

Last synced: 06 Aug 2024

https://github.com/Imageomics/bioclip

This is the repository for the BioCLIP model and the TreeOfLife-10M dataset [CVPR'24 Oral, Best Student Paper].

clip computer-vision imageomics knowledge-guided-machine-learning taxonomy

Last synced: 01 Aug 2024

https://github.com/HFAiLab/clip-gen

CLIP-GEN: Language-Free Training of a Text-to-Image Generator with CLIP

clip pytorch text-to-image text2image

Last synced: 01 Aug 2024

https://github.com/minimaxir/imgbeddings

Python package to generate image embeddings with CLIP without PyTorch/TensorFlow

ai clip embeddings image-processing images onnx transformers

Last synced: 03 Aug 2024

https://github.com/miccunifi/SEARLE

[ICCV 2023] - Zero-shot Composed Image Retrieval with Textual Inversion

circo cirr clip composed-image-retrieval fashion-iq knowledge-distillation multimodal-learning pytorch textual-inversion

Last synced: 01 Aug 2024

https://github.com/eddieoz/youtube-clips-automator

MARCELO: an AI powered bot to automate the editing and thumbnail creation for your Youtube clips channel

ai audio-processing automation bot clip computer-vision editing thumbnail video video-processing youtube

Last synced: 04 Aug 2024

https://github.com/DRSY/MoTIS

[NAACL 2022]Mobile Text-to-Image search powered by multimodal semantic representation models(e.g., OpenAI's CLIP)

ai clip cross-modal image-search ios-swift k-means k-means-clustering knn knowledge-distillation lsh naacl random-projection retrieval semantic-search vector-search

Last synced: 03 Aug 2024

https://github.com/ylqi/Count-Anything

This method uses Segment Anything and CLIP to ground and count any object that matches a custom text prompt, without requiring any point or box annotation.

clip count-anything segment-anything

Last synced: 28 Aug 2024

https://github.com/Ajatt-Tools/videoclip

🍗 Easily create videoclips with mpv.

addon ajatt audioclip clip mpv mpv-script videoclip

Last synced: 04 Aug 2024

https://github.com/aerobounce/trim.lua

Trim mode for mpv — Turn mpv into Lossless Audio / Video Editor

clip concat ffmpeg lossless lua lua-script mpv mpv-script trim video video-editor video-processing

Last synced: 04 Aug 2024

https://github.com/pansyjs/video-editing-timeline

Timeline for video editing(为视频编辑而写时间线)

clip cut editing timeline video video-clip video-cut video-editing

Last synced: 29 Sep 2024

https://github.com/d-tsuji/clipboard

(Experimental) Go library for multi-platform clipboard.

clip clipboard clipboard-copy clipboard-library go golang library linux macos multiplatform windows

Last synced: 02 Aug 2024

https://github.com/AgentMaker/Paddle-CLIP

A PaddlePaddle version implementation of CLIP of OpenAI.

clip paddlepaddle

Last synced: 03 Aug 2024

https://github.com/hv0905/nekoimagegallery

An AI-powered natural language & reverse Image Search Engine powered by CLIP & qdrant.

clip computer-vision image-search image-search-engine search-engine transformers

Last synced: 01 Oct 2024

https://github.com/gkjohnson/unity-clip-shader

Unity shader and scripts for rendering solid clipped geometry

3d clip geometry graphics rendering shaders unity unity3d

Last synced: 01 Oct 2024

https://github.com/amiechen/twitch-batch-loader

Batch download twitch clips in terminal

batch clip python twitch twitch-api

Last synced: 06 Aug 2024

https://github.com/yingchen001/CF-CLIP

[ACM MM 2022] Towards Counterfactual Image Manipulation via CLIP

clip image-edit manipulation sytlegan

Last synced: 01 Aug 2024

https://github.com/Jorengarenar/fauxClip

Custom registers and clipboard support for Vim without +clipboard

clip clipboard nogui tmux vim

Last synced: 06 Aug 2024

https://github.com/TheoCoombes/crawlingathome

A client library for LAION's effort to filter CommonCrawl with CLIP, building a large scale image-text dataset.

clip dall-e dataset dataset-generation image-text machine-learning

Last synced: 01 Aug 2024

https://github.com/wangrongding/WebCut

🎬 基于 web 端的音视频编辑器。(A web-based audio and video editor.)

audio audio-editor audio-processing clip cut video video-editor video-processing wasm webcodecs

Last synced: 31 Jul 2024

https://github.com/jamjamjon/usls

A Rust library integrated with ONNXRuntime, providing a collection of Computer Vison and Vision-Language models.

ai blip clip cuda dinov2 grounding-dino ml ocr onnx onnxruntime rust rust-yolo sam tensorrt yolo yolo-rs yolo-rust yolov8

Last synced: 07 Sep 2024

https://github.com/blaugold/circular_clip_route

A Flutter package which provides a page route which reveals its page by expanding a circular clip.

animation circular clip flutter page route transition ui

Last synced: 02 Oct 2024

https://github.com/scatalfo/rlivestreamfaildaily

A python bot that downloads, edits, and uploads a compilation of the daily top posts of LivestreamFail to a YouTube channel.

clip moviepy python reddit subreddit twitch-dl video youtube-channel

Last synced: 28 Sep 2024

https://github.com/lalitsinghrana/dynamic-middle-ellipsis

Framework agnostic code to dynamically truncate long text in the center.

abbreviate clip dynamic middle-ellipsis react text-overflow truncate-text vanilla-js web-components

Last synced: 30 Sep 2024

https://github.com/rainydevzz/berryvine

Libre Media/Clip Sharing Software

bun clip clips css elysia handlebars html media media-services sqlite

Last synced: 30 Sep 2024

https://github.com/phronmophobic/clip.clj

CLIP Embeddings for images and text. A clojure wrapper for clip.cpp.

clip clojure embeddings

Last synced: 28 Sep 2024

https://github.com/qin2dim/clip2onnx

Export CLIP to ONNX

clip clip-onnx onnx onnxruntime

Last synced: 01 Oct 2024

https://github.com/grkndev/twitcher

A great library that will allow you to use the Twitch API service. All you need to do is use your Token and Client Id information.

api clip clipr data javascript nodejs npm npm-package npmjs streamers streaming twitch twitch-api twitch-bot twitchtv twtich-clip user

Last synced: 28 Sep 2024

https://github.com/o770/mpvclip-mpvival-mpvloop

Run MPV with options for loops and playback start and end times set according to a pattern or intervals of the total duration of a file.

audio bash-script clip loop mpv mpv-script multimedia playback video

Last synced: 04 Aug 2024

https://github.com/cbhihe/nlp_clip-bleu-meteor

Python Implementation of lexical vector embedding similarity scoring, zero-shot classification of images and n-gram based scoring to compare textual summaries

bimodal bleu clip deep-learning image-text-matching meteor n-grams nlp-machine-learning nltk python rouge scoring-algorithm vector-embeddings

Last synced: 28 Sep 2024

https://github.com/framilano/whydiscordwhy

A video compressor that creates clip compatible with Discord non-nitro plan

clip discord ffmpeg limit nitro videos

Last synced: 26 Sep 2024

https://github.com/jannesgithub/image-recognition-pos-system

A Windows desktop application that scans products using image recognition and includes all the features of a point-of-sale system.

clip mvvm-architecture possystem wpf

Last synced: 27 Sep 2024