Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists tagged with clip
A curated list of projects in awesome lists tagged with clip .
https://github.com/easychen/pushdeer
开放源码的无App推送服务,iOS14+扫码即用。亦支持快应用/iOS和Mac客户端、Android客户端、自制设备
app clip notification-service push
Last synced: 30 Sep 2024
https://github.com/ofa-sys/chinese-clip
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
chinese clip computer-vision contrastive-loss coreml-models deep-learning image-text-retrieval multi-modal multi-modal-learning nlp pretrained-models pytorch transformers vision-and-language-pre-training vision-language
Last synced: 30 Sep 2024
https://github.com/marqo-ai/marqo
Unified embedding generation and search engine. Also available on cloud - cloud.marqo.ai
chatgpt clip deep-learning gpt hacktoberfest hnsw information-retrieval knn large-language-models machine-learning machinelearning multi-modal natural-language-processing search-engine semantic-search tensor-search transformers vector-search vision-language visual-search
Last synced: 01 Oct 2024
https://github.com/OFA-Sys/Chinese-CLIP
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
chinese clip computer-vision contrastive-loss coreml-models deep-learning image-text-retrieval multi-modal multi-modal-learning nlp pretrained-models pytorch transformers vision-and-language-pre-training vision-language
Last synced: 01 Aug 2024
https://github.com/cvhub520/x-anylabeling
Effortless data labeling with AI support from Segment Anything and other awesome models.
clip deep-learning deeplearning labeling-tool llm onnx paddle pytorch resnet sam yolo
Last synced: 01 Oct 2024
https://github.com/CVHub520/X-AnyLabeling
Effortless data labeling with AI support from Segment Anything and other awesome models.
clip deep-learning deeplearning labeling-tool llm onnx paddle pytorch resnet sam yolo
Last synced: 31 Jul 2024
https://github.com/open-mmlab/mmpretrain
OpenMMLab Pre-training Toolbox and Benchmark
beit clip constrastive-learning convnext deep-learning image-classification mae masked-image-modeling mobilenet moco multimodal pretrained-models pytorch resnet self-supervised-learning swin-transformer vision-transformer
Last synced: 30 Sep 2024
https://github.com/yuanzhoulvpi2017/zero_nlp
中文nlp解决方案(大模型、数据、模型、训练、推理)
bert bloom chatglm-6b clip dolly falcon gpt gpt-neox gpt2 huggingface-transformers llama2 nlp pipeline pytorch text-generation transformers
Last synced: 27 Sep 2024
https://github.com/pharmapsychotic/clip-interrogator
Image to prompt with BLIP and CLIP
Last synced: 29 Sep 2024
https://github.com/rom1504/clip-retrieval
Easily compute clip embeddings and build a clip retrieval system with them
ai clip deep-learning knn multimodal semantic-search
Last synced: 30 Sep 2024
https://rom1504.github.io/clip-retrieval/?back=https%3A%2F%2Fknn5.laion.ai&index=laion5B&useMclip=false
Easily compute clip embeddings and build a clip retrieval system with them
ai clip deep-learning knn multimodal semantic-search
Last synced: 03 Aug 2024
https://github.com/jingyi0000/vlm_survey
Collection of AWESOME vision-language models for vision tasks
clip computer-vision deep-learning knowledge-distillation multi-modal-model survey transfer-learning vision-language-model
Last synced: 30 Sep 2024
https://github.com/cambrian-mllm/cambrian
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
chatbot clip computer-vision dino instruction-tuning large-language-models llms mllm multimodal-large-language-models representation-learning
Last synced: 30 Sep 2024
https://github.com/QIN2DIM/hcaptcha-challenger
🥂 Gracefully face hCaptcha challenge with MoE(ONNX) embedded solution.
clip computer-vision hcaptcha hcaptcha-solver image-segmentation multi-modal multi-modal-learning object-detection onnx onnx-models onnxruntime opencv-python playwright solver yolo yolov5 zero-shot-classification
Last synced: 31 Jul 2024
https://github.com/qin2dim/hcaptcha-challenger
🥂 Gracefully face hCaptcha challenge with MoE(ONNX) embedded solution.
clip computer-vision hcaptcha hcaptcha-solver image-segmentation multi-modal multi-modal-learning object-detection onnx onnx-models onnxruntime opencv-python playwright solver yolo yolov5 zero-shot-classification
Last synced: 30 Sep 2024
https://github.com/mbzuai-oryx/video-chatgpt
[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.
chatbot clip gpt-4 llama llava mulit-modal vicuna video-chatboat video-conversation vision-language vision-language-pretraining
Last synced: 27 Sep 2024
https://github.com/unum-cloud/uform
Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than OpenAI CLIP and LLaVA 🖼️ & 🖋️
bert clip clustering contrastive-learning cross-attention huggingface-transformers image-search language-vision llava multi-lingual multimodal neural-network openai openclip pretrained-models pytorch representation-learning semantic-search transformer vector-search
Last synced: 30 Sep 2024
https://github.com/EdVince/Stable-Diffusion-NCNN
Stable Diffusion in NCNN with c++, supported txt2img and img2img
android clip cpp diffusion executable img2img mnn ncnn onnx stable-diffusion tensorrt tnn txt2img
Last synced: 01 Aug 2024
https://github.com/haltakov/natural-language-image-search
Search photos on Unsplash using natural language
clip computer-vision image-search machine-learning photos unsplash
Last synced: 01 Aug 2024
https://github.com/haltakov/natural-language-youtube-search
Search inside YouTube videos using natural language
clip computer-vision machine-learning search youtube
Last synced: 31 Jul 2024
https://github.com/omerbt/Text2LIVE
Official Pytorch Implementation for "Text2LIVE: Text-Driven Layered Image and Video Editing" (ECCV 2022 Oral)
clip eccv2022 generative-model image-editing image-manipulation single-image single-video text-driven-editing text2live video-editing
Last synced: 31 Jul 2024
https://github.com/ArrowLuo/CLIP4Clip
An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
activitynet clip didemo lsmdc msrvtt msvd multimodal multimodal-learning multimodality ranking retrieval retrieval-model search video-clip-retrieval video-text-retrieval
Last synced: 01 Aug 2024
https://github.com/eps696/aphantasia
CLIP + FFT/DWT/RGB = text to image/video
clip text-to-image text-to-video
Last synced: 01 Aug 2024
https://github.com/hila-chefer/Transformer-MM-Explainability
[ICCV 2021- Oral] Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers, a novel method to visualize any Transformer-based network. Including examples for DETR, VQA.
clip detr explainability explainable-ai interpretability lxmert transformer transformers visualbert visualization vqa
Last synced: 01 Aug 2024
https://github.com/Sense-GVT/DeCLIP
Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm
big-model clip image-text multi-model self-supervised vision-language-pretraining zero-shot
Last synced: 01 Aug 2024
https://github.com/pablosichert/react-truncate
React component for truncating multi-line spans and adding an ellipsis.
Last synced: 01 Oct 2024
https://github.com/pengsongyou/openscene
[CVPR'23] OpenScene: 3D Scene Understanding with Open Vocabularies
3d-scene-understanding clip cvpr2023 llm matterport3d nuscenes point-cloud-segmentation point-clouds scannet semantic-segmentation
Last synced: 31 Jul 2024
https://github.com/v-iashin/video_features
Extract video features from raw videos using multiple GPUs. We support RAFT flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, and TIMM models.
audio-features clip feature-extraction i3d ig65m laion multi-gpu optical-flow parallel pytorch r2plus1d raft resnet s3d swin timm vggish video-features visual-features vit
Last synced: 01 Aug 2024
https://github.com/open-compass/vlmevalkit
Open-source evaluation toolkit of large vision-language models (LVLMs), support GPT-4v, Gemini, QwenVLPlus, 40+ HF models, 20+ benchmarks
chatgpt claude clip computer-vision evaluation gemini gpt gpt-4v gpt4 large-language-models llava llm multi-modal openai openai-api pytorch qwen vit vqa
Last synced: 02 Aug 2024
https://github.com/open-compass/VLMEvalKit
Open-source evaluation toolkit of large vision-language models (LVLMs), support GPT-4v, Gemini, QwenVLPlus, 40+ HF models, 20+ benchmarks
chatgpt claude clip computer-vision evaluation gemini gpt gpt-4v gpt4 large-language-models llava llm multi-modal openai openai-api pytorch qwen vit vqa
Last synced: 08 Aug 2024
https://github.com/Chrisvin/EasyReveal
Android Easy Reveal Library
android android-library clip easy easyreveal library reveal reveal-animations
Last synced: 01 Aug 2024
https://github.com/OpenGVLab/Instruct2Act
Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model
chatgpt clip llm robotics segment-anything
Last synced: 02 Aug 2024
https://github.com/opengvlab/instruct2act
Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model
chatgpt clip llm robotics segment-anything
Last synced: 02 Aug 2024
https://github.com/xmed-lab/CLIP_Surgery
CLIP Surgery for Better Explainability with Enhancement in Open-Vocabulary Tasks
clip explainability interpretability multilabel multimodal open-vocabulary sam segment-anything segmentation vision-transformer
Last synced: 31 Jul 2024
https://github.com/wisconsinaivision/vip-llava
[CVPR2024] ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts
chatbot clip cvpr2024 foundation-models gpt-4 gpt-4-vision llama llama2 llava multi-modal vision-language visual-prompting
Last synced: 27 Sep 2024
https://liruiw.github.io/gensim/
GenSim: Generating Robotic Simulation Tasks via Large Language Models
clip gpt-4 llm pybullet simulation
Last synced: 01 Aug 2024
https://github.com/yxuansu/MAGIC
Language Models Can See: Plugging Visual Controls in Text Generation
clip gpt-2 image-captioning multimodal plug-and-play-language-models story-generation text-generation unsupervised-learning zero-shot
Last synced: 02 Aug 2024
https://github.com/haofanwang/natural-language-joint-query-search
Search photos on Unsplash based on OpenAI's CLIP model, support search with joint image+text queries and attention visualization.
attention clip computer-vision image-retrieval image-search multi-modal-search unsplash visualizations
Last synced: 01 Aug 2024
https://github.com/chao1224/MoleculeSTM
Multi-modal Molecule Structure-text Model for Text-based Editing and Retrieval, Nat Mach Intell 2023 (https://www.nature.com/articles/s42256-023-00759-6)
clip computation-chemistry drug-discovery editing foundation-model molecule-editing moleculeclip moleculestm pretraining retrieval
Last synced: 03 Aug 2024
https://github.com/mbzuai-oryx/videogpt-plus
Official Repository of paper VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding
chatbot clip dual-encoder gpt4 gpt4o image-encoder llama3 llava multimodal phi-3-mini vicuna video-chatbot video-conversation video-encoder vision-language vision-language-pretraining
Last synced: 27 Sep 2024
https://github.com/josephrocca/clip-image-sorter
Sort a folder of images according to their similarity with provided text in your browser (uses a browser-ported version of OpenAI's CLIP model and the web's new File System Access API)
clip file-system-access-api openai openai-clip
Last synced: 01 Aug 2024
https://github.com/Shishkebaboo/VodRecovery
The purpose of this script is to obtain videos or clips that are either marked as "sub-only" or have been deleted on Twitch.
broadcast clip clips commad-line commandline console development ffmpeg live m3u8 m3u8-playlist m3u8-videos mp4 python recover twitch twitchclips twitchtv vodrecovery
Last synced: 06 Aug 2024
https://github.com/Imageomics/bioclip
This is the repository for the BioCLIP model and the TreeOfLife-10M dataset [CVPR'24 Oral, Best Student Paper].
clip computer-vision imageomics knowledge-guided-machine-learning taxonomy
Last synced: 01 Aug 2024
https://github.com/HFAiLab/clip-gen
CLIP-GEN: Language-Free Training of a Text-to-Image Generator with CLIP
clip pytorch text-to-image text2image
Last synced: 01 Aug 2024
https://github.com/minimaxir/imgbeddings
Python package to generate image embeddings with CLIP without PyTorch/TensorFlow
ai clip embeddings image-processing images onnx transformers
Last synced: 03 Aug 2024
https://github.com/miccunifi/SEARLE
[ICCV 2023] - Zero-shot Composed Image Retrieval with Textual Inversion
circo cirr clip composed-image-retrieval fashion-iq knowledge-distillation multimodal-learning pytorch textual-inversion
Last synced: 01 Aug 2024
https://github.com/eddieoz/youtube-clips-automator
MARCELO: an AI powered bot to automate the editing and thumbnail creation for your Youtube clips channel
ai audio-processing automation bot clip computer-vision editing thumbnail video video-processing youtube
Last synced: 04 Aug 2024
https://github.com/DRSY/MoTIS
[NAACL 2022]Mobile Text-to-Image search powered by multimodal semantic representation models(e.g., OpenAI's CLIP)
ai clip cross-modal image-search ios-swift k-means k-means-clustering knn knowledge-distillation lsh naacl random-projection retrieval semantic-search vector-search
Last synced: 03 Aug 2024
https://github.com/ylqi/Count-Anything
This method uses Segment Anything and CLIP to ground and count any object that matches a custom text prompt, without requiring any point or box annotation.
clip count-anything segment-anything
Last synced: 28 Aug 2024
https://github.com/Ajatt-Tools/videoclip
🍗 Easily create videoclips with mpv.
addon ajatt audioclip clip mpv mpv-script videoclip
Last synced: 04 Aug 2024
https://github.com/salesforce/MUST
PyTorch code for MUST
clip masked-image-modeling self-training unsupervised-learning zero-shot-classification zero-shot-learning
Last synced: 03 Aug 2024
https://github.com/aerobounce/trim.lua
Trim mode for mpv — Turn mpv into Lossless Audio / Video Editor
clip concat ffmpeg lossless lua lua-script mpv mpv-script trim video video-editor video-processing
Last synced: 04 Aug 2024
https://github.com/sajjjadayobi/CLIPfa
CLIPfa: Connecting Farsi Text and Images
clip farsi farsi-datasets image-search openai-clip persian-nlp zero-shot-learning
Last synced: 04 Aug 2024
https://github.com/pansyjs/video-editing-timeline
Timeline for video editing(为视频编辑而写时间线)
clip cut editing timeline video video-clip video-cut video-editing
Last synced: 29 Sep 2024
https://github.com/d-tsuji/clipboard
(Experimental) Go library for multi-platform clipboard.
clip clipboard clipboard-copy clipboard-library go golang library linux macos multiplatform windows
Last synced: 02 Aug 2024
https://github.com/AgentMaker/Paddle-CLIP
A PaddlePaddle version implementation of CLIP of OpenAI.
Last synced: 03 Aug 2024
https://github.com/hv0905/nekoimagegallery
An AI-powered natural language & reverse Image Search Engine powered by CLIP & qdrant.
clip computer-vision image-search image-search-engine search-engine transformers
Last synced: 01 Oct 2024
https://github.com/amiechen/twitch-batch-loader
Batch download twitch clips in terminal
batch clip python twitch twitch-api
Last synced: 06 Aug 2024
https://github.com/yingchen001/CF-CLIP
[ACM MM 2022] Towards Counterfactual Image Manipulation via CLIP
clip image-edit manipulation sytlegan
Last synced: 01 Aug 2024
https://github.com/TheoCoombes/crawlingathome
A client library for LAION's effort to filter CommonCrawl with CLIP, building a large scale image-text dataset.
clip dall-e dataset dataset-generation image-text machine-learning
Last synced: 01 Aug 2024
https://github.com/wangrongding/WebCut
🎬 基于 web 端的音视频编辑器。(A web-based audio and video editor.)
audio audio-editor audio-processing clip cut video video-editor video-processing wasm webcodecs
Last synced: 31 Jul 2024
https://github.com/blaugold/circular_clip_route
A Flutter package which provides a page route which reveals its page by expanding a circular clip.
animation circular clip flutter page route transition ui
Last synced: 02 Oct 2024
https://github.com/koldim2001/youtube_frame_searcher
Youtube video moment searcher by text or photo
clip clip-model cosine-similarity searching-algorithms streamlit video-search webapp youtube youtube-dl youtube-search
Last synced: 28 Sep 2024
https://github.com/scatalfo/rlivestreamfaildaily
A python bot that downloads, edits, and uploads a compilation of the daily top posts of LivestreamFail to a YouTube channel.
clip moviepy python reddit subreddit twitch-dl video youtube-channel
Last synced: 28 Sep 2024
https://github.com/lalitsinghrana/dynamic-middle-ellipsis
Framework agnostic code to dynamically truncate long text in the center.
abbreviate clip dynamic middle-ellipsis react text-overflow truncate-text vanilla-js web-components
Last synced: 30 Sep 2024
https://github.com/rainydevzz/berryvine
Libre Media/Clip Sharing Software
bun clip clips css elysia handlebars html media media-services sqlite
Last synced: 30 Sep 2024
https://github.com/phronmophobic/clip.clj
CLIP Embeddings for images and text. A clojure wrapper for clip.cpp.
Last synced: 28 Sep 2024
https://github.com/thomrl/instaget
Easily save instagram video/pictures, youtube thumbnails and twitch clips
clip gallery instagram instagram-photos instagram-scraper instagram-video picture pyperclip python-3-5 python3 regex twitch twitch-clips youtube-thumbnail youtube-thumbnails
Last synced: 01 Oct 2024
https://github.com/qin2dim/clip2onnx
Export CLIP to ONNX
clip clip-onnx onnx onnxruntime
Last synced: 01 Oct 2024
https://github.com/grkndev/twitcher
A great library that will allow you to use the Twitch API service. All you need to do is use your Token and Client Id information.
api clip clipr data javascript nodejs npm npm-package npmjs streamers streaming twitch twitch-api twitch-bot twitchtv twtich-clip user
Last synced: 28 Sep 2024
https://github.com/o770/mpvclip-mpvival-mpvloop
Run MPV with options for loops and playback start and end times set according to a pattern or intervals of the total duration of a file.
audio bash-script clip loop mpv mpv-script multimedia playback video
Last synced: 04 Aug 2024
https://github.com/cbhihe/nlp_clip-bleu-meteor
Python Implementation of lexical vector embedding similarity scoring, zero-shot classification of images and n-gram based scoring to compare textual summaries
bimodal bleu clip deep-learning image-text-matching meteor n-grams nlp-machine-learning nltk python rouge scoring-algorithm vector-embeddings
Last synced: 28 Sep 2024
https://github.com/jannesgithub/image-recognition-pos-system
A Windows desktop application that scans products using image recognition and includes all the features of a point-of-sale system.
clip mvvm-architecture possystem wpf
Last synced: 27 Sep 2024