Projects in Awesome Lists tagged with caption
A curated list of projects in awesome lists tagged with caption .
https://github.com/idea-research/grounded-segment-anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
3d-whole-body-pose-estimation automatic-labeling-system caption data-generation image-editing open-vocabulary-detection open-vocabulary-segmentation speech
Last synced: 23 Apr 2025
https://github.com/IDEA-Research/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
3d-whole-body-pose-estimation automatic-labeling-system caption data-generation image-editing open-vocabulary-detection open-vocabulary-segmentation speech
Last synced: 20 Mar 2025
https://github.com/chenyuntc/pytorch-book
PyTorch tutorials and fun projects including neural talk, neural style, poem writing, anime generation (《深度学习框架PyTorch:入门与实战》)
autograd caption charrnn deep-learning gan image-classification jupyter-notebook neural-style neuraltalk nn pytorch pytorch-tutorials pytorch-tutorials-cn tensor tensorboard visdom
Last synced: 13 May 2025
https://github.com/smacke/ffsubsync
Automagically synchronize subtitles with video.
alignment audio caption captions fast-fourier-transform ffmpeg fft speech-detection srt srt-subtitles string-alignment subtitle subtitles sync synchronization vad video vlc vlc-media-player voice-activity-detection
Last synced: 29 Dec 2025
https://github.com/gielcobben/caption
Get Caption, start watching.
app caption electron electron-app macos subtitle subtitles
Last synced: 16 May 2025
https://github.com/kartik-v/bootstrap-star-rating
A simple yet powerful JQuery star rating plugin with fractional rating support.
bootstrap-star-rating caption change-stars css css-styles fractional javascript jquery rate-control rating rtl star star-symbols
Last synced: 14 May 2025
https://github.com/microsoft/univl
An official implementation for " UniVL: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation"
alignment caption caption-task coin joint localization msrvtt multimodal-sentiment-analysis multimodality pretrain pretraining retrieval-task segmentation video video-language video-text video-text-retrieval youcookii
Last synced: 05 Apr 2025
https://github.com/yezz123/soig
OSINT Tool gets a range of information from an Instagram account 🛠
api caption information-gathering instagram instagram-osint osint osint-python osint-tool picture-url profile python python-3 python3
Last synced: 01 Apr 2026
https://github.com/gojiplus/tuber
:sweet_potato: Access YouTube from R
access-youtube caption video youtube youtube-api youtube-oauth
Last synced: 30 Jul 2025
https://github.com/xqq/libaribcaption
Portable ARIB STD-B24 Caption Decoder/Renderer
arib aribb24 caption closedcaption cpp dtv ffmpeg
Last synced: 05 Apr 2025
https://github.com/nopol10/nekocap
Browser extension for creating & uploading community captions for YouTube, niconico and other video sharing sites.
caption captions editor extension substation-alpha subtitle subtitles typescript
Last synced: 16 Jan 2026
https://github.com/richardaecn/cvpr18-caption-eval
Learning to Evaluate Image Captioning. CVPR 2018
caption computer-vision cvpr2018 deep-learning evaluation-metrics image-captioning tensorflow
Last synced: 01 Mar 2026
https://github.com/labbeti/aac-datasets
Audio Captioning datasets for PyTorch.
audio audio-captioning caption captioning dataset datasets deep-learning pytorch
Last synced: 06 Oct 2025
https://github.com/kacky24/stylenet
A pytorch implemention of "StyleNet: Generating Attractive Visual Captions with Styles"
caption cnn dlhacks image-captioning lstm microsoft pytorch pytorch-implemention stylenet
Last synced: 13 Apr 2025
https://github.com/ericmagnuson/captionjs
An open-source jQuery plugin to easily and semantically add captions to images.
accessibility caption captions css javascript js schema
Last synced: 07 May 2025
https://github.com/div99/image-captioning
Image Captioning with Keras
attention caption caption-generator flickr8k-dataset image-captioning keras
Last synced: 14 Apr 2025
https://github.com/danactive/history
Your personal photos/videos history story boarded and scrapbooked with albums and galleries. Associate media with meta data including geocode, caption, descriptions, articles...
album caption gallery geocode map photo photo-album photo-albums photo-book photo-gallery photo-manager photo-mapping photos video
Last synced: 05 Mar 2026
https://github.com/gielcobben/caption-core
Caption Core acts as an abstraction layer for Caption’s core functionality.
caption javascript nodejs npm package subtitles
Last synced: 14 Apr 2025
https://github.com/linto-ai/linto-studio
Transcription and annotation interface for recorded audio or video files
asr audio-transcription caption captioning-videos stt subtitle subtitles transcription-edition video-transcription virtual-scribe
Last synced: 12 Apr 2025
https://github.com/botbahlul/pyvosklivesubtitle
PySimpleGUI based DESKTOP APP that can RECOGNIZE any live streaming in 23 languages that supported by VOSK then TRANSLATE (using unofficial online Google Translate API) and display it as LIVE CAPTION / LIVE SUBTITLE
auto-caption caption ffmpeg google-translate-api live-caption live-subtitle pysimplegui python speech-recognition speechrecognition subtitle voice-recognition voicerecognition vosk
Last synced: 27 Jul 2025
https://github.com/botbahlul/whisper_autosrt
A python script COMMAND LINE utility to AUTO GENERATE SUBTITLE FILE (using faster_whisper module which is a reimplementation of OpenAI Whisper module) and TRANSLATED SUBTITLE FILE (using unofficial online Google Translate API) for any video or audio file
auto-caption auto-subtitle caption faster-whisper ffmpeg google-translate-api openai openai-whisper python speech-recognition speechrecognition subtitle voice-recognition voicerecognition whisper
Last synced: 23 Oct 2025
https://github.com/Kav-K/Described
Automatically describe images sent by users on popular media platforms, incredibly useful for the visually impaired and for complicated imagery.
asyncio caption captions chatbot chatgpt describe-image discord gpt gpt4 help-wanted learn multimodal python
Last synced: 24 Mar 2025
https://github.com/kav-k/described
Automatically describe images sent by users on popular media platforms, incredibly useful for the visually impaired and for complicated imagery.
asyncio caption captions chatbot chatgpt describe-image discord gpt gpt4 help-wanted learn multimodal python
Last synced: 05 Oct 2025
https://github.com/wayou/hexo-image-caption
add caption for images within posts
Last synced: 21 Mar 2025
https://github.com/botbahlul/android-autosrt-v2
ANDROID APP to AUTO GENERATE SUBTITLE FILE and TRANSLATED SUBTITLE FILE (using unofficial online Google Translate API) for any audio/video files using 2 ACTIVITIES
android caption chaquopy ffmpeg google-translate-api googletranslate java python speech-recognition speech-to-text speechrecognition subtitle voice-recognition voice-to-text voicerecognition
Last synced: 19 Aug 2025
https://github.com/baraja-core/doctrine-fulltext-search
Smart Doctrine search engine with ranking system.
algorithm candidates caption database-search dic-container did-you-mean doctrine doctrine-fulltext-search doctrine-search entity fulltext fulltext-search mysql-search overridden perex php-search php-search-engine search search-engine smart
Last synced: 09 Apr 2025
https://github.com/gielcobben/getcaption.co
Website for Caption app.
caption javascript nextjs react website
Last synced: 14 Apr 2025
https://github.com/jacky-xbb/shortcap
Add automatic captions to short videos (YouTube Shorts, TikTok) using AI speech recognition. Fast, customizable, and easy to use.
Last synced: 09 Mar 2026
https://github.com/naivehobo/smart-i
Smart-I is an android application aimed at helping the visually impaired using artificial intelligence and cloud computing.
andorid android android-app android-application caption captioning captioning-images captions cloud cloud-computing deep-learning deep-neural-networks image-recognition visualization
Last synced: 07 May 2025
https://github.com/aliencube/youtube-subtitles-extractor
This is the library to retrieve subtitles from a given YouTube video, inspired by https://github.com/devhims/youtube-caption-extractor
caption hacktoberfest subtitle video youtube
Last synced: 29 Jul 2025
https://github.com/botbahlul/vosk_autosrt
A python script COMMAND LINE utility to AUTO GENERATE SUBTITLE FILE (using free Vosk Speech Recognition API) and TRANSLATED SUBTITLE FILE (using unofficial online Google Translate API) for any video or audio file
auto-caption auto-subtitle caption ffmpeg google-translate-api python speech-recognition speechrecognition subtitle voice-recognition voicerecognition vosk
Last synced: 28 Oct 2025
https://github.com/vaenow/crx-coursera-caption
A friendly UI layout for Coursera caption & Translate caption English into Chinese
caption coursera coursera-caption layout translate ui
Last synced: 08 May 2025
https://github.com/aasim-a/yt-xml2srt
Easily convert YouTube caption format from XML to SRT with zero dependencies
caption convert converter format javascript node nodejs srt xml xml2srt youtube youtube-caption youtube-xml yt-xml2srt
Last synced: 18 Aug 2025
https://github.com/aasim-a/yt-xml2vtt
Easily convert YouTube caption format from XML to VTT with zero dependencies
caption convert converter format javascript node nodejs vtt xml xml2vtt youtube youtube-caption youtube-xml yt-xml2vtt
Last synced: 18 Aug 2025
https://github.com/imdeep2905/tweet-bot
This is a fun little project created in python. It will fetch random image and caption . Then it will post it on twitter using tweepy.
caption random randomimage twitter-api twitter-bot
Last synced: 15 May 2025
https://github.com/nopol10/nekocap-server
The server used by NekoCap
caption captions extension parse-server subtitle subtitles typescript
Last synced: 16 Jan 2026
https://github.com/leeyunjai/image2text
caption generator using lavis and argostranslate
blip2 caption caption-generation caption-generator captioning-images captions image-analysis image-text img2txt
Last synced: 03 Jul 2025
https://github.com/botbahlul/rendersrt
rendersrt is a simple command line tool made with python to render a subtitle file into any video files
caption ffmpeg python srt-subtitle subriptext subtitle subtitle-render
Last synced: 08 Oct 2025
https://github.com/h-hg/docsify-image-caption
a plugin to show image catpion in docsify.
caption center docsify docsify-plugin image title
Last synced: 13 Apr 2025
https://github.com/agarzon/ollama-image-caption
caption flux llava ollama stable-diffusion
Last synced: 09 Mar 2026
https://github.com/ayx03/translation
这个库存放了一些我个人独自完成和与他人合作完成的翻译作品。My translation works. Don't hesitate to open an issue if you find any imperfections.
caption kotlin subtitle subtitles translation
Last synced: 08 Mar 2026
https://github.com/prithivsakthiur/imgscope-ocr-2b-0527
The Imgscope-OCR-2B-0527 model is a fine-tuned version of Qwen2-VL-2B-Instruct, specifically optimized for messy handwriting recognition, document OCR, realistic handwritten OCR, and math problem solving with LaTeX formatting. This model is trained on custom datasets for document and handwriting OCR tasks and textual understanding
caption gradio huggingface-transformers llms ocr ollama-gui py qwen2-vl video-processing vlm
Last synced: 08 Aug 2025
https://github.com/gusanmaz/twcapbot
Tweet Caption Bot and CLI.
caption captioning-images cli twitter twitter-api twitter-bot
Last synced: 26 Dec 2025
https://github.com/superbderrick/smiparser
SMI(Caption) Parser Library
android-library caption ios-lib library
Last synced: 22 Aug 2025
https://github.com/neemiasbsilva/minigpt4-image-caption-generation
Streamline the creation of supervised datasets to facilitate data augmentation for deep learning architectures focused on image captioning. The core framework leverages MiniGPT-4, complemented by the pre-trained Vicuna model, which boasts 13 billion parameters.
caption image-caption-generator minigpt4
Last synced: 10 Aug 2025
https://github.com/garylab/audio-subtitler
Extract vtt and srt subtitle using fast whisper
caption captions fast-whisper srt subtitle vtt whisper
Last synced: 12 Jan 2026
https://github.com/spacesoda/youtube2txt
youtube2txt / YouTube to TXT / YouTube Transcript Grabber — a tool to extract clean transcripts from any YouTube video!
caption localhost subtitle transcript youtube
Last synced: 21 Jan 2026
https://github.com/adrianbj/imagetextcaption
ProcessWire module for automatically creating image captions from the image description field and displayed when an image is inserted in a RTE field.
caption image processwire rte-field
Last synced: 27 Feb 2026
https://github.com/lorhansohaky/commitmessageintosubtitle
Projeto que gera legenda automática para vídeos do Gource.
caption commit commit-message-into-caption commit-message-into-subtitle git gource message subtitle svn
Last synced: 26 Mar 2025
https://github.com/vaenow/chromeless-coursera-caption
Chromeless crawler coursera video's caption / subtitle
caption chromeless coursera crawler crx subtitle
Last synced: 31 Mar 2025
https://github.com/rattletat/caption-party
Topic modeling of German parties based on YouTube video data
caption command-line-tool natural-language-processing party politics subtitles topic-modeling youtube-api youtube-dl
Last synced: 14 Apr 2026
https://github.com/sheng1111/text2srt_tts
Convert text to speech and auto-generate SRT subtitles. A CLI tool for creating synced audio and captions from plain text using multilingual TTS.
audiobook batch-processing caption multilingual speech-synthesis srt subtitle-generator text-to-speech tts video-captioning
Last synced: 05 Oct 2025
https://github.com/sourceduty/image_caption
🖼️ Add captions to images using Python.
ai ai-caption ai-picture ai-tool artificial-intelligence caption captions chatgpt custom-gpt gpt image-tool images openai photos picture-ai pictures tool
Last synced: 08 Aug 2025
https://github.com/goatandsheep/transcript-cc
Create an imprecisely-timed closed captions file based on a text transcript
Last synced: 17 Jan 2026