Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists tagged with video-captioning
A curated list of projects in awesome lists tagged with video-captioning .
https://github.com/yehli/xmodaler
X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsense reasoning, and cross-modal retrieval).
cross-modal-retrieval image-captioning pretraining tden video-captioning vision-and-language visual-question-answering
Last synced: 30 Sep 2024
https://github.com/YehLi/xmodaler
X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsense reasoning, and cross-modal retrieval).
cross-modal-retrieval image-captioning pretraining tden video-captioning vision-and-language visual-question-answering
Last synced: 01 Aug 2024
https://github.com/xiadingZ/video-caption.pytorch
pytorch implementation of video captioning
deep-learning pytorch video-captioning
Last synced: 03 Aug 2024
https://github.com/scopeInfinity/Video2Description
Video to Text: Natural language description generator for some given video. [Video Captioning]
audio-processing cnn-keras deep-neural-networks image-captioning lstm-neural-networks video-captioning video-processing video-to-text
Last synced: 01 Aug 2024
https://github.com/tomchang25/whisper-auto-transcribe
Auto transcribe tool based on whisper
asr deep-learning gradio gradio-interface language-model pytorch speech-processing speech-recognition speech-to-text text-to-speech video-captioning voice-activity-detection
Last synced: 04 Aug 2024
https://github.com/ParitoshParmar/MTL-AQA
What and How Well You Performed? A Multitask Learning Approach to Action Quality Assessment [CVPR 2019]
action-quality-assessment action-recognition c3d captioning dilated-c3d dilated-convolution fine-grained-action-recognition fine-grained-classification lstm mtl-aqa multitask-learning pytorch representation-learning video-captioning video-processing video-understanding
Last synced: 01 Aug 2024