Projects in Awesome Lists tagged with mscoco
A curated list of projects in awesome lists tagged with mscoco .
https://github.com/microsoft/swin-transformer
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
ade20k image-classification imagenet mask-rcnn mscoco object-detection semantic-segmentation swin-transformer
Last synced: 13 May 2025
https://github.com/microsoft/Swin-Transformer
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
ade20k image-classification imagenet mask-rcnn mscoco object-detection semantic-segmentation swin-transformer
Last synced: 16 Mar 2025
https://github.com/sgrvinod/a-pytorch-tutorial-to-image-captioning
Show, Attend, and Tell | a PyTorch Tutorial to Image Captioning
attention-mechanism computer-vision encoder-decoder image-captioning mscoco pytorch pytorch-tutorial show-attend-and-tell
Last synced: 08 Oct 2025
https://github.com/apple/ml-cvnets
CVNets: A library for training computer vision networks
ade20k classification computer-vision deep-learning detection imagenet machine-learning mscoco pascal-voc pytorch segmentation
Last synced: 15 May 2025
https://github.com/SwinTransformer/Swin-Transformer-Object-Detection
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" on Object Detection and Instance Segmentation.
cascade mask-rcnn mscoco object-detection reppoints swin swin-transformer
Last synced: 27 Mar 2025
https://github.com/swintransformer/swin-transformer-object-detection
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" on Object Detection and Instance Segmentation.
cascade mask-rcnn mscoco object-detection reppoints swin swin-transformer
Last synced: 06 Oct 2025
https://github.com/peteanderson80/bottom-up-attention
Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome
caffe captioning-images faster-rcnn image-captioning mscoco mscoco-dataset visual-question-answering vqa
Last synced: 08 Apr 2025
https://github.com/jdai-cv/cotnet
This is an official implementation for "Contextual Transformer Networks for Visual Recognition".
contextual-transformer cotnet image-classification imagenet instance-segmentation mask-rcnn mscoco object-detection semantic-segmentation vision-transformer
Last synced: 05 Apr 2025
https://github.com/vitae-transformer/vitae-transformer
The official repo for [NeurIPS'21] "ViTAE: Vision Transformer Advanced by Exploring Intrinsic Inductive Bias" and [IJCV'22] "ViTAEv2: Vision Transformer Advanced by Exploring Inductive Bias for Image Recognition and Beyond"
ade20k deep-learning imagenet imagenet-classification mscoco object-detection semantic-segmentation vision-transformer vitae-transformer
Last synced: 09 Apr 2025
https://github.com/hustvl/bmaskr-cnn
[ECCV 2020] Boundary-preserving Mask R-CNN
boundary-detection detectron detectron2 faster-rcnn instance-segmentation mask-rcnn maskrcnn mscoco object-detection
Last synced: 21 Aug 2025
https://github.com/peteanderson80/spice
Semantic Propositional Image Caption Evaluation
captioning-images image-captioning mscoco
Last synced: 15 Apr 2025
https://github.com/hrnet/hrnet-fcos
High-resolution Networks for the Fully Convolutional One-Stage Object Detection (FCOS) algorithm
fcos hrnets mscoco object-detection
Last synced: 12 Apr 2025
https://github.com/lightly-ai/labelformat
A tool for converting computer vision label formats.
annotation bounding-boxes kitti labels mscoco object-detection pascal-voc yolo yolov8
Last synced: 05 Apr 2025
https://github.com/peteanderson80/coco-caption
Adds SPICE metric to coco-caption evaluation server codes
captioning-images image-captioning mscoco mscoco-dataset mscoco-image-dataset spice
Last synced: 20 Aug 2025
https://github.com/howardyclo/imagenet2coco
A demo for mapping class labels from ImageNet to COCO.
deep-learning detection few-shot-learning imagenet imagenet-dataset mscoco mscoco-dataset one-shot-learning zero-shot-learning
Last synced: 12 Oct 2025
https://github.com/jakarto3d/jakarnotator
The Jakarnotator is an annotation tool to create your own database for instance segmentation problem.
annotations computer-vision data database deep-learning detectron instance-segmentation mscoco training-data
Last synced: 15 May 2025
https://github.com/buaadreamer/ccrk
[KDD 2024] Improving the Consistency in Cross-Lingual Cross-Modal Retrieval with 1-to-K Contrastive Learning
cross-lingual cross-lingual-retrieval cross-modal cross-modal-retrieval iglue image-text-retrieval image-text-search kdd2024 mscoco multi30k retrieval swin-transformer vision-language-pretraining wit xflickrco xlm-roberta
Last synced: 11 Apr 2025
https://github.com/shunk031/huggingface-datasets_cocoa
COCOA: Semantic Amodal Segmentation for huggingface datasets
bsds cocoa huggingface huggingface-datasets mscoco semantic-segmentation
Last synced: 06 Feb 2026
https://github.com/shunk031/huggingface-datasets_mscoco
Microsoft COCO: Common Objects in Context for huggingface datasets
caption-generation huggingface-datasets instance-segmentation keypoint-detection microsoft-coco mscoco mscoco-dataset object-detection semantic-segmentation
Last synced: 30 Apr 2025
https://github.com/waikato-ufdl/wai-annotations
Python library for converting annotated datasets into various formats (e.g., image classification, object detection and speech datasets).
common-voice conversion deepspeech festvox image-annotation mscoco python3 tfrecords vgg
Last synced: 29 Jul 2025
https://github.com/maximumentropy/ift6266
Inpainting on MSCOCO
inpainting mscoco wasserstein-gan
Last synced: 04 Sep 2025
https://github.com/pnkvalavala/image-captioning
Image Caption Generator using a Pretrained ResNet-50 and an LSTM architecture. Trained on COCO 2017 dataset, it's accessible via a Streamlit app.
computer-vision deep-learning image-captioning lstm mscoco python pytorch resnet streamlit
Last synced: 13 May 2026
https://github.com/k2-gc/object-detection-format-converter
Object Detection Dataset Format Converter
dataset-converter deep-learning kitti kitti-dataset mscoco mscoco-dataset object-detection object-detection-datasets pascal-voc pascal-voc-dataset python python3 yolo yolo-dataset
Last synced: 25 Nov 2025
https://github.com/shunk031/huggingface-datasets_cocoapi-tools
A helper library for easily converting MSCOCO format data using the loading script of huggingface datasets.
huggingface-datasets mscoco mscoco-dataset
Last synced: 07 Mar 2026
https://github.com/shunk031/huggingface-datasets_cocostuff
COCO-Stuff dataset for huggingface datasets
coco-stuff huggingface huggingface-datasets ms-coco mscoco mscoco-dataset
Last synced: 15 May 2026
https://github.com/stiliyandr/neural-image-caption
A simple Python API (built on top of TensorFlow) for neural image captioning with MSCOCO data.
image-captioning mscoco mscoco-dataset neural-networks python-3 tensorflow2
Last synced: 12 Feb 2026