An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with mscoco

A curated list of projects in awesome lists tagged with mscoco .

https://github.com/microsoft/swin-transformer

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

ade20k image-classification imagenet mask-rcnn mscoco object-detection semantic-segmentation swin-transformer

Last synced: 13 May 2025

https://github.com/microsoft/Swin-Transformer

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

ade20k image-classification imagenet mask-rcnn mscoco object-detection semantic-segmentation swin-transformer

Last synced: 16 Mar 2025

https://github.com/SwinTransformer/Swin-Transformer-Object-Detection

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" on Object Detection and Instance Segmentation.

cascade mask-rcnn mscoco object-detection reppoints swin swin-transformer

Last synced: 27 Mar 2025

https://github.com/swintransformer/swin-transformer-object-detection

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" on Object Detection and Instance Segmentation.

cascade mask-rcnn mscoco object-detection reppoints swin swin-transformer

Last synced: 06 Oct 2025

https://github.com/peteanderson80/bottom-up-attention

Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome

caffe captioning-images faster-rcnn image-captioning mscoco mscoco-dataset visual-question-answering vqa

Last synced: 08 Apr 2025

https://github.com/jdai-cv/cotnet

This is an official implementation for "Contextual Transformer Networks for Visual Recognition".

contextual-transformer cotnet image-classification imagenet instance-segmentation mask-rcnn mscoco object-detection semantic-segmentation vision-transformer

Last synced: 05 Apr 2025

https://github.com/vitae-transformer/vitae-transformer

The official repo for [NeurIPS'21] "ViTAE: Vision Transformer Advanced by Exploring Intrinsic Inductive Bias" and [IJCV'22] "ViTAEv2: Vision Transformer Advanced by Exploring Inductive Bias for Image Recognition and Beyond"

ade20k deep-learning imagenet imagenet-classification mscoco object-detection semantic-segmentation vision-transformer vitae-transformer

Last synced: 09 Apr 2025

https://github.com/peteanderson80/spice

Semantic Propositional Image Caption Evaluation

captioning-images image-captioning mscoco

Last synced: 15 Apr 2025

https://github.com/hrnet/hrnet-fcos

High-resolution Networks for the Fully Convolutional One-Stage Object Detection (FCOS) algorithm

fcos hrnets mscoco object-detection

Last synced: 12 Apr 2025

https://github.com/lightly-ai/labelformat

A tool for converting computer vision label formats.

annotation bounding-boxes kitti labels mscoco object-detection pascal-voc yolo yolov8

Last synced: 05 Apr 2025

https://github.com/peteanderson80/coco-caption

Adds SPICE metric to coco-caption evaluation server codes

captioning-images image-captioning mscoco mscoco-dataset mscoco-image-dataset spice

Last synced: 20 Aug 2025

https://github.com/jakarto3d/jakarnotator

The Jakarnotator is an annotation tool to create your own database for instance segmentation problem.

annotations computer-vision data database deep-learning detectron instance-segmentation mscoco training-data

Last synced: 15 May 2025

https://github.com/shunk031/huggingface-datasets_cocoa

COCOA: Semantic Amodal Segmentation for huggingface datasets

bsds cocoa huggingface huggingface-datasets mscoco semantic-segmentation

Last synced: 06 Feb 2026

https://github.com/waikato-ufdl/wai-annotations

Python library for converting annotated datasets into various formats (e.g., image classification, object detection and speech datasets).

common-voice conversion deepspeech festvox image-annotation mscoco python3 tfrecords vgg

Last synced: 29 Jul 2025

https://github.com/maximumentropy/ift6266

Inpainting on MSCOCO

inpainting mscoco wasserstein-gan

Last synced: 04 Sep 2025

https://github.com/pnkvalavala/image-captioning

Image Caption Generator using a Pretrained ResNet-50 and an LSTM architecture. Trained on COCO 2017 dataset, it's accessible via a Streamlit app.

computer-vision deep-learning image-captioning lstm mscoco python pytorch resnet streamlit

Last synced: 13 May 2026

https://github.com/shunk031/huggingface-datasets_cocoapi-tools

A helper library for easily converting MSCOCO format data using the loading script of huggingface datasets.

huggingface-datasets mscoco mscoco-dataset

Last synced: 07 Mar 2026

https://github.com/stiliyandr/neural-image-caption

A simple Python API (built on top of TensorFlow) for neural image captioning with MSCOCO data.

image-captioning mscoco mscoco-dataset neural-networks python-3 tensorflow2

Last synced: 12 Feb 2026