An open API service indexing awesome lists of open source software.

https://github.com/bbenligiray/ms_coco_formatter

A tool to download and format MS COCO dataset for multilabel classification
https://github.com/bbenligiray/ms_coco_formatter

dataset-manager image-classification ms-coco multilabel-classification

Last synced: 8 months ago
JSON representation

A tool to download and format MS COCO dataset for multilabel classification

Awesome Lists containing this project

README

          

# MS COCO Formatter

A tool to download and format MS COCO dataset for multilabel image classification

It outputs a .h5 file that contains the following:

* data_types: 'train' and 'val'
* cats: names of the 80 categories
(replace x with any data type)
* x_images: flattened images (not preprocessed, except for the few grayscale images that have been converted to RGB)
* x_shapes: shapes of the images, to reshape the flattened images
* x_names: file names of the images
* x_label: a one-hot integer vector of labels