https://github.com/bbenligiray/ms_coco_formatter
A tool to download and format MS COCO dataset for multilabel classification
https://github.com/bbenligiray/ms_coco_formatter
dataset-manager image-classification ms-coco multilabel-classification
Last synced: 8 months ago
JSON representation
A tool to download and format MS COCO dataset for multilabel classification
- Host: GitHub
- URL: https://github.com/bbenligiray/ms_coco_formatter
- Owner: bbenligiray
- Created: 2017-07-15T18:54:22.000Z (almost 9 years ago)
- Default Branch: master
- Last Pushed: 2018-06-11T11:23:45.000Z (about 8 years ago)
- Last Synced: 2025-06-24T22:41:38.766Z (12 months ago)
- Topics: dataset-manager, image-classification, ms-coco, multilabel-classification
- Language: Python
- Size: 358 KB
- Stars: 5
- Watchers: 1
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# MS COCO Formatter
A tool to download and format MS COCO dataset for multilabel image classification
It outputs a .h5 file that contains the following:
* data_types: 'train' and 'val'
* cats: names of the 80 categories
(replace x with any data type)
* x_images: flattened images (not preprocessed, except for the few grayscale images that have been converted to RGB)
* x_shapes: shapes of the images, to reshape the flattened images
* x_names: file names of the images
* x_label: a one-hot integer vector of labels