https://github.com/bbenligiray/pascal_voc2007_formatter
A tool to download and format PASCAL VOC 2007 dataset for multilabel classification
https://github.com/bbenligiray/pascal_voc2007_formatter
dataset-manager image-classification multilabel-classification pascal-voc
Last synced: 10 months ago
JSON representation
A tool to download and format PASCAL VOC 2007 dataset for multilabel classification
- Host: GitHub
- URL: https://github.com/bbenligiray/pascal_voc2007_formatter
- Owner: bbenligiray
- Created: 2017-07-14T17:22:31.000Z (almost 9 years ago)
- Default Branch: master
- Last Pushed: 2017-07-17T09:41:09.000Z (almost 9 years ago)
- Last Synced: 2025-05-06T06:46:52.106Z (about 1 year ago)
- Topics: dataset-manager, image-classification, multilabel-classification, pascal-voc
- Language: Python
- Homepage:
- Size: 9.77 KB
- Stars: 10
- Watchers: 1
- Forks: 3
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# PASCAL VOC 2007 Formatter
A tool to download and format PASCAL VOC2007 dataset for multilabel classification. It outputs a .h5 file that contains the following:
* data_types: 'train', 'val' and 'test'
* cats: names of the 20 categories
(replace x with any data type)
* x_images: flattened images (h5py allows differently sized elements only if they are 1D)
* x_shapes: shapes of the images, to reshape the flattened images
* x_names: file names of the images
* x_label: a one-hot integer vector of labels