https://github.com/ndrplz/cifar-10

Python plug-and-play wrapper to CIFAR-10 dataset.
https://github.com/ndrplz/cifar-10

cifar cifar-10 cifar10 computer-vision convolutional-neural-networks dataset deep-learning machine-learning python36 wrapper

Last synced: 12 months ago
JSON representation

Python plug-and-play wrapper to CIFAR-10 dataset.

Host: GitHub
URL: https://github.com/ndrplz/cifar-10
Owner: ndrplz
License: mit
Created: 2018-10-26T21:02:58.000Z (over 7 years ago)
Default Branch: master
Last Pushed: 2018-10-28T19:31:34.000Z (over 7 years ago)
Last Synced: 2025-04-05T14:51:16.615Z (over 1 year ago)
Topics: cifar, cifar-10, cifar10, computer-vision, convolutional-neural-networks, dataset, deep-learning, machine-learning, python36, wrapper
Language: Python
Homepage:
Size: 3.46 MB
Stars: 10
Watchers: 3
Forks: 3
Open Issues: 2
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

          # CIFAR-10



## Usage

As simple as:

```python

from cifar import CIFAR10

# Instantiate the dataset. If the dataset is not found in `dataset_root`,

#  the first time it is automatically downloaded and extracted there.

dataset = CIFAR10(dataset_root='./cifar10')

```

and you're done.

## Why this wrapper?

The [CIFAR-10 dataset](https://www.cs.toronto.edu/~kriz/cifar.html) consists of 60000 32x32 colour images in 10 classes, with 6000 images per class. It's kind of famous in the computer vision community and it is often used as (toy) benchmark. It's a nice dataset to play with. It's a bit like [MNIST](http://yann.lecun.com/exdb/mnist/), but there are cats and dogs and frogs! And there are colors too!

Despite its fame, I did not find any easy plug-and-play wrapper around it. Of course, there are wrappers to CIFAR-10 in most deep learning frameworks ([TensorFlow](https://github.com/tensorflow/models/tree/master/tutorials/image/cifar10), [PyTorch](https://pytorch.org/docs/stable/torchvision/datasets.html)) but you know I usually don't want to get into a whole deep learning framework just to play with 32x32 cat images. So here's why. And yes, I also wanted to have some fun learning [`pathlib`](https://docs.python.org/3/library/pathlib.html).

#### [video] How the dataset looks like, by the way?



  



## Installation

* *No installation required.* You can just clone / download / copypaste this repository.

* I'm wondering if adding it to PyPI might be useful...

## Requirements

* Python >= 3.6

## Hello World

```python

from cifar import CIFAR10

# Instantiate the dataset. If the dataset is not found in `dataset_root`,

#  the first time it is automatically downloaded and extracted there.

dataset = CIFAR10(dataset_root='./cifar10')

# That's it. Now all examples are in `dataset.samples` dictionary. There

#  are 50000 train examples and 10000 test examples.

print(dataset.samples['train'].shape)           # (50000,)

print(dataset.samples['test'].shape)            # (10000,)

# Each example is constituted by a 32x32 RGB image and its

#  corresponding label, both numeric and human readable.

print(dataset.samples['train'][0].image.shape)  # (32, 32, 3)

print(dataset.samples['train'][0].label)        # 6

print(dataset.samples['train'][0].label_hr)     # frog

print(dataset.samples['train'][0].filename)     # leptodactylus_pentadactylus_s_000004.png

# You can also directly print the example

print(dataset.samples['train'][0])              # [frog] - leptodactylus_pentadactylus_s_000004.png

# You can convert the CIFARSamples to ndarray. Images are possibly flattened

#  and/or normalized to be centered on zero (i.e. in range [-0.5, 0.5])

x_train, y_train = CIFAR10.to_ndarray(dataset.samples['train'], normalize=True, flatten=True)

x_test, y_test = CIFAR10.to_ndarray(dataset.samples['test'], normalize=True, flatten=True)

print(x_train.shape, y_train.shape)             # (50000, 3072)  (50000,)

print(x_test.shape, y_test.shape)               # (10000, 3072)  (10000,)

```

## Issues

Please feel free do open an issue if something doesn't look quite right!

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/ndrplz/cifar-10

Awesome Lists containing this project

README