Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/aneeshvartakavi/GNMC
Gracenote Multi-Crop Dataset for Automatic Image Cropping
https://github.com/aneeshvartakavi/GNMC
Last synced: 3 months ago
JSON representation
Gracenote Multi-Crop Dataset for Automatic Image Cropping
- Host: GitHub
- URL: https://github.com/aneeshvartakavi/GNMC
- Owner: aneeshvartakavi
- Created: 2023-01-03T04:01:47.000Z (almost 2 years ago)
- Default Branch: main
- Last Pushed: 2023-01-03T06:32:01.000Z (almost 2 years ago)
- Last Synced: 2024-04-08T04:40:48.628Z (7 months ago)
- Language: Jupyter Notebook
- Homepage:
- Size: 2.05 MB
- Stars: 4
- Watchers: 1
- Forks: 0
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# GNMC
We present the Gracenote Multi-Crop (GNMC) dataset, to further research in algorithms for aesthetic image cropping. The dataset consists of a diverse collection of 10K images, each cropped in five different aspect ratios by experienced editors. GNMC is larger than existing datasets commonly used to benchmark image cropping approaches such as FCDB (1743 images) and FLMS (500 images). This dataset can enable aesthetic cropping algorithms as described in "[An Experience-Based Direct Generation Approach to Automatic Image Cropping](https://ieeexplore.ieee.org/document/9500226/)" by Christensen and Vartakavi.
![Crop Example](./images/crop_example_multiple.jpeg)
## License
This dataset is distributed under the [polyform non-commercial license](https://polyformproject.org/licenses/noncommercial/1.0.0/).
## Dataset
The dataset is hosted on Zenodo, [download here](https://zenodo.org/record/6228834).
The dataset contains a total of 10k images, split between training(8k), validation (1k) and test (2k) sets. The data for each split is available in the `./json` folder, contains data in the following format:
```
[{'crop_bboxes': {'16:9': [0.01615, 0.06691, 0.98806, 0.78996],
'3:4': [0.37921, 0.01301, 0.93118, 0.98606],
'4:3': [0.01404, 0.0158, 0.98596, 0.97955],
'2:1': [0.0014, 0.1171, 0.99157, 0.77138],
'1:1': [0.24017, 0.01022, 0.98034, 0.98885]},
'filename': '009473.jpeg'},
{'crop_bboxes': {'16:9': [0.00208, 0.0037, 0.99792, 0.99722],
'3:4': [0.0, 0.0, 0.42083, 1.0],
'4:3': [0.0, 0.05093, 0.7125, 1.0],
'2:1': [0.0, 0.11296, 1.0, 1.0],
'1:1': [0.0, 0.01944, 0.5526, 1.0]},
'filename': '008917.jpeg'},
{'crop_bboxes': {'16:9': [0.00125, 0.00222, 0.99812, 1.0],
'3:4': [0.4275, 0.0, 0.84875, 1.0],
'4:3': [0.24875, 0.04667, 0.9625, 1.0],
'2:1': [0.0, 0.07444, 0.99875, 0.96333],
'1:1': [0.34062, 0.0, 0.90188, 1.0]},
'filename': '004070.jpeg'}]
```The images can be found at `.//filename.jpeg`. The bounding boxes for each aspect ratio are stored in normalized format as float values between [0,1]. The example notebook provides details on how to work with these.
## Code
This repository contains a single introduction notebook to illustrate how to work with the dataset.
In the future, this repository may include functions for metrics and other possibly other utilities, trained models, etc.
## Citation
You can cite the dataset like so:
>Vartakavi, Aneesh. (2022). The Gracenote Multi-Crop Dataset (0.0.1) [Data set]. Zenodo. https://doi.org/10.5281/zenodo.6228834
You may also be interested in [our related work](https://ieeexplore.ieee.org/document/9500226/):
>C. L. Christensen and A. Vartakavi, "An Experience-Based Direct Generation Approach to Automatic Image Cropping," in IEEE Access, vol. 9, pp. 107600-107610, 2021, doi: 10.1109/ACCESS.2021.3100816.