Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/waikato-ufdl/ufdl-datasets
Nikola website making the UFDL datasets publicly available.
https://github.com/waikato-ufdl/ufdl-datasets
datasets deep-learning
Last synced: about 1 month ago
JSON representation
Nikola website making the UFDL datasets publicly available.
- Host: GitHub
- URL: https://github.com/waikato-ufdl/ufdl-datasets
- Owner: waikato-ufdl
- License: cc-by-sa-4.0
- Created: 2022-01-11T00:49:25.000Z (almost 3 years ago)
- Default Branch: main
- Last Pushed: 2023-02-17T04:21:08.000Z (almost 2 years ago)
- Last Synced: 2023-03-04T15:28:01.861Z (almost 2 years ago)
- Topics: datasets, deep-learning
- Language: Python
- Homepage: https://datasets.cms.waikato.ac.nz/ufdl/
- Size: 1.51 MB
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE.txt
Awesome Lists containing this project
README
# ufdl-datasets
Nikola website making the UFDL datasets publicly available.The generated site is available here:
https://datasets.cms.waikato.ac.nz/ufdl/
## Installation
For developing the site locally:
* create virtual environment
```
virtualenv -p /usr/bin/python3 venv
```* install Nikola
```
./venv/bin/pip install nikola aiohttp watchdog
```## Adding content
* [Nikola handbook](https://getnikola.com/handbook.html)
* Content is written in [reStructured Text](http://docutils.sourceforge.net/rst.html)
* Pages are located in `pages`
* News items (aka blog posts) are located in `posts`; should have a date prefix in the name### How to add/list datasets
* define an *ID* for the dataset, e.g., `dataset1`
* Dataset server
* log into `datasets.cms.waikato.ac.nz`
* change into `/var/www/html/ufdl/data/`
* create a sub-directory according to the *ID*
* upload relevant data archives* Nikola
* `files/conversion`
* create a sub-directory according to the *ID*
* upload any data/script/etc that is required for converting the original data
* `images`
* add sample image to be used as thumbnail on image page (use *ID*.jpg/png for better identification)
* `pages`
* add page for the dataset with *ID*.rst as name
* use dataset *ID* for the page *slug* as well
* Update the relevant pages (update timestamp as well!):
* `index.rst` - contains links to domain pages that list/link the actual datasets
* `image-classification.rst` - page for listing image classification datasets
* `image-segmentation.rst` - page for listing image segmentation datasets
* `object-detection.rst` - page for listing object detection datasets
* `speech.rst` - page for listing speech datasets
* When a new *domain* page is required
* create page `DOMAIN.rst` in `pages`
* update `NAVIGATION_LINKS` in `conf.py` to include the new domain page
* Add a news item in the `posts` directory, linking to the new dataset
* Deploy the site (see below)### How to announce datasets
* Add a news item in the `posts` directory (`YYYY-MM-DD-title-of-post.rst`)
* Deploy the site## Serving
The following command not only serves the website on `localhost:8000`, but also detects changes
in files and automatically rebuilds the website:```
./venv/bin/nikola auto
```## Deploying
Use the following command to build and deploy the website:
```
rm -R output && ./venv/bin/nikola clean && ./venv/bin/nikola build && ./venv/bin/nikola deploy
```