Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/cvdfoundation/kinetics-dataset
https://github.com/cvdfoundation/kinetics-dataset
Last synced: 11 days ago
JSON representation
- Host: GitHub
- URL: https://github.com/cvdfoundation/kinetics-dataset
- Owner: cvdfoundation
- Created: 2020-12-03T16:51:46.000Z (almost 4 years ago)
- Default Branch: main
- Last Pushed: 2024-05-15T05:23:03.000Z (6 months ago)
- Last Synced: 2024-08-01T10:21:10.359Z (3 months ago)
- Language: Shell
- Size: 41 KB
- Stars: 725
- Watchers: 12
- Forks: 94
- Open Issues: 29
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
- Awesome-Video-LLMs - link
README
# Kinetics Datasets Downloader
Kinetics is a collection of large-scale, high-quality datasets of URL links of up to 650,000 video clips that cover 400/600/700 human action classes, depending on the dataset version. The videos include human-object interactions such as playing instruments, as well as human-human interactions such as shaking hands and hugging. Each action class has at least 400/600/700 video clips. Each clip is human annotated with a single action class and lasts around 10 seconds.
The Kinetics project publications can be found here: https://deepmind.com/research/open-source/kinetics.
Note that it may not be safe to train / test across different versions of the dataset, for example the k400 validation set is largely part of the k700 training set so results will not be meaningful.
# Updates
5th of May: fixed k400/train/part_120.tar.gz, it was a tar file before
10th of December: add two downloader scripts for datasets automatic setup. (**k400_downloader.sh** and **k700_2020_downloader.sh**)
13th of September. K400: replaced corrupted mountain climber validation file and made available 1300+ replacement videos for existing corrupted training videos from various classes. K600: added list of links for held out test set.
4th of August 2021 -- replaced corrupted videos in the kinetics-700-2020 test set (these were typically shorter than 9s as well). There are still 5% of the videos in the test set that are shorter than 9s, but genuinely so (e.g. because they are like that in youtube).
# Download Videos
CVDF currently hosts the videos in the Kinetics-400 and Kinetics-700-2020 datasets.
### Kinetics-400
#### Kinetics-400 Download:
##### Clone repo and enter directory
```
git clone https://github.com/cvdfoundation/kinetics-dataset.git
cd kinetics-dataset
```##### Download tar gzip files
This will create two directories, k400 and k400_targz. Tar gzips will be in k400_targz, you can delete k400_targz directory after extraction.
```
bash ./k400_downloader.sh
```##### Extract tar gzip files
```
bash ./k400_extractor.sh
```#### Kinetics-400 Info:
The train/val/test splits are subdivided into many files. The lists of links to video files can be found here:https://s3.amazonaws.com/kinetics/400/train/k400_train_path.txt
https://s3.amazonaws.com/kinetics/400/val/k400_val_path.txt
https://s3.amazonaws.com/kinetics/400/test/k400_test_path.txt
It is easy to obtain a specific split (e.g. train), by:
```
bash download.sh k400_train_path.txt
```
Then, extract `*.tar.gz` files by:
```
bash extract.sh k400_train_path.txt
```The links/annotations can be found under the annotation subfolders:
https://s3.amazonaws.com/kinetics/400/annotations/train.csv
https://s3.amazonaws.com/kinetics/400/annotations/val.csv
https://s3.amazonaws.com/kinetics/400/annotations/test.csv
A readme file can be found in:
http://s3.amazonaws.com/kinetics/400/readme.md
News: users found \~1400 corrupted videos. A replacement for the vast majority can be found here:
https://s3.amazonaws.com/kinetics/400/replacement_for_corrupted_k400.tgz
### Kinetics-600
#### Kinetics-600 Download:
##### Clone repo and enter directory
```
git clone https://github.com/cvdfoundation/kinetics-dataset.git
cd kinetics-dataset
```##### Download tar gzip files
This will create two directories, k600 and k600_targz. Tar gzips will be in k600_targz, you can delete k600_targz directory after extraction.
```
bash ./k600_downloader.sh
```##### Extract tar gzip files
```
bash ./k600_extractor.sh
```#### Kinetics-600 Info:
The train/val/test splits are subdivided into many files. The lists of links to video files can be found here:https://s3.amazonaws.com/kinetics/600/train/k600_train_path.txt
https://s3.amazonaws.com/kinetics/600/val/k600_val_path.txt
https://s3.amazonaws.com/kinetics/600/test/k600_test_path.txt
The links/annotations can be found under the annotation subfolders:
https://s3.amazonaws.com/kinetics/600/annotations/train.txt
https://s3.amazonaws.com/kinetics/600/annotations/train.csv (incomplete)
https://s3.amazonaws.com/kinetics/600/annotations/val.txt
https://s3.amazonaws.com/kinetics/600/annotations/val.csv (incomplete)
https://s3.amazonaws.com/kinetics/600/annotations/test.csv
https://s3.amazonaws.com/kinetics/600/annotations/kinetics600_holdout_test.csv
A readme file can be found in:
http://s3.amazonaws.com/kinetics/600/readme.md
### Kinetics-700-2020
#### Kinetics-700-2020 Download:
##### Clone repo and enter directory
```
git clone https://github.com/cvdfoundation/kinetics-dataset.git
cd kinetics-dataset
```##### Download tar gzip files
This will create two directories, k700-2020 and k700-2020_targz. Tar gzips will be in k700-2020_targz, you can delete k700-2020_targz directory after extraction.
```
bash ./k700_2020_downloader.sh
```##### Extract tar gzip files
```
bash ./k700_2020_extractor.sh
```#### Kinetics-700-2020 Info:
The train/val/test splits are subdivided into many files. The lists of links to video files can be found here:https://s3.amazonaws.com/kinetics/700_2020/train/k700_2020_train_path.txt
https://s3.amazonaws.com/kinetics/700_2020/val/k700_2020_val_path.txt
https://s3.amazonaws.com/kinetics/700_2020/test/k700_2020_test_path.txt
The links/annotations can be found under the annotation subfolders:
https://s3.amazonaws.com/kinetics/700_2020/annotations/train.csv
https://s3.amazonaws.com/kinetics/700_2020/annotations/val.csv
https://s3.amazonaws.com/kinetics/700_2020/annotations/test.csv
A readme file can be found in:
http://s3.amazonaws.com/kinetics/700_2020/K700_2020_readme.txt
# Downstream annotations
We also host annotations for AVA-Kinetics and Countix, which both use Kinetics-700 videos.
To download annotations for AVA-Kinetics:
https://s3.amazonaws.com/kinetics/700_2020/annotations/ava_kinetics_v1_0.tar.gzTo download annotations for countix:
https://s3.amazonaws.com/kinetics/700_2020/annotations/countix.tar.gz