Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/cvdfoundation/kinetics-dataset


https://github.com/cvdfoundation/kinetics-dataset

Last synced: about 2 months ago
JSON representation

Awesome Lists containing this project

README

        

# Kinetics Datasets Downloader

Kinetics is a collection of large-scale, high-quality datasets of URL links of up to 650,000 video clips that cover 400/600/700 human action classes, depending on the dataset version. The videos include human-object interactions such as playing instruments, as well as human-human interactions such as shaking hands and hugging. Each action class has at least 400/600/700 video clips. Each clip is human annotated with a single action class and lasts around 10 seconds.

The Kinetics project publications can be found here: https://deepmind.com/research/open-source/kinetics.

Note that it may not be safe to train / test across different versions of the dataset, for example the k400 validation set is largely part of the k700 training set so results will not be meaningful.

# Updates

5th of May: fixed k400/train/part_120.tar.gz, it was a tar file before

10th of December: add two downloader scripts for datasets automatic setup. (**k400_downloader.sh** and **k700_2020_downloader.sh**)

13th of September. K400: replaced corrupted mountain climber validation file and made available 1300+ replacement videos for existing corrupted training videos from various classes. K600: added list of links for held out test set.

4th of August 2021 -- replaced corrupted videos in the kinetics-700-2020 test set (these were typically shorter than 9s as well). There are still 5% of the videos in the test set that are shorter than 9s, but genuinely so (e.g. because they are like that in youtube).

# Download Videos

CVDF currently hosts the videos in the Kinetics-400 and Kinetics-700-2020 datasets.

### Kinetics-400

#### Kinetics-400 Download:

##### Clone repo and enter directory
```
git clone https://github.com/cvdfoundation/kinetics-dataset.git
cd kinetics-dataset
```

##### Download tar gzip files
This will create two directories, k400 and k400_targz. Tar gzips will be in k400_targz, you can delete k400_targz directory after extraction.
```
bash ./k400_downloader.sh
```

##### Extract tar gzip files
```
bash ./k400_extractor.sh
```

#### Kinetics-400 Info:
The train/val/test splits are subdivided into many files. The lists of links to video files can be found here:

https://s3.amazonaws.com/kinetics/400/train/k400_train_path.txt

https://s3.amazonaws.com/kinetics/400/val/k400_val_path.txt

https://s3.amazonaws.com/kinetics/400/test/k400_test_path.txt

It is easy to obtain a specific split (e.g. train), by:
```
bash download.sh k400_train_path.txt
```
Then, extract `*.tar.gz` files by:
```
bash extract.sh k400_train_path.txt
```

The links/annotations can be found under the annotation subfolders:

https://s3.amazonaws.com/kinetics/400/annotations/train.csv

https://s3.amazonaws.com/kinetics/400/annotations/val.csv

https://s3.amazonaws.com/kinetics/400/annotations/test.csv

A readme file can be found in:

http://s3.amazonaws.com/kinetics/400/readme.md

News: users found \~1400 corrupted videos. A replacement for the vast majority can be found here:

https://s3.amazonaws.com/kinetics/400/replacement_for_corrupted_k400.tgz

### Kinetics-600

#### Kinetics-600 Download:

##### Clone repo and enter directory
```
git clone https://github.com/cvdfoundation/kinetics-dataset.git
cd kinetics-dataset
```

##### Download tar gzip files
This will create two directories, k600 and k600_targz. Tar gzips will be in k600_targz, you can delete k600_targz directory after extraction.
```
bash ./k600_downloader.sh
```

##### Extract tar gzip files
```
bash ./k600_extractor.sh
```

#### Kinetics-600 Info:
The train/val/test splits are subdivided into many files. The lists of links to video files can be found here:

https://s3.amazonaws.com/kinetics/600/train/k600_train_path.txt

https://s3.amazonaws.com/kinetics/600/val/k600_val_path.txt

https://s3.amazonaws.com/kinetics/600/test/k600_test_path.txt

The links/annotations can be found under the annotation subfolders:

https://s3.amazonaws.com/kinetics/600/annotations/train.txt

https://s3.amazonaws.com/kinetics/600/annotations/train.csv (incomplete)

https://s3.amazonaws.com/kinetics/600/annotations/val.txt

https://s3.amazonaws.com/kinetics/600/annotations/val.csv (incomplete)

https://s3.amazonaws.com/kinetics/600/annotations/test.csv

https://s3.amazonaws.com/kinetics/600/annotations/kinetics600_holdout_test.csv

A readme file can be found in:

http://s3.amazonaws.com/kinetics/600/readme.md

### Kinetics-700-2020

#### Kinetics-700-2020 Download:

##### Clone repo and enter directory
```
git clone https://github.com/cvdfoundation/kinetics-dataset.git
cd kinetics-dataset
```

##### Download tar gzip files
This will create two directories, k700-2020 and k700-2020_targz. Tar gzips will be in k700-2020_targz, you can delete k700-2020_targz directory after extraction.
```
bash ./k700_2020_downloader.sh
```

##### Extract tar gzip files
```
bash ./k700_2020_extractor.sh
```

#### Kinetics-700-2020 Info:
The train/val/test splits are subdivided into many files. The lists of links to video files can be found here:

https://s3.amazonaws.com/kinetics/700_2020/train/k700_2020_train_path.txt

https://s3.amazonaws.com/kinetics/700_2020/val/k700_2020_val_path.txt

https://s3.amazonaws.com/kinetics/700_2020/test/k700_2020_test_path.txt

The links/annotations can be found under the annotation subfolders:

https://s3.amazonaws.com/kinetics/700_2020/annotations/train.csv

https://s3.amazonaws.com/kinetics/700_2020/annotations/val.csv

https://s3.amazonaws.com/kinetics/700_2020/annotations/test.csv

A readme file can be found in:

http://s3.amazonaws.com/kinetics/700_2020/K700_2020_readme.txt

# Downstream annotations

We also host annotations for AVA-Kinetics and Countix, which both use Kinetics-700 videos.

To download annotations for AVA-Kinetics:
https://s3.amazonaws.com/kinetics/700_2020/annotations/ava_kinetics_v1_0.tar.gz

To download annotations for countix:
https://s3.amazonaws.com/kinetics/700_2020/annotations/countix.tar.gz