# kinetics-dataset **Repository Path**: fudexiang/kinetics-dataset ## Basic Information - **Project Name**: kinetics-dataset - **Description**: k400数据集 - **Primary Language**: Python - **License**: Not specified - **Default Branch**: main - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 1 - **Created**: 2025-06-10 - **Last Updated**: 2025-06-10 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README # Kinetics Datasets Downloader Kinetics is a collection of large-scale, high-quality datasets of URL links of up to 650,000 video clips that cover 400/600/700 human action classes, depending on the dataset version. The videos include human-object interactions such as playing instruments, as well as human-human interactions such as shaking hands and hugging. Each action class has at least 400/600/700 video clips. Each clip is human annotated with a single action class and lasts around 10 seconds. The Kinetics project publications can be found here: https://deepmind.com/research/open-source/kinetics. Note that it may not be safe to train / test across different versions of the dataset, for example the k400 validation set is largely part of the k700 training set so results will not be meaningful. # Updates 5th of May: fixed k400/train/part_120.tar.gz, it was a tar file before 10th of December: add two downloader scripts for datasets automatic setup. (**k400_downloader.sh** and **k700_2020_downloader.sh**) 13th of September. K400: replaced corrupted mountain climber validation file and made available 1300+ replacement videos for existing corrupted training videos from various classes. K600: added list of links for held out test set. 4th of August 2021 -- replaced corrupted videos in the kinetics-700-2020 test set (these were typically shorter than 9s as well). There are still 5% of the videos in the test set that are shorter than 9s, but genuinely so (e.g. because they are like that in youtube). # Download Videos CVDF currently hosts the videos in the Kinetics-400 and Kinetics-700-2020 datasets. ### Kinetics-400 #### Kinetics-400 Download: ##### Clone repo and enter directory ``` git clone https://github.com/cvdfoundation/kinetics-dataset.git cd kinetics-dataset ``` ##### Download tar gzip files This will create two directories, k400 and k400_targz. Tar gzips will be in k400_targz, you can delete k400_targz directory after extraction. ``` bash ./k400_downloader.sh ``` ##### Extract tar gzip files ``` bash ./k400_extractor.sh ``` #### Kinetics-400 Info: The train/val/test splits are subdivided into many files. The lists of links to video files can be found here: https://s3.amazonaws.com/kinetics/400/train/k400_train_path.txt https://s3.amazonaws.com/kinetics/400/val/k400_val_path.txt https://s3.amazonaws.com/kinetics/400/test/k400_test_path.txt It is easy to obtain a specific split (e.g. train), by: ``` bash download.sh k400_train_path.txt ``` Then, extract `*.tar.gz` files by: ``` bash extract.sh k400_train_path.txt ``` The links/annotations can be found under the annotation subfolders: https://s3.amazonaws.com/kinetics/400/annotations/train.csv https://s3.amazonaws.com/kinetics/400/annotations/val.csv https://s3.amazonaws.com/kinetics/400/annotations/test.csv A readme file can be found in: http://s3.amazonaws.com/kinetics/400/readme.md News: users found \~1400 corrupted videos. A replacement for the vast majority can be found here: https://s3.amazonaws.com/kinetics/400/replacement_for_corrupted_k400.tgz ### Kinetics-600 #### Kinetics-600 Download: ##### Clone repo and enter directory ``` git clone https://github.com/cvdfoundation/kinetics-dataset.git cd kinetics-dataset ``` ##### Download tar gzip files This will create two directories, k600 and k600_targz. Tar gzips will be in k600_targz, you can delete k600_targz directory after extraction. ``` bash ./k600_downloader.sh ``` ##### Extract tar gzip files ``` bash ./k600_extractor.sh ``` #### Kinetics-600 Info: The train/val/test splits are subdivided into many files. The lists of links to video files can be found here: https://s3.amazonaws.com/kinetics/600/train/k600_train_path.txt https://s3.amazonaws.com/kinetics/600/val/k600_val_path.txt https://s3.amazonaws.com/kinetics/600/test/k600_test_path.txt The links/annotations can be found under the annotation subfolders: https://s3.amazonaws.com/kinetics/600/annotations/train.txt https://s3.amazonaws.com/kinetics/600/annotations/train.csv (incomplete) https://s3.amazonaws.com/kinetics/600/annotations/val.txt https://s3.amazonaws.com/kinetics/600/annotations/val.csv (incomplete) https://s3.amazonaws.com/kinetics/600/annotations/test.csv https://s3.amazonaws.com/kinetics/600/annotations/kinetics600_holdout_test.csv A readme file can be found in: http://s3.amazonaws.com/kinetics/600/readme.md ### Kinetics-700-2020 #### Kinetics-700-2020 Download: ##### Clone repo and enter directory ``` git clone https://github.com/cvdfoundation/kinetics-dataset.git cd kinetics-dataset ``` ##### Download tar gzip files This will create two directories, k700-2020 and k700-2020_targz. Tar gzips will be in k700-2020_targz, you can delete k700-2020_targz directory after extraction. ``` bash ./k700_2020_downloader.sh ``` ##### Extract tar gzip files ``` bash ./k700_2020_extractor.sh ``` #### Kinetics-700-2020 Info: The train/val/test splits are subdivided into many files. The lists of links to video files can be found here: https://s3.amazonaws.com/kinetics/700_2020/train/k700_2020_train_path.txt https://s3.amazonaws.com/kinetics/700_2020/val/k700_2020_val_path.txt https://s3.amazonaws.com/kinetics/700_2020/test/k700_2020_test_path.txt The links/annotations can be found under the annotation subfolders: https://s3.amazonaws.com/kinetics/700_2020/annotations/train.csv https://s3.amazonaws.com/kinetics/700_2020/annotations/val.csv https://s3.amazonaws.com/kinetics/700_2020/annotations/test.csv A readme file can be found in: http://s3.amazonaws.com/kinetics/700_2020/K700_2020_readme.txt # Downstream annotations We also host annotations for AVA-Kinetics and Countix, which both use Kinetics-700 videos. To download annotations for AVA-Kinetics: https://s3.amazonaws.com/kinetics/700_2020/annotations/ava_kinetics_v1_0.tar.gz To download annotations for countix: https://s3.amazonaws.com/kinetics/700_2020/annotations/countix.tar.gz