https://github.com/facebookresearch/SlowFast

PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
https://github.com/facebookresearch/SlowFast

Last synced: 4 months ago
JSON representation

PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.

Host: GitHub
URL: https://github.com/facebookresearch/SlowFast
Owner: facebookresearch
License: apache-2.0
Created: 2019-08-20T22:47:26.000Z (almost 6 years ago)
Default Branch: main
Last Pushed: 2024-11-26T11:06:14.000Z (8 months ago)
Last Synced: 2025-03-11T15:11:29.226Z (4 months ago)
Language: Python
Homepage:
Size: 30.7 MB
Stars: 6,825
Watchers: 95
Forks: 1,243
Open Issues: 431
Metadata Files:
- Readme: README.md
- Contributing: CONTRIBUTING.md
- License: LICENSE
- Code of conduct: CODE_OF_CONDUCT.md

Awesome Lists containing this project

awesome-python-machine-learning-resources - GitHub - 52% open · ⏱️ 25.08.2022): (图像数据与CV)
awesome-starts - facebookresearch/SlowFast - PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models. (Python)
awesome-list - SlowFast - Video understanding codebase from FAIR, based on PyTorch. (Computer Vision / Classification & Detection & Tracking)

README

        # PySlowFast

PySlowFast is an open source video understanding codebase from FAIR that provides state-of-the-art video classification models with efficient training. This repository includes implementations of the following methods:

- [SlowFast Networks for Video Recognition](https://arxiv.org/abs/1812.03982)

- [Non-local Neural Networks](https://arxiv.org/abs/1711.07971)

- [A Multigrid Method for Efficiently Training Video Models](https://arxiv.org/abs/1912.00998)

- [X3D: Progressive Network Expansion for Efficient Video Recognition](https://arxiv.org/abs/2004.04730)

- [Multiscale Vision Transformers](https://arxiv.org/abs/2104.11227)

- [A Large-Scale Study on Unsupervised Spatiotemporal Representation Learning](https://arxiv.org/abs/2104.14558)

- [MViTv2: Improved Multiscale Vision Transformers for Classification and Detection](https://arxiv.org/abs/2112.01526)

- [Masked Feature Prediction for Self-Supervised Visual Pre-Training](https://arxiv.org/abs/2112.09133)

- [Masked Autoencoders As Spatiotemporal Learners](https://arxiv.org/abs/2205.09113)

- [Reversible Vision Transformers](https://openaccess.thecvf.com/content/CVPR2022/papers/Mangalam_Reversible_Vision_Transformers_CVPR_2022_paper.pdf)



  



## Introduction

The goal of PySlowFast is to provide a high-performance, light-weight pytorch codebase provides state-of-the-art video backbones for video understanding research on different tasks (classification, detection, and etc). It is designed in order to support rapid implementation and evaluation of novel video research ideas. PySlowFast includes implementations of the following backbone network architectures:

- SlowFast

- Slow

- C2D

- I3D

- Non-local Network

- X3D

- MViTv1 and MViTv2

- Rev-ViT and Rev-MViT

## Updates

 - We now [Reversible Vision Transformers](https://openaccess.thecvf.com/content/CVPR2022/papers/Mangalam_Reversible_Vision_Transformers_CVPR_2022_paper.pdf). Both Reversible ViT and MViT models released. See [`projects/rev`](./projects/rev/README.md).

 - We now support [MAE for Video](https://arxiv.org/abs/2104.11227.pdf). See [`projects/mae`](./projects/mae/README.md) for more information.

 - We now support [MaskFeat](https://arxiv.org/abs/2112.09133). See [`projects/maskfeat`](./projects/maskfeat/README.md) for more information.

 - We now support [MViTv2](https://arxiv.org/abs/2104.11227.pdf) in PySlowFast. See [`projects/mvitv2`](./projects/mvitv2/README.md) for more information.

 - We now support [A Large-Scale Study on Unsupervised Spatiotemporal Representation Learning](https://arxiv.org/abs/2104.14558). See [`projects/contrastive_ssl`](./projects/contrastive_ssl/README.md) for more information.

 - We now support [Multiscale Vision Transformers](https://arxiv.org/abs/2104.11227.pdf) on Kinetics and ImageNet. See [`projects/mvit`](./projects/mvit/README.md) for more information.

 - We now support [PyTorchVideo](https://github.com/facebookresearch/pytorchvideo) models and datasets. See [`projects/pytorchvideo`](./projects/pytorchvideo/README.md) for more information.

 - We now support [X3D Models](https://arxiv.org/abs/2004.04730). See [`projects/x3d`](./projects/x3d/README.md) for more information.

 - We now support [Multigrid Training](https://arxiv.org/abs/1912.00998) for efficiently training video models. See [`projects/multigrid`](./projects/multigrid/README.md) for more information.

 - PySlowFast is released in conjunction with our [ICCV 2019 Tutorial](https://alexander-kirillov.github.io/tutorials/visual-recognition-iccv19/).

## License

PySlowFast is released under the [Apache 2.0 license](LICENSE).

## Model Zoo and Baselines

We provide a large set of baseline results and trained models available for download in the PySlowFast [Model Zoo](MODEL_ZOO.md).

## Installation

Please find installation instructions for PyTorch and PySlowFast in [INSTALL.md](INSTALL.md). You may follow the instructions in [DATASET.md](slowfast/datasets/DATASET.md) to prepare the datasets.

## Quick Start

Follow the example in [GETTING_STARTED.md](GETTING_STARTED.md) to start playing video models with PySlowFast.

## Visualization Tools

We offer a range of visualization tools for the train/eval/test processes, model analysis, and for running inference with trained model.

More information at [Visualization Tools](VISUALIZATION_TOOLS.md).

## Contributors

PySlowFast is written and maintained by [Haoqi Fan](https://haoqifan.github.io/), [Yanghao Li](https://lyttonhao.github.io/), [Bo Xiong](https://www.cs.utexas.edu/~bxiong/), [Wan-Yen Lo](https://www.linkedin.com/in/wanyenlo/), [Christoph Feichtenhofer](https://feichtenhofer.github.io/).

## Citing PySlowFast

If you find PySlowFast useful in your research, please use the following BibTeX entry for citation.

```BibTeX

@misc{fan2020pyslowfast,

  author =       {Haoqi Fan and Yanghao Li and Bo Xiong and Wan-Yen Lo and

                  Christoph Feichtenhofer},

  title =        {PySlowFast},

  howpublished = {\url{https://github.com/facebookresearch/slowfast}},

  year =         {2020}

}

```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/facebookresearch/SlowFast

Awesome Lists containing this project

README