awesome-video-prediction
A curated list of awesome video prediction papers
https://github.com/dadadadawjb/awesome-video-prediction
Last synced: 2 days ago
JSON representation
-
Papers
- Deep Predictive Coding Networks for Video Prediction and Unsupervised Learning
- Video (language) modeling: a baseline for generative models of natural videos
- Unsupervised Learning of Video Representations using LSTMs
- Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting
- Unsupervised learning of visual structure using predictive generative networks
- PredRNN: A Recurrent Neural Network for Spatiotemporal Predictive Learning
- PredRNN++: Towards A Resolution of the Deep-in-Time Dilemma in Spatiotemporal Predictive Learning
- Unsupervised Learning for Physical Interaction through Video Prediction
- SDC-Net: Video prediction using spatially-displaced convolution
- Decomposing Motion and Content for Natural Video Sequence Prediction
- Learning to Decompose and Disentangle Representations for Video Prediction - Fei)
- Exploring Spatial-Temporal Multi-Frequency Analysis for High-Fidelity and Temporal-Consistency Video Prediction
- Stochastic Variational Video Prediction
- Stochastic Video Generation with a Learned Prior
- Stochastic Adversarial Video Prediction
- Improved Conditional VRNNs for Video Prediction
- Greedy Hierarchical Variational Autoencoders for Large-Scale Video Prediction - Fei, Chelsea Finn)
- Deep multi-scale video prediction beyond mean square error
- Eidetic 3D LSTM: A Model for Video Prediction and Beyond - Fei)
- SimVP: Simpler yet Better Video Prediction
- Video Diffusion Models
- MCVD: Masked Conditional Video Diffusion for Prediction, Generation, and Interpolation
- Diffusion Probabilistic Modeling for Video Generation
- Flexible Diffusion Modeling of Long Videos
- Scaling Autoregressive Video Models
- Latent Video Transformer
- ConvTransformer: A Convolutional Transformer Network for Video Frame Synthesis
- VideoGPT: Video Generation using VQ-VAE and Transformers
- Video Prediction by Efficient Transformers
- MaskViT: Masked Visual Pre-Training for Video Prediction - Fei Li)
- MAGVIT: Masked Generative Video Transformer
- MOSO: Decomposing MOtion, Scene and Object for Video Prediction
- Learning Object-Centric Transformation for Video Prediction - MM 2017 PKU
-
Surveys