Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/cure-lab/Awesome-time-series
A comprehensive survey on the time series domains
https://github.com/cure-lab/Awesome-time-series
List: Awesome-time-series
anomaly-detection time-series time-series-analysis time-series-classification time-series-forecasting time-series-prediction
Last synced: 2 months ago
JSON representation
A comprehensive survey on the time series domains
- Host: GitHub
- URL: https://github.com/cure-lab/Awesome-time-series
- Owner: cure-lab
- Created: 2022-03-18T11:45:25.000Z (almost 3 years ago)
- Default Branch: main
- Last Pushed: 2024-03-22T13:58:47.000Z (10 months ago)
- Last Synced: 2024-05-23T06:02:33.728Z (8 months ago)
- Topics: anomaly-detection, time-series, time-series-analysis, time-series-classification, time-series-forecasting, time-series-prediction
- Homepage:
- Size: 953 KB
- Stars: 473
- Watchers: 21
- Forks: 40
- Open Issues: 7
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
- awesome-machine-learning-resources - **[List - lab/Awesome-time-series?style=social) (Table of Contents)
- ultimate-awesome - Awesome-time-series - A comprehensive survey on the time series domains. (Other Lists / Monkey C Lists)
README
# Awesome Time Series
- [📝 Time Series Papers](#-time-series-papers)
- [📝 Time Series Libraries](#-time-series-libraries)
- [📝 Time Series Benchmarks and Datasets](#-time-series-benchmarks-and-datasets)# 📝 Time Series Papers
A comprehensive survey on the time series papers from 2018-2022 (we will update it in time ASAP!) on the top conferences (NeurIPS, ICML, ICLR, SIGKDD, SIGIR, AAAI, IJCAI, WWW, CIKM, ICDM, WSDM, etc.)We divided these papers into several fundamental tasks as follows.
- [📝 Time Series Papers](#-time-series-paper)
- [Survey](#survey)
- [Time Series Forecasting](#time-series-forecasting)
- [Time Series Classification ](#time-series-classification)
- [Anomaly Detection ](#anomaly-detection)
- [Time series Clustering ](#time-series-clustering)
- [Time Series Segmentation](#time-series-segmentation)
- [Others ](#others)## Features
- **Up-to-date** papers
- Summarize the **contributions** in papers
- Present the **datasets** used in papers## Update
- [2022-05-31] Add papers published in ICML 2022
- [2022-05-31] Add papers published in NeurIPS, ICML, ICLR, SIGKDD, SIGIR, AAAI, IJCAI 2019!
- [2022-05-05] Add papers published in WWW 2022!
- [2022-04-25] **TS-Paper v1.0 is released!** We support the published time series papers from 2020 to 2022. Stay tuned!## TODO
- [ ] Add papers published in 2018. (v3.0)## Survey
| Paper | Conference | Year | Code | Key Contribution|
| :--------------------------: | :-------------------: | :------------------: | ----------------------- |------ |
|[Transformers in Time Series: A Survey](https://arxiv.org/pdf/2202.07125.pdf)| - | 2022 | [link](https://github.com/qingsongedu/time-series-transformers-review) |1. This work summarizes the network structures from adaptations and modification. 2. This work categorizes these methods into three tasks, e.g., forecasting, anomaly detection, and classification.
|[Time series data augmentation for deep learning: a survey](https://arxiv.org/pdf/2002.12478.pdf)| IJCAI | 2021 |-| 1. This work systematically reviews and empirically compares different data augmentation methods for time series. 2. They discuss and highlight five future directions to provide useful research guidance.
|[Neural temporal point processes: a review](https://arxiv.org/pdf/2104.03528v5.pdf)| IJCAI | 2021 | - | 1. They focus on important design choices and general principles for defining neural TPP models. 2. They provide an overview of common application areas. 3. They conclude many open challenges and important directions.
|[Time-series forecasting with deep learning: a survey](https://arxiv.org/pdf/2004.13408.pdf)| Philosophical Transactions of the Royal Society A | 2021 | - |1. They survey common encoder and decoder designs used in both one-step-ahead and multi-horizon time series forecasting– describing how temporal information is incorporated into predictions by each model. 2. They highlight recent developments in hybrid deep learning models, which combine well-studied statistical models with neural network components to improve pure methods in either category. 3. They outline some ways in which deep learning can also facilitate decision support with time series data.
|[Deep learning for time series forecasting: a survey](https://www.liebertpub.com/doi/10.1089/big.2020.0159)| Big Data | 2021 | - | 1. They formulate the time series forecasting problem along with its mathematical fundamentals. 2. They discuss the advantages and limitations in the feed forward networks, recurrent neural networks (including Elman, long-short term memory, gated recurrent units, and bidirectional networks), and convolutional neural networks.
|[DL-Traff: Survey and Benchmark of Deep Learning Models for Urban Traffic Prediction](https://arxiv.org/pdf/2108.09091.pdf)| CIKM | 2021 | [graph-data](https://github.com/deepkashiwa20/DL-Traff-Graph), [grid-data](https://github.com/deepkashiwa20/DL-Traff-Grid) |They synthetically review the deep traffic models and the widely used datasets, then build a standard benchmark to comprehensively evaluate their performances with the same settings and metrics.
|[Graph Neural Network for Traffic Forecasting: A Survey](https://arxiv.org/pdf/2101.11174v3.pdf)| - | 2021 | - |The first comprehensive survey that explores the application of graph neural networks for traffic forecasting problems (e.g. road traffic flow and speed forecasting, passenger flow forecasting in urban rail transit systems, and demand forecasting in ride-hailing platforms).
|[Deep learning for anomaly detection in time-series data: review, analysis, and guidelines](https://ieeexplore.ieee.org/abstract/document/9523565)| Access | 2021 | - | 1. This review provides a background on anomaly detection in time-series data and reviews the latest applications in the real world. 2. They comparatively analyze state-of-the-art deep-anomaly-detection models with several benchmark datasets. 3. They offer guidelines for appropriate model selection and training strategy for deep learning-based time series anomaly detection.
|[A review on outlier/anomaly detection in time series data](https://arxiv.org/pdf/2002.04236.pdf)| ACM Computing Surveys | 2021 | - |1. This review provides a structured and comprehensive state-of-the-art on outlier detection techniques in the context of time series. 2. a taxonomy is presented based on the main aspects that characterize an outlier detection technique.
|[A unifying review of deep and shallow anomaly detection](http://128.84.4.34/pdf/2009.11732)| Proceedings of the IEEE | 2021 | - |1. This work draws connections between classic ‘shallow’ and novel deep approaches and show how this relation might cross-fertilize or extend both directions. 2. They outline some critical open challenges.
|[Big Data for Traffic Estimation and Prediction: A Survey of Data and Tools](https://www.mdpi.com/2571-5577/5/1/23)| Applied System Innovation 5 | 2021 | - |This study presents an up-to-date survey of open data and big data tools used for traffic estimation and prediction.
|[Fusion in stock market prediction: A decade survey on the necessity, recent developments, and potential future directions](https://www.sciencedirect.com/science/article/pii/S1566253520303481)| Information Fusion | 2021 | - | 1. Survey information, feature, model fusion from 2011–2020. 2. Discuss their limitations and future directions are explored for various stock applications.
|[Applications of deep learning in stock market prediction: Recent progress](https://www.sciencedirect.com/science/article/pii/S0957417421009441)| ESA | 2021 | - | 1. Give a latest review of recent works on deep learning models for stock market prediction. 2. Discuss data sources, models, metrics and the implementation and reproducibility.
|[Deep Learning for Spatio-Temporal Data Mining: A Survey](https://ieeexplore.ieee.org/abstract/document/9204396)| KDD | 2020 | - | 1. A review of recent progress in applying deep learning techniques for STDM. 2. They classify existing literature based on the types of spatio-temporal data, the data mining tasks, and the deep learning models.
|[Urban flow prediction from spatiotemporal data using machine learning: A survey ](https://www.sciencedirect.com/science/article/abs/pii/S1566253519303094)| Information Fusion | 2020 | - | 1. Urban flow prediction from spatiotemporal data. 2. methods based on machine learning. 3. The difficulties and some ideas.
|[An empirical survey of data augmentation for time series classification with neural networks](https://arxiv.org/pdf/2007.15951.pdf)| - | 2020 | [link](https://github.com/uchidalab/time_series_augmentation) |1. a taxonomy and outline the four families in time series data augmentation, including transformation-based methods, pattern mixing, generative models, and decomposition methods. 2. empirically evaluate 12 time series data augmentation methods on 128 time series classification datasets with six different types of neural networks. 3. analyze the characteristics, advantages and disadvantages, and recommendations of each data augmentation method.
|[Deep Learning on Traffic Prediction: Methods, Analysis and Future Directions](https://arxiv.org/pdf/2004.08555.pdf)| - | 2020 | - | 1. They summarize the existing traffic prediction methods, widely used public datasets, give an evaluation and analysis by conducting extensive experiments to compare the performance of different methods on a real-world public dataset.
|[Neural forecasting: Introduction and literature overview](https://arxiv.org/pdf/2004.10240.pdf)| - | 2020 | - | An introduction and an overview of some of the advances of neural networks in machine learning.
|[Financial time series forecasting with deep learning : A systematic literature review: 2005–2019](https://arxiv.org/pdf/1911.13288.pdf)| ASC | 2019 | - | 1. categorized the studies according to the intended forecasting implementation areas, such as index, forex, commodity forecasting. 2. grouped them based on DL model choices, such as CNNs, Deep Belief Networks (DBNs), Long-Short Term Memory (LSTM).
|[Deep learning for time series classification: a review](https://arxiv.org/pdf/1809.04356.pdf)| Data Mining and Knowledge Discovery | 2019 | [link](https://github.com/hfawaz/dl-4-tsc) | They implemented existing approaches by training 8,730 deep learning models on 97 time series datasets.
|[Financial time series forecasting with deep learning : A systematic literature review: 2005–2019](https://arxiv.org/pdf/1911.13288.pdf)| ASC | 2019 | - | 1. They categorized the studies according to their intended forecasting implementation areas, such as index, forex, commodity forecasting. 2. They grouped DL model, such as Convolutional Neural Networks (CNNs), Deep Belief Networks (DBNs), Long-Short Term Memory (LSTM).
|[Natural language based financial forecasting: a survey](https://dspace.mit.edu/bitstream/handle/1721.1/116314/10462_2017_9588_ReferencePDF.pdf?sequence=2&isAllowed=y)| Artificial Intelligence Review | 2018 | - | They show scopes, progress and hotspots in natural language based financial forecasting (NLFF).## Time Series Forecasting
| Paper | Conference | Year | Code | Used Datasets |Key Contribution|
| :-------------------: | :----------: | :----------: | :------------------------: | ----------------------- |------ |
|[FEDformer: Frequency Enhanced Decomposed Transformer for Long-term Series Forecasting](https://arxiv.org/abs/2201.12740)| ICML | 2022 | [code](https://github.com/MAZiqing/FEDformer) | ETT, Electricity, Exchange, Weather, ILI | We propose to combine Transformer with the seasonal-trend decomposition method, in which the decomposition method captures the global profile of time series while Transformers capture more detailed structures. The proposed method, termed as Frequency Enhanced Decomposed Transformer (FEDformer), is more efficient than standard Transformer with a linear complexity to the sequence length. |
|[TACTiS: Transformer-Attentional Copulas for Time Series](https://arxiv.org/abs/2202.03528)| ICML | 2022 | [code](https://github.com/servicenow/tactis) | electricity, fred-md, kdd-cup, solar-10min, traffic | We propose a versatile method, based on the transformer architecture, that estimates joint distributions using an attentionbased decoder that provably learns to mimic the properties of non-parametric copulas.|
|[Domain Adaptation for Time Series Forecasting via Attention Sharing](https://arxiv.org/abs/2102.06828)| ICML | 2022 | [code](https://github.com/DMIRLAB-Group/SASA) | UCI, Wiki | we propose a novel domain adaptation framework, Domain Adaptation Forecaster (DAF). DAF leverages statistical strengths from a relevant domain with abundant data samples (source) to improve the performance on the domain of interest with limited data (target).|
|[Volatility Based Kernels and Moving Average Means for Accurate Forecasting with Gaussian Processes](https://proceedings.mlr.press/v162/benton22a/benton22a.pdf)| ICML | 2022 | [code](https://github.com/g-benton/Volt) | we take inspiration from well studied domains to introduce a new class of models, Volt and Magpie, that significantly outperform baselines in stock and wind speed forecasting, and naturally extend to the multitask setting. |
|[DSTAGNN: Dynamic Spatial-Temporal Aware Graph Neural Network for Traffic Flow Forecasting](https://proceedings.mlr.press/v162/lan22a/lan22a.pdf)| ICML | 2022 | [code](https://github.com/SYLan2019/DSTAGNN) | PEMS | This paper proposes a novel Dynamic Spatial-Temporal Aware Graph Neural Network (DSTAGNN) to model the complex spatial-temporal interaction in road network. | Stock Prices, Wind Speeds |
| [Multi-Granularity Residual Learning with Confidence Estimation for Time Series Prediction](https://dl.acm.org/doi/pdf/10.1145/3485447.3512056) | WWW | 2022 | [Code]() | [Electricity](https://archive.ics.uci.edu/ml/datasets/ElectricityLoadDiagrams20112014), [Stock](https://github.com/microsoft/qlib) | we design a novel residual learning net to model the prior knowledge of the fine-grained data’s distribution through the coarse-grained one. Furthermore, to alleviate the side effect of validity dif- ferences, we introduce a self-supervised objective for confidence estimation, which delivers more effective optimization without the requirement of additional annotation efforts. |
| [CAMul: Calibrated and Accurate Multi-view Time-Series Forecasting](https://dl.acm.org/doi/pdf/10.1145/3485447.3512037) | WWW | 2022 | [Code](https://github.com/AdityaLab/CAMul) | [google-symptoms](https://pair-code.github.io/covid19_symptom_dataset), covid19, power, tweet | We propose a general probabilistic multi-view forecasting framework CAMul, which can learn representations and uncertainty from diverse data sources. It integrates the information and uncertainty from each data view in a dynamic context-specific manner, assign- ing more importance to useful views to model a well-calibrated forecast distribution. |
| [EXIT: Extrapolation and Interpolation-based Neural Controlled Differential Equations for Time-series Classification and Forecasting](https://dl.acm.org/doi/pdf/10.1145/3485447.3512030) | WWW | 2022 | [Code]() | MuJoCo, Google Stock | we propose to i) generate another latent continuous path using an encoder-decoder archi- tecture, which corresponds to the interpolation process of NCDEs, i.e., our neural network-based interpolation vs. the existing explicit interpolation, and ii) exploit the generative characteristic of the de- coder, i.e., extrapolation beyond the time domain of original data if needed. |
| [RETE: Retrieval-Enhanced Temporal Event Forecasting on Unified Query Product Evolutionary Graph](https://dl.acm.org/doi/pdf/10.1145/3485447.3511974) | WWW | 2022 | - | [Yelp](https://www.yelp.com/dataset/), E-commerce | RETE efficiently and dynamically retrieves relevant entities centrally on each user as high-quality subgraphs, preventing the noise propagation from the densely evolutionary graph structures that incorporate abun- dant search queries. |
|[CATN: Cross Attentive Tree-aware Network for Multivariate Time Series Forecasting](https://www.aaai.org/AAAI22Papers/AAAI-7403.HeH.pdf)| AAAI | 2022 | - | [Traffic](https://archive.ics.uci.edu/ml/datasets/PEMS-SF), [Electricity](https://archive.ics.uci.edu/ml/datasets/ElectricityLoadDiagrams20112014), [PeMSD7(M)](https://dot.ca.gov/programs/traffic-operations/mpr/pemssource), [METR-LA](http://ceur-ws.org/Vol-2750/paper9.pdf) | studied the hierarchical and grouped correlation mining problem of multivariate time-series data and proposed CATN for multi-step forecasting. |
| [Reinforcement Learning based Dynamic Model Combination for Time Series Forecasting](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9564132)| AAAI | 2022 | - | [DATA](https://nsrdb.nrel.gov) | a novel and practically effective online ensemble aggregation framework for time-series forecasting that employs a deep reinforcement learning approach as a meta-learning technique. |
|[Conditional Local Convolution for Spatio-temporal Meteorological Forecasting](https://arxiv.org/abs/2101.01000)|AAAI | 2022 | [Code link](https://github.com/BIRD-TAO/CLCRN) | WeatherBench (Rasp et al. 2020) | a local conditional convolution to capture and imitate the meteorological flows of local patterns on the whole sphere|
| [TLogic: Temporal Logical Rules for Explainable Link Forecasting on Temporal Knowledge Graphs](https://arxiv.org/abs/2112.08025) | AAAI | 2022 | [Code link](https://github.com/liu-yushan/TLogic) | [ Integrated Cri- sis Early Warning System](https://dataverse.harvard.edu/dataverse/icews), [Split method](https://github.com/TemporalKGTeam/xERTE) |the first symbolic framework that directly learns temporal logical rules from temporal knowl- edge graphs and applies these rules for link forecasting|
| [Spatio-Temporal Recurrent Networks for Event-Based Optical Flow Estimation](https://arxiv.org/abs/2109.04871) | AAAI | 2022 | - | The MVSEC dataset (Zhu et al. 2018a) | novel input representation to effectively extract the spatio-temporal information from event input.|
| [A GNN-RNN Approach for Harnessing Geospatial and Temporal Information: Application to Crop Yield Prediction](https://arxiv.org/pdf/2111.08900.pdf) | AAAI | 2022 | - | Crop | a novel GNN-RNN framework to innovatively incorporate both geospatial and temporal knowledge into crop yield prediction.|
| [ST-GSP: Spatial-Temporal Global Semantic Representation Learning for Urban Flow Prediction](https://dl.acm.org/doi/10.1145/3488560.3498444) | WSDM | 2022 | [Code link](https://github.com/k51/STGSP) | TaxiBJ, BikeNYC | our model explicitly models the correlation among temporal dependencies of different scales to extract global temporal dependencies + new simple fusion strategy + self-supervised learning |
| [PYRAFORMER: LOW-COMPLEXITY PYRAMIDAL ATTENTION FOR LONG-RANGE TIME SERIES MODELING AND FORECASTING](https://openreview.net/pdf?id=0EXmFzUn5I) | ICLR | 2022 | [Code link](https://github.com/alipay/Pyraformer) | [Electricity](https://archive.ics.uci.edu/ml/datasets/ElectricityLoadDiagrams20112014), [Wind](https://www.kaggle.com/sohier/30-years-of-european-wind-generation), [ETT data](https://github.com/zhouhaoyi/ETDataset) and [App Flow](https://github.com/alipay/Pyraformer/tree/master/data) | a novel model based on pyramidal attention that can effectively describe both short and long temporal dependencies with low time and space complexity. |
| [DEPTS: DEEP EXPANSION LEARNING FOR PERIODIC TIME SERIES FORECASTING](https://openreview.net/pdf?id=AJAR-JgNw__) | ICLR | 2022 | [Code link](https://github.com/weifantt/DEPTS) | ELECTRICITY, TRAFFIC2, and M4(HOURLY) | model complicated periodic dependencies and to capture sophisticated compositions of diversified periods simultaneously. |
| [TAMP-S2GCNETS: COUPLING TIME-AWARE MULTIPERSISTENCE KNOWLEDGE REPRESENTATION WITH SPATIO-SUPRA GRAPH CONVOLUTIONAL NETWORKS FOR TIME-SERIES FORECASTING](https://openreview.net/pdf?id=wv6g8fWLX2q) | ICLR | 2022 | [Code link](https://www.dropbox.com/sh/n0ajd5l0tdeyb80/AABGn-ejfV1YtRwjf_L0AOsNa?dl=0.) | PeMSD3, PeMSD4, PeMSD8 and COVID-19 | The developed TAMP-S2GCNets model is shown to yield highly competitive forecasting performance on a wide range of datasets, with much lower computational costs. |
| [CoST: Contrastive Learning of Disentangled Seasonal-Trend Representations for Time Series Forecasting](https://arxiv.org/abs/2202.01575) | ICLR | 2022 | [Code link](https://github.com/salesforce/CoST) | [ETT](https://github.com/zhouhaoyi/ETDataset),[Electricity](https://archive.ics.uci.edu/ml/datasets/ElectricityLoadDiagrams20112014),[Weather](https://www.ncei.noaa.gov/data/local-climatological-data/) | proposed CoST, a contrastive learning framework that learns disentangled seasonal-trend representations for time series forecasting tasks. |
| [REVERSIBLE INSTANCE NORMALIZATION FOR ACCURATE TIME-SERIES FORECASTING AGAINST DISTRIBUTION SHIFT](https://openreview.net/pdf?id=cGDAkQo1C0p) | ICLR | 2022 | - | [ETT](https://github.com/zhouhaoyi/ETDataset), [Electricity Consuming Load (ECL)](https://archive.ics.uci.edu/ml/datasets/ElectricityLoadDiagrams20112014) | address the distribution shift problem in time series, proposing a simple yet ef- fective normalization-and-denormalization method, reversible instance normalization (RevIN) |
| [TEMPORAL ALIGNMENT PREDICTION FOR SUPERVISED REPRESENTATION LEARNING AND FEW-SHOT SEQUENCE CLASSIFICATION](https://openreview.net/pdf?id=p3DKPQ7uaAi) | ICLR | 2022 | [Code link](https://github.com/BingSu12/TAP) | MSR Action3D, MSR Daily Activity3D, “Spoken Arabic Digits (SAD)” dataset, ChaLearn | present TAP, which is a learnable distance for sequences. |
| [Deep Switching Auto-Regressive Factorization: Application to Time Series Forecasting](https://arxiv.org/abs/2009.05135) | AAAI | 2021 | - | [Pacific Ocean Temperature Dataset](http://iridl.ldeo.columbia.edu/), [Parking Birmingham Data Set](https://data.birmingham.gov.uk/dataset/birmingham-parking), ........ | it parameterizes the weights in terms of a deep switching vector auto-regressive likelihood governed with a Markovian prior |
| [Dynamic Gaussian Mixture Based Deep Generative Model for Robust Forecasting on Sparse Multivariate Time Series](https://arxiv.org/abs/2103.02164) | AAAI | 2021 | - | [USHCN](https://www.ncdc.noaa.gov/ushcn/introduction), [KDD-CUP](https://www.kdd.org/kdd2018/kdd-cup), MIMIC-III |provides a novel and general solution that explicitly defines temporal dependency between Gaussian mixture distributions at different time steps |
| [Temporal Latent Autoencoder: A Method for Probabilistic Multivariate Time Series Forecasting](https://www.aaai.org/AAAI21Papers/AAAI-3796.NguyenN.pdf) | AAAI | 2021 | -| Traffic, Electricity, Wiki | introduced a novel temporal latent auto-encoder method which enables nonlinear factorization of multivariate time series, learned end-to-end with a temporal deep learning latent space forecast model. By imposing a probabilistic latent space model, complex distributions of the input series are modeled via the decoder.|
| [Synergetic Learning of Heterogeneous Temporal Sequences for Multi-Horizon Probabilistic Forecasting](https://arxiv.org/abs/2102.00431) | AAAI | 2021 | - | Electricity(UCI), Traffic, Environment(Li, L.;Yan,J.;Yang,X.;and Jin,Y.2019a.) | presented a novel approach based on the deep conditional generative model to jointly learn from heterogeneous temporal sequences.|
| [Time-Series Event Prediction with Evolutionary State Graph](https://arxiv.org/pdf/1905.05006.pdf) | WSDM | 2021 | [Code link](https://github.com/VachelHU/EvoNet) | DJIA30, WebTraffic, NetFlow, ClockErr, and AbServe | proposed a novel represen- tation, the evolutionary state graph, to present the time-varying re- lations among time-series states. |
| [Long Horizon Forecasting With Temporal Point Processes](https://arxiv.org/pdf/2101.02815.pdf) | WSDM | 2021 | [Code link](https://github.com/pratham16cse/DualTPP) | Election, Taxi, Traffic-911, and EMS-911. | a novel MTPP model specif- ically designed for long-term forecasting of events. |
| [Modeling Inter-station Relationships with Attentive Temporal Graph Convolutional Network for Air Quality Prediction](https://dl.acm.org/doi/pdf/10.1145/3437963.3441731) | WSDM | 2021 | - | [Beijing](https://beijingair.sinaapp.com/), [Tianjin](http://urban-computing.com/data/Data-1.zip) and [POIs data](https://map.baidu.com/)| encode multiple types of inter-station relationships into graphs and design parallel GCNbased encoding and decoding modules to aggregate features from related stations using different graphs. |
| [Predicting Crowd Flows via Pyramid Dilated Deeper Spatial-temporal Network](https://dl.acm.org/doi/pdf/10.1145/3437963.3441785) | WSDM | 2021 | - | Wi-Fi connection log, [bike in New York city](https://ride.citibikenyc.com/system-data) and [taxi ride in New York](www1.nyc.gov/site/tlc/about/tlc-trip-record-data.page) | ConvLSTM + pyramid dilated residual network + integrated attention |
| [Z-GCNETs: Time Zigzags at Graph Convolutional Networks for Time Series Forecasting](https://proceedings.mlr.press/v139/chen21o.html) | ICML | 2021 | [Code link](https://github.com/Z-GCNETs/Z-GCNETs.git) | Decentraland, Bytom, PeMSD4 and PeMSD8. | The new Z-GCNETs layer allows us to track the salient timeaware topological characterizations of the data persisting over time. |
| [Explaining Time Series Predictions with Dynamic Masks](https://proceedings.mlr.press/v139/crabbe21a.html) | ICML |2021 | [Code link](https://github.com/JonathanCrabbe/Dynamask) | MIMIC-III | These masks are endowed with an insightful information theoretic interpretation and offer a neat improvement in terms of performance. |
| [End-to-End Learning of Coherent Probabilistic Forecasts for Hierarchical Time Series](https://proceedings.mlr.press/v139/rangapuram21a.html) | ICML |2021 | [Code link](https://github.com/awslabs/gluon-ts) | Labour, Traffic, Tourism, Tourism-L, and Wiki | a single, global model that does not require any adjustments to produce coherent, probabilistic forecasts, a first of its kind. |
| [Autoregressive Denoising Diffusion Models for Multivariate Probabilistic Time Series Forecasting](https://proceedings.mlr.press/v139/rasul21a.html) | ICML |2021 | [Code link](https://github.com/zalandoresearch/pytorch-ts) | [Exchange, Solar and Electricity](https://archive.ics.uci.edu/ml/datasets/ElectricityLoadDiagrams20112014), [Traffic](https://archive.ics.uci.edu/ml/datasets/PEMS-SF), [Taxi](https://www1.nyc.gov/site/tlc/about/tlc-trip-record-data.page) and [Wikipedia](https://github.com/mbohlkeschneider/gluon-ts/tree/mv_release/datasets) | a combination of improved variance schedule and an L1 loss to allow sampling with fewer steps at the cost of a small reduction in quality if such a trade-off is required. |
| [Conformal prediction interval for dynamic time-series](https://proceedings.mlr.press/v139/xu21h.html) | ICML |2021 | [Code link](https://github.com/hamrel-cxu/EnbPI) | solar and wind energy data | present a predictive inference method for dynamic time-series. |
| [RNN with Particle Flow for Probabilistic Spatio-temporal Forecasting](https://proceedings.mlr.press/v139/pal21b.html) | ICML |2021 | - | PeMSD3, PeMSD4, PeMSD7 and PeMSD8 | propose a state-space probabilistic model- ing framework for multivariate time-series prediction that can process information provided in the form of a graph that specifies (probable) predictive or causal relationships. |
| [ST-Norm: Spatial and Temporal Normalization for Multi-variate Time Series Forecasting](https://dl.acm.org/doi/pdf/10.1145/3447548.3467330) | KDD |2021 | [Code link](https://github.com/JLDeng/ST-Norm) | BikeNYC, PeMSD7 and Electricity | propose two kinds of normalization modules -- temporal and spatial normalization -- which separately refine the high-frequency component and the local component underlying the raw data. |
| [MiniRocket: A Fast (Almost) Deterministic Transform for Time Series Classification](https://dl.acm.org/doi/pdf/10.1145/3447548.3467231) | KDD |2021 | [Code link](https://github.com/angus924/minirocket) | UCR archive | reformulate Rocket into a new method, MiniRocket. MiniRocket is up to 75 times faster than Rocket on larger datasets, and almost deterministic. |
| [Dynamic and Multi-faceted Spatio-temporal Deep Learning for Traffic Speed Forecasting](https://dl.acm.org/doi/pdf/10.1145/3447548.3467275)|KDD|2021|[Code link](https://github.com/liangzhehan/DMSTGCN)| PeMSD4, PeMSD8 and England | design a dynamic graph construction method to learn the time-specific spatial dependencies of road segments. |
| [Forecasting Interaction Order on Temporal Graphs](https://dl.acm.org/doi/pdf/10.1145/3447548.3467341)|KDD|2021| [Code link](https://github.com/xiawenwen49/TAT-code)| COLLEGEMSG, EMAIL-EU and FBWALL | devise an attention mechanism to aggregate neighborhoods' information based on their representations and time encodings attached to their specific edges. |
|[Quantifying Uncertainty in Deep Spatiotemporal Forecasting](https://dl.acm.org/doi/pdf/10.1145/3447548.3467325)| KDD | 2021 | - | air quality PM2.5, road network traffic, and COVID-19 incident deaths | conduct benchmark studies on uncertainty quantification in deep spatiotemporal forecasting from both Bayesian and frequen- tist perspectives. |
|[Spatial-Temporal Graph ODE Networks for Traffic Flow Forecasting](https://arxiv.org/pdf/2106.12931.pdf)|KDD| 2021 |[Code link](https://github.com/square-coder/STGODE)|-|we capture spatial-temporal dynamics through a tensor-based ordinary differential equation (ODE), as a result, deeper networks can be constructed and spatial-temporal features are utilized synchronously.|
|[A PLAN for Tackling the Locust Crisis in East Africa: Harnessing Spatiotemporal Deep Models for Locust Movement Forecasting](https://dI.acm.org/doi/pdf/10.1145/3447548.3467184)|KDD|2021|[Code link](https://github.com/maryam-tabar/PLAN)| - | PLAN's novel spatio-temporal deep learning architecture enables representing PlantVillage's crowdsourced locust observation data using novel image-based feature representations, and its design is informed by several unique insights about this problem domain. |
| [Topological Attention for Time Series Forecasting](https://arxiv.org/pdf/2107.09031v1.pdf) | NeurIPS | 2021 | [Code link](https://github.com/ElementAl/N-BEATS) | [M4 competition dataset](https://github.com/Mcompetitions/M4-methods) | propose topological attention, which allows attending to local topological features within a time horizon of historical data. |
| [MixSeq: Connecting Macroscopic Time Series Forecasting with Microscopic Time Series Data](https://arxiv.org/pdf/2110.14354v1.pdf) | NeurIPS | 2021 | - | [Rossmann](https://www.kaggle.com/c/rossmann-store-sales), Wiki and [M5](https://www.kaggle.com/c/m5-forecasting-accuracy) | an end2end mixture model to cluster microscopic time series, where all the components come from a family of Seq2seq models parameterized by differ- ent parameters. |
| [Test-time Collective Prediction](https://arxiv.org/pdf/2106.12012v1.pdf) | NeurIPS | 2021 | - | Boston...... | Our approach takes inspiration from the literature in social science on human consensus-making. |
| [Bubblewrap: Online tiling and real-time flow prediction on neural manifolds](https://arxiv.org/pdf/2108.13941v1.pdf) | NeurIPS | 2021 | [Code link](https://github.com/pearsonlab/Bubblewrap) | - | we propose a method that combines fast, stable dimensionality reduction with a soft tiling of the resulting neural manifold, allowing dynamics to be approximated as a probability flow between tiles. |
| [Autoformer: Decomposition Transformers with Auto-Correlation for Long-Term Series Forecasting](https://arxiv.org/pdf/2106.13008v2.pdf) | NeurIPS | 2021 | [Code link](https://github.com/thuml/Autoformer) | ETT, Electricity, Exchange, and Traffic | we propose Auto- former as a novel decomposition architecture with an Auto-Correlation mechanism. |
| [Learning to Learn the Future: Modeling Concept Drifts in Time Series Prediction](https://dl.acm.org/doi/pdf/10.1145/3459637.3482271) | CIKM | 2021 | - | Climate Dataset, Stock Dataset and Synthetic Dataset | propose a novel framework called learning to learn the future. Specifically, we develop a learning method to model the concept drift during the inference stage, which can help the model generalize well in the future. |
| [AdaRNN: Adaptive Learning and Forecasting of Time Series](https://arxiv.org/pdf/2108.04443.pdf) | CIKM | 2021 | - | UCI activity, Air quality, Electric power and Stock price | AdaRNN is a general framework with flexible distribution distances integrated. |
| [Actionable Insights in Urban Multivariate Time-series](https://dl.acm.org/doi/pdf/10.1145/3459637.3482410) | CIKM | 2021 | - | Gaussian, Insect, Wikipedia and so on ...... | introduce and formalize a novel problem RaTSS that aims to find such time-series (rationalizations), which are actionable for the segmentation. We also propose an algorithm Find-RaTSS to find them for any black-box segmentation. |
| [Historical Inertia: A Neglected but Powerful Baseline for Long Sequence Time-series Forecasting](https://arxiv.org/pdf/2103.16349.pdf) | CIKM | 2021 | - | [ETT](https://github.com/zhouhaoyi/), [Electricity](https://github.com/laiguokun/multivariate-time-series-data) | introduce a new baseline for LSTF, the historical inertia (HI), which refers to the most recent historical data-points in the input time series. |
| [AGCNT: Adaptive Graph Convolutional Network for Transformer-based Long Sequence Time-Series Forecasting](https://dl.acm.org/doi/pdf/10.1145/3459637.3482054) | CIKM | 2021 | - | ETT | a probsparse adaptive graph self-attention+the stacked encoder with distilling probsparse graph self-attention integrates the graph attention mechanism+ the stacked decoder with generative inference generates all prediction values in one forward operation |
| [PIETS: Parallelised Irregularity Encoders for Forecasting with Heterogeneous Time-Series](https://arxiv.org/pdf/2110.00071.pdf) | ICDM | 2021 | - | Covid-19 | design a novel architecture, PIETS, to model heterogeneous time-series. |
| [Attentive Neural Controlled Differential Equations for Time-series Classification and Forecasting](https://arxiv.org/pdf/2109.01876.pdf) | ICDM | 2021 | [Code link](https://github.com/sheoyon-jhin/ANCDE) | Character Trajectories, PhysioNet Sepsis and Google Stock. | present Attentive Neural Controlled Differential Equations (ANCDEs) for time-series classification and forecasting, where dual NCDEs are used: one for generating attention values, and the other for evolving hidden vectors for a downstream machine learning task. |
| [SSDNet: State Space Decomposition Neural Network for Time Series Forecasting](https://arxiv.org/pdf/2112.10251.pdf) | ICDM | 2021 | - | Electricity,Exchange, Solar ...... | SSDNet combines the Transformer architecture with state space models to provide probabilistic and interpretable forecasts, including trend and seasonality components and previous time steps important for the prediction. |
| [Two Birds with One Stone: Series Saliency for Accurate and Interpretable Multivariate Time Series Forecasting](https://www.ijcai.org/proceedings/2021/0397.pdf) | IJCAI | 2021 | - | electricity, Air-quality, Industry data | Series saliency is model agnostic and performs as an adap- tive data augmentation method for training deep models. Moreover, by slightly changing the objec- tive, we optimize series saliency to find a mask for interpretable forecasting in both feature and time dimensions. |
| [TE-ESN: Time Encoding Echo State Network for Prediction Based on Irregularly Sampled Time Series Data](https://arxiv.org/pdf/2105.00412.pdf) | IJCAI | 2021 | - | MG system, SILSO, USHCN, COVID-19 | propose a novel Time En- coding (TE) mechanism. TE can embed the time information as time vectors in the complex domain. |
| [DeepFEC: Energy Consumption Prediction under Real-World Driving Conditions for Smart Cities](https://dl.acm.org/doi/pdf/10.1145/3442381.3449983) | WWW | 2021| [Code link](https://github.com/ElmiSay/DeepFEC) |[SPMD](https://catalog.data.gov/dataset/safety-pilot-model-deployment-data), [VED](https://github.com/gsoh/VED) | presents a novel framework that identifies vehicle/driving environment-dependent factors to predict energy consumption over a road network based on his- torical consumption data for different vehicle types. |
| [HINTS: Citation Time Series Prediction for New Publications viaDynamic Heterogeneous Information Network Embedding](https://dl.acm.org/doi/pdf/10.1145/3442381.3450107) | WWW | 2021| - | [the AMiner Computer Science dataset](https://aminer.org/citation) and [the American Physical Society (APS) Physics dataset](https://journals.aps.org/datasets) | a novel end-to-end deep learning framework that converts citation signals from dynamic heterogeneous information networks (DHIN) into citation time series. |
| [Variable Interval Time Sequence Modeling for Career Trajectory Prediction: Deep Collaborative Perspective](https://dl.acm.org/doi/pdf/10.1145/3442381.3449959) | WWW | 2021| - | traffic data from 1988.1 to 2018.11 | propose a unified time-aware career trajectory prediction framework, namely TACTP, which is capable of jointly providing the above three abilities for better understanding the career trajectories of talents. |
| [REST: Reciprocal Framework for Spatiotemporal coupled predictions](https://dl.acm.org/doi/pdf/10.1145/3442381.3449928) | WWW | 2021| - | a traffic dataset released by Li et al. and [a web dataset](https://dumps.wikimedia.org) | come up with a novel Reciprocal SpatioTemporal (REST) framework, which introduces Edge Inference Networks (EINs) to couple with GCNs. |
| [AutoSTG: Neural Architecture Search for Predictions of Spatio-Temporal Graph](https://dl.acm.org/doi/pdf/10.1145/3442381.3449816) | WWW | 2021| [Code link](https://github.com/panzheyi/AutoSTG) | PEMS-BAY and METR-LA | propose a novel framework, entitled AutoSTG, for automated spatio-temporal graph prediction. In our AutoSTG, spatial graph convolution and temporal convolution operations are adopted in our search space to capture complex spatio-temporal correlations. Besides, we employ the meta learning technique to learn the adjacency matrices of spatial graph convolution layers and kernels of temporal convolution layers from the meta knowledge of the attributed graph. |
| [Fine-grained Urban Flow Prediction](https://dl.acm.org/doi/pdf/10.1145/3442381.3449792) | WWW | 2021| - | TaxiBJ+, HappyValley | Spatio-Temporal Relation Net- work (STRN) to predict fine-grained urban flows. First, a backbone network is used to learn high-level representations for each cell. Second, we present a Global Relation Module (GloNet) that cap- tures global spatial dependencies much more efficiently compared to existing methods. Third, we design a Meta Learner that takes external factors and land functions (e.g., POI density) as inputs to produce meta knowledge and boost model performances. |
| [Probabilistic Time Series Forecasting with Shape and Temporal Diversity](https://proceedings.neurips.cc/paper/2020/hash/2f2b265625d76a6704b08093c652fd79-Abstract.html) | NeurIPS | 2020| [Code link](https://github.com/vincent-leguen/STRIPE) | - | Diversity is controlled via two proposed differentiable positive semi-definite kernels for shape and time and exploits a forecasting model with a disentangled latent space. |
| [Benchmarking Deep Learning Interpretability in Time Series Predictions](https://proceedings.neurips.cc/paper/2020/hash/47a3893cc405396a5c30d91320572d6d-Abstract.html) | NeurIPS | 2020| [Code link](https://github.com/ayaabdelsalam91/TS-Interpretability-Benchmark) | - | a comprehensive synthetic benchmark where positions of informative features are known. |
| [Adversarial Sparse Transformer for Time Series Forecasting](https://proceedings.neurips.cc/paper/2020/hash/c6b8c8d762da15fa8dbbdfb6baf9e260-Abstract.html) | NeurIPS | 2020| - | [electricity](https://archive.ics.uci.edu/ml/datasets/ElectricityLoadDiagrams-20112014),[traffic](https://archive.ics.uci.edu/ml/datasets/PEMS-SF), [wind](https://www.kaggle.com/sohier/30-years-of-european-wind-generation), [solar](https://www.nrel.gov/grid/solar-power-data.html), M4-Hourly | By adversarial learning, we improve the contiguous and fidelity at the sequence level. We further propose Sparse Transformer to improve the ability to pay more attention on relevant steps in time series. |
| [Deep Rao-Blackwellised Particle Filters for Time Series Forecasting](https://proceedings.neurips.cc/paper/2020/hash/afb0b97df87090596ae7c503f60bb23f-Abstract.html) | NeurIPS | 2020| - | electricity, traffic, solar, exchange, wiki | proposed an extension of the classical SGLS that addresses two weaknesses |
| [Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction](https://proceedings.neurips.cc/paper/2020/hash/12ffb0968f2f56e51a59a6beb37b2859-Abstract.html) | NeurIPS | 2020| - | - | introduced a new class of predictive model, a R-model, that is a hybrid between standard model-free and model-based mechanisms |
| [EvolveGraph: Multi-Agent Trajectory Prediction with Dynamic Relational Reasoning](https://proceedings.neurips.cc/paper/2020/hash/e4d8163c7a068b65a64c89bd745ec360-Abstract.html) | NeurIPS | 2020| - | Honda 3D Dataset (H3D), NBA SportVU Dataset (NBA), and Stanford Drone Dataset (SDD) | present a generic trajectory forecasting framework with explicit relational reasoning among multiple heterogeneous, interactive agents with a graph representation. |
| [Multi-agent Trajectory Prediction with Fuzzy Query Attention](https://proceedings.neurips.cc/paper/2020/hash/fe87435d12ef7642af67d9bc82a8b3cd-Abstract.html) | NeurIPS | 2020| [Code link](https://github.com/nitinkamra1992/FQA) | ETH-UCY, Collisions, NGsim, Charges, NBA | a general architecture designed to predict trajectories in multi-agent systems while modeling the crucial inductive biases of motion, namely, inertia, relative motion, intents and interactions. |
| [Set Functions for Time Series](http://proceedings.mlr.press/v119/horn20a.html) | ICML | 2020| [Code link](https://github.com/BorgwardtLab/Set_Functions_for_Time_Series) | MIMIC-III, Physionet 2012 Mortality Prediction Challenge | presented a novel approach for classifying time se- ries with irregularly-sampled and unaligned. |
| [Learning from Irregularly-Sampled Time Series: A Missing Data Perspective](http://proceedings.mlr.press/v119/li20k.html) | ICML | 2020| [Code link](https://github.com/steveli/partial-encoder-decoder) | MNIST..... | introduced an encoder-decoder framework for modeling general missing data problems and introduced two model families leveraging this framework: P-VAE and P-BiGAN. |
| [Unsupervised Transfer Learning for Spatiotemporal Predictive Networks](http://proceedings.mlr.press/v119/yao20a.html) | ICML | 2020| [Code link](https://github.com/thuml/transferable-memory) | - | studied a new unsupervised transfer learn- ing problem of using multiple pretrained models to im- prove the performance of a new spatiotemporal predictive learning task. |
| [Connecting the Dots: Multivariate Time Series Forecasting with Graph Neural Networks](https://arxiv.org/pdf/2005.11650.pdf) | KDD | 2020| [Code link](https://github.com/nnzhan/MTGNN) | Solar-Energy, Traffic, Electricity, Exchange-Rate and PeMS ......| a novel framework for multivariate time series forecasting |
| [Deep State-Space Generative Model For Correlated Time-to-Event Predictions](https://dl.acm.org/doi/pdf/10.1145/3394486.3403206) | KDD | 2020| [Code link](https://github.com/Google-Health/records-research/state-space-model) | MIMIC-III | proposed a deep latent state-space generative model to capture the relations between patients’ mortality risk and the associated organ failure risks. |
| [Attention based multi-modal new product sales time-series forecasting](https://dl.acm.org/doi/10.1145/3394486.3403362) | KDD | 2020| - | - | propose and empirically evaluate several novel attention-based multi-modal encoder-decoder models to forecast the sales for a new product purely based on product images, any available product attributes and also external factors like holidays, events, weather, and discount. |
| [BusTr: predicting bus travel times from real-time traffic](https://dl.acm.org/doi/pdf/10.1145/3394486.3403376) | KDD | 2020| - | - | demonstrates excellent generalization to test data that differs both spatially and temporally from the training examples we use, allowing our model to cope gracefully with the ever-changing world. |
| [CompactETA: A Fast Inference System for Travel Time Prediction](https://dl.acm.org/doi/10.1145/3394486.3403386) |KDD| 2020| - | - | encode high order spatial and temporal dependency into sophisticated representations by applying graph attention network on a spatiotemporal weighted road network graph. We further encode the sequential information of the travel route by positional encoding to avoid the recurrent network structure. |
| [DATSING: Data Augmented Time Series Forecasting with Adversarial Domain Adaptation](https://www.researchgate.net/publication/344082806_DATSING_Data_Augmented_Time_Series_Forecasting_with_Adversarial_Domain_Adaptation) | CIKM | 2020| - | - | propose a two-phased frameworkwhich first clusters similar mixed domains time series data and thenperforms a fine-tuning procedure with domain adversarial regular-ization to achieve better out-of-sample generalization. |
| [Dual Sequential Network for Temporal Sets Prediction](https://dl.acm.org/doi/pdf/10.1145/3397271.3401124) | SIGIR | 2020| - | - | addressed the problem that most of the existing methods were designed for predicting time series or temporal events, which could not be directly used for temporal sets prediction due to the difficulties of multi-level representations of items and sets, complex temporal dependencies of sets, and evolving dynamics of sequential behaviors. |
| [Think Globally, Act Locally: A Deep Neural Network Approach to High-Dimensional Time Series Forecasting](https://papers.nips.cc/paper/2019/file/3a0844cee4fcf57de0c71e9ad3035478-Paper.pdf) | NeurIPS | 2019 | [Code](https://github.com/rajatsen91/deepglo) | electricity, traffic, wik, PeMS07(M) | Our model can be trained effectively on high-dimensional but diverse time series, where different time series can have vastly different scales, without a priori normalization or rescaling. |
| [Shape and Time Distortion Loss for Training Deep Time Series Forecasting Models](https://papers.nips.cc/paper/2019/file/466accbac9a66b805ba50e42ad715740-Paper.pdf) | NeurIPS | 2019 | [Code](https://github.com/vincent-leguen/DILATE) | Synth, ECG, Traffic | We introduce a differentiable loss function suitable for training deep neural nets, and provide a custom back-prop implementation for speeding up optimization. We also introduce a variant of DILATE, which provides a smooth generalization of temporally-constrained Dynamic Time Warping (DTW). |
| [Enhancing the Locality and Breaking the Memory Bottleneck of Transformer on Time Series Forecasting](https://papers.nips.cc/paper/2019/file/6775a0635c302542da2c32aa19d86be0-Paper.pdf) | NeurIPS | 2019 | - | [solar](https://www.nrel.gov/grid/solar-power-data.html), [wind](https://www.kaggle.com/sohier/30-years-of-european-wind-generation) | we first propose convolutional self-attention by producing queries and keys with causal convolution so that local context can be better incorporated into attention mechanism. |
|[Discovering Latent Covariance Structures for Multiple Time Series](https://proceedings.mlr.press/v97/tong19a/tong19a.pdf) | ICML | 2019 | - | - | We present a pragmatic search algorithm which explores a larger structure space efficiently. |
| [BeatGAN: Anomalous Rhythm Detection using Adversarially Generated Time Series](https://www.ijcai.org/proceedings/2019/0616.pdf) | IJCAI | 2019 | [Code](https://github.com/Vniex/BeatGAN) | - | BeatGAN outputs explainable results to pinpoint the anomalous time ticks of an input beat, by comparing them to ad- versarially generated beats. |
| [Learning Interpretable Deep State Space Model for Probabilistic Time Series Forecasting](https://arxiv.org/pdf/2102.00397.pdf) | IJCAI | 2019 | [Code](https://docs.aws.amazon.com/sagemaker/latest/dg/deepar.html) | [MIT-BIH ECG dataset](https://physionet.org/cgi-bin/atm/ATM?database=mitdb), CMU Motion Capture dataset | Our approach involves a deep network based embodiment of the state space model, to allow for non-linear emission and transition models design, which is flexible to deal with arbitrary data distribution.|
| [Explainable Deep Neural Networks for Multivariate Time Series Predictions](https://www.ijcai.org/proceedings/2019/0932.pdf) | IJCAI | 2019 | - | - | We design a two stage convolutional neural network architec- ture which uses particular kernel sizes. This allows us to utilise gradient based techniques for generat- ing saliency maps for both the time dimension and the features. |## Time Series Classification
| Paper | Conference | Year | Code | Used Datasets |Key Contribution|
| :-------------------: | :----------: | :----------: | :------------------------: | ----------------------- |------ |
| [OMNI-SCALE CNNS: A SIMPLE AND EFFECTIVE KERNEL SIZE CONFIGURATION FOR TIME SERIES CLASSIFICATION](https://openreview.net/pdf?id=PDYs7Z2XFGv) | ICLR |2022 | [Code link](https://github.com/Wensi-Tang/OS-CNN) | MEG-TLE, UEA 30 archive, UCR 85 archive, UCR 128 archive | presents a simple 1D-CNN block, namely OS-block. |
| [Correlative Channel-Aware Fusion for Multi-View Time Series Classification](https://arxiv.org/abs/1911.11561) | AAAI | 2021 | - | EV-Action, NTU RGB+D, UCI Daily and Sports Activities | The global-local temporal encoders are developed to extract robust temporal representations for each view, and a learnable fusion mechanism is proposed to boost the multi-view label information. |
| [Learnable Dynamic Temporal Pooling for Time Series Classification](https://arxiv.org/abs/2104.02577) | AAAI | 2021 | - | UCR/UEA |proposes a dynamic temporal pooling + a learning framework to simultaneously optimize the network parameters of a CNN classifier and the prototypical hidden series that encodes the latent semantic of the segments. |
| [ShapeNet: A Shapelet-Neural Network Approach for Multivariate Time Series Classification](https://ojs.aaai.org/index.php/AAAI/article/view/17018) | AAAI | 2021 | [Code link](http://alturl.com/d26bo) | UEA MTS datasets | We propose Mdc-CNN to learn time series subsequences of various lengths into unified space and propose a cluster-wise triplet loss to train the network in an unsupervised fashion. We adopt MST to obtain the MST representation of time series. |
| [Joint-Label Learning by Dual Augmentation for Time Series Classification](https://ojs.aaai.org/index.php/AAAI/article/view/17071) | AAAI | 2021 | [Code](https://github.com/fchollet/keras) | [UCR](https://www.cs.ucr.edu/˜eamonn/) | a novel time-series data augmentation method. |
| [Explainable Multivariate Time Series Classification: A Deep Neural Network Which Learns To Attend To Important Variables As Well As Time Intervals](https://arxiv.org/pdf/2011.11631.pdf) | WSDM | 2021 | - | PM2.5w Seizure Movement | introduced LAXCAT, a novel, modular architecture for explainable multivariate time series classification. |
| [Voice2Series: Reprogramming Acoustic Models for Time Series Classification](http://proceedings.mlr.press/v139/yang21j.html) | ICML |2021 | [Code link](https://github.com/huckiyang/Voice2Series-Reprogramming) | Coffee,...... |a novel approach to reprogram a pre-trained acoustic model for time series classification. |
| [Learning Saliency Maps to Explain Deep Time Series Classifiers](https://thartvigsen.github.io/papers/cikm21.pdf) | CIKM | 2021 | - | Wafer, GunPoint, Computers, Earthqakes, FordA, FordB, CricketX, PTB, ECG | a method that learns to highlight the timesteps that are most responsible and the degree to which they are important for the classifier’s prediction. |
| [Gaussian Process Model Learning for Time Series Classification](https://dl.acm.org/doi/pdf/10.1145/3468791.3468839) | ICDM | 2021 | - | - | proposed a novel approach for time series classification called Local Gaussian Process Model Inference Classification (LOGIC). |
| [Contrast Profile: A Novel Time Series Primitive that Allows Classification in Real World Settings](https://link.springer.com/content/pdf/10.1007/s10618-020-00695-8.pdf) | ICDM | 2021 | [Code link]() | UCR archive | We have shown that the MPdist is more robust to noise, irrelevant data, misalignment etc., than either Euclidian distance or DTW. |
| [Attentive Neural Controlled Differential Equations for Time-series Classification and Forecasting](https://arxiv.org/pdf/2109.01876.pdf) | ICDM | 2021 | [Code link](https://github.com/sheoyon-jhin/ANCDE) | - | a novel NCDE architecture that incorporates the concept of attention. |
| [Imbalanced Time Series Classification for Flight Data Analyzing with Nonlinear Granger Causality Learning](https://dl.acm.org/doi/pdf/10.1145/3340531.3412710) | CIKM | 2020| [Code link]() | - | presented a neural network classification model for imbalanced multivariate time series by leveraging the information learned from normal class, which can also learn the nonlinear Granger causality for each class, so that we can pinpoint how time series classes differ from each other. |
| [Visualet: Visualizing Shapelets for Time Series Classification](https://dl.acm.org/doi/pdf/10.1145/3340531.3417414) | CIKM | 2020| [Code link]() | UCR archive | Such efficiency has made it possible for demo attendees to interact with shapelet discovery and explore high-quality shapelets. In this demo, we present Visualet -- a tool for visualizing shapelets, and exploring effective and interpretable ones. |
| [Learning Discriminative Virtual Sequences for Time Series Classification](https://dl.acm.org/doi/pdf/10.1145/3340531.3412099) | CIKM | 2020| [Code link]() | - | propose a novel time series classification method named Discriminative Virtual Sequence Learning (DVSL). |
| [Fast and Accurate Time Series Classification Through Supervised Interval Search](https://people.eng.unimelb.edu.au/jianzhongq/papers/ICDM2020_TimeSeriesClassificationViaIntervalSearch.pdf) | CIKM | 2020| [Code link](https://github.com/stevcabello/STSF) | - | STSF improves the classification efficiency by examining only a (set of) sub-series of the original time series, and its tree-based structure allows for interpretable outcomes. |## Anomaly Detection
| Dataset | Conference | Year | Code | Used Datasets |Key Contribution|
| :-------------------: | :----------: | :------------------------: | ----------------------- | ------------------------- |------ |
|[Unsupervised Model Selection For Time-series Anomaly Detection](https://arxiv.org/abs/2210.01078)| ICLR | 2023 | | UCR, SMD | In this paper, we explore how we can select accurate time-series anomaly detection models given an unlabeled dataset and a set of candidate models. I’d like to point out that we use adjusted F1. The adjustment ensures that once an algorithm detects even a part of the anomaly, we consider that it has detected the entire anomaly.|
|[Deep Variational Graph Convolutional Recurrent Network for Multivariate Time Series Anomaly Detection](https://proceedings.mlr.press/v162/chen22x/chen22x.pdf)| ICML | 2022 | | DND, SMD, MSL, SMAP| In this paper, we model channel dependency and stochasticity within MTS by developing an embedding-guided probabilistic generative network. We combine it with adaptive Variational Graph Convolutional Recurrent Network (VGCRN) to model both spatial and temporal fine-grained correlations in MTS. To explore hierarchical latent representations, we further extend VGCRN into a deep variational network, which captures multilevel information at different layers and is robust to noisy time series.|
| [A Semi-Supervised VAE Based Active Anomaly Detection Framework in Multivariate Time Series for Online Systems](https://dl.acm.org/doi/pdf/10.1145/3485447.3511984) | WWW | 2022 | - | online cloud server data from two different types of game business | SLA-VAE first defines anomalies based on feature extraction module, introduces semi-supervised VAE to identify anomalies in multivariate time series, and employs active learning to update the online model via a small number of uncertain samples. |
| [Towards a Rigorous Evaluation of Time-series Anomaly Detection](https://arxiv.org/abs/2109.05257) | AAAI | 2022 | - | Secure water treatment (SWaT), ...... | applying PA can severely overestimate a TAD model’s capability.|
| [DeepGPD: A Deep Learning Approach for Modeling Geospatio-Temporal Extreme Events](https://aaai-2022.virtualchair.net/poster_aaai10861) | AAAI | 2022 | [Code link](https://github.com/TylerPWilson/deepGPD) | [the Global Historical Climatology Network (GHCN)](https://www.ncdc.noaa.gov/ghcn-daily-description) |proposed a novel deep learning architecture (DeepGPD) capable of learning the parameters of the generalized Pareto distribution while satisfying the conditions placed on those parameters.|
| [Graph-Augmented Normalizing Flows for Anomaly Detection of Multiple Time Series](https://arxiv.org/abs/2202.07857) | ICLR |2022 | - | PMU-B, PMU-C, SWaT, METR-LA | propose a novel flow model by imposing a Bayesian network among constituent series. |
| [Anomaly Transformer: Time Series Anomaly Detection with Association Discrepancy](https://arxiv.org/abs/2110.02642) | ICLR |2022 | - | SMD MSL SMAP SWaT PSM | propose the Anomaly Transformer with a new Anomaly-Attention mechanism to compute the association discrepancy. A minimax strategy is devised to amplify the normal-abnormal distinguishability of the association discrepancy. |
| [Graph Neural Network-Based Anomaly Detection in Multivariate Time Series](https://www.aaai.org/AAAI21Papers/AAAI-5076.DengA.pdf) | AAAI | 2021 | - | SWaT and WADI | proposed our Graph Deviation Network (GDN) approach, which learns a graph of relationships between sensors, and detects deviations from these patterns, while incorporating sensor embeddings |
| [Time Series Anomaly Detection with Multiresolution Ensemble Decoding](https://www.aaai.org/AAAI21Papers/AAAI-5192.ShenL.pdf) | AAAI | 2021 | - | [ECG, 2D-gesture and Power-demand](http://www.cs.ucr.edu/∼eamonn/discords/), [Yahoo’s S5](https://webscope.sandbox.yahoo.com/) | Its core is to use lower-resolution information to help long-range decoding at layers with higher resolutions. This is achieved by jointly learning multiple recurrent decoders where each decoder is with a different decoding length.|
| [Outlier Impact Characterization for Time Series Data](https://par.nsf.gov/servlets/purl/10272499) | AAAI | 2021 | [benchmark](https://github.com/numenta/NAB) | [Webscope](http://labs.yahoo.com/Academic-Relations), [Physionet](https://physionet.org/content/chfdb/1.0.0/) | study recurring outliers in time series data and aim to provide a systematic way of measuring the impact of such outliers on time series analysis |
| [F-FADE: Frequency Factorization for Anomaly Detection in Edge Streams](https://arxiv.org/pdf/2011.04723.pdf) | WSDM | 2021 | [Code link](http://snap.stanford.edu/f-fade/) | - | a new approach for detection of anomalies in edge streams, which uses a novel frequency-factorization technique to efficiently model the time-evolving distributions of frequencies of interactions between node-pairs. |
| [FluxEV: A Fast and Effective Unsupervised Framework for Time-Series Anomaly Detection](https://dl.acm.org/doi/pdf/10.1145/3437963.3441823) | WSDM | 2021 | [Code link](https://github.com/DawnsonLi/EVT) | - | By converting the non-extreme anomalies to extreme values, our framework addresses the limitation of SPOT and achieves a huge improvement in the detection accuracy. Moreover, Method of Moments is adopted to speed up the parameter estimation in the automatic thresholding. |
| [Event Outlier Detection in Continuous Time](https://proceedings.mlr.press/v139/liu21g.html) | ICML |2021 | [Code link](https://github.com/siqil/CPPOD) | MIMIC III |we develop outlier detection methods based on point processes thatcan take context information into account. Our methods are based on Bayesian decision theory and hypothesis testing with theoretical guarantees. |
| [Multivariate Time Series Anomaly Detection and Interpretation using Hierarchical Inter-Metric and Temporal Embedding](https://dl.acm.org/doi/10.1145/3447548.3467075) | KDD |2021 | [Code link](https:/github.com/zhhlee/lnterFusion) | - |Its core idea is to model the normal patterns inside MTS data through hierarchical Variational AutoEncoder with two stochastic latent variables, each of which learns low-dimensional inter-metric or temporal embeddings. Furthermore, we propose an MCMC-based method to obtain reasonable embeddings and reconstructions at anomalous parts for MTS anomaly interpretation. |
| [Practical Approach to Asynchronous Multi-variate Time Series Anomaly Detection and Localization](https://dl.acm.org/doi/10.1145/3447548.3467174) | KDD |2021 | [Code link](https://github.com/eBay/RANSyncoders) | - |Our solution is designed to leverage this behavior. The solution utilizes spectral analysis on the latent representation of a pre-trained autoencoder to extract dominant frequencies across the signals, which are then used in a subsequent network that learns the phase shifts across the signals and produces a synchronized representation of the raw multivariate. |
| [Time Series Anomaly Detection for Cyber-physical Systems via Neural System Identification and Bayesian Filtering](https://dl.acm.org/doi/10.1145/3447548.3467137) |KDD | 2021 | [Code link](https://github.com/NSIBF/NSIBF) | - | a specially crafted neural network architecture is posed for system identification, i.e., capturing the dynamics of CPS in a dynamical state-space model; then a Bayesian filtering algorithm is naturally applied on top of the "identified" state-space model for robust anomaly detection by tracking the uncertainty of the hidden state of the system recursively over time.|
| [Multi-Scale One-Class Recurrent Neural Networks for Discrete Event Sequence Anomaly Detection](https://dl.acm.org/doi/10.1145/3447548.3467125) | KDD |2021 | [Code link](https://github.com/wzwtrevor/Multi-Scale-One-Class-Recurrent-Neural-Networks) | - |a multi-scale one-class recurrent neural network for detecting anomalies in discrete event sequences. |
| [Online false discovery rate control for anomaly detection in time series](https://proceedings.neurips.cc/paper/2021/file/def130d0b67eb38b7a8f4e7121ed432c-Paper.pdf) | NeurIPS | 2021 | - | - | The methods proposed in this article overcome short-comings of previous FDRC rules in the context of anomaly detection, in particular ensuring that power remains high even when the alternative is exceedingly rare (typical in anomaly detection) and the test statistics are serially dependent (typical in time series). |
| [Detecting Anomalous Event Sequences with Temporal Point Processes](https://arxiv.org/pdf/2106.04465.pdf) | NeurIPS | 2021 | - | LOGS,STEAD | The proposed method can be combined with various TPP models, such as neural TPPs, and is easy to implement. |
| [You Only Look at One Sequence: Rethinking Transformer in Vision through Object Detection](https://arxiv.org/pdf/2106.00666.pdf) | NeurIPS | 2021 | [Code link](https://github.com/hustvl/YoLos) | - | have explored the transferability of the vanilla ViT pre-trained on mid-sized ImageNet-1k dataset to the more challenging COCO object detection benchmark |
| [Drop-DTW: Aligning Common Signal Between Sequences While Dropping Outliers](https://arxiv.org/pdf/2108.11996.pdf)| NeurIPS | 2021 | - | MNIST | introduced an extension to the classic DTW algorithm, which relaxes the constraints of matching endpoints of paired sequences and the continuity of the path cost. |
| [Conditional Score-based Diffusion Models for Probabilistic Time Series Imputation](https://arxiv.org/pdf/2107.03502.pdf)| NeurIPS | 2021 | [Code link](https:/github.com/ermongroup/CsDl) | healthcare, air quality | a novel approach to impute multivariate time series with conditional diffusion models. |
| [SDFVAE: Static and Dynamic Factorized VAE for Anomaly Detection of Multivariate CDN KPIs](https://dl.acm.org/doi/pdf/10.1145/3442381.3450013) | WWW | 2021| - | - | Our key insight is that different KPIs are constrained by certain time-invariant characteristics of the underlying system, and that explicitly modelling such invariance may help resist noise in the data. We thus propose a novel anomaly detection method called SDFVAE, short for Static and Dynamic Factorized VAE, that learns the representations of KPIs by explicitly factorizing the latent variables into dynamic and static parts. |
| [Time-series Change Point Detection with Self-Supervised Contrastive Predictive Coding](https://arxiv.org/pdf/2011.14097.pdf) | WWW | 2021| - | [Yahoo!Benchmark](https://webscope.sandbox.yahoo.com/), [HASC](http://hasc.jp/hc2011), [USC-HAD](http://sipi.usc.edu/had) | propose a novel self-supervised CPD method, 𝑇𝑆 − 𝐶𝑃2 for time series. 𝑇𝑆 − 𝐶𝑃2 learns an embedded representation predict a future interval of a times series from historical samples. |
| [NTAM: Neighborhood-Temporal Attention Model for Disk Failure Prediction in Cloud Platforms](https://dl.acm.org/doi/pdf/10.1145/3442381.3449867) | WWW | 2021| - | industrial datasets collected from millions of disks in Microsoft Azure,....... | NTAM is a novel approach that not only utilizes a disk’s own status data, but also considers its neighbors’ status data. Moreover, NTAM includes a novel attention-based temporal component to capture the temporal nature of the disk status data. Besides, we propose a data enhancement method, called Temporal Progressive Sampling (TPS), to handle the extreme data imbalance issue. |
| [Improving Irregularly Sampled Time Series Learning with Time-Aware Dual-Attention Memory-Augmented Networks](https://dl.acm.org/doi/pdf/10.1145/3459637.3482079) | CIKM | 2021 | [Code link]() | - | The proposed model can leverage both time irregularity, multi-sampling rates and global temporal patterns information inherent in IASS-MTS so as to learn more effective representations for improving prediction performance. |
| [BiCMTS: Bidirectional Coupled Multivariate Learning of Irregular Time Series with Missing Values](https://dl.acm.org/doi/pdf/10.1145/3459637.3482064) | CIKM | 2021 | - | - | BiCMTS method to represent both forward and backward value couplings within a time series by RNNs and between MTS by self-attention networks; the learned bidirectional intra- and inter-time series coupling representations are fused to estimate missing values. |
| [Timeseries Anomaly Detection using Temporal Hierarchical One-Class Network](https://proceedings.neurips.cc/paper/2020/hash/97e401a02082021fd24957f852e0e475-Abstract.html) | NeurIPS | 2020| - | 2D-gesture, Power demand, KDD-Cup99 data, SWaT, MSL, SMAP | based on a set of hierarchical structured hyperspheres. The solution uses a probabilistic relevance on cluster centers to help the model access the whole temporal history. A center orthogonality loss and a temporal self-supervision loss are also introduced for improved feature representation. |
| [USAD : UnSupervised Anomaly Detection on multivariate time series](https://dl.acm.org/doi/pdf/10.1145/3394486.3403392) | KDD | 2020| - | SWaT, WADI, SMD, SMAP, MSL | adversely trained autoencoders |
| [Application Performance Anomaly Detection with LSTM on Temporal Irregularities in Logs](https://hal.archives-ouvertes.fr/hal-03117074/document) | CIKM | 2020| [Code link]() | - | present a new method to perform anomaly detection, while maintaining the quantitative aspect of time, using a count of event types over time. |
| [Multivariate Time-series Anomaly Detection via Graph Attention Network](https://arxiv.org/pdf/2009.02040.pdf) | ICDM | 2020| [Code link]() | SMAP, MSL, TSA | propose a novel framework based on graph attention network for multivariate time-series anomaly detection. |
| [MERLIN: Parameter-Free Discovery of Arbitrary Length Anomalies in Massive Time Series Archives](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9338376) | ICDM | 2020| - | - | an algorithm that can efficiently and exactly find discords of all lengths in massive time series archives. |
| [Outlier Detection for Time Series with Recurrent Autoencoder Ensembles](https://www.ijcai.org/proceedings/2019/0378.pdf) | IJCAI | 2019 | [Code]( https://github.com/tungk/OED) | Traffic,...... | The two solutions are ensemble frameworks, specifically an indepen- dent framework and a shared framework, both of which combine multiple S-RNN based autoencoders to enable outlier detection. |
| [Regularization for Time Series Trend Filtering](https://arxiv.org/pdf/1906.03751.pdf) | IJCAI | 2019 | - | - | we adopt the Hu- ber loss to suppress outliers, and utilize a combina- tion of the first order and second order difference on the trend component as regularization to cap- ture both slow and abrupt trend changes. Further- more, an efficient method is designed to solve the proposed robust trend filtering based on majoriza- tion minimization (MM) and alternative direction method of multipliers (ADMM). |## Time series Clustering
| Paper | Conference | Year | Code | Used Datasets |Key Contribution|
| :-------------------: | :----------: | :----------: | :------------------------: | ----------------------- |------ |
| [Clustering Interval-Censored Time-Series for Disease Phenotyping](https://arxiv.org/pdf/2102.07005v4.pdf) | AAAI | 2022 |- | - | present our method, SubLign, to learn latent representations of disease progression that correct for temporal misalignment in real-world observations and consider conditions for identifiability of subtype and alignment values. |
| [Corsets for Time Series Clustering](https://arxiv.org/pdf/2110.15263.pdf) | NeurIPS| 2021 | - | synthetic data |address the problem of constructing coresets for time series data generated from Gaussian mixture models with auto-correlations across time.|
| [Temporal Phenotyping using Deep Predictive Clustering of Disease Progression](http://proceedings.mlr.press/v119/lee20h/lee20h.pdf) | ICML | 2020| [Code link](https://github.com/chl8856/AC_TPC) | [UKCF](https://www.cysticfibrosis.org.uk), [Alzheimer’s Disease Neuroimaging Initiative (ADNI)](https://adni.loni.usc.edu) | defined novel loss functions to encourage each cluster to have homogeneous future outcomes and designed optimization procedures to avoid trivial solutions in identifying cluster as- signments and the centroids. |
| [Learning low-dimensional state embeddings and metastable clusters from time series data](https://papers.nips.cc/paper/2019/file/c0e90532fb42ac6de18e25e95db73047-Paper.pdf) | NeurIPS | 2019 | - | simulated diffusion processes | his idea also leads to a kernel reshaping method for more accurate nonparametric estimation of the transition function. State embedding can be used to cluster states into metastable sets, thereby identifying the slow dynamics. Sharp statistical error bounds and misclassification rate are proved. |
| [Learning Representations for Time Series Clustering](https://papers.nips.cc/paper/2019/file/1359aa933b48b754a2f54adb688bfa77-Paper.pdf) | NeurIPS | 2019 | - | - | ere we propose a novel unsupervised temporal representation learning model, named Deep Temporal Clustering Representation (DTCR), which integrates the temporal reconstruction and K-means objective into the seq2seq model. |## Time series Segmentation
| Paper | Conference | Year | Code | Used Datasets |Key Contribution|
| :-------------------: | :----------: | :----------: | :------------------------: | ----------------------- |------ |
| [ClaSP-Time Series Segmentation](https://dl.acm.org/doi/pdf/10.1145/3459637.3482240)| CIKM | 2021 | - | 98 datasets.... | a novel and highly accurate method for TSS. ClaSP hierarchically splits a TS into two parts, where each split point is determined by training a binary TS classifier for each possible split point and selecting the one with highest accuracy |
| [Multi-series Time-aware Sequence Partitioning for Disease Progression Modeling](https://www.ijcai.org/proceedings/2021/0493.pdf)| IJCAI | 2021 | - | sEMG | improved the TICC by incorporating multi- series input (M-TICC) and time-awareness (MT-TICC). |
| [Linear Time Complexity Time Series Clustering with Symbolic Pattern Forest](https://www.ijcai.org/proceedings/2019/0406.pdf) | IJCAI | 2019 | [Code]( http://mason.gmu.edu/∼xli22/SPF) | - | This paper presents a novel time series clustering algorithm that has linear time complex- ity. The proposed algorithm partitions the data by checking some randomly selected symbolic pat- terns in the time series. |
| [Similarity Preserving Representation Learning for Time Series Clustering](https://arxiv.org/pdf/1702.03584.pdf) | IJCAI | 2019 | - | - | In this paper, we bridge this gap by proposing an efficient representation learning framework that is able to convert a set of time series with various lengths to an instance-feature matrix. |## Others
| Paper | Conference | Year | Code | Used Datasets |Key Contribution|
| :-------------------: | :----------: | :----------: | :------------------------: | ----------------------- |------ |
| [Adaptive Conformal Predictions for Time Series](https://arxiv.org/pdf/2202.07282.pdf)| ICML | 2022| [code](https://github.com/mzaffran/adaptiveconformalpredictionstimeseries) | | Uncertainty quantification of predictive models is crucial in decision-making problems. Conformal prediction is a general and theoretically sound answer. However, it requires exchangeable data, excluding time series. While recent works tackled this issue, we argue that Adaptive Conformal Inference (ACI, Gibbs and Candes` , 2021), developed for distribution-shift time series, is a good procedure for time series with general dependency. We theoretically analyse the impact of the learning rate on its efficiency in the exchangeable and auto-regressive case. We propose a parameter-free method, AgACI, that adaptively builds upon ACI based on online expert aggregation. We lead extensive fair simulations against competing methods that advocate for ACI’s use in time series. We conduct a real case study: electricity price forecasting. The proposed aggregation algorithm provides efficient prediction intervals for day-ahead forecasting. All the code and data to reproduce the experiments is made available.|
|[Modeling Irregular Time Series with Continuous Recurrent Units](https://arxiv.org/pdf/2111.11344.pdf)| ICML | 2022| [code](https://github.com/boschresearch/Continuous-Recurrent-Units) | Pendulum Images, Climate Data (USHCN), Electronic Health Records (Physionet) | In many datasets (e.g. medical records) observation times are irregular and can carry important information. To address this challenge, we propose continuous recurrent units (CRUs) – a neural architecture that can naturally handle irregular intervals between observations. |
|[Unsupervised Time-Series Representation Learning with Iterative Bilinear Temporal-Spectral Fusion](https://arxiv.org/abs/2202.04770) | ICML | 2022| | HAR, SleepEDF, ECG Waveform, ETT, Weather, SaaT, WADI, SMD, SMAP, MSL | We devise a novel iterative bilinear temporal-spectral fusion to explicitly encode the affinities of abundant time-frequency pairs, and iteratively refines representations in a fusion-and-squeeze manner with Spectrum-to-Time (S2T) and Time-to-Spectrum (T2S) Aggregation modules |
|[Utilizing Expert Features for Contrastive Learning of Time-Series Representations](https://arxiv.org/abs/2206.11517)| ICML | 2022| [code](https://github.com/boschresearch/expclr)| HAR, SleepEDF, ECG Waveform | We present an approach that incorporates expert knowledge for time-series representation learning. Our method employs expert features to replace the commonly used data transformations in previous contrastive learning approaches. |
| [Neural Predicting Higher-order Patterns in Temporal Networks](https://dl.acm.org/doi/pdf/10.1145/3485447.3512181) | WWW | 2022 | [Code](https://github.com/Graph-COM/Neural_Higher-order_Pattern_Prediction) | Tags-math-sx, Tags-ask-ubuntu, Congress-bills, DAWN, Threads-ask-ubuntu | we proposed the first model HIT to predict higher- order patterns in temporal hypergraphs to answer what type of, when, and why interactions may expand in a node triplet. HIT can be further generalized to predict even higher-order patterns. |
| [ONBRA: Rigorous Estimation of the Temporal Betweenness Centrality in Temporal Networks](https://dl.acm.org/doi/pdf/10.1145/3485447.3512204) | WWW | 2022 | [Code](https://github.com/iliesarpe/ONBRA) | [data](http://www.sociopatterns.org/), [data2](https://snap.stanford.edu/temporal-motifs/data-html) | In this work we present ONBRA, the first sampling-based approximation algorithm for estimating the temporal betweenness centrality values of the nodes in a temporal network, providing rigorous probabilistic guar- antees on the quality of its output. |
| [Knowledge-based Temporal Fusion Network for Interpretable Online Video Popularity Prediction](https://dl.acm.org/doi/pdf/10.1145/3485447.3511934) | WWW | 2022 | [Code]() | medium-video dataset and a micro-video dataset from the server logs of [Xigua](https://www.ixigua.com/) and [Douyin](https://www.douyin.com/recommend) | In this paper, we propose a Knowledge-based Temporal Fusion Network (KTFN) that incorporates knowledge graph representation to address the aforementioned challenges in the task of online video popularity prediction. |
| [STAM: A Spatiotemporal Aggregation Method for Graph Neural Network-based Recommendation](https://dl.acm.org/doi/pdf/10.1145/3485447.3512041) | WWW | 2022 | [Code](https://github.com/zyang-16/STAM) | MovieLens, Amazon, Taobao | In this work, we propose a spatiotemporal aggregation method STAM to efficiently incorporate temporal information into neighbor embedding learning. |
| [A Graph Temporal Information Learning Framework for Popularity Prediction](https://www2022.thewebconf.org/PaperFiles/45.pdf) | WWW | 2022 | [Code]() | Sina Weibo | In this paper, we propose a graph temporal information learning framework based on an improved graph convolutional network (GTGCN), which can capture both the temporal information governing the spread of information in a snapshot, and the inherent temporal dependencies among different snapshots. |
| [PREP: Pre-training with Temporal Elapse Inference for Popularity Prediction](https://www2022.thewebconf.org/PaperFiles/61.pdf) | WWW | 2022 | [Code]() | Sina Weibo, Twitter | We design a novel pretext task for pre-training, i.e., temporal elapse inference for two ran- domly sampled time slices of popularity dynamics, impelling the representation model to learn intrinsic knowledge about popularity dynamics. |
| [Conditional Loss and Deep Euler Scheme for Time Series Generation](https://arxiv.org/abs/2102.05313v5) | AAAI | 2022 | - | - | - |
| [TS2Vec: Towards Universal Representation of Time Series](https://arxiv.org/abs/2106.10466) | AAAI | 2022 | [Code link](https://github.com/yuezhihan/ts2vec) | [128 UCR datasets](https://www.cs.ucr.edu/~eamonn/time_series_data_2018/),[30 UEA datasets](http://www.timeseriesclassification.com/), [3 ETT datasets](https://github.com/zhouhaoyi/ETDataset), [Electricity](https://archive.ics.uci.edu/ml/datasets/ElectricityLoadDiagrams20112014), [Yahoo dataset](https://webscope.sandbox.yahoo.com/catalog.php?datatype=s&did=70), [KPI dataset](http://test-10056879.file.myqcloud.com/10056879/test/20180524_78431960010324/KPI%E5%BC%82%E5%B8%B8%E6%A3%80%E6%B5%8B%E5%86%B3%E8%B5%9B%E6%95%B0%E6%8D%AE%E9%9B%86.zip) | performs contrastive learning in a hierarchical way over augmented context views|
| [Time Masking for Temporal Language Models](https://dl.acm.org/doi/abs/10.1145/3488560.3498529) | WSDM | 2022 | [Code link](https://github.com/guyrosin/tempobert) | - |- |
| [Long Short-Term Temporal Meta-learning in Online Recommendation](https://dl.acm.org/doi/10.1145/3488560.3498371) | WSDM | 2022 | - | - |- |
| [Structure Meets Sequences: Predicting Network of Co-evolving Sequences](https://dl.acm.org/doi/10.1145/3488560.3498411) | WSDM | 2022 | [Code link](https://github.com/SoftWiser-group/SeeS) | - |- |
| [EvoKG: Jointly Modeling Event Time and Network Structure for Reasoning over Temporal Knowledge Graphs](https://dl.acm.org/doi/10.1145/3488560.3498451) | WSDM | 2022 | [Code link](https://namyongpark.github.io/evokg) | - | - |
| [FILLING THE GAPS: MULTIVARIATE TIME SERIES IMPUTATION BY GRAPH NEURAL NETWORKS](https://openreview.net/pdf?id=kOu3-S3wJ7) | ICLR |2022 | - | Air quality, Traffic, and Smart Grids | - |
| [PSA-GAN: PROGRESSIVE SELF ATTENTION GANS FOR SYNTHETIC TIME SERIES](https://openreview.net/pdf?id=Ix_mh42xq5w) | ICLR |2022 | [Code link](https://github.com/mbohlkeschneider/psa-gan), [Code on glueonts](https://github.com/awslabs/gluon-ts.) | Electricty, M4, Solar energy, Traffic | - |
| [Generative Semi-Supervised Learning for Multivariate Time Series Imputation](https://www.aaai.org/AAAI21Papers/AAAI-7391.MiaoX.pdf) | AAAI | 2021 | - | - | - |
| [Temporal Cross-Effects in Knowledge Tracing](https://dl.acm.org/doi/pdf/10.1145/3437963.3441802) | WSDM | 2021 | - | - |- |
| [Learning Dynamic Embeddings for Temporal Knowledge Graphs](https://dl.acm.org/doi/pdf/10.1145/3437963.3441741) | WSDM | 2021 | - | - |- |
| [Temporal Meta-path Guided Explainable Recommendation](https://arxiv.org/pdf/2101.01433.pdf) | WSDM | 2021 | - | - |- |
| [Generative Adversarial Networks for Markovian Temporal Dynamics: Stochastic Continuous Data Generation](https://proceedings.mlr.press/v139/park21d.html) | ICML | 2021 | [Code link]() | - |- |
| [Discrete-time Temporal Network Embedding via Implicit Hierarchical Learning](https://dl.acm.org/doi/10.1145/3447548.3467422) | KDD | 2021 |[Code link](https://github.com/marlin-codes/HTGN-KDD21) | - |- |
| [Time-series Generation by Contrastive Imitation](https://neurips.cc/Conferences/2021/ScheduleMultitrack?event=26999) | NeurIPS | 2021 | - | - |- |
|[Adjusting for Autocorrelated Errors in Neural Networks for Time Series](https://arxiv.org/pdf/2101.12578.pdf)| NeurIPS | 2021 | [Code link](https://github.com/Daikon-Sun/AdjustAutocorrelation) | - |- |
| [Spikelet: An Adaptive Symbolic Approximation for Finding Higher-Level Structure in Time Series](https://ieeexplore.ieee.org/document/9679141) | ICDM | 2021 | - | - | - |
| [STING: Self-attention based Time-series Imputation Networks using GAN](https://ieeexplore.ieee.org/document/9679183) | ICDM | 2021 | [Code link]() | - | - |
| [SMATE: Semi-Supervised Spatio-Temporal Representation Learning on Multivariate Time Series](https://arxiv.org/pdf/2110.00578.pdf) | ICDM | 2021 | [Code link]() | - | - |
| [TCube: Domain-Agnostic Neural Time-series Narration](https://arxiv.org/pdf/2110.05633.pdf) | ICDM | 2021 | [Code link]() | - | - |
| [Towards Interpretability and Personalization: A Predictive Framework for Clinical Time-series Analysis](https://ieeexplore.ieee.org/document/9679181) | ICDM | 2021 | [Code link]() | - | - |
| [Continual Learning for Multivariate Time Series Tasks with Variable Input Dimensions](https://arxiv.org/pdf/2203.06852.pdf) | ICDM | 2021 | [Code link]() | - | - |
| [CASPITA: Mining Statistically Significant Paths in Time Series Data from an Unknown Network](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9679098) | ICDM | 2021 | [Code link]() | - | - |
| [Multi-way Time Series Join on Multi-length Patterns](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9679018) | ICDM | 2021 | [Code link]() | - | - |
| [Temporal Event Profiling based on Multivariate Time Series Analysis over Long-term Document Archives](https://link.springer.com/content/pdf/10.1186/s13059-015-0639-8.pdf) | SIGIR | 2021 | [Code link]() | - | - |
| [Time-Aware Multi-Scale RNNs for Time Series Modeling](https://www.ijcai.org/proceedings/2021/0315.pdf) | ICDM | 2021 | [Code link]() | - | - |
| [Deep reconstruction of strange attractors from time series](https://proceedings.neurips.cc/paper/2020/hash/021bbc7ee20b71134d53e20206bd6feb-Abstract.html) | NeurIPS | 2020| [Code link](https://github.com/williamgilpin/fnn) | - | - |
| [One Detector to Rule Them All: Towards a General Deepfake Attack Detection Framework](https://arxiv.org/pdf/2105.00187.pdf) | WWW | 2021| [Code link]() | - | - |
| [High-recall causal discovery for autocorrelated time series with latent confounders](https://proceedings.neurips.cc/paper/2020/hash/94e70705efae423efda1088614128d0b-Abstract.html) | NeurIPS | 2020| [Code link](https://github.com/jakobrunge/tigramite) | - | - |
| [Learning Long-Term Dependencies in Irregularly-Sampled Time Series](https://arxiv.org/abs/2006.04418) | NeurIPS | 2020| [Code link](https://github.com/mlech26l/ode-lstms) | - | - |
| [ARMA Nets: Expanding Receptive Field for Dense Prediction](https://proceedings.neurips.cc/paper/2020/hash/cd10c7f376188a4a2ca3e8fea2c03aeb-Abstract.html) | NeurIPS | 2020| [Code link](https://github.com/umd-huang-lab/ARMA-Networks) | - | - |
| [Learnable Group Transform For Time-Series](http://proceedings.mlr.press/v119/cosentino20a.html) | ICML | 2020| [Code link](https://github.com/Koldh/LearnableGroupTransform-TimeSeries) | - | - |
| [Fast RobustSTL: Efficient and Robust Seasonal-Trend Decomposition for Time Series with Complex Patterns](https://dl.acm.org/doi/10.1145/3394486.3403271) | KDD | 2020| - | - | - |
| [Matrix Profile XXI: A Geometric Approach to Time Series Chains Improves Robustness](https://dl.acm.org/doi/10.1145/3394486.3403164) | KDD | 2020| [Code link](https://sites.google.com/site/timeserieschains) | - | - |
| [Multi-Source Deep Domain Adaptation with Weak Supervision for Time-Series Sensor Data](https://dl.acm.org/doi/10.1145/3394486.3403228) | KDD | 2020| [Code link](https://github.com/floft/codats) | - | - |
| [Personalized Imputation on Wearable-Sensory Time Series via Knowledge Transfer](https://dl.acm.org/doi/pdf/10.1145/3340531.3411879) | CIKM | 2020| [Code link]() | - | - |
| [Hybrid Sequential Recommender via Time-aware Attentive Memory Network](https://dl.acm.org/doi/pdf/10.1145/3340531.3411869) | CIKM | 2020| [Code link]() | - | - |
| [Order-Preserving Metric Learning for Mining Multivariate Time Series](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9338310) | ICDM | 2020| [Code link]() | - | - |
| [Fast Automatic Feature Selection for Multi-period Sliding Window Aggregate in Time Series](https://arxiv.org/pdf/2012.01037.pdf) | ICDM | 2020| - | Tianchi, PLAsTiCC, NFL, MotionSense, Gas Sensors | a framework to fill the gap of the end-to-end automatic sliding window aggregate feature selection for time series |
| [Matrix Profile XXII: Exact Discovery of Time Series Motifs Under DTW](https://ieeexplore.ieee.org/document/9338266) | ICDM | 2020| [Code link]() | - | - |
| [Inductive Granger Causal Modeling for Multivariate Time Series](https://arxiv.org/pdf/2102.05298.pdf) | ICDM | 2020 | - | [Finance](http://www.skleinberg.org/data.html), FMRI, Synthetic data | - |
| [Mining Recurring Patterns in Real-Valued Time Series using the Radius Profile](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9338407) | ICDM | 2020 | - | - | - |
| [Learning Periods from Incomplete Multivariate Time Series](cs.albany.edu/~petko/lab/papers/zgzb2020icdm.pdf) | ICDM | 2020 | - | - | - |
| [FilCorr: Filtered and Lagged Correlation on Streaming Time Series](https://ieeexplore.ieee.org/document/9338257) | ICDM | 2020 | - | - | - |
| [Unsupervised Scalable Representation Learning for Multivariate Time Series ](https://papers.nips.cc/paper/2019/file/53c6de78244e9f528eb3e1cda69699bb-Paper.pdf) | NeurIPS | 2019 | [Code]() | [Datasets]() | - |
| [Latent Ordinary Differential Equations for Irregularly-Sampled Time Series](https://papers.nips.cc/paper/2019/file/42a6845a557bef704ad8ac9cb4461d43-Paper.pdf) | NeurIPS | 2019 | - | Human Activity dataset | We generalize RNNs to have continuous-time hidden dynamics defined by ordinary differential equations (ODEs), a model we call ODE-RNNs. Furthermore, we use ODE-RNNs to replace the recognition network of the recently-proposed Latent ODE model. |
| [GRU-ODE-Bayes: Continuous Modeling of Sporadically-Observed Time Series](https://papers.nips.cc/paper/2019/file/455cb2657aaa59e32fad80cb0b65b9dc-Paper.pdf) | NeurIPS | 2019 | [Code]() | [Datasets]() | - |
| [Interpolation-Prediction Networks for Irregularly Sampled Time Series](https://openreview.net/pdf?id=r1efr3C9Ym) | ICLR | 2019 | [Code](https://github.com/mlds-lab/interp-net) | [MIMIC-III](https://mimic.physionet.org/), [UWaveGestureLibraryAll](http://timeseriesclassification.com) | In this paper, we have presented a new framework for dealing with the problem of supervised learn- ing in the presence of sparse and irregularly sampled time series. The proposed framework is fully modular.|
| [SOM-VAE: Interpretable Discrete Representation Learning on Time Series](https://openreview.net/pdf?id=rygjcsR9Y7) | ICLR | 2019 | -| - | The SOM-VAE can recover topologically interpretable state representations on time series and static data. It provides an improvement to standard methods in terms of clustering performance and offers a way to learn discrete two-dimensional representations of the data manifold in concurrence with the reconstruction task. |
|[U-Time: A Fully Convolutional Network for Time Series Segmentation Applied to Sleep Staging](https://papers.nips.cc/paper/2019/file/57bafb2c2dfeefba931bb03a835b1fa9-Paper.pdf) | NeurIPS | 2019 | [Code]() | [Datasets]() | - |
| [E^2GAN: End-to-End Generative Adversarial Network for Multivariate Time Series Imputation](https://www.ijcai.org/proceedings/2019/0429.pdf)| IJCAI | 2019 | -| - | - |# 📝 Time Series Libraries
| Name | Company | Stars | Explanation |
| :--------------------------: | :-------------------: | :------------------: |:------ |
|[📚 Darts](https://github.com/unit8co/darts)| Unit8 | ⭐️ 5.3K | Darts is a Python library for user-friendly forecasting and anomaly detection on time series. It contains a variety of models, from classics such as ARIMA to deep neural networks. The forecasting models can all be used in the same way, using fit() and predict() functions, similar to scikit-learn. The library also makes it easy to backtest models, combine the predictions of several models, and take external data into account. Darts supports both univariate and multivariate time series and models. The ML-based models can be trained on potentially large datasets containing multiple time series, and some of the models offer a rich support for probabilistic forecasting.
|[📚 Prophet](https://github.com/facebook/prophet)| Meta (Facebook) | ⭐️ 15.5K | Prophet is a procedure for forecasting time series data based on an additive model where non-linear trends are fit with yearly, weekly, and daily seasonality, plus holiday effects. It works best with time series that have strong seasonal effects and several seasons of historical data. Prophet is robust to missing data and shifts in the trend, and typically handles outliers well.
|[📚 Neural Prophet](https://github.com/ourownstory/neural_prophet)| - | ⭐️ 2.8K | A Neural Network based Time-Series model, inspired by Facebook Prophet and AR-Net, built on PyTorch.
|[📚 GluonTS](https://github.com/awslabs/gluonts)| AWS | ⭐️ 3.3K | GluonTS is a Python package for probabilistic time series modeling, focusing on deep learning based models, based on PyTorch and MXNet.
|[📚 stumpy](https://github.com/TDAmeritrade/stumpy)| TD Ameritrade | ⭐️ 2.5K | STUMPY is a powerful and scalable Python library that efficiently computes something called the matrix profile, which is just an academic way of saying "for every subsequence within your time series, automatically identify its corresponding nearest-neighbor". What's important is that once you've computed your matrix profile (middle panel above) it can then be used for a variety of time series data mining tasks.
|[📚 tsfresh](https://github.com/blue-yonder/tsfresh)| Blue Yonder GmbH | ⭐️ 7K | The package provides systematic time-series feature extraction by combining established algorithms from statistics, time-series analysis, signal processing, and nonlinear dynamics with a robust feature selection algorithm. In this context, the term time-series is interpreted in the broadest possible sense, such that any types of sampled data or even event sequences can be characterised.
|[📚 SKTIME](https://github.com/sktime/sktime)| - | ⭐️ 6.1K | sktime is a library for time series analysis in Python. It provides a unified interface for multiple time series learning tasks. Currently, this includes time series classification, regression, clustering, annotation and forecasting. It comes with time series algorithms and scikit-learn compatible tools to build, tune and validate time series models.
|[📚 pmdarima](https://github.com/alkaline-ml/pmdarima)| - | ⭐️ 1.3K | Pmdarima (originally pyramid-arima, for the anagram of 'py' + 'arima') is a statistical library designed to fill the void in Python's time series analysis capabilities.
|[📚 tslearn](https://github.com/tslearn-team/tslearn)| - | ⭐️ 2.4K | The machine learning toolkit for time series analysis in Python.
|[📚 PyTorch Forecasting](https://github.com/jdb78/pytorch-forecasting)| - | ⭐️ 2.6K | PyTorch Forecasting is a PyTorch-based package for forecasting time series with state-of-the-art network architectures. It provides a high-level API for training networks on pandas data frames and leverages PyTorch Lightning for scalable training on (multiple) GPUs, CPUs and for automatic logging.
|[📚 StatsForecast](https://github.com/Nixtla/statsforecast)| - | ⭐️ 2.2K | StatsForecast offers a collection of widely used univariate time series forecasting models, including automatic ARIMA, ETS, CES, and Theta modeling optimized for high performance using numba. It also includes a large battery of benchmarking models.
|[📚 Streamz](https://github.com/python-streamz/streamz)| - | ⭐️ 1.1K | Streamz helps you build pipelines to manage continuous streams of data. It is simple to use in simple cases, but also supports complex pipelines that involve branching, joining, flow control, feedback, back pressure, and so on.
# 📝 Time Series Benchmarks and Datasets
| Paper | Conference | Year | Code | Key Contribution|
| :--------------------------: | :-------------------: | :------------------: | ----------------------- |------ |
|[Monash Time Series Forecasting Repository](https://forecastingdata.org/)| NeurIPS | 2021 | [paper link](https://openreview.net/pdf?id=wEc1mgAjU-) |There have been many deep time series evaluated on the same datasets in recent years. Even though this works for basic benchmarking, it may not hold up when applied to a variety of temporal tasks. Its goal is to create a "master list" of different time series datasets and serve as an authoritative benchmark. Over 20 different datasets are included in the repository, spanning industries as diverse as health, retail, ride-share, and demographics.
|[Revisiting Time Series Outlier Detection: Definitions and Benchmarks](https://openreview.net/forum?id=r8IvOsnHchr)| NeurIPS | 2021 | [link](https://github.com/datamllab/tods/tree/benchmark) |This paper critiques many existing time series anomaly/outlier detection datasets and proposes 35 brand-new synthetic datasets and 4 real-world datasets for benchmarking purposes.
|[Subseasonal Forecasting Microsoft](https://www.microsoft.com/en-us/research/project/subseasonal-climate-forecasting/)| Microsoft | 2021 | [link](https://www.microsoft.com/en-us/research/project/subseasonal-climate-forecasting/downloads/) |Microsoft has released a dataset to facilitate machine learning for improving subseasonal forecasting (e.g. two to six weeks in the future). Forecasting subseasonally helps government agencies and farmers prepare for weather events. In general, deep learning models performed quite poorly compared to other methods in Microsoft's benchmark. A simple feed-forward model proved to be the most accurate DL model, while the Informer performed poorly.## Contributing
We appreciate all contributions to improve this paper repo!Please feel free to pull requests, open an issue or send me email ([email protected]) to add awesome papers.