Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/mhamilton723/tseries
A library for time series analysis with sklearn.
https://github.com/mhamilton723/tseries
Last synced: 2 months ago
JSON representation
A library for time series analysis with sklearn.
- Host: GitHub
- URL: https://github.com/mhamilton723/tseries
- Owner: mhamilton723
- Created: 2015-12-15T01:11:58.000Z (about 9 years ago)
- Default Branch: master
- Last Pushed: 2017-09-20T03:42:15.000Z (over 7 years ago)
- Last Synced: 2024-10-14T09:38:09.259Z (3 months ago)
- Language: Jupyter Notebook
- Homepage:
- Size: 7.29 MB
- Stars: 80
- Watchers: 9
- Forks: 13
- Open Issues: 2
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# tseries
Classes for working with time series in python using the sklearn ecosystem. Contains:
[![DOI](https://zenodo.org/badge/48011339.svg)](https://zenodo.org/badge/latestdoi/48011339)
- TimeSeriesRegressor: Turn any sklearn estimator into a time series estimator. Eg Linear Regression -> Vector Auto Regression (VAR)
- Delta Transformer: Transform your time series to predict additive changes. When pipelining this one can create "Integrated" time series models like the VARI
- Double Pipeline: Add invertable transformations to your labels so that you can regress in different spaces and pull back to the original
- Functions for proper cross validation of time series.
- Other utilities ive found usefulDocumentation is currently lagging behind development and pull requests are greatly appreciated!
# Install
The latest stable release can be installed with:
`pip install tseries`
## Requires
Numpy, Pandas, SciKit-Learn,pickle## Usage
To make a predictor of the stock market that maps the previous two days of the s&p500 stock prices and
predicts the next day's price of AAPL stock try the following:
```
from TimeSeriesEstimator import TimeSeriesRegressor, time_series_split
from sklearn.linear_model import LinearRegression,Lasso
from utils import datasetsX = datasets('sp500')
y = X['AAPL']
X_train, X_test = time_series_split(X)
y_train, y_test = time_series_split(y)n_prev=2
tsr = TimeSeriesRegressor(Lasso(), n_prev=n_prev)
tsr.fit(X_train, y_train)
pred_train = tsr.predict(X_train) #outputs a numpy array of length: len(X_train)-n_prev
pred_test = tsr.predict(X_test)
```To forecast all stocks in the s&p500 100 days into the future use the forecast method:
```
tsr = TimeSeriesRegressor(LinearRegression(), n_prev=2)
tsr.fit(X_train)
fc = tsr.forecast(X_train, 100)
```
See the ipython notebook for a longer interactive example!## Install
Clone this repo and call directly as a module. Have not added automatic install support yet.## Mechanics
The TSR works by taking in a single (X) or two datasets (X,Y) of equal length.
In the single dataset case, the TSR assumes you would like to predict the next element in the dataset using the previous elements.
In either case it forms a dataset by taking the previous n timesteps and flattening them into a vector.Dataset X
Feature 1
Feature 21
1.52
2.53
3.54
4.55
5.5New X with n_prev = 2
Feature 1
Feature 2
Feature 3
Feature 41
1.5
2
2.52
2.5
3
3.53
3.5
4
4.5New Y with n_prev = 2
Feature 1
Feature 23
3.54
4.55
5.5Now the X and Y datasets can be fit by any regression technique in sklearn.
If the technique cannot handle vectors as outputs, use the "parallel_models" input to predict
each feature sequence with its own multi to single dim regressor.