An open API service indexing awesome lists of open source software.

https://github.com/tansudasli/analytics-sandbox

from Statistical approach to Machine learning
https://github.com/tansudasli/analytics-sandbox

feature-engineering machine-learning matplotlib numpy opencv pandas probability regex scikit-learn seaborn statistics

Last synced: 2 months ago
JSON representation

from Statistical approach to Machine learning

Awesome Lists containing this project

README

          

# analytics-sandbox

Machine learning models w/ scikit-learn

## structure

- / (root): **ML** on real scenarios
- /dataset: .csv files
- /probability: core probability exercises
- /statistics: core statistic exercises
- /pandas: core pandas concepts, intersection b/w pandas and numpy
- /opencv: core image and video concepts w/ opencv
- /numpy: core _image concepts_ in matplotlib and numpy
- /scikit-learn: core _ML_ concepts w/ scikit-learn

## models at / (root)

- `car_sales_lineer_regression` covers all ML steps
- `advertising_EDA` uses precleaned **click dataset**.
- real estate in NYC is **very massy**! dataset.
- `real_estate_of_nyc_EDA`, covers explatory data analysis
- `real_estate_of_nyc_lineer_regression`, sale_price _prediction_
- `real_estate_of_nyc_knn`, to _set empty_ neighborhoods