Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/nikoshet/new-york-city-taxi-fare-prediction-machine-learning

Project for course 'Machine Learning' for M.Sc. 'Data Science and Machine Learning' in NTUA
https://github.com/nikoshet/new-york-city-taxi-fare-prediction-machine-learning

data-science keras-tensorflow machine-learning numpy pandas python

Last synced: about 2 months ago
JSON representation

Project for course 'Machine Learning' for M.Sc. 'Data Science and Machine Learning' in NTUA

Awesome Lists containing this project

README

        

# Machine Learning M.Sc. Course Project

Dataset: "New York City Taxi Fare Prediction" (Kaggle).

More can be found [here](https://www.kaggle.com/c/new-york-city-taxi-fare-prediction/)

## Contributors

[Nick Nikitas](https://github.com/nikoshet)

[Filippos Markolefas](https://github.com/filippos93)

[Myron Sampsakis](https://github.com/sbmyron)

## Notebook Structure
```
~/Machine_Learning_Project.ipynb

└── Connection With Kaggle

└── Load Dataset

└── Preprocessng
| Feature Extraction
| Clean Data
| General
| Geospatial Data

└── Exploratory Data Analysis
| General Analysis
| Feature Interaction
| Plots

└── Machine Learning
| Split Data Into Train, Test set
| Functions
| Algorithms
| Linear Regression
| Random Forest
| Epsilon-Support Vector Regression
| XGBoost
| Voting Regressor
| Bagging Regressor
| Histogram-based Gradient Boosting Regression Tree
| Neural Network
| Results
```
(The files JFK2.txt, LaGuardia2.txt, Manhattan.txt, Newark2.txt contain the coordinates for the NYC airports and Manhattan)
## Libraries Used
pandas, numpy, matplotlib, seaborn, shapely, sklearn, keras
### Licence
Copyright © 2020 Nick Nikitas, Filippos Markolefas, Myron Sampsakis