An open API service indexing awesome lists of open source software.

https://github.com/mehrab-kalantari/book-price-prediction

Book price dataset analysis and modeling
https://github.com/mehrab-kalantari/book-price-prediction

bivariate-analysis eda feature-engineering feature-extraction feature-importance nlp one-hot-encoding ordinal-encoding outlier-detection random-forest-regressor scaling univariate-analysis

Last synced: 3 months ago
JSON representation

Book price dataset analysis and modeling

Awesome Lists containing this project

README

        

# Book Prices Prediction
## Contents
### Data Understanding
Understanding data and features

![info](sample/info.png)

### Data Cleaning For EDA
* Renaming columns
* Categorical to numerical
* Feature expansion
* Feature extraction
* Null values handling

### Exploratory Data Analysis
* Univariate Analysis
* Target analysis
* Numerical features
* Categorical features

* Bivariate Analysis
* Year analysis
* Population analysis
* Price analysis

### Data Preprocessing
* Target Log
* Encoding
* Ordinal
* One Hot
* Count Vectorizer

* Discretization
* Normalization
* Scaling
* Standardization
* Outlier Detection

### Modeling
Random forest regressor is used.

### Evaluation
MSE for train and test datasets
* Train MSE
* ![tr](sample/train.png)

* Test MSE
* ![te](sample/test.png)

### Feature Importance
![f](sample/feat.png)