Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/gaurav-van/house_price_predictor_streamlit_web_app

Data Science Project to Predict House Prices in Bangalore using the concept of Regression. This Repository is used for Deployment of the Project
https://github.com/gaurav-van/house_price_predictor_streamlit_web_app

data-analysis data-science exploratory-data-analysis machine-learning prediction python regression streamlit

Last synced: 9 days ago
JSON representation

Data Science Project to Predict House Prices in Bangalore using the concept of Regression. This Repository is used for Deployment of the Project

Awesome Lists containing this project

README

        

# House_Price_Predictor_Streamlit_web_app



Note: This Repository is required for deployment of this project on Streamlit Cloud.




Web App Link: https://gaurav-van-house-price-predictor-streamlit-heroku-app-g56zmy.streamlitapp.com/


Project Repo: https://github.com/Gaurav-Van/Data_Science__Machine_Learning-Projects


This Model / Project / Web app predicts the price of a Real Estate property / House on the basis of Features like : (area_type, location,
total_sqft, balcony, bathroom, BHK)

Model Used - Linear Regression (Multiple Linear Regression)

The Project / Web App is built in Python using the Following Libraries:

* numpy
* pandas
* matplotlib
* seaborn
* sklearn
* pickle
* flask
* streamlit
* json


## Concept Used
1. Data Collection - From Kaggle: https://www.kaggle.com/datasets/amitabhajoy/bengaluru-house-price-data


2. Data Pre-Procesing

* Removing Not-so-important columns
* Checking and removing or replacement of null values
* Outlier detection using Box plot, Outlier treatment using Flooring and Capping
* Adding new data on the basis of Domain Knowledge


3. EDA - Performing Data analysis on the basis of Domain Knowledge [ do check the jupyter file ]

4. Model Building

* Encoding
* As i am dealing with Regression problem, that too linear models so no need of Feature Scalling
* Dividing the data by Train test split
* Testing Model's Score on divided data [ train_test_split and cross_val_score]
* Model Used - Linear Regression (Multiple Linear Regression)

5. Deployment - Building web app with the help of streamlit and deploying it on heroku cloud