https://github.com/sonu275981/insurance-prediction
Exploratory Data Analysis & predicting medical insurance cost with machine learning.
https://github.com/sonu275981/insurance-prediction
insurance insurance-premium-predictor linear-regression machine-learning mse python r2score random-forest
Last synced: 7 months ago
JSON representation
Exploratory Data Analysis & predicting medical insurance cost with machine learning.
- Host: GitHub
- URL: https://github.com/sonu275981/insurance-prediction
- Owner: sonu275981
- Created: 2021-11-29T14:28:35.000Z (almost 4 years ago)
- Default Branch: main
- Last Pushed: 2021-12-05T12:00:40.000Z (almost 4 years ago)
- Last Synced: 2025-02-15T20:29:27.395Z (9 months ago)
- Topics: insurance, insurance-premium-predictor, linear-regression, machine-learning, mse, python, r2score, random-forest
- Language: Jupyter Notebook
- Homepage:
- Size: 617 KB
- Stars: 1
- Watchers: 3
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Insurance prediction
As shown above, this Project leverages machine learning models to predict your insurance charges, and helps the customer understand how smoking or decreasing your BMI affects insurance premiums.
As we see the value of gross insurance premiums worldwide continue to skyrocket past 5 trillion dollars, we know that most of these costs are preventable. For example, just by eliminating smoking, and lowering your BMI by a few points could mean shaving thousands of dollars off of your premium charges. In this application, we study the effects of age, smoking, BMI, gender, and region to determine how much of a difference these factors can make on your insurance premium. By using our application, customers see the radical difference their lifestyle choices make on their insurance charges. By leveraging AI and machine learning, we help customers understand just how much smoking increases their premium, by predicting how much they will have to pay within seconds.
## Project Overview
- Seek insight from the dataset with Exploratory Data Analysis
- Performed data processing, data engineering to prepare data before modeling
- Built a model to predict Insurance Cost based on the features
## Data Processing
- Check missing value - there are none
- Feature transformation Encoding ```sex```, ```region``` Ordinal encoding ```smoker```
- Modeling - Separating target & features - Splitting train & test data
- Modeling using Linear Regression, Random Forest, Decision Tree algorithm
## Conclusion
Based on the predictive modeling, Decision Tree algorithm has the best score compared to the others, Decision Tree algorithm is fit based on the train & test accuracy.