https://github.com/namansnghl/medical-expense-prediction-linear-reg
Medical Insurance data EDA and premium prediction
https://github.com/namansnghl/medical-expense-prediction-linear-reg
analysis data-visualization regression-models
Last synced: 4 months ago
JSON representation
Medical Insurance data EDA and premium prediction
- Host: GitHub
- URL: https://github.com/namansnghl/medical-expense-prediction-linear-reg
- Owner: namansnghl
- License: mit
- Created: 2024-01-28T00:47:37.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2024-01-28T05:02:04.000Z (over 1 year ago)
- Last Synced: 2025-01-08T08:47:49.287Z (5 months ago)
- Topics: analysis, data-visualization, regression-models
- Language: Jupyter Notebook
- Homepage:
- Size: 2.81 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Predicting Medical Expense - Linear Regression with Scikit Learn

## Problem Statement
ACME Insurance Inc. offers affordable health insurance to thousands of customer all over the United States. As the lead data scientist at ACME, **you're tasked with creating an automated system to estimate the annual medical expenditure for new customers**, using information such as their age, sex, BMI, children, smoking habits and region of residence.
Estimates from your system will be used to determine the annual insurance premium (amount paid every month) offered to the customer. Due to regulatory requirements, you must be able to explain why your system outputs a certain prediction.
You're given a [CSV file](https://raw.githubusercontent.com/JovianML/opendatasets/master/data/medical-charges.csv) containing verified historical data, consisting of the aforementioned information and the actual medical charges incurred by over 1300 customers.Dataset source: https://github.com/stedy/Machine-Learning-with-R-datasets