Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/manisharora96/medical-cost-analysis-for-smokers-and-non-smokers

In this Project i make a cost analysis on the medical charges spent on Smokers and for differentiating it to show the cost spent on smoker and non-smoker accordingly
https://github.com/manisharora96/medical-cost-analysis-for-smokers-and-non-smokers

Last synced: 9 days ago
JSON representation

In this Project i make a cost analysis on the medical charges spent on Smokers and for differentiating it to show the cost spent on smoker and non-smoker accordingly

Awesome Lists containing this project

README

        

# Medical-Cost-Analysis-for-Smokers-and-Non-smokers

This Data analysis and data visualization project consists of the fact that the average medicine charges can be changed for the smokers and non-smokers.

### Dataset :

Download the dataset from, [insurance.csv](https://www.kaggle.com/mirichoi0218/insurance?select=insurance.csv)

This dataset is used to deploy the medical cost analysis of smokers and non-smokers and also used for predicting the average cost of the smokers as compared to the non-smokers.

### Algorithms Used for predicting the average medical charges for the smokers :
1. Linear Regression
2. Polynomial Regression of 2nd degree
3. Lasso Regression
4. Ridge Regression
5. SVM or, Support Vector Machine
6. Random Forest Regression
7. Decision tree regression

Finally, based on these several algorithms and machine learning methods I have done a comparative analysis of all those algorithms and evaluated them based on the Cross validation Score.

### Exploratory Data Analysis :
1. Distribution of charges for the smokers and non-smokers
2. Charges for Men and Women
3. BMI Distribution
4. Age Distribution of the smokers and non-smokers
5. 18+years smokers and nonsmokers
6. Joint plot and Scatter Plot showing the distribution
7. Distribution of charges of the patients
8. Health effects on children due to the parents, who are smoking
9. Region based smoker analysis

These features are being evaluated and determined based on the datasets, which provides the exploratory data analysis of this project.

### Result Analysis :
1. Error management of various algorithms
2. Checking the scores of the algorithms, such as, R2 score, cross validation score, RMSE, mean squared error.
3. Evaluation of the best model based on the CV score.
4. Evaluation of the best model based on the R2 score.
5. Evaluation of the best model based on the RMSE

### Conclusion :
1.The impact of smoking on medical care use was examined in a 30-month prospective population-based cohort study in Japan (N = 43 408).

2.Male smokers incurred 11% more medical costs than ‘never smokers’ but for female smokers and never smokers the costs were almost thesame.

3.This difference was mainly attributable to the increased use of inpatient medical care among smokers, especially in males, where per monthcost of inpatient care was 33% higher in smokers.

***********************************************************************