Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/shahira-sadat/portfolio-project-u.s.-medical-insurance
https://github.com/shahira-sadat/portfolio-project-u.s.-medical-insurance
Last synced: 6 days ago
JSON representation
- Host: GitHub
- URL: https://github.com/shahira-sadat/portfolio-project-u.s.-medical-insurance
- Owner: shahira-sadat
- License: mit
- Created: 2024-01-30T06:34:09.000Z (10 months ago)
- Default Branch: main
- Last Pushed: 2024-01-30T07:03:06.000Z (10 months ago)
- Last Synced: 2024-01-30T08:25:04.146Z (10 months ago)
- Language: Jupyter Notebook
- Size: 30.3 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Portfolio-Project-U.S.-Medical-Insurance
## Project Overview
The goal with this project will be to analyze various attributes within insurance.csv to learn more about the patient information in the file and gain insight into potential use cases for the dataset.
## Project Objectives
1. Import a dataset
2. Analyze a dataset by building out functions or class methods
3. Use libraries to assist in analysis## Prerequisites
Ensure that you have a solid understanding of the following topic:
- Python Fundamentals
## Files
The repository includes the following files:
- us-medical-insurance-costs.ipynb: Jupyter Notebook with analysis description, code and models
- insurance.csv: Containing a CSV file with data used by us-medical-insurance-costs.ipynb## Getting Started
1. **Clone the repository:**
```bash
git clone [email protected]:shahira-sadat/Portfolio-Project-U.S.-Medical-Insurance.git```
2. **Navigate to the project directory:**
```bash
cd Portfolio-Project-U.S.-Medical-Insurance```
3. **Open the Jupyter Notebook:**
```bash
jupyter notebook```
4. **Start exploring the OKCupid_Data_Analysis.ipynb notebook:**
```bash
us-medical-insurance-costs.ipynb
```## Overview
The script does the following:
1. Loading Data:
- Imports the CSV file and initializes lists for different attributes (ages, sexes, bmis, num_children, smoker_statuses, regions, insurance_charges).
2. Analyzing Data:
- Calculates the average patient age.
- Counts the number of female and male patients.
- Identifies unique regions.
- Computes the average yearly medical insurance charges.
- Determines the region with the most patients.
- Finds the maximum and minimum BMI values.
- Calculates the maximum and minimum insurance charges.
- Identifies the maximum and minimum number of children.3. Creating a Dictionary:
- Organizes the data into a dictionary for easy access.
4. Displaying Results:
- Prints the results of the analyses.## Example Output:
```bash
Average Patient Age: 39.21 years
Count for female: 662
Count for male: 676
['southeast', 'northwest', 'southwest', 'northeast']
Average Yearly Medical Insurance Charges: $13270.42
Region with the most patients: southeast (364 patients)
Maximum BMI: 53.13
Minimum BMI: 15.96
Maximum Insurance Charges: $63770.43
Minimum Insurance Charges: $1121.87
Maximum Number of Children: 5
Minimum Number of Children: 0
```Feel free to modify and extend the script according to your needs.
## Author
👤 Shahira Sadat
- GitHub: [Shahira Sadat](https://github.com/shahira-sadat)
- Twitter: [Shahira Sadat](https://twitter.com/SadatShahira)
- Linkedin: [Shahira Sadat](https://www.linkedin.com/in/shahira-sadat-49b402199)
- Gmail: [email protected]Contributions, issues, and feature requests are welcome!
## Show your support
Give a ⭐️ if you like this project!