Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/akash1070/project---applied-statistics-
To dive deep into this data & find some valuable insights.
https://github.com/akash1070/project---applied-statistics-
data-analysis data-science python statistics
Last synced: about 1 month ago
JSON representation
To dive deep into this data & find some valuable insights.
- Host: GitHub
- URL: https://github.com/akash1070/project---applied-statistics-
- Owner: Akash1070
- Created: 2022-08-29T07:04:38.000Z (over 2 years ago)
- Default Branch: main
- Last Pushed: 2022-08-29T11:49:00.000Z (over 2 years ago)
- Last Synced: 2024-04-05T11:45:29.392Z (9 months ago)
- Topics: data-analysis, data-science, python, statistics
- Language: Jupyter Notebook
- Homepage:
- Size: 513 KB
- Stars: 1
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# **Applied Statistics**
Leveraging customer information is paramount for most businesses.
In the case of an insurance company, attributes of customers like the ones mentioned below can be crucial in making business decisions. Hence, knowing to explore and generate value out of such data can be an invaluable skill to have.**Objective**:
We want to see if we can dive deep into this data to find some valuable insights.
a.Shape of the data
b. Data type of each attribute
c. Checking the presence of missing values
d. 5 point summary of numerical attributes
e. Distribution of ‘bmi’, ‘age’ and ‘charges’ columns.
f. Measure of skewness of ‘bmi’, ‘age’ and ‘charges’ columns
g. Checking the presence of outliers in ‘bmi’, ‘age’ and ‘charges columns
h. Distribution of categorical columns (include children)
i. Pair plot that includes all the columns of the data frame
j. Do charges of people who smoke differ significantly from the people who don't?
k. Does bmi of males differ significantly from that of females?
l. Is the proportion of smokers significantly different in different genders?
m. Is the distribution of bmi across women with no children, one child and two children, the same?
## Authors
- [@Akash Kumar Jha](https://github.com/Akash1070)
## Installation
To install the libraries used in this project. Follow the
below steps:```bash
import numpy as np
import pandas as pd
from matplotlib import pyplot as plt
import seaborn as sns
import statsmodels.api as sm
import scipy.stats as stats
```
## Running Flask ApiTo run tests, run the following command
```bash
python app.py
```## Documentation
[Documentation]( https://www.kaggle.com/mirichoi0218/insurance/downloads/insurance.zip/1)
## 🚀 About Me
Data Scientist Enthusiast | Petroleum Engineer Graduate | Solving Problems Using Data
# Hi, I'm Akash! 👋
## 🔗 Links
[![github](https://img.shields.io/badge/github-000?style=for-the-badge&logo=ko-fi&logoColor=white)](https://github.com/Akash1070)
[![linkedin](https://img.shields.io/badge/linkedin-0A66C2?style=for-the-badge&logo=linkedin&logoColor=white)](https://www.linkedin.com/in/akashkumar107/)
## Other Common Github Profile Sections
👩💻 I’m interested in Petroleum Engineering🧠 I’m currently learning Data Scientist
👯♀️ I’m looking to collaborate on Ideas & Data
📫 How to reach me - https://www.linkedin.com/in/akashkumar107/
## 🛠 Skills
1. Data Scientist
2. Data Analyst
3. Business Analyst
4. Machine Learning