Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/imnotannamaria/ia-ead-pandas
EDA (Exploratory Data Analysis) concepts with pandas
https://github.com/imnotannamaria/ia-ead-pandas
matplotlib pandas python
Last synced: about 10 hours ago
JSON representation
EDA (Exploratory Data Analysis) concepts with pandas
- Host: GitHub
- URL: https://github.com/imnotannamaria/ia-ead-pandas
- Owner: imnotannamaria
- Created: 2023-11-01T14:01:16.000Z (about 1 year ago)
- Default Branch: main
- Last Pushed: 2023-11-27T19:40:10.000Z (12 months ago)
- Last Synced: 2023-11-28T17:53:46.928Z (11 months ago)
- Topics: matplotlib, pandas, python
- Language: Jupyter Notebook
- Homepage:
- Size: 470 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# EDA (Exploratory Data Analysis) with Pandas
## Database info
Exploratory analysis is of the telecom segment. The company has three sets of customer and service data, with a variable that determines whether the customer has endorsed (churn) or not the telecom company..
The purpose is that with these data sets and based on some hypotheses that we will formulate and that will be answered by the EDA, can obtain some initial insights for the construction of artificial intelligence that can "predict" the abandonment of still active customers.
- churn_customers.csv = Client data
- churn_contracts.csv = Contracts data
- churn_services.csv = Services data
## Univariate Analysis
Univariate analysis is a fundamental statistical technique in the field of Artificial Intelligence (AI), used to examine and understand the behavior of a single variable in a dataset. In other words, it focuses on the analysis of one feature or variable at a time, without considering relationships with other variables.
### Hypotheses
1. Client age group has a strong association with Churn
2. A client who com less than 6 months of active contract is more prone to Churn
3. Customer with monthly contracts are prone to Churn## Bivariate Analysis
Bivariate Analysis is a statistical approach that focuses on the relationship between two variables in a dataset. Unlike Univariate Analysis, which explores individual features, Bivariate Analysis seeks to understand how two variables are interconnected and whether there is any significant relationship between them.