Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/imnotannamaria/ia-ead-pandas

EDA (Exploratory Data Analysis) concepts with pandas
https://github.com/imnotannamaria/ia-ead-pandas

matplotlib pandas python

Last synced: about 10 hours ago
JSON representation

EDA (Exploratory Data Analysis) concepts with pandas

Awesome Lists containing this project

README

        

# EDA (Exploratory Data Analysis) with Pandas

## Database info

Exploratory analysis is of the telecom segment. The company has three sets of customer and service data, with a variable that determines whether the customer has endorsed (churn) or not the telecom company..

The purpose is that with these data sets and based on some hypotheses that we will formulate and that will be answered by the EDA, can obtain some initial insights for the construction of artificial intelligence that can "predict" the abandonment of still active customers.

- churn_customers.csv = Client data

- churn_contracts.csv = Contracts data

- churn_services.csv = Services data

## Univariate Analysis

Univariate analysis is a fundamental statistical technique in the field of Artificial Intelligence (AI), used to examine and understand the behavior of a single variable in a dataset. In other words, it focuses on the analysis of one feature or variable at a time, without considering relationships with other variables.

### Hypotheses

1. Client age group has a strong association with Churn
2. A client who com less than 6 months of active contract is more prone to Churn
3. Customer with monthly contracts are prone to Churn

## Bivariate Analysis

Bivariate Analysis is a statistical approach that focuses on the relationship between two variables in a dataset. Unlike Univariate Analysis, which explores individual features, Bivariate Analysis seeks to understand how two variables are interconnected and whether there is any significant relationship between them.