An open API service indexing awesome lists of open source software.

https://github.com/nouraalgohary/fifa-world-cup-data-analysis


https://github.com/nouraalgohary/fifa-world-cup-data-analysis

data dataanalysis powerbi powerbi-visuals

Last synced: 3 months ago
JSON representation

Awesome Lists containing this project

README

          

# FIFA-World-Cup-Data-Analysis

## Power BI Dashboard
You can view my interactive Power BI Report [here](https://app.powerbi.com/groups/me/reports/41614564-3ab7-4418-998e-ae25f0e83747/ReportSection?experience=power-bi).

![Capture](https://github.com/NouraAlgohary/FIFA-World-Cup-Data-Analysis/assets/103903785/b998f44b-43f1-4a99-bb63-ebfb0552d672)

## Project Overview:
This project involves extracting, profiling, and analyzing FIFA World Cup data from the Wikipedia website. The primary focus is on creating a comprehensive report that includes key information such as the year, host country, champion, runner-up, final score, and the number of participating teams. The project aims to provide insightful visualizations for better understanding and interpretation of the data.

## Project Steps:
### 1. Data Extraction:

Extract data from the relevant tables on the Wikipedia page: [FIFA World Cup Wikipedia](https://en.wikipedia.org/wiki/FIFA_World_Cup) using Power BI import mode.

![image](https://github.com/NouraAlgohary/FIFA-World-Cup-Data-Analysis/assets/103903785/22a6e5f8-eb84-40c4-b6b1-614e1a57cfb4)

### 2. Data Profiling:
#### Data Types and Formats:
- Ed. (Edition):
Data Type: Numeric
Description: Represents the edition or number of the FIFA World Cup. It indicates the sequential order of each tournament.

- Year:
Data Type: Numeric
Description: Indicates the calendar year in which the FIFA World Cup tournament took place.

- Host:
Data Type: Categorical
Description: Specifies the country that hosted the FIFA World Cup for a particular edition.

- Final, Third-place play-off, Champion, Runner-up, Third, Fourth:
Data Type: Categorical
Description: These columns provide information about the teams involved in different stages of the tournament:
"Final" indicates the teams that reached the final match.
"Third-place play-off" indicates the teams that competed for the third-place position.
"Champion" indicates the team that won the championship.
"Runner-up" indicates the team that finished as the runner-up.
"Third" indicates the team that finished in third place.
"Fourth" indicates the team that finished in fourth place.

- No. of teams:
Data Type: Numeric
Description: Represents the total number of teams that participated in a specific edition of the FIFA World Cup.

#### Statistical Summary:
Number of editions of the FIFA World Cup

![image](https://github.com/NouraAlgohary/FIFA-World-Cup-Data-Analysis/assets/103903785/d3fb47a6-65c8-40c2-b182-48b2d224c981)

Distribution of the World Cup years

![image](https://github.com/NouraAlgohary/FIFA-World-Cup-Data-Analysis/assets/103903785/93fb98db-d669-4590-a34b-2a43ae82f16f)

Frequency of hosting by countries

![image](https://github.com/NouraAlgohary/FIFA-World-Cup-Data-Analysis/assets/103903785/1bb7eb8d-5260-4075-810b-fd9074c00623)

Frequency of teams winning the championship

![image](https://github.com/NouraAlgohary/FIFA-World-Cup-Data-Analysis/assets/103903785/20fe045a-b17b-44dc-a299-1c305ed5aaec)

No. of teams
![image](https://github.com/NouraAlgohary/FIFA-World-Cup-Data-Analysis/assets/103903785/86f5c2ed-b2d9-4031-84ce-7cf2f9f27e9e)

### 3. Data Cleaning and ETL:
- Profile and clean the extracted data.
- Perform Extract, Transform, and Load (ETL) operations to structure the data for analysis.
- Load the following fields:
Year
Host Country
Champion
Runner-up
Final Score
Number of Teams

![image](https://github.com/NouraAlgohary/FIFA-World-Cup-Data-Analysis/assets/103903785/31d23f60-34fb-4b4b-a778-20b4c507acba)

### 4. Report Visualization:
I chose appropriate visuals to convey the following insights:
a. Host Country Frequency:
 Display the number of times the championship was hosted by the host country on a map.
b. Top 5 World Cup Winners:
 Highlight the top 5 countries that have won the World Cup the most.
c. Teams Participation:
 Showcase the number of teams that participated in each championship.
d. Detailed Table:
 Create a table displaying all the relevant fields in a clear and organized manner.
e. Yearly Filtering:
 Implement a filter mechanism allowing users to filter the dashboard by year.

_This documentation serves as a guide for project execution, ensuring a systematic approach to extracting, processing, and visualizing FIFA World Cup data for meaningful insights._