https://github.com/nouraalgohary/fifa-world-cup-data-analysis
https://github.com/nouraalgohary/fifa-world-cup-data-analysis
data dataanalysis powerbi powerbi-visuals
Last synced: 3 months ago
JSON representation
- Host: GitHub
- URL: https://github.com/nouraalgohary/fifa-world-cup-data-analysis
- Owner: NouraAlgohary
- Created: 2024-01-23T15:49:00.000Z (over 2 years ago)
- Default Branch: main
- Last Pushed: 2024-02-03T05:04:01.000Z (over 2 years ago)
- Last Synced: 2025-04-08T18:37:36.271Z (about 1 year ago)
- Topics: data, dataanalysis, powerbi, powerbi-visuals
- Homepage:
- Size: 153 KB
- Stars: 0
- Watchers: 2
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# FIFA-World-Cup-Data-Analysis
## Power BI Dashboard
You can view my interactive Power BI Report [here](https://app.powerbi.com/groups/me/reports/41614564-3ab7-4418-998e-ae25f0e83747/ReportSection?experience=power-bi).

## Project Overview:
This project involves extracting, profiling, and analyzing FIFA World Cup data from the Wikipedia website. The primary focus is on creating a comprehensive report that includes key information such as the year, host country, champion, runner-up, final score, and the number of participating teams. The project aims to provide insightful visualizations for better understanding and interpretation of the data.
## Project Steps:
### 1. Data Extraction:
Extract data from the relevant tables on the Wikipedia page: [FIFA World Cup Wikipedia](https://en.wikipedia.org/wiki/FIFA_World_Cup) using Power BI import mode.

### 2. Data Profiling:
#### Data Types and Formats:
- Ed. (Edition):
Data Type: Numeric
Description: Represents the edition or number of the FIFA World Cup. It indicates the sequential order of each tournament.
- Year:
Data Type: Numeric
Description: Indicates the calendar year in which the FIFA World Cup tournament took place.
- Host:
Data Type: Categorical
Description: Specifies the country that hosted the FIFA World Cup for a particular edition.
- Final, Third-place play-off, Champion, Runner-up, Third, Fourth:
Data Type: Categorical
Description: These columns provide information about the teams involved in different stages of the tournament:
"Final" indicates the teams that reached the final match.
"Third-place play-off" indicates the teams that competed for the third-place position.
"Champion" indicates the team that won the championship.
"Runner-up" indicates the team that finished as the runner-up.
"Third" indicates the team that finished in third place.
"Fourth" indicates the team that finished in fourth place.
- No. of teams:
Data Type: Numeric
Description: Represents the total number of teams that participated in a specific edition of the FIFA World Cup.
#### Statistical Summary:
Number of editions of the FIFA World Cup

Distribution of the World Cup years

Frequency of hosting by countries

Frequency of teams winning the championship

No. of teams

### 3. Data Cleaning and ETL:
- Profile and clean the extracted data.
- Perform Extract, Transform, and Load (ETL) operations to structure the data for analysis.
- Load the following fields:
Year
Host Country
Champion
Runner-up
Final Score
Number of Teams

### 4. Report Visualization:
I chose appropriate visuals to convey the following insights:
a. Host Country Frequency:
Display the number of times the championship was hosted by the host country on a map.
b. Top 5 World Cup Winners:
Highlight the top 5 countries that have won the World Cup the most.
c. Teams Participation:
Showcase the number of teams that participated in each championship.
d. Detailed Table:
Create a table displaying all the relevant fields in a clear and organized manner.
e. Yearly Filtering:
Implement a filter mechanism allowing users to filter the dashboard by year.
_This documentation serves as a guide for project execution, ensuring a systematic approach to extracting, processing, and visualizing FIFA World Cup data for meaningful insights._