https://github.com/zainulabdeenofficial/data-visualization
This project showcases the use of Pandas for data visualization and analysis. It involves working with a movie dataset from TMDB (The Movie Database), and the goal is to analyze and present insights from the dataset through various visualizations.
https://github.com/zainulabdeenofficial/data-visualization
data-visualization
Last synced: 21 days ago
JSON representation
This project showcases the use of Pandas for data visualization and analysis. It involves working with a movie dataset from TMDB (The Movie Database), and the goal is to analyze and present insights from the dataset through various visualizations.
- Host: GitHub
- URL: https://github.com/zainulabdeenofficial/data-visualization
- Owner: ZainulabdeenOfficial
- Created: 2024-09-19T18:17:39.000Z (almost 2 years ago)
- Default Branch: main
- Last Pushed: 2024-09-26T06:17:41.000Z (over 1 year ago)
- Last Synced: 2025-04-01T19:33:38.652Z (about 1 year ago)
- Topics: data-visualization
- Language: Jupyter Notebook
- Homepage:
- Size: 15.8 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Data Visualization Using Pandas
This project showcases the use of **Pandas** for data visualization and analysis. It involves working with a movie dataset from TMDB (The Movie Database), and the goal is to analyze and present insights from the dataset through various visualizations.
## Instructor
**Muhammad Jamil**
GitHub: [Jamil226](https://github.com/Jamil226)
## Project Details
This repository contains a Jupyter notebook developed in **Google Colab** to demonstrate data visualization techniques. The project uses a CSV dataset and focuses on utilizing **Pandas** for data processing and analysis.
### Colab Notebook
The project is designed using [Google Colab](https://colab.research.google.com), making it easy to run and share the notebook without needing any local setup.
### Dataset
The dataset used in this project is **TMDB 5000 Credits**, which provides information about various movies. Due to the large file size (more than 25 MB), the dataset is stored in a ZIP file.
**Important:**
The dataset is available as a ZIP file because GitHub restricts the upload of files larger than 25 MB. Please extract the ZIP file before using it in the notebook.
- **Dataset File:** `tmdb_5000_credits.zip`
- **Extracted File:** `tmdb_5000_credits.csv`
Make sure to unzip the file before running the notebook.
### Tools & Libraries
- **Pandas:** For data analysis and manipulation.
- **Matplotlib:** For basic plotting and visualizations.
- **Seaborn:** For enhanced visualizations and statistical plotting.
### Instructions to Use
1. Download or clone this repository.
2. Extract the `tmdb_5000_credits.zip` file.
3. Open the notebook in Google Colab or Jupyter Notebook.
4. Follow the instructions in the notebook and execute each cell to visualize and analyze the dataset.
### GitHub Links
- **Instructor's GitHub:** [https://github.com/Jamil226](https://github.com/Jamil226)
- **Your GitHub:** [https://github.com/Zainulabdeenoffical](https://github.com/Zainulabdeenoffical)
## Author
This project was created by **M Zain UL Abideen** as part of an educational data visualization project guided by **Muhammad Jamil**.
## License
This project is licensed under the [MIT License](LICENSE).