https://github.com/mohd-mb2205153/data-science-project
A data analysis project using Python to study climate patterns and genocide-related datasets from Gaza and the West Bank.
https://github.com/mohd-mb2205153/data-science-project
climate-change data-science matplotlib numpy palestine python3
Last synced: 4 months ago
JSON representation
A data analysis project using Python to study climate patterns and genocide-related datasets from Gaza and the West Bank.
- Host: GitHub
- URL: https://github.com/mohd-mb2205153/data-science-project
- Owner: mohd-mb2205153
- License: gpl-3.0
- Created: 2025-04-16T21:16:40.000Z (about 1 year ago)
- Default Branch: main
- Last Pushed: 2025-05-24T07:30:57.000Z (about 1 year ago)
- Last Synced: 2025-06-12T03:19:14.102Z (about 1 year ago)
- Topics: climate-change, data-science, matplotlib, numpy, palestine, python3
- Language: Jupyter Notebook
- Homepage:
- Size: 31.5 MB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# ๐ Climate Change & Genocide in Gaza
Welcome to our Data Science project repository. This project is divided into two major parts that explore the applications of data science in two critical real-world contexts: **climate change** and the **ongoing genocide in Gaza**.
---
## ๐ Project Overview
### ๐ Part A: Climate Change โ Temperatures and Precipitation
In this part, we explore real climate data to identify long-term trends in temperature and precipitation. Using techniques from data science and statistical inference, we:
- Clean and prepare raw climate datasets.
- Visualize temporal and seasonal changes.
- Apply hypothesis testing to analyze climate phenomena.
- Draw inferences from natural processes using simulation and random sampling.
### ๐ต๐ธ Part B: Genocide in Gaza โ Telling the Story through Data
This part applies data science to the urgent humanitarian crisis in Gaza since October 7th, 2023. Using official datasets on casualties, infrastructure damage, and more, we:
- Preprocess and clean multiple datasets (CSV, JSON, PDF).
- Visualize key patterns and events from the data.
- Pose and test hypotheses using statistical techniques.
- Present powerful narratives backed by data, raising awareness and documenting the impact of the genocide.
---
## ๐ Datasets Used
### ๐ Climate Change
- Global temperature and precipitation datasets from Berkeley Earth and related sources.
### ๐ต๐ธ Genocide in Gaza
- `daily-casualties-gaza.csv`
- `daily-casualties-westbank.csv`
- `killed-in-gaza.csv` (converted from PDF)
- `infrastructure-damaged.csv` (converted from JSON)
- Additional PDF: `killed-in-gaza_moh_2025-03-23.pdf` (used for extended analysis)
Sources: [TechForPalestine Data Repository](https://data.techforpalestine.org/)
---
## ๐ Tools & Libraries
- Python 3
- `datascience` package ( by UC Berkley )
- `numpy` (functions covered in the course)
- `otter-grader` (for testing)
- Jupyter Notebook
---
## ๐ Visualizations and Analyses
- Line and bar plots of daily and monthly casualties.
- Histograms of infrastructure damage.
- Overlay charts showing correlations.
- Aggregated statistics by gender, age group, region, and time.
- Hypothesis tests comparing means between Gaza and West Bank, trends before/after major events, and more.
---
## โ
Project Requirements
- ๐ Exploratory Data Analysis (EDA) with 6 insightful plots.
- ๐งช Three full hypothesis testing procedures.
- ๐ Well-structured project report (PDF).
- ๐ Complete Jupyter Notebooks with documented code and analysis.
---
## ๐ Acknowledgements
- Gaza data provided by Tech for Palestine.
- Climate data courtesy of Berkeley Earth.
- Developed as part of CMPS 360: Data Science Fundamentals @ Qatar University, by Dr. Tamer Elsayed.
---
## ๐ฌ Contact
For any clarifications or questions, you can email us at:
- muhtasim2k2@gmail.com
- fahrelazki@gmail.com
## ๐ License
This project is licensed under the **GNU General Public License v3.0 (GPL-3.0)**.
You are free to:
- **Use** the code for any purpose.
- **Study** how the code works and adapt it to your needs.
- **Share** the code and any modifications you make.
- **Modify** the code and distribute your versions under the same license.
However, under this license, you **must**:
- **Disclose source**: If you distribute the code (modified or unmodified), you must include the source code or a way to obtain it.
- **License your modifications** under the same GPL-3.0 terms.
- **State changes**: If you make changes to the code, you must clearly state what you changed.
> โ ๏ธ There is **no warranty**, and the license does **not grant trademark rights**.
For more details, refer to the [full license text](https://www.gnu.org/licenses/gpl-3.0.en.html).