Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/shahira-sadat/biodiversity-in-national-parks-portfolio-project

Code Academy Data Science Path Portfolio Project Biodiversity in National Parks
https://github.com/shahira-sadat/biodiversity-in-national-parks-portfolio-project

codeacademy-pro data-science jupyter-notebook

Last synced: about 1 month ago
JSON representation

Code Academy Data Science Path Portfolio Project Biodiversity in National Parks

Host: GitHub
URL: https://github.com/shahira-sadat/biodiversity-in-national-parks-portfolio-project
Owner: shahira-sadat
Created: 2024-06-02T03:40:42.000Z (9 months ago)
Default Branch: main
Last Pushed: 2024-06-02T04:03:52.000Z (9 months ago)
Last Synced: 2024-11-13T17:12:07.086Z (3 months ago)
Topics: codeacademy-pro, data-science, jupyter-notebook
Language: Jupyter Notebook
Homepage:
Size: 480 KB
Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# Biodiversity in National Parks Portfolio Project
Code Academy Data Science Path Portfolio Project Biodiversity in National Parks

## Project Overview

This project explores the biodiversity in national parks using species data and observations. The analysis involves merging and analyzing data from two datasets: species_info.csv containing information about species, including their category, scientific names, common names, and conservation status; and observations.csv providing details about species observations across different national parks.

## Project Objectives

1. Data Loading and Inspection
2. Exploratory Data Analysis (EDA)
3. Data Integration and Analysis
4. Visualizations
5. Summary and Insights

## Prerequisites

Ensure that you have a solid understanding of the following topic:

- Python Fundamentals
- Data Acquisition and Preprocessing
- Data Visualization with Matplotlib and Seaborn
- Exploratory Data Analysis (EDA)
- Pandas for Data Manipulation

## Files

The repository includes the following files:

- species_info.csv: Dataset containing information about species.
- observations.csv: Dataset with observations data from national parks.
- biodiversity_analysis.ipynb: Jupyter Notebook containing detailed analysis description, code, and visualizations.

## Getting Started

1. **Clone the repository:**

```bash
git clone [email protected]:shahira-sadat/biodiversity-in-national-parks-portfolio-project.git

```

2. **Navigate to the project directory:**

```bash
cd biodiversity-in-national-parks-portfolio-project

```

3. **Open the Jupyter Notebook:**

```bash
jupyter notebook

```

4. **Start exploring the OKCupid_Data_Analysis.ipynb notebook:**

```bash
biodiversity_analysis.ipynb
```

## Overview

The script does the following:

1. Loading Data:

- Load species_info.csv and observations.csv.
- Check data shapes and basic statistics.

2. Exploratory Data Analysis (EDA):

- Visualize species distribution across categories and conservation statuses.
- Analyze observations per park and species category.
- Explore mean observations per park and conservation status.

3. Data Integration and Visualization::

- Merge datasets to analyze species observations in parks.
- Plot total observations of species in national parks.
- Visualize mean observations per park and per species category.

4. Insights and Conclusions:
- Summarize findings on species distribution, observations, and conservation statuses.

Feel free to modify and extend the script according to your needs.

## Author

👤 Shahira Sadat

- GitHub: [Shahira Sadat](https://github.com/shahira-sadat)
- Twitter: [Shahira Sadat](https://twitter.com/SadatShahira)
- Linkedin: [Shahira Sadat](https://www.linkedin.com/in/shahira-sadat)
- Gmail: [email protected]

Contributions, issues, and feature requests are welcome!

## Show your support

Give a ⭐️ if you like this project!