Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/danpoynor/pet-shelter-data-analysis-notebook
Demonstration of skills analyzing data from a pet shelter. The CSV data contains tables detailing the incoming and outgoing animals and I use my knowledge of Pandas to gather and present the requested information.
https://github.com/danpoynor/pet-shelter-data-analysis-notebook
csv data-analysis data-cleaning data-science jupyter-notebook matplotlib numpy pandas pet-shelter tabular-data
Last synced: 1 day ago
JSON representation
Demonstration of skills analyzing data from a pet shelter. The CSV data contains tables detailing the incoming and outgoing animals and I use my knowledge of Pandas to gather and present the requested information.
- Host: GitHub
- URL: https://github.com/danpoynor/pet-shelter-data-analysis-notebook
- Owner: danpoynor
- Created: 2022-09-22T16:32:14.000Z (about 2 years ago)
- Default Branch: main
- Last Pushed: 2022-10-05T12:00:30.000Z (about 2 years ago)
- Last Synced: 2024-01-24T15:03:05.014Z (10 months ago)
- Topics: csv, data-analysis, data-cleaning, data-science, jupyter-notebook, matplotlib, numpy, pandas, pet-shelter, tabular-data
- Language: Jupyter Notebook
- Homepage:
- Size: 8.37 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Pet Shelter Data Analysis Notebook
Demonstration of skills analyzing data from a pet shelter. The CSV data contains tables detailing incoming and outgoing animals and I use my knowledge of Pandas to gather and present the requested information.
This notebook was created using JupyterLab application through Anaconda.
---
## Notebook Organization
The notebook is divided into the following sections. Note some sections aren't necessary to the analysis but included for completeness.
A. Setup and import data
B. Initial investigation of the data
* View the head() of each DataFrame
* View the info() of each DataFrame
* View the describe() output for each DataFrame
* View the length of each DataFrame
* View the shape() for each DataFrame
* View the dtypes() for each DataFrameC. Clean the data and prepare it for analysis
* Handle Duplicates: Repeated data
* Combine the two DataFrames
* Handle Formatting: Misspellings, extra whitespace, differences after combining multiple datasets
* Handle Missing Data: Data labeled as unknown, Nan, or empty
* Handle Type Issues: Data that is a different type than expected
* Handle Nonsensical Data: Data that does not make sense
* Handle Saturated Data: Data that is at the extremes of the measurement
* Handle Confidential Data: Personally identifiable information
* Handle Individual Errors: Errors that affect a single value
* Handle Systematic Errors: Errors that affect all or large portions of the data
* Create Additional Columns for AnalysisD. Analyze the data
1. Is there an area where more pets are found?
2. What is the average number of pets found in a month in the year 2015? Are there months where there is a higher number of animals found?
3. What is the ratio of incoming pets vs. adopted pets?
4. What is the distribution of the types of animals in the shelter?
5. What are the adoption rates for specific breeds?
6. What are the adoption rates for different colorings?
7. About how many animals are spayed/neutered each month?Additional Solutions for Exceeds Rating
1. How many animals in the shelter are repeats? Which animal was returned to the shelter the most?
2. What are the adoption rates for the following age groups?* baby: 4 months and less
* young: 5 months - 2 years
* adult: 3 years - 10 years
* senior: 11+3. If spay/neuter for a dog costs 100 dollars and a spay/neuter for a cat costs 50 dollars, how much did the shelter spend in 2015 on these procedures?
---
## Data
Source: