An open API service indexing awesome lists of open source software.

https://github.com/odessaz/portfolioprojects

This is a repository I have created to showcase skills, share projects and track my progress in Data Analytics and Data Science
https://github.com/odessaz/portfolioprojects

applied-mathematics data-analysis data-science excel jupyter-notebook matplotlib-pyplot pandas portfolio python r r-studio seaborn sql statistics

Last synced: 2 months ago
JSON representation

This is a repository I have created to showcase skills, share projects and track my progress in Data Analytics and Data Science

Awesome Lists containing this project

README

        

# Data Analyst Portfolio Project Repository
## About Me:
Hi!, I'm Odessa, a third-year Mathematics and Its Applications student at Toronto Metropolitan University.

This repository showcases my skills, shares projects, and tracks my progress in Data Analytics and Data science-related topics. I will update it as I create more projects!

## Table of Contents
- [About Me](https://github.com/OdessaZ/PortfolioProjects/tree/main?tab=readme-ov-file#about-me)

- [Portfolio Projects](https://github.com/OdessaZ/PortfolioProjects?tab=readme-ov-file#portfolio-projects)

- Python
- [Examining Key Determinants of Movie Success](https://github.com/OdessaZ/PortfolioProjects?tab=readme-ov-file#examining-key-determinants-of-movie-success)
- [Finance Tracker](https://github.com/OdessaZ/PortfolioProjects/tree/main#2-finance-tracker)
- [Linear Regression with Python](https://github.com/OdessaZ/PortfolioProjects#4-linear-regression-with-python)
- Microsoft Excel
- [Bike Sales Data Dashboard: Interactive Excel Analysis](https://github.com/OdessaZ/PortfolioProjects#3-bike-sales-data-dashboard-interactive-excel-analysis)
- R Studio
- SQL
- Tableau

- [Education](https://github.com/OdessaZ/PortfolioProjects?tab=readme-ov-file#education)

- [Contact](https://github.com/OdessaZ/PortfolioProjects?tab=readme-ov-file#contact)

## Portfolio Projects
In this section, I will list my data analysis projects and briefly describe each one.

### 1. Examining Key Determinants of Movie Success
**Code**: [Examining Key Determinants of Movie Success](https://github.com/OdessaZ/PortfolioProjects/blob/main/Python%20Movie%20Correlation.ipynb)

**Goal**: To determine what factors contribute the most to a movie's success.

**Description**: This project explores the correlation between various movie features (e.g., budget, revenue, rating) using Python. By analyzing a dataset of movies, the project applies data wrangling techniques, correlation analysis, and data visualization to uncover patterns and relationships between different movie attributes. The goal is to better understand how certain factors impact movie success, providing insights that could be valuable for filmmakers, marketers, and data enthusiasts.

**Skills**: Data cleaning, Data analysis, Correlation matrices, Hypothesis Testing, Data Visualization.

**Technology**: Python, Pandas, Numpy, Seaborn, Matplotlib, SciPy.

**Results**: Using Python functions, the analysis revealed that votes and budget have the highest correlation with gross earnings.

### 2. Finance Tracker
**Code**: [Finance Tracker](https://github.com/OdessaZ/PortfolioProjects/blob/main/Expense_tracker.py)

**Goal**: Get expense data from a user (via the terminal), save and categorize each expense into an Excel spreadsheet that is automatically created and updated.

**Description**: This project is a simple finance tracker designed to help users view their finances effectively. It allows users to record, and categorize their expenses while staying within their budget. This project demonstrates skills in Python programming, and data handling while showcasing a practical approach to personal finance management. It highlights the potential of coding to simplify everyday tasks and improve productivity.

**Skills**: Loops, Dictionaries, Variables and Data Types, Operators, Lists, User Input, Functions, Classes, Tuples and Sets.

**Technology**: Python.

### 3. Bike Sales Data Dashboard: Interactive Excel Analysis
**Code**: [Bike Sales Data Dashboard: Interactive Excel Analysis](https://github.com/OdessaZ/PortfolioProjects/blob/main/Excel%20Project.xlsx)

**Goal**: To create an interactive dashboard using Excel that visualizes demographic data and allows users to filter and analyze trends in bike-related purchases based on factors like marital status, education, and region.

**Description**: This project demonstrates the creation of an interactive, data-driven dashboard using Microsoft Excel. It includes the integration of multiple pivot charts, slicers for filtering key demographic data, and various visual enhancements to create a user-friendly interface. The dashboard allows for detailed analysis of various data fields, such as marital status, education level, and region, with the ability to slice and dice the information for deeper insights. This project showcases proficiency in Excel data visualization and manipulation, ideal for showcasing in a portfolio to demonstrate Excel skills.

**Skills**: Data Visualization, Data Cleaning, Pivot Tables, Slicers, Report Connections, and Filtering and Analyzing Data Based on Multiple Variables.

**Technology**: Microsoft Excel

**Results**: An interactive and visually appealing dashboard that allows users to explore and analyze key data insights, such as demographic trends, purchasing behavior, and regional patterns, through the use of pivot tables and slicers in Excel. The dashboard enables dynamic filtering and comparison across various variables, offering valuable insights for decision-making.

### 4. Linear Regression with Python
**Code**: [Linear Regression with Python](https://github.com/OdessaZ/PortfolioProjects/blob/main/Linear%20Regression%20with%20Python.ipynb)

**Goal**: To understand and implement linear regression using Python, applying statistical learning concepts to analyze data and make predictions. It involved preprocessing data, building and evaluating a regression model, and visualizing results to gain insights.

**Description**: This project focuses on implementing linear regression in Python to explore the relationship between variables and make predictions. It begins with data preprocessing using libraries like Pandas and NumPy, followed by building and training a regression model with scikit-learn. The project emphasizes evaluating model performance through metrics such as R-squared and Mean Squared Error, ensuring accurate predictions. Data visualizations created with Matplotlib and Seaborn provide insights into trends and the model's fit. This project demonstrates practical skills in data analysis, statistical modeling, and machine learning applications.

**Skills**: Data Visualization, Statistical Modeling, Model Development and Evaluation, and Scikit-learn.

**Technology**: Python

**Results**: The linear regression model showed a strong relationship between the variables, making accurate predictions possible. The R-squared value confirmed the model explained the data well, and residual analysis showed it met key assumptions. This proved the model worked effectively for the dataset.

## Education
Toronto Metropolitan University: Bachelor's Degree, Mathematics and Its Applications, 2022 - 2026

## Contact
- LinkedIn: [@odessazang](https://www.linkedin.com/in/odessa-zang/)
- Email: [[email protected]](mailto:[email protected])