An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with data-analysis-python

A curated list of projects in awesome lists tagged with data-analysis-python .

https://github.com/mahmoudwal27/e-commerce-data-analysis

A collection of data analysis and visualization projects focused on ecommerce datasets. Using Python in Google Colab for analysis and Excel for exploration, these projects uncover key insights and trends, showcasing expertise in data manipulation and visualization to inform business decisions.

analytics data-analysis data-analysis-python data-set google-cloud python

Last synced: 21 Apr 2026

https://github.com/edwinrlambert/investigating-netflix-movies

Demonstrates data analysis and visualization techniques for Netflix movies using Python in a Jupyter notebook. This is a DataCamp project.

data-analysis data-analysis-python netflix python

Last synced: 25 Apr 2026

https://github.com/robinhosz/traffic-data-analyzer

This repository presents a project for urban traffic signal optimization using Reinforcement Learning (RL) to improve traffic flow, increase safety at intersections, and promote energy efficiency in cities. Using the "Traffic Volume Counts" dataset from Kaggle.

data-analysis-python kaggle-dataset numpy pandas python traffic-analysis

Last synced: 28 Apr 2026

https://github.com/matheusafonseca/python-data-visualization-matplotlib-seaborn-masterclass-udemy

This repository is dedicated to storing the code developed during the "Python Data Visualization: Matplotlib & Seaborn Masterclass" course on Udemy.

charts data-analysis data-analysis-python data-science data-visualization database graphics graphics-programming jupyter-notebook matplotlib matplotlib-plots python python3 seaborn seaborn-plots

Last synced: 28 Apr 2026

https://github.com/mr-dhan/eda-sales-customer-transactions

Dalam dunia bisnis ritel yang kompetitif, pemahaman mendalam terhadap perilaku pelanggan merupakan fondasi penting untuk pengambilan keputusan strategis. Namun, data transaksi pelanggan seringkali berjumlah besar dan kompleks, sehingga memerlukan proses analisis yang efektif untuk mengungkap insight yang berharga.

dashboard data data-analysis data-analysis-python data-science data-visualization eda python

Last synced: 29 Apr 2026

https://github.com/abdoomohamedd/python-data-analysis-projects

A collection of data analysis projects that I have worked on. Each project involves cleaning data, performing exploratory data analysis (EDA), and creating visualizations to extract insights. The projects utilize various Python libraries, including pandas, numpy, matp

data-analysis data-analysis-python data-cleaning data-visualization matplotlib matplotlib-pyplot numpy pandas python

Last synced: 01 May 2026

https://github.com/satyamtripathi8/tools_for_data_science

Introduction to Data Science Tools(Python)

data-analysis-python matplotlib-pyplot numpy pandas

Last synced: 01 May 2026

https://github.com/drill-n-bass/dealavo-project

Cartesian product from dictionary to list of dictionaries and faster methods for finding index than the `index` method.

data-analysis data-analysis-python matplotlib pandas python python3 random timeit

Last synced: 06 May 2026

https://github.com/korniichuk/pydatan-homework

Python Data Analysis course homework

course data-analysis data-analysis-python python python3

Last synced: 06 May 2026

https://github.com/bnvulpe/regression-and-time-series

This work centers on assessing and comparing predictive models for regression and time series prediction using specific datasets, with the goal of selecting the most effective methodology for unseen test data.

colab data-analysis data-analysis-python data-science data-visualization forecasting jupyter-notebook machine-learning model-evaluation predictive-modeling python regression sarima sarimax time-series-analysis time-series-analysis-and-forecasting

Last synced: 08 May 2026

https://github.com/imsalione/sql-fundamentals-farsi

This project offers a versatile and automated solution for generating database reports in structured formats.

data-analysis-python data-engineering database mssql-database python ssis

Last synced: 08 May 2026

https://github.com/arfazrll/data-analyst-dashboard

Data Analyst Dashboard is an interactive tool designed to help data analysts explore, analyze, and visualize datasets with ease. Using Dash and Plotly.

csv-files dashboards data-analysis-python python streamlit

Last synced: 08 May 2026

https://github.com/nahiyanhkhan/data-processing-and-visualization

Loan Data Processing using Python's numpy and pandas libraries. For data visualization, matplotlib and seaborn are used.

data-analysis-python data-visualization matplotlib numpy pandas seaborn

Last synced: 09 May 2026

https://github.com/drill-n-bass/ovh-project

The goal of this task is to prepare statistical analysis of set of data from disks.

anaconda analysis data-analysis data-analysis-python jupyter-notebook matplotlib-python pandas python3 seaborn-plots

Last synced: 09 May 2026

https://github.com/gui-sitton/prepaid

In this project I work as an analyst for the telecommunications company Megaline. The company offers its customers prepaid plans, Surf and Ultimate. The sales department wants to know which plans bring in the most revenue in order to adjust the advertising budget

data data-analysis data-analysis-python data-science data-visualization python

Last synced: 22 May 2026

https://github.com/aaleksandraristic/machine-learning-predictive-models---ga-time-series-prediction

Developing an accurate and reliable financial prediction model for the next 5 years using historical data, to assist investors, traders, and financial analysts make informed decisions about buying or selling stocks in a dynamic market.

data-analysis-python data-management data-visualization-python forecasting-time-series machine-learning machine-learning-algorithms prediction-model python time-series-analysis

Last synced: 14 May 2026

https://github.com/sumit-sinha9/uber-rides-data-analysis

This project aims to analyze Uber ride data to understand various aspects of ride usage, such as the distribution of rides across different categories, purposes, months, days, and times.

data-analysis-python data-analytics data-visualization pandas-python powerbi python rec uber

Last synced: 15 May 2026

https://github.com/smsraj2001/sds-datathon

A simple data science project/hackathon done as part of SDS course

data-analysis data-analysis-python data-cleaning data-science statistics statistics-for-data-science

Last synced: 16 Jul 2025

https://github.com/sadia-khan13/modern_arts_data_cleaning

Welcome to the Data Cleaning project! This repository is dedicated to showcasing best practices and techniques for cleaning data using Pandas within Jupyter Notebook

data-analysis data-analysis-python data-cleaning data-science jupyter-notebook pandas-python

Last synced: 10 May 2026

https://github.com/mahmoudwal27/brazilian_ecommerce

This project explores and cleans the Olist Brazilian E-Commerce dataset using Python (Pandas) to prepare it for Power BI visualization. The process includes loading data, performing exploratory analysis, handling missing values and duplicates, formatting key columns, and exporting clean datasets.

analytics data-analysis data-analysis-python google-cloud python

Last synced: 16 May 2026

https://github.com/narpat78/instagram-user-analytics

An analytics project integrating Python (Jupyter Notebook) with MySQL to extract insights for Instagram’s marketing team and investors, covering loyal users, inactive users, hashtags, fake accounts, and user engagement.

data-analysis-python eda mysql-connector

Last synced: 09 Sep 2025

https://github.com/athari22/multivariable_regression_and_valuation_model_

Multivariable regression model using Python to analyze and predict Boston housing prices based on various socioeconomic and environmental features.

data-analysis data-analysis-python housing-prices housing-prices-competition machine-learning pandas pandas-python plotly python regression-models seaborn seaborn-python sklearn

Last synced: 17 Jun 2025

https://github.com/nishumehta/london-bike-rides-analysis

Analyzed London bike ride patterns using Python (Pandas, Matplotlib) and Tableau, creating interactive dashboards with time series and heatmaps to identify trends and correlations.

dashboard data-analysis-python data-visualization-with-tableau excel jupyter-notebook python tableau tableau-public

Last synced: 17 May 2026

https://github.com/fatihilhan42/wnba-draft-player-dataanalysis-1997-2022-with-python

In this project, the statistics of the players in the WNBA drafts from 1997 to 2022 were examined. The data in the dataset, which you can find in the repo, was first organized using data cleaning algorithms. These cleaned data were then graphically extracted using data visualization algorithms.

data-analysis data-analysis-python data-visualization jupyter-notebook python

Last synced: 17 May 2026

https://github.com/k-bloch/cafe-rewards-offers-analysis

This project explores customer interaction patterns with a café rewards program using Python and Jupyter Notebook, focusing on offer completion rates, demographic trends, and visualizations to enhance marketing strategies.

data-analysis-python jupyter-notebook seaborn sql sqlite3

Last synced: 17 May 2026

https://github.com/sabdikay/telco-customer-churn-analysis-ibm-dataset

This project explores customer churn trends for a company in California using an IBM dataset. Built in a Jupyter Notebook, it employs pandas, NumPy, matplotlib, seaborn, plotly, and scipy to clean, analyze, and visualize data. Through statistical tests and interactive maps, it uncovers key drivers behind customer cancellations

business-intelligence customer-churn data-analysis data-analysis-python data-visualization exploratory-data-analysis jupyter-noteboook matplotlib numpy pandas plotly predictive-modeling python scipy seaborn statistical-analysis

Last synced: 07 Apr 2026

https://github.com/gui-sitton/y.music

In this project I compared the musical preferences of the citizens of Springfild and Shelbyville. I examined real Y.Music data to test hypotheses and compare the behavior of users in these two cities.

data data-analysis data-analysis-python data-science data-visualization python

Last synced: 18 May 2026

https://github.com/cyprianfusi/new-york-city-public-schools-and-sat-scores

One of the most controversial issues in the U.S. educational system is the efficacy of standardized tests and whether they're unfair to certain groups. We could correlate SAT scores with factors like race, gender, income, and more.

data-analysis-python data-cleaning data-visualization data-wrangling

Last synced: 21 Mar 2025

https://github.com/yusuf4030/the-data-analyst-toolkit

📊 Explore essential data analysis tools organized by role and task, empowering users from students to professionals with quick access to valuable resources.

budget budget-management business-intelligence charts cookbook cureated-list data data-analysis-python data-visualization internet-of-everything internet-of-transport large-language-models nse open-source python selenium stock-market traffic-analysis

Last synced: 18 May 2026

https://github.com/gui-sitton/games

Identify patterns that determine whether a game is successful or not. This will allow you to identify potential big winners and plan advertising campaigns.

data data-analysis data-analysis-python data-science data-visualization python

Last synced: 18 May 2026

https://github.com/michelereginabora/cienciadacomputacaopython

Este repositório é resultado do meu primeiro passo para a análise de dados com Python. Aqui contém todos os exercícios realizados no curso de Introdução à Ciência da Computação em Python pela plataforma Coursera, em janeiro de 2023.

data-analysis-python

Last synced: 29 Mar 2025

https://github.com/abdoomohamedd/data-science-projects

A collection of data science projects ranging from exploratory data analysis to predictive modeling and clustering. Each project is designed to solve specific problems or explore particular datasets using various data science techniques and tools.

data-analysis data-analysis-python data-cleaning data-science data-visualization machine-learning machine-learning-algorithms

Last synced: 14 May 2025

https://github.com/prak112/ibm_datasciencecertification

Projects (Assignment projects for each course) related to IBM Data Science Certification courses

data-analysis-python data-cleaning data-visualization-python machine-learning-algorithms

Last synced: 04 Mar 2025

https://github.com/anuj-kshatriya/iphone_sales_data_analysis_project_using_python

This project explores Apple product sales data using Python and Pandas in Jupyter Notebook. It focuses on data cleaning, analysis, and visualization, providing insights into product performance, customer trends, and revenue generation.

data-analysis-python dataanalysisusingpython graphical-data pandas python pythonlibrarires pythonproject

Last synced: 11 May 2026

https://github.com/gui-sitton/carsells

In this project I am an analyst on the Crankshaft List. Hundreds of free vehicle advertisements are published on the site every day. I need to study the data collected over the last few years and determine which factors influence the price of a vehicle.

data data-analysis data-analysis-python data-science data-visualization python

Last synced: 20 May 2026

https://github.com/nrhartnett/passwordstrengthapp

This project combines machine learning with data science implemented in a python environment. Completed as my Master's degree practicum, it displays skills in data analytics as well as intermediate level python development using serializing and de-serializing a Python object structure (Pickle File).

data-analysis-python data-science excel machine-learning-algorithms orange-data-mining pickle-file python python-filehandling

Last synced: 22 Mar 2025

https://github.com/gui-sitton/bank-loans

In this project I will prepare a report for a bank's loan division. I find out whether a customer's marital status and number of children have an impact on loan default, as well as other factors

data data-analysis data-analysis-python data-science data-visualization python

Last synced: 21 May 2026

https://github.com/l0rd-inquisit0r/data-analytics

A repository of data analytics implementations in Python

ai data-analysis data-analysis-python data-analytics

Last synced: 18 Jun 2025

https://github.com/faizantkhan/automated-eda

This repository showcases tools for automatic Exploratory Data Analysis (EDA) in Python. These tools help you quickly understand your datasets and generate insightful reports.

automatic automation autoviz data-analysis data-analysis-python data-science data-visualization dtale dtale-library eda exploratory-data-analysis ml pandas pandas-profiling python python-library sweetviz

Last synced: 18 Apr 2026

https://github.com/shruti23-ui/diwali_sales_analysis

Diwali Sales Analysis: A data analysis project exploring Diwali sales trends, focusing on demographic insights like age and gender-based purchasing behavior. Uses Python for data cleaning, visualization, and insights extraction.

data-analysis-python dataanalysis-projects matplotlib-pyplot python3 seaborn-plots

Last synced: 15 May 2026

https://github.com/emmanuel-dominic/devops-microservices-kubernetes

Project having a pre-trained, sklearn model that has been trained to predict housing prices in Boston according to several features, such as average rooms in a home and data about highway access, teacher-to-pupil ratios, and so on. This project tests your ability to operationalize a Python flask app in a provided file, `app.py` that serves out predictions (inference) about housing prices through API calls. This project could be extended to any pre-trained machine learning model, such as those for image recognition and data labeling.

ci-cd circleci data-analysis-python devops docker kubernetes microservices

Last synced: 18 May 2026

https://github.com/bulbatronik/network-data-analysis

Repo containing homeworks and the project files for "Network Measurements & Data Analysis Lab" course taken during academic year 2022-2023 summer semester of Master of Telecommunication Engineering program at Politecnico di Milano.

anomaly-detection clustering data-analysis-python explainable-ai machine-learning network-measurement

Last synced: 09 Apr 2025

https://github.com/makuche/multi-fidelity-bo

This repository contains data analysis scripts that I have used for my Master thesis research on multi-fidelity Bayesian optimization. The origin of the raw experimental data can be found in the thesis (not in this repository).

data-analysis-python gaussian-processes materials-science multi-fidelity-data multi-task-learning transfer-learning

Last synced: 06 Sep 2025

https://github.com/vikktor93/global-air-pollution

This repository contains an initial data analysis of the global_air_pollution dataset. This dataset measures air quality levels in different cities around the world.

air-quality anaconda contaminants data-analysis-python data-science jupyter-notebook python python3

Last synced: 19 Sep 2025

https://github.com/controldata23/product-sales-from-amazon

This is an Exploratory Data Analysis done on the Amazon Product Sales dataset from kaggle.

data-analysis-python data-cleaning data-exploration data-visualisation eda matplotlib

Last synced: 02 Aug 2025

https://github.com/awais11227/pandas_import_export

Practical examples of importing and exporting different file formats using Pandas, including CSV, Excel, JSON, and more.

csv data-analysis-python data-export data-import excel json pandas-tutorial pandas-tutorial-for-2025

Last synced: 27 Sep 2025

https://github.com/isabella13022012/lucas_kent

Personal portfolio for Lucas Kent. Forked from lenpaul's portfolio-jekyll-theme.

data-analysis-python data-analytics data-visualisation kurtosis matplotlib pandas pyplot python sklearn

Last synced: 03 Oct 2025

https://github.com/keziatbnn/supervised-regression-salaryprediction

Make salary predictions based on years of experience using supervised regression.

data data-analysis-python data-prediction data-science python

Last synced: 11 Aug 2025

https://github.com/ct83/become-a-data-analyst-udacity

This repository contains all of the code, projects and reports that I wrote as I pursued my Udacity - Data Analyst NanoDegree.

data-analysis data-analysis-python data-analyst data-visualisation data-visualization-project datascience python udacity udacity-data-analyst-nanodegree

Last synced: 12 Aug 2025

https://github.com/jprmaulion/cholera-gedeo-ethiopia-spatial-analysis

Exploratory spatial analysis and visualization of cholera case clusters in Gedeo Zone, Ethiopia that integrates demographic and geographic data to identify environmental risk patterns and inform public health interventions. Includes geospatial mapping of cholera incidence relative to waterways and administrative boundaries.

cholera data-analysis data-analysis-python epidemiology ethiopia openstreetmap python spatial-analysis

Last synced: 12 Apr 2026

https://github.com/oguzhansarigol/Expected-Goals-xG-Data-Analysis

We will analyze in our code the expected goal locations of football players based on which areas of the field, at what times, with which feet, using which parts of their bodies, and from which angles and distances they are most likely to score.

data-analysis-python expected-goals numpy pandas-python python

Last synced: 20 Aug 2025

https://github.com/asghar-rizvi/eda_hotel_cancellations

A comprehensive analysis of hotel booking data to understand factors influencing cancellations. This project explores trends in booking patterns, room rates, and cancellation rates using Python and data visualization libraries.

data-analysis-project data-analysis-python data-analysis-real-world-problem data-science python real-world-problem-solving real-world-project report visualization

Last synced: 12 May 2026

https://github.com/aksshri2004/tesla-stock-integration

Implements API to analyse Tesla Stocks and sends a mail and text message when the price fluctuates by 5%.

api-implementation data-analysis-python smptlib twilio-api

Last synced: 23 Mar 2025

https://github.com/farhad-here/data-visualization-analysis-dva

This is my data analysis project. Users can use this project to clean and preprocessing the date or data visualization. Individuals can impute or ecnode ther dataset.

altair bokeh data-analysis data-analysis-python io matplotlib numpy pandas plotly python sklearn streamlit

Last synced: 11 Apr 2026

https://github.com/abhinavbammidi1401/covid-19_analytics

A very comprehensive notebook of statistical models to analyze Covid-19 data and visualization.

analytics covid-19 data-analysis-python data-analytics data-science data-visualization jupyter-notebook predictive-modeling python

Last synced: 19 May 2026

https://github.com/shwetam19/data-analysis-projects

This repository contains 4 projects for analyzing and visualizing data.

data-analysis-python data-science

Last synced: 30 Jun 2025

https://github.com/samjoesilvano/adventureworks_sales_performance_dashboard

Createed an interactive 4-page dashboard for AdventureWorks that visualizes key sales metrics—including revenue, profit, orders, and return rates—across 2020 to 2022. Featuring dynamic geographic analysis and detailed customer insights, this dashboard empowers data-driven decision-making and enhances business performance.

business-intelligence data-analysis-python data-analytics data-driven-decisions data-modeling data-visualization geographic-analysis interactive-dashboards kpi-metrics powerbi sales-performance-analysis

Last synced: 05 Jan 2026

https://github.com/teamtonic/vectonic

🌟Vectonic - Optimized App Creator And Publisher🌟 produces an optimized application and shares it privately and securely to huggingface so that junior executives can immediately produce a high performance knowledge retrieval application for enterprise.

data-analysis-python gradio huggingface-hub togetherai tonic-ai vectara vectara-cli

Last synced: 24 Mar 2025

https://github.com/marlaugustin/fall2024-machine_learning

This is a repository containing the different labs and projects that I had to do for my machine learning class for my fall semester

data-analysis-python python-pandas

Last synced: 24 Mar 2025

https://github.com/administroot/ehs_scrutinizer_preview

一款EHS数据分析软件,用于降低报告出错率(Preview版)

data-analysis-python ehs excel sqlite3

Last synced: 16 Mar 2025

https://github.com/sajjad425/missingvalue

This repository provides a guide on handling missing values in Python, covering identification methods, imputation techniques (mean, median, mode, fill, interpolation), advanced methods (KNN, multiple imputation), and best practices. It includes practical examples for both numerical and categorical data.

data data-analysis-python data-science missing-value-handling missing-value-imputation

Last synced: 04 Apr 2025

https://github.com/camara94/data_analyse_series_temporelles

Dans ce tutoriel, nous allons répondre aux questions suivantes: 1. Lire les données Microsoft à l'aide du package **Pandas Data reader** 2. Obtenez le **prix maximum** de l'action de **2017 à 2022** 3. Quelle est la **date du cours le plus élevé** de l'action ? 4. Quelle est la **date du cours le plus bas** de l'action ?

data-analysis data-analysis-python data-science data-structures-and-algorithms data-visualization serie series-forecasting

Last synced: 09 Apr 2025

https://github.com/controldata23/shopping-data-from-istanbul

This analysis is an EDA done on Istanbul Shopping dataset from kaggle.

data-analysis-python data-cleaning data-exploration descriptive-statistics eda jupyter-notebook

Last synced: 16 Feb 2026

https://github.com/giyanellow/time-series-analysis-on-philippine-debt-and-inflation

A Time Series Analysis on the Philippine Inflation Rate with some predictions using RandomForest.

data-analysis data-analysis-python machine-learning python random-forest

Last synced: 18 Mar 2026

https://github.com/camara94/web-scraping-with-requests-beautifulsoup-and-selenium

Dans ce tutoriel, nous allons découvrir les techniques de web-scraping en request, beautiful-soup et sélénium

beautifulsoup data-analysis-python requests requests-library-python selenium web-scraping

Last synced: 07 Sep 2025

https://github.com/ljadhav25/world-population-analysis-1990-2023-

This repository contains data and analysis related to the world population from 1990 to 2023. The objective is to explore population trends, identify patterns, and visualize demographic changes across different countries and continents over the past few decades.

data-analysis-python data-visualization matplotlib numpy-library pandas-library seaborn

Last synced: 08 Oct 2025

https://github.com/pragati928/cancer-severity-prediction-ml

📊 End-to-end data science project predicting cancer severity using Python, EDA, and Random Forests — focusing on lifestyle and genetic factors.

data-analysis-python data-science-projects eda machine-learning pandas-python random-forest scikit-learn visualizations

Last synced: 08 Oct 2025