An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with data-analysis-python

A curated list of projects in awesome lists tagged with data-analysis-python .

https://github.com/sumit-sinha9/uber-rides-data-analysis

This project aims to analyze Uber ride data to understand various aspects of ride usage, such as the distribution of rides across different categories, purposes, months, days, and times.

data-analysis-python data-analytics data-visualization pandas-python powerbi python rec uber

Last synced: 12 Jun 2025

https://github.com/camara94/data_analyse_series_temporelles

Dans ce tutoriel, nous allons répondre aux questions suivantes: 1. Lire les données Microsoft à l'aide du package **Pandas Data reader** 2. Obtenez le **prix maximum** de l'action de **2017 à 2022** 3. Quelle est la **date du cours le plus élevé** de l'action ? 4. Quelle est la **date du cours le plus bas** de l'action ?

data-analysis data-analysis-python data-science data-structures-and-algorithms data-visualization serie series-forecasting

Last synced: 09 Apr 2025

https://github.com/prateek5525/teco-churn-analysis

This repository analyzes customer churn patterns for Teco, aiming to identify key factors driving churn and to provide actionable insights for retention strategies. Key variables include payment methods, contract types, customer tenure, and demographics, offering a data-driven approach to understanding and mitigating customer churn risks.

data-analysis-python jupyter-notebook python

Last synced: 05 Sep 2025

https://github.com/controldata23/shopping-data-from-istanbul

This analysis is an EDA done on Istanbul Shopping dataset from kaggle.

data-analysis-python data-cleaning data-exploration descriptive-statistics eda jupyter-notebook

Last synced: 16 Feb 2026

https://github.com/yusuf4030/the-data-analyst-toolkit

📊 Explore essential data analysis tools organized by role and task, empowering users from students to professionals with quick access to valuable resources.

budget budget-management business-intelligence charts cookbook cureated-list data data-analysis-python data-visualization internet-of-everything internet-of-transport large-language-models nse open-source python selenium stock-market traffic-analysis

Last synced: 06 Sep 2025

https://github.com/faizantkhan/automated-eda

This repository showcases tools for automatic Exploratory Data Analysis (EDA) in Python. These tools help you quickly understand your datasets and generate insightful reports.

automatic automation autoviz data-analysis data-analysis-python data-science data-visualization dtale dtale-library eda exploratory-data-analysis ml pandas pandas-profiling python python-library sweetviz

Last synced: 05 Dec 2025

https://github.com/giyanellow/time-series-analysis-on-philippine-debt-and-inflation

A Time Series Analysis on the Philippine Inflation Rate with some predictions using RandomForest.

data-analysis data-analysis-python machine-learning python random-forest

Last synced: 20 Feb 2025

https://github.com/fimblo/leanstats

Compute lean metrics on Kanban ticket data. Analyse cycle-time, throughput, and more.

data-analysis-python kanban lean-metrics

Last synced: 10 Mar 2025

https://github.com/abdoomohamedd/python-data-analysis-projects

A collection of data analysis projects that I have worked on. Each project involves cleaning data, performing exploratory data analysis (EDA), and creating visualizations to extract insights. The projects utilize various Python libraries, including pandas, numpy, matp

data-analysis data-analysis-python data-cleaning data-visualization matplotlib matplotlib-pyplot numpy pandas python

Last synced: 06 Sep 2025

https://github.com/camara94/web-scraping-with-requests-beautifulsoup-and-selenium

Dans ce tutoriel, nous allons découvrir les techniques de web-scraping en request, beautiful-soup et sélénium

beautifulsoup data-analysis-python requests requests-library-python selenium web-scraping

Last synced: 07 Sep 2025

https://github.com/edwinrlambert/investigating-netflix-movies

Demonstrates data analysis and visualization techniques for Netflix movies using Python in a Jupyter notebook. This is a DataCamp project.

data-analysis data-analysis-python netflix python

Last synced: 12 Mar 2025

https://github.com/ljadhav25/world-population-analysis-1990-2023-

This repository contains data and analysis related to the world population from 1990 to 2023. The objective is to explore population trends, identify patterns, and visualize demographic changes across different countries and continents over the past few decades.

data-analysis-python data-visualization matplotlib numpy-library pandas-library seaborn

Last synced: 08 Oct 2025

https://github.com/pragati928/cancer-severity-prediction-ml

📊 End-to-end data science project predicting cancer severity using Python, EDA, and Random Forests — focusing on lifestyle and genetic factors.

data-analysis-python data-science-projects eda machine-learning pandas-python random-forest scikit-learn visualizations

Last synced: 08 Oct 2025

https://github.com/hgabrali/masterschool-python-data-analysis-starter

A standardized, best-practice, and bilingual curriculum template for Data Analysis projects. Focuses on mastering core Python libraries (Pandas, NumPy) and the **CRISP-DM** methodology, covering essential steps from Data Assessment to advanced Data Cleaning and Integration. **Content is structured for both Turkish and English learners.*

data-analysis-python data-cleaning data-science data-wrangling datascience english masterschool multilingual multilingual-translations pandas pandas-dataframe python starter-template turkce-kaynak turkish

Last synced: 09 Oct 2025

https://github.com/rightfulcode/customer-segmentation-rfm

This project performs customer segmentation using Recency, Frequency, and Monetary (RFM) metrics to identify key customer groups and provide actionable marketing insights.

data-analysis-python data-visualization elevvo-internship jupyter-notebook matplotlib pandas python rfm-analysis seaborn

Last synced: 10 Oct 2025

https://github.com/controldata23/population-of-countries

An Exploratory Data Analysis done on a Countries dataset from kaggle

data-analysis-python data-cleaning data-exploration eda jupyter-notebook pandas

Last synced: 19 Jan 2026

https://github.com/thinzarhninyu/dap

Notes and Projects for Data Analysis with Python course from FreeCodeCamp.org

data-analysis data-analysis-python ipynb jupyter-notebook python

Last synced: 18 Feb 2026

https://github.com/satyamtripathi8/tools_for_data_science

Introduction to Data Science Tools(Python)

data-analysis-python matplotlib-pyplot numpy pandas

Last synced: 19 Oct 2025

https://github.com/srinidhijai/cafe-sales_data-cleaning-eda

Data cleaning and EDA project using real-world cafe sales & transaction data

data-analysis-python exploratory-data-analysis exploratory-data-analysis-eda matplotlib pandas python3 seaborn

Last synced: 27 Oct 2025

https://github.com/raulmaulidhino-dev/ml_modelling_regression

There are many factors that influence the grades/scores of students. One of the factors is study hours. In this mini analysis project, there are 3 models that will learn and predict the relation between study hours of students and their scores in an exam/test. This project will result the best ML model to solve the problem.

data data-analysis-python data-science eda machine-learning scikit-learn

Last synced: 28 Jan 2026

https://github.com/toluwalase-taiwo/altschoolafrica

This repository showcases my projects and learnings from my one-year diploma course with AltSchool Africa School of Data, Data Science Track. The projects demonstrate my skills and knowledge in data science, machine learning, and programming.

data-analysis-python data-science jupyter-notebook python-programming

Last synced: 29 Jan 2026

https://github.com/ayuub34/siesta-smart

This project is designed to be a AIaaS 2-Way-PMS optimized with AI and ML models. The goal is to enhance the efficiency of the system through advanced technology solutions.

ai data-analysis-python data-processing data-science jupyter-python machine-learning matplotlib numpy pandas powerbi-report pyqt5 scipy seaborn sklearn

Last synced: 25 Feb 2026

https://github.com/xre22zax/biodiversity---national-parks

National Parks Service about endangered species

data-analysis-python data-visualization ipynb python python3

Last synced: 04 Mar 2026

https://github.com/gui-sitton/games

Identify patterns that determine whether a game is successful or not. This will allow you to identify potential big winners and plan advertising campaigns.

data data-analysis data-analysis-python data-science data-visualization python

Last synced: 18 Mar 2025

https://github.com/grindelfp/borrowers-investigation

An analysis of a dataset of borrowers, EDA and identification of dependences between debts and other features.

data-analysis-python ipynb-notebook mlda

Last synced: 04 Mar 2026

https://github.com/gui-sitton/bank-loans

In this project I will prepare a report for a bank's loan division. I find out whether a customer's marital status and number of children have an impact on loan default, as well as other factors

data data-analysis data-analysis-python data-science data-visualization python

Last synced: 18 Mar 2025

https://github.com/gui-sitton/prepaid

In this project I work as an analyst for the telecommunications company Megaline. The company offers its customers prepaid plans, Surf and Ultimate. The sales department wants to know which plans bring in the most revenue in order to adjust the advertising budget

data data-analysis data-analysis-python data-science data-visualization python

Last synced: 18 Mar 2025

https://github.com/gui-sitton/carsells

In this project I am an analyst on the Crankshaft List. Hundreds of free vehicle advertisements are published on the site every day. I need to study the data collected over the last few years and determine which factors influence the price of a vehicle.

data data-analysis data-analysis-python data-science data-visualization python

Last synced: 18 Mar 2025

https://github.com/sonu275981/uber-rides-data-analysis

Analysis of Uber's Ridership Data for NYC.

data-analysis-python flask machine-learning numpy pandas uber

Last synced: 10 Apr 2025

https://github.com/smsraj2001/sds-datathon

A simple data science project/hackathon done as part of SDS course

data-analysis data-analysis-python data-cleaning data-science statistics statistics-for-data-science

Last synced: 16 Jul 2025

https://github.com/thirza258/course_da_id

Berisi hasil pengerjaan terhadap course dari mostly Freecodecamp

backend data-analysis-python data-science javascript jupyter-notebook python

Last synced: 09 Apr 2025

https://github.com/fatihilhan42/wnba-draft-player-dataanalysis-1997-2022-with-python

In this project, the statistics of the players in the WNBA drafts from 1997 to 2022 were examined. The data in the dataset, which you can find in the repo, was first organized using data cleaning algorithms. These cleaned data were then graphically extracted using data visualization algorithms.

data-analysis data-analysis-python data-visualization jupyter-notebook python

Last synced: 29 Oct 2025

https://github.com/k-bloch/cafe-rewards-offers-analysis

This project explores customer interaction patterns with a café rewards program using Python and Jupyter Notebook, focusing on offer completion rates, demographic trends, and visualizations to enhance marketing strategies.

data-analysis-python jupyter-notebook seaborn sql sqlite3

Last synced: 19 Mar 2025

https://github.com/sadia-khan13/modern_arts_data_cleaning

Welcome to the Data Cleaning project! This repository is dedicated to showcasing best practices and techniques for cleaning data using Pandas within Jupyter Notebook

data-analysis data-analysis-python data-cleaning data-science jupyter-notebook pandas-python

Last synced: 20 Mar 2025

https://github.com/narpat78/instagram-user-analytics

An analytics project integrating Python (Jupyter Notebook) with MySQL to extract insights for Instagram’s marketing team and investors, covering loyal users, inactive users, hashtags, fake accounts, and user engagement.

data-analysis-python eda mysql-connector

Last synced: 09 Sep 2025

https://github.com/idjy/python-weather-app

# 🌤️ Python Weather AppA simple desktop weather application built with Python and Tkinter. It fetches real-time weather data from OpenWeatherMap for Karbala, Iraq, and displays it in a clean interface. 🌍

beginner-friendly bootstrap data data-analysis-python django good-first-issue hacktoberfest hacktoberfest2023 matplotlib python-application python-desktop-application python-framework python-library terminal twinkle weather-api web website

Last synced: 17 Jun 2025

https://github.com/athari22/multivariable_regression_and_valuation_model_

Multivariable regression model using Python to analyze and predict Boston housing prices based on various socioeconomic and environmental features.

data-analysis data-analysis-python housing-prices housing-prices-competition machine-learning pandas pandas-python plotly python regression-models seaborn seaborn-python sklearn

Last synced: 17 Jun 2025

https://github.com/arfazrll/data-analyst-dashboard

Data Analyst Dashboard is an interactive tool designed to help data analysts explore, analyze, and visualize datasets with ease. Using Dash and Plotly.

csv-files dashboards data-analysis-python python streamlit

Last synced: 20 Jan 2026

https://github.com/drill-n-bass/dealavo-project

Cartesian product from dictionary to list of dictionaries and faster methods for finding index than the `index` method.

data-analysis data-analysis-python matplotlib pandas python python3 random timeit

Last synced: 30 Oct 2025

https://github.com/korniichuk/pydatan-homework

Python Data Analysis course homework

course data-analysis data-analysis-python python python3

Last synced: 18 Jul 2025

https://github.com/aishwaryagm1999/electric-vehicles-dataset-data-analaysis

Performed Data Cleaning and Data Analysis of the Electric Vehicles Dataset to find the relationship between the features in the dataset and visualized the findings using matplotlib and seaborn.

data-analysis-python data-visualization electric-vehicle feature-analysis matplotlib numpy pandas python seaborn

Last synced: 30 Dec 2025

https://github.com/aaleksandraristic/machine-learning-predictive-models---ga-time-series-prediction

Developing an accurate and reliable financial prediction model for the next 5 years using historical data, to assist investors, traders, and financial analysts make informed decisions about buying or selling stocks in a dynamic market.

data-analysis-python data-management data-visualization-python forecasting-time-series machine-learning machine-learning-algorithms prediction-model python time-series-analysis

Last synced: 30 Oct 2025

https://github.com/sabdikay/telco-customer-churn-analysis-ibm-dataset

This project explores customer churn trends for a company in California using an IBM dataset. Built in a Jupyter Notebook, it employs pandas, NumPy, matplotlib, seaborn, plotly, and scipy to clean, analyze, and visualize data. Through statistical tests and interactive maps, it uncovers key drivers behind customer cancellations

business-intelligence customer-churn data-analysis data-analysis-python data-visualization exploratory-data-analysis jupyter-noteboook matplotlib numpy pandas plotly predictive-modeling python scipy seaborn statistical-analysis

Last synced: 30 Dec 2025

https://github.com/cyprianfusi/new-york-city-public-schools-and-sat-scores

One of the most controversial issues in the U.S. educational system is the efficacy of standardized tests and whether they're unfair to certain groups. We could correlate SAT scores with factors like race, gender, income, and more.

data-analysis-python data-cleaning data-visualization data-wrangling

Last synced: 21 Mar 2025

https://github.com/saurabh9136/data-analysis_using_pandas-numpy

A beginner-friendly repository exploring data analysis using NumPy and Pandas. Covers fundamental operations, data manipulation, and real-world dataset analysis.

data-analysis-python numpy pandas python scipy

Last synced: 05 Apr 2025

https://github.com/robinhosz/traffic-data-analyzer

This repository presents a project for urban traffic signal optimization using Reinforcement Learning (RL) to improve traffic flow, increase safety at intersections, and promote energy efficiency in cities. Using the "Traffic Volume Counts" dataset from Kaggle.

data-analysis-python kaggle-dataset numpy pandas python traffic-analysis

Last synced: 21 Mar 2025

https://github.com/michelereginabora/cienciadacomputacaopython

Este repositório é resultado do meu primeiro passo para a análise de dados com Python. Aqui contém todos os exercícios realizados no curso de Introdução à Ciência da Computação em Python pela plataforma Coursera, em janeiro de 2023.

data-analysis-python

Last synced: 29 Mar 2025

https://github.com/abdoomohamedd/data-science-projects

A collection of data science projects ranging from exploratory data analysis to predictive modeling and clustering. Each project is designed to solve specific problems or explore particular datasets using various data science techniques and tools.

data-analysis data-analysis-python data-cleaning data-science data-visualization machine-learning machine-learning-algorithms

Last synced: 14 May 2025

https://github.com/prak112/ibm_datasciencecertification

Projects (Assignment projects for each course) related to IBM Data Science Certification courses

data-analysis-python data-cleaning data-visualization-python machine-learning-algorithms

Last synced: 04 Mar 2025

https://github.com/anuj-kshatriya/iphone_sales_data_analysis_project_using_python

This project explores Apple product sales data using Python and Pandas in Jupyter Notebook. It focuses on data cleaning, analysis, and visualization, providing insights into product performance, customer trends, and revenue generation.

data-analysis-python dataanalysisusingpython graphical-data pandas python pythonlibrarires pythonproject

Last synced: 29 Mar 2025

https://github.com/danicaalana/wine-dataset-decision-tree

This project is developed as part of Digital Skill Fair (DSF) 35.0 - Data Science by Dibimbing. I am using Wine Recognition Dataset from scikit-learn, which is the results of a chemical analysis of wines grown in the same region in Italy by three different cultivators.

data data-analysis-python data-science decision-tree-classification machine-learning python scikit-learn wine-dataset

Last synced: 01 Nov 2025

https://github.com/nrhartnett/passwordstrengthapp

This project combines machine learning with data science implemented in a python environment. Completed as my Master's degree practicum, it displays skills in data analytics as well as intermediate level python development using serializing and de-serializing a Python object structure (Pickle File).

data-analysis-python data-science excel machine-learning-algorithms orange-data-mining pickle-file python python-filehandling

Last synced: 22 Mar 2025

https://github.com/gui-sitton/y.music

In this project I compared the musical preferences of the citizens of Springfild and Shelbyville. I examined real Y.Music data to test hypotheses and compare the behavior of users in these two cities.

data data-analysis data-analysis-python data-science data-visualization python

Last synced: 18 Mar 2025

https://github.com/l0rd-inquisit0r/data-analytics

A repository of data analytics implementations in Python

ai data-analysis data-analysis-python data-analytics

Last synced: 18 Jun 2025

https://github.com/shruti23-ui/diwali_sales_analysis

Diwali Sales Analysis: A data analysis project exploring Diwali sales trends, focusing on demographic insights like age and gender-based purchasing behavior. Uses Python for data cleaning, visualization, and insights extraction.

data-analysis-python dataanalysis-projects matplotlib-pyplot python3 seaborn-plots

Last synced: 23 Jul 2025

https://github.com/emmanuel-dominic/devops-microservices-kubernetes

Project having a pre-trained, sklearn model that has been trained to predict housing prices in Boston according to several features, such as average rooms in a home and data about highway access, teacher-to-pupil ratios, and so on. This project tests your ability to operationalize a Python flask app in a provided file, `app.py` that serves out predictions (inference) about housing prices through API calls. This project could be extended to any pre-trained machine learning model, such as those for image recognition and data labeling.

ci-cd circleci data-analysis-python devops docker kubernetes microservices

Last synced: 06 Sep 2025

https://github.com/mahmoudwal27/brazilian_ecommerce

This project explores and cleans the Olist Brazilian E-Commerce dataset using Python (Pandas) to prepare it for Power BI visualization. The process includes loading data, performing exploratory analysis, handling missing values and duplicates, formatting key columns, and exporting clean datasets.

analytics data-analysis data-analysis-python google-cloud python

Last synced: 23 Jul 2025

https://github.com/imsalione/sql-fundamentals-farsi

This project offers a versatile and automated solution for generating database reports in structured formats.

data-analysis-python data-engineering database mssql-database python ssis

Last synced: 30 Mar 2025

https://github.com/bulbatronik/network-data-analysis

Repo containing homeworks and the project files for "Network Measurements & Data Analysis Lab" course taken during academic year 2022-2023 summer semester of Master of Telecommunication Engineering program at Politecnico di Milano.

anomaly-detection clustering data-analysis-python explainable-ai machine-learning network-measurement

Last synced: 09 Apr 2025

https://github.com/nishumehta/london-bike-rides-analysis

Analyzed London bike ride patterns using Python (Pandas, Matplotlib) and Tableau, creating interactive dashboards with time series and heatmaps to identify trends and correlations.

dashboard data-analysis-python data-visualization-with-tableau excel jupyter-notebook python tableau tableau-public

Last synced: 18 Sep 2025

https://github.com/makuche/multi-fidelity-bo

This repository contains data analysis scripts that I have used for my Master thesis research on multi-fidelity Bayesian optimization. The origin of the raw experimental data can be found in the thesis (not in this repository).

data-analysis-python gaussian-processes materials-science multi-fidelity-data multi-task-learning transfer-learning

Last synced: 06 Sep 2025

https://github.com/vikktor93/global-air-pollution

This repository contains an initial data analysis of the global_air_pollution dataset. This dataset measures air quality levels in different cities around the world.

air-quality anaconda contaminants data-analysis-python data-science jupyter-notebook python python3

Last synced: 19 Sep 2025

https://github.com/sahilsapariya/sem_vi

All the material for the sem VI is available here including code of labs

compiler-design data-analysis-python hibernate-jpa html-css-javascript operating-system reactjs

Last synced: 02 Aug 2025

https://github.com/controldata23/product-sales-from-amazon

This is an Exploratory Data Analysis done on the Amazon Product Sales dataset from kaggle.

data-analysis-python data-cleaning data-exploration data-visualisation eda matplotlib

Last synced: 02 Aug 2025

https://github.com/awais11227/pandas_import_export

Practical examples of importing and exporting different file formats using Pandas, including CSV, Excel, JSON, and more.

csv data-analysis-python data-export data-import excel json pandas-tutorial pandas-tutorial-for-2025

Last synced: 27 Sep 2025

https://github.com/isabella13022012/lucas_kent

Personal portfolio for Lucas Kent. Forked from lenpaul's portfolio-jekyll-theme.

data-analysis-python data-analytics data-visualisation kurtosis matplotlib pandas pyplot python sklearn

Last synced: 03 Oct 2025

https://github.com/keziatbnn/supervised-regression-salaryprediction

Make salary predictions based on years of experience using supervised regression.

data data-analysis-python data-prediction data-science python

Last synced: 11 Aug 2025

https://github.com/ct83/become-a-data-analyst-udacity

This repository contains all of the code, projects and reports that I wrote as I pursued my Udacity - Data Analyst NanoDegree.

data-analysis data-analysis-python data-analyst data-visualisation data-visualization-project datascience python udacity udacity-data-analyst-nanodegree

Last synced: 12 Aug 2025

https://github.com/jprmaulion/cholera-gedeo-ethiopia-spatial-analysis

Exploratory spatial analysis and visualization of cholera case clusters in Gedeo Zone, Ethiopia that integrates demographic and geographic data to identify environmental risk patterns and inform public health interventions. Includes geospatial mapping of cholera incidence relative to waterways and administrative boundaries.

cholera data-analysis data-analysis-python epidemiology ethiopia openstreetmap python spatial-analysis

Last synced: 04 Oct 2025

https://github.com/oguzhansarigol/Expected-Goals-xG-Data-Analysis

We will analyze in our code the expected goal locations of football players based on which areas of the field, at what times, with which feet, using which parts of their bodies, and from which angles and distances they are most likely to score.

data-analysis-python expected-goals numpy pandas-python python

Last synced: 20 Aug 2025

https://github.com/imprvhub/ecommerce-data-analysis

perform in-depth data analysis from two different next.js projects using python, with flask and gunicorn deployed on azure. [Implementation].

azure data-analysis-python flask gunicorn implementation mysql python

Last synced: 23 Aug 2025

https://github.com/drill-n-bass/ovh-project

The goal of this task is to prepare statistical analysis of set of data from disks.

anaconda analysis data-analysis data-analysis-python jupyter-notebook matplotlib-python pandas python3 seaborn-plots

Last synced: 05 Nov 2025