An open API service indexing awesome lists of open source software.

Data analysis

Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.

https://github.com/vasishta03/econovisionai

A simple Python desktop app to search and explore OECD economic data (CSV) and report summaries (TXT/JSON) using a modern CustomTkinter GUI—no SQL or web frameworks needed.

csv customtkinter data-analysis desktop-app economic-data gui json local-app oecd pandas python search tkinter

Last synced: 10 May 2026

https://github.com/yeopster/datascience_notebook

Compilation of my Notebook based on Kaggle Dataset

data-analysis data-science kaggle notebook python

Last synced: 10 May 2026

https://github.com/datasqlsantosh/global-energy-consumption-renewable-generation-python-data-analysis-portfolio

This project focuses on analyzing global energy consumption patterns and trends in renewable energy generation using Python data analysis libraries such as Seaborn and NumPy. The analysis aims to explore energy consumption data from various regions worldwide and examine the contribution of renewable energy sources over time

data data-analysis data-visualization pandas seaborn

Last synced: 10 May 2026

https://github.com/szuzick/us-immigration-presidential-analysis

Power BI dashboard analyzing 40 years of U.S. immigration data across presidential administrations (1981-2020)

dashboard data-analysis data-visualization government-data immigration powerbi powerbi-dashboards powerbi-visuals presidential-analysis

Last synced: 10 Jun 2026

https://github.com/luca-02/credit-card-fraud-detection

This is a small master's degree project for New Generation Data Models and DBMSs course (academic year 2024/25).

data-analysis database nosql python

Last synced: 10 Jun 2026

https://github.com/sdley/tp2_datascience

Exercice Pratique de traitement de donnees avec python

data-analysis pandas python

Last synced: 11 May 2026

https://github.com/monarch1108/customerinsights-kmeans

understanding customers using KMeans and RFM(recency, frequency & monetary) analysis

data-analysis data-visualization kmeans-clustering machine-learning matplotlib numpy pandas scikit-learn

Last synced: 11 May 2026

https://github.com/szuzick/hr-analytics-pipeline

End-to-end HR analytics solution using PostgreSQL, dbt, and Power BI

data-analysis data-visualization database-maintenance dbt hr-analytics insights postgresql powerbi sql

Last synced: 10 Jun 2026

https://github.com/parthds02/customer-segmentation-with-kmeans-clustering

Analyze customer behavior using Python and KMeans Clustering on transactional data. Features RFM analysis, data cleaning, clustering insights, and actionable visualizations to support business decision-making.

data-analysis data-visualization feature-engineering kmeans-clustering numpy pandas vscode

Last synced: 11 May 2026

https://github.com/hrosicka/czechpopulationestimation

This GitHub repository contains Python code for data analysis and population prediction in the Czech Republic up to the year 2050. The code is written in Python and utilizes the Pandas and Matplotlib libraries.

data-analysis data-visualization matplotlib matplotlib-figures matplotlib-pyplot pandas pandas-dataframe pandas-library pandas-python python python3

Last synced: 11 May 2026

https://github.com/sferez/gradient_descent

Multiple Linear Regression, Gradient Descent with Python

data-analysis data-science gradient-descent linear-regression python

Last synced: 12 May 2026

https://github.com/deliprofesor/amazon-movie-analysis-and-visualization

"Amazon Movie Analysis and Visualization" is a Python project that analyzes and visualizes movie data from Amazon.com, including ratings, directors, actors, release years, MPAA ratings, and pricing. The project provides insights into movie trends and popular films, helping users explore key patterns through interactive visualizations.

data-analysis data-visualization matplotlib pandas python

Last synced: 12 May 2026

https://github.com/leticia-ducatti/sales-dashboard-project

Interactive sales dashboard built with Python and Streamlit — shows KPIs, allows filtering, and visualizes sales data.

data-analysis pandas plotly python streamlit

Last synced: 12 May 2026

https://github.com/jayita11/customer-engagement-insights-for-yelp-restaurant-business-success

This project analyzes Yelp restaurant data using SQLite, Python, and Tableau to explore user engagement, reviews, and ratings. It provides insights into restaurant success across cities, regions, and user behavior.

customer-engagement data-analysis interactive-visualizations json python ratings review sqlite3 tableau-dashboards-for-data-visualization yelp-restaurants

Last synced: 12 May 2026

https://github.com/ygalvao/bra_scraper_2022

A web scraper bot for the 2nd round of the 2022 Brazilian Federal Elections.

data-analysis data-analytics selenium web-scraper webscraper

Last synced: 12 May 2026

https://github.com/leopeng1995/neuralsql

Make DataStore More Intelligent

data-analysis mongodb sql

Last synced: 12 May 2026

https://github.com/sakan811/honkai-star-rail-a-few-fun-insights-with-data-analysis

The project gives insights that delve into the Honkai Star Rail's character's stats of all available characters as of the given date.

data data-analysis data-science data-visualization docker flask game honkai honkai-star-rail honkai-starrail seaborn webscraping webscraping-data webscraping-selenium

Last synced: 10 Jun 2026

https://github.com/sebastian-diaz-berdecia/analisis-popularidad-de-series-y-generos-de-series

Consultas SQL para el análisis de la popularidad de series y géneros series de la base de datos NetflixDB.

business-analytics bussiness-intelligence data data-analysis database mysql mysql-database sql

Last synced: 12 May 2026

https://github.com/johannaschmidle/amazon-cat-couch

Customer product reviews + ratings analysis and visualization [Python, Excel, Tableau, R]

data-analysis data-visualization jupyter-notebook python-notebook r-markdown sentiment-analysis text-analysis web-scraping

Last synced: 11 Jun 2026

https://github.com/parthds02/-daily-calorie-count-meal-plan-generator-

Welcome to the Daily Calorie Count Meal Plan Generator project! This Streamlit web application is designed to create personalized meal plans based on user inputs such as age, weight, gender, and calorie goals. It also allows users to download their customized meal plans as PDFs.

calories-tracker data-analysis data-science pdf-generation streamlit vscode

Last synced: 13 May 2026

https://github.com/roland045/smart_fluid_sedimentation_tester

Control program for custom developed smart fluid sedimentation tester system

arduino data-analysis instrumentation measurement sensor

Last synced: 13 May 2026

https://github.com/manukot/sturdy-engine-python-

I've leant not only various Theoretical Concepts but also practical projects in my Masters Coursework

data-analysis data-visualization python3

Last synced: 13 May 2026

https://github.com/lucs1590/agidatatest

This is a repository with data analysis and data science tests.

data-analysis data-science python test

Last synced: 13 May 2026

https://github.com/madhurragarwal/advertising-data-set---eda-and-ml

Logistic Regression and EDA done on Advertising Data set

data-analysis machine-learning

Last synced: 13 May 2026

https://github.com/deliprofesor/joblocationmapper

JobLocationMapper is a Python tool that visualizes job listings on an interactive map. It uses city and state data to place job markers accurately and color-codes them by occupation (Software, Marketing, Design). The map clusters markers for better organization, and users can click on them to view job details.

clustrered-markers data-analysis data-visualization folium geocoding geographical-visualization interactive-map job-listings map-visualization pandas python

Last synced: 14 May 2026

https://github.com/yeonjaee/data-analytics

converts raw data into actionable insights

data-analysis text-mining

Last synced: 11 Jun 2026

https://github.com/satvikpraveen/matplotlibmasterpro

📷 MatplotlibMasterPro is a complete, portfolio-ready project to master data visualization using matplotlib. Includes 16 notebooks, real datasets, exportable plots, custom themes, Streamlit dashboard, and Docker support. Ideal for learners and data professionals.

charts custom-plots dashboarding data-analysis data-science data-visualization educational-project interactive-visualizations jupyter-notebook matplotlib notebooks open-source plotting portfolio-project python python-utilities reproducible-research subplots time-series-analysis visualization-tools

Last synced: 14 May 2026

https://github.com/yashsingh43/lung-cancer-biomarker-analysis

Gene expression analysis to identify biomarkers for early lung cancer detection (SCLC & NSCLC)

bioinformatics biomarkers cancer cytoscape data-analysis gene-expression gsea nsclc r sclc

Last synced: 11 Jun 2026

https://github.com/iamsainikhil/web-data-scraping

Data scraping from a webpage using Python

beautiful-soup data-analysis data-scraping python

Last synced: 11 Jun 2026

https://github.com/skuschel/postexperiment

postprocessor for experimental (event based) data.

data-analysis eventstore hacktoberfest postprocessing

Last synced: 12 Jun 2026

https://github.com/sambit-mondal/stockx

StockX is a full-stack application designed to help store owners efficiently manage their inventory, track purchases, and analyze stock levels. The system integrates MongoDB, Express, React, and Flask (Python) to provide a seamless experience.

artificial-intelligence data-analysis inventory-management-system machine-learning mern-stack

Last synced: 12 Jun 2026

https://github.com/luizassimoes/q5ga-latency-and-throughput

Quick 5G Analyser: PyQT5 software developed to help with simple graphical analysis and chart generating for ping and iperf3 tests.

data-analysis data-visualization pyqt5 python

Last synced: 13 Jun 2026

https://github.com/nob101/lotto-analyzer

Ein Node.js & SQLite basiertes Tool zur Analyse und Auswertung der Euromillionenziehung und Joker. A Node.js and SQLite web application to analyze, track, and evaluate lottery (Euromillionen) and Joker results.

backend css data-analysis express html5 javascript nodejs sqlite statistical-analysis

Last synced: 14 Jun 2026

https://github.com/brunomontezano/sleep-quality-cognition

💤 Analysis of the paper "Associations between general sleep quality and measures of functioning and cognition in subjects recently diagnosed with bipolar disorder".

bipolar-disorder cognition data-analysis sleep-analysis sleep-research

Last synced: 15 Jun 2026

https://github.com/marielachirinosr/nyc-taxi-trip-exploration-2019-2020

Explores passenger behavior & impact of COVID-19 on NYC taxi industry (Q1 2019-2020).

bigquery data data-analysis data-visualization python sql tableau

Last synced: 15 Jun 2026

https://github.com/juanse0330/registro-pacientes-terapia-python

Proyecto en Python para automatizar el registro y análisis de pacientes en terapia ocupacional domiciliaria. Herramienta orientada al sector salud.

automatizacion data-analysis python salud terapia-ocupacional

Last synced: 17 Jun 2026

https://github.com/fahadnasir13/financial_data-analyzer_tool

A Python-based framework for analyzing, cleaning, and reconciling financial data stored in Excel workbooks.

data-analysis excel financial python store

Last synced: 17 Jun 2026

https://github.com/ibttf/bayborhood

Interactive map to find the ideal neighborhood in San Francisco based on data.

data data-analysis data-visualization gis mapbox react

Last synced: 18 Jun 2026

https://github.com/duoan/ds-nbs

Data analysis and machine learning notebook.

data-analysis data-scientists deep-learning kaggle-competition machine-learning

Last synced: 18 Jun 2026

https://github.com/httpsnooow/graphs-analysis-neo4j

Challenges from the "Neo4J - Data Analysis with Graphs" course by Digital Innovation One (DIO).

challenge data-analysis data-engineering data-science graph neo4j neo4j-database neo4j-graph

Last synced: 18 Jun 2026

https://github.com/alinababer/covid19-timeseries-cases-and-deaths-forecasting-

This study is based on confirmed cases and deaths collected from Pakistan. Results demonstrate the promising potential of TIME SERIES model in forecasting COVID-19 cases and highlight the superior performance of the time series compared to the LSTM.we apply AI-based forecasting models such time series ARIMA, LSTM, prophet and VAR.

arima covid-19 data-analysis data-science data-visualization fbprophet forecasting lstm rnn time-series var vectorautoregression

Last synced: 19 Jun 2026

https://github.com/angelmtenor/idafc

Udacity's Intro to Data Analysis

data-analysis

Last synced: 20 Jun 2026

https://github.com/an4pdm/relatorio-de-vendas

O presente projeto foi feito através das ferramentas oferecidas pelo Power BI afim de aprimorar meus conhecimentos sobre ETL. Os dados utilizados foram de origem do site "Kaggle".

data-analysis data-visualization database etl powerbi

Last synced: 20 Jun 2026

https://github.com/sakan811/stress-pattern-occurrence-in-english-words

This project is intended to provide English learners with data that allows them to make a data-driven guess when encountering words that they aren't sure where to stress

data-analysis data-visualization english english-language english-learning language powerbi powerbi-report powerbi-visuals

Last synced: 20 Jun 2026

https://github.com/dcs-training/intro-to-statistics

Intro to Statistics workshop. In this repo, you are going to find the code and files we are going to use for the practical part of the workshop, together with the ppt associated with this training. Go to the readme file

data-analysis data-visualisation data-wrangling r statistics

Last synced: 20 Jun 2026

https://github.com/aonurakman/data-analysis-and-ml-algorithms

An exploration of data analysis techniques and standard ML algorithms on QSAR oral toxicity dataset. - 2021 - Yıldız Technical University

classification clustering data-analysis data-mining isolation-forest python regression

Last synced: 20 Jun 2026

https://github.com/jayavarshini-jayakumaran/nba-exploratory-data-analysis

A data analytics project that explores NBA game and player data using Python and Power BI. Features data preprocessing, EDA, feature engineering, and an interactive dashboard for visualizing team and player performance trends.

data-analysis data-visualization exploratory-data-analysis powerbi python3

Last synced: 20 Jun 2026

https://github.com/haseebn19/urban-housing-demand

A full-stack web application for visualizing housing and labour market data

data-analysis data-visualization docker full-stack gradle statistics web webapp

Last synced: 22 Jun 2026

https://github.com/emaleckova/emaleckova.github.io

My personal website created with Quarto

biology data-analysis data-viz quarto r

Last synced: 23 Jun 2026

https://github.com/anburocky3/cbse-schools-data

Fetch CBSE Schools in seconds and use it for your data projects

cbse data data-analysis data-science grabber nextjs

Last synced: 24 Jun 2026

https://github.com/vbhvsingh0/coulombic_dyn_formaltetra

The Python code simulates a formaldehyde tetra-cation molecule using Coulombic forces

data-analysis physics-simulation python shell-scripting

Last synced: 24 Jun 2026

https://github.com/imosudi/unsupervised-ml-kmeans-analysis

K-Means clustering analysis using synthetic datasets generated with scikit-learn, including meshgrid visualisation, silhouette score evaluation, and investigation of cluster count and random seed effects.

clustering data-analysis jupyter-notebook kmeans kmeans-clustering machine-learning matplotlib python3 scikit-learn silhouette-score unsupervised-learning

Last synced: 25 Jun 2026

https://github.com/parsabordbar/ctx3docs

The Documentation for context Tree Project.

ai-tools context ctx3 ctx3-docs data-analysis documentation tree workflow

Last synced: 25 Jun 2026

https://github.com/vevdokimovm/python-course-notebooks

Python course practice scripts, Jupyter notebooks and deep learning exercises from Grokking Deep Learning

data-analysis deep-learning jupyter python

Last synced: 27 Jun 2026

https://github.com/soyuid/bakery-data-analyst

# About the Project This Bakery Data Analysis project was created to help bakery owners understand their sales patterns. With in-depth data analysis, it is expected to provide useful insights to improve sales and operational strategies.

bakery data-analysis python sales visualization

Last synced: 24 Mar 2025

https://github.com/vikpires/ds_tips-dataset

Projeto individual do bootcamp de ciência de dados avanti 2024.2, com o objetivo de analisar e observar padrões no conjunto de dados "Tips".

data-analysis data-science data-visualization exploratory-data-analysis jupyter-notebook matplotlib numpy pandas python seaborn tips

Last synced: 17 Sep 2025

https://github.com/wardenkenny/data-analyst-portfolio

A repository I have created to show and explore data analytics.

data-analysis excel r spreadsheets sql tableau

Last synced: 02 Apr 2025

https://github.com/datastalker/survival-cox

This repository contains an R script for performing survival analysis on breast cancer surgery data from the University of Chicago's Billings Hospital. The analysis includes Kaplan-Meier estimation and Cox Proportional Hazards modeling to assess patient survival.

breast-cancer-prediction cox-model data-analysis data-science data-visualization epidemiology kaplan-meier r survival-analysis

Last synced: 02 Apr 2025

https://github.com/hari7261/data-visualization

Python-based application built using CustomTkinter for the graphical user interface (GUI) and Matplotlib for data visualization. It allows users to import datasets, perform real-time data visualization, and analyze data using various chart types and machine learning techniques.

data-analysis data-visualization export hari7261 import python realtime-visualization

Last synced: 17 Jun 2025

https://github.com/gaboelc/analysis-of-the-employment-situation-in-costa-rica-2018-2022

This is an analysis with data extracted from the INEC in order to identify the changes that occurred in the Costa Rican labor market before, during and after the COVID-19 pandemic.

costa-rica data-analysis empleo employment

Last synced: 24 Mar 2025

https://github.com/shreeparab1890/chat-analyzer

This project is a Data Analysis project to analyze the WhatsApp chats.

data-analysis numpy pandas python

Last synced: 12 Apr 2026

https://github.com/m4tice/qm_project

Bicycle project crowd evaluation.

data-analysis data-engineering data-visualization

Last synced: 16 Mar 2025

https://github.com/giuleo129/dataanalysis

This folder contains two projects focused on data analysis and statistical learning using R, covering exploratory data analysis, modeling, and predictive techniques.

data data-analysis data-science statistical-learning

Last synced: 25 Jan 2026

https://github.com/saiteja-talluri/data-analytics-assignement

Report on World Happiness Data (Data Analysis and Visualisation of the data)

data-analysis data-visualization ipynb-jupyter-notebook

Last synced: 20 Jan 2026

https://github.com/dhruvil-26/powerbi-projects

This repository contains Power BI projects showcasing data analysis and interactive dashboards. Each project includes detailed visualizations and insights on diverse topics such as loan analysis, sales performance, and customer behavior.

customer-behavior-analysis data data-analysis interactive-dashboards loan-analysis powerbi sales-performance visualization

Last synced: 04 Feb 2026

https://github.com/danielafishwickinacap/coderhouse_da

Data analyst Final Project files

data-analysis

Last synced: 18 Jan 2026

https://github.com/azaz9026/car_price_prediction_model

This repository contains a machine learning model designed to predict car prices based on various features. Using historical data on car attributes such as make, model, year, mileage, and other relevant factors, the model aims to provide accurate and reliable price estimates for used cars.

data-analysis data-engineering liner-regestion machine-learning modeling numpy pandas python3 rendering

Last synced: 09 Apr 2026

https://github.com/bhushan148/finance-domain-bank-loan-report-tableau

I analyzed 🏦 bank loan data to reveal trends, KPIs, and insights. Using Tableau 📈 for dashboards and SQL 🗃️ for data extraction, I visualized loan applications, borrower profiles, and repayment behaviors 💡.

bussiness-intelligence dashboard-design data-analysis data-visualization excel figma sql sqlqueries tableau

Last synced: 08 Apr 2025

https://github.com/shridhar1504/tableau-visualization-viz.-project-

This repository contains Visualization Projects which is visualized through Tableau Software, by using the visualization we can gain multiple insights and strategies which helps to develop the business for gaining high profit margins and also it provides social values in some cases to calculate damages and intensity of calamities.

dashboards data-analysis data-science data-visualization exploratory-data-analysis tableau tableau-dashboards tableau-public tableau-workbooks visualization

Last synced: 04 Feb 2026

https://github.com/aran203/fluxease

Python package for eddy flux data post processing

data-analysis data-science eddy-covariance python

Last synced: 03 Apr 2025

https://github.com/aldrinjenson/smart-qa

Query any structured data and find relations using natural language

data-analysis llm nlp sql

Last synced: 06 Apr 2025