An open API service indexing awesome lists of open source software.

Data analysis

Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.

https://github.com/billy-enrizky/yelpfusion

Finding All restaurants in the Maryland area using YelpFusion API

data-analysis pandas yelp-api yelpfusion

Last synced: 28 Apr 2026

https://github.com/mikasenghaas/covid19-analysis

analysis of correlation between covid-19 infection numbers and weather data from the beginning of the pandemic until april 2021

data-analysis statistical-analysis

Last synced: 14 Feb 2026

https://github.com/md-emon-hasan/1-simple-stock-price-ml-app

A simple mahcine learning application for stock prices, demonstrating data preprocessing, model training, and deployment using scikit-learn.

data-analysis data-science eda ml-app streamlit-webapp time-series time-series-analysis webapp

Last synced: 31 May 2026

https://github.com/vyjayanthipolapragada/early_sepsis_detection_ml

An end-to-end project leveraging clinical datasets (PhysioNet, MIMIC-IV, MIMIC-IV-ED) to develop and compare ML and LSTM-based models for early sepsis prediction.

data-analysis data-visualization deep-learning healthcare jupyter-notebook keras-tensorflow lstm-neural-networks machine-learning neural-network python

Last synced: 28 Apr 2026

https://github.com/kyleprotho/analysistoolbox

Analysis Tool Box (i.e. "analysistoolbox") is a collection of tools in Python for data collection and processing, statisitics, analytics, and intelligence analysis.

analytics data-analysis open-source-intelligence python3 r research snippets statistics

Last synced: 22 Aug 2025

https://github.com/alexandrelamarre/fission

Data analytics & Structured streaming optimized for the Edge

data-analysis data-engineering rust structured-data unstructured-data

Last synced: 08 Jun 2026

https://github.com/pheithar/socialdata_madridcentral

Social data and visualization course at DTU - 2022. Effectiveness of Madrid Central

data-analysis data-visualization jupyer-notebook madrid python

Last synced: 28 Apr 2026

https://github.com/affec-ds/netflix-recommender-system

Sistema de recomendación de títulos de Netflix basado en contenido. Incluye filtros por título, género y tipo de contenido (películas o series) con interfaz interactiva en Jupyter Notebook.

content-based-recommendation data-analysis eda ipywidgets jupyter-notebook machine-learning movies netflix portfolio-project python recommender-system

Last synced: 28 Apr 2026

https://github.com/allanotieno254/powerbi-chocolate-sales-analysis-dax-calculations-80-

This Power BI project analyzes **chocolate sales performance using advanced DAX calculations and interactive visualizations. The report provides insights into monthly revenue, top-selling products, sales trends, and market performance.

business-intelligence data-analysis dax powerbi powerbi-dashboards powershell-module sales-analysis visualization

Last synced: 13 Feb 2026

https://github.com/thevinh-ha-1710/diabetes-predictive-model

This project aims to train a predictive model to diagnose diabetes on women patients.

data-analysis data-science data-visualization model-training-and-evaluation python

Last synced: 13 Feb 2026

https://github.com/dsrodrigovieira/houserocketsales

Este repositório contém um projeto desenvolvido para praticar habilidades de análise de dados utilizando Python

data-analysis data-visualization heroku kaggle-dataset python

Last synced: 29 Apr 2026

https://github.com/garcane/london-housing-price-dashboard

This Excel-based Housing Visual Dashboard provides a comprehensive view of average house prices across various boroughs in London from 1996 to 2013. The dashboard is designed to offer insights into housing market trends and price variations across different areas of London over time.

data data-analysis data-visualization excel visual

Last synced: 13 Feb 2026

https://github.com/sunnybibyan/call_centre_power_bi_dashboard

Create a dashboard in Power BI to visualize relevant KPIs and metrics that will help the call center manager understand trends.

call-centre-analysis dashboard data-analysis data-visualization powerbi

Last synced: 19 Mar 2026

https://github.com/pxaris/expenditure-analyzer

Application for analyzing expenditure data over time

data-analysis data-visualization docker python statistics

Last synced: 29 Apr 2026

https://github.com/priboy313/pandasflow

A set of custom python modules for friendly workflow on pandas

catboost data-analysis data-science pandas phik python scikit-learn shap

Last synced: 20 Jan 2026

https://github.com/macdon112/layoff-analysis

SQL data cleaning & analysis of global layoffs

data-analysis data-cleaning data-exploration sql

Last synced: 21 Feb 2026

https://github.com/carolinedotxyz/dp_sgd_classification

A hands-on educational walkthrough of training a CelebA (Eyeglasses) image classifier with Differentially Private SGD using PyTorch and Opacus. The focus of this repo is on clarity and reproducibility through balanced subsets, deterministic preprocessing, and side-by-side baseline vs. DP training, while acknowledging real trade-offs.

celeba-dataset classification data-analysis dp-sgd machine-learning opacus python pytorch

Last synced: 16 May 2026

https://github.com/luminati-io/shopee-dataset-samples

A sample dataset of over 1000 Shopee products, extracted using the Bright Data API, ideal for pricing optimization, gap analysis, and market strategy refinement..

api data-analysis data-mining datasets products shopee web-scraping

Last synced: 12 Feb 2026

https://github.com/rcv911/lyapunov-indicators

Calculating Lyapunov indicators with multiprocessing in Python

data-analysis lyapunov lyapunov-indicators multiprocessing

Last synced: 18 Jan 2026

https://github.com/manwithacap/by-the-metric-match

🎲🃏 A game data tracker for your board/card/video games!

data-analysis data-visualization games jupyter-notebook python utility

Last synced: 29 Apr 2026

https://github.com/faisal-khann/diwali-sales-analysis

The "Diwali Sales Analysis" project aims to analyze the sales data during the Diwali festival period to uncover insights and trends that can help improve marketing strategies and sales performance in the future

csv data-analysis eda jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 11 Apr 2026

https://github.com/dcs-training/good-data-visualisation-with-r

Our guide on how we create data visualisations through R. Go to the readme file

data-analysis data-visualisation r rmarkdown

Last synced: 16 Jun 2026

https://github.com/azmainadel/twitter-data-neo4j

Playing with graph database on a large dataset of twitter data.

data-analysis data-visualization neo4j-database snap

Last synced: 06 Apr 2025

https://github.com/fer-aguirre/taller-cookiecutter

Taller sobre cómo usar Cookiecutter para análisis de datos.

cookiecutter data-analysis project-template workshop

Last synced: 19 Mar 2026

https://github.com/varshithdupati/yelp-business-analysis

Big Data analysis on Yelp reviews/businesses for Arizona. Using Hadoop, Spark, PySpark.

arizona-state-university big-data big-data-analytics data-analysis hadoop pyspark spark yelp

Last synced: 04 May 2026

https://github.com/rayyan9477/household-transactions-analysis-and-clustering

This project involves analyzing household transaction data to gain insights into spending patterns and behaviors. The analysis includes data cleaning, exploratory data analysis (EDA), clustering using K-Means, and visualization of customer segments.

customer-segmentation data-analysis data-cleaning data-science exploratory-data-analysis kmeans-clustering machine-learning

Last synced: 27 Feb 2025

https://github.com/phillbertnevinemmanuel/dataprofessionalquestionnaireanalysis_pbix

This project delves into the demographics and job satisfaction of data professionals, presenting insights through a user-friendly dashboard built with Power BI. The dataset has been graciously provided by Alex The Analyst

dashboard data-analysis powerbi visualization

Last synced: 19 Mar 2026

https://github.com/aadityatamrakar/futures_spread_chart

Cash Market & Futures Daily Spread Chart - NSE Stocks

data data-analysis data-mining expressjs nodejs requests

Last synced: 10 Apr 2026

https://github.com/ansh420/mcdonald_case-study

It is basically depend on the market Segment Analysis. It is a case study of mcDonald.

algorithms-implemented data-analysis python3 segmentation

Last synced: 12 Apr 2026

https://github.com/rajnish93/jpandas

A lightweight JavaScript library for working with tabular data, inspired by Pandas in Python. Built with TypeScript, it provides an intuitive API for data manipulation and analysis.

data-analysis data-analytics data-manipulation data-science dataframe javascript pandas stream-processing table typescript

Last synced: 11 Jun 2025

https://github.com/nurfakhri/e-commerce-data-analyst

E-commerce data analysis supported by data wrangling, EDA, and web dashboard

dashboard data-analysis e-commerce flask-application python

Last synced: 10 Feb 2026

https://github.com/robinmillford/analytics_for_fashion_supply_management

This Streamlit dashboard provides a comprehensive analysis of supply chain data, focusing on key metrics such as production volumes, stock levels, order quantities, revenue, manufacturing costs, lead times, shipping costs, transportation routes, risk factors, and sustainability factors

dashboard data-analysis data-visualization streamlit supply-chain-management

Last synced: 07 Sep 2025

https://github.com/alemalvarez/data-analysis-web-project

Web-app providing a simple interface for data storage,

data-analysis data-science javascript react webapp

Last synced: 29 Apr 2026

https://github.com/aritrakar/statpy

A simple package containing some functions for analysing Gaussian and Binomial distributions. Created for the Udacity AWS MLE Foundations 2021 course.

data-analysis python statistics

Last synced: 24 Oct 2025

https://github.com/rani-sikdar/python-data-structures

A repository for sharing Python implementations of common data structures and algorithms, including arrays, linked lists, stacks, queues, trees, graphs, and sorting/searching techniques. Perfect for learning, revisiting fundamentals, and collaborating on computer science concepts. Contributions welcome! 🚀

data-analysis data-structure data-visualization numpy-library pandas-dataframe python python-3

Last synced: 30 Mar 2025

https://github.com/raccoon-hero/gender-equality-tracker

A web application visualizing gender equality metrics with a focus on Ukraine. Built with Flask, it's powered by live data from global open sources, with dynamic research insights and analysis.

chartjs css dashboard data-analysis data-visualization flask frontend gender-equality global-metrics html linked-data openalex opendata python representation semantic-web ukraine webapp wikidata world-bank-api

Last synced: 07 May 2026

https://github.com/ralstonraphael/ralston_ethan_datastory

Mapping Change: Advancing Housing Equity and Stability in Connecticut's Communities

census-data data-analysis data-visualization datawrapper google-sheets housing html json

Last synced: 09 Feb 2026

https://github.com/sejalmankar1012/yuvaco_data_analysis_assessment

This assignment involves writing a Python script to calculate the cost of package deliveries based on provided data and a cost grid. The script takes package details such as weight, distance, and delivery type, applies the cost calculation rules, and saves the results in an output file. You can also run the script in Google Colab for convenience.

csv-file-handling data-analysis google-colab package-delivery python python-scripting

Last synced: 29 Apr 2026

https://github.com/iamjuniorb/data_structures_and_algorithms

I'm working on Data Structures and Algorithms I C949 class in school and decided to write up all of these searching algorithms, sorting algorithms, strutures, and so on to get a better understanding. These can be used with large datasets to test their space and time complexities.

data data-analysis data-science data-structures datastructures datastructures-algorithms datastructuresandalgorithm math mathematics programming python python-app python-library python3

Last synced: 08 Jun 2026

https://github.com/ismailtekin05/caloriedetectingai

🍎🔍 Smart AI system that identifies food items in photos and calculates their calorie content automatically. Built with TensorFlow, YOLOv8, CUDA and computer vision for accurate nutrition tracking.

ai aimodel calorie-calculator computer-vision cuda data-analysis data-science data-segmentation data-visualization dataset dataset-generation image-processing image-recognition python segmentation-models tensorflow ultralytics yaml yolo yolov8

Last synced: 29 Apr 2026

https://github.com/geo-y20/uber-rides-data-analysis

This project aims to analyze Uber ride data to understand various aspects of ride usage, such as the distribution of rides across different categories, purposes, months, days, and times.

dashboard dashboard-templates data data-analysis data-analysis-python data-analytics data-visualization pandas powerbi python recommendation-system rides uber

Last synced: 13 Apr 2026

https://github.com/pngo1997/axa-xl-insurance-bi-dashboard

Provides a comprehensive analysis of insurance submissions, approvals, compliance rates, and profitability for AXA XL Insurance.

bi-analytics bi-dashboard business-analytics data-analysis filtering performance-analysis powerbi segmentation visualization

Last synced: 08 Feb 2026

https://github.com/ddihora1604/iitk_esg

Researching and Analyzing key ESG (Environmental, Social, Governance indicators) metrics and their impact on stock performance and market behavior. Leveraging AI techniques (like Machine Learning and NLP) in finance to extract insights from ESG disclosures, enhancing financial predictions and sustainable investment strategies.

data-analysis data-visualization esg python yahoo-finance

Last synced: 24 Apr 2025

https://github.com/jakobzmrzlikar/fake-news-analysis

An analysis of the FakeNewsNet dataset using NLP techniques.

data-analysis fake-news ipynb-jupyter-notebook nlp-machine-learning

Last synced: 05 Mar 2026

https://github.com/maugus0/sats-flight-data-fetcher

A simple Python tool to fetch and analyze flight data for 15+ major airlines using the AirLabs API.

airline-data cli-tool data-analysis flight-data python3

Last synced: 17 Mar 2026

https://github.com/mafda/seattle_airbnb_data_analysis

This repository contains a comprehensive analysis of the Seattle Airbnb dataset, conducted using the CRISP-DM (Cross Industry Standard Process for Data Mining) methodology.

crisp-dm data-analysis data-science jupyter-notebook pandas-python seattle-data

Last synced: 29 May 2026

https://github.com/is-leeroy-jenkins/sherpa

A budget execution & data analysis tool based on Winforms, .NET 6, and written in C# for EPA analysts

budget-management data-analysis data-science data-visualization federal-government

Last synced: 13 May 2026

https://github.com/arielle0222/data_analysis

📊 Data analysis projects for autonomous driving and smart mobility engineering using Python and SQL.

autonomous-driving composite data-analysis electric-vehicles environmental-data python visualizatoin

Last synced: 30 Apr 2026

https://github.com/albertomorini/policesviolence

Repository for the project of the course Data Science (Fondamenti di Scienza dei Dati) at UniUD.

data-analysis data-science data-visualization r

Last synced: 31 May 2026

https://github.com/elissorokin/data-analyst-portfolio-rus

Это репозиторий, в котором я демонстрирую свои навыки, делюсь проектами и отслеживаю прогресс в области анализа данных и Data Science.

ab-testing data data-analysis datalense matplotlib numpy pandas plotly portfolio postgresql python scipy seaborn sql statistical-analysis

Last synced: 25 Feb 2026

https://github.com/gappeah/global-shipping-analytics-dashboard

This Tableau project provides a comprehensive visual analysis of global sales, shipping costs, and quality metrics across different regions and countries.

data data-analysis data-analyst data-visualization metrics tableau

Last synced: 25 Feb 2025

https://github.com/phillbertnevinemmanuel/movieindustryanalysis-correlation

This project is a comprehensive data analysis endeavor within the Movie Industry, spanning from Data Cleaning to Exploratory Data Analysis, Correlation Analysis, and Temporal Analysis. The dataset was sourced from Kaggle, purportedly scraped using the IMDb API. Python was the primary tool utilized for analysis.

data-analysis data-cleaning python

Last synced: 30 Apr 2026

https://github.com/ibnaleem/cyberchef-discord

A versatile Discord bot that implements CyberChef's features for encoding, decoding, encrypting, compressing, analysing data directly and more in your Discord server

compression cti cyberchef cybersecurity data-analysis data-manipulation discord-bot discord-js encoding encryption hashing infosec parsing redteam

Last synced: 28 Jan 2026

https://github.com/programmer-rd-ai/moviedatascraper

Explore the cinematic universe with our IMDb web scraping project! Dive into movie data with ease, uncovering insights from cast to critical reviews. With dynamic visualizations and reliable data, let's journey through the world of movies like never before. Lights, camera, analysis!

beautifulsoup beautifulsoup4 data data-analysis jupyter-notebook matplotlib numpy pandas programming python python3 scraping seaborn software web

Last synced: 01 Mar 2025

https://github.com/vatshayan/hospital-discharge-analysis

Analysis of Hospitalization Discharge Rates in Lake County, Illinois of various attributes like Anxiety, Alcohol, mood, Diabetes, Asthma, etc

data-analysis data-visualization jupyter-notebook machine machine-learning machine-learning-algorithms scikit-learn

Last synced: 04 Mar 2025

https://github.com/alrza2003/alrza2003.github.io

This repository contains the source files for my personal portfolio website. It highlights my background as a data analyst and radiology student, and showcases real-world projects, tools I use, and ways to connect with me. The site is based on a pre-built template that I customized to reflect my profile and experience.

data data-analysis data-visualization portfolio portfolio-website python

Last synced: 30 Apr 2026

https://github.com/hifza-khalid/book-management-system-sql

A Book Management System SQL project 📚 featuring tables for Authors ✍️, Books 📖, Customers 👤, and Orders 🛒. Includes sample queries for tracking book sales 💰, pricing by genre 🎭, and customer order history 📅.

book-management data-analysis database-management sql sql-queries

Last synced: 03 Feb 2026

https://github.com/cworld1/novel-analysis

A simple project for analyzing Chinese novels

data-analysis novel

Last synced: 17 Mar 2025

https://github.com/thevinh-ha-1710/rstudio-statistics

This project deeply studies 2 datasets using applied statistics techniques.

applied-statistics data-analysis data-science data-visualization rmarkdown rstudio

Last synced: 31 Jan 2026

https://github.com/swarnim1812/crime_project

AI-Driven Crime Forecasting Across Indian States — A pioneering machine learning project that harnesses time series modeling (SARIMAX, Ridge Regression) to uncover patterns and forecast crime trends using real-world multi-state temporal and socio-economic data.

analytics crime-locator crime-prediction data-analysis deep-learning machine-learning prophet-facebook sarimax-model time-series-forecasting

Last synced: 31 Jan 2026

https://github.com/saidsef/ff18

A complete catalog of all the players in Fifa 2018 and their complete statistics

data-analysis data-visualization fifa18 machine-learning machine-learning-algorithms world-cup-2018 world-cup-ranking

Last synced: 29 May 2026

https://github.com/gher-uliege/bluecloud-plankton

Spatial interpolation of plankton data using a neural network

data data-analysis data-visualization neural-network oceanography

Last synced: 30 Mar 2025

https://github.com/hetuvpatel/research-chatgpt

Research and data analysis project evaluating the social, ethical, and educational impacts of ChatGPT using survey-driven insights and Python-powered data analysis. 📚🤖

data-analysis matplotlib pandas python seaborn

Last synced: 01 May 2026

https://github.com/sing-group/bew

Public repository for Biofilmfs Experiment Workbench (BEW).

aibench data-analysis data-management java jfreechart workbench

Last synced: 03 Jul 2025

https://github.com/programmer-rd-ai/dimensionality-reduction

DimRed is a comprehensive Python toolkit for advanced dimensionality reduction, integrating with major machine learning libraries and featuring real-time performance monitoring to enhance data analysis and model efficiency.

analytics data-analysis data-science lightgbm machine-learning matplotlib numpy pandas programming python python3 sklearn university xgboost

Last synced: 01 Mar 2025

https://github.com/scarblase/salary-comparison

Submission for the DataCamp Salary Competition(1 level). 🏆

data data-analysis data-science data-visualization engineering python sql structured-data

Last synced: 01 May 2026

https://github.com/nmsby/pca-machine-learning-lab

Principal Component Analysis (PCA) implementation and analysis lab for Machine Learning. Features manual PCA implementation, scikit-learn applications, data compression, and feature extraction with detailed visualizations.

data-analysis dimensionality-reduction jupyter-notebook machine-learning numpy pca python scikit-learn visualization

Last synced: 01 May 2026

https://github.com/nafisalawalidris/international-breweries

This GitHub readme provides an overview of data analysis using SQL on the International Breweries dataset, including dataset description, analysis questions, example SQL queries, and key insights derived from the analysis.

data-analysis insights international-breweries-dataset queries sql

Last synced: 31 Jan 2026

https://github.com/nakshjainsonigara/vba-canteenmanagementsystem

The Canteen Management System is a comprehensive software solution designed to modernize and optimize canteen operations. It aims to simplify the complexities of managing a canteen by automating key processes such as order management, payment processing, and report generation.

canteen canteen-mangement-system charts data-analysis email excel microsoft payment-gateway vba vba-excel vba-macros word

Last synced: 30 Jan 2026

https://github.com/priyanshubiswas-tech/aws-mwaa-elt-airflow-sql-dbt-superset-project

This project was created as part of an assessment for DigitalXC AI. It demonstrates a cloud-based ELT pipeline using AWS MWAA, Airflow, dbt, PostgreSQL, and Superset. The pipeline automates data ingestion from S3, transformation with dbt, and visualization through Superset, following modern data engineering practices on a scalable AWS architecture.

apache-airflow apache-superset aws-s3 dag data-analysis data-engineering-pipeline data-visualization dbt elt-pipeline python rds-postgres

Last synced: 03 Jul 2025

https://github.com/billgewrgoulas/hypothesis-testing-on-37-seasons-of-nba

First assignment for the course Data Mining @CSE.UOI

data-analysis data-science numpy scipy seaborn statistics

Last synced: 01 May 2026

https://github.com/gesiscss/wikipedia-language-olga-master

Measuring Gender Inequalities of German Professions on Wikipedia

bias crowdflower data-analysis data-science gender images python statistics wikipedia

Last synced: 10 Apr 2026

https://github.com/ifibla/adsdb-project

Algorithms, Data Structures and Databases Project

data-analysis data-engineering python

Last synced: 12 Apr 2026

https://github.com/riddhis2226/titanic-survival-data-analysis

Titanic-Survival-Data-Analysis : Analyze passenger data from the Titanic to predict survival based on features like age, gender, class, and fare.

data-analysis data-mining data-science data-visualization database jupyter-notebook machine-learning-models machinelearning-python plotlyjs python3

Last synced: 01 May 2026

https://github.com/priyanshubiswas-tech/deloitte-daikibo-forensic-analysis-task-2

Forensic pay equity analyzer for Deloitte. Processes compensation data to classify gender equality scores into Fair/Unfair/Discriminative tiers. Outputs modified Excel with 3-tier evaluation system.

data data-analysis deloitte excel forensic-analysis

Last synced: 06 Feb 2026

https://github.com/vitia-fritelle/analise_dieese

Análise realizada com base nos dados extraídos do site https://www.dieese.org.br/analisecestabasica/salarioMinimo.html

data-analysis economic-data

Last synced: 09 Apr 2025

https://github.com/webuccinoco/mysql-pivot-tables

Build complex MySQL pivot tables without touching a single line of code. This free PHP tool lets you visually connect your database and map out your data sources with a few simple clicks.

business-analytics business-intelligence crosstab data-analysis data-analytics data-visualization mysql mysql-database mysql-pivot-table mysql-reports mysql-virtualization php php-pivot-table php-reports pivot-tables reporting-tools

Last synced: 04 Feb 2026

https://github.com/seekinginfiniteloop/fedcal

A feature-rich Python calendar that enables time series analyses of changes in federal workforce schedules and shifts in executive department funding status.

data-analysis data-science econometrics economic-data economics federal federal-government hr pandas pandas-library pandas-python pydata python

Last synced: 15 Apr 2026

https://github.com/nikhilash45/live_ipl_report

This repository hosts the source code for an interactive IPL (Indian Premier League) Dashboard built using PowerBI. The dashboard provides real-time updates on ongoing matches, including live scores, batting and bowling statistics for both teams, and the points table.

analysts cleaning-data cricket-data dashboard data data-analysis data-visualization dax powerbi

Last synced: 19 Mar 2026

https://github.com/vasishth/lecturesintrobayes

Please go to the website for these online lectures:

bayesian-inference brms data-analysis stan

Last synced: 06 Feb 2026

https://github.com/siddharthbadal/sql-case-studies-data-analysis

Data Analysis case studies on various databases using SQL . Demonstrating proficiency in solving diverse business problems. Projects cover sales, orders, products, finance, healthcare and other sectors, and highlight my ability to analyze complex datasets through SQL queries, data manipulation, and visualization techniques.

data-analysis sql sql-query sql-server sqlserver

Last synced: 08 Jan 2026

https://github.com/virajbhutada/global-universities-success-analysis-powerbi-sql-excel

This capstone project conducts in-depth analysis using Power BI, SQL, and Excel to explore complex dynamics shaping global university success. Integrating data from diverse ranking systems and criteria, our aim is to unravel the factors influencing universities worldwide.

capstone capstoneproject data-analysis data-analytics data-insights data-science data-science-projects data-visualization excel exploratory-data-analysis mece mysql powerbi powerpoint sql

Last synced: 20 Jun 2025