An open API service indexing awesome lists of open source software.

Data analysis

Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.

https://github.com/flexmonster/svelte-flexmonster

Svelte wrapper for Flexmonster Pivot Table & Charts

data-analysis data-visualization frontend pivot-tables svelte sveltekit

Last synced: 27 Feb 2026

https://github.com/fer-aguirre/taller-cookiecutter

Taller sobre cómo usar Cookiecutter para análisis de datos.

cookiecutter data-analysis project-template workshop

Last synced: 19 Mar 2026

https://github.com/allanotieno254/employee-performance-tracker-excel-

An Excel-based tool to track and evaluate employee performance, compliance, and skills assessments with summary statistics and visual charts

compliance-tracker data-analysis employee-performance-analysis excel human-resources

Last synced: 19 Mar 2026

https://github.com/allanotieno254/road-accident-data-analysis-dashboard-using-excel

This repository contains the Road Accident Data Analysis Dashboard, a comprehensive Excel-based tool designed to provide in-depth analysis and visualization of road accident data.

dashboards-excel data-analysis excel kpi visualization

Last synced: 19 Mar 2026

https://github.com/luminati-io/shopee-dataset-samples

A sample dataset of over 1000 Shopee products, extracted using the Bright Data API, ideal for pricing optimization, gap analysis, and market strategy refinement..

api data-analysis data-mining datasets products shopee web-scraping

Last synced: 12 Feb 2026

https://github.com/arhcoder/base-hackathon-2022

💸 Sistema que analiza las facturas de compra-venta de una empresa de importaciones y exportaciones, y crea una base de conocimiento con la que crea sugerencias de abastecimiento para las empresas clientes de Banco BASE, con el fin de ahorrarles dinero.

algorithms bank companies data-analysis decision-making exportation hackaton importation javascript mysql python suggestions

Last synced: 16 Apr 2026

https://github.com/mattdelaune/retail_rfm_analysis

Power BI multi-page report leveraging advanced data visualization for RFM analysis. Delivers deep analytical insights into customer behavior, engagement, and spending patterns, driving strategic business decisions.

data-analysis dax powerbi report rfm-analysis sales-data visualization

Last synced: 19 Mar 2026

https://github.com/allanotieno254/powerbi-chocolate-sales-analysis-dax-calculations-80-

This Power BI project analyzes **chocolate sales performance using advanced DAX calculations and interactive visualizations. The report provides insights into monthly revenue, top-selling products, sales trends, and market performance.

business-intelligence data-analysis dax powerbi powerbi-dashboards powershell-module sales-analysis visualization

Last synced: 13 Feb 2026

https://github.com/mikasenghaas/covid19-analysis

analysis of correlation between covid-19 infection numbers and weather data from the beginning of the pandemic until april 2021

data-analysis statistical-analysis

Last synced: 14 Feb 2026

https://github.com/dina-hosny/telco-customer-churn-analysis-using-power-bi

An interactive dashboard to represent some analysis of "Telco customer churn" data and the reasons that made customers churn using Microsoft Power BI.

business-intelligence data-analysis data-modeling data-visualization power-bi powerbi

Last synced: 19 Mar 2026

https://github.com/spaghettifunk/gvb

Analysis of GVB in Amsterdam

data-analysis public-transportation

Last synced: 28 Feb 2026

https://github.com/gab-182/market-analysis-report-for-national-clothing-chain

Using custom M and DAX codes in Power BI, I conducte a thorough market analysis for a national clothing chain. The insights gathered from customer data and US Census Bureau statistics led to the formulation of a targeted marketing strategy, contributing to enhanced sales and customer satisfaction.

data-analysis power-bi

Last synced: 19 Mar 2026

https://github.com/szapp/magistdataanalysisvisualization

Case study: Data analysis and visualization to evaluate and recommend a business partnership. Team project for data-driven business with SQL and Tableau

dashboard data-analysis data-science data-visualization data-viz sql tableau-public

Last synced: 19 Mar 2026

https://github.com/edisedis777/duckdb-analyzer

A powerful tool for analyzing large CSV datasets using DuckDB.

csv data-analysis database duckdb

Last synced: 16 Apr 2026

https://github.com/leosimoes/datascienceacademy-powerbi-3.0

Projetos do curso Microsoft Power BI Para Data Science Versão 3.0 da DataScienceAcademy. Dashboards para diversos casos de negócios.

business-intelligence dashboards data-analysis data-visualization microsoft-power-bi

Last synced: 19 Mar 2026

https://github.com/shadan100/sales-prediction-analysis

The aim is to build a predictive model and find out the sales of each product at a particular store. Using this model, BigMart will try to understand the properties of products and stores which play a key role in increasing sales.

artificial-intelligence data-analysis data-science django django-framework jupyter-notebook machine-learning matplotlib pandas predictive-modeling python sales-prediction

Last synced: 01 Mar 2026

https://github.com/antononcube/wl-datareshapers-paclet

Wolfram Language (aka Mathematica) paclet for data reshaping functions, like, long- and wide form, cross tabulation, etc.

contingency-table cross-tabulation data-analysis data-transformation long-form wide-form

Last synced: 20 Mar 2026

https://github.com/lijesh010/globalsuperstoresalesanalysis

The Global Superstore Sales Analysis repository showcases a comprehensive Power BI dashboard that provides valuable insights into sales performance. This project is designed to present key information and trends to stakeholders, enabling informed decision-making.

dashboard data-analysis data-visualization msexcel power-bi sales-analysis

Last synced: 19 Mar 2026

https://github.com/tnleite/projeto_king_lift

Este projeto apresenta uma análise detalhada dos dados financeiros da King Lift, uma empresa de locação de empilhadeiras. Utilizando Microsoft Excel, Power Query e Power Pivot, desenvolvi um dashboard interativo, também em Excel, que ajuda a empresa a obter insights valiosos para melhorar a eficiência operacional e aumentar o faturamento.

data-analysis data-science data-visualization excel

Last synced: 19 Mar 2026

https://github.com/wrighang/shipping-data-analysis

Independent Project: Transit time trends analysis following a major shipping process change.

data-analysis matplotlib numpy pandas python

Last synced: 18 Apr 2026

https://github.com/mayankyadav23/air-bnb-data-analysis

Data analysis and insights from NYC Airbnb listings, focusing on key metrics such as host performance, neighborhood trends, pricing, and customer reviews. Comprehensive documentation of ETL processes and analytical methodologies is provided. Perfect for understanding Airbnb dynamics and decision-making in the NYC market.

advanced-excel business-intelligence data-analysis data-analytics data-visualization power-bi ppt

Last synced: 19 Mar 2026

https://github.com/sleeplessglory/big-data

Projects regarding big data analysis, presented within Jupyter Notebook

big-data data-analysis data-visualization jupyter python

Last synced: 16 Apr 2026

https://github.com/snehilk1312/data_science

This Repository contains the Data Science things I have done in recent times along with visualization , cleaning , models, statistics, Courses, Datasets. :=)

data-analysis data-science glove natural-language-processing nlp nltk statistics word2vec

Last synced: 02 Apr 2026

https://github.com/reza-saeedi-coding/netflix-data-analysis

A complete end-to-end Netflix dataset analysis using Python, SQL, and Matplotlib. Explores genres, content ratings, and trends using exploratory data analysis and visualizations.

data-analysis data-cleaning eda matplotlib netflix pandas portfolio-project python sql sqlite

Last synced: 17 Apr 2026

https://github.com/harshmule1/school-data-analysis-

School Data Analysis Using SQL

data-analysis mssql sql

Last synced: 04 Apr 2026

https://github.com/alfikiafan/air-quality-analysis

This repository contains a comprehensive data analysis project on Air Quality Dataset, covering the complete data analysis process from data gathering, cleaning, exploratory data analysis (EDA), to building a fully interactive dashboard using Streamlit.

air-quality data-analysis dicoding

Last synced: 17 Apr 2026

https://github.com/kalfasyan/filoma

profiling files, directories, image data

data-analysis profiler validation

Last synced: 05 Apr 2026

https://github.com/jrbourbeau/cr-composition

IceCube cosmic-ray composition analysis

cosmic-rays data-analysis machine-learning physics python

Last synced: 20 Apr 2026

https://github.com/rakumar99/power-bi-projects

This repository contains various power bi projects and dashboards of Humaan Resources , Financial Analysis using Power BI Desktop.

dashboards data-analysis data-visualization databases datacleaning datamodeling etl powerbi powerquery reports

Last synced: 04 Jun 2026

https://github.com/aravind-selvam/bikeshare-company-analysis

Google Data Analytics Professional Certificate program's Capstone project, of a bike sharing company

analytics business-analytics business-intelligence data data-analysis data-visualization dataanalytics google-data-analytics postgresql sql sql-server

Last synced: 22 Apr 2026

https://github.com/virajbhutada/movie-rental-store-analytics-sql-powerbi-excel

Dive into the DVD rental industry with my Capstone project, Movie Rental Analytics. Analyzing the Sakila DVD Rental Store Database, I extract insights through exploratory data analysis (EDA) and Power BI visualizations. Findings inform strategies for optimizing film inventory, enhancing business operations, and customer experiences.

business-intelligence capstone-project customer-behavior-analysis data-analysis data-science excel exploratory-data-analysis film-ratings mece movie-database movie-rental mysql powerbi powerbi-visuals revenue-analysis sql sql-database

Last synced: 05 Jun 2026

https://github.com/datavil/framex

A light-weight, dataset obtaining library for fast prototyping, tutorial creation, and experimenting.

data-analysis data-fetching data-science dataframe datasets visualization

Last synced: 06 Jun 2026

https://github.com/chandansoren/diabetics_prediction

Predicting that whether the patient has diabetes or not on the basis of the features we will provide to our machine learning model.

data-analysis machine-learning python svm

Last synced: 06 Jun 2026

https://github.com/mr-chang95/loan_data_visualization

Data Visualization Project for Udacity's Data Analyst Program. Using Python in Jupyter Notebook.

data-analysis data-visualization jupyter-notebook loans python udacity-data-analyst-nanodegree

Last synced: 24 Apr 2026

https://github.com/leosimoes/uerj-tcc-analisador-dados

Trabalho de conclusão de curso (TCC) em Engenharia de Computação. Aplicativo Web para preparação e análise de dados, criação de gráficos e modelos de regressão linear e logistica.

computer-engineer data-analysis data-science data-visualization linear-logistic linear-regression python streamlit

Last synced: 24 Apr 2026

https://github.com/flyingfathead/neurograph-framework

A versatile tool for visualizing entropy loss in TensorFlow-based neural network training, providing insightful scatter plots with annotations.

data-analysis data-analysis-python data-visualization entropy graph graphs neural-network neural-networks neural-networks-visualization nn python python3 tensorflow tensorflow2 training visualization visualization-tools

Last synced: 24 Apr 2026

https://github.com/asifdotexe/quickvu

Quick VU: No-code, data cleaning analysis and visualization tool built on Streamlit. Quickly clean, visualize, explore, and understand data relationships and correlations with ease. Perfect for analysts, business users, and anyone looking to gain data insights—without writing a single line of code.

automation data-analysis data-cleaning data-visualization python3 streamlit-application toolkit

Last synced: 06 Jun 2026

https://github.com/faezeh-gholamrezaie/visual-google-scholar-search

A Python script that searches Google Scholar for specific keywords and visually presents the results in various chart formats, enabling researchers to analyze trends and insights in academic literature.

academic academic-research academic-trends ai ai-research bibliometrics data-analysis data-visualization google-scholar publication-analysis python research-trends scholarly scholarly-data word-cloud

Last synced: 25 Apr 2026

https://github.com/airdac/sim-ames_housing

Prediction of house prices with linear regression in R. Team project from UPC's Master's Degree in Data Science

data-analysis data-science linear-regression r statistical-models upc

Last synced: 07 Jun 2026

https://github.com/sarthak-0-sach/drivermasterdata_database_table

This code enables data integration from multiple sources and ensures a single source for all driver-related attributes. Designed for scalability and pipeline compatibility, this project supports clean data transformations, validations, and storage-ready outputs. Ideal for quick analytics, created using python & airflow, automated using cronjob.

apache-airflow-etl-pipeline data-analysis data-visualization database-management python

Last synced: 27 Apr 2026

https://github.com/jongan69/potion-leaderboard

Start of Entry for potion leaderboard contest

data-analysis leaderboard potion trading

Last synced: 11 Jun 2026

https://github.com/alxrm/scent-of-literature

Russian literature sentiment analysis in terms of very small dataset

classification data-analysis sentiment-analysis sklearn tf-idf

Last synced: 28 Apr 2026

https://github.com/vyjayanthipolapragada/early_sepsis_detection_ml

An end-to-end project leveraging clinical datasets (PhysioNet, MIMIC-IV, MIMIC-IV-ED) to develop and compare ML and LSTM-based models for early sepsis prediction.

data-analysis data-visualization deep-learning healthcare jupyter-notebook keras-tensorflow lstm-neural-networks machine-learning neural-network python

Last synced: 28 Apr 2026

https://github.com/alexandrelamarre/fission

Data analytics & Structured streaming optimized for the Edge

data-analysis data-engineering rust structured-data unstructured-data

Last synced: 08 Jun 2026

https://github.com/pheithar/socialdata_madridcentral

Social data and visualization course at DTU - 2022. Effectiveness of Madrid Central

data-analysis data-visualization jupyer-notebook madrid python

Last synced: 28 Apr 2026

https://github.com/affec-ds/netflix-recommender-system

Sistema de recomendación de títulos de Netflix basado en contenido. Incluye filtros por título, género y tipo de contenido (películas o series) con interfaz interactiva en Jupyter Notebook.

content-based-recommendation data-analysis eda ipywidgets jupyter-notebook machine-learning movies netflix portfolio-project python recommender-system

Last synced: 28 Apr 2026

https://github.com/faris771/investigate_a_dataset

This repository contains a Jupyter Notebook that investigates a dataset using data analysis techniques.

data-analysis

Last synced: 29 Apr 2026

https://github.com/jhrcook/wagenmaker-data-analysis

Analysis of Registered Replication Report: Strack, Martin, & Stepper (1988) by Wagenmaker et al.

data-analysis r r-project statistics

Last synced: 08 Jun 2026

https://github.com/iamjuniorb/data_structures_and_algorithms

I'm working on Data Structures and Algorithms I C949 class in school and decided to write up all of these searching algorithms, sorting algorithms, strutures, and so on to get a better understanding. These can be used with large datasets to test their space and time complexities.

data data-analysis data-science data-structures datastructures datastructures-algorithms datastructuresandalgorithm math mathematics programming python python-app python-library python3

Last synced: 08 Jun 2026

https://github.com/27ahmad/foreign-direct-investment-analytics

This repository contains an exploratory data analysis (EDA) and visualization project on a dataset of Foreign Direct Investment (FDI) by companies. The objective is to analyze FDI trends and present key insights through an interactive Tableau dashboard.

data-analysis eda matplotlib pandas python seaborn tableau

Last synced: 29 Apr 2026

https://github.com/ayu-hack/ayu-hack

Enthusiastic learner passionate about building software and exploring the world of technology. Eager to contribute to open-source projects and collaborate with the developer community. Continuously developing my skills in Python,SQL,HTML,CSS,PowerBI, MacOS. Always open to feedback and excited to keep growing!

config css data-analysis github-config html powerbi-desktop python3 sql

Last synced: 30 Apr 2026

https://github.com/alcestide/scianalytics

Playground for Data Analysis and Visualization for Research and Scientifical Purposes with Pandas and Plotly.

csv data-analysis data-science data-visualization pandas plotly python science-research statistics

Last synced: 30 Apr 2026

https://github.com/mkk-1817/hr-attrition

This project, conducted during my internship at MeriSKILL, focuses on HR Attrition Prediction using advanced Machine Learning models. The initiative includes the development of a dynamic Dashboard and in-depth Analysis to offer actionable insights for proactive human resource strategies.

data-analysis data-science data-visualization jupyter-notebook machine-learning-algorithms powerbi python

Last synced: 03 May 2026

https://github.com/hetuvpatel/research-chatgpt

Research and data analysis project evaluating the social, ethical, and educational impacts of ChatGPT using survey-driven insights and Python-powered data analysis. 📚🤖

data-analysis matplotlib pandas python seaborn

Last synced: 01 May 2026

https://github.com/billgewrgoulas/hypothesis-testing-on-37-seasons-of-nba

First assignment for the course Data Mining @CSE.UOI

data-analysis data-science numpy scipy seaborn statistics

Last synced: 01 May 2026

https://github.com/lisa-ho/breadit

Respository for scraping and analysing data from the Reddit/Sourdough community to explore lockdown baking trends.

data-analysis data-viz nltk python reddit-api sentiment-analysis web-scraping

Last synced: 01 May 2026

https://github.com/riddhis2226/titanic-survival-data-analysis

Titanic-Survival-Data-Analysis : Analyze passenger data from the Titanic to predict survival based on features like age, gender, class, and fare.

data-analysis data-mining data-science data-visualization database jupyter-notebook machine-learning-models machinelearning-python plotlyjs python3

Last synced: 01 May 2026

https://github.com/com-480-data-visualization/project-2023-choo-choo-data-darlings

This repository contains the source code for our data visualization project, an interactive platform designed to explore the intricate Swiss transportation network. Developed by the Choo Choo Data Darlings team at EPFL, the project provides an in-depth view into the vast array of Swiss transportation operations, including trains, buses, and trams.

boats buses data-analysis data-science data-visualisation data-visualization epfl metro public-transport public-transportation switzerland trains trams

Last synced: 01 May 2026

https://github.com/henrylin03/china-gdp

Analysis and visualisation of China GDP data using Python.

data data-analysis data-visualisation dataset kaggle pandas

Last synced: 01 May 2026

https://github.com/emso-exe/reclamacoes_de_consumidores_com_empresa_de_telecomunicacoes

Projeto de análise de reclamações de consumidores com empresa de telecomunicações no 1º semestre de 2021 com base nos dados do site consumidor.gov.br.

analise-de-dados ciencia-de-dados data-analysis data-science datascience python python-3 python3

Last synced: 02 May 2026

https://github.com/dangerousfish/uk-climate-trends-dashboard-metoffice

A data pipeline and Streamlit dashboard that aggregates, cleans and visualises historical UK Met Office station data - interactive charts, heatmaps and maps for temperature, rainfall and sunshine.

climate climate-analysis climate-change climate-data climate-science data-analysis data-visualization metoffice metofficeweather streamlit temperature weather

Last synced: 02 May 2026

https://github.com/melogabriel/nubank-expenses-analysis

This project consolidates monthly credit card statement data from Nubank into a single CSV file using Python, enabling data visualization through a Google Sheets dashboard in Looker Studio.

data-analysis data-visualization googlesheets lookerstudio pandas python

Last synced: 02 May 2026

https://github.com/msthamizh/phonepe-pulse-data-visualization-and-exploration

Developing a Streamlit application that allows users to explore and analyze transaction data from the PhonePe Pulse dataset. The project aims to provide insights into digital payment trends across India.

data-analysis data-visualization dataframe mysql pandas plotly python streamlit

Last synced: 02 May 2026

https://github.com/seankwarren/water-quality-analysis

An examination of water quality in the Atlanta watershed with a focus on identifying neglected areas and potential strategies for improving water quality monitoring

analytics data-analysis jupyter-notebook python

Last synced: 03 May 2026

https://github.com/fybex/chatgpt-conversations-analysis

Analysis of 89,000 ChatGPT conversations to understand interaction patterns and response behaviors.

chatgpt conversation-analysis data-analysis data-visualization language-analysis prompt-patterns sentiment-analysis

Last synced: 02 May 2026

https://github.com/ferrangarciarovira/premier-league-betting-analysis

Comprehensive Python analysis of Premier League betting market inefficiencies (2005–2024). Evaluates bookmaker biases, betting strategies, and market efficiency using statistical methods and Monte Carlo simulations.

betting-strategies bias-detection data-analysis market-efficiency monte-carlo-simulation premier-league python sports-analytics

Last synced: 03 May 2026

https://github.com/cs-joy/pandasv2.0.3

learn data analysis with pandas

data-analysis pandas pandas-learning

Last synced: 03 May 2026

https://github.com/manikantasanjay/time_series_data_analysis_on_stocks

Time Series Data Analysis project on Daily Stock Prices of the following companies(Apple, Microsoft, Google, Amazon) for a span of 5 years.

data-analysis pandas stock time-series time-series-analysis

Last synced: 03 May 2026

https://github.com/theairbend3r/mice-memory-response

Effect of memory on current response in mice using methods from computational neuroscience and machine learning.

computational-neuroscience data-analysis data-science machine-learning neuroscience python

Last synced: 09 Jun 2026

https://github.com/zeynepcol/data-analysis-visualization

Data visualization and interactive analytics - Olympics Dataset

data-analysis data-science data-visualization matplotlib pandas plotly python scipy seaborn streamlit

Last synced: 03 May 2026

https://github.com/0xjeremy/me-18-final

Data collection and Analysis tools for IMUs

data-analysis imu raspberry-pi

Last synced: 03 May 2026

https://github.com/shelton-beep/predicting-gpa-using-lifestyle-factors

Predicting student GPA using lifestyle factors like study habits, sleep, and stress levels. A machine learning model built to help students and educators understand the impact of lifestyle choices on academic performance.

data-analysis data-preprocessing data-science feature-engineering gpa-prediction machine-learning model-interpretability predictive-modeling python regression-analysis student-performance xgboost

Last synced: 04 May 2026

https://github.com/mystique85/altseason-ethereum-analysis

Altcoin season analysis relative to Ethereum – price comparisons, technical indicators, and historical market trends

altcoins bitcoin blockchain crypto data-analysis ethereum investing

Last synced: 04 May 2026

https://github.com/ruchit0807/heart_disease_prediction

An interactive ML-powered web app that predicts the risk of heart disease based on clinical inputs like age, chest pain, cholesterol, ECG, and more. Built using Python, Streamlit, and scikit-learn, it offers early risk assessment in a simple and accessible way—just enter your health metrics and get instant feedback.

data-analysis data-science knn-regression pandas streamlit

Last synced: 04 May 2026

https://github.com/gowthamsundaresan/eigenscan

blockexplorer for eigenlayer

crypto data-analysis eigenlayer nextjs web3

Last synced: 04 May 2026

https://github.com/ehtisham-sadiq/building-an-ml-based-heart-disease-diagnosis-system-with-flask

It is an end-to-end project that combines machine learning to create a user-friendly Heart Disease Diagnosis System, powered by Flask.

data-analysis exploratory-data-analysis feature-engineering flask machine-learning model-building model-evaluation pipelines python3 rest-api

Last synced: 04 May 2026

https://github.com/scarblase/sales_insights

A data-driven analysis of 15,000 sales records using Python, Pandas, and visualizations to uncover trends, optimize strategies, and enhance business performance. 🚀📊

data-analysis data-visualization dataset matplotlib-pyplot pandas python3 sales-analysis seaborn

Last synced: 05 May 2026

https://github.com/kimtth/agent-data-analyst-stream-chainlit

⚡️Chainlit-based Data Analyst Chat Agent (Responses API, Server Sent Events) 📈

agent azure-openai chainlit code-interpreter data-analysis server-sent-events stream-response

Last synced: 09 Jun 2026

https://github.com/shuddha2021/stellar-candidate-selector

A sophisticated candidate selection algorithm leveraging multi-criteria analysis and machine learning to identify top software engineering candidates. This tool features flexible filtering, score adjustment, and detailed visualizations to streamline the recruitment process.

candidate-selection data-analysis data-visualization machine-learning pandas plotting-in-python python python-data-analysis recruitment scikit-learn

Last synced: 05 May 2026

https://github.com/elcaiseri/udacity-advanced-data-analysis

UDACITY - Advanced-Data-Analysis Track Project

data-analysis python

Last synced: 05 May 2026