An open API service indexing awesome lists of open source software.

Data analysis

Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.

https://github.com/jamiemagee/rhi

Collating the data on the Renewable Heat Incentive scheme, and presenting it in a more readable format.

data-analysis open-data open-government rhi

Last synced: 25 Feb 2026

https://github.com/stastnypremysl/lsql-csv

lsql-csv is a tool for small CSV file data querying from a shell with short queries. It makes it possible to work with small CSV files like with a read-only relational databases. The tool implements a new language LSQL similar to SQL, specifically designed for working with CSV files in a shell. LSQL aims to be a more lapidary language than SQL.

csv data-analysis data-processing haskell language linux-shell lsql lsql-csv new-language query-language relational-database sql unix-command unix-philosophy unix-shell

Last synced: 25 Feb 2026

https://github.com/nakshjainsonigara/vba-canteenmanagementsystem

The Canteen Management System is a comprehensive software solution designed to modernize and optimize canteen operations. It aims to simplify the complexities of managing a canteen by automating key processes such as order management, payment processing, and report generation.

canteen canteen-mangement-system charts data-analysis email excel microsoft payment-gateway vba vba-excel vba-macros word

Last synced: 30 Jan 2026

https://github.com/nafisalawalidris/international-breweries

This GitHub readme provides an overview of data analysis using SQL on the International Breweries dataset, including dataset description, analysis questions, example SQL queries, and key insights derived from the analysis.

data-analysis insights international-breweries-dataset queries sql

Last synced: 31 Jan 2026

https://github.com/tynoee/record_company-database

A record company database with multiple query commands using SQL

data-analysis sql

Last synced: 31 Jan 2026

https://github.com/swarnim1812/crime_project

AI-Driven Crime Forecasting Across Indian States — A pioneering machine learning project that harnesses time series modeling (SARIMAX, Ridge Regression) to uncover patterns and forecast crime trends using real-world multi-state temporal and socio-economic data.

analytics crime-locator crime-prediction data-analysis deep-learning machine-learning prophet-facebook sarimax-model time-series-forecasting

Last synced: 31 Jan 2026

https://github.com/thevinh-ha-1710/rstudio-statistics

This project deeply studies 2 datasets using applied statistics techniques.

applied-statistics data-analysis data-science data-visualization rmarkdown rstudio

Last synced: 31 Jan 2026

https://github.com/victor-antoniassi/junior_data_analyst_test_01

Solution developed for a technical assessment that analyzed video game sales data to support gaming partnership decisions.

asses assessment-project data-analysis data-analysis-project data-analyst duckdb etl prefect python

Last synced: 01 Jun 2026

https://github.com/phammings/sales-management-analysis

Sales management analysis and Power BI dashboard for sample business request and user stories

data-analysis excel powerbi sql

Last synced: 01 Feb 2026

https://github.com/elissorokin/data-analyst-portfolio-rus

Это репозиторий, в котором я демонстрирую свои навыки, делюсь проектами и отслеживаю прогресс в области анализа данных и Data Science.

ab-testing data data-analysis datalense matplotlib numpy pandas plotly portfolio postgresql python scipy seaborn sql statistical-analysis

Last synced: 25 Feb 2026

https://github.com/madeiradata/microsoft-data-analysts-club

Open-source Repository of Useful Scripts and Solutions for Microsoft Data Analysts

data-analysis data-visualization microsoft-data-analysis powerbi powerbi-report

Last synced: 19 Mar 2026

https://github.com/ac12644/fractz-ai-data-analyst

Analyze data and gain insights instantly with FRACTZ's AI Data Analyst. Flexible, fast analytics tailored to your needs.

ai data-analysis data-visualization

Last synced: 01 Feb 2026

https://github.com/uchida16104/healthanalysis

It abstracts the health status of each device from its operational time calculated from RescueTime and analyzes the data.

data-analysis portfolio portfolio-website security security-tool

Last synced: 02 Feb 2026

https://github.com/maugus0/sats-flight-data-fetcher

A simple Python tool to fetch and analyze flight data for 15+ major airlines using the AirLabs API.

airline-data cli-tool data-analysis flight-data python3

Last synced: 17 Mar 2026

https://github.com/akash1070/project---applied-statistics-

To dive deep into this data & find some valuable insights.

data-analysis data-science python statistics

Last synced: 30 Apr 2026

https://github.com/jakobzmrzlikar/fake-news-analysis

An analysis of the FakeNewsNet dataset using NLP techniques.

data-analysis fake-news ipynb-jupyter-notebook nlp-machine-learning

Last synced: 05 Mar 2026

https://github.com/pngo1997/axa-xl-insurance-bi-dashboard

Provides a comprehensive analysis of insurance submissions, approvals, compliance rates, and profitability for AXA XL Insurance.

bi-analytics bi-dashboard business-analytics data-analysis filtering performance-analysis powerbi segmentation visualization

Last synced: 08 Feb 2026

https://github.com/chahiriabderrahmane/carpricepredictor

🚗 Cars Exploration & Price Prediction | Analyzing Cars.com Listings

data-analysis data-science data-visualization machine-learning python streamlit web-scraping

Last synced: 08 Feb 2026

https://github.com/ralstonraphael/ralston_ethan_datastory

Mapping Change: Advancing Housing Equity and Stability in Connecticut's Communities

census-data data-analysis data-visualization datawrapper google-sheets housing html json

Last synced: 09 Feb 2026

https://github.com/nurfakhri/e-commerce-data-analyst

E-commerce data analysis supported by data wrangling, EDA, and web dashboard

dashboard data-analysis e-commerce flask-application python

Last synced: 10 Feb 2026

https://github.com/flexmonster/svelte-flexmonster

Svelte wrapper for Flexmonster Pivot Table & Charts

data-analysis data-visualization frontend pivot-tables svelte sveltekit

Last synced: 27 Feb 2026

https://github.com/phillbertnevinemmanuel/dataprofessionalquestionnaireanalysis_pbix

This project delves into the demographics and job satisfaction of data professionals, presenting insights through a user-friendly dashboard built with Power BI. The dataset has been graciously provided by Alex The Analyst

dashboard data-analysis powerbi visualization

Last synced: 19 Mar 2026

https://github.com/patriloto/intro_r_para_reinventartec_2021

Material del taller Primeros pasos en R para el análisis de datos

data-analysis rstats

Last synced: 12 Feb 2026

https://github.com/fer-aguirre/taller-cookiecutter

Taller sobre cómo usar Cookiecutter para análisis de datos.

cookiecutter data-analysis project-template workshop

Last synced: 19 Mar 2026

https://github.com/allanotieno254/employee-performance-tracker-excel-

An Excel-based tool to track and evaluate employee performance, compliance, and skills assessments with summary statistics and visual charts

compliance-tracker data-analysis employee-performance-analysis excel human-resources

Last synced: 19 Mar 2026

https://github.com/allanotieno254/road-accident-data-analysis-dashboard-using-excel

This repository contains the Road Accident Data Analysis Dashboard, a comprehensive Excel-based tool designed to provide in-depth analysis and visualization of road accident data.

dashboards-excel data-analysis excel kpi visualization

Last synced: 19 Mar 2026

https://github.com/luminati-io/shopee-dataset-samples

A sample dataset of over 1000 Shopee products, extracted using the Bright Data API, ideal for pricing optimization, gap analysis, and market strategy refinement..

api data-analysis data-mining datasets products shopee web-scraping

Last synced: 12 Feb 2026

https://github.com/arhcoder/base-hackathon-2022

💸 Sistema que analiza las facturas de compra-venta de una empresa de importaciones y exportaciones, y crea una base de conocimiento con la que crea sugerencias de abastecimiento para las empresas clientes de Banco BASE, con el fin de ahorrarles dinero.

algorithms bank companies data-analysis decision-making exportation hackaton importation javascript mysql python suggestions

Last synced: 16 Apr 2026

https://github.com/mattdelaune/retail_rfm_analysis

Power BI multi-page report leveraging advanced data visualization for RFM analysis. Delivers deep analytical insights into customer behavior, engagement, and spending patterns, driving strategic business decisions.

data-analysis dax powerbi report rfm-analysis sales-data visualization

Last synced: 19 Mar 2026

https://github.com/garcane/london-housing-price-dashboard

This Excel-based Housing Visual Dashboard provides a comprehensive view of average house prices across various boroughs in London from 1996 to 2013. The dashboard is designed to offer insights into housing market trends and price variations across different areas of London over time.

data data-analysis data-visualization excel visual

Last synced: 13 Feb 2026

https://github.com/thevinh-ha-1710/diabetes-predictive-model

This project aims to train a predictive model to diagnose diabetes on women patients.

data-analysis data-science data-visualization model-training-and-evaluation python

Last synced: 13 Feb 2026

https://github.com/allanotieno254/powerbi-chocolate-sales-analysis-dax-calculations-80-

This Power BI project analyzes **chocolate sales performance using advanced DAX calculations and interactive visualizations. The report provides insights into monthly revenue, top-selling products, sales trends, and market performance.

business-intelligence data-analysis dax powerbi powerbi-dashboards powershell-module sales-analysis visualization

Last synced: 13 Feb 2026

https://github.com/nikhilash45/power-bi-vsualisation-of-joins

In This Power Bi Report User Can Visualis Join By Themselves , and it is easy to understand joins now.

business-analytics business-intelligence data data-analysis data-visualization joins powerbi sql visualization

Last synced: 19 Mar 2026

https://github.com/mikasenghaas/covid19-analysis

analysis of correlation between covid-19 infection numbers and weather data from the beginning of the pandemic until april 2021

data-analysis statistical-analysis

Last synced: 14 Feb 2026

https://github.com/daniel1kp/openrtb-dashboard

This is a demo project designed to illustrate using Rill to analyze programmatic bid logs using the canonical open RTB framework.

data-analysis openrtb real-time-bidding rill

Last synced: 19 Mar 2026

https://github.com/dina-hosny/telco-customer-churn-analysis-using-power-bi

An interactive dashboard to represent some analysis of "Telco customer churn" data and the reasons that made customers churn using Microsoft Power BI.

business-intelligence data-analysis data-modeling data-visualization power-bi powerbi

Last synced: 19 Mar 2026

https://github.com/spaghettifunk/gvb

Analysis of GVB in Amsterdam

data-analysis public-transportation

Last synced: 28 Feb 2026

https://github.com/garcane/british-airways-analysis

This project focuses on analyzing and visualising travel data from British Airways using Tableau. The goal is to extract insights and present them in an interactive and visually appealing manner.

data data-analysis data-visualization tableau

Last synced: 19 Mar 2026

https://github.com/gab-182/market-analysis-report-for-national-clothing-chain

Using custom M and DAX codes in Power BI, I conducte a thorough market analysis for a national clothing chain. The insights gathered from customer data and US Census Bureau statistics led to the formulation of a targeted marketing strategy, contributing to enhanced sales and customer satisfaction.

data-analysis power-bi

Last synced: 19 Mar 2026

https://github.com/chiemekaifemegbulem/make.com

A curated portfolio of Make.com automation workflows engineered to streamline operations and ensure precision. Featuring solutions for e-commerce, data integration, marketing, and bespoke business processes, it exemplifies expertise in designing scalable, efficient, and dependable automated systems.

api automate automated automation business data-analysis data-science dataengineering integration integromat make scenario software-engineering upwork workflows

Last synced: 15 Feb 2026

https://github.com/prashver/dashboard-gallery

These dashboards provide insights across diverse domains, including cryptocurrency sales, workforce challenges, disease impact analysis, and retail trends. Leveraging tools like Power BI and Excel, they offer actionable insights for decision-making.

cryptocurrency dashboards data-analysis data-profession data-visualization market-segmentation-analysis microsoft-excel monkey-pox powerbi product-analysis retail-trends

Last synced: 15 Feb 2026

https://github.com/szapp/magistdataanalysisvisualization

Case study: Data analysis and visualization to evaluate and recommend a business partnership. Team project for data-driven business with SQL and Tableau

dashboard data-analysis data-science data-visualization data-viz sql tableau-public

Last synced: 19 Mar 2026

https://github.com/pradeepchegur/seamantic_web_design

We designed a semantic web for Instagram in Wix platform.

data-analysis framework instagram semantic-web website-design wix

Last synced: 19 Mar 2026

https://github.com/nafisalawalidris/hici-african-foods

HiCi African Foods: Excel dashboard & pivot table analysis of EU food rejection data to identify risks & recommend focus areas for market expansion.

data-analysis data-cleaning data-visualization eu-food-rejection excel-dashboard hici-african-foods market-expansion pivot-tables

Last synced: 19 Mar 2026

https://github.com/edisedis777/duckdb-analyzer

A powerful tool for analyzing large CSV datasets using DuckDB.

csv data-analysis database duckdb

Last synced: 16 Apr 2026

https://github.com/leosimoes/datascienceacademy-powerbi-3.0

Projetos do curso Microsoft Power BI Para Data Science Versão 3.0 da DataScienceAcademy. Dashboards para diversos casos de negócios.

business-intelligence dashboards data-analysis data-visualization microsoft-power-bi

Last synced: 19 Mar 2026

https://github.com/shadan100/sales-prediction-analysis

The aim is to build a predictive model and find out the sales of each product at a particular store. Using this model, BigMart will try to understand the properties of products and stores which play a key role in increasing sales.

artificial-intelligence data-analysis data-science django django-framework jupyter-notebook machine-learning matplotlib pandas predictive-modeling python sales-prediction

Last synced: 01 Mar 2026

https://github.com/antononcube/wl-datareshapers-paclet

Wolfram Language (aka Mathematica) paclet for data reshaping functions, like, long- and wide form, cross tabulation, etc.

contingency-table cross-tabulation data-analysis data-transformation long-form wide-form

Last synced: 20 Mar 2026

https://github.com/antononcube/wl-quantileregression-paclet

Wolfram Language (aka Mathematica) paclet that provides various Quantile Regression functions.

data-analysis machine-learning quantile-regression time-series time-series-analysis

Last synced: 20 Mar 2026

https://github.com/lijesh010/globalsuperstoresalesanalysis

The Global Superstore Sales Analysis repository showcases a comprehensive Power BI dashboard that provides valuable insights into sales performance. This project is designed to present key information and trends to stakeholders, enabling informed decision-making.

dashboard data-analysis data-visualization msexcel power-bi sales-analysis

Last synced: 19 Mar 2026

https://github.com/tnleite/projeto_king_lift

Este projeto apresenta uma análise detalhada dos dados financeiros da King Lift, uma empresa de locação de empilhadeiras. Utilizando Microsoft Excel, Power Query e Power Pivot, desenvolvi um dashboard interativo, também em Excel, que ajuda a empresa a obter insights valiosos para melhorar a eficiência operacional e aumentar o faturamento.

data-analysis data-science data-visualization excel

Last synced: 19 Mar 2026

https://github.com/harshmule1/store-sales-analysis

Sales Analysis Using Power Bi

data-analysis powerbi

Last synced: 19 Mar 2026

https://github.com/rakumar99/jp-morgan-chase-virtual-internship

This repository contains the various tasks assigned by JPMorgan Chase & Co. Virtual Internship on Microsoft Excel

conditional-formatting dashboard data-analysis data-visualization hlookup pivot-tables presentation vba-macros vlookup

Last synced: 02 Mar 2026

https://github.com/wrighang/shipping-data-analysis

Independent Project: Transit time trends analysis following a major shipping process change.

data-analysis matplotlib numpy pandas python

Last synced: 18 Apr 2026

https://github.com/sunnybibyan/marketing_campaign_analysis_power_bi_dashboard

Campaign Performance Analysis This project analyzes the performance of Spring, Summer, and Fall marketing campaigns, revealing key insights and actionable recommendations.

data-analysis data-visualization dax marketing-campaign powerbi

Last synced: 19 Mar 2026

https://github.com/kalebers/economic_analysis_data_science

Data Analysis Python project using economy data base to predict percentage of good and bad payers

data-analysis data-science machine-learning pandas python scipy sklearn-library

Last synced: 18 Apr 2026

https://github.com/denisecase/nlp-03-text-exploration

Exploratory analysis of text corpora using tokenization, frequency, co-occurrence, and bigrams to reveal structure in text.

bigrams co-occurence corpus-analysis data-analysis nlp python text-analysis text-exploration tokenization

Last synced: 02 Jun 2026

https://github.com/anthonybench/datapeek

Peek summary of datafile in a succinct, opinionated manner.

cli data data-analysis

Last synced: 02 Mar 2026

https://github.com/mohamedhany99/human-voice-identifier-counter

the application developed in (KIVY) it can identify the users imported into the dataset based on the support vector machine training model it has two features ( Importing new voice - Detection to detect the human voices and count them)

android android-app android-application automation automation-framework data data-analysis data-mining data-science data-visualization datascience kivy kivy-framework machine-learning python

Last synced: 27 Mar 2026

https://github.com/xuri/excelize-cs

Excelize is a C# port of Go Excelize library that allow you to write to and read from XLAM / XLSM / XLSX / XLTM / XLTX files.

agent ai chart csharp data-analysis data-science data-visualization excel excelize formula microsft office ooxml parser spreadsheet xlsm xlsx

Last synced: 03 Mar 2026

https://github.com/mayankyadav23/air-bnb-data-analysis

Data analysis and insights from NYC Airbnb listings, focusing on key metrics such as host performance, neighborhood trends, pricing, and customer reviews. Comprehensive documentation of ETL processes and analytical methodologies is provided. Perfect for understanding Airbnb dynamics and decision-making in the NYC market.

advanced-excel business-intelligence data-analysis data-analytics data-visualization power-bi ppt

Last synced: 19 Mar 2026

https://github.com/santiagortiiz/snowflake-data-warehousing

Snowflake University. Snowflake Data Warehousing. Foundamentals

big-data data-analysis data-warehouse olap snowflake

Last synced: 19 Mar 2026

https://github.com/kinshuk-code-1729/data-visualisation-using-python

This Repository consists of several python snippets for creating Two-Dimensional (2D) Graphics

data-analysis data-science data-visualization matplotlib visualization

Last synced: 02 Jun 2026

https://github.com/steno-aarhus/legliv

Substitution of red meat with legumes and risk of primary liver cancer in UK Biobank participants: A prospective cohort study

cancer-research data-analysis epidemiology nutritional-epidemiology nutritional-science open-science reproducibility reproducible-research rstats ukbiobank

Last synced: 03 Mar 2026

https://github.com/sleeplessglory/big-data

Projects regarding big data analysis, presented within Jupyter Notebook

big-data data-analysis data-visualization jupyter python

Last synced: 16 Apr 2026

https://github.com/wewoc/garmin_local_archive

Secure, local-first archive for Garmin Connect health data (HRV, sleep, activities). Private & offline. Structured for local analysis (Excel, HTML-Dashboard, Ollama, Open WebUI, AnythingLLM). Your data stays on your machine.

backup dashboard data-analysis fitness-tracker garmin garmin-connect ollama open-webui privacy privacy-enhancing-technologies privacy-first privacy-focused python self-hosted

Last synced: 16 Apr 2026

https://github.com/mg380/ibm-applied-data-science-capstone

This Capstone is the 10th (final) course in IBM Data Science Professional Certificate specialization, and it actually summarises in the form of project all materials that have been learned during this specialization

capstone data data-analysis data-science datascience ibm machine-learning plotly python scikit-learn sql

Last synced: 05 Mar 2026

https://github.com/dcs-training/intromachinelearning

This course is aimed at providing an introduction to machine learning for those with some beginner level python skills. Go to the readme file

data-analysis data-wrangling machine-learning python statistics

Last synced: 06 Mar 2026

https://github.com/snehilk1312/data_science

This Repository contains the Data Science things I have done in recent times along with visualization , cleaning , models, statistics, Courses, Datasets. :=)

data-analysis data-science glove natural-language-processing nlp nltk statistics word2vec

Last synced: 02 Apr 2026

https://github.com/sadia-khan13/supervised_machine_learning

This repository is meant to document my hands-on experience with supervised learning algorithms and techniques. It includes a variety of exercises, and experiments using different types of data and tools. Each file represents a step forward in building my machine learning skills.

data-analysis data-science jupyter-notebook machine-learning machine-learning-algorithms python sciket-learn supervised-machine-learning

Last synced: 06 Mar 2026

https://github.com/archie-cm/a-b-testing-mobile-games

This project have objective to examine what happens when the first gate in the game was moved from level 30 to level 40. When a player installed the game, he or she was randomly assigned to either gate30 or gate40.

abtesting data-analysis python retention-rate

Last synced: 17 Apr 2026

https://github.com/carmoreno/analisisaccidentalidadbogota

Data Analysis about traffic accidents at Bogotá, Colombia.

data-analysis data-science jupyer-notebook matplotlib numpy pandas scikit-learn

Last synced: 17 Apr 2026

https://github.com/reza-saeedi-coding/netflix-data-analysis

A complete end-to-end Netflix dataset analysis using Python, SQL, and Matplotlib. Explores genres, content ratings, and trends using exploratory data analysis and visualizations.

data-analysis data-cleaning eda matplotlib netflix pandas portfolio-project python sql sqlite

Last synced: 17 Apr 2026

https://github.com/sandk21/detection_faux_billets

Algorithme de détection de faux billets selon leurs dimensions géométriques et application web pour générer les prédictions

data-analysis data-science data-visualization machine-learning pandas python scipy sklearn streamlit

Last synced: 03 Apr 2026

https://github.com/harshmule1/school-data-analysis-

School Data Analysis Using SQL

data-analysis mssql sql

Last synced: 04 Apr 2026

https://github.com/hebaqaisar/movie-recommender-system

AI Recommender System - Recommends you similar movies based on Directors, Tags, Name, Type, Actors, Genre etc

artificial-intelligence data-analysis data-mining data-science jupyter-notebook machine-learning machine-learning-algorithms ml movies-rate pycharm python

Last synced: 17 Apr 2026

https://github.com/pradipece/weather_forecast_data_analysis

Using decision trees and random forest algorithms to solve real-world data analysis. "sklearn_decision_trees_random_forests"

data-analysis data-science data-visualization git github python python3

Last synced: 19 Apr 2026

https://github.com/shsiddhant/memory.fm

A Python library, CLI tool, and web-based dashboard for exploring music listening history from Last.fm and Spotify.

analytics data-analysis data-visualization memories music

Last synced: 04 Apr 2026

https://github.com/alfikiafan/air-quality-analysis

This repository contains a comprehensive data analysis project on Air Quality Dataset, covering the complete data analysis process from data gathering, cleaning, exploratory data analysis (EDA), to building a fully interactive dashboard using Streamlit.

air-quality data-analysis dicoding

Last synced: 17 Apr 2026

https://github.com/cdilga/knn-c

C implementation of a K-Nearest Neighbour algorithm

data-analysis knn

Last synced: 04 Apr 2026