An open API service indexing awesome lists of open source software.

Data analysis

Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.

https://github.com/sarathchandranpm/restaurant_order_analysis

This project entails an in-depth analysis of a restaurant's order and menu data. The focus is on exploring customer ordering behaviors, menu item attributes, and order specifics. By investigating the connections between order details, menu items, and order dates, the project seeks to generate valuable insights into the restaurant's operations.

data-analysis mysql sql

Last synced: 10 Apr 2025

https://github.com/nafisalawalidris/hotel-reservation-analysis

This project analyses hotel reservation data from Resort Hotels and City Hotels to uncover booking trends and insights. Utilising Microsoft Excel for initial data cleaning, PostgreSQL for data analysis and Tableau for creating visualisations, the project aims to deliver a comprehensive dashboard that highlights key metrics such as booking status.

data-analysis data-cleaning data-visualisation hotel-reservations microsoft-excel postgresql sql tableau tableau-dashboards tableau-desktop tableau-public

Last synced: 06 Jul 2025

https://github.com/joannescode/data-series-with-kaggle

Repositório de notebooks práticos sobre tratamento e análise de datasets

data-analysis matplotlib pandas python

Last synced: 13 Mar 2025

https://github.com/jerela/mola

A Python library for matrix algebra

data-analysis linear-algebra-library matrix-algebra python

Last synced: 14 Jan 2026

https://github.com/jcaperella29/stock_evaluation_python

A Python script to classify companies based on financial metrics like Piotroski F-Score and Stock Valuation, using CSV financial data for analysis and output.

ai-in-finance artificial-intelligence classification csv-processing data-analysis expert-system finance financial-analysis financial-analysis-tools piotroski-f-score python quantitative-analysis rule-based-classifier stock-analysis stock-valuation

Last synced: 07 Sep 2025

https://github.com/jeniljani-4444/end-to-end-world-cup-analysis-web-app

Our streamlined Streamlit web app fetches and processes ESPN CricInfo data delivering dynamic graphs for a quick and engaging cricket experience. Deployed on AWS EC2 with CI/CD pipelines.

aws-ec2 data-analysis plotly preprocessing streamlit-webapp

Last synced: 02 Apr 2025

https://github.com/erickkhosasi/thelook-data_analysis

Final project for my SQL mini bootcamp. This project explores an e-commerce dataset to uncover key business insights. Data insights were queried in Google BigQuery and visualized with Google Sheets.

bigquery data-analysis e-commerce sql

Last synced: 05 Oct 2025

https://github.com/abhinavsharma07/fraud_analytics-credit_card_fraud_detection

The aim of this project is to predict fraudulent credit card transactions with the help of different machine learning models.

banking data-analysis decision-trees hyperparameter-optimization machine-learning-algorithms pipelines random-forest-classifier svm-classifier xgboost-classifier

Last synced: 06 Oct 2025

https://github.com/raccoon-hero/gender-equality-tracker

A web application visualizing gender equality metrics with a focus on Ukraine. Built with Flask, it's powered by live data from global open sources, with dynamic research insights and analysis.

chartjs css dashboard data-analysis data-visualization flask frontend gender-equality global-metrics html linked-data openalex opendata python representation semantic-web ukraine webapp wikidata world-bank-api

Last synced: 07 May 2026

https://github.com/ahammadshawki8/playing-with-pandas

🐼 Pandas is one of my favourite library in python. It is well-known for "Analyzing" data. Learn basics and beyond the basics of Pandas from this repository. 🤍🖤

beginner-friendly data-analysis favourite-library pandas python

Last synced: 17 Apr 2026

https://github.com/sunnybibyan/exploratory-data-analysis-eda

Welcome to the Titanic Dataset - Exploratory Data Analysis (EDA) project repository! This project aims to uncover insights from the Titanic dataset using Python and Jupyter Notebook. By analyzing key variables such as age, gender, and class, we aim to visualize relationships between passenger characteristics and survival rates.

data-analysis data-visualization jupyter-notebook python titanic-dataset

Last synced: 18 Jan 2026

https://github.com/targetta/ankaflow

YAML-based data pipeline framework that runs both locally and fully in-browser designed for data engineers, ML teams, and SaaS developers who need flexible, SQL-powered pipelines.

bigquery clickhouse data-analysis dataops deltalake duckdb elt-pipeline etl etl-automation motherduck parquet python sql

Last synced: 09 Oct 2025

https://github.com/muneeb1030/webscrapper_mastodon

The Mastodon Social Platform Scraper is a Python-based web scraping tool designed to explore and extract valuable data from the Mastodon social platform.

data-analysis data-collection mastodon python3 scrapy scrapy-spider selenium-python webscraping

Last synced: 09 Oct 2025

https://github.com/kamanhang/sqldatawarehousedataengineeringproject

This project delivers a modern data warehouse which focuses on building clean, organized data pipeline which covers important aspects such as ETL Pipeline Development, Data Cleaning, Data Modelling and Data Analytics

customer-analytics data-analysis data-cleaning data-engineering data-modeling data-pipeline data-visualization datascience etl-pipeline postgresql powerbi powerbidashboard sales-analysis sql

Last synced: 10 Oct 2025

https://github.com/cyberoctane29/unicorn-companies-analysis

This project explores unicorn companies, private startups valued at over $1 billion, using Python for data analysis. It covers industry trends, geographic distribution, and investment patterns through EDA, including data cleaning, handling missing values, datetime transformations, and visualizations to uncover key insights.

data-analysis eda numpy pandas python

Last synced: 02 May 2026

https://github.com/selcuk05/forbes_top_100_celebrities_data_analysis

Forbes Top 100 Celebrities since 2005 Data Analysis and Visualization

data-analysis data-science

Last synced: 11 Oct 2025

https://github.com/allanotieno254/awsome-chocolate-company-sales-analysis-dashboard

This repository contains an in-depth analysis of chocolate consumption trends, focusing on various factors influencing consumer preferences, production, and market performance.

data-analysis data-science data-transformation measures powerbi sales-analysis visualization

Last synced: 23 Feb 2026

https://github.com/famarks/grafarg

Grafarg is an interactive data analytics and graphical data visualization application. Grafarg being a progressive fork of Grafana 7.5.17 continues to be available under open source Apache 2.0 License

analytics charts data data-analysis data-science data-visualization grafana grafarg graph

Last synced: 19 Jan 2026

https://github.com/shreeparab1890/flipkart-laptops-analysis-eda

This ipython notebook is the Exploratory data analysis (EDA) of the Laptops listed on Flipkart.

data-analysis eda exploratory-data-analysis matplotlib-pyplot numpy pandas plotly

Last synced: 14 Apr 2026

https://github.com/titanscouting/tra-superscript

The Red Alliance data analysis package

data-analysis frc-scouting hacktoberfest python

Last synced: 11 Oct 2025

https://github.com/abhi-lab2/ipl-data-analysis

IPL data analysis for future predictions

data-analysis data-science python

Last synced: 14 Apr 2026

https://github.com/strampelligiovanni/straklip

An HST pipeline for reducing wide-field imaging observations not specifically designed for High Contrast Imaging analysis. Published in Strampelli et al. 2022.

binaries data-analysis data-reduction direct-imaging exoplanets high-contrast-imaging hst wide-field-surveys

Last synced: 12 Oct 2025

https://github.com/listiangr/ecommerce_sales_data_analysis

Proyek ini menganalisis data penjualan e-commerce untuk membantu bisnis memahami tren penjualan, performa produk, dan segmen pelanggan. Tujuan utamanya adalah memberikan wawasan yang dapat meningkatkan strategi pemasaran dan pengelolaan produk.

dashboard data-analysis data-cleaning data-collection data-penjualan data-visualization exploratory-data-analysis microsoft-excel

Last synced: 19 Jan 2026

https://github.com/ayobami6/tweet-data-analysis

WeRateDogs Tweets Scrape using twitter Api

data-analysis data-science twitter webscraping

Last synced: 31 May 2026

https://github.com/bristolmyerssquibb/blockr.workshop

R in Pharma 2024 blockr workshop

data-analysis nocode r

Last synced: 18 Apr 2026

https://github.com/raad07/sql_project-world_layoffs_dataset

This is a SQL project which comprises the Data Cleaning in the first part and Exploratory Data Analysis (EDA) in the second part.

data-analysis database mysql sql

Last synced: 27 Jan 2026

https://github.com/agustinmusanti/delitosencaba-proyectofinal-dataanalytics-coderhouse

En este repositorio muestro mi proyecto final en el curso "Data Analytics" de Coderhouse.

data-analysis excel powerbi

Last synced: 22 Jan 2026

https://github.com/rkolehov/retail-sales-analysis-project

End-to-end e-commerce analysis showcasing SQL and data visualization skills. Tracks sales, customer behavior, product performance, and delivery efficiency. Interactive dashboards provide actionable insights for business decision-making

analytics dashboard data-analysis ecommerce jupyter-notebook postgresql python sql tableau vscode

Last synced: 19 Apr 2026

https://github.com/nafisalawalidris/springforth-university-foodbank

Springforth University Food Bank: A collaborative initiative with UNESCO to address student food insecurity. Contains code and resources for the web application, data analysis, and insights into the prevalence and impact of food insecurity on academic performance.

academic-performance collaborative-initiative data-analysis data-visualization excel pivot-tables powerbi springforth-university-food-bank student-food-insecurity unesco

Last synced: 17 Feb 2026

https://github.com/vruddhi18/e-commerce_data_analysis_powerbi_dashboard

The E-Commerce Data Analysis project leverages Power BI to analyze sales and customer insights from Blinkit, Zepto, Myntra, and Flipkart, providing interactive dashboards to enhance e-commerce strategies.

data-analysis powerbi

Last synced: 27 Feb 2026

https://github.com/jfjlaros/spreadscript

SpreadScript: Use a spreadsheet as a function.

automation command-line data-analysis evaluation function interface spreadsheet

Last synced: 16 Oct 2025

https://github.com/bishtrishu/pizza_sales_data_analysis_sql

This project is a comprehensive data analysis of pizza sales, aimed at uncovering key insights and trends to inform business decisions. Using a combination of SQL, Python, and data visualization tools, the project analyzes sales data to understand customer preferences, peak sales periods, and the most popular pizza types.

cloud data data-analysis data-science data-visualization dataanalytics database mysql oracle-database

Last synced: 14 Apr 2026

https://github.com/as16082023/coffee-bean-sales-analysis

Analyzing coffee bean sales data to optimize consumer targeting, product offerings, and strategic marketing in the coffee industry.

coffee-bean-sales dashboard data-analysis data-visualization ms-excel

Last synced: 22 Jan 2026

https://github.com/gallillio/unsupervised_clustering_music_recommendation_system

Music Recommendation System using Unsupervised Machine Learning Clustering Methods using K-Means, Fuzzy C Mean DBSCAN, Gaussian Mixture Model, BIRCH and Agglomerative Clustering

affinity-propagation agglomerative-clustering birch-clustering data-analysis data-visualization dbscan-clustering fuzzy-cmeans-clustering gaussian-mixture-models k-means-clustering pca unsupervised-machine-learning

Last synced: 19 Oct 2025

https://github.com/moscarde/pyproductivity

Application uptime tracker that monitors active windows, automatically generating daily usage reports.

daily-report data-analysis python tracker

Last synced: 19 Oct 2025

https://github.com/renanmoliveir/analise_de_dados_bikestore_power-bi_atualizan-o

Projeto de análise de dados do banco de dados Bike Store com Power BI.

data-analysis dax-languague powerbi query

Last synced: 15 Mar 2026

https://github.com/gattiharishkumar/blinkit-sales-analysis-dashboard

This project presents a comprehensive sales analysis dashboard for Blinkit, an Indian last-minute delivery app. The dashboard was created using Power BI and provides a detailed overview of the company's sales performance across various outlets and product categories.

dashboard data-analysis data-transformation data-visualization ms-excel-data-analytics power-query powerbi powerbi-visuals

Last synced: 19 Mar 2026

https://github.com/viper373/163-buff

爬取网易BUFF平台CS:GO武器皮肤交易数据

163 arima crawler-python csgo data-analysis prediction python

Last synced: 24 Oct 2025

https://github.com/tunjis/global-superstore_dashboard_tableau

Tableau dashboard with 4 different types of visualisations

charts dashboard data-analysis data-visualisation excel tableau

Last synced: 23 Jan 2026

https://github.com/dcs-training/null-hypothesis-testing-with-r

This two-class course will focus on developing theoretical and practical skills for null hypothesis testing in R. Go to the readme file

data-analysis data-wrangling r statistics

Last synced: 24 Oct 2025

https://github.com/ayenpure/stockmeup

This is a class project for 'CIS 610 : Data Science' where I try and validate Stock Market recommendations.

data-analysis data-mining data-science java mapreduce mapreduce-java

Last synced: 24 Oct 2025

https://github.com/ivanildobarauna-dev/currency-quote

Complete solution for extracting currency pair quotes data with comprehensive testing, parameter validation, flexible configuration management, Hexagonal Architecture, CI/CD pipelines, code quality tools, and detailed documentation.

data-analysis data-analytics data-engineering library pypi-packages python

Last synced: 27 Oct 2025

https://github.com/sustentarea/gs-data-analysis-report-3

📓 Exploring potential associations between childhood undernutrition and the Standardized Precipitation Evapotranspiration Index (SPEI) in Brazilian municipalities (2008–2019)

brazil climate-change data-analysis data-science food-systems global-syndemic ibge malnutrition nutrition obesity r rstats sisvan spei sustainable-eating wasting worldclim

Last synced: 27 Oct 2025

https://github.com/vhawk19/ambaan

just wants the average analyst to be happi

data-analysis duckdb-wasm sql vue

Last synced: 01 Mar 2026

https://github.com/vasishth/lecturesintrobayes

Please go to the website for these online lectures:

bayesian-inference brms data-analysis stan

Last synced: 06 Feb 2026

https://github.com/seekinginfiniteloop/fedcal

A feature-rich Python calendar that enables time series analyses of changes in federal workforce schedules and shifts in executive department funding status.

data-analysis data-science econometrics economic-data economics federal federal-government hr pandas pandas-library pandas-python pydata python

Last synced: 15 Apr 2026

https://github.com/nafisalawalidris/international-breweries

This GitHub readme provides an overview of data analysis using SQL on the International Breweries dataset, including dataset description, analysis questions, example SQL queries, and key insights derived from the analysis.

data-analysis insights international-breweries-dataset queries sql

Last synced: 31 Jan 2026

https://github.com/tynoee/record_company-database

A record company database with multiple query commands using SQL

data-analysis sql

Last synced: 31 Jan 2026

https://github.com/victor-antoniassi/junior_data_analyst_test_01

Solution developed for a technical assessment that analyzed video game sales data to support gaming partnership decisions.

asses assessment-project data-analysis data-analysis-project data-analyst duckdb etl prefect python

Last synced: 01 Jun 2026

https://github.com/elissorokin/data-analyst-portfolio-rus

Это репозиторий, в котором я демонстрирую свои навыки, делюсь проектами и отслеживаю прогресс в области анализа данных и Data Science.

ab-testing data data-analysis datalense matplotlib numpy pandas plotly portfolio postgresql python scipy seaborn sql statistical-analysis

Last synced: 25 Feb 2026

https://github.com/uchida16104/healthanalysis

It abstracts the health status of each device from its operational time calculated from RescueTime and analyzes the data.

data-analysis portfolio portfolio-website security security-tool

Last synced: 02 Feb 2026

https://github.com/maugus0/sats-flight-data-fetcher

A simple Python tool to fetch and analyze flight data for 15+ major airlines using the AirLabs API.

airline-data cli-tool data-analysis flight-data python3

Last synced: 17 Mar 2026

https://github.com/jakobzmrzlikar/fake-news-analysis

An analysis of the FakeNewsNet dataset using NLP techniques.

data-analysis fake-news ipynb-jupyter-notebook nlp-machine-learning

Last synced: 05 Mar 2026

https://github.com/chahiriabderrahmane/carpricepredictor

🚗 Cars Exploration & Price Prediction | Analyzing Cars.com Listings

data-analysis data-science data-visualization machine-learning python streamlit web-scraping

Last synced: 08 Feb 2026

https://github.com/flexmonster/svelte-flexmonster

Svelte wrapper for Flexmonster Pivot Table & Charts

data-analysis data-visualization frontend pivot-tables svelte sveltekit

Last synced: 27 Feb 2026

https://github.com/fer-aguirre/taller-cookiecutter

Taller sobre cómo usar Cookiecutter para análisis de datos.

cookiecutter data-analysis project-template workshop

Last synced: 19 Mar 2026

https://github.com/mattdelaune/retail_rfm_analysis

Power BI multi-page report leveraging advanced data visualization for RFM analysis. Delivers deep analytical insights into customer behavior, engagement, and spending patterns, driving strategic business decisions.

data-analysis dax powerbi report rfm-analysis sales-data visualization

Last synced: 19 Mar 2026

https://github.com/allanotieno254/powerbi-chocolate-sales-analysis-dax-calculations-80-

This Power BI project analyzes **chocolate sales performance using advanced DAX calculations and interactive visualizations. The report provides insights into monthly revenue, top-selling products, sales trends, and market performance.

business-intelligence data-analysis dax powerbi powerbi-dashboards powershell-module sales-analysis visualization

Last synced: 13 Feb 2026

https://github.com/nikhilash45/power-bi-vsualisation-of-joins

In This Power Bi Report User Can Visualis Join By Themselves , and it is easy to understand joins now.

business-analytics business-intelligence data data-analysis data-visualization joins powerbi sql visualization

Last synced: 19 Mar 2026

https://github.com/leosimoes/datascienceacademy-powerbi-3.0

Projetos do curso Microsoft Power BI Para Data Science Versão 3.0 da DataScienceAcademy. Dashboards para diversos casos de negócios.

business-intelligence dashboards data-analysis data-visualization microsoft-power-bi

Last synced: 19 Mar 2026

https://github.com/tnleite/projeto_king_lift

Este projeto apresenta uma análise detalhada dos dados financeiros da King Lift, uma empresa de locação de empilhadeiras. Utilizando Microsoft Excel, Power Query e Power Pivot, desenvolvi um dashboard interativo, também em Excel, que ajuda a empresa a obter insights valiosos para melhorar a eficiência operacional e aumentar o faturamento.

data-analysis data-science data-visualization excel

Last synced: 19 Mar 2026

https://github.com/kalebers/economic_analysis_data_science

Data Analysis Python project using economy data base to predict percentage of good and bad payers

data-analysis data-science machine-learning pandas python scipy sklearn-library

Last synced: 18 Apr 2026

https://github.com/mohamedhany99/human-voice-identifier-counter

the application developed in (KIVY) it can identify the users imported into the dataset based on the support vector machine training model it has two features ( Importing new voice - Detection to detect the human voices and count them)

android android-app android-application automation automation-framework data data-analysis data-mining data-science data-visualization datascience kivy kivy-framework machine-learning python

Last synced: 27 Mar 2026

https://github.com/steno-aarhus/legliv

Substitution of red meat with legumes and risk of primary liver cancer in UK Biobank participants: A prospective cohort study

cancer-research data-analysis epidemiology nutritional-epidemiology nutritional-science open-science reproducibility reproducible-research rstats ukbiobank

Last synced: 03 Mar 2026

https://github.com/wewoc/garmin_local_archive

Secure, local-first archive for Garmin Connect health data (HRV, sleep, activities). Private & offline. Structured for local analysis (Excel, HTML-Dashboard, Ollama, Open WebUI, AnythingLLM). Your data stays on your machine.

backup dashboard data-analysis fitness-tracker garmin garmin-connect ollama open-webui privacy privacy-enhancing-technologies privacy-first privacy-focused python self-hosted

Last synced: 16 Apr 2026

https://github.com/mg380/ibm-applied-data-science-capstone

This Capstone is the 10th (final) course in IBM Data Science Professional Certificate specialization, and it actually summarises in the form of project all materials that have been learned during this specialization

capstone data data-analysis data-science datascience ibm machine-learning plotly python scikit-learn sql

Last synced: 05 Mar 2026

https://github.com/sadia-khan13/supervised_machine_learning

This repository is meant to document my hands-on experience with supervised learning algorithms and techniques. It includes a variety of exercises, and experiments using different types of data and tools. Each file represents a step forward in building my machine learning skills.

data-analysis data-science jupyter-notebook machine-learning machine-learning-algorithms python sciket-learn supervised-machine-learning

Last synced: 06 Mar 2026

https://github.com/sandk21/detection_faux_billets

Algorithme de détection de faux billets selon leurs dimensions géométriques et application web pour générer les prédictions

data-analysis data-science data-visualization machine-learning pandas python scipy sklearn streamlit

Last synced: 03 Apr 2026

https://github.com/hebaqaisar/movie-recommender-system

AI Recommender System - Recommends you similar movies based on Directors, Tags, Name, Type, Actors, Genre etc

artificial-intelligence data-analysis data-mining data-science jupyter-notebook machine-learning machine-learning-algorithms ml movies-rate pycharm python

Last synced: 17 Apr 2026

https://github.com/cdilga/knn-c

C implementation of a K-Nearest Neighbour algorithm

data-analysis knn

Last synced: 04 Apr 2026

https://github.com/hayatiyrtgl/data_analysis_project

Financial data analysis: preprocess, visualize, calculate technical indicators.

data-analysis data-analysis-python data-science dataframe numpy pandas python python3 stock-price-prediction talib trade-analysis

Last synced: 04 Apr 2026

https://github.com/kalfasyan/filoma

profiling files, directories, image data

data-analysis profiler validation

Last synced: 05 Apr 2026

https://github.com/jrbourbeau/cr-composition

IceCube cosmic-ray composition analysis

cosmic-rays data-analysis machine-learning physics python

Last synced: 20 Apr 2026

https://github.com/rakumar99/power-bi-projects

This repository contains various power bi projects and dashboards of Humaan Resources , Financial Analysis using Power BI Desktop.

dashboards data-analysis data-visualization databases datacleaning datamodeling etl powerbi powerquery reports

Last synced: 04 Jun 2026

https://github.com/ganeshkumartk/ncov-2019

[EDA] Statistical modelling of Novel Coronavirus breakout nCoV-2019

corona data-analysis ncov ncov-2019 statistics wuhan wuhan-coronavirus wuhan-virus

Last synced: 05 Jun 2026

https://github.com/chandansoren/diabetics_prediction

Predicting that whether the patient has diabetes or not on the basis of the features we will provide to our machine learning model.

data-analysis machine-learning python svm

Last synced: 06 Jun 2026

https://github.com/savinrazvan/degrees

A program that computes the "degrees of separation" between two actors by identifying the sequence of movies connecting them, inspired by the Six Degrees of Kevin Bacon game. Uses IMDb-based datasets for actors, movies, and their relationships.

actor-connections ai data-analysis degrees-of-separation educational-project graph-theory imdb movie-database python six-degrees-of-kevin-bacon

Last synced: 24 Apr 2026

https://github.com/anastasius21/imdb-movie-analysis

Analysis of IMDb's Top 1000 Movies dataset using Pandas, Matplotlib, and Seaborn. It provides visualizations and insights into various aspects of movies, such as ratings, genres, directors, and release years.

data-analysis data-exploration data-science data-visualization imdb imdb-dataset jupyter-notebook python

Last synced: 25 Apr 2026

https://github.com/cdeweyx/game-of-thrones-s7e1-eda

Exploratory data analysis of scraped tweets related to Game of Thrones S7E1

data-analysis data-visualization python twitter-api

Last synced: 26 Apr 2026

https://github.com/alejo1630/sport_stats

Data analysis of information from the summer and winter Olympic games over the years. UC Davis SQL Specialization Final Project

data-analysis jupyter-notebook olympics-dataset plotly python seaborn sql

Last synced: 26 Apr 2026

https://github.com/adrija-debnath/ideas-isi-data-science-internship

Topic of the Project - Predictive Maintenance Analysis, Data Science Internship at IDEAS - Institute of Data Engineering, Analytics and Science Foundation Technology Innovation Hub at Indian Statistical Institute, Kolkata.

data-analysis data-science predictive-analytics predictive-maintenance streamlit

Last synced: 27 Apr 2026

https://github.com/antonio-f/big-data-analysis-with-scala-and-spark

Coding assignments from the course "Big Data Analysis with Scala and Spark" (Coursera).

big-data bigdata coursera data-analysis scala spark

Last synced: 27 Apr 2026

https://github.com/jongan69/potion-leaderboard

Start of Entry for potion leaderboard contest

data-analysis leaderboard potion trading

Last synced: 11 Jun 2026