An open API service indexing awesome lists of open source software.

Data analysis

Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.

https://github.com/gmbeddard/ee152-realtime_embedded_systems-finalproject

An STM32-based implementation of the Pan-Tompkins algorithm for real-time QRS detection. Includes robust debugging tools, heart rate monitoring, and live ECG signal support via a python graphing script.

cpp-programming data-analysis ecg embedded-c freertos stm32

Last synced: 21 Apr 2026

https://github.com/foxriver76/iobroker.intelliflow

Stream data analysis adapter for ioBroker.

data-analysis iobroker machine-learning streaming-data

Last synced: 04 Apr 2025

https://github.com/frankelavsky/political-polarization-challenge

I had 8 hours to build a solution to the research claim that "politics have become more divided in the past 50 years." You can navigate views of congressional voting patterns using arrows. I used d3, require, MVC pattern, and vanilla js. Pre-processed the data in node.js. Data is from DW-NOMINATE: ftp://k7moa.com/junkord/HANDSL01114A20_STAND_ALONE_30.DAT

client-side css d3 d3js data-analysis data-visualization frontend frontend-app html interactive interactive-visualizations javascript modular nodejs political-science politics requirejs research single-page-app visualization

Last synced: 06 Apr 2026

https://github.com/victorherdz10/rainsense-iot

Sistema IoT de detección temprana de lluvias con Arduino. Monitorea condiciones meteorológicas usando sensores DHT22/BMP280 y algoritmos de predicción multivariable para alertas en tiempo real. Procesa datos y los envía via HTTP/JSON.

arduino bmp280 data-analysis dht22 embedded-systems iot platformio rain-detection real-time sensor-network weather-prediction weather-station

Last synced: 17 Apr 2026

https://github.com/yard1/linearordering

An R package. Provides various methods of linear ordering of data. Supports weights and positive/negative impacts.

data-analysis data-analysis-in-r data-analysis-r data-science r

Last synced: 21 May 2026

https://github.com/sharathsphd/coffee_causality

Data-driven analysis of coffee shop sales using correlation, regression, and causal inference. A Jupyter Book project exploring foot traffic, weather patterns, and business analytics.

business-analytics causal-inference correlation data-analysis foot-traffic forecasting github-pages jupyter-notebook machine-learning open-source python regression retail-analytics statistics storytelling time-series visualization weather-analysis

Last synced: 18 May 2026

https://github.com/phillbertnevinemmanuel/coviddeathvaceda

an exploratory data analysis based on dataset of covid statisics from 2020-2022

data-analysis database sql

Last synced: 09 Apr 2025

https://github.com/patilni3/matplotlib-in-depth

Python's Matplotlib Library for Data Analysis, Machine Learning, Data Science and many more...

data-analysis data-representation data-science data-visualization matplotlib matplotlib-pyplot plots-in-python powerbi seaborn

Last synced: 03 Apr 2025

https://github.com/vivienneforreal/covid4eu-sorbonne

Economy: “Analysis of Labor Market decisions of men and women during the COVID-19 pandemic in the 4EU+ countries”.

covid-19 data-analysis data-science data-visualization pandas

Last synced: 20 Mar 2025

https://github.com/fbraza/paris_airbnb

Analysis of Paris AirBnB data using R and Shiny

analysis data data-analysis paris-airbnb r shiny

Last synced: 21 Mar 2025

https://github.com/hafeez-urrehman/mental-health-analyzer

Mental-Health-Analyzer is an AI-Based project for predicting mental health disorders such as stress, anxiety, depression, and loneliness. By applying machine learning techniques, this project analyzes user inputs and behavioral data to provide accurate predictions, aiming to support mental well-being and early intervention.

data-analysis data-science early-diagnonosis machine-learning mental-health mental-wellbeing predictive-modeling python

Last synced: 17 May 2026

https://github.com/diacod-i/bournetokill

Analysis on inhibition assay data for Monoamine Oxidase protein family

data-analysis data-science data-visualization python3

Last synced: 21 May 2026

https://github.com/cosmoduende/r-uber-trips-analyisis

Explore your activity on Uber with R: How to analyze and visualize your personal data history. Find out how you consume the Uber App using a copy of your data.

analisis-de-data data-analysis data-analytics data-science data-visualisation data-visualization data-viz eda flexdashboard ggmap ggplot2 mobility-as-a-service qmplot r-language r-programming ridesharing uber uber-data visualizacion-de-datos

Last synced: 14 Jul 2025

https://github.com/shubhamgoyal575/diwali-sankranti-promotion-sales

This Power BI dashboard analyzes sales performance during Diwali and Sankranti festivals. It provides insights into revenue trends, top-selling products, regional sales distribution, and customer purchasing behavior to help optimize festive season sales strategies. 🚀

buisness-intelligence dashboard data-analysis data-visualization diwali-sankranti-sales-analysis excel fast-moving-consumers-goods fmcg microsoft-power-bi mysql power-query powerbi revenue-insights sales-dashboard sales-insights sql

Last synced: 02 Mar 2026

https://github.com/athul64/exploratory-data-analysis

To preprocess and analyze the given employee dataset, present the findings graphically, and derive meaningful insights to help better understand the company’s workforce.

colab-notebook data-analysis data-visualization matplotlib numpy pandas python seaborn statistical-analysis

Last synced: 25 Feb 2026

https://github.com/grypesc/graduateadmissions

Visualization, analysis and predictive modeling of a Kaggle graduate admissions dataset.

data-analysis data-mining data-science data-visualization dataset

Last synced: 08 Jul 2025

https://github.com/markoshb/machine-learning-subject

Implementation of multiclass classification problems in R

classification-model data-analysis r

Last synced: 14 Mar 2025

https://github.com/dogoncouch/dhcptranslate

Parses ISC DHCP server config, performs DNS resolution as needed, and outputs lease data in CSV format.

configuration csv-format data-analysis isc-dhcp isc-dhcp-server migration-tool

Last synced: 20 Mar 2025

https://github.com/onome-joseph/ml-fraud-dectection

This project is designed to identify fraudulent transactions with high accuracy.

classfication-model data-analysis data-science machine-learning problem-solving

Last synced: 06 Apr 2025

https://github.com/v6ntage/sql-sales_data-analytics-project

This repository contains a SQL scripts demonstration analytical techniques.

analytics business-analytics data data-analysis database query sql sql-server

Last synced: 12 Apr 2026

https://github.com/jahnavigupta06/zepto-delivery-customer-analytics

Real-time SQL + Power BI Analytics Project replicating Zepto's customer & delivery insights.

business-intelligence churn-analysis customer-segmentation data-analysis data-visualization powerbi sql-server

Last synced: 02 Aug 2025

https://github.com/ituvtu/Data-Science-AB-Testing

This project focuses on conducting A/B testing to evaluate the effectiveness of two marketing campaigns. Using statistical analysis and hypothesis testing, we determine which campaign is more effective in improving conversion rates.

a-b-testing data-analysis data-analysis-python data-mining ipynb jupyter jupyter-notebook python

Last synced: 26 Sep 2025

https://github.com/rdrahul123/sales-dashboard

The Sales Analysis Dashboard was developed to provide insights into sales, profits, and product performance across different categories, timeframes, and geographic locations. By leveraging Power BI, the project aimed to transform raw data into actionable visualizations, facilitating better decision-making for stakeholders.

data-analysis data-science data-visualization dax powerbi

Last synced: 06 Jan 2026

https://github.com/rayyan9477/multiple-disease-prediction-system

This repository contains a Multiple Disease Prediction System leveraging machine learning techniques for accurate predictions. It utilizes Python, Pandas, Scikit-learn, and Flask for data preprocessing, model building, and web deployment. Explore the project and connect on LinkedIn for collaborations.

data-analysis data-science machine-learning python streamlit

Last synced: 10 Apr 2026

https://github.com/shriram-vibhute/data-analysis

This repository offers a comprehensive collection of data analysis techniques using NumPy Pandas, Matplotlib and Seaborn.

data-aggregation data-analysis data-visualization data-wrangling matplotlib numpy pandas seaborn

Last synced: 02 Aug 2025

https://github.com/oguzgn/a-case-study-for-a-livestreaming-platform

This project aims to analyze livestream watch times of users across different regions. The goal is to identify the top 5 users with the highest watch time for each region. The analysis involves multiple SQL transformations to extract meaningful insights from the data.

bigquery data data-analysis data-modeling live-streaming sql

Last synced: 23 Jun 2025

https://github.com/mariam-badr-mb/gtc-land-type-classification

This project develops a machine learning model to classify land cover types in Egypt using Sentinel-2 satellite imagery. The system detects categories such as agriculture, water bodies, urban areas, deserts, roads, and tree cover.

data-analysis data-visualization deep-neural-networks eda machine-learning model-architecture streamlit

Last synced: 12 Jun 2026

https://github.com/sarathchandranpm/vehicle_theft_analysis

This project is a comprehensive data analysis of vehicle theft patterns, utilizing advanced SQL techniques to explore when, which, and where vehicles are most likely to be stolen. The analysis provides deep insights into vehicle theft characteristics through systematic, multi-dimensional exploration.

data-analysis mysql sql

Last synced: 02 Aug 2025

https://github.com/airscholar/data_analysis_with_ai

A repository showing how to use AI and ChatGPT for Data Analysis with Pandas and Python

chatgpt data-analysis gpt4 openai pandas pandasai python

Last synced: 10 Apr 2026

https://github.com/reusjimenez/data-analysis-labs

Casos completos y ejercicios prácticos de análisis de datos. 📊

data-analysis data-visualization jupyter-notebook machine-learning matplotib numpy panel python sklearn

Last synced: 04 Apr 2025

https://github.com/msthamizh/airbnb_analysis

Developing a Streamlit application enabling users to explore and analyze Airbnb listing data. This application allows users to interactively visualize geospatial distributions of listings, analyze pricing trends, and explore availability patterns across different locations. Integrates MongoDB Atlas for data storage and PowerBi for advanced insights

data-analysis data-cleaning data-visualization json mongodb pandas-dataframe plotly powerbi python streamlit

Last synced: 11 Apr 2026

https://github.com/hfxbse/dhbw-data-analysis

Exploratory data analysis R notebook for the module T3INF4333 "Grundlagen Data Science" held in 2024 by Lothar B. Blum at the DHBW Stuttgart.

data-analysis data-science dhbw dhbw-stuttgart ggplot2 r r-notebook

Last synced: 04 May 2026

https://github.com/t-mohamed-shafeek/data-analysis-on-tamil-nadu-road-accidents

The "Data Analysis on Tamil Nadu Road Accidents" is a project deals with analysis of data on Road Accidents encountered by Tamil Nadu ( one of the states of India ) in the year of 2020 and 2021. But the dataset is most recently created (created on February 15, 2023 with source form TN Police).

dashboard data-analysis data-science data-visualization jupyter-notebook tableau

Last synced: 03 Aug 2025

https://github.com/virajbhutada/article-recommendation-system

This project aims to redefine content discovery by delivering personalized article recommendations tailored to individual user preferences. We use advanced machine learning techniques like PCA and K-means clustering to analyze user behavior and article characteristics to provide highly accurate recommendations.

anaconda article-recommendation clustering-algorithm data-analysis data-science keras-tensorflow machine-learning machine-learning-algorithms ml-models numpy pandas plotly python scikit-learn scipy

Last synced: 06 Jan 2026

https://github.com/ilchen/eu_economic_data_analysis

Jupyter notebooks for analysis of Eurozone GDP, yields on government bonds, inflation expectations, unemployment and participation rates, money supply, personal consumption and savings, stock market. Using APIs from Eurostat, ECB, OECD and Yahoo-Finance.

data-analysis disposable-income finance gdp hicp inflation interest-rates jupyter-notebook money-supply participation-rate risk-free-interest-rate savings stock-market unemployment-rate

Last synced: 10 Oct 2025

https://github.com/juliusmarkwei/iris-dataset-analysis

Data analysis, data visualization and model training using the popular Iris Dataset

data-analysis data-visualisation linear-regression machine-learning

Last synced: 03 Aug 2025

https://github.com/alxrm/scent-of-literature

Russian literature sentiment analysis in terms of very small dataset

classification data-analysis sentiment-analysis sklearn tf-idf

Last synced: 28 Apr 2026

https://github.com/aymane-maghouti/sentiment-analysis-for-jumia-reviews-and-smartphone-price-prediction-system

The project focuses on customer sentiment analysis for Jumia, aiding informed online decisions. It collects and analyzes product comments to determine sentiments and implements a decision-making algorithm. Additionally, it includes product price prediction system using regression techniques.

beutifulsoup data-analysis data-cleaning data-collection data-preprocessing data-scraping data-visualization eda falsk machine-learning python web-application

Last synced: 18 Apr 2026

https://github.com/sanam2405/chatinfo

Analysing the WhatsApp Chat with my crush over a 6M period

data-analysis data-visualization python

Last synced: 27 Apr 2026

https://github.com/gesiscss/wikipedia-language-olga-master

Measuring Gender Inequalities of German Professions on Wikipedia

bias crowdflower data-analysis data-science gender images python statistics wikipedia

Last synced: 10 Apr 2026

https://github.com/ipanalytics/vpn-provider-overlap-intelligence

Aggregate VPN provider infrastructure overlap analysis: exact-IP overlap, shared /24 prefixes, hosting dependency, and provider relationship clusters. No raw VPN IP lists.

anti-fraud asn cybersecurity data-analysis fraud-detection infrastructure ip ip-intelligence ip-reputation network-analysis network-intelligence osint proxy-detection threat-intelligence vpn vpn-detection

Last synced: 25 May 2026

https://github.com/nirmit27/book-recommender-system

This is a book recommendation system based on item-based Collaborative Filtering memory-based model created using Flask.

data-analysis data-science flask python python3 recommender-system render

Last synced: 05 May 2026

https://github.com/chelseammatta/nopd-cad-data-analysis

Analysis of 911 call data from New Orleans' 3rd & 4th police districts (2019-2022) using BigQuery

911-calls 911-data bigquery cad-data crime-analysis data-analysis emergency-response new-orleans public-safety sql

Last synced: 01 Jul 2025

https://github.com/walkerdustin/vergleich-von-messmethoden-fuer-punktwolken

Bei der Vermessung eines physischen Raumes ist das Ergebnis eine Punktwolke. Diese Punktwolke beschreibt dann ausgewählte Punkte im Raum, zum Beispiel auf den Wänden und der Decke. Wenn diese Punkte in zwei seperaten Messungen gemessen werden, vielleicht sogar von unterschiedlichen Geräten, soll hinterher herausgefunden werden wie genau diese Punktwolken übereinstimmen. Dafür gibt es zwei grundsätzlich verschiedene Methoden. Diese sollen hier verglichen werden.

3d-models accuracy-metrics data-analysis data-visualization kaggle measure-distance numpy point-cloud pointcloudprocessing punkte python science-research simulation statistics

Last synced: 11 Apr 2026

https://github.com/olob0/badwords-pt-br

💬 Wordlist com palavrões em pt-BR para análise de dados, filtros, ou texto considerado "evitável"

badword-filter badwords brasil data-analysis filter filter-lists filterlist portugues portuguese text-analysis wordlist

Last synced: 06 Jan 2026

https://github.com/ayu-hack/ayu-hack

Enthusiastic learner passionate about building software and exploring the world of technology. Eager to contribute to open-source projects and collaborate with the developer community. Continuously developing my skills in Python,SQL,HTML,CSS,PowerBI, MacOS. Always open to feedback and excited to keep growing!

config css data-analysis github-config html powerbi-desktop python3 sql

Last synced: 30 Apr 2026

https://github.com/mxagar/airbnb_data_analysis

An analysis of the AirBnB dataset from Euskadi / the Basque Country.

airbnb data-analysis data-science eda feature-engineering modeling pandas regression

Last synced: 25 Apr 2026

https://github.com/dual-points/dplearn

A Python package for data analysis.

data-analysis data-science python python-package

Last synced: 16 Oct 2025

https://github.com/mehulcode12/atliq-bank_creditcard_transaction_analysis

The credit card project at Atliq Bank comprises two key phases: market identification and trial. This initiative aims to leverage mathematical and statistical concepts to analyze data related to demographics, income, credit scores, and spending patterns in order to identify the target audience for the credit card.

codebasics data-analysis data-science data-visualization mathematics python python3 statistics

Last synced: 30 Apr 2026

https://github.com/chen0040/pyspark-advanced-algorithms

Samples of Advanced Algorithms and Data Analysis implemented in pyspark

advanced-algorithms data-analysis map-reduce pyspark

Last synced: 12 Jan 2026

https://github.com/bishtrishu/pizza_sales_data_analysis_sql

This project is a comprehensive data analysis of pizza sales, aimed at uncovering key insights and trends to inform business decisions. Using a combination of SQL, Python, and data visualization tools, the project analyzes sales data to understand customer preferences, peak sales periods, and the most popular pizza types.

cloud data data-analysis data-science data-visualization dataanalytics database mysql oracle-database

Last synced: 14 Apr 2026

https://github.com/realorangeone/docker-cyberchef

A containerized deployment of CyberChef, with additional protections

cyberchef data-analysis data-manipulation docker encoding

Last synced: 24 Aug 2025

https://github.com/jiachengwang-punch/predictive-analytics-skill

A reusable, multi-model, language-adaptive methodology for end-to-end machine learning analysis of tabular data.

claude-skill codex-skill data-analysis data-science deepseek feature-engineering lightgbm llm machine-learning methodology prompt-engineering tabular-data

Last synced: 30 May 2026

https://github.com/lightbridge-ks/zoominterface

A data analysis Shiny app of program Zoom report files.

data-analysis r shiny-apps zoom-class zoom-meetings

Last synced: 01 Jun 2026

https://github.com/gallillio/unsupervised_clustering_music_recommendation_system

Music Recommendation System using Unsupervised Machine Learning Clustering Methods using K-Means, Fuzzy C Mean DBSCAN, Gaussian Mixture Model, BIRCH and Agglomerative Clustering

affinity-propagation agglomerative-clustering birch-clustering data-analysis data-visualization dbscan-clustering fuzzy-cmeans-clustering gaussian-mixture-models k-means-clustering pca unsupervised-machine-learning

Last synced: 19 Oct 2025

https://github.com/ifibla/adsdb-project

Algorithms, Data Structures and Databases Project

data-analysis data-engineering python

Last synced: 12 Apr 2026

https://github.com/0xpr03/clantool

CF Management & Data Analysis Tool, crawler backend in rust

backend-server crawler data-analysis rust

Last synced: 05 Feb 2026

https://github.com/ivanildobarauna-dev/api-to-dataframe

Python library that simplifies obtaining data from API endpoints by converting them directly into Pandas DataFrames. This library offers robust features, including retry strategies for failed requests.

data-analysis data-analytics data-engineering library pypi-packages python

Last synced: 06 Mar 2025

https://github.com/com-480-data-visualization/project-2023-the-vizards

Lausanne Transportation : a data visualization of the Lausanne Transportation network. Developed by the Vizards team as part of the EPFL Data Visualization course project (COM-480).

buses data-analysis data-science data-visualization epfl lausanne map metro public-transport public-transportation switzerland webgl

Last synced: 01 May 2026

https://github.com/alicankaya192/ai-jobs-market-2025-2026-salaries

🤖 Global AI & LLM jobs market analysis (2025–2026). Salary trends, remote work premiums, top paying skills, and LLM engineering vs traditional AI comparisons. 📈

ai-jobs data-analysis data-science data-visualization eda exploratory-data-analysis generative-ai jobs jupyter-notebook llm-learning market-analysis matplotlib pandas salary-analysis statistics

Last synced: 21 Jun 2026

https://github.com/renanmoliveir/analise_de_dados_bikestore_power-bi_atualizan-o

Projeto de análise de dados do banco de dados Bike Store com Power BI.

data-analysis dax-languague powerbi query

Last synced: 15 Mar 2026

https://github.com/emso-exe/reclamacoes_de_consumidores_com_empresa_de_telecomunicacoes

Projeto de análise de reclamações de consumidores com empresa de telecomunicações no 1º semestre de 2021 com base nos dados do site consumidor.gov.br.

analise-de-dados ciencia-de-dados data-analysis data-science datascience python python-3 python3

Last synced: 02 May 2026

https://github.com/rkirlew/workoutrecommendationsdataset

This repository contains a synthetic dataset designed for building personalized workout recommendation models. The data is generated for educational and experimental purposes, allowing users to practice machine learning techniques such as classification, k-NN, and clustering, as well as explore fitness-related data analysis.

classification data-analysis dataset k-nearest-neighbours machine-learning

Last synced: 08 May 2026

https://github.com/cano1998/eda-survival-of-the-titanic

This project focuses on Exploratory Data Analysis (EDA) to identify the key determinants that influenced survival during the infamous Titanic accident.

data-analysis data-cleaning data-preprocessing data-visualization exploratory-data-analysis jupyter-notebook titanic-survival-exploration

Last synced: 21 Jun 2026

https://github.com/atymri/linqsimulator

LINQ Simulator is an interactive C# console application designed to let you experiment with LINQ queries in real time.

console csharp data data-analysis linq query sql

Last synced: 23 Oct 2025

https://github.com/prime-infinity/type-one

Software to visualize and analyze GitHub repos based on certain statistics such as stars, forks and issues

data-analysis data-visualization

Last synced: 03 Feb 2026

https://github.com/as16082023/atliq-hospitality-analysis

This project presents an overview of AtliQ Grands' performance in the hospitality industry using Power BI.

atliqgrand codebasicsresumeprojectchallenge data-analysis data-visualization powerbi revenueinsights

Last synced: 23 Jan 2026

https://github.com/mgobeaalcoba/matplotlib_y_seaborn

Aquí dejaré trabajos de visualización realizados con ambas librerías de Python.

data-analysis data-science data-visualization dataset matplotlib numpy pandas python seaborn

Last synced: 13 Apr 2026

https://github.com/parmeetbhamrah/air-quality-india-analysis

Exploratory data analysis of real-time air quality data from Indian cities using Python, Pandas, Matplotlib, and Seaborn.

air-quality data-analysis eda exploratory-data-analysis government-data india matplotlib numpy pandas python seaborn

Last synced: 05 May 2026

https://github.com/cs-joy/pandasv2.0.3

learn data analysis with pandas

data-analysis pandas pandas-learning

Last synced: 03 May 2026

https://github.com/discdiver/new-belgium-ratings

Find the most popular New Belgium beers of all time!

beautifulsoup data-analysis pandas python seaborn webscraping

Last synced: 10 Apr 2026

https://github.com/manikantasanjay/time_series_data_analysis_on_stocks

Time Series Data Analysis project on Daily Stock Prices of the following companies(Apple, Microsoft, Google, Amazon) for a span of 5 years.

data-analysis pandas stock time-series time-series-analysis

Last synced: 03 May 2026

https://github.com/antonijn/polyfit

Fits a polygon to a given data input

c data-analysis linear-algebra toy

Last synced: 16 Jul 2025

https://github.com/mohamed3nan/udacity

Udacity Data Analysis Nanodegree Program

data-analysis data-visualization numpy pandas python

Last synced: 10 Apr 2026

https://github.com/ivanildobarauna-dev/currency-quote

Complete solution for extracting currency pair quotes data with comprehensive testing, parameter validation, flexible configuration management, Hexagonal Architecture, CI/CD pipelines, code quality tools, and detailed documentation.

data-analysis data-analytics data-engineering library pypi-packages python

Last synced: 27 Oct 2025

https://github.com/sustentarea/gs-data-analysis-report-3

📓 Exploring potential associations between childhood undernutrition and the Standardized Precipitation Evapotranspiration Index (SPEI) in Brazilian municipalities (2008–2019)

brazil climate-change data-analysis data-science food-systems global-syndemic ibge malnutrition nutrition obesity r rstats sisvan spei sustainable-eating wasting worldclim

Last synced: 27 Oct 2025

https://github.com/ruchit0807/heart_disease_prediction

An interactive ML-powered web app that predicts the risk of heart disease based on clinical inputs like age, chest pain, cholesterol, ECG, and more. Built using Python, Streamlit, and scikit-learn, it offers early risk assessment in a simple and accessible way—just enter your health metrics and get instant feedback.

data-analysis data-science knn-regression pandas streamlit

Last synced: 04 May 2026