An open API service indexing awesome lists of open source software.

Data analysis

Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.

https://github.com/renanmoliveir/analise_de_dados_bikestore_power-bi_atualizan-o

Projeto de análise de dados do banco de dados Bike Store com Power BI.

data-analysis dax-languague powerbi query

Last synced: 15 Mar 2026

https://github.com/as16082023/restaurant-order-analysis

Analyzing order data to identify the most and least popular menu items and types of cuisine

data-analysis maven-analytics mysql restaurant-order sql

Last synced: 10 Apr 2025

https://github.com/ayaanjawaid/google_playstore_data_analysis

This project provides an in-depth analysis of Google Play Store apps and user reviews, focusing on understanding app performance, user sentiment, and key trends in app categories. Using Python, I performed data cleaning, feature engineering, and exploratory data analysis (EDA) on app data and reviews.

data-analysis eda html numpy pandas-dataframe plotly python vizualisation

Last synced: 24 Feb 2026

https://github.com/tks18/pyquery

PyQuery is a local-first data operating system built on lazy execution that processes 100GB+ files while you doomscroll. No cap. 🧢

data-analysis data-science etl hdfs parquet pipeline polars python

Last synced: 14 Jan 2026

https://github.com/rara-ch/data-analysis-portfolio

This repository to store my data analytics projects, showcasing my skills in SQL and Python.

data-analysis mathematics matplotlib numpy pandas portfolio probability python seaborn sql statistics

Last synced: 12 Mar 2025

https://github.com/gattiharishkumar/blinkit-sales-analysis-dashboard

This project presents a comprehensive sales analysis dashboard for Blinkit, an Indian last-minute delivery app. The dashboard was created using Power BI and provides a detailed overview of the company's sales performance across various outlets and product categories.

dashboard data-analysis data-transformation data-visualization ms-excel-data-analytics power-query powerbi powerbi-visuals

Last synced: 19 Mar 2026

https://github.com/aadityatamrakar/futures_spread_chart

Cash Market & Futures Daily Spread Chart - NSE Stocks

data data-analysis data-mining expressjs nodejs requests

Last synced: 10 Apr 2026

https://github.com/parmeetbhamrah/air-quality-india-analysis

Exploratory data analysis of real-time air quality data from Indian cities using Python, Pandas, Matplotlib, and Seaborn.

air-quality data-analysis eda exploratory-data-analysis government-data india matplotlib numpy pandas python seaborn

Last synced: 05 May 2026

https://github.com/cano1998/eda-survival-of-the-titanic

This project focuses on Exploratory Data Analysis (EDA) to identify the key determinants that influenced survival during the infamous Titanic accident.

data-analysis data-cleaning data-preprocessing data-visualization exploratory-data-analysis jupyter-notebook titanic-survival-exploration

Last synced: 22 Oct 2025

https://github.com/jeniljani-4444/end-to-end-world-cup-analysis-web-app

Our streamlined Streamlit web app fetches and processes ESPN CricInfo data delivering dynamic graphs for a quick and engaging cricket experience. Deployed on AWS EC2 with CI/CD pipelines.

aws-ec2 data-analysis plotly preprocessing streamlit-webapp

Last synced: 02 Apr 2025

https://github.com/bcko/ud-da-stroopeffect

Udacity Data Analyst Nanodegree Project : Test a Perceptual Phenomenon (Stroop Effect)

data-analysis data-analyst-nanodegree stroop-effect udacity udacity-data-analyst-nanodegree

Last synced: 04 Jul 2025

https://github.com/ajimaulana123/e-commerce-data-analis

Analisis dataset e-commerce guna menjawab kebutuhan product mana yang paling laris dibeli customer

data-analysis

Last synced: 28 Apr 2026

https://github.com/lmuffato/analise-de-diarias-prefeituas-do-es

Esse código faz parte de um projeto de descoberta e combate a esquemas de corrupção, através do tratamento e cruzamento de dados abertos disponíveis em diversas prefeituras do Espirito Santo através do portal da transparência. Junção e análise de várias tabelas importadas em csv.

data-analysis personal-project r rstudio

Last synced: 12 Jun 2025

https://github.com/jatin-mehra119/bike-rentals-dataset

This repository focuses on optimizing bike rental availability during peak hours and days using machine learning techniques. Leveraging publicly available data from the UCI Machine Learning Repository, it includes scripts for data preprocessing, model training, and visualization, along with detailed observations and results.

data-analysis data-science ensemble-model pandas scikitlearn-machine-learning

Last synced: 15 Apr 2026

https://github.com/markmusic27/data-statistics-calculator

💣 This method (made in JavaScript / Python) can find the mean, median, mode, range, and standard deviation.

data-analysis standard-deviation statistics statistics-calculator

Last synced: 23 Oct 2025

https://github.com/atymri/linqsimulator

LINQ Simulator is an interactive C# console application designed to let you experiment with LINQ queries in real time.

console csharp data data-analysis linq query sql

Last synced: 23 Oct 2025

https://github.com/anurag-kumar-molankala/blinkit-grocery-sales-dashboard

The BlinkIT Grocery Sales Dashboard is an interactive Power BI dashboard that provides insights into grocery sales performance. It includes key KPIs, sales trends, and outlet performance analysis.

business-intelligence dashboards data-analysis data-visualization dax excel kpi-dashboard power-bi powerquery slicers-kpi-card-multirow-card sql-server ssis ssms ssrs-reports

Last synced: 09 Apr 2025

https://github.com/viper373/163-buff

爬取网易BUFF平台CS:GO武器皮肤交易数据

163 arima crawler-python csgo data-analysis prediction python

Last synced: 24 Oct 2025

https://github.com/colburncodes/se_pudding_2023

This project is a React app designed to showcase research conducted by a team of data scientists and data analysts. The app is utilizing React and React-Chartjs-2

chartjs-2 data-analysis data-science data-visualization react-chartjs-2 reactjs

Last synced: 11 May 2026

https://github.com/mr-vozhyk/test-tasks

Выполненные тестовые задания (не запрещенные к публикации)

analysis data-analysis digital-analysis e-commerce excel google-colab google-sheets marketing marketplace power-bi power-query python sql

Last synced: 07 May 2026

https://github.com/paezha/isdas

Companion package for An Introduction to Spatial Data Analysis and Statistics with R

data-analysis gis rstats spatial-analysis spatial-statistics

Last synced: 04 Jan 2026

https://github.com/sivas-2/food-demand

This project aims to predict food demand based on historical data, leveraging various statistical methods to achieve accurate forecasts.

data-analysis data-science dataanalysis food-demand-forecasting statistics

Last synced: 12 Aug 2025

https://github.com/tunjis/global-superstore_dashboard_tableau

Tableau dashboard with 4 different types of visualisations

charts dashboard data-analysis data-visualisation excel tableau

Last synced: 23 Jan 2026

https://github.com/lmuffato/project-ting-trybe

Projeto ting - Projeto avaliativo da Trybe do Bloco 37: Estrutura de Dados II: Listas, Filas e Pilhas

data data-analysis python queue read-file stack trybe trybe-projects

Last synced: 12 Jun 2025

https://github.com/nafisalawalidris/sales-performance-dashboard

Sales Performance Dashboard: Analyze and visualize sales data using Power BI. Gain insights into trends, customer segments, product performance, and geographic distribution. Make data-driven decisions to optimize sales strategies and maximize revenue.

analytics-revenue dashboard-power-bi data data-analysis intelligence-sales optimization performance sales visualization-business

Last synced: 03 Feb 2026

https://github.com/lmuffato/dados-meteorologicos-inmet-tratamento

Tratamento e enálise de dados meteorológicos das estações locais fornecidos pelo INMET, utilizando a linguagem R

data-analysis personal-project r rstudio

Last synced: 12 Jun 2025

https://github.com/akash1070/data-science-virtual-internship-by-anz

Exploratory data analysis and prediction of annual salary for customers from the dataset provided by ANZ.

data-analysis data-science predictive-analytics presentation-slides

Last synced: 24 Mar 2025

https://github.com/RickContreras/StudentPerformancePredictionSaberPro

Modelo de clasificación para predecir el desempeño de estudiantes en las Pruebas Saber Pro en Colombia. Incluye análisis exploratorio de datos, preprocesamiento y modelos de machine learning.

classification colombia data-analysis data-science education educational-assessment exploratory-data-analysis jupyter-notebook machine-learning python saber-pro scikit-learn student-performance

Last synced: 24 Oct 2025

https://github.com/as16082023/atliq-hospitality-analysis

This project presents an overview of AtliQ Grands' performance in the hospitality industry using Power BI.

atliqgrand codebasicsresumeprojectchallenge data-analysis data-visualization powerbi revenueinsights

Last synced: 23 Jan 2026

https://github.com/santiagohermo/data.tools

Convenience functions for data manipulation

data-analysis data-cleaning data-science data-wrangling

Last synced: 24 Oct 2025

https://github.com/ryanfranklin237/data-cleansing

A group of python scripts that clean large data sets by removing duplicate data, putting data in correct formats, and removing redundant cells

data-analysis data-cleaning data-science extract-transform-load pandas-dataframe python

Last synced: 28 Feb 2025

https://github.com/dcs-training/null-hypothesis-testing-with-r

This two-class course will focus on developing theoretical and practical skills for null hypothesis testing in R. Go to the readme file

data-analysis data-wrangling r statistics

Last synced: 24 Oct 2025

https://github.com/ayenpure/stockmeup

This is a class project for 'CIS 610 : Data Science' where I try and validate Stock Market recommendations.

data-analysis data-mining data-science java mapreduce mapreduce-java

Last synced: 24 Oct 2025

https://github.com/rkirlew/workoutrecommendationsdataset

This repository contains a synthetic dataset designed for building personalized workout recommendation models. The data is generated for educational and experimental purposes, allowing users to practice machine learning techniques such as classification, k-NN, and clustering, as well as explore fitness-related data analysis.

classification data-analysis dataset k-nearest-neighbours machine-learning

Last synced: 08 May 2026

https://github.com/airdac/sim-telco_customer_churn

Prediction of customer churn with logistic regression in R. Team project from UPC's Master's Degree in Data Science

classification data-analysis data-science logistic-regression r statistical-models upc

Last synced: 28 May 2026

https://github.com/allanotieno254/british-airways-review-dashboard-using-tableau

An interactive dashboard built using Tableau, visualizing British Airways customer review data. This project analyzes customer sentiment, satisfaction levels, and other key metrics to provide insights into passenger experiences and identify improvement areas for British Airways.

customer-satisfaction-analysis data-analysis data-visualisation tableau-dashboards tableau-public

Last synced: 19 Mar 2026

https://github.com/alphonsg/bimana

Package for performing automated bio-image analysis tasks.

bioimage-analysis bioinformatics data-analysis deep-learning edge-detection image-analysis image-processing

Last synced: 25 Oct 2025

https://github.com/nirmit27/book-recommender-system

This is a book recommendation system based on item-based Collaborative Filtering memory-based model created using Flask.

data-analysis data-science flask python python3 recommender-system render

Last synced: 05 May 2026

https://github.com/jabhij/fbi_nics-firearm-background-checks

This project is a try to showcase the use of guns across the US.

data-analysis data-analytics data-science data-visualization tableau

Last synced: 23 Feb 2026

https://github.com/guruakaashjn/te_project_microsoft_ai

AI based statistical analysis of land-use plastic pollution in India using AI/ML techniques.

artificial-intelligence data-analysis data-analytics data-science data-visualization machine-learning powerbi

Last synced: 27 Feb 2025

https://github.com/mkk-1817/adhd-prediction

This project focuses on leveraging machine learning techniques to predict Attention-Deficit/Hyperactivity Disorder (ADHD) in children. Accurate and early diagnosis is crucial for effective intervention and support.

adhd data-analysis data-science jupyter-notebook machine-learning machine-learning-algorithms prediction python

Last synced: 09 May 2026

https://github.com/albertomorini/policesviolence

Repository for the project of the course Data Science (Fondamenti di Scienza dei Dati) at UniUD.

data-analysis data-science data-visualization r

Last synced: 31 May 2026

https://github.com/priyanshubiswas-tech/pwc-power-bi-task-1-2

Power BI dashboards analyzing Phonenow's call center performance and customer retention. Task 1 focuses on KPIs like satisfaction rating, call count, and agent efficiency. Task 2 analyzes retention trends and customer behavior to enhance loyalty. Built using Power BI, DAX, and Excel.

dashboard data data-analysis dax-measures excel powerbi powerbidashboard

Last synced: 23 Jan 2026

https://github.com/mohamed3nan/udacity

Udacity Data Analysis Nanodegree Program

data-analysis data-visualization numpy pandas python

Last synced: 10 Apr 2026

https://github.com/soumya-thoutam/revenue-and-demand-forecasting-analysis

End-to-end data analysis project with SQL and Power BI for revenue and demand forecasting in bike-sharing.

data-analysis data-visualization powerbi sql sql-server

Last synced: 16 Mar 2025

https://github.com/gustavo-zamai/shop_data_analisys

Analysis diferents shopping mall sells

data-analysis openpyxl pandas python3 pywin32

Last synced: 01 Mar 2025

https://github.com/amstuta/cpp-neural-network

Simple implementation of a feedforward neural network in c++

data-analysis deep-learning machine-learning neural-network

Last synced: 08 Apr 2025

https://github.com/carusel02/sequential-data-processing-and-analysis

Sequential data processing and analysis using linked-list in C

data-analysis data-processing linked-list

Last synced: 26 Oct 2025

https://github.com/unnatmalik/dattavism-ai-powered-data-insight-generator-

Dattavism is an AI-powered data insight platform that transforms raw CSV files into comprehensive, contextualized reports—complete with visualizations, statistical summaries, and natural language insights. Dattavism is designed to handle datasets across diverse domains. it is Built using Python, Streamlit, Gemini API, Pandas, Matplotlib, NumPy,

data-analysis python streamlit

Last synced: 24 Jul 2025

https://github.com/abdul-wahab-318/pakistani-news-sentiment-analysis

This project involves performing sentiment analysis on Pakistani news articles collected over the past month (August-September 2024). The primary goal is to understand media sentiments regarding various topics and events covered in the news. A total of 800+ articles were scraped from multiple news sources.

data-analysis machine-learning pakistan pakistani-politics sentiment-analysis

Last synced: 26 Oct 2025

https://github.com/jpotter80/notebook-examples

This repository demonstrates a systematic approach to cleaning and standardizing e-commerce product data using DuckDB. The notebook serves as a detailed walkthrough of our data cleaning methodology, showcasing how we handle common data quality challenges in e-commerce datasets.

data-analysis data-cleaning jupyter-notebook

Last synced: 12 Jun 2025

https://github.com/2kabhishek/pyramen

Data Analysis for Ramen 🍜💹

csv data data-analysis fun python report

Last synced: 26 Oct 2025

https://github.com/jinkogule/multi-analyst

O Multi Analyst é uma ferramenta de análise de dados com uma usabilidade simples, que utiliza inteligência artificial para interpretar os resultados das análises realizadas, retornando insights úteis aos usuários.

apriori-algorithm bootstrap css data-analysis django html numpy open-ai pandas python web-application

Last synced: 12 Apr 2026

https://github.com/gustavo-zamai/product_return_data_analysis

Analysis returns products of differents stores

data-analysis excel pandas plotly-express python3 pywin32

Last synced: 13 May 2026

https://github.com/vatshayan/hospital-discharge-analysis

Analysis of Hospitalization Discharge Rates in Lake County, Illinois of various attributes like Anxiety, Alcohol, mood, Diabetes, Asthma, etc

data-analysis data-visualization jupyter-notebook machine machine-learning machine-learning-algorithms scikit-learn

Last synced: 04 Mar 2025

https://github.com/gholamrezadar/favourite-youtube-channels

this program goes through your youtube watch history and sorts channels based how many of their videos you have watched!

data-analysis data-visualization python

Last synced: 16 Jan 2026

https://github.com/ivanildobarauna-dev/currency-quote

Complete solution for extracting currency pair quotes data with comprehensive testing, parameter validation, flexible configuration management, Hexagonal Architecture, CI/CD pipelines, code quality tools, and detailed documentation.

data-analysis data-analytics data-engineering library pypi-packages python

Last synced: 27 Oct 2025

https://github.com/programmer-rd-ai/moviedatascraper

Explore the cinematic universe with our IMDb web scraping project! Dive into movie data with ease, uncovering insights from cast to critical reviews. With dynamic visualizations and reliable data, let's journey through the world of movies like never before. Lights, camera, analysis!

beautifulsoup beautifulsoup4 data data-analysis jupyter-notebook matplotlib numpy pandas programming python python3 scraping seaborn software web

Last synced: 01 Mar 2025

https://github.com/sarathchandranpm/restaurant_order_analysis

This project entails an in-depth analysis of a restaurant's order and menu data. The focus is on exploring customer ordering behaviors, menu item attributes, and order specifics. By investigating the connections between order details, menu items, and order dates, the project seeks to generate valuable insights into the restaurant's operations.

data-analysis mysql sql

Last synced: 10 Apr 2025

https://github.com/foggy-projects/foggy-data-mcp-bridge

MCP Data Bridge for Java. Enabling safe Text-to-Query via a semantic layer, making enterprise data accessible to AI Agents.

agent data-analysis java llm mcp semantic-layer spring-boot text-to-sql

Last synced: 16 Mar 2026

https://github.com/rcv911/lyapunov-indicators

Calculating Lyapunov indicators with multiprocessing in Python

data-analysis lyapunov lyapunov-indicators multiprocessing

Last synced: 18 Jan 2026

https://github.com/sustentarea/gs-data-analysis-report-3

📓 Exploring potential associations between childhood undernutrition and the Standardized Precipitation Evapotranspiration Index (SPEI) in Brazilian municipalities (2008–2019)

brazil climate-change data-analysis data-science food-systems global-syndemic ibge malnutrition nutrition obesity r rstats sisvan spei sustainable-eating wasting worldclim

Last synced: 27 Oct 2025

https://github.com/shadowk29/cusumtools

An eclectic collection of python scripts I have found to be useful in processing nanopore data

data-analysis data-visualization time-series-analysis

Last synced: 16 Mar 2026

https://github.com/dcs-training/good-data-visualisation-with-r

Our guide on how we create data visualisations through R. Go to the readme file

data-analysis data-visualisation r rmarkdown

Last synced: 16 Jun 2026

https://github.com/abhik1711/material-classification-and-energy-band-prediction---excavate-25

A Two-Stage Machine Learning Pipeline: A Binary Classifier to identify insulators with high accuracy and a Stacking Regressor to predict precise band gap values for insulators by leveraging advanced feature engineering techniques and ensemble learning methods

data-analysis machine-learning python

Last synced: 04 Sep 2025

https://github.com/yonatanadam/film-success-prediction

Analyzing Hollywood movie success based on genre, target audience, and runtime using machine learning

data-analysis ipynb machine-learning

Last synced: 25 Jan 2026

https://github.com/quantitext/quantitext

Official repository for QuantiText applications in the .NET ecosystem.

api aspnet-core csharp data-analysis dotnet-core mvc-architecture

Last synced: 30 Mar 2025

https://github.com/5ekastanx/data-analysis

Extracting data from parsing, for example, like hacking using Python using all sorts of function methods

data-analysis html python

Last synced: 14 Mar 2025

https://github.com/programmer-rd-ai/dimensionality-reduction

DimRed is a comprehensive Python toolkit for advanced dimensionality reduction, integrating with major machine learning libraries and featuring real-time performance monitoring to enhance data analysis and model efficiency.

analytics data-analysis data-science lightgbm machine-learning matplotlib numpy pandas programming python python3 sklearn university xgboost

Last synced: 01 Mar 2025

https://github.com/mxagar/airbnb_data_analysis

An analysis of the AirBnB dataset from Euskadi / the Basque Country.

airbnb data-analysis data-science eda feature-engineering modeling pandas regression

Last synced: 25 Apr 2026

https://github.com/jossimmar/ensa-ss25

Repositorio destinado al manejo de datos de consumo de los Clientes Mayores de ENSA del Grupo Distriluz.

data-analysis electrical-engineering python sqlite

Last synced: 30 Mar 2025

https://github.com/jcaperella29/stock_evaluation_python

A Python script to classify companies based on financial metrics like Piotroski F-Score and Stock Valuation, using CSV financial data for analysis and output.

ai-in-finance artificial-intelligence classification csv-processing data-analysis expert-system finance financial-analysis financial-analysis-tools piotroski-f-score python quantitative-analysis rule-based-classifier stock-analysis stock-valuation

Last synced: 07 Sep 2025

https://github.com/oguzgn/fully-automated-performance-marketing-dashboard

This project integrates data from multiple ad platforms with Google Analytics to track marketing campaigns. It uses a structured naming system and UTM tags. Data is visualized in Looker Studio dashboards to analyze campaign performance and ad spend.

bigquery data-analysis data-engineering data-modeling marketing-analytics marketing-automation marketing-data-science marketingdata sql

Last synced: 24 Mar 2025

https://github.com/matthewgrosman/messenger-analytics

Project that ingests Facebook Messenger conversations and generates analytics.

analytics data-analysis excel facebook facebook-messenger java mongodb

Last synced: 15 Apr 2025

https://github.com/as16082023/music-store-analysis

This project involves analyzing music store data using SQL queries in MySQL workbench to enhance decision-making, identify trends, and understand customer behavior

data-analysis music-store-analysis mysql sql

Last synced: 06 Jul 2025

https://github.com/kentlouisetonino/sw-project-data-analysis

My project for AMA MATH 6200 course.

data-analysis python school-project

Last synced: 28 Feb 2025

https://github.com/vipul2001/cousera-courses

This repo covers the solution to the assignments of various courses on algorithm,deep learning and data Analytics

coursera-courses data-analysis data-analytics-ibm deep-learning divide-and-conquer neural-network

Last synced: 29 May 2026

https://github.com/ivanildobarauna-dev/api-to-dataframe

Python library that simplifies obtaining data from API endpoints by converting them directly into Pandas DataFrames. This library offers robust features, including retry strategies for failed requests.

data-analysis data-analytics data-engineering library pypi-packages python

Last synced: 06 Mar 2025

https://github.com/whis99/userfunnelanalysis

An ecommerce user funnel conversion data analysis with matplotlib & python.

data-analysis data-analysis-python data-analyst data-visualization google-colab jupyter-notebook matplotlib python

Last synced: 13 Apr 2026

https://github.com/gorhkdwj/da_portfolio

Kim Jae Chun's DA_Portfolio

data data-analysis python sql

Last synced: 20 Feb 2026