An open API service indexing awesome lists of open source software.

Data analysis

Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.

https://github.com/cai-lab-at-university-of-michigan/ncorrect

A toolkit for the correction and normalization of SWC files from neuron morphology experiments.

data-analysis neuron-morphology swc

Last synced: 05 May 2026

https://github.com/andreaschandra/who-suicides-statistics

Exploratory Data Analysis for Suicides using Python

data-analysis data-science eda python

Last synced: 27 Apr 2026

https://github.com/invictusaman/socioeconomic-indicators-in-chicago-sql-python

This project displays how to create a database connection in notebook, update database using python and how to run Python program and SQL queries together. It uses SQLite and Chicago dataset for analysis.

data-analysis jupyter-notebook python sql sql-queries sqlite

Last synced: 12 Feb 2026

https://github.com/shivamswarnkar/tesla-stock-prediction

Making prediction of close prices of Tesla Stocks using different regression methods.

data-analysis data-visualization plotly regression regularization sklearn stock-price-prediction

Last synced: 05 May 2026

https://github.com/aisurjyasamantaray/sales-perfomance-analysis-dashboard

A comprehensive sales performance analysis dashboard built using Python, and visualization tools. This project includes data cleaning, descriptive statistics, correlation analysis, and insights into sales trends, profitability, and the impact of discounts. Key features include interactive visualizations using Seaborn, and Matplot

analytics annova data data-analysis data-visualization-project dataproject eda hypothesis-testing pandas-dataframe python sales-performance-analysis statistics

Last synced: 04 Apr 2026

https://github.com/billy-enrizky/kimia-farma-sales-management-database-replica-project

SQL Database Management, Then Visualizing it on Tableau!

analytics data-analysis data-visualization sql

Last synced: 27 Feb 2026

https://github.com/supertetelman/kaggle-public

A collection of Python and Matlab projects aimed at utilizing various machine learning techniques to solve big data problems.

cnn data-analysis deep-learning machine-learning matlab python

Last synced: 29 Apr 2026

https://github.com/ezzz-lui/rsm-evaluationproject

Este repositorio es donde esta documentado nuestro proyecto para RSM por parte de actividad final para el bootcamp Data Analyst

data-analysis python

Last synced: 13 Feb 2026

https://github.com/manmolecular/http-response-clustering

:chart_with_downwards_trend: Clustering of HTTP responses using k-means++ and the elbow method

data-analysis elbow-method elbow-plot jupyter k-means-plus-plus python3

Last synced: 29 Apr 2026

https://github.com/jku-vds-lab/marjorie

Marjorie is a web-based approach to visualize and explore patterns in type 1 diabetes data.

data-analysis diabetes pattern-recognition visualization

Last synced: 09 May 2026

https://github.com/abeltavares/nps_performance_analysis

Analyzing and Monitoring Net Promoter Score (NPS) Performance for Healthcare Companies using SQL and Power BI

customer-satisfaction dashboard data-analysis data-visualization healthcare net-promoter-score nps-analysis performance-monitoring power-bi sql

Last synced: 19 Mar 2026

https://github.com/sivkri/imagecoloranalysis

ImageColorAnalysis is a repository with a Python script for color analysis in images using ImageMagick. It generates bash scripts for individual JPG images to analyze specific colors. It provides a flexible solution for extracting color information from images, applicable in various domains such as image classification and data analysis.

bash-scripts color-analysis computer-vision data-analysis image-classification image-processing imagemagick pavement pavement-images python-scripting stomata stomatal-index

Last synced: 13 Feb 2026

https://github.com/ehtisham-sadiq/ai-pioneers-datascience-arena

This repository is dedicated to the AI Amigos team's participation in the Artificial Intelligence (AI) competition with a focus on Data Science.

artificial-intelligence competition data-analysis data-science data-visualization machine-learning model-building model-evaluation numpy pandas python3 supervised-learning unsupervised-learning

Last synced: 05 May 2026

https://github.com/dcs-training/introcausalinference

This is a repository for the Introduction to Causal Inference course provided by Chris Oldnall for the CDCS. Go to the readme file

data-analysis python r statistics

Last synced: 05 May 2026

https://github.com/GeiserX/secciones-nacionalidades

Foreign Insight - WebApp providing insights about nationalities in Spain (Source: Instituto Nacional de Estadística)

census-data dashboard data-analysis data-visualization demographics geospatial government-data immigration ine nationalities open-data population r self-hosted shiny shinydashboard spain spanish statistics webapp

Last synced: 08 Apr 2026

https://github.com/omarsar/energy_stats

Analyzing energy production with Kibana Lens

data-analysis data-science data-visualization elasticsearch kibana

Last synced: 29 Apr 2026

https://github.com/kaustubhgupta/data-analysis-hub

This is where all my Data Analysis notebooks are present. All the notebooks are either fully explored and have an explanatory readme or a medium article has been published which is linked in the README.

data-analysis data-science google-play-store kaggle matplotlib pandas seaborn

Last synced: 10 May 2026

https://github.com/gattiharishkumar/employee-attendance-leaves-analytics-dashboard

This project showcases a Power BI dashboard created to analyze employee attendance and leaves over a three-month period. The data was sourced from Excel datasets available on the Codebasics website.

dashboards data-analysis data-cleaning data-transformation data-visualization power-query-editor powerbi

Last synced: 19 Mar 2026

https://github.com/python-opendata-analysis/opendata-casebook

オープンデータや公的統計の分析・活用の事例とサンプルコードを公開しています。

data-analysis opendata python statistics

Last synced: 04 Apr 2026

https://github.com/mylethidiem/zero-to-hero

Project for learning, practicing code: Python, SQL, C/C++, Data science/Data Analysis, AI/Machine learning

ai cpp data-analysis data-science deep-learning machine-learning mlops python sql

Last synced: 02 Mar 2026

https://github.com/mk2112/minicorpus

Reproducing, then improving MiniPile with PyTorch and HuggingFace

data-analysis huggingface pytorch subset-construction subset-selection

Last synced: 20 Apr 2026

https://github.com/pitmonticone/covid-italy

References for COVID-19 situation in Italy.

coronavirus covid-19 covid-19-italy data data-analysis documentation testing

Last synced: 05 Apr 2026

https://github.com/cworld1/da-learning

Some notes and code about CWorld learning Data Analysis

data-analysis data-science jupyter-book jupyter-notebook python r

Last synced: 18 Apr 2026

https://github.com/uts58/international-student-job-insights-usa

Data-driven insights on job hunting for international students in the USA, analyzing listings, roles, and trends.

career-insights cpt data-analysis eb1 eb2 eb3 h1b handshake job-analytics job-trends jobs jupyter-notebook opt python work-visa

Last synced: 25 Apr 2026

https://github.com/mirokeimioniemi/optimizing-insulin-injection-timing

Data processing and analysis for "Determining the optimal timing for insulin injection to minimize glucose level variability after a meal in ideal conditions" - a research project for the IB Standard Level Mathematics Analysis and Approaches course inspired by my type 1 diabetes.

cgm data-analysis data-science dexcom dexcom-g6 diabetes exploration ib insulin insulin-timing international-baccalaureate mathematics optimization python type-1-diabetes

Last synced: 09 May 2026

https://github.com/shashankbansal6/signal-analysis-for-patient-monitoring

A reliable patient monitoring system which analyzes the correlated physiological signals collected from the patient's body, and generates alarms for abnormalities.

data-analysis patient-monitoring

Last synced: 18 Mar 2026

https://github.com/verbasik/yandex.practicum.datascience

Портфолио проектов Data Science, выполненных в рамках профессиональной переподготовки в Яндекс.Практикум. Включает исследования в области финансов, недвижимости, кинопроката и других, с использованием статистики, машинного обучения и анализа данных.

data-analysis data-science machine-learning yandex-praktikum

Last synced: 29 Jan 2026

https://github.com/sabujxi/python-scraper-and-data-analysts-admin-panel-in-django

A data scraper from texas govt site and a helping web app for managing, reviewing and editing the data

analyst data data-analysis data-entry data-scraper django django-application python python-scraper real-estate regex scraper texas

Last synced: 30 Apr 2026

https://github.com/freepicheep/nu-salesforce

A nushell module to interact with Salesforce data through the Salesforce REST API.

data-analysis nu nushell salesforce salesforce-api scripting shell

Last synced: 03 Mar 2026

https://github.com/idaraabasiudoh/vehicle-co2emission_model

Predicts CO2 emissions from vehicle fuel consumption using a multiple linear regression model trained on sklearn, based on a dataset of engine sizes and corresponding CO2 emissions in Canada.

data-analysis jupyter-notebook machine-learning python3 scikit-learn

Last synced: 06 May 2026

https://github.com/dain55788/ibm-data-engineer-lecture-note

Lecture Notes and Practice Materials of IBM Data Engineering Course

data-analysis database dataengineering datawarehouse ibm

Last synced: 01 Mar 2026

https://github.com/dcs-training/2024-11-18-cdcs-carpentry-social-sciences

This repo contains the material produced for a course run by the Centre in November 2024

data-analysis data-visualisation data-wrangling intro-to-programming r

Last synced: 14 Feb 2026

https://github.com/iantomasinicola/portfoliodataanalyst

Progetto di Data analysis con Python, Microsoft Sql Server e Excel

data-analysis excel python sql

Last synced: 12 May 2026

https://github.com/kalebers/data_streams_parametric_t-sne

Research for Parametric T-SNE in high to low dimensional data stream, published in 2021 by Kalebe Rodrigues Szlachta and Andre de Macedo Wlodkovski, oriented by Jean Paul Barddal, Computer Science graduation from Pontifical Catholic University of Parana (PUCPR)

classifier data-analysis data-science data-visualization machinelearning parametric parametric-tsne python tsne-algorithm tsne-visualization

Last synced: 11 Feb 2026

https://github.com/dcs-training/effectivedatavisualisation

This repository hosts the material connected to a training course developed by Dave Elsmore (Edina) for CDCS on good data visualisation. Go to the readme file

data-analysis data-visualisation data-wrangling python

Last synced: 11 Feb 2026

https://github.com/mrjxtr/tokyo_airbnb_analysis_project

Full project case study and analysis to show potential opportunities to start an AirBnb business in Tokyo, Japan.

data-analysis data-cleaning data-science data-visualization pandas python3

Last synced: 24 Feb 2026

https://github.com/alejandrodumas/traintestdiff

Explore the distribution of your train/validation/test datasets

data-analysis matplotlib pandas seaborn

Last synced: 28 Apr 2026

https://github.com/mahmoudparsian/data-management-for-business-analytics

Data Management for Business Analytics: This course focuses on database management systems and procedures with an emphasis on the design and development of efficient business information systems. MySQL is used to teach the basics of relational database systems, structures, and database queries by using SQL.

analytics business-analytics business-intelligence data-analysis data-visualization database mysql python-data-analysis relational-databases relational-model sql

Last synced: 26 Feb 2026

https://github.com/juliamanifolds/multivariatedataanalysis.jl

Multivariate data analysis using geometric algorithms made easy!

data-analysis geometric-algorithms julia multivariate-statistics

Last synced: 11 Feb 2026

https://github.com/karlyndiary/global-electronics-retailer-sales-and-customer-insights

Developed an analysis using Python, SQL, and Excel to examine sales and customer demographics for a Global Electronics Retailer. The findings aim to enhance business strategies and improve overall performance.

dashboard data-analysis data-cleaning-and-preprocessing data-pipeline data-visualization etl microsoft-excel microsoft-sql-server python sql

Last synced: 14 Feb 2026

https://github.com/vultair/vultair-platform

An automated tool for forensic investigations of social media accounts. Supports platforms like Facebook, Twitter, Instagram, Telegram, WhatsApp, etc.

android automation data-analysis data-parsing forensics-tools investigation social-media

Last synced: 03 Jun 2026

https://github.com/antononcube/wl-outlieridentifiers-paclet

Wolfram Language (aka Mathematica) paclet that provides outlier identifier functions.

data-analysis hampel outlier-detection outliers

Last synced: 20 Mar 2026

https://github.com/ishanoshada/lottery-predict

Predict lottery numbers with this Flask-powered web app! Upload Excel data, get real-time analysis, and see animated predictions. Try it now! 🎰

data-analysis flask lottery lottery-prediction machine-learning prediction prediction-model predictor python

Last synced: 28 Apr 2026

https://github.com/zevnda/browser-history-extractor

A simple TypeScript tool for extracting and analyzing browser history data from Firefox and Chrome browsers

browser-history chrome data-analysis firefox

Last synced: 25 Apr 2026

https://github.com/tushar2704/stats-mosaic-streamlit

Stats-Mosaic-Streamlit is a comprehensive GitHub repository that aims to provide a growing collection of curated content and projects centered around statistics and its intersection with data science, machine learning, and artificial intelligence.

artificial-intelligence bivariate-analysis data-analysis data-science hypothesis-testing machine-learning statistical-learning statistics streamlit streamlit-tushar2704 univariate-analysis

Last synced: 28 Apr 2026

https://github.com/w-edward/youtube-keyword-popularity-analyzer

An effort to discover the top trending keywords on Youtube.

data-analysis node-js numpy python webscraping youtube-api

Last synced: 15 Apr 2026

https://github.com/labrinyang/apple-health-analysis

Mayo Clinic-grade Apple Health data analysis — Claude Code skill with 20 peer-reviewed statistical methods and 35+ SVG visualizations

apple-health cgm claude-code claude-skill data-analysis health-analytics heart-rate statistical-methods

Last synced: 19 Apr 2026

https://github.com/zelosleone/finncorr

A .NET Core financial analysis tool/API for calculating correlations between time series data with interactive visualizations powered by ML.NET and Plotly.js.

aspnet-core correlation-analysis csv-parser data-analysis dotnet financial-analysis machine-learning ml-net plotly rest-api statistical-analysis swagger time-series visualization

Last synced: 09 Feb 2026

https://github.com/sidhuk/metaboheatmap

A R/Shiny based app for visualizing metabolomics data through heatmaps

data-analysis data-visualization heatmap metabolomics shiny

Last synced: 26 Feb 2026

https://github.com/quantumudit/analyzing-yell-cafes

This project focuses on scraping data related to cafes and coffee shops in London, England from the Yellow Pages (Yell.com) website; performing necessary transformations on the scraped data and then analyzing & visualizing it using Jupyter Notebook and Power BI.

data-analysis data-science data-transformation data-visualization etl jupyter-notebook power-bi python webscraping

Last synced: 02 May 2026

https://github.com/nafiealhilaly/analyze-coderhub-sa

A simple web app to analyze/explore coderhub.sa API data, this project was my first real react app.

backend data-analysis eda frontend python react reactjs

Last synced: 19 Apr 2026

https://github.com/praveendecode/product_sentiment_analysis

This project employs NLTK, Prowebscraper, and Python for sentiment analysis on online product reviews. Through web scraping, EDA, and NLP, it evaluates user satisfaction by comparing actual ratings and sentiment scores

data-analysis data-visualization natural-language-processing nltk-python product-analysis python sentiment-analysis

Last synced: 03 May 2026

https://github.com/alexandregazagnes/unilasalle-public-resources

UniLaSalle-Public-Ressources : This public repository contains the notebooks and the data used for both : 2nd Year - Practical Statistical Tests 4th Year - Data Analysis with Python

data data-analysis data-analytics data-cleaning data-storytelling education educational exploratory-data-analysis python python3 r r-programming rstudio statistics visualization

Last synced: 28 Apr 2026

https://github.com/rajshrestha86/police-brutality-data-analysis

In this project, we analyze the events after George Floyd’s death. The protests and riots across the United States and sentiments of news articles of three different news sources that have different political leaning. We will see how these media reacted after Floyd’s death and see the effect of media bias on the sentiments of news for #BlackLivesMatter and #AllLivesMatter movement. We will also see if there is a correlation between the police budget and the number of protests. This analysis will help us to see if there is really a need for defunding police to reduce police brutality and casualties. We will also see the correlation of partisan segregation and number of deaths to see if political preference has an effect on the number of deaths by police.

data-analysis matplotlib pandas python sentiment-analysis web-scraping

Last synced: 17 Apr 2026

https://github.com/seyedhosseinzadeh/ws_tm

Weather web scraping and Time series model to predict temperature, humidity and barometer

data-analysis deep-learning lstm-model machine-learning prediction prediction-model weather web-scraping

Last synced: 10 Jun 2026

https://github.com/tddschn/whatsapp-chat-analyze

Command Line Tool to Generate Pretty Charts from Whatsapp Exported Chats

data-analysis data-visualization plotly python whatsapp whatsapp-data

Last synced: 04 Jun 2026

https://github.com/juliasouz/dashboard-vendas

Dashboard interativo de vendas do Xbox Game Pass, criado no Excel para análise e visualização de dados de assinaturas.

business-intelligence dashboard data-analysis excel sales-data visualizacao-de-dados xbox-game-pass

Last synced: 31 Jan 2026

https://github.com/worst001/note_machine_learning

整理了机器学习相关资料与手册,包括数学基础、机器学习模型实现示例、神经网络。

ai data-analysis deep-learning development guide learning machine-learning markdown mkdocs note notebook

Last synced: 08 May 2026

https://github.com/willie-conway/datavista-command-line-application

A robust 🐍Python application for data analysis that provides a wide range of tools for 🔃loading, 🧹cleaning, and 🔃preprocessing data. It includes features for 📈statistical analysis, 👨🏿‍🔬hypothesis testing, 🦾machine learning, clustering, ⏳time series forecasting, and 📊data visualization, all designed to enhance your analytical workflow.

analytics big-data command-line data-analysis data-cleaning data-driven data-mining data-pipeline data-preprocessing data-science data-scientist data-visualization data-wrangling exploratory-data-analysis machine-learning pandas predictive-analytics python statistics visualization-tools

Last synced: 01 May 2026

https://github.com/1luvc0d3/metabase-mcp

MCP server connecting Claude to Metabase for natural language data analysis, dashboard management, and SQL queries

anthropic claude data-analysis mcp metabase model-context-protocol natural-language sql

Last synced: 21 Apr 2026

https://github.com/saranshbansal/spam-detection-analytics-tool

This is a nice tool to read chunks of sms data from a csv and understand how different algorithms (pre-implemented) perform in identifying spam messages.

analytics data-analysis data-science data-visualization mysql spring-boot

Last synced: 01 May 2026

https://github.com/frostrain5015/investory

Multi-market portfolio tracker with quant analytics, backtesting & AI copilot.

data-analysis finance investment

Last synced: 13 Jun 2026

https://github.com/quantumudit/analyzing-quotes

This project focuses on scraping all the quotes and their related data from the "Quotes To Scrape" website; performing necessary transformations on the scraped data and then analyzing & visualizing it using Jupyter Notebook and Power BI.

data-analysis data-science data-transformation data-visualization etl jupyter-notebook power-bi python webscraping

Last synced: 01 May 2026

https://github.com/carocardenas0699/pi02-data-analysis

Proyecto Individual 2 de la carrera Data Science. Se realizó un análisis de homicidios en siniestros viales en la ciudad de Buenos Aires. Incluye: ETL, EDA, Dashboard interactivo con resultados

data-analysis data-science data-visualization eda etl powerbi python

Last synced: 15 Jun 2026

https://github.com/quantumudit/thereyougo-store-analysis

This project focuses on scraping all the products and their related info from the "There You Go" website; performing necessary transformations on the scraped data and then analyzing & visualizing it using Jupyter Notebook and Power BI.

data-analysis data-science data-transformation data-visualization etl jupyter-notebook power-bi python webscraping

Last synced: 21 Apr 2026

https://github.com/aliakbar-omidi/digikala-data-analysis

Analysis of the behavior of Digikala customers in shopping at different times

collection data-analysis matplotlib numpy pandas persiantools python seaborn

Last synced: 15 Apr 2026

https://github.com/mindlessmuse666/client-data-analysing-tool

Инструмент для анализа данных. Приложение позволяет пользователям загружать данные в формате CSV, фильтровать их, визуализировать ключевые показатели с помощью графиков и создавать отчеты.

data-analysis desktop-application matplotlib pandas pyqt6 pyqt6-desktop-application python sqlite student-project

Last synced: 26 Apr 2026

https://github.com/revogati/ecommerce_consumer_behaviour

This is a Full Data Analytics project From data cleaning, preparation, exploration, Interpretation of insights up to Presentation of findings and recommendations..

data-analysis data-exploration ecommerce jupyter-notebook python sql tableau-public visualization

Last synced: 16 Apr 2026

https://github.com/pratishtha-abrol/astronomy-dataanalysis

A key technique in Data Driven Astronomy

astronomy astropy crossmatch data-analysis

Last synced: 15 Jun 2026

https://github.com/asifdotexe/covidporfolioproject

This is a SQL + Tableau Project on real world Covid 19 Dataset from the start of recorded case to 2nd March 2022 i.e My birthday XD

dashboard data-analysis data-exploration data-visualization sql sql-server tableau

Last synced: 08 Jun 2026

https://github.com/andr3w03/bike-sharing-dashboard

Bike Sharing Data Analysis Streamlit Dashboard

dashboard data-analysis data-visualization python streamlit

Last synced: 01 May 2026

https://github.com/anil951/early-detection-of-mental-health

This project develops a predictive model to identify early signs of mental health issues in adolescents using social media activity, school performance, health records, and an AI chatbot. It analyzes emotional tone, academic changes, and health data, offering personalized recommendations and resources for mental wellness.

data-analysis deep-learning early-detection lstm mental-health sentiment-analysis social-media

Last synced: 28 Jan 2026

https://github.com/kirrrto/amazon-market-research-dashboard

Amazon product research dashboard for spreadsheet imports, supplier product pages, specification matrices, gap analysis, requirement drafts and supplier follow-up exports.

amazon-product-research data-analysis ecommerce market-research pandas product-development python specification-analysis streamlit

Last synced: 17 Jun 2026

https://github.com/chahiriabderrahmane/carpricepredictor

🚗 Cars Exploration & Price Prediction | Analyzing Cars.com Listings

data-analysis data-science data-visualization machine-learning python streamlit web-scraping

Last synced: 08 Feb 2026

https://github.com/alcestide/scianalytics

Playground for Data Analysis and Visualization for Research and Scientifical Purposes with Pandas and Plotly.

csv data-analysis data-science data-visualization pandas plotly python science-research statistics

Last synced: 30 Apr 2026

https://github.com/spaghettifunk/gvb

Analysis of GVB in Amsterdam

data-analysis public-transportation

Last synced: 28 Feb 2026