An open API service indexing awesome lists of open source software.

Data analysis

Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.

https://github.com/mohammadkarbalaee/python-for-data-analysis-book

All the practice and code that I am doing while I read the book called, Python for data analysis

data-analysis data-science python

Last synced: 27 Mar 2025

https://github.com/mggg/gerrytools

Tools for evaluating districting plans.

data-analysis tooling

Last synced: 10 Sep 2025

https://github.com/alhankeser/citibike-analysis

Extracting and Transforming Citi Bike Data for Analysis

citibike data-analysis data-science data-visualization etl sql

Last synced: 25 Jan 2026

https://github.com/mo-karbalaee/python-for-data-analysis-book

All the practice and code that I am doing while I read the book called, Python for data analysis

data-analysis data-science python

Last synced: 02 Aug 2025

https://github.com/mynenik/xyplot-32

Extensible Plotting and Data Analysis Program for 32-bit x86 GNU/Linux

cpp data-analysis data-manipulation data-visualization forth linux-app motif xwindows

Last synced: 02 Aug 2025

https://github.com/artdgn/pages

Auto-updating dashboards about COVID-19 https://artdgn.github.io/pages

covid-19 data-analysis modeling

Last synced: 17 Jan 2026

https://github.com/michaelnabil230/laravel-analytics

A Laravel package to retrieve pageviews and other data from Database

data-analysis data-structures database laravel php

Last synced: 25 Jan 2026

https://github.com/tideland/go-cells

Light-weight event-processing based on the idea of meshed cells with different pluggable behaviors

cep data-analysis data-stream event-processing events golang

Last synced: 05 Apr 2025

https://github.com/benjamindpb/wikidata-preprocessing

Wikidata dump preprocessing & analysis of georreferencial entities

data-analysis preprocessing wikidata wikidata-dump

Last synced: 15 Jul 2025

https://github.com/tanaylab/naryn

Native Access medical record Retriever for high Yield aNalytics

data-analysis medical-records

Last synced: 20 Jul 2025

https://github.com/omarelgabry/insights.py

A Python package for reading, storing, & analyzing data from Public Data APIs

data-analysis

Last synced: 14 Jul 2025

https://github.com/cosmoduende/r-holy-books-sentiment-data-analysis

What's the most positive or negative religion? . Sentiment and Data Analysis of Holy Books with R. Analysis of religious dogmas by exploring their Holy Books (The Bible, The Quran, The Dhammapada, and The Book of Mormon) with R

bible book-of-mormon data-analysis data-analytics data-visualisation data-visualization dataviz dhammapada holy-scriptures quran religions-studies religious religious-studies sentiment-analysis sentiment-polarity sentimental-analysis text-analysis text-analytics text-mining text-mining-analysis

Last synced: 07 Mar 2026

https://github.com/romac/adaproject

🔬 Project proposal for the Applied Data Analysis course at EPFL

data-analysis

Last synced: 31 Dec 2025

https://github.com/andreantonacci/eu2019

Social Network Analysis of Twitter Topic-Network Structures during the 2019 European Elections

data-analysis election-analysis european-elections gephi latex master-thesis social-network-analysis

Last synced: 17 Jul 2025

https://github.com/yashika-malhotra/strategic-analysis-of-retail-brand-in-south-america-using-sql

Leveraged Big Query and MySQL to analyze 100K records for sales optimization, trend identification, and enhancing customer satisfaction for a retail brand in South America and to provide insights and recommendations to improve their userbase and improve their services

bigquery data-analysis data-science database database-schema google-bigquery mysql-server sql

Last synced: 18 Apr 2026

https://github.com/farfarfun/fundata

数据处理工具包 - 提供数据清洗、转换和分析功能

data-analysis data-processing farfarfun numpy pandas python

Last synced: 17 Feb 2026

https://github.com/cafferychen777/microbiomestat-turtorial-professional-version

MicrobiomeStat Tutorial Repository: This is a comprehensive resource for learning how to use the MicrobiomeStat package. It provides a step-by-step guide to effectively analyze complex microbiome data.

16s 16s-rrna 16s-seq analysis data-analysis metagenomes metagenomic-analysis metagenomic-pipeline metagenomics microbiome microbiome-analysis microbiome-analysis-pipelines microbiome-data microbiome-workflow omics omics-data omics-data-analysis omics-data-integration visualization

Last synced: 03 Jan 2026

https://github.com/louis-heraut/aeag_toolbox

🛠️ R toolbox to provide a simple way of interacting with all the code necessary to carry out hydrological stationnarity analysis for the Agence de l'Eau Adour-Garonne (AEAG)

climate climate-change climate-data data-analysis data-visualization environment hydrology inrae low-water mann-kendall mann-kendall-tests r stationarity-test statistics

Last synced: 08 Apr 2026

https://github.com/quantumudit/analyzing-cleanaway-services

This project focuses on scraping all the service locations across Australia and their associated attributes from "Cleanaway" website; performing necessary transformations on the scraped data and then analyzing & visualizing it using Jupyter Notebook and Power BI.

data-analysis data-science data-transformation data-visualization etl jupyter-notebook power-bi python webscraping

Last synced: 25 Jan 2026

https://github.com/manikantasanjay/loan_repayment_regression_project

Prediction Of Loan Repayment using Sequential Neural Networks on Lending Club Dataset.

data-analysis data-visualization lending-club loan-repayment matplotlib numpy pandas-library seaborn tensorboard tensorflow

Last synced: 04 Apr 2025

https://github.com/priyanka7411/dataspark-electronics-retail-analytics

DataSpark is a data analysis project using Python, SQL, and Power BI to analyze global electronics retail sales, focusing on customer behavior, sales performance, product profitability, and store performance to optimize sales strategies.

analytics-providers business-intelligence customer-segmentation data data-analysis electronics-industry global-sales pandas powerbi powerbi-visuals product-profitability python retail-analytics sales-performance sql store-analysis visualization

Last synced: 10 Jul 2025

https://github.com/quantumudit/analyzing-suez-services

This project focuses on scraping all the service locations across Australia & New Zealand and their associated attributes from "Suez" website; performing necessary transformations on the scraped data and then analyzing & visualizing it using Jupyter Notebook and Power BI.

data-analysis data-science data-transformation data-visualization etl jupyter-notebook power-bi python webscraping

Last synced: 18 May 2026

https://github.com/cfneves/turma-visualizacao-de-dados

Repositório oficial da turma de Visualização de Dados · Lab365 / SENAI SC

data-analysis data-visualization jupyter lab365 matplotlib pandas plotly python seaborn senai

Last synced: 05 May 2026

https://github.com/chanmeng666/water-quality-testing-data-analysis

Statistical analysis and predictive modeling of water quality parameters using Python, pandas, scikit-learn, and statsmodels

data-analysis data-science data-visualization environmental-monitoring jupyter-notebook machine-learning pandas python scikit-learn seaborn statistics water-quality

Last synced: 18 Apr 2026

https://github.com/pottekkat/matplotlib-basics

A jupyter notebook that walks you through the basics of matplotlib for data science applications

data-analysis data-visualization machine-learning matplotlib pyplot python

Last synced: 09 May 2026

https://github.com/mutasim77/dbt-analytics

🍉 Repo for analytics engineering with dbt, transforming raw data into actionable insights.

big-query data data-analysis dbt warehouse

Last synced: 25 Feb 2026

https://github.com/kwokhing/exploratory-data-analysis-on-smrt-tweets

Demo on performing exploratory data analysis (EDA) on train service disruptions based on scrapped (user generated contents) tweets from the train operator's (SMRT) twitter account

data-analysis data-cleaning data-collection data-preparation exploratory-data-analysis exploratory-data-visualizations folium geospatial-data leaflet-map python python3 regex scraping selenium selenium-python social-media text-processing user-generated-content web-scraping webscraping

Last synced: 27 Jul 2025

https://github.com/realbou17/wusf1

A Python-based GUI tool for visualizing F1 telemetry with multiple data representations.

data-analysis f1 fastf1 formula1 formula1-data-analysis python student-project

Last synced: 18 Apr 2026

https://github.com/josmarcristello/goprotimeocr

Python-based OCR tool using EasyOCR and OpenCV for automated text extraction from images. Customizable image preprocessing steps and options for GPU acceleration make this a versatile and efficient solution for various OCR tasks

computer-vision data-analysis easyocr gopro gopro-camera ocr opencv pytesseract python

Last synced: 09 May 2026

https://github.com/surajv311/data_analysis-food_recipes_ds

Data preprocessing, cleaning, <Analysis> & plotting 📊 of Food Recipies Dataset (from Kaggle). 🐍 Libraries used: Pandas, Matplotlib, Seaborn, Plotly.📈

data-analysis kaggle-dataset matplotlib numpy pandas plotly seaborn

Last synced: 03 May 2026

https://github.com/lussierc/foodborneillnessdataanalysis

A data analysis of foodborne illnesses using R Scripting methods.

data-analysis database foodborne-disease-outbreaks foodborne-illnesses rstudio

Last synced: 03 May 2026

https://github.com/iremaydas/base-r-skill

Provides base R programming guidance covering data structures, data wrangling, statistical modeling, visualization, and I/O

claude-skills data-analysis r skills

Last synced: 10 Jun 2026

https://github.com/brews/riverpca

Companion repository to Malevich S.B., and C.A. Woodhouse (2017), Pacific SSTs, mid-latitude atmospheric circulation, and widespread interannual anomalies in Western US streamflow, Geophys. Res. Lett., 44, doi:10.1002/2017GL073536.

analysis data-analysis paper pca python river streamflow visualization

Last synced: 28 Apr 2026

https://github.com/felipeheide/technicalcryptobot

Python script to analyze and trade Bitcoin (BTC) based on technical indicators like RSI, MACD, MMS, and support/resistance levels. Fetches prices from CryptoCompare and CoinGecko APIs. Allows investing a specified amount and displays potential profit/loss. Runs continuously, updating every minute.

api bitcoin coingecko cryptocompare cryptocurrency data-analysis python trading-bot

Last synced: 10 May 2026

https://github.com/gallillio/webscraping-datacleaning-imdb_videogame_webscrapper

IMDb Web Scrapper using Python. The scraped data is automatically data cleaned to address missing or wrong data, it dynamically searches Steam and Google to fill in the gaps or correct the data, This tool streamlines the process of aggregating video game data, facilitating analysis and insights for enthusiasts, researchers, and developers alike.

automation data-analysis data-cleaning data-wrangling jupyter-notebook python web-scraping webscraping

Last synced: 30 Apr 2026

https://github.com/walidalsafadi/house-prices

Ask a home buyer to describe their dream house, and they probably won't begin with the height of the basement ceiling or the proximity to an east-west railroad. With 79 explanatory variables describing (almost) every aspect of residential homes in Ames, Iowa, this competition challenges you to predict the final price of each home.

cross-validation data-analysis data-science data-visualization decision-trees eda house-price-prediction house-prices jupyter linear-regression machine-learning machine-learning-algorithms mlp-regressor plot python random-forest-regression regression svr xgboost-regression

Last synced: 11 May 2026

https://github.com/martinboller/cc-build

Builds latest version of CyberChef and install it with NGINX on another system. CyberChef is a simple, intuitive web app for analyzing and decoding data without having to deal with complex tools or programming languages.

analysis blueteam compression cyberchef data-analysis data-manipulation decode encode encryption hashing parsing virtual-machine

Last synced: 15 Feb 2026

https://github.com/arfc/pride

(P)lan for (R)ap(I)d (DE)carbonization

data-analysis quantitative-recommendations reactor

Last synced: 27 Mar 2026

https://github.com/y1-studio/statbooru

Statbooru is an analytics tool built on Danbooru metadata. It helps artists and researchers discover stable, recognizable characters to draw, identify subjects that may attract attention, compare character popularity, analyze popular formats and tag patterns, and track how popularity changes over time.

danbooru data-analysis dataset python

Last synced: 17 May 2026

https://github.com/dataket/dataket

Propuesta de proyecto para el Datatón Anticorrupción 2021. Equipo Dataket

corruption data-analysis dataviz open-government

Last synced: 02 Apr 2026

https://github.com/vandita2020/merra2_nasa_wind_speed_analysis

In this study, we aim to explore the vulnerability of power grids in the south-east region of the USA with the help of data analysis tools and machine learning algorithms

data-analysis data-science machine-learning-algorithms python

Last synced: 09 Jun 2026

https://github.com/lebrancconvas/personal-saved-datasets

Saved Datasets for doing stuff in the area of Statistics & Data Science.

data-analysis data-science database datasets excel kaggle kaggle-dataset microsoft-excel sample-dataset

Last synced: 12 May 2026

https://github.com/grand-27-master/data-science-course

One-stop repo for learning data science along with roadmap!

data-analysis data-science machine-learning python statistics

Last synced: 12 May 2026

https://github.com/reiniiriarios/squirrel-table

Desktop application to run MySQL queries over SSH and generate CSV and XLSX files. Useful for QA where queries need to be run repeatedly and files handed off.

csv csv-files data-analysis desktop-application electron excel mariadb mysql nodejs qa quality-assurance quality-control quality-control-assurance sql xlsx

Last synced: 15 Apr 2026

https://github.com/virajbhutada/coursera-google-data-analytics-capstone

A repository containing the Capstone project for the Google Data Analytics Professional Certificate, focusing on analyzing FitBit fitness tracker usage data to derive insights relevant to Bellabeat, a wellness technology company.

coursera data-analysis data-visualization fitbit fitness-tracker google google-data-analytics-capstone-project python-programming

Last synced: 10 Jun 2026

https://github.com/vidhi1290/robust-yield-prediction-

"Predicting a Greener Future 🌾📊 Delve into the world of agriculture and data science with our Yield Prediction project. We harness machine learning and weather data to forecast crop yields accurately. Join us in cultivating smarter farming practices for a sustainable tomorrow."

artificial-intelligence data-analysis data-cleaning-and-preprocessing data-science data-visualization dataexploration devops docker machine-learning machine-learning-algorithms matplotlib matplotlib-pyplot pandas python scikit-learn scikitlearn-machine-learning streamlit yield-prediction-for-food-processing

Last synced: 15 Apr 2026

https://github.com/qathom/crawlx

crawlx allows to analyze product data on Amazon. It is a simple and lightweight tool to act on product issues.

data-analysis electron es6-javascript vue vuejs2 vuex2 webpack

Last synced: 14 Feb 2026

https://github.com/totonga/odsbox-jaquel-mcp

MCP server to work with odsbox to access ASAM ODS HTTP API.

ai-agents asam data-analysis data-science mcp ods

Last synced: 11 Apr 2026

https://github.com/rickyxume/data_statistic_analysis

2021数据统计与分析大赛全国一等奖方案

data-analysis data-mining data-science

Last synced: 15 Jun 2026

https://github.com/virajbhutada/power-bi-resources

Comprehensive Power BI resources covering interview questions, group training materials, project portfolio ideas, and a cheat sheet for quick reference. Elevate your Power BI skills with curated content designed to enhance your proficiency and boost project success.

data-analysis data-modeling data-visualization dax-expression interview-preparation portfolio-projects powerbi quick-reference visualization-techniques

Last synced: 03 Mar 2026

https://github.com/iondv/metrics

IONDV. Framework application: Metrics is to collect and show the metrics data.

collecting data data-analysis iondv iondv-app metrics

Last synced: 10 Feb 2026

https://github.com/monteirooscar98/salario-minimo-brasil

Extração de dados através de API do Banco Central, WebScraping no site do Dieese e Análise dos dados.

api brasil data-analysis opendata python webscraping

Last synced: 15 Jun 2026

https://github.com/avrtt/sentiment-analysis

Sentiment analysis of reviews with a simple web interface in Flask

data-analysis data-mining data-science flask machine-learning nlp parsing text-mining

Last synced: 01 May 2026

https://github.com/joamag/pandas

Loads of pandas data from China with awesome data

data data-analysis jupyter notebook pandas

Last synced: 25 Apr 2026

https://github.com/fx2y/datanarrate

[WIP] LLM-powered agent for adaptive data analysis across multiple sources. Uses natural language for complex queries, visualizations, and insights. Features autonomous planning, SQL/Elasticsearch generation, and AI storytelling. Built with LangChain, GPT-4, FastAPI, and React.

ai data-analysis data-visualization elasticsearch fastapi gpt-4 langchain machine-learning nlp react sql

Last synced: 31 Jan 2026

https://github.com/ndleah/self-quantified

😊 EDA, self-quantified data analysis

data-analysis personal-data-analysis r self-quantify

Last synced: 08 Jun 2026

https://github.com/pnnl/archive_walker

Archive Walker Software to read and examine PMU data to detect events and conditions for further analysis.

data-analysis pmu synchrophasor

Last synced: 09 May 2026

https://github.com/michenriksen/inspectra

A simple web app for data inspection.

data-analysis decoding web-tool

Last synced: 08 May 2026

https://github.com/imranr98/brokeetl

Parse transactions from bank statement PDFs into a JSON array.

automation bank-statement banking data-analysis data-mining data-ownership etl finance json lifestyle pdf pdf-converter tracking

Last synced: 27 Apr 2026

https://github.com/josechirif/reviews-and-satisfaction-analysis-of-airbnb-brazil-and-mexico-from-june-2010-to-february-2021

This project analyzes the reviews and satisfaction of customers who used AirBnB services. It also studies if there is a relationship between another variables.

data data-analysis data-visualization powerbi sql-server

Last synced: 25 Feb 2026

https://github.com/willie-conway/meta-data-analyst-portfolio

A comprehensive 📚portfolio showcasing projects and skills developed during the Meta Data Analyst Professional Certificate 🎓course, featuring 📈data analysis, 📊visualization, and 👨🏿‍💻management using various ⚙️tools.

big-data business-intelligence data-analysis data-cleaning data-driven-decisions data-management data-mining data-visualization exploratory-data-analysis jupyter-notebook machine-learning pandas porfolio predictive-modeling python spreadsheet-analysis sql statistics tableau visualization-tools

Last synced: 11 Apr 2026

https://github.com/dcs-training/summerschool2024-stream2

Welcome to the repository for all the materials for Stream 2 of the CDCS 2024 summer school. Go to the readme file

data-analysis data-visualisation data-wrangling geographical-data r sentiment-analysis statistics text-analysis web-scraping

Last synced: 25 Apr 2025

https://github.com/linuxto5re/gateiodatafilteration

Gate.io API analysis: spot, margin, futures; filters high volume, identifies common cryptos.

arbitrage-bot centralized-exchanges cryptocurrency data-analysis gate gateio-api python rest trading-strategies

Last synced: 23 Oct 2025

https://github.com/prathamesh693/04_sales-and-demand-forecasting-for-retail-chains

This project aims to forecast weekly sales for retail stores using historical sales and economic data. By applying advanced time series forecasting models, we enable better inventory management, demand planning, and revenue optimization for retail chains. The project includes both traditional statistical models and deep learning techniques.

arima-forecasting data-analysis data-visualization exploratory-data-analysis jupyter-notebook prophet-facebook python spyder-python-ide streamlit-webapp

Last synced: 01 Jul 2025

https://github.com/startreedata/metabase-pinot-driver

Apache Pinot driver for the Metabase business intelligence front-end

apache data-analysis data-visualization metabase pinot

Last synced: 10 Mar 2026

https://github.com/3bdalrhmans3d/dataqualityproject

An interactive web application for data quality analysis, machine learning, and conversational AI, built with Streamlit.

data-analysis data-visualization ml numpy ollama pandas python seaborn streamlit

Last synced: 23 Feb 2026

https://github.com/bishopce16/stroke_prediction_analysis

The purpose of this project is to derive insight on characteristics and statistics regarding the dataset to see which factors influence whether or not a patient has had a stroke.

data-analysis data-visualization machine-learning nump pandas plpgsql python seaborn tableau

Last synced: 26 Apr 2025

https://github.com/tushar2704/instagram-user-analytics

This project revolves around the exploration and analysis of user engagement patterns on the popular social media platform, Instagram. By delving into user data and interaction metrics, this project aims to provide valuable insights into user behavior, content performance, and trends.

artificial-intelligence data-analysis data-science instagram project tushar2704

Last synced: 23 Jan 2026

https://github.com/pmsipilot/intercom2dw

Intercom2DW is an attempt at loading all the data available in an Intercom application.

data-analysis docker intercom

Last synced: 11 Apr 2026

https://github.com/arnavk-09/world-population

🌏 Taipy demo to explore world population data...

data-analysis python taipy world-population

Last synced: 31 Mar 2025

https://github.com/nhsdigital/sde_example_analysis

Example of what you can do in Databricks in the Secure Data Environment (SDE) using Python, SQL, and R.

data-analysis data-science databricks-notebooks machine-learning mlflow

Last synced: 25 Oct 2025

https://github.com/petulla/readroper

Read single and multi-card ASCII polling datasets in R

ascii data-analysis polling-data r

Last synced: 31 Mar 2025

https://github.com/roaldarbol/anibehavr

🪲 An R package for Analysis of Animal Behaviour

animal-behavior behavioural-states data-analysis r

Last synced: 11 Oct 2025

https://github.com/tsffarias/ibge-nomes-brasil

Análise de dados dos nomes no Brasil utilizando a base de dados disponibilizada pelo IBGE (2010)

data-analysis ibge python

Last synced: 05 Jul 2025

https://github.com/tirendazacademy/data-sets

Data sets for Tirendaz Akademi Youtube

data-analysis dataset

Last synced: 12 Sep 2025

https://github.com/wtbates99/stock-indicators

A comprehensive self-hosted stock analysis platform combining FastAPI and React. Features interactive stock visualizations, real-time data aggregation, and customizable technical indicators with a modern, grid-based interface.

backend data-analysis data-mining data-science fastapi financial-analysis frontend investment python reactjs sqlite3 statstistics stock-market stock-price-prediction time-series

Last synced: 15 Oct 2025

https://github.com/garrettj403/albertaenergysources

Get grid data from Alberta Electric System Operator (AESO)

alberta canada data-analysis energy-data

Last synced: 10 Oct 2025

https://github.com/erictleung/microbiome-analysis-resources

:microscope: Resources and notes on studying the human microbiome

bioinformatics data-analysis gut-microbiome microbiology microbiome notes resources

Last synced: 18 Jan 2026

https://github.com/enzon3/tws-dataset_gen

Program that slowly generates a dataset of news titles and the sentiment of the titles.

data-analysis data-science dataset

Last synced: 14 Apr 2026

https://github.com/silveirinhajuan/rotinapy

RotinaPy: Simplify your daily life and maximize productivity with an integrated app for task management, study tracking, flashcards, and more. Built with Streamlit and Python.

data-analysis flashcards llm-integration llm-ui machine-learning ollama productivity python streamlit study study-project study-tracker task-management task-manager

Last synced: 13 Feb 2026