An open API service indexing awesome lists of open source software.

Data analysis

Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.

https://github.com/tideland/go-cells

Light-weight event-processing based on the idea of meshed cells with different pluggable behaviors

cep data-analysis data-stream event-processing events golang

Last synced: 05 Apr 2025

https://github.com/louis-heraut/aeag_toolbox

🛠️ R toolbox to provide a simple way of interacting with all the code necessary to carry out hydrological stationnarity analysis for the Agence de l'Eau Adour-Garonne (AEAG)

climate climate-change climate-data data-analysis data-visualization environment hydrology inrae low-water mann-kendall mann-kendall-tests r stationarity-test statistics

Last synced: 08 Apr 2026

https://github.com/bdslab-upv/dashi

A flexible and powerful Python toolkit for dataset shift analysis and characterization, providing supervised and unsupervised evaluation of temporal and multi-source data shifts, visualization tools, and statistical insights for data integrity and model performance monitoring

data-analysis data-science dataset-shift python temporal-analysis

Last synced: 13 Dec 2025

https://github.com/ouhscbbmc/data-science-practices-1

Collection of publicly available practices of data science and analysis

bbmc-investigation data-analysis data-science python r sql

Last synced: 27 Jul 2025

https://github.com/nragland37/event-optimization-tool

R-based Shiny application that maps availability and identifies optimal engagement times to enhance participation within an organization

data-analysis data-cleaning data-preparation heatmap r shiny shiny-app tidyverse

Last synced: 02 Feb 2026

https://github.com/lucs1590/strava-analysis

🏃📊 Using strava to do personal analyses and to practice data scientist skills.

data-analysis data-science github jupyter-notebook python3 strava strava-api

Last synced: 08 Mar 2026

https://github.com/michaelnabil230/laravel-analytics

A Laravel package to retrieve pageviews and other data from Database

data-analysis data-structures database laravel php

Last synced: 25 Jan 2026

https://github.com/jpvt/data_science

Portfolio with my Data Science Projects.

data-analysis data-science deep-learning machine-learning portfolio xgboost

Last synced: 19 Jun 2025

https://github.com/romac/adaproject

🔬 Project proposal for the Applied Data Analysis course at EPFL

data-analysis

Last synced: 31 Dec 2025

https://github.com/mynenik/xyplot-32

Extensible Plotting and Data Analysis Program for 32-bit x86 GNU/Linux

cpp data-analysis data-manipulation data-visualization forth linux-app motif xwindows

Last synced: 02 Aug 2025

https://github.com/antrubtor/socialstats

Analyze your personal data from Discord, Instagram, Snapchat, and WhatsApp. View your call durations, response times, voice messages, and messaging activity in clean Excel charts.

call-duration chat-export data-analysis data-visualization discord excel instagram message-frequency message-history snapchat social-networks-statistics whatsapp

Last synced: 28 Jul 2025

https://github.com/DataHerb/dataherb-python

Python Package for DataHerb: create, search, and load datasets.

data data-analysis data-mining database dataset python

Last synced: 08 May 2025

https://github.com/tanaylab/naryn

Native Access medical record Retriever for high Yield aNalytics

data-analysis medical-records

Last synced: 20 Jul 2025

https://github.com/rikulauttia/ai-commercial-decisionmaking

AI-Driven Large Dataset Analysis & Commercial Decision-Making: Research on predictive analytics, machine learning strategies, and real-world business applications [Python, TensorFlow, PyTorch] 🤖📊

artificial-intelligence big-data business-intelligence business-strategy commercial-decision-making data-analysis data-science decision-making deep-learning machine-learning neural-networks predictive-analytics python research thesis

Last synced: 03 Mar 2026

https://github.com/farfarfun/fundata

数据处理工具包 - 提供数据清洗、转换和分析功能

data-analysis data-processing farfarfun numpy pandas python

Last synced: 17 Feb 2026

https://github.com/mo-karbalaee/python-for-data-analysis-book

All the practice and code that I am doing while I read the book called, Python for data analysis

data-analysis data-science python

Last synced: 02 Aug 2025

https://github.com/victoku1/social-network-analyzer

Social Network Analyzer is a Flask-powered app that combines AI-driven insights with social media links (e.g., Twitter, Instagram). Validate profiles, add user context, and receive intelligent personality summaries via OpenAI’s cutting-edge models. Perfect for exploring the power of AI in an interactive way!

ai api-integration backend data-analysis flask frontend information-extraction oauth openai psycology python social-media social-network social-network-analysis web-scraping

Last synced: 07 May 2026

https://github.com/gabriel-dp/mineirando_github

Project to mine and analyze public GitHub data. Practical work of the Social Network Mining and Analysis subject at UFSJ

data-analysis data-mining github ufsj

Last synced: 07 Mar 2026

https://github.com/super-lou/card

🎴 Card of Analyse and Diagnostic in R for a user-friendly experience of data aggregation with EXstat

aggregation climate-change climate-data climate-science data-analysis data-science diagnostic environment environment-variables hydrology hydrology-statistical inrae r statistics tools user-friendly

Last synced: 13 Apr 2025

https://github.com/louis-heraut/AEAG_toolbox

🛠️ R toolbox to provide a simple way of interacting with all the code necessary to carry out hydrological stationnarity analysis for the Agence de l'Eau Adour-Garonne (AEAG)

climate climate-change climate-data data-analysis data-visualization environment hydrology inrae low-water mann-kendall mann-kendall-tests r stationarity-test statistics

Last synced: 30 Oct 2025

https://github.com/msyriac/alhazen

Tools for gravitational lensing: now deprecated by pixell+orphics+symlens. Leaving it here in case anyone needs to reproduce old stuff.

analysis cmb data-analysis gravitational-lensing lensing

Last synced: 16 Mar 2025

https://github.com/cjunwon/youtube-data-analysis

End-to-end Youtube data analysis project using Youtube Data API, MySQL, AWS, Flask

aws-rds data-analysis datapipeline flask nlp pandas python shell sql vader-sentiment-analysis youtube youtube-api

Last synced: 17 Feb 2026

https://github.com/benjamindpb/wikidata-preprocessing

Wikidata dump preprocessing & analysis of georreferencial entities

data-analysis preprocessing wikidata wikidata-dump

Last synced: 15 Jul 2025

https://github.com/mohammadkarbalaee/python-for-data-analysis-book

All the practice and code that I am doing while I read the book called, Python for data analysis

data-analysis data-science python

Last synced: 27 Mar 2025

https://github.com/cosmoduende/r-holy-books-sentiment-data-analysis

What's the most positive or negative religion? . Sentiment and Data Analysis of Holy Books with R. Analysis of religious dogmas by exploring their Holy Books (The Bible, The Quran, The Dhammapada, and The Book of Mormon) with R

bible book-of-mormon data-analysis data-analytics data-visualisation data-visualization dataviz dhammapada holy-scriptures quran religions-studies religious religious-studies sentiment-analysis sentiment-polarity sentimental-analysis text-analysis text-analytics text-mining text-mining-analysis

Last synced: 07 Mar 2026

https://github.com/thecoderpinar/credit-card-fraud-detection-project

This project focuses on the detection of credit card fraud using various data science and machine learning techniques. The dataset includes a record of credit card transactions over a specific period, with the goal of accurately identifying fraudulent activities. 🚀✨

anamoly-detection classification-algorithms credit-card-transactions data-analysis data-preprocessing data-science data-visualization fraud-detection machine-learning python

Last synced: 30 Apr 2025

https://github.com/alhankeser/citibike-analysis

Extracting and Transforming Citi Bike Data for Analysis

citibike data-analysis data-science data-visualization etl sql

Last synced: 25 Jan 2026

https://github.com/artdgn/pages

Auto-updating dashboards about COVID-19 https://artdgn.github.io/pages

covid-19 data-analysis modeling

Last synced: 17 Jan 2026

https://github.com/bradleyboehmke/uc-bana-6043

Additional resources for the UC BANA 6043 Statistical Computing course

data-analysis data-science data-visualization python

Last synced: 10 Jul 2025

https://github.com/omarelgabry/insights.py

A Python package for reading, storing, & analyzing data from Public Data APIs

data-analysis

Last synced: 14 Jul 2025

https://github.com/kylejgillett/stevepy

A Space Weather data analysis tool for Python.

astronomy aurora data-analysis physics python space-weather space-weather-research

Last synced: 22 Mar 2025

https://github.com/rickyxume/data_statistic_analysis

2021数据统计与分析大赛全国一等奖方案

data-analysis data-mining data-science

Last synced: 23 Feb 2025

https://github.com/andreantonacci/eu2019

Social Network Analysis of Twitter Topic-Network Structures during the 2019 European Elections

data-analysis election-analysis european-elections gephi latex master-thesis social-network-analysis

Last synced: 17 Jul 2025

https://github.com/zackakil/hot-shot-basketball-tracker

Mini web app for displaying basketball practice metrics.

basketball chartjs data-analysis html sport visualization

Last synced: 28 Jan 2026

https://github.com/cafferychen777/microbiomestat-turtorial-professional-version

MicrobiomeStat Tutorial Repository: This is a comprehensive resource for learning how to use the MicrobiomeStat package. It provides a step-by-step guide to effectively analyze complex microbiome data.

16s 16s-rrna 16s-seq analysis data-analysis metagenomes metagenomic-analysis metagenomic-pipeline metagenomics microbiome microbiome-analysis microbiome-analysis-pipelines microbiome-data microbiome-workflow omics omics-data omics-data-analysis omics-data-integration visualization

Last synced: 03 Jan 2026

https://github.com/forhadulislam/sna-project

A project for Social Network Analysis. Analyzed yahoo query logs

data-analysis data-mining social-network

Last synced: 13 May 2026

https://github.com/nikolas-virionis/polynomial-regression

Python package that analyses the given datasets and comes up with the best regression representation with either the smallest polynomial degree possible, to be the most reliable without overfitting or other models such as exponentials and logarithms

data-analysis exponential-regression flexibility logarithmic-regression logistic-regression polynomial-regression python sinusoisdal-regression statistics

Last synced: 06 Apr 2026

https://github.com/AnonCatalyst/WebHound

WebHound is your Python-powered command-line assistant for sharp and efficient web searches! It sniffs out data from major search engines and detects social platforms, helping you uncover valuable insights and stay ahead of the game. 🌐🔍✨

awareness data-analysis data-visualization information-retrieval osint osint-python osint-reconnaissance osint-resources osint-tool osint-tools osinttools web-osint webscraping

Last synced: 29 Jul 2025

https://github.com/magnaopus1/synthron-cfd-trader-pro

SYNTHRON CFD Trader PRO is a cutting-edge trading platform featuring raw, custom-designed machine learning models. From reinforcement learning for dynamic strategies to predictive analytics, sentiment analysis, and optimization techniques, it empowers trading across stocks, forex, indices, commodities, futures, and crypto with precision.

ai backtesting cfd commodities data-analysis data-science data-structures forex futures indices machine-learning trading

Last synced: 30 Apr 2025

https://github.com/storopoli/r_scripts

Couple of handy R Scripts that I use in a daily basis for Scientific Research

data-analysis data-science data-visualization r scientific

Last synced: 08 Jul 2025

https://github.com/mirdan08/crafty

Data analysis project i've developed for the web scraping course.

blockchain data-analysis webscraping

Last synced: 30 Jul 2025

https://github.com/yashika-malhotra/strategic-analysis-of-retail-brand-in-south-america-using-sql

Leveraged Big Query and MySQL to analyze 100K records for sales optimization, trend identification, and enhancing customer satisfaction for a retail brand in South America and to provide insights and recommendations to improve their userbase and improve their services

bigquery data-analysis data-science database database-schema google-bigquery mysql-server sql

Last synced: 18 Apr 2026

https://github.com/baci-ak/b-vista

Interactive EDA tool to explore pandas DataFrames — via Python, notebooks & Docker

analytics data-analysis data-science data-visualization dataframe docker eda flask flexible ipython jupyter notebook pandas python react visualization

Last synced: 22 Jul 2025

https://github.com/valeriopagliarino/tcf-2021-unito-public

Exam project of the course "Computing Tecniques for Physics" - Università degli Studi di Torino - Physics department - 2021

cern-root data-analysis geant4-simulation monte-carlo-simulation object-oriented-programming physics

Last synced: 27 Mar 2025

https://github.com/amrrs/introduction-to-eda-with-python

Introduction to EDA with Python Session Files

data-analysis eda python

Last synced: 12 Sep 2025

https://github.com/quantumudit/analyzing-suez-services

This project focuses on scraping all the service locations across Australia & New Zealand and their associated attributes from "Suez" website; performing necessary transformations on the scraped data and then analyzing & visualizing it using Jupyter Notebook and Power BI.

data-analysis data-science data-transformation data-visualization etl jupyter-notebook power-bi python webscraping

Last synced: 18 May 2026

https://github.com/ac-gomes/data-engineering-with-databricks

A simple boilerplate for data engineering and data analysis training in Databricks.

data-analysis data-engineering databricks databricks-notebooks pyspark python unit-testing

Last synced: 30 Apr 2025

https://github.com/jaybird1291/anki-llm-review-stats-exporter

Export your Anki review history (revlog) as JSONL so you can analyze it with an LLM (ChatGPT, Claude, local models, etc.) without using any API.

anki anki-addon chatgpt data-analysis data-export jsonl llm review-stats revlog statistics

Last synced: 24 Dec 2025

https://github.com/abhiksark/udacity-dataanalyst-nanodegree

All my codes that were submitted during Udacity Nanodegree - Data Analyst Course.

data-analysis data-visualization matplotlib pandas python seaborn statistics udacity udacity-nanodegree

Last synced: 07 Apr 2026

https://github.com/quantumudit/analyzing-cleanaway-services

This project focuses on scraping all the service locations across Australia and their associated attributes from "Cleanaway" website; performing necessary transformations on the scraped data and then analyzing & visualizing it using Jupyter Notebook and Power BI.

data-analysis data-science data-transformation data-visualization etl jupyter-notebook power-bi python webscraping

Last synced: 25 Jan 2026

https://github.com/lafayettegabe/nlp-resume-extraction

📝 NER (Named Entity Recognition) project aimed at solving the problem of manually shortlisting resumes by automating the process. This project proposes using NLP techniques and NER model to classify and extract relevant entities from resumes such as person name, college name, academics information, relevant experiences, skill set, etc.

big-data data data-analysis data-science eda ner nlp resume-extractor

Last synced: 03 Apr 2025

https://github.com/josmarcristello/geokmlanalyzer

The Geo Kml Analyzer is a Python-based tool designed to process and analyze elevation data from KML files. Uses google maps to obtain elevation data (interpolates if necessary) and spherical trigonometry equations to calculate the distance.

data-analysis geolocation geospatial gis google-maps-api gps kml python

Last synced: 30 Jul 2025

https://github.com/gher-uliege/seadatacloud

Tools and interfaces to work with DIVA interpolation software tool.

data-analysis data-visualization interpolation nco netcdf ocean-sciences oceanography

Last synced: 30 Mar 2025

https://github.com/rafaelbroseghini/data-analysis-visualizations-ml

:bar_chart: NLP, Regression Models, Story telling, Time Series Analysis ... in Python

data-analysis data-science machine-learning machine-learning-algorithms time-series visualization

Last synced: 03 Oct 2025

https://github.com/varunbanka/data-insights

Data Insights is a user-friendly tool for analyzing large CSV files. Its advanced analytics helps uncover hidden patterns and trends, making it perfect for data scientists and analysts.

artificial-intelligence automation data-analysis data-science dataanalysis datahive numpy pandas python

Last synced: 22 Jun 2025

https://github.com/crafterkolyan/applied-statistical-data-analysis

Курс прикладного статистического анализа данных. ВМК МГУ. Весна 2020

autoexec-scripts autotest data-analysis github-actions statistics university

Last synced: 17 Jun 2025

https://github.com/manikantasanjay/loan_repayment_regression_project

Prediction Of Loan Repayment using Sequential Neural Networks on Lending Club Dataset.

data-analysis data-visualization lending-club loan-repayment matplotlib numpy pandas-library seaborn tensorboard tensorflow

Last synced: 04 Apr 2025

https://github.com/elkronos/anovatoolbox

This GitHub repository contains a collection of functions for performing various statistical analyses and generating visualizations. The functions are designed to work with different types of data and provide comprehensive outputs for data analysis.

anova anova-model data-analysis r statistics

Last synced: 17 Mar 2025

https://github.com/priyanka7411/dataspark-electronics-retail-analytics

DataSpark is a data analysis project using Python, SQL, and Power BI to analyze global electronics retail sales, focusing on customer behavior, sales performance, product profitability, and store performance to optimize sales strategies.

analytics-providers business-intelligence customer-segmentation data data-analysis electronics-industry global-sales pandas powerbi powerbi-visuals product-profitability python retail-analytics sales-performance sql store-analysis visualization

Last synced: 10 Jul 2025

https://github.com/danielpuentee/outdpik

The fundamental toolkit for outliers search and visualization. It aims to be the fundamental high-level package for this purpose.

data-analysis matplotlib numpy python

Last synced: 16 Aug 2025

https://github.com/juangesino/behaviouraleconomics

All the files and data for the experiment performed during the course Behavioural Economics @ University of Amsterdam

behavioral-economics behavioural-economics data-analysis economics game-theory statistics

Last synced: 28 Oct 2025

https://github.com/mggg/gerrytools

Tools for evaluating districting plans.

data-analysis tooling

Last synced: 10 Sep 2025

https://github.com/al-ghaly/airline-company-data-warehouse

Data Warehouse modeling, design, implementation, and analysis for an Airline Company.

data-analysis data-warehousing database-modeling sql-server

Last synced: 14 Apr 2025

https://github.com/vandita2020/merra2_nasa_wind_speed_analysis

In this study, we aim to explore the vulnerability of power grids in the south-east region of the USA with the help of data analysis tools and machine learning algorithms

data-analysis data-science machine-learning-algorithms python

Last synced: 09 Jun 2026

https://github.com/dataket/dataket

Propuesta de proyecto para el Datatón Anticorrupción 2021. Equipo Dataket

corruption data-analysis dataviz open-government

Last synced: 02 Apr 2026

https://github.com/ndleah/self-quantified

😊 EDA, self-quantified data analysis

data-analysis personal-data-analysis r self-quantify

Last synced: 08 Jun 2026

https://github.com/kwokhing/exploratory-data-analysis-on-smrt-tweets

Demo on performing exploratory data analysis (EDA) on train service disruptions based on scrapped (user generated contents) tweets from the train operator's (SMRT) twitter account

data-analysis data-cleaning data-collection data-preparation exploratory-data-analysis exploratory-data-visualizations folium geospatial-data leaflet-map python python3 regex scraping selenium selenium-python social-media text-processing user-generated-content web-scraping webscraping

Last synced: 27 Jul 2025

https://github.com/josechirif/reviews-and-satisfaction-analysis-of-airbnb-brazil-and-mexico-from-june-2010-to-february-2021

This project analyzes the reviews and satisfaction of customers who used AirBnB services. It also studies if there is a relationship between another variables.

data data-analysis data-visualization powerbi sql-server

Last synced: 25 Feb 2026

https://github.com/martinboller/cc-build

Builds latest version of CyberChef and install it with NGINX on another system. CyberChef is a simple, intuitive web app for analyzing and decoding data without having to deal with complex tools or programming languages.

analysis blueteam compression cyberchef data-analysis data-manipulation decode encode encryption hashing parsing virtual-machine

Last synced: 15 Feb 2026

https://github.com/felipeheide/technicalcryptobot

Python script to analyze and trade Bitcoin (BTC) based on technical indicators like RSI, MACD, MMS, and support/resistance levels. Fetches prices from CryptoCompare and CoinGecko APIs. Allows investing a specified amount and displays potential profit/loss. Runs continuously, updating every minute.

api bitcoin coingecko cryptocompare cryptocurrency data-analysis python trading-bot

Last synced: 10 May 2026

https://github.com/pottekkat/matplotlib-basics

A jupyter notebook that walks you through the basics of matplotlib for data science applications

data-analysis data-visualization machine-learning matplotlib pyplot python

Last synced: 09 May 2026

https://github.com/vidhi1290/robust-yield-prediction-

"Predicting a Greener Future 🌾📊 Delve into the world of agriculture and data science with our Yield Prediction project. We harness machine learning and weather data to forecast crop yields accurately. Join us in cultivating smarter farming practices for a sustainable tomorrow."

artificial-intelligence data-analysis data-cleaning-and-preprocessing data-science data-visualization dataexploration devops docker machine-learning machine-learning-algorithms matplotlib matplotlib-pyplot pandas python scikit-learn scikitlearn-machine-learning streamlit yield-prediction-for-food-processing

Last synced: 15 Apr 2026

https://github.com/walidalsafadi/house-prices

Ask a home buyer to describe their dream house, and they probably won't begin with the height of the basement ceiling or the proximity to an east-west railroad. With 79 explanatory variables describing (almost) every aspect of residential homes in Ames, Iowa, this competition challenges you to predict the final price of each home.

cross-validation data-analysis data-science data-visualization decision-trees eda house-price-prediction house-prices jupyter linear-regression machine-learning machine-learning-algorithms mlp-regressor plot python random-forest-regression regression svr xgboost-regression

Last synced: 11 May 2026