An open API service indexing awesome lists of open source software.

Data analysis

Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.

https://github.com/mo-karbalaee/python-for-data-analysis-book

All the practice and code that I am doing while I read the book called, Python for data analysis

data-analysis data-science python

Last synced: 02 Aug 2025

https://github.com/rickyxume/data_statistic_analysis

2021数据统计与分析大赛全国一等奖方案

data-analysis data-mining data-science

Last synced: 23 Feb 2025

https://github.com/bradleyboehmke/uc-bana-6043

Additional resources for the UC BANA 6043 Statistical Computing course

data-analysis data-science data-visualization python

Last synced: 10 Jul 2025

https://github.com/mggg/gerrytools

Tools for evaluating districting plans.

data-analysis tooling

Last synced: 10 Sep 2025

https://github.com/louis-heraut/aeag_toolbox

🛠️ R toolbox to provide a simple way of interacting with all the code necessary to carry out hydrological stationnarity analysis for the Agence de l'Eau Adour-Garonne (AEAG)

climate climate-change climate-data data-analysis data-visualization environment hydrology inrae low-water mann-kendall mann-kendall-tests r stationarity-test statistics

Last synced: 08 Apr 2026

https://github.com/bdslab-upv/dashi

A flexible and powerful Python toolkit for dataset shift analysis and characterization, providing supervised and unsupervised evaluation of temporal and multi-source data shifts, visualization tools, and statistical insights for data integrity and model performance monitoring

data-analysis data-science dataset-shift python temporal-analysis

Last synced: 13 Dec 2025

https://github.com/quantumudit/analyzing-cleanaway-services

This project focuses on scraping all the service locations across Australia and their associated attributes from "Cleanaway" website; performing necessary transformations on the scraped data and then analyzing & visualizing it using Jupyter Notebook and Power BI.

data-analysis data-science data-transformation data-visualization etl jupyter-notebook power-bi python webscraping

Last synced: 25 Jan 2026

https://github.com/quantumudit/analyzing-suez-services

This project focuses on scraping all the service locations across Australia & New Zealand and their associated attributes from "Suez" website; performing necessary transformations on the scraped data and then analyzing & visualizing it using Jupyter Notebook and Power BI.

data-analysis data-science data-transformation data-visualization etl jupyter-notebook power-bi python webscraping

Last synced: 18 May 2026

https://github.com/abhiksark/udacity-dataanalyst-nanodegree

All my codes that were submitted during Udacity Nanodegree - Data Analyst Course.

data-analysis data-visualization matplotlib pandas python seaborn statistics udacity udacity-nanodegree

Last synced: 07 Apr 2026

https://github.com/elkronos/anovatoolbox

This GitHub repository contains a collection of functions for performing various statistical analyses and generating visualizations. The functions are designed to work with different types of data and provide comprehensive outputs for data analysis.

anova anova-model data-analysis r statistics

Last synced: 17 Mar 2025

https://github.com/storopoli/r_scripts

Couple of handy R Scripts that I use in a daily basis for Scientific Research

data-analysis data-science data-visualization r scientific

Last synced: 08 Jul 2025

https://github.com/AnonCatalyst/WebHound

WebHound is your Python-powered command-line assistant for sharp and efficient web searches! It sniffs out data from major search engines and detects social platforms, helping you uncover valuable insights and stay ahead of the game. 🌐🔍✨

awareness data-analysis data-visualization information-retrieval osint osint-python osint-reconnaissance osint-resources osint-tool osint-tools osinttools web-osint webscraping

Last synced: 29 Jul 2025

https://github.com/yashika-malhotra/strategic-analysis-of-retail-brand-in-south-america-using-sql

Leveraged Big Query and MySQL to analyze 100K records for sales optimization, trend identification, and enhancing customer satisfaction for a retail brand in South America and to provide insights and recommendations to improve their userbase and improve their services

bigquery data-analysis data-science database database-schema google-bigquery mysql-server sql

Last synced: 18 Apr 2026

https://github.com/DataHerb/dataherb-python

Python Package for DataHerb: create, search, and load datasets.

data data-analysis data-mining database dataset python

Last synced: 08 May 2025

https://github.com/super-lou/card

🎴 Card of Analyse and Diagnostic in R for a user-friendly experience of data aggregation with EXstat

aggregation climate-change climate-data climate-science data-analysis data-science diagnostic environment environment-variables hydrology hydrology-statistical inrae r statistics tools user-friendly

Last synced: 13 Apr 2025

https://github.com/josmarcristello/geokmlanalyzer

The Geo Kml Analyzer is a Python-based tool designed to process and analyze elevation data from KML files. Uses google maps to obtain elevation data (interpolates if necessary) and spherical trigonometry equations to calculate the distance.

data-analysis geolocation geospatial gis google-maps-api gps kml python

Last synced: 30 Jul 2025

https://github.com/cjunwon/youtube-data-analysis

End-to-end Youtube data analysis project using Youtube Data API, MySQL, AWS, Flask

aws-rds data-analysis datapipeline flask nlp pandas python shell sql vader-sentiment-analysis youtube youtube-api

Last synced: 17 Feb 2026

https://github.com/farfarfun/fundata

数据处理工具包 - 提供数据清洗、转换和分析功能

data-analysis data-processing farfarfun numpy pandas python

Last synced: 17 Feb 2026

https://github.com/crafterkolyan/applied-statistical-data-analysis

Курс прикладного статистического анализа данных. ВМК МГУ. Весна 2020

autoexec-scripts autotest data-analysis github-actions statistics university

Last synced: 17 Jun 2025

https://github.com/nikolas-virionis/polynomial-regression

Python package that analyses the given datasets and comes up with the best regression representation with either the smallest polynomial degree possible, to be the most reliable without overfitting or other models such as exponentials and logarithms

data-analysis exponential-regression flexibility logarithmic-regression logistic-regression polynomial-regression python sinusoisdal-regression statistics

Last synced: 06 Apr 2026

https://github.com/forhadulislam/sna-project

A project for Social Network Analysis. Analyzed yahoo query logs

data-analysis data-mining social-network

Last synced: 13 May 2026

https://github.com/valeriopagliarino/tcf-2021-unito-public

Exam project of the course "Computing Tecniques for Physics" - Università degli Studi di Torino - Physics department - 2021

cern-root data-analysis geant4-simulation monte-carlo-simulation object-oriented-programming physics

Last synced: 27 Mar 2025

https://github.com/mohammadkarbalaee/python-for-data-analysis-book

All the practice and code that I am doing while I read the book called, Python for data analysis

data-analysis data-science python

Last synced: 27 Mar 2025

https://github.com/varunbanka/data-insights

Data Insights is a user-friendly tool for analyzing large CSV files. Its advanced analytics helps uncover hidden patterns and trends, making it perfect for data scientists and analysts.

artificial-intelligence automation data-analysis data-science dataanalysis datahive numpy pandas python

Last synced: 22 Jun 2025

https://github.com/artdgn/pages

Auto-updating dashboards about COVID-19 https://artdgn.github.io/pages

covid-19 data-analysis modeling

Last synced: 17 Jan 2026

https://github.com/al-ghaly/airline-company-data-warehouse

Data Warehouse modeling, design, implementation, and analysis for an Airline Company.

data-analysis data-warehousing database-modeling sql-server

Last synced: 14 Apr 2025

https://github.com/magnaopus1/synthron-cfd-trader-pro

SYNTHRON CFD Trader PRO is a cutting-edge trading platform featuring raw, custom-designed machine learning models. From reinforcement learning for dynamic strategies to predictive analytics, sentiment analysis, and optimization techniques, it empowers trading across stocks, forex, indices, commodities, futures, and crypto with precision.

ai backtesting cfd commodities data-analysis data-science data-structures forex futures indices machine-learning trading

Last synced: 30 Apr 2025

https://github.com/rikulauttia/ai-commercial-decisionmaking

AI-Driven Large Dataset Analysis & Commercial Decision-Making: Research on predictive analytics, machine learning strategies, and real-world business applications [Python, TensorFlow, PyTorch] 🤖📊

artificial-intelligence big-data business-intelligence business-strategy commercial-decision-making data-analysis data-science decision-making deep-learning machine-learning neural-networks predictive-analytics python research thesis

Last synced: 03 Mar 2026

https://github.com/louis-heraut/AEAG_toolbox

🛠️ R toolbox to provide a simple way of interacting with all the code necessary to carry out hydrological stationnarity analysis for the Agence de l'Eau Adour-Garonne (AEAG)

climate climate-change climate-data data-analysis data-visualization environment hydrology inrae low-water mann-kendall mann-kendall-tests r stationarity-test statistics

Last synced: 30 Oct 2025

https://github.com/danielpuentee/outdpik

The fundamental toolkit for outliers search and visualization. It aims to be the fundamental high-level package for this purpose.

data-analysis matplotlib numpy python

Last synced: 16 Aug 2025

https://github.com/lucs1590/strava-analysis

🏃📊 Using strava to do personal analyses and to practice data scientist skills.

data-analysis data-science github jupyter-notebook python3 strava strava-api

Last synced: 08 Mar 2026

https://github.com/msyriac/alhazen

Tools for gravitational lensing: now deprecated by pixell+orphics+symlens. Leaving it here in case anyone needs to reproduce old stuff.

analysis cmb data-analysis gravitational-lensing lensing

Last synced: 16 Mar 2025

https://github.com/juangesino/behaviouraleconomics

All the files and data for the experiment performed during the course Behavioural Economics @ University of Amsterdam

behavioral-economics behavioural-economics data-analysis economics game-theory statistics

Last synced: 28 Oct 2025

https://github.com/kylejgillett/stevepy

A Space Weather data analysis tool for Python.

astronomy aurora data-analysis physics python space-weather space-weather-research

Last synced: 22 Mar 2025

https://github.com/priyanka7411/dataspark-electronics-retail-analytics

DataSpark is a data analysis project using Python, SQL, and Power BI to analyze global electronics retail sales, focusing on customer behavior, sales performance, product profitability, and store performance to optimize sales strategies.

analytics-providers business-intelligence customer-segmentation data data-analysis electronics-industry global-sales pandas powerbi powerbi-visuals product-profitability python retail-analytics sales-performance sql store-analysis visualization

Last synced: 10 Jul 2025

https://github.com/manikantasanjay/loan_repayment_regression_project

Prediction Of Loan Repayment using Sequential Neural Networks on Lending Club Dataset.

data-analysis data-visualization lending-club loan-repayment matplotlib numpy pandas-library seaborn tensorboard tensorflow

Last synced: 04 Apr 2025

https://github.com/jpvt/data_science

Portfolio with my Data Science Projects.

data-analysis data-science deep-learning machine-learning portfolio xgboost

Last synced: 19 Jun 2025

https://github.com/benjamindpb/wikidata-preprocessing

Wikidata dump preprocessing & analysis of georreferencial entities

data-analysis preprocessing wikidata wikidata-dump

Last synced: 15 Jul 2025

https://github.com/victoku1/social-network-analyzer

Social Network Analyzer is a Flask-powered app that combines AI-driven insights with social media links (e.g., Twitter, Instagram). Validate profiles, add user context, and receive intelligent personality summaries via OpenAI’s cutting-edge models. Perfect for exploring the power of AI in an interactive way!

ai api-integration backend data-analysis flask frontend information-extraction oauth openai psycology python social-media social-network social-network-analysis web-scraping

Last synced: 07 May 2026

https://github.com/cosmoduende/r-holy-books-sentiment-data-analysis

What's the most positive or negative religion? . Sentiment and Data Analysis of Holy Books with R. Analysis of religious dogmas by exploring their Holy Books (The Bible, The Quran, The Dhammapada, and The Book of Mormon) with R

bible book-of-mormon data-analysis data-analytics data-visualisation data-visualization dataviz dhammapada holy-scriptures quran religions-studies religious religious-studies sentiment-analysis sentiment-polarity sentimental-analysis text-analysis text-analytics text-mining text-mining-analysis

Last synced: 07 Mar 2026

https://github.com/jaybird1291/anki-llm-review-stats-exporter

Export your Anki review history (revlog) as JSONL so you can analyze it with an LLM (ChatGPT, Claude, local models, etc.) without using any API.

anki anki-addon chatgpt data-analysis data-export jsonl llm review-stats revlog statistics

Last synced: 24 Dec 2025

https://github.com/andreantonacci/eu2019

Social Network Analysis of Twitter Topic-Network Structures during the 2019 European Elections

data-analysis election-analysis european-elections gephi latex master-thesis social-network-analysis

Last synced: 17 Jul 2025

https://github.com/tanaylab/naryn

Native Access medical record Retriever for high Yield aNalytics

data-analysis medical-records

Last synced: 20 Jul 2025

https://github.com/rafaelbroseghini/data-analysis-visualizations-ml

:bar_chart: NLP, Regression Models, Story telling, Time Series Analysis ... in Python

data-analysis data-science machine-learning machine-learning-algorithms time-series visualization

Last synced: 03 Oct 2025

https://github.com/thecoderpinar/credit-card-fraud-detection-project

This project focuses on the detection of credit card fraud using various data science and machine learning techniques. The dataset includes a record of credit card transactions over a specific period, with the goal of accurately identifying fraudulent activities. 🚀✨

anamoly-detection classification-algorithms credit-card-transactions data-analysis data-preprocessing data-science data-visualization fraud-detection machine-learning python

Last synced: 30 Apr 2025

https://github.com/zackakil/hot-shot-basketball-tracker

Mini web app for displaying basketball practice metrics.

basketball chartjs data-analysis html sport visualization

Last synced: 28 Jan 2026

https://github.com/nragland37/event-optimization-tool

R-based Shiny application that maps availability and identifies optimal engagement times to enhance participation within an organization

data-analysis data-cleaning data-preparation heatmap r shiny shiny-app tidyverse

Last synced: 02 Feb 2026

https://github.com/mirdan08/crafty

Data analysis project i've developed for the web scraping course.

blockchain data-analysis webscraping

Last synced: 30 Jul 2025

https://github.com/lafayettegabe/nlp-resume-extraction

📝 NER (Named Entity Recognition) project aimed at solving the problem of manually shortlisting resumes by automating the process. This project proposes using NLP techniques and NER model to classify and extract relevant entities from resumes such as person name, college name, academics information, relevant experiences, skill set, etc.

big-data data data-analysis data-science eda ner nlp resume-extractor

Last synced: 03 Apr 2025

https://github.com/michaelnabil230/laravel-analytics

A Laravel package to retrieve pageviews and other data from Database

data-analysis data-structures database laravel php

Last synced: 25 Jan 2026

https://github.com/alhankeser/citibike-analysis

Extracting and Transforming Citi Bike Data for Analysis

citibike data-analysis data-science data-visualization etl sql

Last synced: 25 Jan 2026

https://github.com/tideland/go-cells

Light-weight event-processing based on the idea of meshed cells with different pluggable behaviors

cep data-analysis data-stream event-processing events golang

Last synced: 05 Apr 2025

https://github.com/baci-ak/b-vista

Interactive EDA tool to explore pandas DataFrames — via Python, notebooks & Docker

analytics data-analysis data-science data-visualization dataframe docker eda flask flexible ipython jupyter notebook pandas python react visualization

Last synced: 22 Jul 2025

https://github.com/romac/adaproject

🔬 Project proposal for the Applied Data Analysis course at EPFL

data-analysis

Last synced: 31 Dec 2025

https://github.com/ac-gomes/data-engineering-with-databricks

A simple boilerplate for data engineering and data analysis training in Databricks.

data-analysis data-engineering databricks databricks-notebooks pyspark python unit-testing

Last synced: 30 Apr 2025

https://github.com/cafferychen777/microbiomestat-turtorial-professional-version

MicrobiomeStat Tutorial Repository: This is a comprehensive resource for learning how to use the MicrobiomeStat package. It provides a step-by-step guide to effectively analyze complex microbiome data.

16s 16s-rrna 16s-seq analysis data-analysis metagenomes metagenomic-analysis metagenomic-pipeline metagenomics microbiome microbiome-analysis microbiome-analysis-pipelines microbiome-data microbiome-workflow omics omics-data omics-data-analysis omics-data-integration visualization

Last synced: 03 Jan 2026

https://github.com/amrrs/introduction-to-eda-with-python

Introduction to EDA with Python Session Files

data-analysis eda python

Last synced: 12 Sep 2025

https://github.com/ouhscbbmc/data-science-practices-1

Collection of publicly available practices of data science and analysis

bbmc-investigation data-analysis data-science python r sql

Last synced: 27 Jul 2025

https://github.com/gher-uliege/seadatacloud

Tools and interfaces to work with DIVA interpolation software tool.

data-analysis data-visualization interpolation nco netcdf ocean-sciences oceanography

Last synced: 30 Mar 2025

https://github.com/omarelgabry/insights.py

A Python package for reading, storing, & analyzing data from Public Data APIs

data-analysis

Last synced: 14 Jul 2025

https://github.com/mynenik/xyplot-32

Extensible Plotting and Data Analysis Program for 32-bit x86 GNU/Linux

cpp data-analysis data-manipulation data-visualization forth linux-app motif xwindows

Last synced: 02 Aug 2025

https://github.com/gabriel-dp/mineirando_github

Project to mine and analyze public GitHub data. Practical work of the Social Network Mining and Analysis subject at UFSJ

data-analysis data-mining github ufsj

Last synced: 07 Mar 2026

https://github.com/antrubtor/socialstats

Analyze your personal data from Discord, Instagram, Snapchat, and WhatsApp. View your call durations, response times, voice messages, and messaging activity in clean Excel charts.

call-duration chat-export data-analysis data-visualization discord excel instagram message-frequency message-history snapchat social-networks-statistics whatsapp

Last synced: 28 Jul 2025

https://github.com/cfneves/turma-visualizacao-de-dados

Repositório oficial da turma de Visualização de Dados · Lab365 / SENAI SC

data-analysis data-visualization jupyter lab365 matplotlib pandas plotly python seaborn senai

Last synced: 05 May 2026

https://github.com/virajbhutada/power-bi-resources

Comprehensive Power BI resources covering interview questions, group training materials, project portfolio ideas, and a cheat sheet for quick reference. Elevate your Power BI skills with curated content designed to enhance your proficiency and boost project success.

data-analysis data-modeling data-visualization dax-expression interview-preparation portfolio-projects powerbi quick-reference visualization-techniques

Last synced: 03 Mar 2026

https://github.com/reiniiriarios/squirrel-table

Desktop application to run MySQL queries over SSH and generate CSV and XLSX files. Useful for QA where queries need to be run repeatedly and files handed off.

csv csv-files data-analysis desktop-application electron excel mariadb mysql nodejs qa quality-assurance quality-control quality-control-assurance sql xlsx

Last synced: 15 Apr 2026

https://github.com/michenriksen/inspectra

A simple web app for data inspection.

data-analysis decoding web-tool

Last synced: 08 May 2026

https://github.com/imranr98/brokeetl

Parse transactions from bank statement PDFs into a JSON array.

automation bank-statement banking data-analysis data-mining data-ownership etl finance json lifestyle pdf pdf-converter tracking

Last synced: 27 Apr 2026

https://github.com/pottekkat/matplotlib-basics

A jupyter notebook that walks you through the basics of matplotlib for data science applications

data-analysis data-visualization machine-learning matplotlib pyplot python

Last synced: 09 May 2026

https://github.com/avrtt/sentiment-analysis

Sentiment analysis of reviews with a simple web interface in Flask

data-analysis data-mining data-science flask machine-learning nlp parsing text-mining

Last synced: 01 May 2026

https://github.com/dataket/dataket

Propuesta de proyecto para el Datatón Anticorrupción 2021. Equipo Dataket

corruption data-analysis dataviz open-government

Last synced: 02 Apr 2026

https://github.com/kwokhing/exploratory-data-analysis-on-smrt-tweets

Demo on performing exploratory data analysis (EDA) on train service disruptions based on scrapped (user generated contents) tweets from the train operator's (SMRT) twitter account

data-analysis data-cleaning data-collection data-preparation exploratory-data-analysis exploratory-data-visualizations folium geospatial-data leaflet-map python python3 regex scraping selenium selenium-python social-media text-processing user-generated-content web-scraping webscraping

Last synced: 27 Jul 2025

https://github.com/vidhi1290/robust-yield-prediction-

"Predicting a Greener Future 🌾📊 Delve into the world of agriculture and data science with our Yield Prediction project. We harness machine learning and weather data to forecast crop yields accurately. Join us in cultivating smarter farming practices for a sustainable tomorrow."

artificial-intelligence data-analysis data-cleaning-and-preprocessing data-science data-visualization dataexploration devops docker machine-learning machine-learning-algorithms matplotlib matplotlib-pyplot pandas python scikit-learn scikitlearn-machine-learning streamlit yield-prediction-for-food-processing

Last synced: 15 Apr 2026

https://github.com/josmarcristello/goprotimeocr

Python-based OCR tool using EasyOCR and OpenCV for automated text extraction from images. Customizable image preprocessing steps and options for GPU acceleration make this a versatile and efficient solution for various OCR tasks

computer-vision data-analysis easyocr gopro gopro-camera ocr opencv pytesseract python

Last synced: 09 May 2026

https://github.com/ndleah/self-quantified

😊 EDA, self-quantified data analysis

data-analysis personal-data-analysis r self-quantify

Last synced: 08 Jun 2026

https://github.com/totonga/odsbox-jaquel-mcp

MCP server to work with odsbox to access ASAM ODS HTTP API.

ai-agents asam data-analysis data-science mcp ods

Last synced: 11 Apr 2026