An open API service indexing awesome lists of open source software.

Data analysis

Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.

https://github.com/stimulsoft/stimulsoft.dashboards.php

Dashboards.PHP is a complete software package for designing and viewing dashboards. Includes the JS data analysis engine, dashboard designer and viewer. Support PHP 5, PHP 7, and PHP 8 versions.

charts dashboard-builder dashboards data-analysis data-grid data-visualization datatable dynamic-dashboard interactive-dashboards live-data mysql-data php php-bi-tools php-dashboard php-kpi php7 php8 pivot-tables sql-datasources statistics

Last synced: 14 Oct 2025

https://github.com/yash22222/tata-data-visualisation-virtual-internship

Data Visualisation: Empowering Business with Effective Insights Gain insights into leveraging data visualisations as a tool for making informed business decisions.

basics ceo charts cmo data-analysis data-interpretation data-science data-visualization graphs machine-learning mcq microsoft-excel microsoft-power-bi microsoft-word powerpoint-presentations python tableau tata tata-data-visualisation

Last synced: 22 Jul 2025

https://github.com/rhenkin/visxhclust

A Shiny app and functions for visual exploration of hierarchical clustering.

clustering data-analysis data-science r r-package r-shiny rstats shiny-apps

Last synced: 02 Apr 2025

https://github.com/davidzajac1/reptoro

A Data Visualization and Analytics Platform for the Reptile Industry

analytics data-analysis data-visualization plotly-dash python

Last synced: 15 May 2026

https://github.com/cosmoduende/r-ufo-sightings

Are we alone in the universe? - Data Analysis and Data Visualization of UFO sightings with R. How to analyze and visualize data of UFO sightings of the last century in the USA and the rest of the world with R language.

data-analysis data-analytics data-science data-visualisation data-visualization data-visualizations dataviz ovni ovni-dataset r-code r-language r-programming r-stats ufo ufo-analysis ufo-dataset ufo-sighting ufo-sightings

Last synced: 13 May 2025

https://github.com/jimbrig/lossrunAnalyzer

R Package and Shiny App to Analyze Insurance Lossruns

actuarial data-analysis data-mining data-science insurance r record-linkage risk-management shiny

Last synced: 30 Jul 2025

https://github.com/thecoderpinar/earthquake_prediction_analysis_project

🌍 Welcome to the Earthquake Prediction Analysis Project! 🚀 This project aims to predict earthquake magnitudes using LSTM neural networks and analyze seismic data. Explore, analyze, and forecast earthquakes with ease! 📈🔮

analysis data-analysis data-science earthquake-prediction geocoding geology lstm lstm-neural-networks machine-learning matlab matlab-deep-learning open-source time-series visualization

Last synced: 16 Aug 2025

https://github.com/negativenagesh/spam-ham_email_detection_machine_learning

This project focuses on classifying spam/ham emails, using machine learning algorithms like LGR, NB, RF, DT etc.. and based on the accuracy score and precision score I chose logistic regression for the classification. And I have used streamlit for frontend.

app data-analysis data-cleaning data-engineering data-science data-visualization data-visualizations jupyter-notebook logistic-regression machine-learning modeling naive-bayes-classifier nlp python

Last synced: 12 Apr 2025

https://github.com/pepe-god/dataprophet

Extracts the identity information citizens from MySQL, creates a family network based on TC ID No. and exports it to CSV

101m 109m adres data-analysis data-extraction database-connector family-tree genealogy gsm hsys identity mysql-database python-script pyton

Last synced: 13 Jul 2025

https://github.com/quantumudit/basketball-players-analysis

The project focuses on analyzing salaries and various other in-game metrics of top NBA basketball players from 2005-14 by performing exploratory data analysis with Python and Jupyter Notebook and by visualizing the data in an insightful dashboard made with Power BI

data-analysis jupyter-notebook power-bi python

Last synced: 17 May 2026

https://github.com/nicucalcea/raise

An R library that uses ChatGPT / GPT to generate data

chatgpt chatgpt-api chatgpt-app data-analysis gpt gpt-35-turbo openai openai-chatgpt parsing r

Last synced: 05 Mar 2025

https://github.com/scottgriv/river-charts

🌊 📉 A Python, Django, Plotly, and Pandas web application that visualizes river data in real time, pulled using an API from the United States Geological Survey (USGS).

api charts data data-analysis data-visualization dataset django pandas plotly python usgs usgs-api visualization webapp

Last synced: 12 Aug 2025

https://github.com/louis-heraut/exstat

🌾 R package to provide an efficient and simple solution to aggregate and analyze the stationarity of time series

climate climate-change climate-data data-analysis data-visualization environment hydrology inrae low-water mann-kendall mann-kendall-tests r stationarity-test statistics time-series

Last synced: 16 Jun 2025

https://github.com/franpog859/top-of-the-world

🌍🔝 Proof that your country is the top of the world using GeoTIFF images and a little bit of geometry. Data mining project

data data-analysis data-mining elevation geometry geotiff image-processing matplotlib nvector rasterio

Last synced: 16 Feb 2026

https://github.com/louis-heraut/card

🎴 Card of Analyse and Diagnostic in R for a user-friendly experience of data aggregation with parametrisation file.

aggregation climate-change climate-data climate-science data-analysis data-science diagnostic environment environment-variables hydrology hydrology-statistical inrae r statistics tools user-friendly

Last synced: 09 Mar 2026

https://github.com/csparpa/last.fm-stats

Exercise on Last.fm data aggregation

data-analysis exercise lastfm lastfm-api python

Last synced: 21 May 2026

https://github.com/ynikitenko/lena

Lena is an architectural framework for data analysis

analysis-framework analysis-pipeline data-analysis data-science

Last synced: 30 Apr 2025

https://github.com/astrodynamic/retailanalitycs-in-postgresql

Develop a SQL script to create a database with tables, views, roles, and functions. Form personalized offers to increase average check, frequency of visits, and cross-selling.

bd csv data-analysis data-export data-input data-manipulation data-validation database-management functions git margin offers postgresql retail role-permission-management selling sql transaction tsv views

Last synced: 06 Apr 2026

https://github.com/johnsell620/sentiment-analysis-goodreads-reviews

Document-level sentiment analysis of book reviews scraped from the Goodreads website. Technologies used include TensorFlow, Spark, HDFS, Sqoop, Scrapy, and D3.js.

data-analysis data-visualization recurrent-neural-networks web-scraping

Last synced: 30 Apr 2025

https://github.com/rikard-helgegren/leverage_analysis_tool

Analyst tool for portfolio construction. How can levereged certificates be used to increase returns in a portfolio while keeping the risk as low as possible. Use the tool and find out.

cpp data-analysis investment kivy-framework python3

Last synced: 12 Apr 2025

https://github.com/apoorvalal/lalrutils

Misc utility functions in R for personal use.

data-analysis r r-package

Last synced: 07 Mar 2026

https://github.com/kellyjadams/spotify-data-analyze

A serverless data pipeline that logs my Spotify listening history to BigQuery using Cloud Run, then visualizes trends with Looker Studio. Built with Python, Flask, Docker, and GCP..

data-analysis data-engineering

Last synced: 07 May 2025

https://github.com/avinashkranjan/basic-data-analysis-and-visualization-in-python

📊 Some of the most important python tools in data science for Data Analysis and Data Visualization.

data-analysis data-science matplotlib matplotlib-pyplot numpy pandas plotly seabourne

Last synced: 30 Oct 2025

https://github.com/saksham-joshi/sentiment_analyzer

Analyze the sentiment of a text stored in a string or file and understand the reason why your blogs and posts are not ranking up.

data-analysis data-analytics python sentiment-analyser sentiment-analysis sentiment-analysis-without-nltk

Last synced: 22 Aug 2025

https://github.com/sushant1827/traffic-forecasting-using-iot-sensor-data

Demonstrates how to utilize XGBoost for traffic forecasting using data gathered from IoT sensors, highlighting its efficiency in processing complex datasets and delivering accurate predictions.

data-analysis data-visualization exploratory-data-analysis feature-engineering feature-importance feature-selection gridsearchcv hyperparameter-optimization hyperparameter-tuning iot random-search xgboost-regression

Last synced: 08 Mar 2026

https://github.com/alexeyev/hse-spb-bigdata-python-fall2016

Материалы к курсу по программированию и инструментам анализа данных, прочитанному в петербургском филиале НИУ ВШЭ осенью 2016 года

course-materials data-analysis numpy pandas python scikit-learn sklearn

Last synced: 07 Apr 2026

https://github.com/asifdotexe/sentimentscoringmodel

This project focuses on performing sentiment analysis on Amazon reviews using natural language processing (NLP) techniques. It includes various steps, from data exploration and preprocessing to building and evaluating sentiment models.

data-analysis data-visualization natural-language-processing sentiment-analysis

Last synced: 28 Oct 2025

https://github.com/orkunaktas/sofascore-webscraping

⚽️I scraped the shot data of the Fenerbahçe - Adana Demirspor match from Sofascore⚽️

beautifulsoup data-analysis football-analytics football-data selenium webscraping

Last synced: 28 Oct 2025

https://github.com/jimmymugendi/email-sms-spam_classifier

Email SMS Spam Classifier is a cutting-edge machine learning solution designed to combat spam messages in email and SMS communications. Leveraging advanced Natural Language Processing (NLP) techniques, the system preprocesses text data by tokenizing, removing stopwords, and stemming, ensuring the most accurate classification results.

data-analysis data-visualization machine-learning-algorithms pandas seaborn-plots sklearn-library

Last synced: 25 Sep 2025

https://github.com/lisa-ho/three-investigators

Respository for scraping and analysing fan data on a German audio drama called 'Die Drei Fragezeichen' (the three investigators).

data-analysis data-viz datawrapper python webscraping

Last synced: 25 Oct 2025

https://github.com/i10mm/gpt-arxiv-fetcher

Revolutionize your research with our GitHub repository, where GPT meets arXiv API for seamless access and analysis of the latest academic papers!

artificial data-analysis intelligence llm machine-learning

Last synced: 14 Jul 2025

https://github.com/irfanchahyadi/ml-notes

Complete personal notes for performing Data Analysis, Preprocessing, and Training ML model.

data-analysis machine-learning plotting python

Last synced: 11 Jul 2025

https://github.com/coderjolly/player-market-value-prediction

There is an intense transfer speculation that surrounds all major player transfers today. An important part of negotiations is predicting the fair market price for a player. Therefore, we are predicting this Market Value of a player using the data provided in csv format.

data-analysis data-visualization decision-tree-regression machine-learning xgboost-regression

Last synced: 22 Jun 2025

https://github.com/c0deta1ker/matbasex

MatBaseX is an all-in-one database and analytical tool for photoelectron spectroscopy (PES) analysis, focused on materials and their X-ray interactions. It offers features like a Materials Properties Database, IMFP & XPS Sensitivity Factor Calculator, and PES N-Layer Simulations & Curve Fitting utilities. Explore its powerful capabilities today!

cross-sections crystal-structure crystallography data-analysis data-fitting database electron imfp imfp-calculator-matlab material material-database matlab matlab-application matlab-gui matlab-toolbox pes-modelling photoelectron-spectroscopy photoionization simulation xps

Last synced: 01 Jul 2025

https://github.com/amrrs/introduction-to-eda-with-python

Introduction to EDA with Python Session Files

data-analysis eda python

Last synced: 12 Sep 2025

https://github.com/storopoli/r_scripts

Couple of handy R Scripts that I use in a daily basis for Scientific Research

data-analysis data-science data-visualization r scientific

Last synced: 08 Jul 2025

https://github.com/bdslab-upv/dashi

A flexible and powerful Python toolkit for dataset shift analysis and characterization, providing supervised and unsupervised evaluation of temporal and multi-source data shifts, visualization tools, and statistical insights for data integrity and model performance monitoring

data-analysis data-science dataset-shift python temporal-analysis

Last synced: 13 Dec 2025

https://github.com/ouhscbbmc/data-science-practices-1

Collection of publicly available practices of data science and analysis

bbmc-investigation data-analysis data-science python r sql

Last synced: 27 Jul 2025

https://github.com/valeriopagliarino/tcf-2021-unito-public

Exam project of the course "Computing Tecniques for Physics" - Università degli Studi di Torino - Physics department - 2021

cern-root data-analysis geant4-simulation monte-carlo-simulation object-oriented-programming physics

Last synced: 27 Mar 2025

https://github.com/ac-gomes/data-engineering-with-databricks

A simple boilerplate for data engineering and data analysis training in Databricks.

data-analysis data-engineering databricks databricks-notebooks pyspark python unit-testing

Last synced: 30 Apr 2025

https://github.com/antrubtor/socialstats

Analyze your personal data from Discord, Instagram, Snapchat, and WhatsApp. View your call durations, response times, voice messages, and messaging activity in clean Excel charts.

call-duration chat-export data-analysis data-visualization discord excel instagram message-frequency message-history snapchat social-networks-statistics whatsapp

Last synced: 28 Jul 2025

https://github.com/thecoderpinar/credit-card-fraud-detection-project

This project focuses on the detection of credit card fraud using various data science and machine learning techniques. The dataset includes a record of credit card transactions over a specific period, with the goal of accurately identifying fraudulent activities. 🚀✨

anamoly-detection classification-algorithms credit-card-transactions data-analysis data-preprocessing data-science data-visualization fraud-detection machine-learning python

Last synced: 30 Apr 2025

https://github.com/bradleyboehmke/uc-bana-6043

Additional resources for the UC BANA 6043 Statistical Computing course

data-analysis data-science data-visualization python

Last synced: 10 Jul 2025

https://github.com/AnonCatalyst/WebHound

WebHound is your Python-powered command-line assistant for sharp and efficient web searches! It sniffs out data from major search engines and detects social platforms, helping you uncover valuable insights and stay ahead of the game. 🌐🔍✨

awareness data-analysis data-visualization information-retrieval osint osint-python osint-reconnaissance osint-resources osint-tool osint-tools osinttools web-osint webscraping

Last synced: 29 Jul 2025

https://github.com/danielpuentee/outdpik

The fundamental toolkit for outliers search and visualization. It aims to be the fundamental high-level package for this purpose.

data-analysis matplotlib numpy python

Last synced: 16 Aug 2025

https://github.com/nikolas-virionis/polynomial-regression

Python package that analyses the given datasets and comes up with the best regression representation with either the smallest polynomial degree possible, to be the most reliable without overfitting or other models such as exponentials and logarithms

data-analysis exponential-regression flexibility logarithmic-regression logistic-regression polynomial-regression python sinusoisdal-regression statistics

Last synced: 06 Apr 2026

https://github.com/quantumudit/analyzing-suez-services

This project focuses on scraping all the service locations across Australia & New Zealand and their associated attributes from "Suez" website; performing necessary transformations on the scraped data and then analyzing & visualizing it using Jupyter Notebook and Power BI.

data-analysis data-science data-transformation data-visualization etl jupyter-notebook power-bi python webscraping

Last synced: 18 May 2026

https://github.com/mirdan08/crafty

Data analysis project i've developed for the web scraping course.

blockchain data-analysis webscraping

Last synced: 30 Jul 2025

https://github.com/forhadulislam/sna-project

A project for Social Network Analysis. Analyzed yahoo query logs

data-analysis data-mining social-network

Last synced: 13 May 2026

https://github.com/quantumudit/analyzing-cleanaway-services

This project focuses on scraping all the service locations across Australia and their associated attributes from "Cleanaway" website; performing necessary transformations on the scraped data and then analyzing & visualizing it using Jupyter Notebook and Power BI.

data-analysis data-science data-transformation data-visualization etl jupyter-notebook power-bi python webscraping

Last synced: 25 Jan 2026

https://github.com/priyanka7411/dataspark-electronics-retail-analytics

DataSpark is a data analysis project using Python, SQL, and Power BI to analyze global electronics retail sales, focusing on customer behavior, sales performance, product profitability, and store performance to optimize sales strategies.

analytics-providers business-intelligence customer-segmentation data data-analysis electronics-industry global-sales pandas powerbi powerbi-visuals product-profitability python retail-analytics sales-performance sql store-analysis visualization

Last synced: 10 Jul 2025

https://github.com/manikantasanjay/loan_repayment_regression_project

Prediction Of Loan Repayment using Sequential Neural Networks on Lending Club Dataset.

data-analysis data-visualization lending-club loan-repayment matplotlib numpy pandas-library seaborn tensorboard tensorflow

Last synced: 04 Apr 2025

https://github.com/victoku1/social-network-analyzer

Social Network Analyzer is a Flask-powered app that combines AI-driven insights with social media links (e.g., Twitter, Instagram). Validate profiles, add user context, and receive intelligent personality summaries via OpenAI’s cutting-edge models. Perfect for exploring the power of AI in an interactive way!

ai api-integration backend data-analysis flask frontend information-extraction oauth openai psycology python social-media social-network social-network-analysis web-scraping

Last synced: 07 May 2026

https://github.com/josmarcristello/geokmlanalyzer

The Geo Kml Analyzer is a Python-based tool designed to process and analyze elevation data from KML files. Uses google maps to obtain elevation data (interpolates if necessary) and spherical trigonometry equations to calculate the distance.

data-analysis geolocation geospatial gis google-maps-api gps kml python

Last synced: 30 Jul 2025

https://github.com/louis-heraut/aeag_toolbox

🛠️ R toolbox to provide a simple way of interacting with all the code necessary to carry out hydrological stationnarity analysis for the Agence de l'Eau Adour-Garonne (AEAG)

climate climate-change climate-data data-analysis data-visualization environment hydrology inrae low-water mann-kendall mann-kendall-tests r stationarity-test statistics

Last synced: 08 Apr 2026

https://github.com/mohammadkarbalaee/python-for-data-analysis-book

All the practice and code that I am doing while I read the book called, Python for data analysis

data-analysis data-science python

Last synced: 27 Mar 2025

https://github.com/autuanliu/rlang

科学统计实验与研究(数据方面)

data-analysis r

Last synced: 30 Jun 2026

https://github.com/juangesino/behaviouraleconomics

All the files and data for the experiment performed during the course Behavioural Economics @ University of Amsterdam

behavioral-economics behavioural-economics data-analysis economics game-theory statistics

Last synced: 28 Oct 2025

https://github.com/baci-ak/b-vista

Interactive EDA tool to explore pandas DataFrames — via Python, notebooks & Docker

analytics data-analysis data-science data-visualization dataframe docker eda flask flexible ipython jupyter notebook pandas python react visualization

Last synced: 22 Jul 2025

https://github.com/lafayettegabe/nlp-resume-extraction

📝 NER (Named Entity Recognition) project aimed at solving the problem of manually shortlisting resumes by automating the process. This project proposes using NLP techniques and NER model to classify and extract relevant entities from resumes such as person name, college name, academics information, relevant experiences, skill set, etc.

big-data data data-analysis data-science eda ner nlp resume-extractor

Last synced: 03 Apr 2025

https://github.com/rafaelbroseghini/data-analysis-visualizations-ml

:bar_chart: NLP, Regression Models, Story telling, Time Series Analysis ... in Python

data-analysis data-science machine-learning machine-learning-algorithms time-series visualization

Last synced: 03 Oct 2025

https://github.com/zackakil/hot-shot-basketball-tracker

Mini web app for displaying basketball practice metrics.

basketball chartjs data-analysis html sport visualization

Last synced: 28 Jan 2026

https://github.com/kylejgillett/stevepy

A Space Weather data analysis tool for Python.

astronomy aurora data-analysis physics python space-weather space-weather-research

Last synced: 22 Mar 2025

https://github.com/mggg/gerrytools

Tools for evaluating districting plans.

data-analysis tooling

Last synced: 10 Sep 2025

https://github.com/super-lou/card

🎴 Card of Analyse and Diagnostic in R for a user-friendly experience of data aggregation with EXstat

aggregation climate-change climate-data climate-science data-analysis data-science diagnostic environment environment-variables hydrology hydrology-statistical inrae r statistics tools user-friendly

Last synced: 13 Apr 2025

https://github.com/alhankeser/citibike-analysis

Extracting and Transforming Citi Bike Data for Analysis

citibike data-analysis data-science data-visualization etl sql

Last synced: 25 Jan 2026

https://github.com/mo-karbalaee/python-for-data-analysis-book

All the practice and code that I am doing while I read the book called, Python for data analysis

data-analysis data-science python

Last synced: 02 Aug 2025

https://github.com/abhiksark/udacity-dataanalyst-nanodegree

All my codes that were submitted during Udacity Nanodegree - Data Analyst Course.

data-analysis data-visualization matplotlib pandas python seaborn statistics udacity udacity-nanodegree

Last synced: 07 Apr 2026

https://github.com/msyriac/alhazen

Tools for gravitational lensing: now deprecated by pixell+orphics+symlens. Leaving it here in case anyone needs to reproduce old stuff.

analysis cmb data-analysis gravitational-lensing lensing

Last synced: 16 Mar 2025

https://github.com/mynenik/xyplot-32

Extensible Plotting and Data Analysis Program for 32-bit x86 GNU/Linux

cpp data-analysis data-manipulation data-visualization forth linux-app motif xwindows

Last synced: 02 Aug 2025

https://github.com/lucs1590/strava-analysis

🏃📊 Using strava to do personal analyses and to practice data scientist skills.

data-analysis data-science github jupyter-notebook python3 strava strava-api

Last synced: 08 Mar 2026

https://github.com/romac/adaproject

🔬 Project proposal for the Applied Data Analysis course at EPFL

data-analysis

Last synced: 31 Dec 2025

https://github.com/artdgn/pages

Auto-updating dashboards about COVID-19 https://artdgn.github.io/pages

covid-19 data-analysis modeling

Last synced: 17 Jan 2026

https://github.com/jaybird1291/anki-llm-review-stats-exporter

Export your Anki review history (revlog) as JSONL so you can analyze it with an LLM (ChatGPT, Claude, local models, etc.) without using any API.

anki anki-addon chatgpt data-analysis data-export jsonl llm review-stats revlog statistics

Last synced: 24 Dec 2025

https://github.com/jpvt/data_science

Portfolio with my Data Science Projects.

data-analysis data-science deep-learning machine-learning portfolio xgboost

Last synced: 19 Jun 2025

https://github.com/michaelnabil230/laravel-analytics

A Laravel package to retrieve pageviews and other data from Database

data-analysis data-structures database laravel php

Last synced: 25 Jan 2026