An open API service indexing awesome lists of open source software.

Data analysis

Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.

https://github.com/mindful-ai-assistants/credit-card-prediction

💳 This repository focuses on building a predictive model to assess the likelihood of credit card defaults. The project includes data analysis, feature engineering, and machine learning to provide accurate default predictions.

artificial-intelligence data-analysis data-science jupyter logistic-regression machine-learning oneness-consciousness predictive-modeling python3 scikit-learn

Last synced: 12 Apr 2025

https://github.com/jimbrig/EDA

Exploratory Data Analysis R Package and Shiny App

data-analysis data-visualization eda r shiny

Last synced: 30 Jul 2025

https://github.com/avinesh-masih/data-analytics-assignment

Complete PW Skills Data Analytics Assignments: This repository contains all PW Skills Data Analytics assignments, covering topics like Python, SQL, Statistics, Data Visualization, and more. It includes well-structured solutions with notebooks and queries, ideal for learners seeking clarity and hands-on practice.

ai api data-analysis data-science data-visualization eda flask jupyter-notebook machine-learning matplotlib numpy pandas pw pw-assignment pw-skills-assignment pwskills python seaborn sql statistics

Last synced: 13 Jun 2025

https://github.com/freekatz/english-reading

考研英语(10-19)数据集及相关数据分析

data-analysis dataset

Last synced: 16 Jul 2025

https://github.com/DCS-training/intromachinelearning

This course is aimed at providing an introduction to machine learning for those with some beginner level python/Rstudio skills. Go to the readme file

data-analysis data-wrangling machine-learning python statistics

Last synced: 25 Apr 2025

https://github.com/lafayettegabe/g2m-insight-for-cab-investment-firm

📊 Exploratory Data Analysis (EDA) on multiple datasets related to the cab industry in the US, to provide actionable insights and recommendations to a private firm looking to invest in the market. The analysis includes data cleaning, transformation, visualization, and hypothesis testing.

big-data data data-analysis data-science data-visualization eda gotomarket

Last synced: 13 Jun 2025

https://github.com/asifdotexe/timeseriesanalysis

This repository serves as a central hub for all of my projects related to time series analysis. Here, you'll find a collection of projects, code samples, and resources that explore various aspects of time series data and its analysis.

data-analysis feature-engineering jupyter-notebook pandas python time-series-analysis visualization

Last synced: 14 Apr 2026

https://github.com/aravind-selvam/covid_dashboard

With Covid death and vaccine data. I have created a dashboard.

covid-19 data-analysis data-science data-visualization tableau tableau-public visualization

Last synced: 08 Mar 2026

https://github.com/luminati-io/Amazon-dataset-samples

A sample dataset of over 1,000 Amazon product listings, extracted using the Bright Data API, perfect for competitive analysis, market trends, and eCommerce insights.

amazon api data-analysis data-science dataset ecommerce products web-scraping

Last synced: 09 Apr 2025

https://github.com/code-jl/nfl-point-kicker-data-scraper

A Python-based web scraping toolkit that extracts and processes NFL kicking statistics from Pro-Football-Reference. This project automates the collection of comprehensive game data, with a particular focus on field goal attempts and environmental conditions.

automation beautifulsoup csv data-analysis data-collection field-goals football-statistics kicking-stats nfl python selenium sports-analysis statistics weather-data web-scraping

Last synced: 06 Sep 2025

https://github.com/camille-maslin/simulfcimage

🔍 SimulFCImage: A professional multispectral image processing application developed for ImViA Laboratory.

academic-project computer-vision data-analysis gui-application image-processing image-viewer multispectral-images pyqt5 python scientific-visualization spectral-analysis

Last synced: 05 Feb 2026

https://github.com/mustafah/dream-my-plots

Create visual plots in Python with the help of text prompting popular LLMs through langchain

ai artificial-intelligence automation data-analysis data-visualization langchain llms machine-learning plotting python

Last synced: 13 Apr 2026

https://github.com/scailfin/flowserv-core

Reproducible and Reusable Data Analysis Workflow Server

benchmarks data-analysis reproducibility reusability workflows

Last synced: 14 Jan 2026

https://github.com/tathithienthanh/dataanalysis_diagnosis-of-diabetes-based-on-data-set-of-blood-test-result

Implement all learned knowledge about data analysis and data mining to make a complete project about Diagnosis of diabetes based on data set of blood test result

blood-test classification clustering data-analysis data-processing decision-tree diabetes-prediction diagnosis exercise google-colab health hierarchical ipynb kmeans knn knns py python smote-sampling visualization

Last synced: 08 Jan 2026

https://github.com/tillbiskup/cwepr

A Python package based on the ASpecD framework for handling cwEPR data.

continuous-wave data-analysis data-processing electron-paramagnetic-resonance reproducible-research reproducible-science

Last synced: 06 Sep 2025

https://github.com/agungbudiwirawan/supermarket_sales_dashboard-sales-analysis-using-pivot-table-and-chart

The objective of this project is to analyze supermarket sales using pivot table and chart in Microsoft Excel. After doing the analysis, I created an interactive dashboard that aims to make it easier for the audience to explore data.

dashboard data-analysis data-science data-visualization excel sales-dashboard spreadsheet

Last synced: 08 Jan 2026

https://github.com/is-leeroy-jenkins/badger

A data science & analysis toolkit for federal analysts with the environmental protection agency based on WPF, Net 8, and is written in C#.

ai budget budget-management data-analysis data-science data-visualization federal-government large-language-models machine-learning

Last synced: 14 Oct 2025

https://github.com/ernestaroozoo/memestocks.net

MemeStocks.net is a Python web app that tracks the historical popularity of specific stocks by monitoring Reddit mentions. Users can search for a stock symbol and view information such as the stock's name, price, and historical popularity data. The data is gathered using the Pushshift API and stored in a PostgreSQL database.

dashboard data-analysis financial-data meme-stock python reddit-scraper scraping sql stock streamlit

Last synced: 06 Mar 2026

https://github.com/devexpress-examples/web-forms-pivot-grid-bind-to-sql-data-source

This example demonstrates how to create an ASPxPivotGrid and bind it to data via code.

asp-net-web-forms data-analysis dotnet pivot-grid pivot-grid-for-web-forms

Last synced: 06 Jul 2025

https://github.com/ashwinpn/visualization

Data Visualization using Matplotlib, Pandas Visualization, Seaborn, ggplot, and Plotly.

analysis data data-analysis data-science data-visualization graphs plots python python3 visualization

Last synced: 13 Apr 2026

https://github.com/arzan101/ola-data-analytics

Ola - Identified the reason and trends for ride cancellation. Process - Cleaned and Processed Data from multiple sources, applied Sql queries and visualized data using PoweBi . Motive - To reduce the cancellation rate

dashboard data-analysis data-mining data-visualization dataanalytics excel powerbi sql

Last synced: 06 Jan 2026

https://github.com/llnl/hdtopology

High-dimensional topological data analysis library for NDDAV

analysis cpp data-analysis data-viz high-dimensional-data topological-data-analysis visualization

Last synced: 29 Apr 2025

https://github.com/its-kanii/predictive-maintenance-for-healthcare-equipment

Predictive Maintenance for Healthcare Equipment utilizes machine learning to analyze operational metrics and predict equipment failures. This project leverages a dataset of usage hours, temperature, and maintenance history to enhance equipment reliability and reduce downtime.

data-analysis data-science failure-prediction feature-engineering healthcare-equipment jupyter-notebook machine-learning predictive-maintenance python time-series-analysis

Last synced: 09 May 2026

https://github.com/henrylin03/video-games

Using Python and SQL to clean, analyse and visualise video games' data from Metacritic. Includes scraping using BeautifulSoup.

analysis beautifulsoup beautifulsoup4 data data-analysis data-science eda jupyter-notebook pandas python sql sqlite3 video-game video-games

Last synced: 14 Apr 2026

https://github.com/naso7y/students-performance-analysis

A project analyzing students' academic performance to identify trends and factors affecting outcomes. Built with Python, using data visualization and statistical techniques to derive actionable insights.

data-analysis data-visualization machine-learning python

Last synced: 23 Feb 2026

https://github.com/ivanildobarauna-dev/data-pipeline-sync-ingest

ETL Process for Currency Quotes Data" project is a complete solution dedicated to extracting, transforming and loading (ETL) currency quote data. This project uses several advanced techniques and architectures to ensure the efficiency and robustness of the ETL process.

business-intelligence data-analysis data-analytics data-engineering data-pipeline data-visualization etl-pipeline python

Last synced: 28 Oct 2025

https://github.com/jpquast/icp-ms-data-explorer

A shiny app for the exploration of ICP-MS data.

data-analysis icp-ms r shiny shiny-apps

Last synced: 17 Jan 2026

https://github.com/emso-c/stream-analyser

A tool that analyses YouTube live streams.

cli data-analysis guessing highlights python youtube-video

Last synced: 18 Jan 2026

https://github.com/markmelnic/scalg

List scoring algorithm. Analyse data using a range based procentual proximity algorithm.

algorithm data-analysis pypi pypi-package score scorer scoring scoring-algorithm

Last synced: 08 Oct 2025

https://github.com/markmelnic/carsen-desktop

A python dashboard app for scraping and tracking cars for sale on websites such as mobile.de

automation dashboard data-analysis interface scraping scraping-websites tkinter-python

Last synced: 08 Oct 2025

https://github.com/fenghaojiang/ethereum-etl

ETL(Extract, Transform, Load) data from Ethereum like EVM Block chain

data-analysis ethereum etl

Last synced: 14 Jan 2026

https://github.com/lacerbi/vbmc

Variational Bayesian Monte Carlo (VBMC) algorithm for posterior and model inference in MATLAB (old location)

bayesian-inference data-analysis gaussian-processes machine-learning matlab variational-inference

Last synced: 11 Oct 2025

https://github.com/briatte/asr

Applied Stats with R and RStudio (first-year social-science tutorials)

course data-analysis data-science data-visualization r statistics

Last synced: 14 Apr 2026

https://github.com/codebypinar/reta

🍃 Explore the world of renewable energy production, analyze historical data, and predict sustainable energy trends. Join us on the journey to a greener future!

arima clean-energy data-analysis data-science data-visualization energy-future forecasting-models innovation renewable-energy sustainability time-series

Last synced: 12 Oct 2025

https://github.com/1sumer/sql

This repository contains SQL scripts and data for various analytical and database management tasks. The project is designed to demonstrate SQL capabilities in handling complex queries, data analysis, and database design. It includes datasets related to e-commerce and streaming services, with a focus on real-world scenarios and use cases.

analytics data data-analysis data-storage sql vscode

Last synced: 19 Jan 2026

https://github.com/hvignolo87/ortex-programming-challenge

Coding challenges required for the Python Developer and Data Engineer job positions.

challenge data-analysis finance pandas python scripting sql sqlalchemy

Last synced: 17 May 2026

https://github.com/winter000boy/dsa-practice

This repository holds my solutions for LeetCode’s Pandas playlists. Each section includes code and notes on using Pandas to handle real-world data tasks efficiently. Perfect for anyone looking to deepen their understanding of data manipulation with Pandas.

data-analysis data-science leetcode leetcode-python pandas-python python3

Last synced: 06 Feb 2026

https://github.com/pizofreude/data-career-navigator

An interactive dashboard providing deep insights into career opportunities for data-related roles, utilizing a comprehensive dataset sourced from LinkedIn. Features include analysis of experience levels, salaries, key skills, job locations, and industry trends, aiding job seekers and professionals in exploring and identifying optimal career paths.

codeinplace data-analysis data-visualization standford-university

Last synced: 13 Mar 2026

https://github.com/quantumudit/alteryx-weekly-challenges

This repository contains Alteryx solutions to the weekly challenges published in Alteryx Community

alteryx alteryx-workflow data-analysis data-science data-transformation data-visualization etl

Last synced: 27 Jan 2026

https://github.com/parisaroozgarian/ibm-data-analyst-professional-certificate

The IBM Data Analyst Professional Certificate, consisting of 9 courses, equips with essential skills in Excel, SQL, Python, data visualization, and analysis techniques

big-data business-analysis business-communication communication data-analysis data-management data-structures data-visualization databases general-statistics human-resources planning python-programming spreedsheet sql

Last synced: 27 Jan 2026

https://github.com/yeisonmontoya1815/machine-learning_prediction_can_inflation

we aim to predict trends in the Canadian market basket using sentiment analysis techniques. Sentiment analysis involves analyzing text data to determine the sentiment expressed, whether positive, negative, or neutral.

algorithms-and-data-structures data data-analysis data-science data-visualization feature-engineering machine-learning matplotlib-pyplot numerical-analysis numpy pandas pipelines python sklearn structured-data super unsupervised-learning

Last synced: 05 Feb 2026

https://github.com/dcs-training/bayesian-statistics

Materials for the CDCS Introduction to Bayesian Statistics course. Go to the readme file

bayesian-statistics data-analysis r statistics

Last synced: 05 Feb 2026

https://github.com/rvalla/chessevolution

Some code to analyze my chess games using the Lichess API.

chess data-analysis lichess lichess-api python

Last synced: 23 Oct 2025

https://github.com/trybnetic/tu7-acceleration-sleep-wake-classification

Supporting material for the paper ''Discrimination of sleep and wake periods from a hip-worn raw acceleration sensor using recurrent neural networks''

accelerometer accelerometry actigraphy data-analysis sensors sleep

Last synced: 01 Jun 2026

https://github.com/kevinschoon/qviz

QViz Interactive Plotting

data-analysis data-visualization go gonum qframe yaegi

Last synced: 01 Jun 2026

https://github.com/dcs-training/r-qgisintegratingspatialanalysis

This was an intermediate course of three sessions with a focus on developing skills in data visualisation, analysis and integration using both R studio and QGIS. Go to the readme file

data-analysis data-visualisation data-wrangling gis qgis r spatial-analysis

Last synced: 28 Jan 2026

https://github.com/thennen/py-ivtools

A package for reproducible measurement and analysis of current-voltage characteristics of electronic devices.

current-voltage data-analysis data-visualization electrical-engineering emerging-technology instrumentation measurements

Last synced: 23 Jan 2026

https://github.com/rgalyeon/machine_learning_and_data_analysis

Machine Learning and Data Analysis specialization by Yandex and MIPT

coursera data-analysis data-science machine-learning mipt python yandex

Last synced: 03 Mar 2025

https://github.com/muzammil-13/data_analysis-inmakes

A data-driven project that leverages machine learning to predict Bitcoin price trends. Using historical Bitcoin data, this analysis provides 30-day price forecasts through advanced statistical modeling.

data-analysis data-science learning-by-doing machine-learning numpy pandas python python-library task

Last synced: 19 Feb 2026

https://github.com/cosmoduende/r-marvel-vs-dc

DC Comics vs Marvel Comics - Exploratory Data Analysis and Data Visualization with R. Who has the smartest, strongest, fastest, or most powerful hero or villain? How to answer this and more questions with R

comics data-analysis data-analysis-r data-analytics data-visualization dataviz dc-characters dc-comics eda exploratory-analysis exploratory-data-analysis exploratory-data-visualizations marvel-characters marvel-comics marvel-vs-dc shdb superherodb superheroes superheros

Last synced: 11 Apr 2025

https://github.com/tameronline/tameronline

Showcasing Projects on Data Analysis, Programming, and AI — Developed Using Python and Modern Frameworks

data-analysis deep-learning flask machine-learning numpy pandas python3 sql web-development

Last synced: 11 Jun 2025

https://github.com/michaelcurrin/water-crisis-scraper

Scrape and explore data related to Cape Town's water crisis (Python3 application)

cape-town cron csv dam-levels data-analysis html open-data python3 schedule scraping south-africa water-crisis water-level webscraping

Last synced: 28 Jul 2025

https://github.com/quantumudit/uk-student-accommodation-analysis

This project focuses on scraping student properties related data from the UK Student Accommodation website; performing necessary transformations on the scraped data and then analyzing & visualizing it using Jupyter Notebook and Power BI.

data-analysis data-science data-transformation data-visualization etl jupyter-notebook power-bi python webscraping

Last synced: 27 Apr 2026

https://github.com/atxtechbro/flightradar24

Advanced Python application leveraging the power of APIs and the pandas library to retrieve and perform in-depth analysis of flight data from Flightradar24. It uncovers insights such as the most common departure and arrival cities, contributing to the field of aviation data science.

api-integration aviation-data data-analysis data-science data-visualization flightradar24-api pandas-library python requests-library web-scraping

Last synced: 21 Mar 2025

https://github.com/thealphadollar/messiah

Messiah: The Mighty Son Of God Is Here To Help You Through Times Of Calamity

azure backend data data-analysis flask frontend materialize natural-disasters

Last synced: 19 Jan 2026

https://github.com/poga/dat-ipynb-demo

use ipython notebook to analyze data in dat archive

dat data-analysis distributed jupyter-notebook

Last synced: 17 Aug 2025

https://github.com/avrtt/paysage

Pandas add-on library: find data quality issues and clean/improve dataframes in one line using scikit-learn transformer

data-analysis data-cleaning data-compression data-profiling data-quality data-quality-checks data-reporting pandas pandas-dataframe schema-validation scikit-learn scikit-learn-transformer

Last synced: 14 May 2026

https://github.com/simoneas02/data-science

🐍 A planning study to become a data scientist and to improve my current skills. 🤘🏼🌻

data data-analysis data-science data-visualization deep-learning machine-learning pandas python3 r sql

Last synced: 12 Apr 2026

https://github.com/zachlagden/spotify-listening-analyzer

A comprehensive Python tool for analyzing your Spotify listening history data.

analytics data-analysis pandas python spotify-web-api spotipy

Last synced: 31 Jul 2025

https://github.com/techytushar/india-odi-analysis

Analysis of ODI cricket matches of Indian Team

cricket data-analysis data-science pandas plotting python3

Last synced: 05 May 2026

https://github.com/elysian01/ml-eda-and-modelling-using-streamlit

Beautiful Web interface made using Streamlit for quick Exploratory Data Analysis and building classification models which are implemented from scratch.

data-analysis data-visualization eda exploratory-data-analysis knn-classification logistic-regression matplotlib ml-model-on-web ml-models naive-bayes-classifier pandas seaborn streamlit streamlit-webapp

Last synced: 12 Apr 2025

https://github.com/kaguya163/ankara_coffee_sales_analysis

"Coffee shop sales analysis in Ankara. SQL, Tableau, Python, Data Analytics"

data-analysis mysql python sql tableau

Last synced: 17 Feb 2026

https://github.com/leonism/customer-predictive-analysis

Explore this repository, a comprehensive resource offering an in-depth guide to conducting customer predictive analysis using cutting-edge machine learning techniques, all within the intuitive framework of Dataiku.

data-analysis data-model data-science data-visualization dataiku machine-learning predictive-modeling

Last synced: 28 Mar 2025

https://github.com/dcs-training/from-spss-to-r-how-to-make-your-statistical-analysis-reproducible

Comfortable/aware of how to run your stats in SPSS? Curious to learn how to run them in R? You've come to the right place. Go to the readme file

data-analysis data-visualisation data-wrangling good-practices-digital-research r rmarkdown spss statistics

Last synced: 25 Jan 2026

https://github.com/prathmesh2507/coffee-sales-powerbi-dashboard

Interactive Coffee Shop Sales Dashboard built using Power BI

business-intelligence dashboard data-analysis data-visualization dax powerbi

Last synced: 16 May 2026

https://github.com/depressioncenter/data-and-design-core

Code developed by the EFDC Data and Design Core team to support mental health research.

data-analysis data-science efdc inference r statistical-analysis umich

Last synced: 19 May 2026

https://github.com/juicedata/juicefs-deeplearning-tutorials

Deep Learning and Data Analytics Techniques with the help of JuiceFS.

data-analysis deep-learning filesystem juicefs machine-learning

Last synced: 07 Jul 2025

https://github.com/prathmesh2507/india-superstore-powerbi-dashboard

Interactive India Superstore Sales Dashboard built using Power BI

business-intelligence dashboard data-analysis data-visualization powerbi

Last synced: 16 May 2026

https://github.com/thecoderpinar/reta

🍃 Explore the world of renewable energy production, analyze historical data, and predict sustainable energy trends. Join us on the journey to a greener future!

arima clean-energy data-analysis data-science data-visualization energy-future forecasting-models innovation renewable-energy sustainability time-series

Last synced: 03 Apr 2025

https://github.com/rapidsurveys/oldr

An Implementation of the Rapid Assessment Method for Older People (RAM-OP)

assessment data-analysis odk r ram-op rapid-assessment

Last synced: 12 Apr 2025

https://github.com/thecoderpinar/worldpopulationanalysis2024

World Population Analysis 2024: An In-Depth Exploration of Urban and Rural Populations and Infrastructure Accessibility

data-analysis data-science economic-indicators machine-learning population-growth prophet-forecasting

Last synced: 03 Apr 2025

https://github.com/i4ds/ecallisto_ng

Ecallisto NG is a Python package tailored for interacting with Ecallisto data.

data-analysis data-visualization e-callisto ecallisto-international-network numpy pandas python spectrometer

Last synced: 13 Oct 2025