An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/shubhamgoyal575/credit-card-fraud-detection

📌 Credit Card Fraud Detection using Machine Learning This project focuses on detecting fraudulent credit card transactions using machine learning models like Random Forest, XGBoost, and Deep Learning. The dataset is preprocessed to handle class imbalance, and multiple models are evaluated based on ROC AUC Score and F1 Score.

adaboost-classifier artificial-neural-networks credit-card-fraud data-analysis data-cleaning data-preprocessing data-science data-visualization deep-learning exploratory-data-analysis lightgbm machine-learning machine-learning-algorithms random-forest-classifer scikit-learn tensorflow xgboost

Last synced: 08 Feb 2026

https://github.com/imnotamr/datasets-used

A comprehensive collection of datasets for machine learning and data science projects, covering topics from advertising and sales to health and sports analytics

ai classification data-analysis data-science data-visualization deep-learning jupyter-notebook machine-learning models python regression-models

Last synced: 19 May 2026

https://github.com/syarwinaaa09/exploring-airbnb-market-trends

a data analysis project exploring NYC Airbnb listings, using data visualization and pandas for price trends, room types, and reviews.

airbnb data-analysis data-science data-visualization jupyter-notebook new-york-city nyc pandas price-analysis reviews room-types

Last synced: 30 Apr 2026

https://github.com/samir-atra/share-lm_dataset_analysis

Analysis, studies and optimizations on the ShareLM extension dataset

data-analysis data-visualization gemma3n huggingface huggingface-transformers pandas

Last synced: 19 May 2026

https://github.com/kartikey2807/bike-classification-1rt700

Binary classification problem involving Logistic regression, SMOTE and feature expansion.

data-analysis data-engineering data-visualization logistic-regression

Last synced: 30 Jul 2025

https://github.com/danasilver/twacker

Track your Twitter friends (following) and followers.

data-visualization heroku-app twitter

Last synced: 12 Jul 2025

https://github.com/shuyib/london_weather_prediction

The London Weather Project aims to predict the mean temperature in London using historical weather data, involving data cleaning, feature engineering, and modeling with techniques like imputation, transformation, scaling, and the use of Mlflow for tracking model performance and hyperparameters.

data-cleaning data-lab data-science data-visualization datacamp-projects environmental-science feature-engineering forecasting jupyter-notebook machine-learning mlflow open-data python random-forest regression-analysis time-series weather-prediction

Last synced: 29 Mar 2025

https://github.com/hooopo/ossinsight-pick

Handpicks, features, or highlights a selection of open-source repositories each week. We cherry-pick the best, trending, or otherwise interesting repositories, providing an in-depth analysis you won't find elsewhere, thus enabling developers to discover, learn from, and contribute to these noteworthy projects.

analytics data-visualization github open-source trending-repositories visualization

Last synced: 30 Jul 2025

https://github.com/mariarodr1136/numdynamics

NumDynamics is an advanced random number generation system that combines C's performance with Python's analytical power. It generates, analyzes, and visualizes random numbers across various statistical distributions, offering a precise and efficient toolkit for random number analysis. 📊

algorithm-design c-programming data-science data-visualization histograms numerical-computing numerical-simulations probability-distributions python-scripts random-number-generator statistical-modeling statistics

Last synced: 04 Jul 2026

https://github.com/tapiwamakandigona/react-analytics-dashboard

Real-time analytics dashboard with charts, KPIs, and data tables | React + TypeScript + Recharts

analytics charts dashboard data-visualization react recharts tailwindcss typescript vite

Last synced: 06 Apr 2026

https://github.com/civicdatalab/hp-fiscal-data-explorer-frontend

Frontend for Himachal Pradesh Fiscal Data explorer for Open Budgets India Platform

budget contributions-welcome data-visualization hacktoberfest open-budgets open-source opensource reactjs

Last synced: 10 Apr 2025

https://github.com/elitay152/assemblyai_audio_project

Audio analysis project using AssemblyAI's API

audio-analysis data-visualization machine-learning

Last synced: 10 Apr 2025

https://github.com/tushar2704/tableau-portfolio

Collection of Power BI projects and dashboards that demonstrate my skills and expertise in data visualization, business intelligence, and analytics using Power BI.

artificial-intelligence dashboard data-science data-visualization tableau

Last synced: 25 Jan 2026

https://github.com/stat-by-tish/house-insurance-data-analysis

Fraud detection in house insurance using MATLAB – EDA, classification (trees, KNN, SVM, RF), and clustering. Built for a student project.

classification classification-trees clustering data-visualization exploratory-data-analysis house-data insurance-claims kmeans-clustering knn-classification matlab

Last synced: 26 Jun 2025

https://github.com/no-country-simulation/c21-55-n-data-bi

Trabajo de análisis estadístico en Power Bi, sobre la deserción de alumnos en carreras culturales universitarias de argentina.

data-visualization

Last synced: 18 Feb 2026

https://github.com/prady2309/stock-analysis

Analysis on the stock prices of Apple, Google, Microsoft and Amazon

data-analysis data-science data-visualization python stock-market

Last synced: 19 May 2026

https://github.com/ezeparziale/analisis-uso-bicicletas-caba

:biking_man: Análisis de como afecto la pandemia el uso de las bicicletas en CABA.

data data-science data-visualization

Last synced: 14 Mar 2025

https://github.com/manoharvit/personality-prediction-test

Career professionals and psychologists use this information in personality career tests for recruitment and candidate assessment. Accurate personality estimation is delicate and can be falsified to some extent by a seeker, as I did. Given that employment is frequently associated with significant fiscal and social benefits, job campaigners are incen

data-mining data-science data-visualization machine-learning

Last synced: 30 Jul 2025

https://github.com/amaravivian/client-project-analysis

"Comprehensive data analysis project for a new client to provide data-driven recommendations."

data-science data-structures data-visualization r tableau

Last synced: 02 Apr 2025

https://github.com/rohitblaze10/microsoft_stock_analysis-2025-kaggle

A comprehensive analysis of Microsoft's (MSFT) stock data from 1986 to 2025, covering trends, volatility, and interactive visualizations using Python

data-science data-visualization eda python

Last synced: 08 Nov 2025

https://github.com/ismailhakkii/data_visualization

Yapay veri seti oluşturma, veri görselleştirme, veri setini eğiltme, test, lojistik regresyon ve modelin değerlendirilmesi.

data-set-creation data-visualization logistic-regression model-training

Last synced: 24 Apr 2026

https://github.com/annnieglez/computer-vision-parking-lot

This project leverages computer vision techniques to analyze parking lot occupancy. The goal is to detect available parking spaces in real-time using image and video input.

computer-vision data-analysis data-science data-visualization google-colab image-classification image-processing machine-learning python transfer-learning

Last synced: 15 May 2026

https://github.com/bytelabss/frontend-vue-api5

Frontend of the data visualization project for the 5th semester of Banco de Dados at FATEC-São José dos Campos

data-visualization vuejs

Last synced: 17 Mar 2025

https://github.com/nivasharmaa/friskwatch

A Java program for analyzing stop-and-frisk data from the NYPD. Features data import, organization, and statistical analysis to compare occurrences during and after policy implementation.

data-analysis data-visualization dataprocessing datascience file-io java java-oop nypd-data

Last synced: 19 May 2026

https://github.com/analyticalnahid/seaborn-tutorial

A complete Notebook on Seaborn for Data Science

data-visualization seaborn seaborn-tutorial

Last synced: 23 Aug 2025

https://github.com/saketr3/voting-policy-impact-visualizer

Data visualization web app where users can compare voter turnout of different demographics with states’ voting policy fairness scores

data-visualization voting

Last synced: 14 Mar 2025

https://github.com/madhursinghbhadoriya/data_analysis_sales_insights_using_tableau

• Performed Data Cleaning using MySQL. • Data analysis and ETL in Tableau. • Created an Interactive Dashboard with significant information about the Sales Insights, Profit and Revenue Analysis.

data-analysis data-visualization dataanalysis etl mysql tableau-dashboards tableau-desktop

Last synced: 09 Apr 2025

https://github.com/sumit-sinha9/uber-rides-data-analysis

This project aims to analyze Uber ride data to understand various aspects of ride usage, such as the distribution of rides across different categories, purposes, months, days, and times.

data-analysis-python data-analytics data-visualization pandas-python powerbi python rec uber

Last synced: 15 May 2026

https://github.com/alan-oliveir/previsao_cartao_fidelidade

Projeto de ciência de dados para previsão de plano de fidelidade para clientes de uma companhia aérea.

data-science data-visualization database gradio python sql

Last synced: 04 May 2026

https://github.com/furkankarakuz/turkey_earthquake

This project focuses on analyzing and visualizing earthquake data specific to Turkey. It aims to provide insightful visualizations on topics such as earthquake frequency, location, and magnitude using data obtained from Boğaziçi University Kandilli Observatory and Earthquake Research Institute.

api data data-visualization earthquake python python3 request streamlit turkey turkey-earthquake

Last synced: 20 May 2026

https://github.com/touradbaba/women-parliament-representation-dashboard

A data visualization dashboard built with Dash and Plotly to explore women's representation in parliaments worldwide and is deployed on Heroku and PythonAnywhere.

dash dashboard data-visualization exploratory-data-analysis heroku heroku-deployment plotly pythonanywhere

Last synced: 15 Jun 2025

https://github.com/the-ethan-hunt/dekh-data

Playground for data visualization notebooks

data-visualization jupyter-notebook python

Last synced: 28 Mar 2025

https://github.com/otsaloma/pollen-chart

Helsinki pollen count visualization

data-visualization javascript lambda pollen python

Last synced: 17 Apr 2026

https://github.com/saisathvik07/e-commerce-sales-analysis-using-sql-and-powerbi

This repository provides an extensive examination of Amazon Sales Data utilizing SQL

analytics data-science data-visualization mysql-database powerbi

Last synced: 22 Mar 2025

https://github.com/prsdthkr/viz-design-demo

📈 This repo houses my experiments with D3 (mostly inspired by other's work) for information visualization class project and demo.

d3 data-visualization lgbtq

Last synced: 06 May 2026

https://github.com/Akhil-krishnan-r/super_market_analysis

The growth of supermarkets in most populated cities are increasing and market competitions are also high. This dataset is one of the historical sales of supermarket company which has recorded in 3 different branches for 3 months data. Predictive data analytics methods are easy to apply with this dataset

data-visualization matplotlib numpy pandas seaborn

Last synced: 03 Feb 2026

https://github.com/alrza2003/google-data-analysis-case-study-cyclistic

This project analyzes Cyclistic’s trip data to identify patterns in bike usage between casual riders and annual members. The findings help optimize marketing strategies and membership conversions.

business-task cyclistic-bike-share-analysis-case-study data-analysis data-science data-visualization google-data-analytics google-data-analytics-capstone-project google-data-analytics-professional jupyter-notebook python rmarkdown tableau

Last synced: 09 May 2026

https://github.com/jofaval/iris-flowers

Multilabel Classification of the famous Iris Flowers Dataset from Ronald Aylmer Fisher in 1936

classification data-analysis data-science data-visualization google-colab iris-flowers kaggle machine-learning python scikit-learn xgboost

Last synced: 05 Apr 2026

https://github.com/cudavailable/gdp-data-visualization

A data visualization project for GDP data

data-visualization gdp vue

Last synced: 20 May 2026

https://github.com/vaxdata22/customer-churn-data-analytics-etl-pipeline-by-airflow-on-ec2

This is an end-to-end AWS Cloud ETL project. This orchestration uses Apache Airflow on AWS EC2 as well as AWS Glue. It demonstrates how to build ETL pipeline that would perform data transform using Glue job/crawler as well as loading into a Redshift table. It also shows how to connect Amazon Athena to Glue Data Catalog, and Power BI to Redshift.

amazon-athena amazon-redshift apache-airflow aws-ec2 aws-glue aws-s3 business-intelligence customer-churn-analytics dags data-visualization etl-pipeline orchestration power-bi python3

Last synced: 18 Jun 2025

https://github.com/srinibas-masanta/hotel-revenue-analysis-dashboard

This project focuses on analyzing hotel booking data to uncover key metrics and insights that drive revenue management decisions. By creating an interactive Power BI dashboard, the project aims to improve strategic decision-making, optimize occupancy rates, and enhance overall financial performance within the hospitality industry.

business-analytics data-analysis data-science data-visualization dax-functions hospitality powerbi

Last synced: 12 Jan 2026

https://github.com/prakhar-code/house_sales_analysis

House Sales Analysis Of King County, Washington, USA and Clean Visualization.

data-cleaning data-visualization excel tableau tableau-dashboards tableau-public

Last synced: 12 Jan 2026

https://github.com/albertofaraujo/excel_dashboard_rh

É um painel de controle de Recursos Humanos que permite monitorar informações essenciais da empresa, oferecendo recursos de filtragem por áreas, tipos de contratos e períodos (em meses).

analise-de-dados dashboards data-visualization excel figma

Last synced: 27 Mar 2025

https://github.com/jbalooshie/bikesharing

Repo showing a Tableau story I created using Citi Bike data. Visualizations were created in Tableau showing rental duration, popular days/times to travel, and user breakdowns by gender/age.

citibike data-visualization tableau

Last synced: 09 Apr 2025

https://github.com/shivasairam1706/mlops-project1

End-to-end ML-Ops project using PySpark and AWS, covering environment setup, model training, deployment with data capture, execution, and analysis. CI/CD pipelines (AWS CodePipeline) and monitoring (CloudWatch) ensure automated deployment, performance tracking, and model retraining for production-ready ML solutions.

aws aws-lambda aws-s3 data-engineering data-science data-visualization delta-lake docker forcasting mlops-project pyspark unix-shell

Last synced: 20 May 2026

https://github.com/asimpson/is-steph-mvp

🏀 Compare the 2018 NBA MVP contenders against Steph Curry's historic, unanimous, 2016 MVP season.

data-visualization nba reactjs

Last synced: 20 May 2026

https://github.com/alra-code/data-analytics-com-power-bi

Desafio de projetos do Boocamp Data Analytics realizado pela Dio Me em 2024

analytics data-visualization desafios-resolvidos dio-bootcamp powerbi pt-br

Last synced: 25 Jan 2026

https://github.com/karanch10/fraudshield

FraudShield is a machine learning credit card fraud detection system that analyzes transaction attributes to identify suspicious activities in real time. Built with Python, SQL, and Django, it provides a user-friendly interface for fraud prediction using OpenBanking APIs and advanced detection techniques. Ideal for businesses and individuals.

data-analysis data-science data-visualization machine-learning python3

Last synced: 20 May 2026

https://github.com/youssef-saaed/activity-recognition-using-various-ml-algorithms

This project involves a comprehensive comparative analysis of various machine learning models to classify activities based on a given dataset. The analysis follows a structured approach, including data exploration, model training, model evaluation, and results interpretation to identify the best performing model.

activity-recognition comparative-analysis cross-validation data-exploration data-visualization machine-learning model-evaluation model-training neural-networks

Last synced: 22 Mar 2025

https://github.com/hs094/data.science.094

This repository features my collection of Python notebooks, highlighting a diverse range of data science tasks, projects, and innovative ideas. It will be continuously updated with new insights, experiments, and explorations as I expand my knowledge and expertise in the field.

data-analytics data-science data-visualization python

Last synced: 16 Feb 2026

https://github.com/even-wei/chartdown

A blazingly fast markdown-to-PDF converter that transforms CSV data into beautiful charts, written in Zig. Create data-rich documents with simple syntax.

charts cli-tool csv data-visualization markdown pdf zig

Last synced: 15 May 2026

https://github.com/kaustubh-indulkar/te-it-dsbda-assignmnets

This repository contains the solutions for a series of assignments covering Data Science And Big Data Analytics concepts.

big-data big-data-analytics data-analytics data-science data-visualization sppu-2019-pattern sppu-it-dept

Last synced: 29 Mar 2025

https://github.com/djsprenk/djsprenk.github.io

GitHub Pages site for DJ Sprenk

d3 d3-visualization data-visualization dj music python

Last synced: 20 May 2026

https://github.com/thergh/aisd-lab

Programs made for the Algorithms and Data Structures class

algorithms data-structures data-visualization

Last synced: 03 Apr 2025

https://github.com/xkomil/datasciencesummerstudy

I want to document in this repository all my studying data science topics

data-science data-visualization machine-learning meteostat numpy pandas seaborn sklearn streamlit-webapp

Last synced: 05 May 2026

https://github.com/c2r0b/2q

Manage data and relationships with AI

data-visualization graphql relationships rust tauri

Last synced: 09 Apr 2026

https://github.com/jpgiant/training_project

Analyzing whether there is a difference between the average death ages of left handers and right handers using Bayesian Conditional Probability Theorem.

bayesian-statistics data-analysis data-visualization numpy pandas-dataframe python

Last synced: 30 Apr 2026

https://github.com/dianannperla/trangs-web-thong-ke

Một ứng dụng web đơn giản để thống kê dữ liệu và hiển thị biểu đồ bằng Vue.js.

chartjs data-visualization frontend vuejs web-development

Last synced: 23 Sep 2025

https://github.com/irishmorales/ph-poverty-statistics

An exploratory data analysis of Philippine poverty data. Data includes given 1991-2015 data, appended FIES 2018 & 2021 data, and 2024 & 2027 poverty estimates calculated using ARIMA.

data-visualization exploratory-data-analysis philippines poverty-alleviation

Last synced: 22 Mar 2025

https://github.com/ranxi2001/predicting-mental-health-risk

数据分析案例-精神健康预测(数据来源kaggle)

data-analysis data-visualization eda

Last synced: 27 Jun 2025

https://github.com/samruddhi3012/rfm-analysis

Hi there! In this project I have performed Sales Analysis (RFM Analysis) using SQL and Tableau.

data-analysis data-visualization mssqlserver rfm-analysis segmentation tableau

Last synced: 27 Jun 2025

https://github.com/catalina2820/inteligencia-de-negocios

This repository contains materials and resources for the Business Intelligence course. It includes notes, workshops, and practical exercises that cover essential concepts and applications in data science, data visualization, machine learning, and big data.

bigdata data-cleaning data-science data-visualization web-scraping

Last synced: 04 Apr 2025

https://github.com/hannahgsimon/halmodeling2024

Developed code using the Hybrid Automata Library (HAL) to create a spatial agent-based model of radio-immune response to spatially fractionated radiotherapy. This project was in association with the Cleveland Clinic Lerner Research Institute, Jacob Scott Lab.

agent-based-model bifurcation-analysis cancer-models computational-biology data-visualization hybrid-automata immune-response mathematical-modelling ordinary-differential-equations radiation-therapy spatial-model statistics systems-biology

Last synced: 23 Nov 2025

https://github.com/mkk-1817/cvip-ds-breast-cancer-prediction

Completed Phase 1 Project: Breast Cancer Prediction at CodersCave as a Data Science Intern. Achieved outstanding results with Logistic Regression (96.49% accuracy), SVM (93.85%), ANN (92.98%), and Random Forest (94.73%). Integrated a user-friendly Flask UI for accessibility, contributing to impactful healthcare solutions.

data-science data-visualization flask-application jupyter-notebook machine-learning machine-learning-algorithms python

Last synced: 30 Jun 2026

https://github.com/zacoppotamus/cityshade

[WIP] Studies in GPU-based cartography

cartography data-visualization glsl

Last synced: 13 May 2026

https://github.com/gappeah/solana-ml-forecast

This project uses machine learning, specifically an XGBoost regressor, to predict the price of Solana (SOL) based on historical data and engineered features.

cryptocurrency data-visualization machine-learning solana xgboost

Last synced: 25 Feb 2025

https://github.com/jwalsh/jwalsh

GitHub profile with automated topic analysis and visualization of my open source contributions

automation data-visualization github-api github-profile makefile org-mode

Last synced: 06 Apr 2026

https://github.com/vlad1343/data-visualisation

Python project showcasing interactive and static visualizations using Plotly and Matplotlib. It includes analysis of CSV, JSON, and API data, turning complex datasets into clear, insightful charts.

anova api csv-files data-analysis data-visualization json matplotlib matplotlib-pyplot pandas pandas-python plotly python3 seaborn seaborn-python

Last synced: 08 Apr 2026

https://github.com/faizantkhan/python_matplotlib

Matplotlib is a powerful Python library for creating visualizations and plots. It’s widely used for data representation, making complex information more accessible and interpretable. It offers various types of plots, including line graphs, scatter plots, bar charts, histograms, and more

data-analysis data-analytics data-engineering data-science data-visualization deep-learning graphs line machine-learning machine-learning-algorithms matplotlib matplotlib-pyplot matplotlib-python python

Last synced: 20 May 2026

https://github.com/vzamboulingame/data-portfolio

This repository showcases my projects in Python and SQL, highlighting my skills in data analysis & visualization.

data-analysis data-portfolio data-science data-science-portfolio data-science-projects data-visualization jupyter-notebook portfolio python sql

Last synced: 20 May 2026

https://github.com/vipulbunny/ml-learning_projects

A collection of machine learning projects implemented in Python, showcasing core concepts like regression, classification, clustering, and model evaluation techniques. Ideal for learners and data science enthusiasts.

classification clustering data-analysis data-science data-visualization decision-trees jupyter-notebook machine-learning model-evaluation random-forest regression supervised-learning unsupervised-learning

Last synced: 23 Jul 2025

https://github.com/bhargav-joshi/gradient-descent-in-linear-regression

Gradient Descent is the process of minimizing a function by following the gradients of the cost function. This involves knowing the form of the cost as well as the derivative so that from a given point you know the gradient and can move in that direction, e.g. downhill towards the minimum value.

3d-visualization data-visualization gradient-descent-algorithm gradient-descent-implementation

Last synced: 22 Jun 2025

https://github.com/zhouzhuofei/juliadl

learning Julia, write some notebooks, like machine learning and data science, visualization.

data-science data-visualization julia mxnet

Last synced: 21 Apr 2026

https://github.com/ravish1729/meteoroids-landing

Plotting coordinate where meteoroids have fallen on world map.

basemap data-visualization matplotlib nasa

Last synced: 15 May 2026

https://github.com/cecoeco/fcc_d3_certification

Data Visualization Certification (freeCodeCamp)

d3js data-visualization javascript

Last synced: 07 Sep 2025