An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/leandrocollares/employment-insurance-beneficiaries

A responsive line chart that shows regular Employment Insurance beneficiaries in Canada between 2019 and 2021

d3 data-visualization svelte

Last synced: 07 May 2026

https://github.com/amarlearning/exploring-67-years-of-lego

In this project, I have explored database of every LEGO set ever built.

data-manipulation data-visualization importing-and-cleaning-data jupyter-notebook pandas python

Last synced: 07 May 2026

https://github.com/muthukumar0908/-singapore-resale-flat-prices-predicting

This project is to develop a machine learning model and deploy it as a user-friendly web application that predicts the resale prices of flats in Singapore.

data-analysis data-visualization mechine-learing plotly python streamlit

Last synced: 07 May 2026

https://github.com/moustafamohamed01/mall-customer-segmentation-data

Customer segmentation using K-Means clustering based on annual income and spending score.

data-science data-visualization k-means-clustering machine-learning python scikit-learn unsupervised-learning

Last synced: 08 May 2026

https://github.com/ropaxyz/octobot-octopus-energy-discord-bot

A Discord bot for Octopus Energy users to track and visualize their energy consumption. Integrates with Octopus Energy's API to fetch and display personalized energy data, costs, and usage charts.

asyncio data-visualization discord-bot energy-monitoring graphql matplotlib octopus-energy octopus-energy-api python rest-api sqlite

Last synced: 08 May 2026

https://github.com/moiri-gamboni/vintedcalculator

📊 Smart pricing calculator for Vinted sellers - optimize your listings based on storage, seasonality, and market dynamics

clothing-resale data-visualization e-commerce inventory-management pricing-calculator react recharts sales-optimization seasonal-pricing shadcn-ui tailwindcss typescript vinted

Last synced: 08 May 2026

https://github.com/samjoesilvano/password_strength_prediction_using_nlp

Developed a predictive model to categorize passwords as Strong, Good, or Weak, enhancing security and reducing breach risks. The project involves cleaning and analyzing data from an SQL database, using the TF-IDF technique for transformation, and implementing a Logistic Regression model to achieve accurate classifications.

data-analysis data-classification data-cleaning data-visualization logistic-regression machine-learning natural-language-processing pandas password-security password-strength python scikit-learn sql tf-idf

Last synced: 08 May 2026

https://github.com/rightfulcode/retail-sales-breakdown

Time Series Analysis of Walmart Retail Sales – Internship project analyzing sales trends, seasonal patterns, and revenue breakdowns using Pandas, Matplotlib, and Seaborn.

data-analytics data-visualization elevvo-internship matplotlib pandas python retail-sales seaborn time-series-analysis

Last synced: 08 May 2026

https://github.com/iyashwantsaini/911_capstone

For this capstone project we will be analyzing some 911 call data from Kaggle.

capstone data-science data-visualization python3

Last synced: 10 Jun 2026

https://github.com/monajemi-arman/object-detection-utils

Utility scripts used in object detection model training and testing

coco data-visualization dataset-visualizer deep-learning object-detection visualizer yolo

Last synced: 10 Jun 2026

https://github.com/walkerdustin/ml-notebook-template

This is a template to Kickstart your ML project Let this be a starting point for your next Data analysis project

data-science data-visualization machine-learning machine-learning-algorithms ml notebook python

Last synced: 09 May 2026

https://github.com/abhroroy365/market_analysis

This project explores customer segmentation and market analysis in the context of online retail using an online retail dataset. By applying advanced analytics, we aim to uncover insights that can drive strategic decisions and enhance business performance.

clustering data data-analysis data-visualization kmeans-clustering machine-learning market-analysis python silhouette-analysis

Last synced: 09 May 2026

https://github.com/smahala02/materials-science-image-analysis

Image analysis for materials science with a focus on particle diameter measurement and image scaling using Python.

data-visualization image-analysis materials-science particle-measurement python

Last synced: 09 May 2026

https://github.com/shyamkumarnagilla/big-sales-prediction

The "Big Sales Prediction" model is a machine learning project that aims to accurately forecast sales for a given period. The model utilizes the Random Forest Regressor algorithm, a powerful ensemble learning technique, to analyze historical sales data and make predictions. It can be valuable for businesses looking to optimize sales forecasting.

data-analytics data-preprocessing data-science data-visualization machine-learning model-evaluation model-training

Last synced: 09 May 2026

https://github.com/abhinav330/msc-project

AI-Powered Chatbot for University Websites This project enhances the usability of university websites by providing an AI-driven chatbot powered by advanced Large Language Models (LLMs) and Retrieval-Augmented Generation (RAG).

chatbot data-science data-visualization finetuning-llms gemma2 llama3 llama3-finetune llm llm-inference mistral-7b nlp ollama phi-3-mini rag research-project

Last synced: 09 May 2026

https://github.com/chauxvive/fcctreemap

A responsive treemap visualization built with D3.js to display hierarchical data in an interactive format. Created as part of the FreeCodeCamp Data Visualization Certification.

d3 d3js data-visualization dataviz treemap

Last synced: 10 May 2026

https://github.com/rohan3122k/social-media-sentiment-analysis-of-finance-defence-and-healthcare-in-the-usa

This project provides a comprehensive, data-driven analysis of three critical sectors - Finance, Defense, and Healthcare , under the administrations of Donald Trump and Joe Biden.

api aws data-visualization datamining financial-analysis healthcare-application nytimes-api python reddit-api sentiment-analysis wordcloud-visualization

Last synced: 11 May 2026

https://github.com/ceia-prefeitura/urban-lit-tracker-etl

UrbanLitTracker coleta artigos acadêmicos sobre mudanças urbanas via OpenAlex API, processa e armazena em MongoDB. Oferece dashboard interativo com Dash, exibindo dados como trabalhos mais relevantes, autores e palavras-chave frequentes, facilitando a análise e visualização da literatura urbana.

academic-research bibliometrics data-analysis data-pipeline data-visualization etl openalex-api urban-studies

Last synced: 11 May 2026

https://github.com/dannykyungh/data-analytics-portfolio

This is a repository that I have created to showcase skills, share projects and track my progress in Data Analytics / Data Science related topics.

advanced-excel data-cleaning data-modeling data-visualization data-warehousing google-sheets looker-studio python r sql tableau

Last synced: 12 May 2026

https://github.com/magnus0969/gdp-analysis

An in-depth exploration of global GDP trends using Python and data science techniques. This project involves data preprocessing, exploratory data analysis (EDA), statistical insights, and interactive visualizations to understand economic patterns and correlations.

data-science data-visualization gdp-analysis plotly python3

Last synced: 12 May 2026

https://github.com/devanshsahu47/prime-content-analytics

Prime Data Explorer analyzes Amazon Prime's content and credits data to uncover trends in release years, genres, and ratings. It cleans, merges, and visualizes the data to provide actionable insights for optimizing content strategy and boosting audience engagement.

data-analysis data-visualization exploratory-data-analysis jupyter-notebook python3

Last synced: 13 May 2026

https://github.com/ivandobrovolsky/crimeaisukraine

How do maps, data libraries, streaming platforms, travel services, and internet infrastructure classify Crimea? We audited 200 digital platforms across 12 categories.

bigdata data-science data-visualization database machine-learning python

Last synced: 13 May 2026

https://github.com/silkiemoth/eds-240-class-examples

Repository for in-class work assignments and notes in EDS-240 Data Visualization and Communication at UCSB.

classwork data-visualization r ucsb-meds

Last synced: 13 May 2026

https://github.com/mathyouf/kaggle-notebook-code

Code and Images which I used in Kaggle Notebooks. Mostly for style and code clarity.

data-visualization kaggle

Last synced: 14 May 2026

https://github.com/ewels/contributor-graphs

Contributor timelines for any git or GitHub repo: a publication-ready SVG and an interactive HTML page

cli contributors data-visualization git github open-source rust svg timeline visualization

Last synced: 11 Jun 2026

https://github.com/jorgeatgu/d3-bundle

Importando solo los módulos necesarios de d3

d3js d3v4 data-visualization

Last synced: 13 Jun 2026

https://github.com/jameshulse/are-we-winning

An interactive map representing which countries are winning or losing their battle against Covid-19

coronavirus coronavirus-tracking covid-19 data-visualization vue vuejs

Last synced: 13 Jun 2026

https://github.com/bpazy/my_running_page

Make your own running home page

codoon data-visualization garmin gpx keep nike strava

Last synced: 17 Jun 2026

https://github.com/leftcoastnerdgirl/webscraping_and_beautifulsoup

This project uses Beautiful Soup to create scrap data from a news website.

beautifulsoup data-visualization jupyter-notebook splinter webscraping

Last synced: 17 Jun 2026

https://github.com/tanmayborse/institionistic_fuzzy_approx_space

This model introduces a hybrid approach that utilizes rough sets on intuitionistic fuzzy approximation spaces for pre-processing and soft sets for post-processing, resulting in an effective decision-making solution.

data-cleaning-and-preprocessing data-science data-visualization decision-making fuzzy-logic

Last synced: 17 Jun 2026

https://github.com/mattsebastianh/Make-a-Line-Chart

Data Visualization with Matplotlib | Matplotlib Fundamentals

data-visualization matplotlib pandas-dataframe python

Last synced: 18 Jun 2026

https://github.com/dineshram0212/youtube-analysis

This YouTube Analysis Package provides tools for analyzing YouTube video data, including metrics on views, likes, comments, and engagement trends. Ideal for gaining insights into video performance and audience interaction patterns.

data data-visualization pandas python webscraping youtube-api-v3

Last synced: 19 Jun 2026

https://github.com/an4pdm/relatorio-de-vendas

O presente projeto foi feito através das ferramentas oferecidas pelo Power BI afim de aprimorar meus conhecimentos sobre ETL. Os dados utilizados foram de origem do site "Kaggle".

data-analysis data-visualization database etl powerbi

Last synced: 20 Jun 2026

https://github.com/orvn/some-visualizations

Just some visualizations of concepts and data

d3js data-visualization math statistics

Last synced: 24 Jun 2026

https://github.com/lu-m-dev/biostatistics-eda

Exploratory data analysis and visualization system for biostatistical research

biostatistics data-analysis data-visualization eda

Last synced: 25 Jun 2026

https://github.com/hamza-ali-shahjahan/thousandworlds-explorer

An explorable map of every world humanity has discovered — built on the NASA Exoplanet Archive.

astronomy data-visualization exoplanets nasa react space typescript vite

Last synced: 29 Jun 2026

https://github.com/pkjjoshi/behind-the-menu-uncovering-insights-from-restaurant-data

Discover hidden patterns in dining data — from popular cuisine pairings to geographic restaurant clusters

data-analysis data-visualization insights jupyter-notebook pandas python restaurant-data

Last synced: 05 Jul 2025

https://github.com/mikkelrask/henryrollins-scraper

FANATIC! A dataset of Henry Rollins' listens on his KRCW radio show, with data dating back to 2017 - 496 episodes of weird and rare finds, fast paced punk and frog sounds. Includes a scraper that keeps the data up-to-date with henryrollins.com

archive data-analysis data-visualization music

Last synced: 29 Jun 2026

https://github.com/nazir20/scraping-tweets-using-python-and-preprocessing-tweets-for-sentiment-analysis

This is repo is about how to scrape tweets from Twitter using Python and also proprocessing tweets for sentiment analysis

data-cleaning data-visualization jupyter-notebook python twitter-sentiment-analysis

Last synced: 13 Apr 2026

https://github.com/estebanrucan/reporte-comunas-tasa-defuncion-alta_2017

El fin de este reporte es indicar cuales son las mayores causas de defunción en las comunas de Chile en el año 2017, el material queda a libre disposición para que se puedan tomar medidas.

chile data-visualization ggplot2 plotly rmarkdown

Last synced: 04 Feb 2026

https://github.com/saifalibaig/covid-19-infection-rate-analysis-using-python

Analysis of Covid-19 Infection rate and the world happiness report to identify if there is any relationship between infection rate and happiness

data-analysis data-visualization jupyter-notebook numpy pandas python3 sns

Last synced: 18 Apr 2026

https://github.com/parthivnaresh/facilyst

Facilyst is a library that makes using data science and machine learning tools easier.

data-science data-visualization deep-learning machine-learning mock-data neural-network python

Last synced: 18 Mar 2025

https://github.com/bretsw/eme6356-ss23-module5

Slide deck for EME6356, Module 5: Data Visualization (Spring 2023)

analytics data-analytics data-visualization slides visualization

Last synced: 08 Jan 2026

https://github.com/ndiplacide7/r-project

Explore diverse data analysis techniques using R programming combined with advanced machine learning algorithms to uncover insights and create powerful predictive models.

data-analysis data-visualization machine-learning-algorithms r

Last synced: 25 Mar 2025

https://github.com/cartervr/taxdatabase-sql-tableau

End-to-end process for building an SQL Azure database, performing data analysis with SQL and Python, and visualizing data with Tableau.

azure data-science data-visualization database-architecture database-deployment database-management databse-design datanalysis erdiagram sql tableau

Last synced: 13 Mar 2026

https://github.com/rafaelmoura23/capella-info-ai

CapellaInfo is a Laravel-based application designed for automation, data, and AI projects. Its primary goal is to store and manage personal projects efficiently, providing a centralized platform for innovation and development.

artificial-intelligence automation data-science data-visualization laravel neural-network

Last synced: 28 Apr 2026

https://github.com/brazer27/iris-classification

A Python implementation of Naive Bayes algorithm for Iris flower classification. Features include cross-validation, data preprocessing, and prediction capabilities. Built from scratch without ML libraries, achieving ~95% accuracy on the classic Iris dataset.

cross-validation data-science data-visualization flower-classification iris-dataset machine-learning naive-bayes python

Last synced: 06 Sep 2025

https://github.com/djeada/data-visualization

This repository is dedicated to the exploration of various data visualization frameworks through bite-sized code snippets, as well as providing insights on effective data visualization techniques and principles.

altair data-visualization matplotlib plotly

Last synced: 08 Jan 2026

https://github.com/bertiewooster/ipywidgets

Interactive data visualizations in a Jupyter Notebook per tutorial https://python.plainenglish.io/interactive-visualizations-with-pandas-seaborn-and-ipywidgets-173e5d7d6a5e

data-analysis data-science data-visualization ipython-notebook ipywidgets juypter-notebook python

Last synced: 06 Mar 2026

https://github.com/ak-abhilash/insightcat

📊 One-click open-source EDA tool for CSV, Excel, JSON

csv- data-analysis- data-visualization eda- fastapi- open-source- pandas- react-

Last synced: 14 Jun 2025

https://github.com/balajimohan18/milk-production-time-series-forecasting-datascience-project

This project uses time series forecasting to predict future milk production. The data used in this project is monthly milk production data from January 1962 to December 1975. The ARIMA (autoregressive integrated moving average) model is used to forecast the milk production. The model is evaluated using various metric.

acf adf data-analysis data-cleaning data-science data-visualization eda exploratory-data-analysis machine-learning pacf seasonality time-series trends

Last synced: 30 May 2026

https://github.com/quocduyenanhnguyen/california-gas-prices

In this project, I scrapped data from a website to collect different types of gas data and their prices in California.

csv-files data-analytics data-cleaning data-visualization gas-prices mysql python3 tableau tableau-dashboards tableau-public

Last synced: 13 May 2026

https://github.com/alainamariajoe/netflix-data-visualizer

A simple data visualization project using Netflix movies and TV shows dataset to create basic insights and visual representations.

data-visualization matplotlib pandas python seaborn

Last synced: 17 May 2026

https://github.com/suresh-chelani/crop-data-visualization

This project implements data visualization tasks using TypeScript, Vite, Apache ECharts, and Mantine v7. The goal is to process agricultural data, handle missing values, and render a table and a bar chart based on the dataset.

apache-echarts data-visualization mantine-v7 typescript vite

Last synced: 01 Mar 2025

https://github.com/spriggancg/hishiryo

A package to generate a picture representation of any csv file.

csv data-visualization dataset heatmap package pipy python python3

Last synced: 14 Jan 2026

https://github.com/greatwoman23/hotel_reservation_analysis

In this project, we delve into the intricate world of hotel reservations, utilizing a multifaceted analytical approach to uncover valuable insights. Through a combination of SQL queries and Tableau visualizations, we meticulously dissect a rich dataset comprising booking details, customer demographics, and reservation statuses.

data-analysis data-science data-visualization hotel hotel-reservation publications sql sql-query sqlite3 tableau

Last synced: 15 May 2026

https://github.com/sssshefer/covid-map

Interactive map showing covid data implemented on R language

big-data data-visualization r r-studio

Last synced: 01 Mar 2025

https://github.com/thomas-basham/ps-creel

This web application fetches fishing report data from the Washington Department of Fish and Wildlife (WDFW) Creel Reports page and displays it on an interactive map.

creel creel-survey data-science data-visualization database fish fishing nextjs postgresql puget-sound-data pugetsound react sql website

Last synced: 13 Apr 2026

https://github.com/sakeenanavavi/larana_diamonds

A Diamond Price Predictor uses multiple machine learning algorithms to predict the price of a diamond based on a its attributes.

data-visualization knn machine-learning-algorithms random-forest regression support-vector-regression xgboost

Last synced: 10 Jan 2026

https://github.com/aykutsp/world-infrastructure-data-hub

Interactive world map of fuel prices, electricity, EV charging costs and CO2 emissions — daily-refreshed open data pipeline.

choropleth climate-data co2-emissions data-visualization electricity-prices energy-data ev-charging fuel-prices github-actions leaflet open-data react sustainability typescript vite world-map

Last synced: 05 Apr 2026

https://github.com/joaopalmeiro/vscode-altair-snippets

A VS Code extension for scaffolding Altair charts for data visualization.

altair data-visualization python vscode vscode-extension vscode-snippets

Last synced: 09 May 2026

https://github.com/satyam4229/prediction-of-cement-compressive-strength

Prediction of cement compressive strength is a model which is based on Regression model, Here we predict that how much is the compressive strength of the particular cement has with variety of mixtures of its component.

data-analysis data-science data-visualization jupyter-notebook kaggle python

Last synced: 13 Apr 2026

https://github.com/pratik-khose/realtime-sales-simulation

Power BI: Realtime Sales Simulation using SQL Server and Direct Query

data-analysis data-analytics data-visualization dax-query powerbi sql sql-server sqlserver

Last synced: 10 Jun 2026

https://github.com/auliannee/customer-analysis-with-tableau

This repository contains the data source and the tableau workbook.

data-analysis data-visualization tableau

Last synced: 12 Mar 2026

https://github.com/ryanbbrown/volleyball-analysis-project

Analyzes 10 years of self-collected men's NCAA volleyball player height and team wins data to determine the importance of height for success.

data-analysis data-visualization python volleyball

Last synced: 31 May 2026

https://github.com/esther-poniatowski/multitask-context-dependent-behavior

Data analysis of neuronal recordings in naive and trained animals performing multiple tasks in active and passive attentional states

cognitive-neuroscience computational-neuroscience data-analysis data-visualization information-processing

Last synced: 26 Mar 2025

https://github.com/bowenfu/cylindervtk

Generate deformed flexible cylinder based on given displacements

data-visualization python vtk

Last synced: 07 Jul 2025

https://github.com/madhuresh2011/hr-analytics-using-power-bi

HR Analytics Dashboard, leveraging the power of Power BI to transform data into actionable insights.

analysis dashboard data-analytics data-visualization excel-dataset insights power-query powerbi

Last synced: 07 Jan 2026

https://github.com/deliprofesor/health-score-prediction-model-the-impact-of-lifestyle-and-demographic-factors

A machine learning project predicting health scores based on lifestyle and demographic factors like age, BMI, diet, and exercise. Techniques include Random Forest, Polynomial Regression, and Linear Regression, with a focus on model performance and actionable health insights.

cross-validation data data-science data-visualization feature-engineering linear-regression machine-learning polynomial-regression random-forest

Last synced: 10 Apr 2025

https://github.com/archanakokate/ml_mercedes_benz_greener_manufacturing_project

This project involves reducing testing time for car configurations. The tasks include removing columns with zero variance, checking for null values, applying label encoding, performing dimensionality reduction, and using XGBoost to predict testing time.

data-visualization dimentionality-reduction encoding exploratory-data-analysis machine-learning-algorithms

Last synced: 17 Mar 2025

https://github.com/freya135/personal-finance-manager

This project is a web-based personal finance manager dashboard built using Next.js and Vercel PostgreSQL. The dashboard aggregates essential financial data to help users track metrics like profits, sales, and customer activity, and it provides easy-to-read visualizations to support data-driven decision-making.

data-visualization nextjs personal-finance-manager postgresql vercel webdashboard

Last synced: 13 Apr 2026

https://github.com/seblehner/feldprakt

Collection of plotting routines for a field exercise work using different measurement tools and Hobo weather stations.

data-analysis data-visualization jupyter-notebook python

Last synced: 05 Oct 2025

https://github.com/jianxi-erin/bigdata-machinelearning-lab

本项目是一个综合性的大数据与机器学习实验平台,包含两个主要任务,每个任务涵盖三个关键技术模块:大数据处理、数据分析和机器学习。项目基于真实的竞赛设计,提供完整的数据处理模拟和建模实践。

data-analysis data-visualization hadoop machine-learning python spark sql

Last synced: 03 May 2026

https://github.com/subhamghimire/dataanavis

Learning Data analysis and visualization

data-analysis data-science data-visualization dataset

Last synced: 06 Oct 2025

https://github.com/tsbarr/belly-button-challenge

Using front-end development tools (javascript, html and css) I built an interactive dashboard to explore the Belly Button Biodiversity dataset, which catalogs the microbes that colonize human navels.

data data-visualization javascript

Last synced: 04 Mar 2026

https://github.com/boss294/credisync

CrediSync is a comprehensive web application designed to manage and track your financial transactions, including credits, debits, creditors, and debtors. With modern UI/UX and advanced features, it provides a seamless experience for managing your financial records.

credit-tracker css data-structures data-visualization dsa html html3 js management-system money-management web-dev website websoftware

Last synced: 17 May 2026

https://github.com/bdice/signac-micde-cnsccs-2018

Slides and demos for the MICDE CNSCCS Symposium, October 15, 2018

data-management data-visualization demo signac workflow-automation

Last synced: 07 Oct 2025

https://github.com/gabboraron/biostatisztika_es_alkalmazasai

"A statisztika a matematika azon ága, melynek feladata, hogy eszközt adjon a politikusok kezébe, mellyel tetszőleges állítás és annak ellentéte is tudományos alapon igazolható"

biostatistics data-analysis data-visualization r statistics statistics-course

Last synced: 24 Oct 2025

https://github.com/ljadhav25/world-population-analysis-1990-2023-

This repository contains data and analysis related to the world population from 1990 to 2023. The objective is to explore population trends, identify patterns, and visualize demographic changes across different countries and continents over the past few decades.

data-analysis-python data-visualization matplotlib numpy-library pandas-library seaborn

Last synced: 08 Oct 2025

https://github.com/aymanmomin/excel-coffee-data-analytics-exploring-coffee-orders-dataset

This project utilizes a coffee orders dataset to perform comprehensive data analytics and gain insights into customer preferences, popular items, and sales trends. The analysis aims to provide valuable information for coffee shop owners and enthusiasts, facilitating data-driven decision-making and improved customer satisfaction.

data-analysis data-visualization excel project

Last synced: 18 Jan 2026

https://github.com/syncfusionexamples/how-to-add-arrows-to-the-chart-axis-in-wpf-chart

Learn how to enhance WPF charts by adding arrows to the chart axes using annotations for improved visualization and clarity.

axis-with-arrows chart-annotations chart-axis chart-customization charting-library charts data-visualization line-annotation wpf-char wpf-sfcharts

Last synced: 08 Oct 2025

https://github.com/tyriek-cloud/power-bi-nyc-housing-financial-report

This report was conducted to provide a comprehensive analysis of various NYC housing and financial data.

dashboard data-analysis data-visualization financial-analysis powerbi statistics

Last synced: 21 Jan 2026

https://github.com/juanes0023/dashboard-mtp

🚗 Track user activity and revenue in real-time with the Mileage Tracker Pro Dashboard for clear insights and growth trends.

analytics business-intelligence dashboard data-visualization plotly python real-time-analytics saas streamlit supabase

Last synced: 20 Apr 2026

https://github.com/priyanshubiswas-tech/priyanshubiswas-tech

SWE-Data Engineer @ EDN | Kubeflow-MLOps | Kubernetes | Databricks | AWS EMR-Lambda-Glue, Eventbridge, SQS-SNS | OCI Multi-Cloud Architect Professional | GCP GA4 | Gen AI | IEEE Brand Amb. | Ex-Chair, PES | Ex-Sec, SB

apache-spark aws data-analysis data-engineering data-visualization dbt hadoop kubernetes python3 sql

Last synced: 21 Jan 2026

https://github.com/ianhaggerty/final-capstone

This represents the final capstone project in my HyperionDev Data Science (fundamentals) course. A dataset of Amazon customer reviews is analysed using natural language processing.

amazon data-analytics data-science data-visualization dataset matplotlib nlp nlp-machine-learning numpy pandas plotly reviews seaborn spacy tabulate textblob wordcloud

Last synced: 19 Jan 2026