An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/amg-ai-labs/petrol_station_finder

A Python script to find nearby petrol stations and fuel prices using UK government data.

api data-visualization fuel geo python uk

Last synced: 13 Jun 2025

https://github.com/eduardorodriguesf/youtube-trending-scraper

Scraper program that searches youtube trending videos categories

data-visualization matplotlib pandas seaborn selenium

Last synced: 05 May 2026

https://github.com/analysisbyvivek/Road-Accident

Analyzes road accident patterns, exploring factors like lighting, weather, speed limits, time of day, and road conditions to uncover trends in severity and frequency.

data-analysis data-visualization eda jupyter-notebook kaggle tableau-public

Last synced: 29 Jan 2026

https://github.com/wilkerhop/vanguard-anime-critique

Neo-Brutalist web application demonstrating the Vanguard Analytical Framework for anime critique with interactive data visualizations and comparative analysis.

anime article chartjs critical-analysis css data-visualization github-pages neo-brutalism web-design

Last synced: 29 May 2026

https://github.com/wilkerhop/linestream

A dynamic line visualization using HTML, JavaScript, and SVG. Each point has a vertical position based on its currentPosition, and all points are connected. New points can be added dynamically, updating the visual representation in real time. This project explores JavaScript, DOM manipulation, and SVG rendering.

data-visualization dynamic-graphics frontend html interactive-ui javascript proof-of-concept svg web-development

Last synced: 29 May 2026

https://github.com/hassanislam463/data-cleaning-and-modelling-top-5-categories-analysis-forage

This project involves cleaning, merging, and analyzing datasets to identify the top 5 performing categories based on aggregate popularity scores. It includes cleaned datasets, a final merged dataset, visualizations, and a presentation summarizing the tasks and results. Tools used: Microsoft Excel, Python, and PowerPoint.

data-analysis data-visualization microsoft-excel

Last synced: 07 Jan 2026

https://github.com/athul64/tmdb-dataset-analysis

This data set contains information about 10,000 movies extracted from TMDB. The dataset contains movies from 1960 to 2015. Including user ratings and revenue. Original data from Kaggle.

data-visualization dataframe eda numpy pandas python

Last synced: 14 Apr 2026

https://github.com/sco1/xbmini-py

Python Toolkit for the GCDC HAM

data-analysis data-visualization python python3

Last synced: 07 May 2025

https://github.com/quangandrei1003/france_air_pollution_pipeline

End-to-end air pollution data pipeline for French metropolitan cities using Airflow, Python, dbt, BigQuery.

airflow bigquery data data-analytics data-engineering data-modeling data-visualization dbt docker etl pandas python terraform

Last synced: 13 Apr 2026

https://github.com/anarya22/accenture-north-america-data-analytics-and-visualization-job-simulation-on-forage

Completed a simulation focused on advising a hypothetical social media client as a Data Analyst at Accenture. Cleaned, modelled and analyzed 7 datasets to uncover insights into content trends to inform strategic decisions. Prepared a PowerPoint deck and video presentation to communicate key insights for the client and internal stakeholders.

analyzing-visualization data-cleaning data-visualization numpy pandas powerbi powerpoint-presentations

Last synced: 09 May 2026

https://github.com/samruddhi3012/tata-data-visualization

Hi! This repo contains the dashboard I created using Tableau for TATA Data Visualization Training!

data-analysis data-visualization tableau tata

Last synced: 07 Jan 2026

https://github.com/dmarks84/coursework_project_text-mining-topic-modeling

Project for University of Michigan Applied Data Science Specialization -- Developed functions to score similarity between text passages.

data-modeling data-reporting data-visualization databases eda nlp numpy pandas python statistics text-mining

Last synced: 12 Apr 2026

https://github.com/mr-chang95/udacity_movie_project

Movie Data Analysis and Visualization Project for Udacity's Data Analyst Program. Using Python in Jupyter Notebook.

data-analysis data-visualization jupyter-notebook movie python

Last synced: 13 Apr 2026

https://github.com/femincan/d3-choropleth-map

My solution for the Visualize Data with a Choropleth Map project on FCC.

css3 d3js data-visualization html5 javascript

Last synced: 13 Apr 2026

https://github.com/danaelshrbiny10/gold-prices

The Egypt Gold Prices project is a data analysis and visualization initiative that focuses on tracking and understanding the daily gold prices in Egyptian pounds per gram.

data-visualization docker docker-compose matplotlib mongodb numpy pandas powerbi python3 webscraping

Last synced: 13 Apr 2026

https://github.com/neelimabonangi/real-time-weather-data-processing

Processes and analyzes near real-time weather data using the Kappa architecture,Apache Kafka,Spark,Cassandra,docker,AWS EC2,spring boot API

aws cassandra data-visualization dataanalysis dataprocessing docker ec2 json kafka kappa-architecture machine-learning restapi spark springboot-api xml

Last synced: 13 Apr 2026

https://github.com/derrmru/whats-in-the-news

Data Visualisation of News Content

data-visualization nlp react scraped-data

Last synced: 17 May 2026

https://github.com/badranalyst/tips-dataset-analysis-dashboard-with-streamlit-and-plotly

Interactive Streamlit dashboard analyzing the Seaborn 'tips' dataset, which records information on restaurant bills, including total bill amounts, tips, customer demographics (e.g., gender, smoking status), and dining details (e.g., day, time). Visualized with Plotly for insights into tipping patterns.

data-analysis data-analytics data-visualization dataset eda exploratory-data-analysis matplotlib matplotlib-pyplot numpy pandas plotly python seaborn streamlit

Last synced: 13 Apr 2026

https://github.com/farhashaad/farhashaad98

This is a repository to showcase my skills, share projects and track my progress in Data Science related projects.

data data-visualization dataanalysis matplotlib pandas python seaborn sql tableau

Last synced: 24 Apr 2026

https://github.com/brunomontezano/simple-thrive-user-growth-plot

📱 The repo contains a simple line plot for a presentation about the "Thrive: combata a depressão" app showing the user growth from April to October in 2022.

data-visualization data-viz datavisualization dataviz depression ggplot2 plots presentation-materials r-programming thrive-app user-growth

Last synced: 10 Jun 2026

https://github.com/pkjjoshi/behind-the-menu-uncovering-insights-from-restaurant-data

Discover hidden patterns in dining data — from popular cuisine pairings to geographic restaurant clusters

data-analysis data-visualization insights jupyter-notebook pandas python restaurant-data

Last synced: 05 Jul 2025

https://github.com/melih0132/all-my-projects

This repository showcases projects from my computer science journey, covering technologies like web development and interactive applications.

csharp data-visualization database game-development html-css ia javascript kotlin-android python software-development swift unity web-development

Last synced: 05 Apr 2026

https://github.com/nazir20/scraping-tweets-using-python-and-preprocessing-tweets-for-sentiment-analysis

This is repo is about how to scrape tweets from Twitter using Python and also proprocessing tweets for sentiment analysis

data-cleaning data-visualization jupyter-notebook python twitter-sentiment-analysis

Last synced: 13 Apr 2026

https://github.com/saisurajmatta/airbnb-data-visualisation-project

Explored and visualized Seattle Airbnb data to gain insights into pricing, geographic trends, and optimal listing strategies for hosts.

data-analytics data-visualization excel tableau tableau-dashboards tableau-public tableu-workbook

Last synced: 05 Feb 2026

https://github.com/happymary16/data-visualization-labs

Solutions for all labs from 'Data Visualization' course

data-visualization jupiter-notebook python

Last synced: 13 Apr 2026

https://github.com/marianamartiyns/rfm-cluster-analysis

Customer behavior and sales analysis, including data cleaning, RFM calculation, churn analysis and customer clustering.

cluster-analysis data-analysis data-cleaning data-visualization pyhton

Last synced: 16 Mar 2025

https://github.com/mkaspulanwar/p6_bigdata_realtime_largescale_visualization

Praktikum Week 6 Big Data: Real-time analytics dan visualisasi data skala besar menggunakan PySpark Structured Streaming, Parquet Data Lake, dan Streamlit untuk monitoring mobilitas dan traffic smart city.

big-data data-visualization pyspark spark-streaming streamlit traffic-analytics

Last synced: 13 Apr 2026

https://github.com/deliprofesor/breast-cancer-detection-using-svm-with-smote-and-model-optimization

This project analyzes health and lifestyle factors influencing heart attack risk using statistical methods and machine learning, with Ridge Regression identified as the best predictive model.

classification data data-preprocessing data-science data-visualization gridsearchcv machine-learning python roc-curve smote svm

Last synced: 10 Apr 2025

https://github.com/akansharajput280799/covid19-impact-analysis-usa

Data Analysis and Predictive Modeling to study COVID-19 impact across age groups, regions, and seasons in the USA.

classification-algorithm clustering-algorithm data-preprocessing data-visualization descriptive-statistics exploratory-data-analysis matplotlib numpy pandas seaborn

Last synced: 13 Apr 2026

https://github.com/leandrocollares/nyc-film-permits

NYC film permits: an exploratory data analysis

data-analysis data-visualization pandas plotly

Last synced: 05 Jul 2025

https://github.com/deliprofesor/virtual-reality-in-education-impact-analysis-and-insights

This project examines the impact of Virtual Reality (VR) on education, focusing on its effects on student engagement, learning outcomes, and creativity. It uses data analysis techniques like descriptive statistics, correlation analysis, and clustering to assess VR's effectiveness in enhancing learning.

clustering data data-analysis data-science data-visualization exploratory-data-analysis hypothesis-testing machine-learning python regression-analysis virtual-reality

Last synced: 14 Jun 2025

https://github.com/naveen88112/clustering_customer_invoice_data

Customer Invoice Data Clustering This project uses clustering methods on customer invoice data for segmentation analysis. It preprocesses data, normalizes features, and uses K-Means and DBSCAN to cluster customers according to spending habits and shared locations.

clustering data-preprocessing data-visualization numpy pandas python silhouette-score standardization

Last synced: 13 Apr 2026

https://github.com/saifalibaig/covid-19-infection-rate-analysis-using-python

Analysis of Covid-19 Infection rate and the world happiness report to identify if there is any relationship between infection rate and happiness

data-analysis data-visualization jupyter-notebook numpy pandas python3 sns

Last synced: 18 Apr 2026

https://github.com/shoebjoarder/superstore

A Dash app to analyze Superstore dataset.

dashboard data-analysis data-visualization python-3

Last synced: 02 Apr 2025

https://github.com/aditishenoy35/netflix_analysis

An interactive data visualization project exploring Netflix content using Python and Jupyter Notebook.

data-visualization jupyter-notebook python

Last synced: 20 Apr 2026

https://github.com/shellynagar27/business-insights-360-project

A comprehensive Dashboard which provides better understanding of the business's market standing, key focus areas for optimization, underperforming customers, and year-wise financial insights, aiding in better inventory planning and performance tracking. Further it can be used in answering n number of why questions based on the situations.

dashboard data-analysis data-visualization dax-languague dax-studio excel performance-optimization power-bi reporting sql storage-manager

Last synced: 27 Jan 2026

https://github.com/cyber-security-tech/top10-movies-web

Feature-rich full-stack Flask web app that lets users search, rate, and review movies via TMDb API, with smart genre filtering, interactive statistics (Chart.js), form validation (Flask-WTF), star-based ratings, and a polished UI/UX designed for real-world deployment.

api-integration bootstrap chartjs crud-app data-visualization flask flask-blueprints flask-wtf form-validation fullstack genre-filtering jinja movie-database python responsive-design sqlalchemy sqlite tmdb-api ui-ux web-app

Last synced: 08 Apr 2026

https://github.com/bretsw/eme6356-ss23-module5

Slide deck for EME6356, Module 5: Data Visualization (Spring 2023)

analytics data-analytics data-visualization slides visualization

Last synced: 08 Jan 2026

https://github.com/muichi-mon/fxplot

A simple JavaFX-based plotting library for quick and easy data-visualization.

data-visualization javafx plot series-data

Last synced: 16 May 2026

https://github.com/fazzaan/gitbook-sciencing

GitBook sync for Sciencing publishing & training projects

data-presentation data-visualization ebook gitbook science science-communication science-research

Last synced: 08 Jan 2026

https://github.com/cartervr/taxdatabase-sql-tableau

End-to-end process for building an SQL Azure database, performing data analysis with SQL and Python, and visualizing data with Tableau.

azure data-science data-visualization database-architecture database-deployment database-management databse-design datanalysis erdiagram sql tableau

Last synced: 13 Mar 2026

https://github.com/wadeChriestenson/Main_Application

A Django application to host my personal resume.

data-analysis data-visualization django plotly python ui-design

Last synced: 11 Mar 2025

https://github.com/cronware/predictive-maintenance

The Predictive Maintenance System is a C# WinForms application designed to monitor and analyze sensor data from industrial equipment in real time. It integrates machine learning (ML.NET) and MongoDB to detect anomalies, predict failures, and optimize maintenance schedules before equipment breakdown occurs.

csharp data-visualization dotnet machine-learning mlnet mongodb predictive-maintenance winforms

Last synced: 13 Apr 2026

https://github.com/treyhamilton/ds-project-1

A compilation of various programming concepts written in Python/R covering the topics listed below

covid19-data data-science data-visualization exploratory-data-analysis

Last synced: 06 Jul 2025

https://github.com/terilios/automated_data_scientist

Automated Data Scientist: An intelligent, adaptive data analysis tool that leverages AI-driven automation to dynamically plan, execute, and refine data science workflows. Automatically handles data preparation, analysis planning, code generation, and result interpretation using advanced language models.

adaptive-analytics ai-driven-analytics ai-powered-data-tools api-integration automated-data-science automation data-insights data-preparation data-science-workflow data-visualization dynamic-analysis-planning exploratory-data-analysis intelligent-data-processing language-models machine-learning ml-ops openai-gpt python scalable-data-analysis

Last synced: 23 Jun 2025

https://github.com/dbriane208/python-for-data-science

Machine Learning and Data Science repository. Love crafting Machine Learning models.

data-analysis data-science data-visualization machine-learning numpy pandas python seaborn

Last synced: 13 Apr 2026

https://github.com/mituskillologies/data-science-mar25

Programs of Data Science batch @ MITU Skillologies, March 2025

data-analytics data-science data-visualization machine-learning python

Last synced: 16 Mar 2025

https://github.com/master-helix/music-queries

This is a beginner Data Analyst Portfolio Project aimed at providing data insights based on a music store dataset

data-analytics data-visualization ms-excel postgresql sql

Last synced: 06 Sep 2025

https://github.com/vatshayan/pokemon-analysis

Visualization, Analysis & Predicting the accuracy of finding Pokemon power, attack & speed through Machine Learning

artificial-intelligence data data-analysis data-science data-visualization dataset machine-learning machine-learning-algorithms pokemon scikit-learn

Last synced: 30 May 2026

https://github.com/abhisek-13/whatsapp-chat-analyzer

The WhatsApp Chat Analyzer is a data analysis project that provides insights into WhatsApp chats. It analyzes chat data to show metrics like the number of lines, most used letter, chatting duration, media files shared, most used emojis, and group member activity. The results are displayed on a user-friendly dashboard built with Streamlit.

data-analysis data-mining data-visualization eda machine-learning machine-learning-algorithms matplotlib numpy pandas python seaborn sklearn

Last synced: 13 Apr 2026

https://github.com/nittygritty-zzy/quantlab

🚀 Professional quantitative trading research platform with ML-powered backtesting, multi-source options analysis, portfolio management, and interactive Plotly visualizations. Built on qlib with CLI interface.

algorithmic-trading backtesting cli data-visualization financial-analysis machine-learning options-trading plotly portfolio-management python qlib quantitative-finance

Last synced: 14 Jan 2026

https://github.com/bertiewooster/ipywidgets

Interactive data visualizations in a Jupyter Notebook per tutorial https://python.plainenglish.io/interactive-visualizations-with-pandas-seaborn-and-ipywidgets-173e5d7d6a5e

data-analysis data-science data-visualization ipython-notebook ipywidgets juypter-notebook python

Last synced: 06 Mar 2026

https://github.com/superskyyy/stackoverseer

This is a StackOverflow monitor where you can easily access the most trending and up-to-date questions on a particular set of tags. This project can be modified to support wider range of tags and provide functionalities.

charts data-visualization stackoverflow-api stackoverflow-questions

Last synced: 08 Jan 2026

https://github.com/k8hertweck/tidytuesdaydataviz

data viz for TidyTuesday lunch meetup at the Hutch

data-visualization tidytuesday

Last synced: 30 May 2026

https://github.com/quocduyenanhnguyen/california-gas-prices

In this project, I scrapped data from a website to collect different types of gas data and their prices in California.

csv-files data-analytics data-cleaning data-visualization gas-prices mysql python3 tableau tableau-dashboards tableau-public

Last synced: 13 May 2026

https://github.com/srinibas-masanta/electric-vehicle-analysis-dashboard

This repository features an interactive Tableau dashboard that visualizes electric vehicle (EV) adoption trends in the U.S. 🚗⚡ Explore EV growth, top manufacturers, regional distribution, and the impact of incentives—all in one dynamic view. 📊 Use filters to dive deeper into the data and uncover key insights! 🚀

dashboards data-analysis data-visualization tableau

Last synced: 15 Jan 2026

https://github.com/suresh-chelani/crop-data-visualization

This project implements data visualization tasks using TypeScript, Vite, Apache ECharts, and Mantine v7. The goal is to process agricultural data, handle missing values, and render a table and a bar chart based on the dataset.

apache-echarts data-visualization mantine-v7 typescript vite

Last synced: 01 Mar 2025

https://github.com/omar7001-b/data-miner

DataMiner is an interactive web application for data mining and machine learning. It helps users upload, clean, transform, and analyze datasets while building predictive models — all through a simple and powerful Streamlit interface.

data-cleaning data-mining data-preprocessing data-science data-visualization interactive-dashboards pandas python scikit-learn streamlit

Last synced: 28 Apr 2025

https://github.com/izadoraluz/uber-twitter-feedback-analysis

Uma pesquisa exploratória ccom análise de feedback positivo sobre a empresa Uber no Twitter (X) usando visualização de dados, com o objetivo de criar um projeto prático usando PLN e um dashboard intuitivo

dashboard data-visualization pln

Last synced: 05 Feb 2026

https://github.com/diogocarrola/freecodecamp-projects

A collection of projects completed as part of the freeCodeCamp curriculum. This repository showcases my progress and skills in web development, including HTML, CSS, JavaScript and more.

apis data-visualization front-end javascript responsive-design

Last synced: 26 Mar 2025

https://github.com/thomas-basham/ps-creel

This web application fetches fishing report data from the Washington Department of Fish and Wildlife (WDFW) Creel Reports page and displays it on an interactive map.

creel creel-survey data-science data-visualization database fish fishing nextjs postgresql puget-sound-data pugetsound react sql website

Last synced: 13 Apr 2026

https://github.com/samruddhi3012/public-health-data-analysis

Hi! This repo involves analyzing the Healthcare analytics using Advanced Microsoft Excel.

dashboard data-analysis data-visualization healthcare microsoft-excel pivot-chart pivot-tables vlookup

Last synced: 05 Feb 2026

https://github.com/nmelgar/healthy_child_dataviz

Data visualization project to analyze what a healthy child is.

analysis data data-analysis data-science data-visualization dataviz research tableau visualization

Last synced: 23 Feb 2026

https://github.com/nurulashraf/telco-customer-churn-prediction-model

This repository contains a Telco Customer Churn Prediction project using machine learning. It includes data preprocessing, exploratory data analysis, feature engineering, and model development to predict customer churn. Key tools used are Python, Pandas, NumPy, Matplotlib, Seaborn, and scikit-learn.

churn-prediction classification-model customer-churn data-visualization exploratory-data-analysis machine-learning predictive-analytics python scikit-learn

Last synced: 16 Mar 2025

https://github.com/robinmillford/cardiac-care-performance-dashboard

This project presents a comprehensive data analysis and interactive dashboard focused on Cardiac Surgery and Percutaneous Coronary Interventions (PCI) performance by hospital, spanning from 2008 onwards.

cardiac data-analysis data-visualization plotly-express streamlit-dashboard tableau tableau-public

Last synced: 07 Sep 2025

https://github.com/auliannee/customer-analysis-with-tableau

This repository contains the data source and the tableau workbook.

data-analysis data-visualization tableau

Last synced: 12 Mar 2026

https://github.com/esther-poniatowski/multitask-context-dependent-behavior

Data analysis of neuronal recordings in naive and trained animals performing multiple tasks in active and passive attentional states

cognitive-neuroscience computational-neuroscience data-analysis data-visualization information-processing

Last synced: 26 Mar 2025

https://github.com/satyam4229/prediction-of-different-diseases

Prediction of the different diseases with the help of different symptoms express the diseases in the real time. In the dataset, there are 132+ different symptoms on which the model is trained to give the best result of the disease.

data-analysis data-science data-visualization jupyter-notebook kaggle python

Last synced: 13 Apr 2026

https://github.com/ankitrai259/sales_insight_dashboard

Sales Insight: Using SQL for data cleaning and Power BI for making interactive dashboard

dashboard data data-visualization datacleaning postgresql powerbi sql

Last synced: 17 Mar 2025

https://github.com/fbarffmann/mycitibike

Built an interactive Leaflet.js map visualizing over 750 Citi Bike station locations in NYC. Analyzed usage patterns, station density, and user navigation across the network.

citibike data-analysis data-visualization geojson geospatial interactive-map javascript leaflet nyc web-mapping

Last synced: 07 Jul 2025

https://github.com/samruddhi3012/rfm-sales-analysis

Hi there! In this project I have performed Sales Analysis (RFM Analysis) using SQL and Tableau.

data-analysis data-visualization mssqlserver rfm-analysis segmentation tableau

Last synced: 12 Mar 2025

https://github.com/subratamondal1/heart-attack-prediction

Heart Attack Prediction of patients based on the required data. Data Ingestion - Data Preparation - Exploratory Data Analysis (EDA) - Modelling - Evaluation.

data-analysis data-science data-visualization kaggle-dataset machine-learning matplotlib-pyplot numpy pandas python3 scikit-learn seaborn

Last synced: 09 Apr 2026

https://github.com/deliprofesor/health-status-and-heart-attack-risk-eda-regression-and-hypothesis-testing-analysis

This project analyzes health and lifestyle factors influencing heart attack risk using statistical methods and machine learning, with Ridge Regression identified as the best predictive model.

data-cleaning data-visualization exploratory-data-analysis healthcare-insights hypothesis-testing machine-learning ridge-regression

Last synced: 10 Apr 2025

https://github.com/lvsvendsen/shime-monitor-r

R script for visualizing pH and pump activity in SHIME gut microbiome experiments.

data-visualization microbiome r research-tool shime

Last synced: 13 Sep 2025

https://github.com/jansim/ridges

R package for downloading and visualizing topographical elevation data.

data-visualization geospatial r ridgeline

Last synced: 02 Mar 2025

https://github.com/dcostachar/bellabeat-case-study

An analysis of Fitbit Fitness Tracker data with R to examine user behaviour and conduct a competitor analysis to optimize Bellabeat's product marketing strategies.

consumer-behaviour-analysis data-visualization exploratory-data-analysis ggplot2 health-data marketing-analytics r statistical-analysis tidyverse

Last synced: 02 Apr 2025

https://github.com/albanecoiffe/uber_data_visu_streamlit

Tableau de bord interactif avec Streamlit permettant d'explorer les données des trajets Uber de janvier 2015 à New York.

data-visualization streamlit

Last synced: 02 May 2026

https://github.com/01110011011101010110010001101111/tigergraph_cosmos_template

Template for TigerGraph and Cosmograph Projects with pyTigerGraph, Fast API, and Cosmos

cosmograph data-visualization tigergraph

Last synced: 26 Mar 2025

https://github.com/nero103/airbnb-destination

This is and end-to-end project to uncover the ideal destination based on listings and hosts. Strategy included: Data workflow-SQL analysis-Data modeling-Data Visualization-Findings

data-analysis data-modeling data-visualization etl etl-pipeline excel microsoft-sql-server powerpoint sql tableau

Last synced: 27 Mar 2026

https://github.com/mohsinraza2999/new-york-taxi-fare-analysis

This project analyzes and predicts taxi fares estimate fares in advance using Regression Analysis. Conducted EDA, hypothesis testing, to identify key variables. Developed ML models (Random Forest, XGBoost) with GridSearchCV for hyperparameter tuning to predict generous tip giver accurately.

ab-testing data-un data-visualization exploratory-data-analysis fea random-forest regression-analysis sklearn xgboost

Last synced: 17 May 2026

https://github.com/jianxi-erin/bigdata-machinelearning-lab

本项目是一个综合性的大数据与机器学习实验平台,包含两个主要任务,每个任务涵盖三个关键技术模块:大数据处理、数据分析和机器学习。项目基于真实的竞赛设计,提供完整的数据处理模拟和建模实践。

data-analysis data-visualization hadoop machine-learning python spark sql

Last synced: 03 May 2026