Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Data visualization
Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.
- GitHub: https://github.com/topics/data-visualization
- Wikipedia: https://en.wikipedia.org/wiki/Data_visualization
- Created by: Charles Joseph Minard
- Aliases: dataviz,
- Last updated: 2024-11-08 00:07:04 UTC
- JSON Representation
https://github.com/mzprog/datatables
create a better tables with few lines of code.
data-visualization datatables laravel-package livewire
Last synced: 14 Oct 2024
https://github.com/manju1807/advanced-weather-app-nextjs
Advanced Weather App | Next.js 14 Weather Dashboard
advanced-weather-app data-visualization interactive-maps nextjs-14 openweather-api react reactjs real-time-weather redux-toolkit server-side-rendering tailwindcss typescript weather-app weather-app-react weather-application weather-dashboard weather-forecast weather-monitoring
Last synced: 08 Nov 2024
https://github.com/abhipatel35/diabetes_ml_classification
Predict diabetes using machine learning models. Experiment with logistic regression, decision trees, and random forests to achieve accurate predictions based on health indicators. Complete lifecycle of ML project included.
classification data-analysis data-science data-visualization descision-tree diabetes-prediction jupiter-notebook logistic-regression machine-learning model-evaluation open-source pandas pycharm-ide python random-forest scikit-learn
Last synced: 31 Oct 2024
https://github.com/cfelde/raspberry-pi-temperature
Raspberry Pi temperature vs room temperature
data-visualization kotlin raspberry-pi sigbla temperature-sensor
Last synced: 29 Oct 2024
https://github.com/saikiran76/titanicdata-analysis-eda
In this notebook, we're going to analyse the famous Titanic dataset from Kaggle. The dataset is meant for supervised machine learning, but we're only going to do some exploratory analysis at this stage. We'll try to answer some questions using metrics and EDA.:
analysis data-science data-visualization eda python
Last synced: 08 Nov 2024
https://github.com/smahala02/materials-science-introduction
Introduction to Materials Science concepts using Python for array manipulation and visualization with NumPy and Matplotlib.
data-visualization materials-science matplotlib numpy python scientific-computing
Last synced: 31 Oct 2024
https://github.com/kamiviolet/d3_collections
As decribed, all kinds of chart and data visualisation with D3
Last synced: 06 Nov 2024
https://github.com/cainmagi/dash-json-grid
Dash porting version of the react project React JSON Grid. Provide structured and nested grid table view of complicated JSON objects/arrays.
dash data-visualization json json-table json-viewer plotly-dash python python-dash python-libary python3
Last synced: 09 Oct 2024
https://github.com/samkazan/fraud-detection-ml
Machine learning models for enhanced fraud detection in e-commerce transactions, exploring feature engineering, distance prediction, and clustering analysis.
clustering data-science data-visualization dataanalytics dbscan eda hierarchical-clustering kmeans-clustering knn-imputer matplotlib mlxtend python scikit-learn seaborn xgboost
Last synced: 05 Nov 2024
https://github.com/abeltavares/postql
Python library and command-line interface (CLI) tool for interacting with PostgreSQL databases, providing simplified database management, query execution, and result export functionalities.
cli command-line-interface data-analysis data-engineering data-export data-management data-processing data-visualization database database-administration database-tools etl oop postgres postgresql psycopg2 python sql sqlalchemy wrapper
Last synced: 31 Oct 2024
https://github.com/amarianowski/angular-scores-chart
json-server data visualised in d3js diagrams. Built with Angular
angular charts d3 d3-visualization d3js data-visualization frontend javascript json-server typescript typescript-library visualization
Last synced: 27 Oct 2024
https://github.com/anshuthopsee/india-mutual-fund
A simple web app to view the historical NAV of Indian Mutual Funds. You can search for a fund by name and view its historical NAV in a chart and also see the percentage change in the NAV over time. Site built with React, Material UI and D3. Data source: https://mfapi.in
api chart d3 data-visualization finance graph india material-ui mutual-funds net-asset-value react redux redux-toolkit reduxtoolkit stock-data stock-market stocks
Last synced: 27 Oct 2024
https://github.com/prsdthkr/viz-design-demo
📈 This repo houses my experiments with D3 (mostly inspired by other's work) for information visualization class project and demo.
Last synced: 16 Oct 2024
https://github.com/coslynx/project-1723965725283-5jnlm6
Project: Initial Prototype for Personalized Content Recommendation. Created at https://spectra.codes, which is owned by @Drix10
chartjs code-generation community-features data-visualization developer-tools devops fitness-tracker goal-setting machine-learning mvp mvp-development next-auth nextjs postgresql prisma progress-tracking software-development typescript user-interface zustand
Last synced: 15 Oct 2024
https://github.com/coslynx/fittrack-ipgtb7
Project: Prototype for Seamless User Experience. Created at https://spectra.codes, which is owned by @Drix10
api-integration code-generation data-visualization developer-tools devops fitness-community fitness-tracker goal-setting machine-learning mvp nextjs postgresql progress-tracking react social-sharing software-development tailwindcss typescript user-authentication zustand
Last synced: 15 Oct 2024
https://github.com/tanaybhadula/avacado-analytics
It is a dashboard built using Dash.It shows graphs of dataset about sales and prices of avocados in the United States between 2015 and 2018.
css dash data-analytics data-visualization python
Last synced: 11 Oct 2024
https://github.com/tanaybhadula/twitter-trends-dashboard
An interactive dashboard to visualizes data on current Twitter trends by country and globally. Collects data of over 60 countries using the python Tweepy library, processed it,and visualized it in the form of bar chart and pie chart using the Plotly Dash framework.
dash dashboard data-analysis data-visualization plotly python trends twitter
Last synced: 11 Oct 2024
https://github.com/stynw7/data_mining
Provides Computer Science material especially at Database Major which is Data Mining ⛏️⛏️
classification clustering data-mining data-visualization database outlier-detection r-programming
Last synced: 13 Oct 2024
https://github.com/qubitpi/peitho-data
An opinionated Python package on Big Data Analytics & Machine Learning
artificial-intelligence data-visualization machine-learning python3
Last synced: 27 Oct 2024
https://github.com/eudesgccunha/desafios-dnc
Desafios desenvolvidos durante a formação em Cientista de Dados da Escola DNC.
big-data classification-algorithm clustering data data-analysis data-science data-visualization database excel machine-learning nlp powerbi python recommendation-system regression-models sql
Last synced: 29 Sep 2024
https://github.com/tsbarr/belly-button-challenge
I built an interactive dashboard to explore the Belly Button Biodiversity dataset, which catalogs the microbes that colonize human navels.
data data-visualization javascript
Last synced: 08 Nov 2024
https://github.com/docuvesta/retinoids-skincare-analysis
Using data to learn about retinoids in skincare 🧴
cosmetics cream data-analysis-python data-visualization eyecare mask notion products retinoid retinol serum skincare streamlit
Last synced: 12 Oct 2024
https://github.com/adadalshabab/data-engineering-gcp-project
An end-to-end modern data engineering project, including deployment of ETL pipeline on Google Cloud Platform, using BigQuery for data analysis and leveraging Looker to generate an insight dashboard.
bigquery data data-science data-visualization databases dataengineering-a engineering etl-pipeline looker-studio powerbi
Last synced: 31 Oct 2024
https://github.com/vvipjain/hockey-tournament-analysis
Hockey Tournament Analysis
beautifulsoup data data-analysis data-visualization databases pandas pandas-dataframe powerbi python python-library python-script requests-library-python sql sql-server sqlalchemy
Last synced: 31 Oct 2024
https://github.com/panagiotischaviaropoulos/google-data-analytics-case-study
bigquery data-visualization sql
Last synced: 13 Oct 2024
https://github.com/moeabbas6/dbt_analytics_engine
An end-to-end project using dbt to demonstrate data transformations, testing, and visualization with Google BigQuery, and Looker Studio. It showcases a complete data pipeline from extraction/generation to deployment.
analytics-engineering bigquery data data-pipeline data-transformation data-visualization dbt testing
Last synced: 12 Oct 2024
https://github.com/irsol/udacity-data-foundations-nd
data data-analysis data-visualization exel sql udacity udacity-data udacity-nanodegree
Last synced: 11 Oct 2024
https://github.com/suryakaranraja/listview-application-for-zoho-interview
This repository has the files of basic application for designed to retrieve details about the top 50 happiest countries in the world for the year 2022.
data-visualization desktop desktop-app happiness-score interview-with-zoho list-view surface-tablet winui3
Last synced: 12 Oct 2024
https://github.com/manishkr1754/food_delivery_dashboard_tableau
Food Delivery Company Sales Analytics using Tableau
analytics business-analytics business-intelligence dashboard data-visualization meal-delivery-service tableau tableau-dashboards tableau-prep tableau-story
Last synced: 23 Oct 2024
https://github.com/vidyadnina/cyclistic-sql-tableau-project
Trip data analysis for a bike-sharing service company using SQL and Tableau.
bigquery dashboard data-analysis data-analytics-sql data-cleaning data-visualization sql
Last synced: 12 Oct 2024
https://github.com/manishkr1754/loan_eligibility_prediction_python
Predict Loan Eligibility for a Finance company
bivariate-analysis data-visualization decision-trees evaluation-metrics exploratory-data-analysis feature-engineering hypothesis-generation logistic-regression matplotlib model-building numpy pandas random-forest seaborn sklearn univariate-analysis xgboost
Last synced: 23 Oct 2024
https://github.com/manishkr1754/food_delivery_dashboard_powerbi
Food Delivery Company Sales Analytics using PowerBI
analytics business-intelligence dashboard data-visualization dax meal-delivery-service pareto pareto-principle powerbi powerbi-report
Last synced: 23 Oct 2024
https://github.com/timjjting/escaping-flatland
A demo of optimization techniques for plotting large datasets in a 3D space using Three.js + Svelte integration
big-data data-visualization glsl-shaders lod octree svelte sveltekit three-js
Last synced: 27 Oct 2024
https://github.com/manishkr1754/nifty50_data_analysis_nsetools_nsepy_python
NIFTY50 Data Analysis from scratch (Data Extraction & Visualization to Investment Insights)
candlestick-chart cufflinks data-analysis data-visualization heikin-ashi interactive-visualizations moving-average nsepy nsetools plotly portfolio-analysis portfolio-analytics portfolio-and-investment-analysis python sharpe-ratio technical-analysis
Last synced: 23 Oct 2024
https://github.com/mlund2k/project-1-baseball-performance-vs.-attendance
Project assets for my first exploratory data analysis: Baseball Performance vs. Attendance.
bigquery data-analysis data-cleaning data-visualization excel rstudio sql tableau tidyverse
Last synced: 12 Oct 2024
https://github.com/dharininadkar/covid-data-dashboard
Data Analysis of Covid-19 Cases
data-analysis data-mining data-science data-visualization ms-excel ms-sql-server
Last synced: 12 Oct 2024
https://github.com/dharininadkar/movies-data-dashboard
Data Analysis of Movies data
data-analysis data-mining data-science data-visualization ms-excel ms-sql-server tableau
Last synced: 12 Oct 2024
https://github.com/melvinjwallace/melvinjw.github.io
A portfolio of a host of projects completed using python and sql.
data data-analysis data-cleaning data-loading data-mining data-preparation data-processing data-science data-transformation data-visualization dataset matplotlib microsoft-sql-server pandas-python seaborn
Last synced: 12 Oct 2024
https://github.com/patricialjohnson/sql-database-project
SQL Music Database Store
business-analytics chinook-database data-analytics data-visualization database kaggle kaggle-dataset microsoft-excel microsoft-sql-server sql sqlite
Last synced: 12 Oct 2024
https://github.com/lewismakau/portfolio-projects
This repository contains file data and SQL files for projects used for my Portfolio.
data-analysis data-cleaning data-structures data-visualization database google-analytics microsoft-sql-server mysql powerbi tableau
Last synced: 12 Oct 2024
https://github.com/1401dev/iowa-liquor-retail-sales-analysis
This repository contains the analysis of Iowa liquor retail sales data, aimed at uncovering sales trends and forecasting future sales patterns. The project involves data cleaning, preparation, and advanced time series analysis using Microsoft SQL Server and Google Colab.
customer-behavior data-analysis data-cleaning data-science data-visualization exploratory-data-analysis forecasting google-colab machine-learning microsoft-sql-server pandas prophet python retail-analytics retail-sales sales-forecasting sales-performance sql statsmodels time-series-analysis
Last synced: 12 Oct 2024
https://github.com/anhmiuhv/android_realtime_graph_view
An small android module to display a data stream in realtime
android data-stream data-visualization graph realtime
Last synced: 29 Oct 2024
https://github.com/kishorereddypudi/codex-energy-drink-analysis
Codex-energy-drink-analysis
dashboard data-visualization dataanalysis database mssql powerbi
Last synced: 12 Oct 2024
https://github.com/alinababer/data-science-and-insight-agent-rag-llama3-lava-llm
Data-Science-and-Insight-Agent-RAG-LLama3-Lava-LLM-Django-WebApplication is an advanced AI-driven chatbot designed to assist in data science, document analysis, and image interpretation. This repository contain the Datascience Agent of this project.
artificial-neural-networks classifcation data-analysis data-engineering data-visualization datascience large-language-models llama2 lstm machine-learning python random-forest regression
Last synced: 12 Oct 2024
https://github.com/Sparsh7082/Data-Analysis-Portfolio
This repository is dedicated to showcasing my skills, sharing projects, and tracking my progress in Data Analytics and Data Science.
canva data-analytics data-manipulation data-visualization database-querying google-slides power-bi power-point presentation-tools programming-language python r spreadsheets sql tableau
Last synced: 23 Oct 2024
https://github.com/faizan-khan-iit/dm_specialization
Data Mining Specialization on Coursera
coursera data-science data-visualisation data-visualization data-viz r
Last synced: 27 Oct 2024
https://github.com/faizan-khan-iit/data_viz_assignment
Data Visualization Assignment on Coursera
coursera data-visualization graph network-graph network-visualization r r-graphics visnetwork
Last synced: 27 Oct 2024
https://github.com/elaaatif/data-visualisation
This project aims to visualize the popularity of programming languages on GitHub from 2011 to 2021. We use data obtained from BigQuery's public `github_repos` and `githubarchive` datasets, focusing on public repositories, pull requests (PRs), and issues.
css d3-visualization d3js data-visualization
Last synced: 06 Nov 2024
https://github.com/abhash-rai/analyzing-credit-card-eligibility
This work was performed as part of BCU undergraduate course.
data-analysis data-visualization ggplot ggplot2 latex r
Last synced: 02 Nov 2024
https://github.com/ineelhere/shinyDwight
A dashboard depicting the great battle of Dwight Schrute and the Computer | The Office
bslib data-visualization imola r rshiny rshiny-application sass server shiny-apps ui-components
Last synced: 02 Nov 2024
https://github.com/tynandebold/daylight
Amount of daylight in select locations around the world.
data-visualization data-viz daylight javascript react time
Last synced: 13 Oct 2024
https://github.com/dmarks84/ind_project_obesity-multi-class-classification--kaggle
Independent Project - Kaggle Competition -- I worked on the obesity classification data set as part of a Kaggle Competition of the same name, scoring (for accuracy) above 0.9
classification correlation-analysis cross-validation data-modeling data-visualization dataframes eda gridsearchcv matplotlib multiclass-classification numpy pandas python seaborn sklearn statistics supervised-ml
Last synced: 05 Nov 2024
https://github.com/dmarks84/ind_project_data-science-london-scikit-learn--kaggle
Independent Project - Kaggle Competition -- I worked on the Data Science London data set for the Data Science London + Scikit-learn competition.
classification cross-validation data-modeling data-reporting data-visualization dataframes eda grid-search matplotlib numpy pandas python sklearn statistics supervised-ml
Last synced: 05 Nov 2024
https://github.com/vshelke/databot
:robot: A databot made for jaano india iniative.
data-visualization flask-application search-engine
Last synced: 29 Oct 2024
https://github.com/kaczmarj/car-safety-shiny
An R Shiny app -- final project for BMI 530
cars data-visualization nhtsa shiny visualization
Last synced: 27 Oct 2024
https://github.com/xilinjia/csv-analyser
Visualize and analyze data in csv files. A multiplatform project runnable on Linux, Android, Windows, MacOS
android compose csv data-science data-visualization kotlin kotlin-multiplatform linux macos multiplatform windows
Last synced: 29 Oct 2024
https://github.com/shoebjoarder/superstore
A Dash app to analyze Superstore dataset.
dashboard data-analysis data-visualization python-3
Last synced: 27 Oct 2024
https://github.com/dmarks84/ind_project_superstore-sales-time-series-analysis--kaggle
Independent Project - Kaggle Dataset-- I worked on the Superstore Sales Dataset, performing (as Part 1) data cleaning and preparation and exploratory data analysis. The main task was to make predictions for future sales based on time-series analysis, which is found in Part 2.
chloropleth data-modeling data-visualization eda linear-regression matplotlib numpy pandas python seaborn sklearn statistics statsmodels supervised-ml time-series-analysis
Last synced: 05 Nov 2024
https://github.com/tynandebold/day-length-line-chart
A day-length line chart, charting the length of daylight from different locations around the world.
d3 d3-visualization d3js data-visualization data-viz
Last synced: 13 Oct 2024
https://github.com/dmarks84/ind_project_california-housing-data--kaggle
Independent Project - Kaggle Dataset-- I worked on the California Housing dataset, performing data cleaning and preparation; exploratory data analysis; feature engineering; regression model buildings; model evaluation.
cross-validation data-modeling data-reporting data-visualization eda folium grid-search matplotlib model-evaluation numpy pandas pca python seaborn sklearn statistics supervised-ml unsupervised-ml
Last synced: 05 Nov 2024
https://github.com/markjacksonfishing/pipedreams
A play on pipelines, with a focus on making data accessible and insightful.
backend data-engineering data-processing data-visualization deployment etl frontend machine-learning python streamlit
Last synced: 02 Nov 2024
https://github.com/aran203/cricanalytics
ADSC Fall 24 Project for cricket analytics with hawkeye data
data-engineering data-visualization python streamlit
Last synced: 02 Nov 2024
https://github.com/karlyndiary/coffee-shop-sales-analysis
Analyzing coffee shop sales using Pandas for data cleaning and exploratory data analysis (EDA), and Streamlit for data visualization dashboards.
data-analysis data-cleaning data-preprocessing data-visualization eda pandas streamlit streamlit-dashboard
Last synced: 02 Nov 2024
https://github.com/tralahm/parliament-2017-dataset
Concise, Clean data sets of the 2017 Kenyan General Election results for the Members of the Senate and National Assembly Composition
csv-parsing data-analysis data-visualization datasets election-data ipynb-jupyter-notebook kaggle-dataset kenya-constituencies kenya-counties matplotlib python3 tralahtek
Last synced: 05 Nov 2024
https://github.com/chandkund/spam-email-detection
This project focuses on detecting spam emails using a fine-tuned DistilBERT model, a lighter version of the BERT model. The model is trained to classify email text into two categories: spam (1) and not spam (0). The dataset consists of email texts labeled as either spam or non-spam.
data-visualization datapreprocessing matplotlib pandas python pytorch sklearn transformer
Last synced: 03 Nov 2024
https://github.com/karlyndiary/smartphone-price-analytics
A data pipeline for analyzing smartphone pricing by retrieving data from Flipkart using RapidAPI, transforming it, and visualizing insights using SQL Server and Excel.
beautifulsoup data-analysis data-pipeline data-visualization data-visualization-dashboard etl microsoft microsoft-excel microsoft-sql-server python smartphone-price-analysis
Last synced: 12 Oct 2024
https://github.com/coslynx/fit-track-mvp
Project: Track fitness goals, log workouts, and share progress with friends.. Created at https://coslynx.com
api-integration code-generation data-visualization developer-tools devops fitness-tracker goal-setting machine-learning mvp mvp-development nextjs postgresql prisma progress-tracking social-features software-development tailwindcss typescript user-authentication zustand
Last synced: 03 Nov 2024
https://github.com/chandkund/housing-price-prediction
Predict housing prices using the Boston Housing Dataset. Covers data loading, cleaning, preprocessing, EDA, normalization, standardization, and regression models (Linear Regression, Decision Tree, Random Forest, Extra Trees). Evaluated with Mean Squared Error (MSE). Tech: Python, Pandas, NumPy, Scikit-learn, Seaborn, Matplotlib.
data-science data-visualization matplotlib numpy pandas pyhton sklearn sklearn-library sklearn-metrics
Last synced: 03 Nov 2024
https://github.com/bhavinpatel4199/artificial-intelligence--algorithm-and-mathematics
This repository focuses on AI with an emphasis on algorithms and mathematical foundations. It includes projects on data processing, fundamental AI algorithms, and mathematical concepts like linear algebra and optimization. Hands-on work with various frameworks provides practical model-building experience.
algorithms-and-data-structures data-structures data-visualization mathematic probability problem-solving python3 sklearn
Last synced: 03 Nov 2024
https://github.com/bhavinpatel4199/machine-learning-programming
This repository serves as a central hub for various machine learning projects and experiments. It contains multiple sub-repositories, each focusing on different aspects of machine learning, from data preprocessing to advanced deep learning techniques.
data-structures data-visualization machine-learning machine-learning-algorithms pandas-dataframe python3 sklearn
Last synced: 03 Nov 2024
https://github.com/badranalyst/student-tests-data-analysis-application
Python-based analysis of student test scores in math, reading, and writing, examining correlations with parental education, lunch type, and test preparation. Includes data cleaning, visualization, and statistical insights into factors influencing academic performance.
data-analysis data-visualization dataset matplotlib numpy pandas python sklearn
Last synced: 03 Nov 2024
https://github.com/jaguzmana/colombia-covid-analysis
A project proposed to enhance SQL proficiency and develop skills in data visualization using Tableau.
data-visualization mssql-database tableau
Last synced: 12 Oct 2024
https://github.com/coslynx/digital-library-backend-mvp-streamlined
A core library management system enabling book cataloging and user account management... Created at https://coslynx.com
api-development book-cataloging borrowing-system code-generation data-visualization developer-tools devops digital-library fastapi jwt-authentication library-management machine-learning mvp mvp-backend postgresql python software-development sqlalchemy usage-analytics user-authentication
Last synced: 31 Oct 2024
https://github.com/rohetoric/text-vector-visualisation
Website: https://rohetoric.github.io/text-vector-visualisation/
data-science data-visualization fasttext fasttext-embeddings machine-learning python3 spacy spacy-nlp tensorflow tensorflow-examples tensorflow-experiments tensorflow-tutorials tensorflow1 tensorflow2
Last synced: 03 Nov 2024
https://github.com/sanju-srivatsa/vizcraft-data-science-app
Please find the Demo Link for the DS App
data-science data-visualization eda pandas plotly python streamlit
Last synced: 03 Nov 2024
https://github.com/suhailsallam/tips_dashboard
Dashboard using Python & Streamlit
dashboard data-analysis data-analytics data-science data-scientist data-visualization python streamlit streamlit-dashboard streamlit-webapp
Last synced: 03 Nov 2024
https://github.com/emilyjspencer/coronavirus-daily-deaths-chart.js
Visualizing the Uk's daily deaths from Coronavirus between March and June 2020
Last synced: 30 Oct 2024
https://github.com/emilyjspencer/data-visualizations
📊 Creating data visualisations with Chart.js
Last synced: 30 Oct 2024
https://github.com/lukakerr/us-surnames
US Surname data visualisation using R. Displays top 25 US surnames and race/ethnic percentage per name.
Last synced: 30 Oct 2024
https://github.com/zayarhtet/visualising-election-with-data
Visualisation of Myanmar Election 2020 with past data
data-mining data-science data-visualization
Last synced: 06 Nov 2024
https://github.com/vi/csvdimreduce
Command-line tool to run a dimensionality reduction algorithm on CSV files
cli command-line-tool csv csv-files data-science data-visualization dimension-reduction dimensionality-reduction
Last synced: 16 Oct 2024
https://github.com/mayurasandakalum/ipl-data-engineering-spark-databricks
Comprehensive data engineering and analytics project using IPL dataset with Amazon S3, Apache Spark, Databricks, and SQL. Includes data storage, transformation, analysis, and visualization.
amazon-s3 apache-spark aws big-data cricket-analytics data-analytics data-engineering data-visualization databricks etl-pipeline ipl-dataset machine-learning python sql
Last synced: 12 Oct 2024
https://github.com/dissorial/prx21_erikz
Analysis of self-tracked data: interactive visualizations & predictive algorithms
analytics data-analysis data-science data-visualization machine-learning matplotlib pandas python python3 visualization
Last synced: 30 Oct 2024
https://github.com/ayabh/portfolio
This repository is to show my Data Analytics, Science & Engineering skills, share projects, and track my progress.
aws data-engineering data-science data-visualization etl pyspark python sql
Last synced: 30 Oct 2024
https://github.com/pathak-ashutosh/sentiment-analysis-yelp-reviews
Perform sentiment analysis on Yelp dataset with Apache Spark
apache-spark big-data data-engineering data-pipeline data-visualization hadoop hdfs natural-language-processing pyspark sentiment-analysis spark-mllib spark-nlp spark-sql
Last synced: 12 Oct 2024
https://github.com/travelxml/apache-spark-pyspark-databricks
APACHE SPARK: Data Analysis, Transformation, and Visualisation with PySpark, IPL Data Analysis
apache-spark data-science data-visualization databricks databricks-notebooks dataframe ipl machine-learning pyspark pyspark-mllib pyspark-notebook pyspark-python pyspark-tutorial
Last synced: 12 Oct 2024
https://github.com/sayamalt/steel-energy-consumption-prediction-using-pyspark
Successfully established a machine learning model using PySpark which can precisely predict the energy consumption of the steel industry, up to an r2 score of approximately 99.5%.
apache-spark big-data-analytics big-data-processing cross-validation data-visualization exploratory-data-analysis hyperparameter-tuning machine-learning model-training-and-evaluation python regression spark sql
Last synced: 11 Oct 2024
https://github.com/shinjimc/simulated_annealing_tsp
This project applies Simulated Annealing to solve the Traveling Salesman Problem using Peru's departments as nodes. Through iterative refinement, it finds the shortest route visiting each department once. Visual feedback enhances understanding and debugging, resulting in an optimal solution displayed with total distance.
data-visualization geospatial-analysis simulated-annealing simulated-annealing-algorithm simulated-annealing-edge-detection traveling-salesman-problem traveling-salesman-problem-solver
Last synced: 06 Nov 2024
https://github.com/mariarodr1136/atmosphereanalyzer
Atmosphere Analyzer is a smart environmental monitoring system that simulates real-time sensor data using Python, integrates AWS S3 for storage, and employs AWS Lambda for processing. Its Django API serves a React frontend, providing an interactive dashboard for visualizing key environmental metrics. 🌡️
aws data-visualization django environmental-monitoring iot lambda python react real-time-data s3-bucket serverless sustainable-resourse
Last synced: 03 Nov 2024
https://github.com/olekscode/colorpalettes
Color palettes for Pharo
color color-palette data-visualization pharo smalltalk
Last synced: 31 Oct 2024
https://github.com/solygambas/d3-firebase
5 small projects to understand D3.js basics using Firebase and Materialize.
d3 d3js data-visualization firebase firestore javascript materialize materializecss
Last synced: 03 Nov 2024
https://github.com/stanleynguyen/so.cube
World map visualisation of World's Cube Association data 🌏
cas cube data-visualization leaftlet map
Last synced: 30 Oct 2024
https://github.com/37743/ml-starterkit
This project is designed to streamline the data science workflow by allowing users to input a file path and receive a comprehensive analysis tailored to their needs. The tool automates key stages in the data science pipeline, including Exploratory Data Analysis (EDA), data preprocessing, and model training.
data-preprocessing data-visualization exploratory-data-analysis machine-learning python
Last synced: 03 Nov 2024
https://github.com/bhiogade/customer-purchase-analysis
Comprehensive Customer Purchase Analysis Across Multiple Dimensions
data-analysis data-visualization tableau tableau-desktop
Last synced: 23 Oct 2024
https://github.com/bhiogade/tlc-trip-analysis
NYC Taxi and Limousine Commission (TLC) Trip Analysis
data-analysis data-cleaning data-collection data-visualization pandas-python tableau tableau-desktop
Last synced: 23 Oct 2024
https://github.com/bhiogade/hpi-analysis
House price index (HPI) Analysis
data-analysis data-cleaning data-visualization gathering-data numpy pandas-python
Last synced: 23 Oct 2024
https://github.com/shubhamdeepkeshav/visualization-on-tips
📊 Data visualization project analyzing tipping behavior in restaurants using Python. 🍽️ Explores insights based on ⏰ time, 👥 party size, 🧑🤝🧑 gender, and 🚬 smoker status with Matplotlib and Seaborn.
data-visualization dataanalysis eda matplotlib python seaborn
Last synced: 04 Nov 2024
https://github.com/kunalthakur204/visualization-on-flower
🌸 Flower Dataset Visualization Visualizing patterns and relationships in flower data through charts and plots. Perfect for exploring floral characteristics and trends! 📊
data data-visualization dataanalysis flowerdataset python
Last synced: 04 Nov 2024
https://github.com/allanreda/automated-k-means-clustering-engine
An interactive K-Means clustering tool built with Flask and Scikit-Learn, supporting Excel file uploads, cluster analysis, and data export, deployed on Google Cloud Run via Docker with CI/CD integration.
cicd css data-visualization deployment docker flask google-cloud-run html javascript k-means-clustering machine-learning matplotlib numpy pandas python scikit-learn
Last synced: 31 Oct 2024
https://github.com/aanastasiou/neoads
Abstract Data Structures over neo4j
data-modeling data-science data-structures data-visualization graph-theory ogm
Last synced: 30 Oct 2024