Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/mzprog/datatables

create a better tables with few lines of code.

data-visualization datatables laravel-package livewire

Last synced: 14 Oct 2024

https://github.com/abhipatel35/diabetes_ml_classification

Predict diabetes using machine learning models. Experiment with logistic regression, decision trees, and random forests to achieve accurate predictions based on health indicators. Complete lifecycle of ML project included.

classification data-analysis data-science data-visualization descision-tree diabetes-prediction jupiter-notebook logistic-regression machine-learning model-evaluation open-source pandas pycharm-ide python random-forest scikit-learn

Last synced: 31 Oct 2024

https://github.com/cfelde/raspberry-pi-temperature

Raspberry Pi temperature vs room temperature

data-visualization kotlin raspberry-pi sigbla temperature-sensor

Last synced: 29 Oct 2024

https://github.com/saikiran76/titanicdata-analysis-eda

In this notebook, we're going to analyse the famous Titanic dataset from Kaggle. The dataset is meant for supervised machine learning, but we're only going to do some exploratory analysis at this stage. We'll try to answer some questions using metrics and EDA.:

analysis data-science data-visualization eda python

Last synced: 08 Nov 2024

https://github.com/smahala02/materials-science-introduction

Introduction to Materials Science concepts using Python for array manipulation and visualization with NumPy and Matplotlib.

data-visualization materials-science matplotlib numpy python scientific-computing

Last synced: 31 Oct 2024

https://github.com/kamiviolet/d3_collections

As decribed, all kinds of chart and data visualisation with D3

charts d3 data-visualization

Last synced: 06 Nov 2024

https://github.com/cainmagi/dash-json-grid

Dash porting version of the react project React JSON Grid. Provide structured and nested grid table view of complicated JSON objects/arrays.

dash data-visualization json json-table json-viewer plotly-dash python python-dash python-libary python3

Last synced: 09 Oct 2024

https://github.com/samkazan/fraud-detection-ml

Machine learning models for enhanced fraud detection in e-commerce transactions, exploring feature engineering, distance prediction, and clustering analysis.

clustering data-science data-visualization dataanalytics dbscan eda hierarchical-clustering kmeans-clustering knn-imputer matplotlib mlxtend python scikit-learn seaborn xgboost

Last synced: 05 Nov 2024

https://github.com/abeltavares/postql

Python library and command-line interface (CLI) tool for interacting with PostgreSQL databases, providing simplified database management, query execution, and result export functionalities.

cli command-line-interface data-analysis data-engineering data-export data-management data-processing data-visualization database database-administration database-tools etl oop postgres postgresql psycopg2 python sql sqlalchemy wrapper

Last synced: 31 Oct 2024

https://github.com/anshuthopsee/india-mutual-fund

A simple web app to view the historical NAV of Indian Mutual Funds. You can search for a fund by name and view its historical NAV in a chart and also see the percentage change in the NAV over time. Site built with React, Material UI and D3. Data source: https://mfapi.in

api chart d3 data-visualization finance graph india material-ui mutual-funds net-asset-value react redux redux-toolkit reduxtoolkit stock-data stock-market stocks

Last synced: 27 Oct 2024

https://github.com/prsdthkr/viz-design-demo

📈 This repo houses my experiments with D3 (mostly inspired by other's work) for information visualization class project and demo.

d3 data-visualization lgbtq

Last synced: 16 Oct 2024

https://github.com/tanaybhadula/avacado-analytics

It is a dashboard built using Dash.It shows graphs of dataset about sales and prices of avocados in the United States between 2015 and 2018.

css dash data-analytics data-visualization python

Last synced: 11 Oct 2024

https://github.com/tanaybhadula/twitter-trends-dashboard

An interactive dashboard to visualizes data on current Twitter trends by country and globally. Collects data of over 60 countries using the python Tweepy library, processed it,and visualized it in the form of bar chart and pie chart using the Plotly Dash framework.

dash dashboard data-analysis data-visualization plotly python trends twitter

Last synced: 11 Oct 2024

https://github.com/stynw7/data_mining

Provides Computer Science material especially at Database Major which is Data Mining ⛏️⛏️

classification clustering data-mining data-visualization database outlier-detection r-programming

Last synced: 13 Oct 2024

https://github.com/qubitpi/peitho-data

An opinionated Python package on Big Data Analytics & Machine Learning

artificial-intelligence data-visualization machine-learning python3

Last synced: 27 Oct 2024

https://github.com/tsbarr/belly-button-challenge

I built an interactive dashboard to explore the Belly Button Biodiversity dataset, which catalogs the microbes that colonize human navels.

data data-visualization javascript

Last synced: 08 Nov 2024

https://github.com/adadalshabab/data-engineering-gcp-project

An end-to-end modern data engineering project, including deployment of ETL pipeline on Google Cloud Platform, using BigQuery for data analysis and leveraging Looker to generate an insight dashboard.

bigquery data data-science data-visualization databases dataengineering-a engineering etl-pipeline looker-studio powerbi

Last synced: 31 Oct 2024

https://github.com/moeabbas6/dbt_analytics_engine

An end-to-end project using dbt to demonstrate data transformations, testing, and visualization with Google BigQuery, and Looker Studio. It showcases a complete data pipeline from extraction/generation to deployment.

analytics-engineering bigquery data data-pipeline data-transformation data-visualization dbt testing

Last synced: 12 Oct 2024

https://github.com/suryakaranraja/listview-application-for-zoho-interview

This repository has the files of basic application for designed to retrieve details about the top 50 happiest countries in the world for the year 2022.

data-visualization desktop desktop-app happiness-score interview-with-zoho list-view surface-tablet winui3

Last synced: 12 Oct 2024

https://github.com/vidyadnina/cyclistic-sql-tableau-project

Trip data analysis for a bike-sharing service company using SQL and Tableau.

bigquery dashboard data-analysis data-analytics-sql data-cleaning data-visualization sql

Last synced: 12 Oct 2024

https://github.com/timjjting/escaping-flatland

A demo of optimization techniques for plotting large datasets in a 3D space using Three.js + Svelte integration

big-data data-visualization glsl-shaders lod octree svelte sveltekit three-js

Last synced: 27 Oct 2024

https://github.com/mlund2k/project-1-baseball-performance-vs.-attendance

Project assets for my first exploratory data analysis: Baseball Performance vs. Attendance.

bigquery data-analysis data-cleaning data-visualization excel rstudio sql tableau tidyverse

Last synced: 12 Oct 2024

https://github.com/lewismakau/portfolio-projects

This repository contains file data and SQL files for projects used for my Portfolio.

data-analysis data-cleaning data-structures data-visualization database google-analytics microsoft-sql-server mysql powerbi tableau

Last synced: 12 Oct 2024

https://github.com/1401dev/iowa-liquor-retail-sales-analysis

This repository contains the analysis of Iowa liquor retail sales data, aimed at uncovering sales trends and forecasting future sales patterns. The project involves data cleaning, preparation, and advanced time series analysis using Microsoft SQL Server and Google Colab.

customer-behavior data-analysis data-cleaning data-science data-visualization exploratory-data-analysis forecasting google-colab machine-learning microsoft-sql-server pandas prophet python retail-analytics retail-sales sales-forecasting sales-performance sql statsmodels time-series-analysis

Last synced: 12 Oct 2024

https://github.com/anhmiuhv/android_realtime_graph_view

An small android module to display a data stream in realtime

android data-stream data-visualization graph realtime

Last synced: 29 Oct 2024

https://github.com/alinababer/data-science-and-insight-agent-rag-llama3-lava-llm

Data-Science-and-Insight-Agent-RAG-LLama3-Lava-LLM-Django-WebApplication is an advanced AI-driven chatbot designed to assist in data science, document analysis, and image interpretation. This repository contain the Datascience Agent of this project.

artificial-neural-networks classifcation data-analysis data-engineering data-visualization datascience large-language-models llama2 lstm machine-learning python random-forest regression

Last synced: 12 Oct 2024

https://github.com/Sparsh7082/Data-Analysis-Portfolio

This repository is dedicated to showcasing my skills, sharing projects, and tracking my progress in Data Analytics and Data Science.

canva data-analytics data-manipulation data-visualization database-querying google-slides power-bi power-point presentation-tools programming-language python r spreadsheets sql tableau

Last synced: 23 Oct 2024

https://github.com/elaaatif/data-visualisation

This project aims to visualize the popularity of programming languages on GitHub from 2011 to 2021. We use data obtained from BigQuery's public `github_repos` and `githubarchive` datasets, focusing on public repositories, pull requests (PRs), and issues.

css d3-visualization d3js data-visualization

Last synced: 06 Nov 2024

https://github.com/abhash-rai/analyzing-credit-card-eligibility

This work was performed as part of BCU undergraduate course.

data-analysis data-visualization ggplot ggplot2 latex r

Last synced: 02 Nov 2024

https://github.com/ineelhere/shinyDwight

A dashboard depicting the great battle of Dwight Schrute and the Computer | The Office

bslib data-visualization imola r rshiny rshiny-application sass server shiny-apps ui-components

Last synced: 02 Nov 2024

https://github.com/tynandebold/daylight

Amount of daylight in select locations around the world.

data-visualization data-viz daylight javascript react time

Last synced: 13 Oct 2024

https://github.com/dmarks84/ind_project_obesity-multi-class-classification--kaggle

Independent Project - Kaggle Competition -- I worked on the obesity classification data set as part of a Kaggle Competition of the same name, scoring (for accuracy) above 0.9

classification correlation-analysis cross-validation data-modeling data-visualization dataframes eda gridsearchcv matplotlib multiclass-classification numpy pandas python seaborn sklearn statistics supervised-ml

Last synced: 05 Nov 2024

https://github.com/dmarks84/ind_project_data-science-london-scikit-learn--kaggle

Independent Project - Kaggle Competition -- I worked on the Data Science London data set for the Data Science London + Scikit-learn competition.

classification cross-validation data-modeling data-reporting data-visualization dataframes eda grid-search matplotlib numpy pandas python sklearn statistics supervised-ml

Last synced: 05 Nov 2024

https://github.com/vshelke/databot

:robot: A databot made for jaano india iniative.

data-visualization flask-application search-engine

Last synced: 29 Oct 2024

https://github.com/kaczmarj/car-safety-shiny

An R Shiny app -- final project for BMI 530

cars data-visualization nhtsa shiny visualization

Last synced: 27 Oct 2024

https://github.com/xilinjia/csv-analyser

Visualize and analyze data in csv files. A multiplatform project runnable on Linux, Android, Windows, MacOS

android compose csv data-science data-visualization kotlin kotlin-multiplatform linux macos multiplatform windows

Last synced: 29 Oct 2024

https://github.com/shoebjoarder/superstore

A Dash app to analyze Superstore dataset.

dashboard data-analysis data-visualization python-3

Last synced: 27 Oct 2024

https://github.com/dmarks84/ind_project_superstore-sales-time-series-analysis--kaggle

Independent Project - Kaggle Dataset-- I worked on the Superstore Sales Dataset, performing (as Part 1) data cleaning and preparation and exploratory data analysis. The main task was to make predictions for future sales based on time-series analysis, which is found in Part 2.

chloropleth data-modeling data-visualization eda linear-regression matplotlib numpy pandas python seaborn sklearn statistics statsmodels supervised-ml time-series-analysis

Last synced: 05 Nov 2024

https://github.com/tynandebold/day-length-line-chart

A day-length line chart, charting the length of daylight from different locations around the world.

d3 d3-visualization d3js data-visualization data-viz

Last synced: 13 Oct 2024

https://github.com/dmarks84/ind_project_california-housing-data--kaggle

Independent Project - Kaggle Dataset-- I worked on the California Housing dataset, performing data cleaning and preparation; exploratory data analysis; feature engineering; regression model buildings; model evaluation.

cross-validation data-modeling data-reporting data-visualization eda folium grid-search matplotlib model-evaluation numpy pandas pca python seaborn sklearn statistics supervised-ml unsupervised-ml

Last synced: 05 Nov 2024

https://github.com/markjacksonfishing/pipedreams

A play on pipelines, with a focus on making data accessible and insightful.

backend data-engineering data-processing data-visualization deployment etl frontend machine-learning python streamlit

Last synced: 02 Nov 2024

https://github.com/aran203/cricanalytics

ADSC Fall 24 Project for cricket analytics with hawkeye data

data-engineering data-visualization python streamlit

Last synced: 02 Nov 2024

https://github.com/karlyndiary/coffee-shop-sales-analysis

Analyzing coffee shop sales using Pandas for data cleaning and exploratory data analysis (EDA), and Streamlit for data visualization dashboards.

data-analysis data-cleaning data-preprocessing data-visualization eda pandas streamlit streamlit-dashboard

Last synced: 02 Nov 2024

https://github.com/tralahm/parliament-2017-dataset

Concise, Clean data sets of the 2017 Kenyan General Election results for the Members of the Senate and National Assembly Composition

csv-parsing data-analysis data-visualization datasets election-data ipynb-jupyter-notebook kaggle-dataset kenya-constituencies kenya-counties matplotlib python3 tralahtek

Last synced: 05 Nov 2024

https://github.com/chandkund/spam-email-detection

This project focuses on detecting spam emails using a fine-tuned DistilBERT model, a lighter version of the BERT model. The model is trained to classify email text into two categories: spam (1) and not spam (0). The dataset consists of email texts labeled as either spam or non-spam.

data-visualization datapreprocessing matplotlib pandas python pytorch sklearn transformer

Last synced: 03 Nov 2024

https://github.com/karlyndiary/smartphone-price-analytics

A data pipeline for analyzing smartphone pricing by retrieving data from Flipkart using RapidAPI, transforming it, and visualizing insights using SQL Server and Excel.

beautifulsoup data-analysis data-pipeline data-visualization data-visualization-dashboard etl microsoft microsoft-excel microsoft-sql-server python smartphone-price-analysis

Last synced: 12 Oct 2024

https://github.com/chandkund/housing-price-prediction

Predict housing prices using the Boston Housing Dataset. Covers data loading, cleaning, preprocessing, EDA, normalization, standardization, and regression models (Linear Regression, Decision Tree, Random Forest, Extra Trees). Evaluated with Mean Squared Error (MSE). Tech: Python, Pandas, NumPy, Scikit-learn, Seaborn, Matplotlib.

data-science data-visualization matplotlib numpy pandas pyhton sklearn sklearn-library sklearn-metrics

Last synced: 03 Nov 2024

https://github.com/bhavinpatel4199/artificial-intelligence--algorithm-and-mathematics

This repository focuses on AI with an emphasis on algorithms and mathematical foundations. It includes projects on data processing, fundamental AI algorithms, and mathematical concepts like linear algebra and optimization. Hands-on work with various frameworks provides practical model-building experience.

algorithms-and-data-structures data-structures data-visualization mathematic probability problem-solving python3 sklearn

Last synced: 03 Nov 2024

https://github.com/bhavinpatel4199/machine-learning-programming

This repository serves as a central hub for various machine learning projects and experiments. It contains multiple sub-repositories, each focusing on different aspects of machine learning, from data preprocessing to advanced deep learning techniques.

data-structures data-visualization machine-learning machine-learning-algorithms pandas-dataframe python3 sklearn

Last synced: 03 Nov 2024

https://github.com/badranalyst/student-tests-data-analysis-application

Python-based analysis of student test scores in math, reading, and writing, examining correlations with parental education, lunch type, and test preparation. Includes data cleaning, visualization, and statistical insights into factors influencing academic performance.

data-analysis data-visualization dataset matplotlib numpy pandas python sklearn

Last synced: 03 Nov 2024

https://github.com/jaguzmana/colombia-covid-analysis

A project proposed to enhance SQL proficiency and develop skills in data visualization using Tableau.

data-visualization mssql-database tableau

Last synced: 12 Oct 2024

https://github.com/emilyjspencer/coronavirus-daily-deaths-chart.js

Visualizing the Uk's daily deaths from Coronavirus between March and June 2020

chartjs data-visualization

Last synced: 30 Oct 2024

https://github.com/emilyjspencer/data-visualizations

📊 Creating data visualisations with Chart.js

chartjs data-visualization

Last synced: 30 Oct 2024

https://github.com/lukakerr/us-surnames

US Surname data visualisation using R. Displays top 25 US surnames and race/ethnic percentage per name.

data data-visualization r

Last synced: 30 Oct 2024

https://github.com/zayarhtet/visualising-election-with-data

Visualisation of Myanmar Election 2020 with past data

data-mining data-science data-visualization

Last synced: 06 Nov 2024

https://github.com/vi/csvdimreduce

Command-line tool to run a dimensionality reduction algorithm on CSV files

cli command-line-tool csv csv-files data-science data-visualization dimension-reduction dimensionality-reduction

Last synced: 16 Oct 2024

https://github.com/mayurasandakalum/ipl-data-engineering-spark-databricks

Comprehensive data engineering and analytics project using IPL dataset with Amazon S3, Apache Spark, Databricks, and SQL. Includes data storage, transformation, analysis, and visualization.

amazon-s3 apache-spark aws big-data cricket-analytics data-analytics data-engineering data-visualization databricks etl-pipeline ipl-dataset machine-learning python sql

Last synced: 12 Oct 2024

https://github.com/dissorial/prx21_erikz

Analysis of self-tracked data: interactive visualizations & predictive algorithms

analytics data-analysis data-science data-visualization machine-learning matplotlib pandas python python3 visualization

Last synced: 30 Oct 2024

https://github.com/ayabh/portfolio

This repository is to show my Data Analytics, Science & Engineering skills, share projects, and track my progress.

aws data-engineering data-science data-visualization etl pyspark python sql

Last synced: 30 Oct 2024

https://github.com/sayamalt/steel-energy-consumption-prediction-using-pyspark

Successfully established a machine learning model using PySpark which can precisely predict the energy consumption of the steel industry, up to an r2 score of approximately 99.5%.

apache-spark big-data-analytics big-data-processing cross-validation data-visualization exploratory-data-analysis hyperparameter-tuning machine-learning model-training-and-evaluation python regression spark sql

Last synced: 11 Oct 2024

https://github.com/shinjimc/simulated_annealing_tsp

This project applies Simulated Annealing to solve the Traveling Salesman Problem using Peru's departments as nodes. Through iterative refinement, it finds the shortest route visiting each department once. Visual feedback enhances understanding and debugging, resulting in an optimal solution displayed with total distance.

data-visualization geospatial-analysis simulated-annealing simulated-annealing-algorithm simulated-annealing-edge-detection traveling-salesman-problem traveling-salesman-problem-solver

Last synced: 06 Nov 2024

https://github.com/mariarodr1136/atmosphereanalyzer

Atmosphere Analyzer is a smart environmental monitoring system that simulates real-time sensor data using Python, integrates AWS S3 for storage, and employs AWS Lambda for processing. Its Django API serves a React frontend, providing an interactive dashboard for visualizing key environmental metrics. 🌡️

aws data-visualization django environmental-monitoring iot lambda python react real-time-data s3-bucket serverless sustainable-resourse

Last synced: 03 Nov 2024

https://github.com/solygambas/d3-firebase

5 small projects to understand D3.js basics using Firebase and Materialize.

d3 d3js data-visualization firebase firestore javascript materialize materializecss

Last synced: 03 Nov 2024

https://github.com/stanleynguyen/so.cube

World map visualisation of World's Cube Association data 🌏

cas cube data-visualization leaftlet map

Last synced: 30 Oct 2024

https://github.com/37743/ml-starterkit

This project is designed to streamline the data science workflow by allowing users to input a file path and receive a comprehensive analysis tailored to their needs. The tool automates key stages in the data science pipeline, including Exploratory Data Analysis (EDA), data preprocessing, and model training.

data-preprocessing data-visualization exploratory-data-analysis machine-learning python

Last synced: 03 Nov 2024

https://github.com/bhiogade/customer-purchase-analysis

Comprehensive Customer Purchase Analysis Across Multiple Dimensions

data-analysis data-visualization tableau tableau-desktop

Last synced: 23 Oct 2024

https://github.com/shubhamdeepkeshav/visualization-on-tips

📊 Data visualization project analyzing tipping behavior in restaurants using Python. 🍽️ Explores insights based on ⏰ time, 👥 party size, 🧑‍🤝‍🧑 gender, and 🚬 smoker status with Matplotlib and Seaborn.

data-visualization dataanalysis eda matplotlib python seaborn

Last synced: 04 Nov 2024

https://github.com/kunalthakur204/visualization-on-flower

🌸 Flower Dataset Visualization Visualizing patterns and relationships in flower data through charts and plots. Perfect for exploring floral characteristics and trends! 📊

data data-visualization dataanalysis flowerdataset python

Last synced: 04 Nov 2024

https://github.com/allanreda/automated-k-means-clustering-engine

An interactive K-Means clustering tool built with Flask and Scikit-Learn, supporting Excel file uploads, cluster analysis, and data export, deployed on Google Cloud Run via Docker with CI/CD integration.

cicd css data-visualization deployment docker flask google-cloud-run html javascript k-means-clustering machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 31 Oct 2024