Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/vatshayan/pokemon-analysis

Visualization, Analysis & Predicting the accuracy of finding Pokemon power, attack & speed through Machine Learning

artificial-intelligence data data-analysis data-science data-visualization dataset machine-learning machine-learning-algorithms pokemon scikit-learn

Last synced: 15 Nov 2024

https://github.com/cano1998/data-visualization-project

A project focused on data visualization to explore various aspects of a car dataset. The visualizations provide insights into car performance, efficiency, and characteristics based on different manufacturers and features.

bar-pl bar-plot data-analysis data-visualization histogram jupyter-notebook line-plot

Last synced: 09 Nov 2024

https://github.com/mohamedmetwalli5/breastcancerdiagnosis

Breast cancer diagnosis using machine learning via the XGBoost Algorithm after visualizing the data set & exploring it.

cancer data-visualization machine-learning

Last synced: 11 Nov 2024

https://github.com/jatin-mehra119/insurance_dataset

The objective of this project is to predict insurance charges based on various factors.

data-visualization dataanalysis prediction-model python regression-models

Last synced: 16 Nov 2024

https://github.com/victorowinoke/custmer-segmentation-using-rfm-python-

Customer Segmentation using the Recency, Frequency and Monetary Values

customer-segmentation data data-visualization python3 science time-series-analysis

Last synced: 15 Nov 2024

https://github.com/swapnil-jain/whatsapp-chat-statistics

Web Application showing common statistics, graphs and various charts about the uploaded Whatsapp chat.

data-analytics data-science data-visualization heroku-deployment streamlit whatsapp

Last synced: 15 Nov 2024

https://github.com/blankscreen-exe/data-mining-lab

Course Code: CS626, MCS Batch-2019 (Final Year) Evening

data-visualization datamining datamining-algorithms datascience

Last synced: 11 Nov 2024

https://github.com/chingu-voyages/v30-geckos-team-04

An educational application dedicated to informing people about the issues involved in air quality | Voyage-30-Geckos-Team-04 | https://clean-the-air.herokuapp.com/

air-quality data-visualization

Last synced: 14 Nov 2024

https://github.com/aniket965/crime-against-women-india

Some Data collected and visualisations of crime against women in India

crime-data data-visualization india women

Last synced: 15 Nov 2024

https://github.com/leandrocollares/infant-mortality-in-africa

An interactive choropleth map that shows infant mortality rates in Africa between 1960 and 2018

d3 data-visualization react

Last synced: 11 Nov 2024

https://github.com/vaxdata22/salifort-motors-and-waze-churn

Employee retention predictive model development for Salifort Motors and Waze. This is a terminal project I did to earn the Google Advanced Data Analytics Professional Certificate.

data-analytics data-visualization model-development predictive-analytics python statistical-analysis

Last synced: 15 Nov 2024

https://github.com/leandrocollares/population-in-dutch-provinces

A responsive bar chart showing the population of Dutch provinces

d3 data-visualization svelte

Last synced: 11 Nov 2024

https://github.com/leandrocollares/street-cherry-trees-in-vancouver

Street cherry trees in Vancouver: an exploratory data analysis

data-analysis data-visualization folium pandas plotly-express

Last synced: 11 Nov 2024

https://github.com/leandrocollares/foreign-born-population-in-canada

Responsive bar chart that shows the percentage of foreign-born population in Canada between 1871 and 2011

d3 data-visualization svelte

Last synced: 11 Nov 2024

https://github.com/leandrocollares/ei-beneficiaries-in-canada

A responsive line chart showing regular Employment Insurance beneficiaries in Canada and its provinces and territories between 2019 and 2021

d3 data-visualization svelte

Last synced: 11 Nov 2024

https://github.com/muhammadadilnaeem/machine-learning-project-student-performance-indicator

This project explores how students' performance (exam scores) is influenced by variables such as Gender, Ethnicity, Parental Level of Education, Lunch Type, and Test Preparation Course.

data-analysis data-science data-visualization dataset flas html-css machine-learning machine-learning-algorithms python

Last synced: 16 Nov 2024

https://github.com/leandrocollares/beyond-the-3-point-arc

A responsive scatter plot that shows the percentage of points scored by NBA teams via 3-point and mid-range field goals

d3 data-visualization react

Last synced: 11 Nov 2024

https://github.com/leandrocollares/employment-insurance-beneficiaries

A responsive line chart that shows regular Employment Insurance beneficiaries in Canada between 2019 and 2021

d3 data-visualization svelte

Last synced: 11 Nov 2024

https://github.com/leandrocollares/urbanization-versus-income

A responsive scatter plot that shows urban population percentages and GDP per capita in Americas.

d3 data-visualization svelte

Last synced: 11 Nov 2024

https://github.com/casperkristiansson/finance-tracker

A project which solved an issue of mine which was tracking my finance. This Finance Tracking application gives overviews of expenses and income to give its users an easy way to explore their data.

dashboard data-visualization finance-management firebase-auth react

Last synced: 15 Nov 2024

https://github.com/leandrocollares/temperatures-in-victoria

Responsive line chart that shows maximum daily temperatures in Victoria, BC in June 2021

d3 data-visualization svelte

Last synced: 11 Nov 2024

https://github.com/leandrocollares/nyc-film-permits

NYC film permits: an exploratory data analysis

data-analysis data-visualization pandas plotly

Last synced: 11 Nov 2024

https://github.com/leandrocollares/long-range-brilliance

A responsive scatterplot showing minutes played and 3-point field goals made by the best 3-point shooters in NBA history

d3 data-visualization svelte

Last synced: 11 Nov 2024

https://github.com/82luli02/sakila_dvd_rental_database_analysis

Analysis of the Sakila DVD Rental database using SQL

data data-analysis data-science data-visualization sql

Last synced: 16 Nov 2024

https://github.com/gowhale/daily-spend-analysis

Python script to analyse spending habits.

data-visualization pandas python

Last synced: 17 Nov 2024

https://github.com/andreazoccatelli/europe_emissions_shinyapp

A shiny app which shows the gas emissions of european countries from 2011 to 2020

data-science data-visualization emissions-co2

Last synced: 17 Nov 2024

https://github.com/in-jun/commit-canvas

지루한 컨트리뷰션 그래프를 예술 작품으로! 당신의 GitHub 프로필을 캔버스로 변환하여 독특한 패턴을 만들어보세요. 더 이상 단순한 잔디가 아닌, 당신만의 창의적인 작품을 GitHub에서 선보일 수 있습니다.

commit-history contribution-graph data-visualization git github-api golang oauth pixel-art profile-customization web-app

Last synced: 11 Nov 2024

https://github.com/lisabensoussan/sampling-data-wrangling-and-visualization

This project focuses on simulating rollup profit strategies and analyzing data on notable female scientists using R. It includes tasks like simulation, data scraping from Wikipedia, and generating various visualizations.

data-visualization data-wrangling probability simulation statistical-analysis

Last synced: 11 Nov 2024

https://github.com/nirmit27/coursera-data-viz-dash

Some dashboards created using Dash and Plotly.

dash dashboard data-visualization plotly plotly-dash python3

Last synced: 11 Nov 2024

https://github.com/mrankitgupta/titanic-survival-prediction-93-xgboost

Titanic Survival Prediction Project (93% Accuracy)🛳️ In this notebook, The goal is to correctly predict if someone survived the Titanic shipwreck using different Machine Learning Model & Hyperparameter tunning.

classification data-analysis data-science data-visualization gradient-boosting kaggle-competition linear-regression logistic-regression machine-learning machine-learning-algorithms ml ml-models nlp prediction predictive-modeling random-forest titanic titanic-kaggle titanic-survival-prediction xgboost

Last synced: 17 Nov 2024

https://github.com/adamouization/python-machine-learning-data-science-notes

:orange_book: Jupyter notebooks containing useful Python code and notes for general Machine Learning and Data Science projects.

data data-science data-visualization guide jupyter jupyter-notebook machine-learning matplotlib notes numpy pandas pandas-dataframe python seaborn

Last synced: 09 Nov 2024

https://github.com/nikhilash45/power-bi-vsualisation-of-joins

In This Power Bi Report User Can Visualis Join By Themselves , and it is easy to understand joins now.

business-analytics business-intelligence data data-analysis data-visualization joins powerbi sql visualization

Last synced: 12 Nov 2024

https://github.com/nikhilash45/live_ipl_report

Explore real-time updates and insights into ongoing IPL matches with this interactive PowerBI dashboard. Featuring live scores, detailed batting and bowling statistics, and a dynamic points table, stay informed and engaged throughout the IPL season.

analysts cleaning-data cricket-data dashboard data data-analysis data-visualization dax powerbi

Last synced: 12 Nov 2024

https://github.com/atheeralzhrani/data-science-projects

This repository contains my data science projects, where I utilized tools and libraries such as Spark, Python, Pandas, NumPy, SQLite, Matplotlib, Seaborn, and performed Exploratory Data Analysis .

data-engineering data-preprocessing data-science data-visualization exploratory-data-analysis matplotlib pandas python python-lambda seaborn spark

Last synced: 16 Nov 2024

https://github.com/sayamalt/steel-energy-consumption-prediction-using-pyspark

Successfully established a machine learning model using PySpark which can precisely predict the energy consumption of the steel industry, up to an r2 score of approximately 99.5%.

apache-spark big-data-analytics big-data-processing cross-validation data-visualization exploratory-data-analysis hyperparameter-tuning machine-learning model-training-and-evaluation python regression spark sql

Last synced: 16 Nov 2024

https://github.com/robinmillford/strategic-insights-unveiling-the-dynamics-of-ipl-2022-auction

This project involves a comprehensive analysis of the IPL 2022 Auction. The goal was to gain insights into the auction dynamics, player characteristics, and spending patterns of different teams.

data-analysis data-visualization ipl powerbi sql

Last synced: 17 Nov 2024

https://github.com/robinmillford/avocado-shiny-app

A dashboard of California avocado insights.

avocado data-visualization shiny-apps shinydashboard

Last synced: 17 Nov 2024

https://github.com/mlund2k/project-1-baseball-performance-vs.-attendance

Project assets for my first exploratory data analysis: Baseball Performance vs. Attendance.

bigquery data-analysis data-cleaning data-visualization excel rstudio sql tableau tidyverse

Last synced: 12 Oct 2024

https://github.com/vidyadnina/cyclistic-sql-tableau-project

Trip data analysis for a bike-sharing service company using SQL and Tableau.

bigquery dashboard data-analysis data-analytics-sql data-cleaning data-visualization sql

Last synced: 12 Oct 2024

https://github.com/alinababer/data-science-and-insight-agent-rag-llama3-lava-llm

Data-Science-and-Insight-Agent-RAG-LLama3-Lava-LLM-Django-WebApplication is an advanced AI-driven chatbot designed to assist in data science, document analysis, and image interpretation. This repository contain the Datascience Agent of this project.

artificial-neural-networks classifcation data-analysis data-engineering data-visualization datascience large-language-models llama2 lstm machine-learning python random-forest regression

Last synced: 12 Oct 2024

https://github.com/suryakaranraja/listview-application-for-zoho-interview

This repository has the files of basic application for designed to retrieve details about the top 50 happiest countries in the world for the year 2022.

data-visualization desktop desktop-app happiness-score interview-with-zoho list-view surface-tablet winui3

Last synced: 12 Oct 2024

https://github.com/moeabbas6/dbt_analytics_engine

An end-to-end project using dbt to demonstrate data transformations, testing, and visualization with Google BigQuery, and Looker Studio. It showcases a complete data pipeline from extraction/generation to deployment.

analytics-engineering bigquery data data-pipeline data-transformation data-visualization dbt testing

Last synced: 12 Oct 2024

https://github.com/Sparsh7082/Data-Analysis-Portfolio

This repository is dedicated to showcasing my skills, sharing projects, and tracking my progress in Data Analytics and Data Science.

canva data-analytics data-manipulation data-visualization database-querying google-slides power-bi power-point presentation-tools programming-language python r spreadsheets sql tableau

Last synced: 23 Oct 2024

https://github.com/adadalshabab/data-engineering-gcp-project

An end-to-end modern data engineering project, including deployment of ETL pipeline on Google Cloud Platform, using BigQuery for data analysis and leveraging Looker to generate an insight dashboard.

bigquery data data-science data-visualization databases dataengineering-a engineering etl-pipeline looker-studio powerbi

Last synced: 31 Oct 2024

https://github.com/stynw7/data_mining

Provides Computer Science material especially at Database Major which is Data Mining ⛏️⛏️

classification clustering data-mining data-visualization database outlier-detection r-programming

Last synced: 13 Oct 2024

https://github.com/tanaybhadula/avacado-analytics

It is a dashboard built using Dash.It shows graphs of dataset about sales and prices of avocados in the United States between 2015 and 2018.

css dash data-analytics data-visualization python

Last synced: 12 Nov 2024

https://github.com/tynandebold/daylight

Amount of daylight in select locations around the world.

data-visualization data-viz daylight javascript react time

Last synced: 13 Oct 2024

https://github.com/dmarks84/ind_project_obesity-multi-class-classification--kaggle

Independent Project - Kaggle Competition -- I worked on the obesity classification data set as part of a Kaggle Competition of the same name, scoring (for accuracy) above 0.9

classification correlation-analysis cross-validation data-modeling data-visualization dataframes eda gridsearchcv matplotlib multiclass-classification numpy pandas python seaborn sklearn statistics supervised-ml

Last synced: 05 Nov 2024

https://github.com/christs8920/process-mining-py

A process mining project that analyzes an event log and discovers its process model.

data-science data-visualization datavisualization pm4py process-mining processmining python

Last synced: 11 Nov 2024

https://github.com/prsdthkr/viz-design-demo

📈 This repo houses my experiments with D3 (mostly inspired by other's work) for information visualization class project and demo.

d3 data-visualization lgbtq

Last synced: 16 Oct 2024

https://github.com/abeltavares/postql

Python library and command-line interface (CLI) tool for interacting with PostgreSQL databases, providing simplified database management, query execution, and result export functionalities.

cli command-line-interface data-analysis data-engineering data-export data-management data-processing data-visualization database database-administration database-tools etl oop postgres postgresql psycopg2 python sql sqlalchemy wrapper

Last synced: 31 Oct 2024

https://github.com/samkazan/fraud-detection-ml

Machine learning models for enhanced fraud detection in e-commerce transactions, exploring feature engineering, distance prediction, and clustering analysis.

clustering data-science data-visualization dataanalytics dbscan eda hierarchical-clustering kmeans-clustering knn-imputer matplotlib mlxtend python scikit-learn seaborn xgboost

Last synced: 05 Nov 2024

https://github.com/dmarks84/ind_project_data-science-london-scikit-learn--kaggle

Independent Project - Kaggle Competition -- I worked on the Data Science London data set for the Data Science London + Scikit-learn competition.

classification cross-validation data-modeling data-reporting data-visualization dataframes eda grid-search matplotlib numpy pandas python sklearn statistics supervised-ml

Last synced: 05 Nov 2024

https://github.com/cainmagi/dash-json-grid

Dash porting version of the react project React JSON Grid. Provide structured and nested grid table view of complicated JSON objects/arrays.

dash data-visualization json json-table json-viewer plotly-dash python python-dash python-libary python3

Last synced: 09 Oct 2024

https://github.com/prekshivyas/cis-595-big-data-analytics

Comprehensive real estate price prediction project, integrating socioeconomic indicators and property features.

data-analysis data-cleaning data-mining data-preprocessing data-science data-visualization data-wrangling exploratory-data-analysis web-scraping

Last synced: 09 Nov 2024

https://github.com/dmarks84/ind_project_superstore-sales-time-series-analysis--kaggle

Independent Project - Kaggle Dataset-- I worked on the Superstore Sales Dataset, performing (as Part 1) data cleaning and preparation and exploratory data analysis. The main task was to make predictions for future sales based on time-series analysis, which is found in Part 2.

chloropleth data-modeling data-visualization eda linear-regression matplotlib numpy pandas python seaborn sklearn statistics statsmodels supervised-ml time-series-analysis

Last synced: 05 Nov 2024

https://github.com/tynandebold/day-length-line-chart

A day-length line chart, charting the length of daylight from different locations around the world.

d3 d3-visualization d3js data-visualization data-viz

Last synced: 13 Oct 2024

https://github.com/dmarks84/ind_project_california-housing-data--kaggle

Independent Project - Kaggle Dataset-- I worked on the California Housing dataset, performing data cleaning and preparation; exploratory data analysis; feature engineering; regression model buildings; model evaluation.

cross-validation data-modeling data-reporting data-visualization eda folium grid-search matplotlib model-evaluation numpy pandas pca python seaborn sklearn statistics supervised-ml unsupervised-ml

Last synced: 05 Nov 2024

https://github.com/smahala02/materials-science-introduction

Introduction to Materials Science concepts using Python for array manipulation and visualization with NumPy and Matplotlib.

data-visualization materials-science matplotlib numpy python scientific-computing

Last synced: 31 Oct 2024

https://github.com/abhipatel35/diabetes_ml_classification

Predict diabetes using machine learning models. Experiment with logistic regression, decision trees, and random forests to achieve accurate predictions based on health indicators. Complete lifecycle of ML project included.

classification data-analysis data-science data-visualization descision-tree diabetes-prediction jupiter-notebook logistic-regression machine-learning model-evaluation open-source pandas pycharm-ide python random-forest scikit-learn

Last synced: 31 Oct 2024

https://github.com/mzprog/datatables

create a better tables with few lines of code.

data-visualization datatables laravel-package livewire

Last synced: 14 Oct 2024

https://github.com/zmyzheng/stack_overflow_qa_assistant

Big Data Analysis project with recommendation, cluster analysis and graph database

big-data-analytics cluster-analysis data-visualization graph-database hadoop mahout recommendation-system

Last synced: 23 Oct 2024

https://github.com/zmyzheng/browserassistant

Big Data & Cloud Computing project for recommendation, cluster analysis, data visualization with Hadoop and Spark deployed in auto- scaling cloud environment, youtube link:

angular big-data-analytics cloud cluster-analysis data-visualization elasticsearch flask hadoop recommendation-system spark spring-boot

Last synced: 23 Oct 2024

https://github.com/shwetapardhi/assignment-04-simple-linear-regression-2

Q2) Salary_hike -> Build a prediction model for Salary_hike Build a simple linear regression model by performing EDA and do necessary transformations and select the best model using R or Python. EDA and Data Visualization. Correlation Analysis. Model Building. Model Testing. Model Predictions.

correlation-analysis data-visualization distplot eda feature-engineering model-building model-predictions model-template numpy ols-regression p-value pandas python r-square-values regression-plot seaborn simple-linear-regression smf statsmodels t-score

Last synced: 11 Nov 2024

https://github.com/shwetapardhi/assignment-04-simple-linear-regression-1

Q1) Delivery_time -> Predict delivery time using sorting time. Build a simple linear regression model by performing EDA and do necessary transformations and select the best model using R or Python. EDA and Data Visualization, Feature Engineering, Correlation Analysis, Model Building, Model Testing and Model Predictions using simple linear regressi

correlation-analysis data-visualization distplot eda feature-engineering model-building model-prediction model-testing numpy ols-regression p-value pandas python regression-plot rsquare-values seaborn simple-linear-regression smf statsmodel t-score

Last synced: 11 Nov 2024

https://github.com/sudarsann27/basic_machine_learning_algorithms

Basic Machine learning algorithms using scikit-learn and other fundamental libraries

data-science data-visualization ensemble-model kaggle numpy pandas scikit-learn supervised-machine-learning

Last synced: 03 Nov 2024