Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/tynandebold/daylight

Amount of daylight in select locations around the world.

data-visualization data-viz daylight javascript react time

Last synced: 13 Oct 2024

https://github.com/mariarodr1136/atmosphereanalyzer

Atmosphere Analyzer is a smart environmental monitoring system that simulates real-time sensor data using Python, integrates AWS S3 for storage, and employs AWS Lambda for processing. Its Django API serves a React frontend, providing an interactive dashboard for visualizing key environmental metrics. 🌡️

aws data-visualization django environmental-monitoring iot lambda python react real-time-data s3-bucket serverless sustainable-resourse

Last synced: 03 Nov 2024

https://github.com/dmarks84/ind_project_obesity-multi-class-classification--kaggle

Independent Project - Kaggle Competition -- I worked on the obesity classification data set as part of a Kaggle Competition of the same name, scoring (for accuracy) above 0.9

classification correlation-analysis cross-validation data-modeling data-visualization dataframes eda gridsearchcv matplotlib multiclass-classification numpy pandas python seaborn sklearn statistics supervised-ml

Last synced: 05 Nov 2024

https://github.com/solygambas/d3-firebase

5 small projects to understand D3.js basics using Firebase and Materialize.

d3 d3js data-visualization firebase firestore javascript materialize materializecss

Last synced: 03 Nov 2024

https://github.com/dmarks84/ind_project_data-science-london-scikit-learn--kaggle

Independent Project - Kaggle Competition -- I worked on the Data Science London data set for the Data Science London + Scikit-learn competition.

classification cross-validation data-modeling data-reporting data-visualization dataframes eda grid-search matplotlib numpy pandas python sklearn statistics supervised-ml

Last synced: 05 Nov 2024

https://github.com/stanleynguyen/so.cube

World map visualisation of World's Cube Association data 🌏

cas cube data-visualization leaftlet map

Last synced: 30 Oct 2024

https://github.com/dmarks84/ind_project_superstore-sales-time-series-analysis--kaggle

Independent Project - Kaggle Dataset-- I worked on the Superstore Sales Dataset, performing (as Part 1) data cleaning and preparation and exploratory data analysis. The main task was to make predictions for future sales based on time-series analysis, which is found in Part 2.

chloropleth data-modeling data-visualization eda linear-regression matplotlib numpy pandas python seaborn sklearn statistics statsmodels supervised-ml time-series-analysis

Last synced: 05 Nov 2024

https://github.com/tynandebold/day-length-line-chart

A day-length line chart, charting the length of daylight from different locations around the world.

d3 d3-visualization d3js data-visualization data-viz

Last synced: 13 Oct 2024

https://github.com/37743/ml-starterkit

This project is designed to streamline the data science workflow by allowing users to input a file path and receive a comprehensive analysis tailored to their needs. The tool automates key stages in the data science pipeline, including Exploratory Data Analysis (EDA), data preprocessing, and model training.

data-preprocessing data-visualization exploratory-data-analysis machine-learning python

Last synced: 03 Nov 2024

https://github.com/mrwilbroad/react-data-visualizaion

Data visualization with reactjs , react-chartjs-2

chartjs data-visualization graphs react-chartjs-2 reactjs

Last synced: 07 Nov 2024

https://github.com/dmarks84/ind_project_california-housing-data--kaggle

Independent Project - Kaggle Dataset-- I worked on the California Housing dataset, performing data cleaning and preparation; exploratory data analysis; feature engineering; regression model buildings; model evaluation.

cross-validation data-modeling data-reporting data-visualization eda folium grid-search matplotlib model-evaluation numpy pandas pca python seaborn sklearn statistics supervised-ml unsupervised-ml

Last synced: 05 Nov 2024

https://github.com/tralahm/parliament-2017-dataset

Concise, Clean data sets of the 2017 Kenyan General Election results for the Members of the Senate and National Assembly Composition

csv-parsing data-analysis data-visualization datasets election-data ipynb-jupyter-notebook kaggle-dataset kenya-constituencies kenya-counties matplotlib python3 tralahtek

Last synced: 05 Nov 2024

https://github.com/karlyndiary/smartphone-price-analytics

A data pipeline for analyzing smartphone pricing by retrieving data from Flipkart using RapidAPI, transforming it, and visualizing insights using SQL Server and Excel.

beautifulsoup data-analysis data-pipeline data-visualization data-visualization-dashboard etl microsoft microsoft-excel microsoft-sql-server python smartphone-price-analysis

Last synced: 12 Oct 2024

https://github.com/jaguzmana/colombia-covid-analysis

A project proposed to enhance SQL proficiency and develop skills in data visualization using Tableau.

data-visualization mssql-database tableau

Last synced: 12 Oct 2024

https://github.com/shubhamdeepkeshav/visualization-on-tips

📊 Data visualization project analyzing tipping behavior in restaurants using Python. 🍽️ Explores insights based on ⏰ time, 👥 party size, 🧑‍🤝‍🧑 gender, and 🚬 smoker status with Matplotlib and Seaborn.

data-visualization dataanalysis eda matplotlib python seaborn

Last synced: 04 Nov 2024

https://github.com/kunalthakur204/visualization-on-flower

🌸 Flower Dataset Visualization Visualizing patterns and relationships in flower data through charts and plots. Perfect for exploring floral characteristics and trends! 📊

data data-visualization dataanalysis flowerdataset python

Last synced: 04 Nov 2024

https://github.com/vi/csvdimreduce

Command-line tool to run a dimensionality reduction algorithm on CSV files

cli command-line-tool csv csv-files data-science data-visualization dimension-reduction dimensionality-reduction

Last synced: 16 Oct 2024

https://github.com/mayurasandakalum/ipl-data-engineering-spark-databricks

Comprehensive data engineering and analytics project using IPL dataset with Amazon S3, Apache Spark, Databricks, and SQL. Includes data storage, transformation, analysis, and visualization.

amazon-s3 apache-spark aws big-data cricket-analytics data-analytics data-engineering data-visualization databricks etl-pipeline ipl-dataset machine-learning python sql

Last synced: 12 Oct 2024

https://github.com/sayamalt/steel-energy-consumption-prediction-using-pyspark

Successfully established a machine learning model using PySpark which can precisely predict the energy consumption of the steel industry, up to an r2 score of approximately 99.5%.

apache-spark big-data-analytics big-data-processing cross-validation data-visualization exploratory-data-analysis hyperparameter-tuning machine-learning model-training-and-evaluation python regression spark sql

Last synced: 11 Oct 2024

https://github.com/sitek94/react-interactive-scatter-plot

An interactive scatter plot made with React and D3

d3 data-visualization react

Last synced: 30 Oct 2024

https://github.com/sitek94/react-d3-bar-chart

A bar chart made with React and D3.

d3 data-visualization react

Last synced: 30 Oct 2024

https://github.com/bhiogade/customer-purchase-analysis

Comprehensive Customer Purchase Analysis Across Multiple Dimensions

data-analysis data-visualization tableau tableau-desktop

Last synced: 23 Oct 2024

https://github.com/allanreda/automated-k-means-clustering-engine

An interactive K-Means clustering tool built with Flask and Scikit-Learn, supporting Excel file uploads, cluster analysis, and data export, deployed on Google Cloud Run via Docker with CI/CD integration.

cicd css data-visualization deployment docker flask google-cloud-run html javascript k-means-clustering machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 31 Oct 2024

https://github.com/noturlee/iris-dataanalyis

This project aims to classify Iris flowers into three species—setosa, versicolor, and virginica—based on their sepal and petal measurements using machine learning techniques. The dataset comprises 150 samples evenly distributed among these species

data-analysis data-modeling data-science data-structures-and-algorithms data-visualization

Last synced: 04 Nov 2024

https://github.com/noturlee/sales-dataanalysis

This project aims to predict product sales based on advertising expenditures, focusing on 'TV advertising'. Machine learning techniques are employed to analyze and interpret data, enabling businesses to optimize advertising strategies and maximize sales potential.

data-modeling data-science data-structures-and-algorithms data-visualization linear-regression

Last synced: 04 Nov 2024

https://github.com/sted9000/gambit

Learn Pot Limit Omaha Starting Ranges

automation data-visualization poker python

Last synced: 16 Aug 2024

https://github.com/katiesaund/jpm_healthcare_2019

Analysis of healthcare stock changes over JPM week versus rest of the year.

biotech data-science data-visualization finance jpmorgan r rstats stock

Last synced: 14 Oct 2024

https://github.com/katiesaund/tidy_tuesday

A weekly data project in R from the R4DS online learning community

data-analysis data-visualization datascience plot r rstats tidytuesday

Last synced: 14 Oct 2024

https://github.com/aelluminate-hive/global-cost-index-analysis

This analysis explores the cost of living across various countries, aiming to provide insights into economic disparities and living standards on a global scale. Utilizing a dataset that includes indices for overall cost of living, groceries, restaurant prices, and rent, we investigate the top and least expensive countries worldwide.

data-science data-visualization exploratory-data-analysis jupyter-notebook python3

Last synced: 05 Nov 2024

https://github.com/aelluminate-hive/gym-members-exercise-analysis

Analyze fitness dataset to uncover insights into exercise patterns, calorie consumption, body composition, and overall fitness levels. Explore correlations between workout type, frequency, and physiological markers. Use data analysis techniques to provide a comprehensive understanding of fitness behaviors and factors influencing exercise outcomes.

data-science data-visualization exploratory-data-analysis jupyter-notebook python3

Last synced: 05 Nov 2024

https://github.com/aelluminate-hive/student-performance-analysis

This project analyzes student data to understand factors affecting final exam scores. Data includes study habits, extracurriculars, family background, school environment, and demographics. The goal is to identify key contributors to academic success.

data-science data-visualization exploratory-data-analysis jupyter-notebook python3

Last synced: 05 Nov 2024

https://github.com/mthaler/ourworldindata

Gnuplot files for plotting data from ourworldindata.org

data-visualization gnuplot ourworldindata

Last synced: 05 Nov 2024

https://github.com/bhushan148/finance-domain-bank-loan-report-tableau

I analyzed 🏦 bank loan data to reveal trends, KPIs, and insights. Using Tableau 📈 for dashboards and SQL 🗃️ for data extraction, I visualized loan applications, borrower profiles, and repayment behaviors 💡.

bussiness-intelligence dashboard-design data-analysis data-visualization excel figma sql sqlqueries tableau

Last synced: 05 Nov 2024

https://github.com/kirankumar-ak777/vaccine_usage_analysis_and_prediction

This Vaccine Prediction Project predicts individual likelihood of receiving H1N1 or seasonal flu vaccines using demographic, medical, and behavioral data. Through data processing, feature selection, and logistic regression, it builds a model to better understand vaccine uptake behaviors.

data-visualization machine-learning pandas python

Last synced: 05 Nov 2024

https://github.com/hayatiyrtgl/cryptocurrency_time_series_rnn

Python script for training a Simple RNN model on cryptocurrency price data to predict future prices, including data exploration and evaluation

data-analysis data-science data-visualization keras pandas pandas-python prediction predictive-modeling python python-script rnn rnn-tensorflow tensorflow time-series time-series-analysis

Last synced: 05 Nov 2024

https://github.com/superskyyy/stackoverseer

This is a StackOverflow monitor where you can easily access the most trending and up-to-date questions on a particular set of tags. This project can be modified to support wider range of tags and provide functionalities.

charts data-visualization stackoverflow-api stackoverflow-questions

Last synced: 07 Nov 2024

https://github.com/qtle3/decision-tree-regression

This project implements **Decision Tree Regression** to predict the salary of an employee based on their position level. By using a dataset containing position levels and their corresponding salaries, the project highlights how decision trees can capture complex relationships in data through non-linear regression.

data-visualization decision-tree-regression prediction-model

Last synced: 05 Nov 2024

https://github.com/qtle3/logistic-regression

A Python implementation of Logistic Regression to classify social network ads based on age and estimated salary, featuring data visualization and performance metrics such as confusion matrix and accuracy score.

data-visualization feature-scaling logistic-regression logistic-regression-algorithm model-evaluation

Last synced: 05 Nov 2024

https://github.com/allanreda/telco-customer-churn-predictor-app

A web-based machine learning application that predicts customer churn using a logistic regression model. Built with Scikit-Learn for model training, Gradio for the user interface, and deployed on Google Cloud App Engine. The app allows users to input customer data and receive predictions on churn risk to support business decision-making.

app-engine data-visualization deployment google-cloud gradio hyperparameter-tuning logistic-regression machine-learning numpy pandas scikit-learn

Last synced: 07 Nov 2024

https://github.com/allanreda/ga4-session-predictor-flask-app

Flask app that can predict future number of GA4 sessions, using the Prophet library.

data-visualization flask ga4-api matplotlib pandas prophet-library python time-series-forecasting

Last synced: 07 Nov 2024

https://github.com/cassiofb-dev/covid-grafico

Gráficos da COVID-19 (Mortes e Casos) com Chart.js.

chartjs covid-19 data-visualization

Last synced: 07 Nov 2024

https://github.com/kinolag/traffic

A geospatial visualisation app showing road traffic information for all areas of Inner London. Built in TypeScript, combining React with D3.

d3 data-visualization geojson geospatial-visualization mapping react responsive-design svg topojson typescript

Last synced: 27 Oct 2024

https://github.com/jay6430/python-for-data-science

This Repository includes all the Python concepts that are necessary for Data science.

csv data-visualization datascience-machinelearning html-css ipynb-jupyter-notebook matplotlib numpy pandas python3 xml

Last synced: 11 Oct 2024

https://github.com/abdelrahmanbayoumi/titanic-machine-learning-from-disasters

Knowing from a training set of samples listing passengers who survived or did not survive the Titanic disaster, can our model determine based on a given test dataset not containing the survival information, if these passengers in the test dataset survived or not.

data-analysis data-science data-visualization machine-learning pandas

Last synced: 05 Nov 2024

https://github.com/miteshgupta07/streamlit-machine-learning-app

A Streamlit application for interactive exploratory data analysis (EDA) and data visualization, offering dynamic tools to analyze and visualize machine learning datasets.

data-visualization python streamlit

Last synced: 07 Nov 2024

https://github.com/bhavinpatel4199/machine-learning-framework

This repository, showcases various projects that explore key concepts in both supervised and unsupervised learning, with a focus on real-world applications. The projects utilize a range of machine learning techniques, including data preprocessing, feature selection, exploratory data analysis (EDA), and model optimization.

classification clustering data-science data-structures data-visualization exploratory-data-analysis machine-learning machine-learning-algorithms machine-learning-models pandas-dataframe predictive-modeling preprocessing-data sklearn supervised-learning unsupervised-learning

Last synced: 03 Nov 2024

https://github.com/miserman/splot

An R package to ease data visualization

data-visualization r

Last synced: 05 Nov 2024

https://github.com/aneeshmurali-n/ann-diabetes-prediction

Predicting diabetes progression using an Artificial Neural Network (ANN). This project leverages the scikit-learn diabetes dataset for training and evaluation. Includes data preprocessing, model building, and performance visualization.

ann data-preprocessing data-visualization deep-learning diabetes-prediction exploratory-data-analysis keras machine-learning matplotlib neural-network numpy pandas regression scikit-learn seaborn tensorflow visualization

Last synced: 31 Oct 2024

https://github.com/gabrieldiem/data_visualization_lifespan_wealth

Little python script that shows a data visualization of life span and wealth worldwide

data-visualization pandas plotly python script

Last synced: 05 Nov 2024

https://github.com/gabrieldiem/iss_locator

Little python script that plots the ISS (International Space Station) location in a world map at a given time

data-visualization pandas plotly python script

Last synced: 05 Nov 2024

https://github.com/madhursinghbhadoriya/data_analysis_sales_insights_using_tableau

• Performed Data Cleaning using MySQL. • Data analysis and ETL in Tableau. • Created an Interactive Dashboard with significant information about the Sales Insights, Profit and Revenue Analysis.

data-analysis data-visualization dataanalysis etl mysql tableau-dashboards tableau-desktop

Last synced: 05 Nov 2024

https://github.com/madhursinghbhadoriya/cutomer_data_analysis1

Customer_Data Analysis - Tableau

chart data-visualization tableau

Last synced: 05 Nov 2024

https://github.com/kuranez/eu-energy-map

Dashboard with interactive map visualizing Eurostat data regarding renewable energy developments in the European Union.

data-visualization energy-data european-union geopandas pandas plotly python

Last synced: 05 Nov 2024

https://github.com/corndogit/dataspaceart

A generative art project which generates stylized patterns from weather data

data-visualization python weather

Last synced: 07 Nov 2024

https://github.com/sayamalt/credit-card-approval-prediction

Successfully developed a machine learning model which can accurately predict up to 100% accuracy whether a credit card application of a given applicant would be approved or not, based on several demographic features such as applicant age, total income, marital status, total years of work experience, etc.

binary-classification cicd-deployment cross-validation data-exploration-and-preprocessing data-visualization exploratory-data-analysis feature-engineering hyperparameter-optimization machine-learning model-deployment model-retraining model-selection model-testing model-training-and-evaluation

Last synced: 07 Nov 2024

https://github.com/sayamalt/taxi-trip-fare-prediction

Successfully created a machine learning model which can accurately predict the fare of a taxi trip based on several features such as trip duration, tip amount, etc.

cross-validation data-exploration-and-preprocessing data-visualization exploratory-data-analysis feature-engineering hyperparameter-optimization machine-learning model-deployment model-selection model-training-and-evaluation regression-modelling

Last synced: 07 Nov 2024

https://github.com/sayamalt/company-bankruptcy-prediction

Successfully developed a machine learning model which can accurately predict whether a firm will become bankrupt or not, depending on various features such as net value growth rate, borrowing dependency, cash/total assets, etc.

binary-classification cicd-deployment cross-validation data-exploration-and-preprocessing data-visualization docker-container exploratory-data-analysis feature-engineering github-actions hyperparameter-optimization machine-learning model-deployment model-training-and-evaluation

Last synced: 07 Nov 2024

https://github.com/sayamalt/employee-attrition-prediction

Successfully established a machine learning model which can accurately predict whether an employee of a given company will leave it in the impending future or not, based on several employee details and employment metrics.

binary-classification continuous-deployment continuous-integration cross-validation data-exploration-and-preprocessing data-visualization exploratory-data-analysis feature-engineering hyperparameter-optimization machine-learning model-deployment model-training-and-evaluation

Last synced: 07 Nov 2024

https://github.com/sayamalt/life-expectancy-prediction

Successfully established a machine learning model which can accurately predict the expected life duration of a human being based on several demographic features such as alcohol consumption per capita, average BMI of entire population, etc.

cross-validation data-cleaning-and-preprocessing data-visualization docker end-to-end-pipeline exploratory-data-analysis feature-engineering github-actions-workflow hyperparameter-tuning machine-learning model-deployment model-training-and-evaluation

Last synced: 07 Nov 2024

https://github.com/sayamalt/concrete-strength-prediction

Successfully developed a machine learning model which can accurately predict the strength of cement based on various features such as blast furnace slag, water, coarse aggregate, etc.

cross-validation data-visualization exploratory-data-analysis feature-engineering hyperparameter-optimization machine-learning model-deployment model-training-and-evaluation regression-models

Last synced: 07 Nov 2024

https://github.com/sayamalt/superstore-sales-prediction

Successfully established a machine learning model that can accurately predict the sales of a superstore based on various features such as quantity, profit, discount, postal code, etc. The features are mainly associated with order details and customer demographics.

azure-machine-learning azure-web-app-service cicd-deployment cross-validation data-cleaning-and-preprocessing data-visualization exploratory-data-analysis feature-engineering github-actions-ci-cd hyperparameter-tuning machine-learning model-deployment model-retraining model-testing model-training-and-evaluation regression-models

Last synced: 07 Nov 2024

https://github.com/sayamalt/flight-price-prediction

Successfully established a machine learning model to accurately predict the price of a flight in India based on several features such as duration, days left, arrival time, departure time and so on.

data-visualization exploratory-data-analysis feature-engineering hyperparameter-tuning machine-learning model-deployment model-training-and-evaluation regression-models

Last synced: 07 Nov 2024

https://github.com/sayamalt/twitter-sentiment-analysis

Successfully established a machine learning model which can accurately classify the sentiment of any particular tweet into either positive, negative or neutral category.

data-visualization exploratory-data-analysis nlp sentiment-analysis supervised-learning text-processing

Last synced: 07 Nov 2024

https://github.com/sayamalt/titanic-survival-prediction

Successfully developed a Logistic Regression model for predicting the survival of a passenger aboard the Titanic ship based on his/her various features such as gender, age, passenger class, no. of siblings, embarkation location, etc.

data-cleaning data-preprocessing data-visualization exploratory-data-analysis logistic-regression machine-learning sklearn

Last synced: 07 Nov 2024

https://github.com/sayamalt/quora-duplicate-question-pairs-identification

Successfully developed a machine learning model which can accurately detect whether any given pair of Quora questions are duplicate or not.

data-visualization machine-learning natural-language-processing nltk paraphrase-detection text-preprocessing

Last synced: 07 Nov 2024

https://github.com/sayamalt/house-price-prediction

Successfully created a regression model for predicting the price of any house, excluding enormous real estates and mansions, to a significant level of accuracy.

data-visualization exploratory-data-analysis feature-engineering feature-selection machine-learning regression-analysis regression-testing

Last synced: 07 Nov 2024

https://github.com/sayamalt/fraudulent-transactions-prediction

Successfully trained a machine learning model which can predict whether a given transaction is fraud or not.

data-visualization exploratory-data-analysis imblearn machine-learning model-based-testing model-building predictive-analytics sklearn

Last synced: 07 Nov 2024

https://github.com/sayamalt/customer-churn-prediction

Successfully established a machine learning model which can predict whether any given customer currently utilizing the products and services offered by a company will churn at anytime in the future or not, depending upon a set of unique features/characteristics pertaining to that specific individual, to a great level of accuracy.

classification data-visualization exploratory-data-analysis feature-engineering hyperparameter-tuning model-deployment model-evaluation model-optimization model-training supervised-machine-learning

Last synced: 07 Nov 2024

https://github.com/mcommer/emtools

A toolbox for geophysical EM-simulation data- and model-file processing, analysis, plotting, and other gimmicks

data-visualization electromagnetics geophysics plotting-scripts shell-scripts

Last synced: 16 Oct 2024

https://github.com/alexgenovese/react-charts-covid-19-data

Examples on COVID-19 data using different library charts: G2, G2Plot, Plotly, ApexCharts

data-analysis data-science data-visualization react reactjs

Last synced: 07 Nov 2024

https://github.com/papposilene/mappingthepompidou

Visualizing the Centre Pompidou's (Centre national d'art moderne, aka CNAM) collection data.

data-visualization laravel7 museum museum-collections nuxtjs

Last synced: 27 Sep 2024

https://github.com/camara94/data_analyse_series_temporelles

Dans ce tutoriel, nous allons répondre aux questions suivantes: 1. Lire les données Microsoft à l'aide du package **Pandas Data reader** 2. Obtenez le **prix maximum** de l'action de **2017 à 2022** 3. Quelle est la **date du cours le plus élevé** de l'action ? 4. Quelle est la **date du cours le plus bas** de l'action ?

data-analysis data-analysis-python data-science data-structures-and-algorithms data-visualization serie series-forecasting

Last synced: 05 Nov 2024

https://github.com/adi3042/data_science

📊🚀 Explore the Data Science Universe! Unlock insights and master data skills with hands-on assignments spanning machine learning, visualization, and more. Your journey to becoming a data expert starts here! 🎯💡 DataScienceJourney

anomaly-detection big-data-processing classification clustering computer-vision data-cleaning-and-preprocessing data-visualization deep-learning dimensionality-reduction ensemble-learning exploratory-data-analysis feature-engineering machine-learning model-deployment model-selection-and-evaluation natural-language-processing regression-analysis statistical-analysis time-series-analysis-and-forecasting

Last synced: 08 Nov 2024

https://github.com/robwiederstein/kytc_loc

Plot Kentucky licensing locations

data-visualization ggmap leaflet r xml2

Last synced: 06 Nov 2024

https://github.com/robwiederstein/covid-19-ky

Monitor US covid-19 cases w/ Johns Hopkins data

data data-visualization leaflet plotly r shell

Last synced: 06 Nov 2024