Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/spacebakery/making-a-visual-argument

Codecademy | Data Visualization with Matplotlib | Making a Visual Argument in Matplotlib

data-visualization matplotlib python

Last synced: 09 Nov 2024

https://github.com/qtle3/decision-tree-regression

This project implements **Decision Tree Regression** to predict the salary of an employee based on their position level. By using a dataset containing position levels and their corresponding salaries, the project highlights how decision trees can capture complex relationships in data through non-linear regression.

data-visualization decision-tree-regression prediction-model

Last synced: 05 Nov 2024

https://github.com/qtle3/logistic-regression

A Python implementation of Logistic Regression to classify social network ads based on age and estimated salary, featuring data visualization and performance metrics such as confusion matrix and accuracy score.

data-visualization feature-scaling logistic-regression logistic-regression-algorithm model-evaluation

Last synced: 05 Nov 2024

https://github.com/allanreda/telco-customer-churn-predictor-app

A web-based machine learning application that predicts customer churn using a logistic regression model. Built with Scikit-Learn for model training, Gradio for the user interface, and deployed on Google Cloud App Engine. The app allows users to input customer data and receive predictions on churn risk to support business decision-making.

app-engine data-visualization deployment google-cloud gradio hyperparameter-tuning logistic-regression machine-learning numpy pandas scikit-learn

Last synced: 07 Nov 2024

https://github.com/allanreda/ga4-session-predictor-flask-app

Flask app that can predict future number of GA4 sessions, using the Prophet library.

data-visualization flask ga4-api matplotlib pandas prophet-library python time-series-forecasting

Last synced: 07 Nov 2024

https://github.com/cassiofb-dev/covid-grafico

Gráficos da COVID-19 (Mortes e Casos) com Chart.js.

chartjs covid-19 data-visualization

Last synced: 07 Nov 2024

https://github.com/shwetapardhi/assignment-04-simple-linear-regression-2

Q2) Salary_hike -> Build a prediction model for Salary_hike Build a simple linear regression model by performing EDA and do necessary transformations and select the best model using R or Python. EDA and Data Visualization. Correlation Analysis. Model Building. Model Testing. Model Predictions.

correlation-analysis data-visualization distplot eda feature-engineering model-building model-predictions model-template numpy ols-regression p-value pandas python r-square-values regression-plot seaborn simple-linear-regression smf statsmodels t-score

Last synced: 11 Nov 2024

https://github.com/abdelrahmanbayoumi/titanic-machine-learning-from-disasters

Knowing from a training set of samples listing passengers who survived or did not survive the Titanic disaster, can our model determine based on a given test dataset not containing the survival information, if these passengers in the test dataset survived or not.

data-analysis data-science data-visualization machine-learning pandas

Last synced: 05 Nov 2024

https://github.com/miteshgupta07/streamlit-machine-learning-app

A Streamlit application for interactive exploratory data analysis (EDA) and data visualization, offering dynamic tools to analyze and visualize machine learning datasets.

data-visualization python streamlit

Last synced: 07 Nov 2024

https://github.com/zmyzheng/browserassistant

Big Data & Cloud Computing project for recommendation, cluster analysis, data visualization with Hadoop and Spark deployed in auto- scaling cloud environment, youtube link:

angular big-data-analytics cloud cluster-analysis data-visualization elasticsearch flask hadoop recommendation-system spark spring-boot

Last synced: 23 Oct 2024

https://github.com/spacebakery/make-the-other-charts

Codecademy | Data Visualization with Matplotlib | Matplotlib Fundamentals

ab-lines annotate bar-chart data-visualization histogram matplotlib pie-chart python scatter-plot subplots

Last synced: 09 Nov 2024

https://github.com/zmyzheng/stack_overflow_qa_assistant

Big Data Analysis project with recommendation, cluster analysis and graph database

big-data-analytics cluster-analysis data-visualization graph-database hadoop mahout recommendation-system

Last synced: 23 Oct 2024

https://github.com/spacebakery/making-a-visual-argument--compare-grammy-win-records-project

Codecademy | Data Visualization with Matplotlib | Making a Visual Argument in Matplotlib

data-visualization matplotlib python

Last synced: 09 Nov 2024

https://github.com/mzprog/datatables

create a better tables with few lines of code.

data-visualization datatables laravel-package livewire

Last synced: 14 Oct 2024

https://github.com/spacebakery/make-a-line-chart

Codecademy | Data Visualization with Matplotlib | Matplotlib Fundamentals

data-visualization matplotlib pandas-dataframe python

Last synced: 09 Nov 2024

https://github.com/abhipatel35/diabetes_ml_classification

Predict diabetes using machine learning models. Experiment with logistic regression, decision trees, and random forests to achieve accurate predictions based on health indicators. Complete lifecycle of ML project included.

classification data-analysis data-science data-visualization descision-tree diabetes-prediction jupiter-notebook logistic-regression machine-learning model-evaluation open-source pandas pycharm-ide python random-forest scikit-learn

Last synced: 31 Oct 2024

https://github.com/smahala02/materials-science-introduction

Introduction to Materials Science concepts using Python for array manipulation and visualization with NumPy and Matplotlib.

data-visualization materials-science matplotlib numpy python scientific-computing

Last synced: 31 Oct 2024

https://github.com/prekshivyas/cis-595-big-data-analytics

Comprehensive real estate price prediction project, integrating socioeconomic indicators and property features.

data-analysis data-cleaning data-mining data-preprocessing data-science data-visualization data-wrangling exploratory-data-analysis web-scraping

Last synced: 09 Nov 2024

https://github.com/spacebakery/make-a-line-chart-for-research

Codecademy | Data Visualization with Matplotlib | Matplotlib Fundamentals

data-visualization line-chart matplotlib python

Last synced: 09 Nov 2024

https://github.com/virajbhutada/hr-analytics-excel-sql-tableau-powerbi

Explore a comprehensive HR Analytics portfolio showcasing data analysis and visualization skills. Featuring dashboards in Power BI, Excel, and Tableau, along with SQL queries for deeper insights. A holistic view of expertise in HR analytics, data visualization, and database management. Let's dive into the game of data insights!

data-analysis data-management data-visualization excel hr-analytics interactive-dashboards portfolio-project postgresql powerbi powerbi-visuals sql sql-queries tableau tableau-public

Last synced: 11 Nov 2024

https://github.com/virajbhutada/diamond-price-estimator

This project develops a predictive model to estimate diamond prices based on characteristics like carat, cut, color, and clarity. It covers data preprocessing, feature engineering, model selection, training, and evaluation. The final product is a web app where users can input diamond attributes to get accurate and instant price predictions.

cross-validation css data-analysis data-science-projects data-visualization eda feature-engineering html hyperparameter-tuning jupyter-notebooks machine-learning ml-algorithms model-deployment model-selection performance-optimization predictive-modeling python python-app user-interface

Last synced: 11 Nov 2024

https://github.com/cainmagi/dash-json-grid

Dash porting version of the react project React JSON Grid. Provide structured and nested grid table view of complicated JSON objects/arrays.

dash data-visualization json json-table json-viewer plotly-dash python python-dash python-libary python3

Last synced: 09 Oct 2024

https://github.com/samkazan/fraud-detection-ml

Machine learning models for enhanced fraud detection in e-commerce transactions, exploring feature engineering, distance prediction, and clustering analysis.

clustering data-science data-visualization dataanalytics dbscan eda hierarchical-clustering kmeans-clustering knn-imputer matplotlib mlxtend python scikit-learn seaborn xgboost

Last synced: 05 Nov 2024

https://github.com/gabrieldiem/data_visualization_lifespan_wealth

Little python script that shows a data visualization of life span and wealth worldwide

data-visualization pandas plotly python script

Last synced: 05 Nov 2024

https://github.com/gabrieldiem/iss_locator

Little python script that plots the ISS (International Space Station) location in a world map at a given time

data-visualization pandas plotly python script

Last synced: 05 Nov 2024

https://github.com/abeltavares/postql

Python library and command-line interface (CLI) tool for interacting with PostgreSQL databases, providing simplified database management, query execution, and result export functionalities.

cli command-line-interface data-analysis data-engineering data-export data-management data-processing data-visualization database database-administration database-tools etl oop postgres postgresql psycopg2 python sql sqlalchemy wrapper

Last synced: 31 Oct 2024

https://github.com/madhursinghbhadoriya/data_analysis_sales_insights_using_tableau

• Performed Data Cleaning using MySQL. • Data analysis and ETL in Tableau. • Created an Interactive Dashboard with significant information about the Sales Insights, Profit and Revenue Analysis.

data-analysis data-visualization dataanalysis etl mysql tableau-dashboards tableau-desktop

Last synced: 05 Nov 2024

https://github.com/madhursinghbhadoriya/cutomer_data_analysis1

Customer_Data Analysis - Tableau

chart data-visualization tableau

Last synced: 05 Nov 2024

https://github.com/kuranez/eu-energy-map

Dashboard with interactive map visualizing Eurostat data regarding renewable energy developments in the European Union.

data-visualization energy-data european-union geopandas pandas plotly python

Last synced: 05 Nov 2024

https://github.com/gkar90/gdp-vs-life-expectancy

Statistical analysis on GDP vs Life Expectancy

data-science data-visualization statistical-analysis

Last synced: 12 Nov 2024

https://github.com/prsdthkr/viz-design-demo

📈 This repo houses my experiments with D3 (mostly inspired by other's work) for information visualization class project and demo.

d3 data-visualization lgbtq

Last synced: 16 Oct 2024

https://github.com/sayamalt/credit-card-approval-prediction

Successfully developed a machine learning model which can accurately predict up to 100% accuracy whether a credit card application of a given applicant would be approved or not, based on several demographic features such as applicant age, total income, marital status, total years of work experience, etc.

binary-classification cicd-deployment cross-validation data-exploration-and-preprocessing data-visualization exploratory-data-analysis feature-engineering hyperparameter-optimization machine-learning model-deployment model-retraining model-selection model-testing model-training-and-evaluation

Last synced: 07 Nov 2024

https://github.com/sayamalt/taxi-trip-fare-prediction

Successfully created a machine learning model which can accurately predict the fare of a taxi trip based on several features such as trip duration, tip amount, etc.

cross-validation data-exploration-and-preprocessing data-visualization exploratory-data-analysis feature-engineering hyperparameter-optimization machine-learning model-deployment model-selection model-training-and-evaluation regression-modelling

Last synced: 07 Nov 2024

https://github.com/sayamalt/company-bankruptcy-prediction

Successfully developed a machine learning model which can accurately predict whether a firm will become bankrupt or not, depending on various features such as net value growth rate, borrowing dependency, cash/total assets, etc.

binary-classification cicd-deployment cross-validation data-exploration-and-preprocessing data-visualization docker-container exploratory-data-analysis feature-engineering github-actions hyperparameter-optimization machine-learning model-deployment model-training-and-evaluation

Last synced: 07 Nov 2024

https://github.com/sayamalt/employee-attrition-prediction

Successfully established a machine learning model which can accurately predict whether an employee of a given company will leave it in the impending future or not, based on several employee details and employment metrics.

binary-classification continuous-deployment continuous-integration cross-validation data-exploration-and-preprocessing data-visualization exploratory-data-analysis feature-engineering hyperparameter-optimization machine-learning model-deployment model-training-and-evaluation

Last synced: 07 Nov 2024

https://github.com/sayamalt/life-expectancy-prediction

Successfully established a machine learning model which can accurately predict the expected life duration of a human being based on several demographic features such as alcohol consumption per capita, average BMI of entire population, etc.

cross-validation data-cleaning-and-preprocessing data-visualization docker end-to-end-pipeline exploratory-data-analysis feature-engineering github-actions-workflow hyperparameter-tuning machine-learning model-deployment model-training-and-evaluation

Last synced: 07 Nov 2024

https://github.com/sayamalt/concrete-strength-prediction

Successfully developed a machine learning model which can accurately predict the strength of cement based on various features such as blast furnace slag, water, coarse aggregate, etc.

cross-validation data-visualization exploratory-data-analysis feature-engineering hyperparameter-optimization machine-learning model-deployment model-training-and-evaluation regression-models

Last synced: 07 Nov 2024

https://github.com/sayamalt/superstore-sales-prediction

Successfully established a machine learning model that can accurately predict the sales of a superstore based on various features such as quantity, profit, discount, postal code, etc. The features are mainly associated with order details and customer demographics.

azure-machine-learning azure-web-app-service cicd-deployment cross-validation data-cleaning-and-preprocessing data-visualization exploratory-data-analysis feature-engineering github-actions-ci-cd hyperparameter-tuning machine-learning model-deployment model-retraining model-testing model-training-and-evaluation regression-models

Last synced: 07 Nov 2024

https://github.com/sayamalt/flight-price-prediction

Successfully established a machine learning model to accurately predict the price of a flight in India based on several features such as duration, days left, arrival time, departure time and so on.

data-visualization exploratory-data-analysis feature-engineering hyperparameter-tuning machine-learning model-deployment model-training-and-evaluation regression-models

Last synced: 07 Nov 2024

https://github.com/sayamalt/twitter-sentiment-analysis

Successfully established a machine learning model which can accurately classify the sentiment of any particular tweet into either positive, negative or neutral category.

data-visualization exploratory-data-analysis nlp sentiment-analysis supervised-learning text-processing

Last synced: 07 Nov 2024

https://github.com/sayamalt/titanic-survival-prediction

Successfully developed a Logistic Regression model for predicting the survival of a passenger aboard the Titanic ship based on his/her various features such as gender, age, passenger class, no. of siblings, embarkation location, etc.

data-cleaning data-preprocessing data-visualization exploratory-data-analysis logistic-regression machine-learning sklearn

Last synced: 07 Nov 2024

https://github.com/christs8920/process-mining-py

A process mining project that analyzes an event log and discovers its process model.

data-science data-visualization datavisualization pm4py process-mining processmining python

Last synced: 11 Nov 2024

https://github.com/sayamalt/quora-duplicate-question-pairs-identification

Successfully developed a machine learning model which can accurately detect whether any given pair of Quora questions are duplicate or not.

data-visualization machine-learning natural-language-processing nltk paraphrase-detection text-preprocessing

Last synced: 07 Nov 2024

https://github.com/sayamalt/house-price-prediction

Successfully created a regression model for predicting the price of any house, excluding enormous real estates and mansions, to a significant level of accuracy.

data-visualization exploratory-data-analysis feature-engineering feature-selection machine-learning regression-analysis regression-testing

Last synced: 07 Nov 2024

https://github.com/sayamalt/fraudulent-transactions-prediction

Successfully trained a machine learning model which can predict whether a given transaction is fraud or not.

data-visualization exploratory-data-analysis imblearn machine-learning model-based-testing model-building predictive-analytics sklearn

Last synced: 07 Nov 2024

https://github.com/sayamalt/customer-churn-prediction

Successfully established a machine learning model which can predict whether any given customer currently utilizing the products and services offered by a company will churn at anytime in the future or not, depending upon a set of unique features/characteristics pertaining to that specific individual, to a great level of accuracy.

classification data-visualization exploratory-data-analysis feature-engineering hyperparameter-tuning model-deployment model-evaluation model-optimization model-training supervised-machine-learning

Last synced: 07 Nov 2024

https://github.com/shreeparab1890/indian-elections-2019-analysis-eda

This ipython notebook is the Exploratory data analysis (EDA) of the Indian Lok Sabha Elections 2019.

data data-analysis data-science data-visualization eda exploratory-data-analysis matplotlib numpy pandas plotly python python3 visualization

Last synced: 08 Nov 2024

https://github.com/tanaybhadula/avacado-analytics

It is a dashboard built using Dash.It shows graphs of dataset about sales and prices of avocados in the United States between 2015 and 2018.

css dash data-analytics data-visualization python

Last synced: 12 Nov 2024

https://github.com/alexgenovese/react-charts-covid-19-data

Examples on COVID-19 data using different library charts: G2, G2Plot, Plotly, ApexCharts

data-analysis data-science data-visualization react reactjs

Last synced: 07 Nov 2024

https://github.com/stynw7/data_mining

Provides Computer Science material especially at Database Major which is Data Mining ⛏️⛏️

classification clustering data-mining data-visualization database outlier-detection r-programming

Last synced: 13 Oct 2024

https://github.com/adadalshabab/data-engineering-gcp-project

An end-to-end modern data engineering project, including deployment of ETL pipeline on Google Cloud Platform, using BigQuery for data analysis and leveraging Looker to generate an insight dashboard.

bigquery data data-science data-visualization databases dataengineering-a engineering etl-pipeline looker-studio powerbi

Last synced: 31 Oct 2024

https://github.com/moeabbas6/dbt_analytics_engine

An end-to-end project using dbt to demonstrate data transformations, testing, and visualization with Google BigQuery, and Looker Studio. It showcases a complete data pipeline from extraction/generation to deployment.

analytics-engineering bigquery data data-pipeline data-transformation data-visualization dbt testing

Last synced: 12 Oct 2024

https://github.com/robwiederstein/kytc_loc

Plot Kentucky licensing locations

data-visualization ggmap leaflet r xml2

Last synced: 06 Nov 2024

https://github.com/robwiederstein/covid-19-ky

Monitor US covid-19 cases w/ Johns Hopkins data

data data-visualization leaflet plotly r shell

Last synced: 06 Nov 2024

https://github.com/vbhatsaccnt/softdrinktrendsanalysis

A Tableau dashboard project providing comprehensive insights into soft drink sales trends, allowing for detailed analysis and informed decision-making within the beverage industry.

dashboard data-visualization food-products marketing tableau trend-analysis

Last synced: 10 Nov 2024

https://github.com/suryakaranraja/listview-application-for-zoho-interview

This repository has the files of basic application for designed to retrieve details about the top 50 happiest countries in the world for the year 2022.

data-visualization desktop desktop-app happiness-score interview-with-zoho list-view surface-tablet winui3

Last synced: 12 Oct 2024

https://github.com/faezeh-gholamrezaie/visual-google-scholar-search

A Python script that searches Google Scholar for specific keywords and visually presents the results in various chart formats, enabling researchers to analyze trends and insights in academic literature.

academic academic-research academic-trends ai ai-research bibliometrics data-analysis data-visualization google-scholar publication-analysis python research-trends scholarly scholarly-data word-cloud

Last synced: 07 Nov 2024

https://github.com/vidyadnina/cyclistic-sql-tableau-project

Trip data analysis for a bike-sharing service company using SQL and Tableau.

bigquery dashboard data-analysis data-analytics-sql data-cleaning data-visualization sql

Last synced: 12 Oct 2024

https://github.com/mlund2k/project-1-baseball-performance-vs.-attendance

Project assets for my first exploratory data analysis: Baseball Performance vs. Attendance.

bigquery data-analysis data-cleaning data-visualization excel rstudio sql tableau tidyverse

Last synced: 12 Oct 2024

https://github.com/ljadhav25/django-data-analyzer

Django Data Analyzer is a web application built using the Django framework, designed to streamline data analysis tasks. Users can upload CSV files containing data for analysis. The application utilizes the powerful data manipulation capabilities of Python libraries like pandas and numpy to perform various analyses on the uploaded data.

data-analysis data-visualization django-application matplotlib numpy pandas python seaborn

Last synced: 10 Nov 2024

https://github.com/leftcoastnerdgirl/visualizations_with_tableau

This projects gives the student an opportunity to demonstrate their skills in data visualization.

data-analytics data-visualization storytelling storytelling-with-data tableau

Last synced: 09 Nov 2024

https://github.com/alinababer/data-science-and-insight-agent-rag-llama3-lava-llm

Data-Science-and-Insight-Agent-RAG-LLama3-Lava-LLM-Django-WebApplication is an advanced AI-driven chatbot designed to assist in data science, document analysis, and image interpretation. This repository contain the Datascience Agent of this project.

artificial-neural-networks classifcation data-analysis data-engineering data-visualization datascience large-language-models llama2 lstm machine-learning python random-forest regression

Last synced: 19 Nov 2024

https://github.com/Sparsh7082/Data-Analysis-Portfolio

This repository is dedicated to showcasing my skills, sharing projects, and tracking my progress in Data Analytics and Data Science.

canva data-analytics data-manipulation data-visualization database-querying google-slides power-bi power-point presentation-tools programming-language python r spreadsheets sql tableau

Last synced: 23 Oct 2024

https://github.com/amitkaps/vaccines

India COVID Vaccines Status Visualisation

data-visualization

Last synced: 06 Nov 2024

https://github.com/tynandebold/daylight

Amount of daylight in select locations around the world.

data-visualization data-viz daylight javascript react time

Last synced: 13 Oct 2024

https://github.com/madeiradata/microsoft-data-analysts-club

Open-source Repository of Useful Scripts and Solutions for Microsoft Data Analysts

data-analysis data-visualization microsoft-data-analysis powerbi powerbi-report

Last synced: 06 Nov 2024

https://github.com/dmarks84/ind_project_obesity-multi-class-classification--kaggle

Independent Project - Kaggle Competition -- I worked on the obesity classification data set as part of a Kaggle Competition of the same name, scoring (for accuracy) above 0.9

classification correlation-analysis cross-validation data-modeling data-visualization dataframes eda gridsearchcv matplotlib multiclass-classification numpy pandas python seaborn sklearn statistics supervised-ml

Last synced: 05 Nov 2024

https://github.com/dmarks84/ind_project_data-science-london-scikit-learn--kaggle

Independent Project - Kaggle Competition -- I worked on the Data Science London data set for the Data Science London + Scikit-learn competition.

classification cross-validation data-modeling data-reporting data-visualization dataframes eda grid-search matplotlib numpy pandas python sklearn statistics supervised-ml

Last synced: 05 Nov 2024

https://github.com/dmarks84/ind_project_superstore-sales-time-series-analysis--kaggle

Independent Project - Kaggle Dataset-- I worked on the Superstore Sales Dataset, performing (as Part 1) data cleaning and preparation and exploratory data analysis. The main task was to make predictions for future sales based on time-series analysis, which is found in Part 2.

chloropleth data-modeling data-visualization eda linear-regression matplotlib numpy pandas python seaborn sklearn statistics statsmodels supervised-ml time-series-analysis

Last synced: 05 Nov 2024