Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/diegopino/publibdata_codexhackathon

Public Library Data processing/analysis codex hackathon attempt

data-analysis data-visualization libraries public

Last synced: 08 Nov 2024

https://github.com/sayamalt/credit-card-approval-prediction

Successfully developed a machine learning model which can accurately predict up to 100% accuracy whether a credit card application of a given applicant would be approved or not, based on several demographic features such as applicant age, total income, marital status, total years of work experience, etc.

binary-classification cicd-deployment cross-validation data-exploration-and-preprocessing data-visualization exploratory-data-analysis feature-engineering hyperparameter-optimization machine-learning model-deployment model-retraining model-selection model-testing model-training-and-evaluation

Last synced: 07 Nov 2024

https://github.com/sayamalt/taxi-trip-fare-prediction

Successfully created a machine learning model which can accurately predict the fare of a taxi trip based on several features such as trip duration, tip amount, etc.

cross-validation data-exploration-and-preprocessing data-visualization exploratory-data-analysis feature-engineering hyperparameter-optimization machine-learning model-deployment model-selection model-training-and-evaluation regression-modelling

Last synced: 07 Nov 2024

https://github.com/sayamalt/company-bankruptcy-prediction

Successfully developed a machine learning model which can accurately predict whether a firm will become bankrupt or not, depending on various features such as net value growth rate, borrowing dependency, cash/total assets, etc.

binary-classification cicd-deployment cross-validation data-exploration-and-preprocessing data-visualization docker-container exploratory-data-analysis feature-engineering github-actions hyperparameter-optimization machine-learning model-deployment model-training-and-evaluation

Last synced: 07 Nov 2024

https://github.com/sayamalt/employee-attrition-prediction

Successfully established a machine learning model which can accurately predict whether an employee of a given company will leave it in the impending future or not, based on several employee details and employment metrics.

binary-classification continuous-deployment continuous-integration cross-validation data-exploration-and-preprocessing data-visualization exploratory-data-analysis feature-engineering hyperparameter-optimization machine-learning model-deployment model-training-and-evaluation

Last synced: 07 Nov 2024

https://github.com/sayamalt/life-expectancy-prediction

Successfully established a machine learning model which can accurately predict the expected life duration of a human being based on several demographic features such as alcohol consumption per capita, average BMI of entire population, etc.

cross-validation data-cleaning-and-preprocessing data-visualization docker end-to-end-pipeline exploratory-data-analysis feature-engineering github-actions-workflow hyperparameter-tuning machine-learning model-deployment model-training-and-evaluation

Last synced: 07 Nov 2024

https://github.com/sayamalt/concrete-strength-prediction

Successfully developed a machine learning model which can accurately predict the strength of cement based on various features such as blast furnace slag, water, coarse aggregate, etc.

cross-validation data-visualization exploratory-data-analysis feature-engineering hyperparameter-optimization machine-learning model-deployment model-training-and-evaluation regression-models

Last synced: 07 Nov 2024

https://github.com/sayamalt/superstore-sales-prediction

Successfully established a machine learning model that can accurately predict the sales of a superstore based on various features such as quantity, profit, discount, postal code, etc. The features are mainly associated with order details and customer demographics.

azure-machine-learning azure-web-app-service cicd-deployment cross-validation data-cleaning-and-preprocessing data-visualization exploratory-data-analysis feature-engineering github-actions-ci-cd hyperparameter-tuning machine-learning model-deployment model-retraining model-testing model-training-and-evaluation regression-models

Last synced: 07 Nov 2024

https://github.com/sayamalt/flight-price-prediction

Successfully established a machine learning model to accurately predict the price of a flight in India based on several features such as duration, days left, arrival time, departure time and so on.

data-visualization exploratory-data-analysis feature-engineering hyperparameter-tuning machine-learning model-deployment model-training-and-evaluation regression-models

Last synced: 07 Nov 2024

https://github.com/smahala02/materials-science-image-analysis

Image analysis for materials science with a focus on particle diameter measurement and image scaling using Python.

data-visualization image-analysis materials-science particle-measurement python

Last synced: 08 Nov 2024

https://github.com/sayamalt/twitter-sentiment-analysis

Successfully established a machine learning model which can accurately classify the sentiment of any particular tweet into either positive, negative or neutral category.

data-visualization exploratory-data-analysis nlp sentiment-analysis supervised-learning text-processing

Last synced: 07 Nov 2024

https://github.com/smahala02/materials-science-data-analysis

Analysis of diffraction and spectrum data in materials science using Python for data visualization and interpretation.

data-visualization diffraction-analysis materials-science python spectrum-analysis

Last synced: 08 Nov 2024

https://github.com/sayamalt/titanic-survival-prediction

Successfully developed a Logistic Regression model for predicting the survival of a passenger aboard the Titanic ship based on his/her various features such as gender, age, passenger class, no. of siblings, embarkation location, etc.

data-cleaning data-preprocessing data-visualization exploratory-data-analysis logistic-regression machine-learning sklearn

Last synced: 07 Nov 2024

https://github.com/tom-draper/call-graph-viz

A tool to visualise Python function calls and code complexity.

call-graph call-graphs data-visualisation data-visualization function-calls graph python vizjs

Last synced: 11 Oct 2024

https://github.com/sayamalt/quora-duplicate-question-pairs-identification

Successfully developed a machine learning model which can accurately detect whether any given pair of Quora questions are duplicate or not.

data-visualization machine-learning natural-language-processing nltk paraphrase-detection text-preprocessing

Last synced: 07 Nov 2024

https://github.com/sayamalt/house-price-prediction

Successfully created a regression model for predicting the price of any house, excluding enormous real estates and mansions, to a significant level of accuracy.

data-visualization exploratory-data-analysis feature-engineering feature-selection machine-learning regression-analysis regression-testing

Last synced: 07 Nov 2024

https://github.com/sayamalt/fraudulent-transactions-prediction

Successfully trained a machine learning model which can predict whether a given transaction is fraud or not.

data-visualization exploratory-data-analysis imblearn machine-learning model-based-testing model-building predictive-analytics sklearn

Last synced: 07 Nov 2024

https://github.com/jofaval/sonar

Binary Classification of Sonar Signals of Rocks and Metal cylinders in 1987

data-analysis data-science data-visualization machine-learning python scikit-learn sonar uci

Last synced: 21 Oct 2024

https://github.com/sayamalt/customer-churn-prediction

Successfully established a machine learning model which can predict whether any given customer currently utilizing the products and services offered by a company will churn at anytime in the future or not, depending upon a set of unique features/characteristics pertaining to that specific individual, to a great level of accuracy.

classification data-visualization exploratory-data-analysis feature-engineering hyperparameter-tuning model-deployment model-evaluation model-optimization model-training supervised-machine-learning

Last synced: 07 Nov 2024

https://github.com/another-guy/use-d3

React hooks for D3.js data visualization library.

d3 d3js d3js-hook d3js-hooks data-visualization data-viz react react-hook react-hooks reactjs

Last synced: 27 Oct 2024

https://github.com/cluzier/crypto-price-dashboard

Shows current crypto prices and trade history

charts cryptocurrency data-visualization

Last synced: 07 Nov 2024

https://github.com/lucs1590/triathlon-dashboard

This is a repository that shows some graphics and makes a dashboard related to triathlon data.

angular dashboard data-visualization data-viz graphs plotly plotly-dash plotlyjs storytelling triathlon

Last synced: 11 Oct 2024

https://github.com/alexgenovese/react-charts-covid-19-data

Examples on COVID-19 data using different library charts: G2, G2Plot, Plotly, ApexCharts

data-analysis data-science data-visualization react reactjs

Last synced: 07 Nov 2024

https://github.com/adamspannbauer/twitch_packed_bar

Example using a packed barchart to visualize emote usage in a twitch.tv chat

chat data-visualization data-viz packed-barchart twitch

Last synced: 11 Oct 2024

https://github.com/corndogit/dataspaceart

A generative art project which generates stylized patterns from weather data

data-visualization python weather

Last synced: 07 Nov 2024

https://github.com/ianhaggerty/final-capstone

This represents the final capstone project in my HyperionDev Data Science (fundamentals) course. A dataset of Amazon customer reviews is analysed using natural language processing.

amazon data-analytics data-science data-visualization dataset matplotlib nlp nlp-machine-learning numpy pandas plotly reviews seaborn spacy tabulate textblob wordcloud

Last synced: 31 Oct 2024

https://github.com/jasoncobra3/whatsapp_chat_analyzer

WhatsApp Chat Analyzer is a powerful tool that provides insightful analytics from your WhatsApp conversations. Whether you're curious about your chatting habits, want to analyze group dynamics, or need to extract meaningful data from your conversations, this tool has got you covered!

data-analysis data-science data-visualization machine-learning streamlit streamlit-webapp whatsapp-chat whatsapp-chat-analyzer

Last synced: 31 Oct 2024

https://github.com/mishra-krishna/olympics-2024-analysis

Interactive dashboard for the Paris 2024 Olympics using Streamlit and Plotly. Explore medal counts, athlete stats, and event data. Containerized with Docker and deployed on Azure.

data-visualization olympics olympics-visualization streamlit streamlit-dashboard streamlit-webapp

Last synced: 10 Oct 2024

https://github.com/alexandrehiroyuki/bcc_endofcourseproject

End of course project for "Bases Computacionais da Ciência" class at UFABC.

data-science data-visualization jupyter-notebook python

Last synced: 17 Oct 2024

https://github.com/prakharagr2001/whatsappchatanalyzer

WhatsAppChat Analyzer is a Python-powered website leveraging the capabilities of NumPy, Pandas, and Streamlit. It provides comprehensive analysis of WhatsApp conversations, extracting valuable insights and statistics for a user-friendly interpretation of chat data.

chat-analysis data-visualization numpy pandas python streamlit whatsapp-chat

Last synced: 31 Oct 2024

https://github.com/shridhar1504/loan-clustering-datascience-project

This project uses Machine Learning to Cluster loan together based on their similarities. The project uses a dataset of loan application which includes information about the Loan amount and Balance. The project then use the clustering algorithm to group the loan together based on the similarities.

clustering-algorithm data-analysis data-science data-visualization datanalysis eda kmeans-clustering machine-learning python sql sql-server unsupervised-learning

Last synced: 05 Nov 2024

https://github.com/petitatelier/data-sets

A collection of data sets, to play with in visualization experiments

data-visualization dataset

Last synced: 08 Nov 2024

https://github.com/bdice/signac-micde-cnsccs-2018

Slides and demos for the MICDE CNSCCS Symposium, October 15, 2018

data-management data-visualization demo signac workflow-automation

Last synced: 03 Nov 2024

https://github.com/petitatelier/data-generators

A collection of data generators, to play with in visualization experiments

data-generator data-visualization

Last synced: 08 Nov 2024

https://github.com/robwiederstein/kytc_loc

Plot Kentucky licensing locations

data-visualization ggmap leaflet r xml2

Last synced: 06 Nov 2024

https://github.com/robwiederstein/covid-19-ky

Monitor US covid-19 cases w/ Johns Hopkins data

data data-visualization leaflet plotly r shell

Last synced: 06 Nov 2024

https://github.com/pejpero/neural_network_regression_and_classification

This repository contains neural network regression models built from scratch and using Keras for comparison. It visualizes training and testing performance, analyzing MSE, R², and decision boundaries. The project demonstrates learning techniques and optimization for regression tasks.

data-visualization keras machine-learning neural-network regression

Last synced: 03 Nov 2024

https://github.com/faezeh-gholamrezaie/visual-google-scholar-search

A Python script that searches Google Scholar for specific keywords and visually presents the results in various chart formats, enabling researchers to analyze trends and insights in academic literature.

academic academic-research academic-trends ai ai-research bibliometrics data-analysis data-visualization google-scholar publication-analysis python research-trends scholarly scholarly-data word-cloud

Last synced: 07 Nov 2024

https://github.com/chrispsang/customerchurnanalysis

Predicting customer churn using a RandomForestClassifier with detailed EDA, model evaluation, and visualization. Includes a Tableau dashboard for interactive insights.

customerchurn data-analysis data-visualization datapreprocessing machine-learning python scikit-learn tableau

Last synced: 10 Oct 2024

https://github.com/khanovico/python-stock-analyzer

This is a Webapp implemented by python and several data science frameworks, enabling online stock trend analyzing.

amcharts-js-charts data-analysis data-visualization flask javascript pandas python scikit-learn

Last synced: 03 Nov 2024

https://github.com/shridhar1504/foreign-exchange-rate-time-series-datascience-project

This project will use time series analysis to forecast the exchange rate between the euro and the US dollar. The project will use a variety of statistical techniques, such as ARIMA to model the data and forecast the exchange rate.

data-analysis data-preprocessing data-science data-transformation data-visualization eda exploratory-data-analysis foreign-exchange-rates machine-learning model-fitting predictive-modeling python3 time-series time-series-analysis

Last synced: 05 Nov 2024

https://github.com/shridhar1504/loan-classification-datascience-project

This project uses machine learning algorithms to predict the classification of loan status. The dataset is loaded and some transformation is done using SQL for getting a proper dataset with some valid informations.

classification data-analysis data-cleaning data-science data-visualization eda loan-prediction loan-status machine-learning predictive-modeling sql supervised-learning

Last synced: 05 Nov 2024

https://github.com/shridhar1504/power-bi-visualization-project

This repository contains Visualization Projects which is visualized through Power BI Software, by using the visualization we can gain multiple insights and strategies which helps to develop the business for gaining high profit margins and by the insights we can reduce the damages by accidents & calamities.

dashboard data-analysis data-cleaning data-science data-visualization exploratory-data-analysis microsoft-excel microsoft-power-bi microsoft-powerpoint powerbi powerbi-report powerbi-visuals powerpoint-slides

Last synced: 05 Nov 2024

https://github.com/shridhar1504/rafik-s-kitchen-data-analysis

The Project is about the Analysis of the Sales and Expenses Data of a Famous Fast-food Restaurant. This mainly focuses on gaining Insights that will boost the Future Sales and also Business Strategies it Improve the Profit Margins. Handled Tools are SQL, Python, Power BI, MS Office Tools.

business-analytics business-intelligence data-analysis data-analytics data-visualization eda ms-office powerbi-report powerpoint-presentations python sql-server

Last synced: 05 Nov 2024

https://github.com/zacoppotamus/cityshade

[WIP] Studies in GPU-based cartography

cartography data-visualization glsl

Last synced: 11 Oct 2024

https://github.com/wadeChriestenson/Main_Application

A Django application to host my personal resume.

data-analysis data-visualization django plotly python ui-design

Last synced: 23 Oct 2024

https://github.com/shridhar1504/milk-production-time-series-forecasting-datascience-project

This project uses time series forecasting to predict future milk production. The data used in this project is monthly milk production data from January 1962 to December 1975. The ARIMA (autoregressive integrated moving average) model is used to forecast the milk production. The model is evaluated using various metric.

adf arima-model augmented-dickey-fuller-test data-analysis data-analytics data-science data-visualization eda exploratory-data-analysis machine-learning machine-learning-algorithms python python3 residuals sarimax seasonality time-series time-series-forecasting trends

Last synced: 05 Nov 2024

https://github.com/amitkaps/vaccines

India COVID Vaccines Status Visualisation

data-visualization

Last synced: 06 Nov 2024

https://github.com/madeiradata/microsoft-data-analysts-club

Open-source Repository of Useful Scripts and Solutions for Microsoft Data Analysts

data-analysis data-visualization microsoft-data-analysis powerbi powerbi-report

Last synced: 06 Nov 2024

https://github.com/thesfinox/mltools

A collection of simple tools for data science and machine learning projects.

ai data-analysis data-science data-visualization logging machine-learning matplotlib neural-network python toolbox

Last synced: 06 Nov 2024

https://github.com/darkdk123/house-valuation-model

A Challenge Project in a Boot-Camp to create a ML Model to predict the prices of houses in Boston Massachusetts from multiple parameters Using Multivariable Regression.

data-analysis data-science data-visualization matplotlib-pyplot multivariate-regression predictive-modeling statistics

Last synced: 07 Nov 2024

https://github.com/darkdk123/handwashing-discovery-analysis

A Guided Project in a Boot camp to Analyse the Original Data used in the Discovery of Viruses & Hand Washing By Dr. Ignaz Semmelweis in Vienna General Hospital in the 1840s.

data-analysis data-science data-visualization matplotlib-pyplot numpy pandas plotly-python python seaborn-plots

Last synced: 07 Nov 2024

https://github.com/shridhar1504/sales-forecasting-datascience-project

Develop a data science project using historical sales data to build a regression model that accurately predicts future sales. Preprocess the dataset, conduct exploratory analysis, select relevant features, and employ regression algorithms for model development. Evaluate model performance, optimize hyperparameters, and provide actionable insights.

data-analytics data-cleaning data-science data-testing data-visualization forecasting-models machin model-evaluation model-fitting prediction predictive-modeling python3 regression-algorithms salesforecast sklearn-library supervised-learning

Last synced: 05 Nov 2024

https://github.com/athari22/investigating-netflix-movies-and-guest-stars-in-the-office

Apply basic Python skills in Introduction to Python and Intermediate Python by processing and visualizing film and television data.

data-analysis data-science data-visualization loop loops matplotlib matplotlib-pyplot netflix numpy office pandas python

Last synced: 08 Nov 2024

https://github.com/athari22/house_sales_in_king_count_usa

The idea of the project is to do a Data analysis in a Real Estate Investment Trust. The Trust would like to start investing in Residential real estate.

analysis data data-science data-visualization ibm ibm-watson linearregression machine-learning matplotlib numpy pandas sklearn-library

Last synced: 08 Nov 2024

https://github.com/lut-ful/e-commerce-sales-report

This dashboard provides a visual analysis of e-commerce sales data

data data-analytics data-science data-visualization power-bi statics

Last synced: 06 Nov 2024

https://github.com/lut-ful/credit-card-financial-report-dashboard

Visualizing and unlocking different KPI's of a Financials instution depending on credit card data using Power BI.

data-analytics data-visualization dax-query postgresql powe-bi python statics

Last synced: 06 Nov 2024

https://github.com/xre22zax/biodiversity---national-parks

National Parks Service about endangered species

data-analysis-python data-visualization ipynb python python3

Last synced: 11 Oct 2024

https://github.com/athari22/extracting-and-visualizing-stock-data

The idea of the project is to work for a new startup investment firm that helps customers invest their money in stocks.

amazon dashboard data-science data-visualization database ibm ibm-watson pandas python sql tesla visualization

Last synced: 08 Nov 2024

https://github.com/timjjting/nba-teams-pts

The NBA Teams’ Performance Tracking System is a project aims to help users to track the performance of NBA teams from 2000 to 2009.

bootstrap d3js data-visualization jquery nba nba-visualization

Last synced: 06 Nov 2024

https://github.com/timjjting/data-is-beautiful

Introductory slides to data visualization

data-visualization

Last synced: 06 Nov 2024

https://github.com/chrispsang/healthcare-dataanalysis

Analyze synthetic patient data to identify trends, improve healthcare delivery, and predict patient outcomes using machine learning models. Includes data exploration, preprocessing, model building, and visualizations.

data-analysis data-science data-visualization healthcare jupyter-notebook machine-learning python

Last synced: 06 Nov 2024

https://github.com/chigurakula-bs/forecasting-time-series-on-covid-19

COVID-19 is one of the most pandemic problems today, many countries suffer tohandle COVID-19 problems. Data mining techniques provide a good quality tool for improving manual analysis identification COVID19 cases per day, deaths per day and number of patients cured per day.

arima-model data-mining-algorithms data-visualization jupyter-notebook sarima-model

Last synced: 06 Nov 2024

https://github.com/mumtaz4118/nlp-course

Programming Assignments and Lectures for Stanford's CS 224: Natural Language Processing with Deep Learning

course data data-analysis data-analytics data-science data-visualization deep-learning education machine-learning natural-language-processing neural-network transfer-learning

Last synced: 07 Nov 2024

https://github.com/la-lo-go/whatsgraph

Easily visualize WhatsApp exported chats

data-visualization next nextjs react shadcn-ui whatsapp

Last synced: 10 Oct 2024

https://github.com/abdoomohamedd/python-data-analysis-projects

Welcome to the Python Data Analysis Projects repository! This repository contains a collection of data analysis projects that I have worked on. Each project involves cleaning data, performing exploratory data analysis (EDA), and creating visualizations to extract insights. The projects utilize various Python libraries, including pandas, numpy, matp

data-analysis data-analysis-python data-cleaning data-visualization matplotlib matplotlib-pyplot numpy pandas python

Last synced: 06 Nov 2024

https://github.com/xkomil/datasciencesummerstudy

I want to document in this repository all my studying data science topics

data-science data-visualization machine-learning meteostat numpy pandas seaborn sklearn streamlit-webapp

Last synced: 05 Nov 2024

https://github.com/ahmetzamanis/panelregressionbroadband

Panel regression analysis on the relationship between countries' broadband subscriptions, and GDP per capita, using time series data.

data-science data-visualization modeling panel-data panel-regression r rmarkdown time-series

Last synced: 07 Nov 2024

https://github.com/kruthiktr/crop-recommendation-system-using-machine-learning

A machine learning-based system recommending crops based on soil, climate, and environmental conditions to optimize agricultural yields.

ai-in-agriculture crop-recommendation data-visualization machine-learning prediction python python3 recommendation-system

Last synced: 18 Oct 2024

https://github.com/shuddha2021/stellar-candidate-selector

A sophisticated candidate selection algorithm leveraging multi-criteria analysis and machine learning to identify top software engineering candidates. This tool features flexible filtering, score adjustment, and detailed visualizations to streamline the recruitment process.

candidate-selection data-analysis data-visualization machine-learning pandas plotting-in-python python python-data-analysis recruitment scikit-learn

Last synced: 06 Nov 2024

https://github.com/shuddha2021/interactive-data-visualization-app

An interactive web application for visualizing data using Chart.js. Users can explore and analyze data through dynamic charts and customize their view

chart data-visualization event-handling interactive-ui javascript real-time-updates responsive-design web-development

Last synced: 06 Nov 2024

https://github.com/takk8is/datasetanalysiseda

A robust Python tool for comprehensive dataset analysis and machine learning model evaluation. This project automates the process of data preprocessing, exploratory data analysis (EDA), and predictive modeling, with a focus on handling common data inconsistencies.

analytics analyzer chart csv-files data-science data-visualization datascience dataset datasets davidccavalcante eda fjallstoppur graphics machine-learning python python3 takk-ag takk-design takk8is xlsx-files

Last synced: 18 Oct 2024