An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/mathyouf/kaggle-notebook-code

Code and Images which I used in Kaggle Notebooks. Mostly for style and code clarity.

data-visualization kaggle

Last synced: 14 May 2026

https://github.com/balajimohan18/foreign-exchange-rate-time-series-datascience-project

This project will use time series analysis to forecast the exchange rate between the euro and the US dollar. The project will use a variety of statistical techniques, such as ARIMA to model the data and forecast the exchange rate.

data-analysis data-analytics data-preprocessing data-science data-transformation data-visualization eda exploratory-data-analysis foreign-exchange-rates machine-learning model-fitting predictive-modeling python3 time-series time-series-analysis

Last synced: 14 May 2026

https://github.com/ewels/contributor-graphs

Contributor timelines for any git or GitHub repo: a publication-ready SVG and an interactive HTML page

cli contributors data-visualization git github open-source rust svg timeline visualization

Last synced: 11 Jun 2026

https://github.com/mohamedmetwalli5/breastcancerdiagnosis

Breast cancer diagnosis using machine learning via the XGBoost Algorithm after visualizing the data set & exploring it.

cancer data-visualization machine-learning

Last synced: 11 Jun 2026

https://github.com/madebysan/timeline

A static film timeline for seeing when movies are set, from ancient history to imaginary futures.

cinema data-visualization film html-css movies static-site timeline tmdb vanilla-javascript

Last synced: 12 Jun 2026

https://github.com/adamspannbauer/twitch_packed_bar

Example using a packed barchart to visualize emote usage in a twitch.tv chat

chat data-visualization data-viz packed-barchart twitch

Last synced: 12 Jun 2026

https://github.com/shashwat9kumar/trends_in_a_country_on_twitter

Finding trending topics in each country on twitter and visualizing them in a WordCloud

data data-visualization trends tweepy twitter-api wordcloud

Last synced: 13 Jun 2026

https://github.com/stephenombuya/automation_scripts

A collection of Python scripts and tools designed to automate various tasks, improve productivity, and simplify repetitive actions. Each script is well-documented and serves a specific purpose, ranging from data visualization to smart home control.

automation-with-python data-visualization productivity python3 smart-home-automation webautomation

Last synced: 13 Jun 2026

https://github.com/luizassimoes/q5ga-latency-and-throughput

Quick 5G Analyser: PyQT5 software developed to help with simple graphical analysis and chart generating for ping and iperf3 tests.

data-analysis data-visualization pyqt5 python

Last synced: 13 Jun 2026

https://github.com/jatinnxn/diabetes-prediction

this repository showcases a machine learning model built to predict diabetes using Diabetes dataset. The project walks through data preprocessing, model training, and evaluation, offering a Decision Tree-based solution to classify individuals as diabetic or non-diabetic based on various health metrics. It also supports real-time predictions.

data-cleaning data-preprocessing data-visualization decision-tree-classifier machine-learning

Last synced: 13 Jun 2026

https://github.com/marielachirinosr/nyc-taxi-trip-exploration-2019-2020

Explores passenger behavior & impact of COVID-19 on NYC taxi industry (Q1 2019-2020).

bigquery data data-analysis data-visualization python sql tableau

Last synced: 15 Jun 2026

https://github.com/anderson-andre-p/uber-data-analysis

This repository contains a comprehensive data analysis project focused on Uber rides. The dataset used in this project is a spreadsheet obtained from Uber, containing data related to ride details, such as pick-up and drop-off locations, date and time of the ride, and the fare amount.

data-analysis data-science data-visualization python

Last synced: 15 Jun 2026

https://github.com/hanifheinrich/population-data-visualization

Implementasi Visualisai Data pada Data Kependudukan Nagari Tanjung Balik, Kabupaten Solok, Sumatera Barat Menggunakan Streamlit

data-visualization python streamlit-dashboard

Last synced: 16 Jun 2026

https://github.com/joonarafael/ids-exercises

Repository to store the exercise submissions for the Introduction to Data Science course (University of Helsinki).

course-work data-science data-visualization jupyter-notebook university-assignment

Last synced: 16 Jun 2026

https://github.com/leftcoastnerdgirl/webscraping_and_beautifulsoup

This project uses Beautiful Soup to create scrap data from a news website.

beautifulsoup data-visualization jupyter-notebook splinter webscraping

Last synced: 17 Jun 2026

https://github.com/tanmayborse/institionistic_fuzzy_approx_space

This model introduces a hybrid approach that utilizes rough sets on intuitionistic fuzzy approximation spaces for pre-processing and soft sets for post-processing, resulting in an effective decision-making solution.

data-cleaning-and-preprocessing data-science data-visualization decision-making fuzzy-logic

Last synced: 17 Jun 2026

https://github.com/mattsebastianh/Make-the-Other-Charts.-Silly-s-Ice-Cream-Shop-Project

Data Visualization with Matplotlib | Matplotlib Fundamentals | Silly's Ice Cream Shop Project

data-visualization matplotlib python

Last synced: 18 Jun 2026

https://github.com/mattsebastianh/Making-a-visual-argument--compare-grammy-win-records-Project

Data Visualization with Matplotlib | Making a Visual Argument in Matplotlib

data-visualization matplotlib python

Last synced: 18 Jun 2026

https://github.com/philippmeder/visdata

Useful python tools for data visualisation, e.g. 2D-profiles (also known as profile plots), comparison of measurements, or tables.

data-science data-visualization profile-plot python python-3 python3

Last synced: 18 Jun 2026

https://github.com/sebastianurdaneguibisalaya/enfermedades-fissal

Análisis holístico de atenciones por enfermedades raras, huérfanas y transplantes coberturados por FISSAL en el Perú.

data-analysis data-visualization python

Last synced: 24 Feb 2025

https://github.com/pramodkondur/dataspark-end-to-end-dataanalytics

Cleaned, performed EDA and stored data in MySQL. Queried, and analyzed data, uncovering opportunities to drive revenue growth and optimize operations, with a potential revenue growth of $30.03 million. Reported key insights using Power BI.

data-analysis data-visualization eda powerbi python sql

Last synced: 21 May 2026

https://github.com/arction/lcu-tutorial-areaseries-04

Tutorial for LightningChart .Net High-Performance Charting Library. Creating a 2D chart with multiple AreaSeries.

areaseries chart charting-library csharp data-visualization dotnet example lightning-chart performance tutorial visualization xy

Last synced: 12 Mar 2025

https://github.com/jigyasag18/ibm-power-bi-dashboard-project

IBM Power BI Dashboard Project is a data-driven analysis of employees using IBM's comprehensive dataset, providing insights into key factors contributing to employee turnover and enabling organizations to strategize effectively towards improved employee retention and satisfaction.

data data-visualization dataanalysis dataanalytics dataset datavisualisation datavisualization-project powerbi powerbi-dashboards powerbi-report powerbi-visuals powerbidashboard

Last synced: 07 Mar 2026

https://github.com/dinamohsin/toman-bikeshare-data-analysis-sql-power-bi

This project involves data analysis using SQL, Power BI, and CSV datasets to extract insights and visualize key business metrics.

csv-files data-analysis data-visualization database powerbi sql sql-server

Last synced: 22 Apr 2026

https://github.com/jainish-prajapati/solar-flare-prediction

This repository contains code and data for predicting solar flare energy ranges using machine learning, based on NASA's RHESSI mission data. It includes preprocessing of FITS files into a unified CSV dataset and implements models like Gradient Boosting, Random Forest, and Decision Tree classifiers, achieving accuracies up to 87%.

data-visualization machine-learning numpy pandas python scikit-learn solar-flare-prediction

Last synced: 30 Dec 2025

https://github.com/kimatudo3/atliq-hardware-dashboard

The AtliQ Hardware BI 360 Dashboard is a comprehensive business intelligence tool crafted to empower AtliQ Hardware with data-driven insights across various departments.

atliq dashboard data-engineering data-visualization database database-management datacleaning dax-query m mysql powerbi-desktop powerquery sql-server visualization

Last synced: 07 May 2026

https://github.com/femincan/d3-heat-map

My solution for the Visualize Data with a Heat Map project on FCC.

css3 d3js data-visualization html5 javascript

Last synced: 18 May 2026

https://github.com/arction/lcu-tutorial-barseries-05

Tutorial for LightningChart .Net High-Performance Charting Library. Creating a 2D chart with multiple BarSeries.

barseries charting-library csharp data-visualization dotnet example lightning-chart performance tutorial visualization xy

Last synced: 12 Mar 2025

https://github.com/yrohitha/customer-segmentation

Clean and Apply RFM technique to rank and group clusters to identify the best customers and perform targeted marketing campaigns, using real online transaction data

data-cleaning data-science data-visualization datetime marketing-analytics pandas python3 user-segmentation

Last synced: 13 Mar 2025

https://github.com/arction/lcu-tutorial-3dsurfacegrid-09

Tutorial for LightningChart .Net High-Performance Charting Library. Creating a 3D chart with SurfaceGridSeries.

3d chart charting-library csharp data-visualization dotnet example lightning-chart performance surfacegrid tutorial visualization

Last synced: 12 Mar 2025

https://github.com/arction/lcu-tutorial-multipleaxes-03

Tutorial for LightningChart .Net High-Performance Charting Library. Adding Multiple Axes.

axes chart charting-library csharp data-visualization dotnet example layered-layout lightning-chart line performance tutorial visualization xy

Last synced: 12 Mar 2025

https://github.com/mindlessmuse666/iris-knn

Проект демонстрирует применение алгоритма k-ближайших соседей (KNN) для классификации набора данных Iris. Включает загрузку данных, обучение модели, оценку производительности и визуализацию результатов с использованием библиотек Pandas, Scikit-learn, Matplotlib, Seaborn и Plotly.

algorithm classification data-analysis data-visualization iris-dataset knn lazy-learning machine-learning python scikit-learn

Last synced: 17 Aug 2025

https://github.com/ax-va/interactive-data-visualization-dale-2023

These examples on Interactive Data Visualization in the browser using Flask and D3.js are compiled with some modifications from the book "Data Visualization with Python and Javascript: Cleaning, Cleaning, Exploring, and Transforming Your Data" by Kyran Dale, published by O'Reilly Media in 2023.

ax-va d3 d3-visualization d3js data-science data-visualization dataviz javascript python

Last synced: 13 Mar 2025

https://github.com/arction/lcu-tutorial-stockseries-06

Tutorial for LightningChart .Net High-Performance Charting Library. Displaying financial data with StockSeries.

chart charting-library csharp data-visualization dotnet example lightning-chart performance stockseries tutorial visualization xy

Last synced: 12 Mar 2025

https://github.com/mindlessmuse666/features-scaling

Проект по масштабированию признаков датасета Iris с использованием Python, Pandas, Scikit-learn, Seaborn и Plotly. Включает визуализацию данных, применение различных методов масштабирования и оценку производительности модели логистической регрессии.

data-scaling data-visualization feature-engineering iris-dataset machine-learning pandas plotly python scikit-learn seaborn student-project

Last synced: 16 Jun 2025

https://github.com/cowboymrzamo2380/json-to-excel-converter

This repository provides a tool to convert JSON data to Excel format (.xlsx). It allows you to easily transform structured JSON data into a well-organized spreadsheet for better analysis and visualization.

automation-script automation-tools data-analysis data-converter data-export data-formatting data-tools data-visualization excel excel-automation excel-converter excel-tools json json-exporter json-parser json-processing json-to-csv json-to-excel programming-tools spreadsheet-tools

Last synced: 05 Apr 2025

https://github.com/myriamba/smart-grid-data-analysis-clustering

Exploratory Data Analysis of Smartmeter Data , Visualization, and Consumer Clustering for London Households.

clustering-algorithm data-analytics data-visualization eda unsupervised-learning

Last synced: 18 May 2026

https://github.com/memgonzales/lyrid-training

Compilation of the activities and projects given to probationary Lyrids of the Center for Complexity and Emerging Technologies (COMET) as partial requirement for promotion to cohort membership

data-analytics data-cleaning data-science data-visualization jupyter-notebook

Last synced: 05 Jul 2025

https://github.com/neelanjan-chakraborty/data-science-projects

This repository showcases all the notebooks and visualizations from my academic journey. 🎉 It's a treasure trove of projects covering diverse subjects. From 🧪 science experiments to 📊 data analysis, 🖥️ coding challenges to 📝 research papers, it's all here! Explore the code, dive into the visuals, and witness my academic growth. 🌱

dashboard dashboards data-science data-visualisation data-visualization matplotlib plotly powerbi powerbi-visuals pyplot python

Last synced: 21 Jul 2025

https://github.com/jatin-mehra119/insurance_dataset

The objective of this project is to predict insurance charges based on various factors.

data-visualization dataanalysis prediction-model python regression-models

Last synced: 15 May 2026

https://github.com/janashanaa/flightanalysis

This Jupyter Notebook presents an exploratory data analysis of data derived from a flight booking website.

data-analysis data-visualization exploratory-data-analysis jupyter-notebook python

Last synced: 15 May 2026

https://github.com/fatihilhan42/cyclistic_bike_share_data_analysis

This repo contains the Google Data Analytics Capstone - Case Study 1 project, which is the final stage of the Google Data Analytics course on coursera. The description of the code and analysis is posted on my Kaggle account. I hope this repo will help everyone who wants to do this project. thanks.

bike-share capstone-project cyclistic data-science data-visualization google rprogramming

Last synced: 25 Jun 2025

https://github.com/karthikarajagopal44/data-analysis-using-python-libraries-

The COVID-19 pandemic has significantly impacted India, necessitating a detailed analysis of the virus’s spread within the country. In this project, we explore an India-specific COVID-19 dataset, leveraging Python libraries such as Pandas, NumPy, Matplotlib, and Seaborn.

data-cleaning data-visualization matplotlib numpy pandas python python3 scikit-learn seaborn

Last synced: 07 Apr 2026

https://github.com/autumnchris/treemap-sales-diagram

A D3.js treemap built in React.js that presents the top 100 sold video games grouped by their associated gaming platform.

babel css3 d3 d3-js d3js data-visualization freecodecamp javascript react reactjs sass scss treemap treemap-diagram treemap-diagram-challenge treemap-sales-diagram webpack

Last synced: 07 Apr 2026

https://github.com/satyacoder29/comparison-of-region-based-sales-tableau

The region-based sales comparison analyzes sales performance across different regions. It identifies trends, top-performing regions, and areas needing improvement by comparing metrics like revenue, growth rate, and product demand. This analysis helps optimize sales strategies and resource allocation for better performance.

data-analysis data-cleaning data-collection data-visualization powerquerym relationships tableau tableau-desktop unions

Last synced: 02 Feb 2026

https://github.com/hudson-newey/ecoacoustic-analysis-pipeline

A generalised pre-processing, metadata extraction, and analysis pipeline

data-visualization environment-variables pipeline

Last synced: 29 Apr 2025

https://github.com/arction/lcjs-example-0804-meshcircle

A demo application showcasing LightningChart JS IntensityMesh series.

chart data-visualization heatmap intensity lcjs lightningchart-js

Last synced: 12 Mar 2025

https://github.com/arction/lcjs-example-0908-3drealtimepoints

A demo application showcasing LightningChart JS to display 3D Scatter chart in real-time.

3d-visualization chart data-visualization lcjs lightningchart-js scatter-plot

Last synced: 12 Mar 2025

https://github.com/ljadhav25/django-data-analyzer

Django Data Analyzer is a web application built using the Django framework, designed to streamline data analysis tasks. Users can upload CSV files containing data for analysis. The application utilizes the powerful data manipulation capabilities of Python libraries like pandas and numpy to perform various analyses on the uploaded data.

data-analysis data-visualization django-application matplotlib numpy pandas python seaborn

Last synced: 01 Mar 2026

https://github.com/lucycatherine/healthinsuranceproject

This repository contains a machine learning project that analyzes the factors influencing health insurance charges, such as age, smoking status, and medical conditions.

data-analysis data-science data-visualization jupyter-notebook machine-learning python

Last synced: 18 May 2026

https://github.com/abdul-aa/kickstarters

Predictive Modeling and Clustering Insights for Kickstarter Success

boosting-ensemble clustering clustering-analysis data-visualization gradient-boosting kprototypes python shap

Last synced: 15 May 2026

https://github.com/vavarm/data-analysis-french-electric-automobile-infrastructure

Data analysis realized in R Shiny and Python about the French electric vehicle and charging station infrastructure

data-analysis data-science data-visualization factominer geojson ggplot2 plotly python r rshiny

Last synced: 15 May 2026

https://github.com/sreyashidey/scrape-analyze-visualize

A project for web scraping, data analysis, and visualization using Selenium, BeautifulSoup, and Python.

bs4 data-visualization selenium

Last synced: 03 May 2026

https://github.com/gui-sitton/y.music

In this project I compared the musical preferences of the citizens of Springfild and Shelbyville. I examined real Y.Music data to test hypotheses and compare the behavior of users in these two cities.

data data-analysis data-analysis-python data-science data-visualization python

Last synced: 18 May 2026

https://github.com/niniola-creator/niniola-creator

This is a repository that I have created to show my skills, share my projects and track my progress in my data science/web development journey.

bootstrap5 css3 data-analysis data-science data-visualization database html5 javascipt javascript matplotlib pandas powerbi python spreadsheets sql

Last synced: 07 Apr 2026

https://github.com/satyacoder29/crm-analytics

CRM Analytics Dashboard – An interactive dashboard using Tableau, SQL, and Salesforce CRM Analytics (CRMA) to analyze sales performance, customer segmentation, and churn prediction. Features automated ETL pipelines, predictive analytics, and real-time insights for data-driven decision-making. 🚀📊

advanced-excel data-analysis data-cleaning data-collection data-transformation data-visualization matplotlib numpy pandas powerbi python seaborn sql tableau

Last synced: 03 Mar 2025

https://github.com/gfav-cybergeek/prodigy_ml_01

A linear regression model to predict house prices based on square footage, number of bedrooms, and bathrooms. Includes feature engineering, preprocessing, and model evaluation.

ai airtificialintelligence algorithms algorithms-and-data-structures data-structures data-visualization jupyter jupyter-notebook jupyterlab machine-learning machine-learning-algorithms machine-learning-models python

Last synced: 05 Apr 2025

https://github.com/cyprianfusi/world-happiness-report-for-2015-2019

World Happiness Report for 2019 with strange and unexpected results for Sub-Sahara African Countries! But it's data speaking...

data-visualization pandas-python

Last synced: 21 Mar 2025

https://github.com/piras-s/tuningcurvesnestedbayesianinference

Bayesian inference of neural tuning curves using nested sampling (PyMultiNest), with theory, simulation, and diagnostic visualizations.

bayesian-inference data-visualization machine-learning model-evaluation nested-sampling neuroscience pymultinest python3 simulation

Last synced: 18 May 2026

https://github.com/cyprianfusi/uk-covid-19-data-via-opendata-api

With recommendation to the UK government to halt all mandatory testing! Tests should only be conducted on patients as part of diagnosis and treatment. This is because with low prevalence of the disease most positive test results are false positives. This is due to irreducible error in the test.

api covid-19 data-visualization pandas-python uk

Last synced: 21 Mar 2025

https://github.com/danzed1/health-ai-assistant

🩺 Deliver personalized health advice using AI, providing instant, accurate responses to wellness questions in a user-friendly application.

ai ai-assistant andriod app-backend apple-health cv data-visualization datascience doctor dspy-ai healthcare healthcare-application ios llm patient promptql react-native whatsapp

Last synced: 15 May 2026

https://github.com/cyprianfusi/new-york-city-public-schools-and-sat-scores

One of the most controversial issues in the U.S. educational system is the efficacy of standardized tests and whether they're unfair to certain groups. We could correlate SAT scores with factors like race, gender, income, and more.

data-analysis-python data-cleaning data-visualization data-wrangling

Last synced: 21 Mar 2025

https://github.com/melih0132/projects

This repository showcases projects from my computer science journey, covering technologies like web development and interactive applications.

csharp data-visualization database game-development html-css javascript python software-development unity web-development

Last synced: 27 Mar 2025

https://github.com/yujonglee/seoul-ultrafinedust-visualization

Seoul Ultrafine-dust Visualization using Open data.

data-visualization fine-dust perl5 processing

Last synced: 15 May 2026

https://github.com/oshinrathor/data-science-systems-and-analytics-projects

Dive into my Data Science Projects Repository, featuring a Spam SMS Classifier, NIA Dashboard, H1N1 Vaccine Prediction, and NYC Taxi Fare Prediction. Each project showcases my skills in data cleaning, exploratory analysis, modeling, and visualization, offering valuable insights and methodologies for data enthusiasts and practitioners.

dashboard data-analysis data-driven-decisions data-presentation data-science data-visualization dataexploration eda insights nia webanalytics

Last synced: 02 Mar 2025

https://github.com/cosmoduende/r-ggcats

StrangeR things: Adding… Cats? to your plots on R. How to analyze and visualize data with the help of funny cats with the ”ggcat” package.

data-analysis data-analytics data-science data-visualisation data-visualization data-viz dataviz ggcats r-language r-library r-package r-programming r-scripts r-studio rstats rstudio

Last synced: 22 Jul 2025

https://github.com/radhikareddy-chintareddy/big-data-analysis-ny-weather-air-quality-2022

End-to-end workflow showcasing database setup, API development, and interactive data retrieval of large datasets. Includes integration and analysis of 2022 SURFACE HOURLY weather data (global, US, and NY) merged with NY air pollution data from the EPA to uncover actionable insights.

big-data-analytics data-integration data-visualization flask-restful jupyter-notebook pymysql python

Last synced: 18 May 2026

https://github.com/vit0r/trino-datavirtualization

POC trino - some catalogs, mariadb,postgresql,mongodb and minio

data-visualization

Last synced: 07 Mar 2026

https://github.com/barrettotte/anilist-ml

Training a binary classifier model to predict if I would recommend an anime using my Anilist user data.

anilist binary-classification data-visualization machine-learning scikit-learn

Last synced: 15 May 2026

https://github.com/cyberoctane29/deutsche-bank-customer-churn-prediction-end-to-end-analysis-and-modeling

In this project, I aim to predict customer churn for Deutsche Bank using supervised machine learning. It involves data exploration, feature engineering, and building Naive Bayes, Decision Tree, Random Forest, and XGBoost models. Models are tuned, evaluated, and compared to identify the best approach for churn prediction.

bank-customer-churn churn-analysis churn-prediction customer-churn-analytics data-analysis data-analytics data-visualization decision-tree eda gaussian-naive-bayes machine-learning random-forest supervised-learning xgboost

Last synced: 11 Oct 2025

https://github.com/kylemit/livedataisbeautiful

A casual attempt at data visualizations

data-visualization highcharts

Last synced: 20 May 2026

https://github.com/andersoncrs/analisis-de-texto-tweets

En este proyecto exploro el análisis de texto de tweets para descubrir tendencias, opiniones y temas relevantes en redes sociales. Usando herramientas de procesamiento de lenguaje natural, convierto grandes volúmenes de mensajes en información clara y visualmente atractiva.

data-analysis data-visualization eda text-mining

Last synced: 21 Jul 2025

https://github.com/nadamarei/data-analyzer

The Qualitative Data Analysis Tool is a powerful Streamlit application designed for researchers to analyze word frequencies in corporate documents. This tool processes PDF reports, identifies target words and their contextually relevant synonyms, and generates comprehensive reports with document statistics, summary analysis, and per-file breakdowns

data-analysis data-visualization python-3 streamlit

Last synced: 18 May 2026

https://github.com/ddihora1604/iitk_task

A comprehensive financial data analysis system that collects, processes, and analyzes data from approximately 500 tickers in the S&P Global Index. It provides detailed financial information, ESG metrics, and various financial statements for comprehensive market analysis.

beautifulsoup4 data-analysis data-visualization datamodelling dataset esg machine-learning python yahoo-finance

Last synced: 29 Oct 2025

https://github.com/nadahamdy217/skincaresentinel

This project analyzes customer feedback for skincare products by predicting sentiment using an unsupervised model. It includes a web application for real-time sentiment analysis, an ETL pipeline built with Azure Data Factory, Azure Databricks, and Azure Synapse Analytics, and a Power BI dashboard for visualizing review trends.

azure customer-feedback data-engineering data-science data-visualization database databricks etl-pipeline flask machine-learning powerbi python sentiment-analysis synapse-analytics unsupervised-learning web-application

Last synced: 07 Apr 2026

https://github.com/j5py/py4e

Python for Everybody Specialization (from University of Michigan on Coursera).

api data-visualization json python sql sqlite xml

Last synced: 05 May 2026

https://github.com/pronzzz/diabetes-prediction

Diabetes prediction using a KNN model and Pima Indian Diabetes Dataset

data-analysis data-manipulation data-preprocessing data-visualization knn machine-learning outlier-detection seaborn

Last synced: 13 Apr 2025

https://github.com/cyberoctane29/salifort-motors-predicting-employee-turnover-and-improving-retention-analysis-and-modeling

In this project, I work as a data analytics professional at Salifort Motors, a fictional leader in alternative energy vehicles. I analyze employee survey data to identify turnover drivers and build predictive models, including multiple logistic regression, decision trees, and random forests, to forecast attrition and support retention strategies.

data-analytics data-visualization decision-trees eda employee-attrition ethical-artificial-intelligence feature-engineering logistic-regression machine-learning random-forest regression-analysis statistical-analysis supervised-learning turnover-analysis

Last synced: 09 Jul 2025