Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-27 00:07:21 UTC
- JSON Representation
https://github.com/hevalhazalkurt/word_analyser
A web app developed in Python and Django that analyzes given text mathematically and sentimentally.
analyzer analyzes content data-analysis django emotion python python3 sentiment sentiment-analyser sentiment-analysis text text-analysis
Last synced: 19 May 2026
https://github.com/samuelpillai/machine-learning-classification-regression-nlp
A curated collection of machine learning mini-projects covering classification, regression, and natural language processing (NLP). This project demonstrates model training, evaluation, feature engineering, and pipeline integration using real-world datasets and Python tools like Scikit-learn, pandas, and NLTK.
classification data-analysis data-science data-visualization feature-engineering jupyter-notebook machine-learning ml-pipeline model-evaluation nlp python regression-models scikit-learn supervised-learning text-mining
Last synced: 30 Apr 2026
https://github.com/farhad-here/id_validator
Iranian National ID Validator. This was one of my data analysis project for the course i had.
data-analysis identity idverification object-oriented-programming oop oops-in-python python streamlit
Last synced: 30 Apr 2026
https://github.com/celineboutinon/laplace-immo
ENSAE-ENSAI Formation Continue (Cepe)/OpenClassrooms Data Analyst 2022-2023 - Projet 3
data-analysis data-analytics data-structures database-design database-schema databases mysql-connector-python mysql-workbench python sql
Last synced: 30 Apr 2026
https://github.com/shishirshekhar/diabetes-prediction
This is a early diabetes prediction web app
data-analysis data-visualization decision-tree-classifier machine-learning streamlit streamlit-application streamlit-dashboard streamlit-web streamlit-webapp visualization
Last synced: 30 Apr 2026
https://github.com/vipulbunny/ml-learning_projects
A collection of machine learning projects implemented in Python, showcasing core concepts like regression, classification, clustering, and model evaluation techniques. Ideal for learners and data science enthusiasts.
classification clustering data-analysis data-science data-visualization decision-trees jupyter-notebook machine-learning model-evaluation random-forest regression supervised-learning unsupervised-learning
Last synced: 23 Jul 2025
https://github.com/mfakhriazhar/nlp-movie-recommender-system
This project is a content-based movie recommender system built using Natural Language Processing (NLP) techniques. By extracting and combining important text features from movie metadata, this system suggests movies that are similar to a user's selected title.
data-analysis data-science deep-learning machine-learning natural-language-processing python recommender-system
Last synced: 30 Apr 2026
https://github.com/yankh764/revenue-data-analysis
A take home assignment of improving a revenue data pipeline
data-analysis docker python sql take-home-assignment
Last synced: 30 Apr 2026
https://github.com/celineboutinon/little-lemon
Meta Database Engineer Professional Certificate - Capstone Project
data-analysis data-analytics data-structures data-visualisation database-design database-schema databases mysql-connector-python mysql-workbench python sql tableau-dashboards
Last synced: 30 Apr 2026
https://github.com/mitchellharrison/mitchellharrison.github.io
Welcome to my slice of the internet, where I share the knowledge that Duke gave me, so you don't have to spend the mortgage-sized amount to access it. Built with R, Python, Quarto, and love.
ai algorithms-and-data-structures blog data-analysis data-science data-visualization educational machine-learning portfolio portfolio-website quarto r r-language statistics tutorials
Last synced: 30 Apr 2026
https://github.com/ddsuhaimi/turkiye-student-evaluation-eda
A little bit of exploration of well-known Turkiye Student Evaluation dataset
data-analysis data-science data-visualization-project exploratory-data-analysis exploratory-data-visualizations
Last synced: 18 Jun 2026
https://github.com/josewebdev2000/space-mission-data-analysis
Exploring space mission data and creating graphs in base of it.
csv data-analysis data-science data-visualization matplotlib matplotlib-figures matplotlib-pyplot pandas pandas-dataframe python
Last synced: 30 Apr 2026
https://github.com/simranjeet97/netflix-analysis-top-rated-_visualization_plotly
Netflix Data Analysis based on Age Based Ratings and Top Genres of 2021 of Movies - TV Shows along side Data Visualization
data-analysis data-science data-visualization database datascience datastructures deep-learning google google-cloud-platform machine-learning machine-learning-algorithms netflix netflix-dataanlysis netflix-dataset netflix-prize python3
Last synced: 10 May 2026
https://github.com/ahmedtaher10/covid-19-cases
The data we are using contains the data on covid-19 cases and their impact on GDP from December 31, 2019, to October 10, 2020.
data-analysis python visualization
Last synced: 30 Apr 2026
https://github.com/aniketmondal/dataanalysis
Contains cleaning, transformation, and exploratory analysis of various data sets using Python Pandas, NumPy, re, random, etc.
analysis data-analysis data-science pandas python
Last synced: 30 Apr 2026
https://github.com/abhi227070/ipl-2024-sold-player-data-analysis
This project analyzes IPL 2024 auctioned players' data, including name, team, cricket type, nationality, and price. Users input a player's name to access team, style, nationality, and auction price, aiding research and fantasy leagues. It offers insights into player dynamics, serving cricket enthusiasts with comprehensive data exploration.
data-analysis data-visualization dataanalytics machine-learning machine-learning-algorithms python3
Last synced: 30 Apr 2026
https://github.com/revtpark/teamseas_scrapper
Scraping Team Seas for data analysis and visualization.
chartjs data-analysis python webscraping
Last synced: 28 Mar 2025
https://github.com/busra-deveci/kaggle-iris_data_analysis
Exploratory data analysis and visualization of the Iris dataset using Python.
data-analysis iris-dataset kaggle pandas python seaborn visualization
Last synced: 30 Apr 2026
https://github.com/fbarffmann/credit-risk-classification
Classified 19,000+ loans as high-risk or healthy using logistic regression. Achieved 100% precision for healthy loans and 84% precision for high-risk loans.
classification credit-risk data-analysis logistic-regression machine-learning model-evaluation pandas python scikit-learn
Last synced: 30 Apr 2026
https://github.com/gitchaell/computer-scrapping
Tool that extracts data from the pages of companies that sell computers in the city of Trujillo - Peru, exports them in an XLSX file according to a relational data model, and displays them on a Power BI dashboard.
data-analysis data-structures data-visualization database dbdiagram export-excel powerbi scrapper-script scrapping xlsx
Last synced: 01 May 2026
https://github.com/badranalyst/e-commerce-customer-analysis-data-science-foundations-case-study
This case study explores e-commerce customer data through data exploration, pre-processing, and splitting. It includes model building and training to analyze customer behavior. Python libraries like Pandas, NumPy, Matplotlib, and Seaborn are used for the analysis and model development.
data-analysis data-science dataset eda exploratory-data-analysis machine-learning matplotlib ml model-building model-training numpy pandas pre-processing python seaborn
Last synced: 01 May 2026
https://github.com/siddharthbadal/youtubeapi-dataanalysis
YoutubeAPI-Data Analysis
data-analysis jupyter-notebook matplotlib pandas python seaborn
Last synced: 10 May 2026
https://github.com/devexpress-examples/wpf-pivot-grid-provide-custom-summary-values
This example demonstrates how to determine the value type when you calculate custom summary values in Pivot Grid for WPF.
data-analysis dotnet dxpivotgrid pivot-grid pivot-grid-for-wpf wpf
Last synced: 01 May 2026
https://github.com/syarwinaaa09/investigating-netflix-movies
🎬 investigating netflix movie trends using python and pandas 📊
csv data-analysis matplotlib netflix pandas visualization
Last synced: 01 May 2026
https://github.com/vasishta03/econovisionai
A simple Python desktop app to search and explore OECD economic data (CSV) and report summaries (TXT/JSON) using a modern CustomTkinter GUI—no SQL or web frameworks needed.
csv customtkinter data-analysis desktop-app economic-data gui json local-app oecd pandas python search tkinter
Last synced: 10 May 2026
https://github.com/fazatholomew/marlboroplan
In order to contribute to a more inclusive sustainable energy program in Massachusetts, this project is part of my work for a nonprofit organization called All In Energy and undergraduate thesis for my degree.
data-analysis data-visualization energy jupyter-notebook massachusetts python
Last synced: 01 May 2026
https://github.com/falakrana/data-analysis-visualization
This repository showcases data analysis and visualization projects using Python and Tableau. It includes exploratory data analysis, interactive dashboards, and insightful visual stories derived from real-world datasets.
data-analysis data-visualization python tableau-public
Last synced: 01 May 2026
https://github.com/zeh237/superstore-data-analytics
This is a Flask based data analytics project based on the superstore dataset using flask, pandas, sql and python
analytics data data-analysis data-science data-visualization flask python superstore
Last synced: 04 May 2025
https://github.com/devag2004/electricity-analysis-using-spark
electricity analysis project made using spark
data-analysis spark spark-mllib
Last synced: 01 May 2026
https://github.com/shruti-h/netflix-eda
Exploratory Data Analysis on Netflix Movies & TV Shows dataset using Python, Pandas, Matplotlib, and Seaborn
data-analysis data-science eda matplotlib netflix pandas-library python seaborn
Last synced: 01 May 2026
https://github.com/mmfava/lonomia-host-plants-2024
This project investigates the relationship between Lonomia achelous and Lonomia obliqua caterpillars and their host plants. The project uses Docker for a consistent environment and R for statistical analysis, with detailed processes documented in Jupyter notebooks.
data-analysis host-plants lonomia lonomism r
Last synced: 01 May 2026
https://github.com/deliprofesor/customerseg-customer-segmentation-and-shopping-analysis
This project performs data exploration, segmentation, and modeling of wholesale customer data using clustering algorithms, PCA, and decision trees to analyze purchasing behavior and predict customer channel preferences.
clustering customer-segmentation data-analysis data-visualization dbscan decision-tree gmm kmeans machine-learning pca
Last synced: 24 Jun 2025
https://github.com/cdeweyx/bryce-harper-2016-analysis
Notebook analyzing Bryce Harper's disappointing 2016 campaign in historical context through data analytics.
data-analysis data-visualization python
Last synced: 01 May 2026
https://github.com/yeopster/datascience_notebook
Compilation of my Notebook based on Kaggle Dataset
data-analysis data-science kaggle notebook python
Last synced: 10 May 2026
https://github.com/virajbhutada/music-store-data-analysis-sql
Hands-on SQL data analysis project for music store. Enhance proficiency with database queries. Ideal for practitioners seeking real-world analytics experience. Gain insights into customer behavior, revenue trends, and genre preferences, empowering strategic decision-making in the music industry. Explore the project for a rich learning experience.
data-analysis data-insights data-science database genre-prediction music-industry music-store postgresql postgresql-database query-optimization revenue-trends sql sql-queries
Last synced: 01 May 2026
https://github.com/scailfin/rob-webapi-flask
Default RESTful Web API implementation for the Reproducible Open Benchmarks for Data Analysis Platform (ROB) using the Flask web framework.
benchmarks data-analysis reproducibility webapi
Last synced: 17 Mar 2026
https://github.com/fbarffmann/python-api-challenge
Accessed and analyzed real-world weather and location data using Python and public APIs. Automated data collection, cleaned API responses, and visualized geographic trends to support business-ready insights.
api automation data-analysis data-visualization google-places-api mapping openweathermap-api pandas python weather-analysis
Last synced: 10 May 2026
https://github.com/httpsnooow/graphs-analysis-neo4j
Challenges from the "Neo4J - Data Analysis with Graphs" course by Digital Innovation One (DIO).
challenge data-analysis data-engineering data-science graph neo4j neo4j-database neo4j-graph
Last synced: 18 Jun 2026
https://github.com/ariyaarka/result-analysis
A simple analysis of result based on different factors shown in figures
data-analysis jupyter-notebook matplotlib numpy-library pandas-dataframe python seaborn
Last synced: 01 May 2026
https://github.com/bpkaur/a-network-analysis-of-game-of-thrones
A Network analysis of Game of Thrones: To analyze the co-occurrence network of the characters in the Game of Thrones books
data-analysis data-science machine-learning networkx python3
Last synced: 01 May 2026
https://github.com/dnut/associations
Python 3 library to identify high-dimensional statistical relationships in any data set.
analytics arch-linux association-rules data data-analysis data-mining data-science machine-learning python-modules
Last synced: 01 May 2026
https://github.com/filip-kustura/data-warehouse-olympics
This project, part of the elective Advanced Database Systems course, involved building a data warehouse based on the already existing database in PostgreSQL. It focuses on analyzing Olympic Games data across time, covering athletes' performance by discipline, location, and other dimensions. Implemented in Spring 2022.
data-analysis data-warehouse database extract-transform-load olympic-games postgresql sql star-schema university-project
Last synced: 01 May 2026
https://github.com/myounesdev/authorgraphanalyzer
a web-based visualization tool for analyzing and exploring author collaboration networks
algorithms binary-tree bts d3js data-analysis dijkstra-algorithm django exception-handling pandas python scss
Last synced: 08 Jun 2026
https://github.com/caesaredia/la-cafe-market-analysis
A data-driven feasibility study exploring the potential of launching a robot-staffed café in Los Angeles, based on real F&B business data.
business-intelligence cafe data-analysis data-visualization food-industry franchise los-angeles market-research pandas python
Last synced: 01 May 2026
https://github.com/sairupeshl/leo-orbital-congestion-analysis
Geospatial data analysis of the UCS Satellite Database using Python to map active LEO space assets, validate orbital parameters, and isolate mega-constellation traffic bottlenecks.
aerospace-engineering data-analysis geospatial-analysis orbital-mechanics pandas python satellite-data seaborn
Last synced: 08 Jun 2026
https://github.com/happybravo/ss4202_project
Space Astronomy project
astro astronomy astrophysics classification data-analysis data-science data-visualization galaxies machine-learning python quasar stars
Last synced: 10 May 2026
https://github.com/manjit-baishya-datascience/flipkart-laptop-listing-eda
This project analyzes laptop price data from Flipkart using AutoScraper for web scraping. It includes data loading, EDA, cleaning, statistical analysis, and visualization. The goal is to derive insights for pricing strategies and market positioning. Explore the repository for detailed documentation and code.
data-analysis ecommerce-platform flipkart laptop python
Last synced: 08 Jun 2026
https://github.com/pablo1785/receipt-rs
Receipt processing backend built with Shuttle.rs, Axum and Azure Form Recognizer API
api-rest axum azure backend cognitive-services computer-vision data-analysis rust shuttle-rs sqlx
Last synced: 01 May 2026
https://github.com/monish-nallagondalla/sensor_fault_detection
This repo contains sensor data for analysis, focusing on sensor readings, their attributes, and classification (Good/Bad). It includes 500+ sensors with features for predictive modeling, anomaly detection, and sensor failure prediction.
anomaly-detection classification data-analysis data-science machine-learning predictive-modeling python sensor-data
Last synced: 01 May 2026
https://github.com/shridhar1504/rafik-s-kitchen-data-analysis
The Project is about the Analysis of the Sales and Expenses Data of a Famous Fast-food Restaurant. This mainly focuses on gaining Insights that will boost the Future Sales and also Business Strategies it Improve the Profit Margins. Handled Tools are SQL, Python, Power BI, MS Office Tools.
business-analytics business-intelligence data-analysis data-analytics data-visualization eda ms-office powerbi-report powerpoint-presentations python sql-server
Last synced: 10 May 2026
https://github.com/nel-zi/city_logistics
Built an automated, scalable Azure cloud data infrastructure for City Logistics, integrating market trends to optimize operations and enhance decision-making.
azure azure-cloud-services data-analysis data-automation data-cleaning data-engineering data-transformation
Last synced: 01 May 2026
https://github.com/pratanup/solar-power-generation-prediction
A solar power generation company wants to optimize solar power production and needs the prediction model to predict ‘Clearsky DHI’, ‘Clearsky DNI’, ‘Clearsky GHI’.
anaconda data-analysis data-science google-colab jupiter-notebook machine-learning machine-learning-algorithms machinelearning-python prediction prediction-model python
Last synced: 01 May 2026
https://github.com/dhruwsunita/customer-churn-analysis
Customer Churn Analysis using panda library
data-analysis data-cleaning data-manipulation data-science pandas python3
Last synced: 01 May 2026
https://github.com/fbarffmann/project1
Analyzed factors influencing movie profitability using Python. Cleaned and visualized film industry data to uncover trends in budgets, sales, genres, and ratings.
box-office-analysis data-analysis data-visualization matplotlib movie-industry pandas python regression seaborn
Last synced: 01 May 2026
https://github.com/shibbir24/amazon-product-sales-data-analysis-trends-and-insights
Amazon Product Sales Data Analysis: Trends and Insights
amazon-dataset data-analysis matplotlib numpy pandas seaborn
Last synced: 01 May 2026
https://github.com/codesaadumair/data-science-monorepo
Comprehensive Data Science monorepo featuring EDA, Machine Learning, Preprocessing, Feature Engineering, and Visualization projects with Jupyter notebooks and Python.
data-analysis data-science data-science-projects data-visualization eda jupyter-notebook jupyterlab machine-learning python
Last synced: 01 May 2026
https://github.com/kavicastelo/soil-fertilizer-analysis-colab
This repository includes a data analysis and model training practical Jupyter notebooks using a soil fertilizer dataset. (use 4th edition)
data-analysis jupyter-notebook python
Last synced: 01 May 2026
https://github.com/linguini1/edueval
The BorealisAI Let's Solve It mentorship project: summarizing student feedback submissions on their professor into one cohesive paragraph for faculty consideration during performance reviews.
ai data data-analysis data-science machine-learning machinelearning nlp python pytorch sentiment-analysis
Last synced: 01 May 2026
https://github.com/abdoomohamedd/python-data-analysis-projects
A collection of data analysis projects that I have worked on. Each project involves cleaning data, performing exploratory data analysis (EDA), and creating visualizations to extract insights. The projects utilize various Python libraries, including pandas, numpy, matp
data-analysis data-analysis-python data-cleaning data-visualization matplotlib matplotlib-pyplot numpy pandas python
Last synced: 01 May 2026
https://github.com/shahaf-f-s/feature-space
A modular framework for combining pandas series features
data-analysis data-science feature-engineering
Last synced: 19 Jun 2026
https://github.com/macdon112/credit-card-fraud-detection
Comparing ML models (Random Forest, KNN, Decision Tree) for credit card fraud detection using SMOTE and stratified cross-validation.
classification data-analysis fraud-detection imbalanced-data machine-learning python scikit-learn
Last synced: 10 May 2026
https://github.com/nurulashraf/hierarchical-clustering-customer-segmentation
A customer segmentation project using hierarchical clustering to group customers based on their spending behaviour and demographics. This helps businesses identify patterns and create targeted marketing strategies.
business-analytics clustering-algorithm customer-segmentation data-analysis hierarchical-clustering machine-learning python unsupervised-learning
Last synced: 18 May 2026
https://github.com/rafath0ssain/predihome
Data analysis using economic factors affecting living conditions across Canadian provinces.
data-analysis data-visualization dplyr ggplot2 graph kaggle linear-regression prediction-model r shiny tidyr
Last synced: 01 May 2026
https://github.com/fer-aguirre/cookiecutter-data-analysis-extensive
A cookiecutter template for data analysis projects using Python.
cookiecutter data-analysis project-template python
Last synced: 09 Apr 2025
https://github.com/leftcoastnerdgirl/introduction_to_pandas
This project introduces the use of Python in a JupyterNotebook.
analytics budget-analysis budget-planner-tool budget-planning data-analysis dataframes jupyter-notebook pandas pandas-python python
Last synced: 01 May 2026
https://github.com/vetrivel07/flight-price-prediction
Developed a flight price prediction model using Python, analyzing historical data to forecast airfare prices and help travelers make informed booking decisions
data-analysis data-visualization jupyter-notebook numpy pandas python
Last synced: 15 Jun 2025
https://github.com/kheriberto/pandas_and_seabron_project
In this project I showcase my ability using pandas and seaborn to mold, transform and plot data.
data-analysis pandas python seaborn
Last synced: 01 May 2026
https://github.com/buildwithlal/introduction-to-data-science-in-python-coursera
introduction to data science in python, part of Applied Data Science using Python Specialization from University of Michigan offered by Coursera
data-analysis matplotlib numpy pandas
Last synced: 03 May 2026
https://github.com/celineboutinon/client-segmentation
CentraleSupélec/OpenClassrooms Data Scientist 2024-2025 - Projet 5
aws client-segmentation cloud-architecture data-analysis data-science data-visualization database e-commerce marketing marketing-analytics marketplace-solution
Last synced: 01 May 2026
https://github.com/imrandil/sql_practice_with_analysis
SQL practice using postgres db and docker as a tool to setup postgres, loving the sql way
data-analysis docker markdown postgres sql
Last synced: 10 May 2026
https://github.com/vedantshi/stock-price-prediction-for-maang-companies
This project utilizes Long Short-Term Memory (LSTM) networks to forecast stock prices. It includes steps for data preprocessing, model training, and visualization of predictions using Python in Jupyter Notebook. The project demonstrates proficiency in machine learning, data analysis, and Python programming.
data-analysis data-visualization lstm machine-learning python stock-price-prediction
Last synced: 01 May 2026
https://github.com/dgraves4/cms-hospital-quality-analytics
Python analytics project using CMS hospital quality data to clean, summarize, and visualize hospital ratings, reporting patterns, and facility characteristics.
cms-data data-analysis eda healthcare-analytics matplotlib pandas python
Last synced: 19 Jun 2026
https://github.com/guptakushal03/whatsapp-chat-analyser
The WhatsApp Chat Analyzer is a Python-based tool built with Streamlit for analyzing WhatsApp chat data. It provides insights such as total messages, word count, media shared, links shared, monthly activity timeline, most active users, activity maps, and word clouds.
chat-analysis data-analysis data-visualization python streamlit text-processing whatsapp word-cloud
Last synced: 01 May 2026
https://github.com/cassiofb-dev/fide-rating-analysis
The plot speaks for itself
chess data-analysis fide hans rating
Last synced: 15 Jun 2025
https://github.com/bhoyarapurva23399/mini-erp-inventory-billing
Lightweight ERP inventory and billing web app built using Python Flask and SQLite — featuring product, customer, and dashboard management.
backend data-analysis erp flask inventory-billing mini-project python sqlite
Last synced: 01 May 2026
https://github.com/ujjwalll/get-that-flair
It is a repository for project detecting the flair of reddit post through their links. You can find the working model of it at - https://get-that-flair.herokuapp.com/
data-analysis data-visualization django-application herokuapp machine-learning naive-bayes-classifier praw-reddit python3 random-forest reddit-api sentiment-analysis topic-modeling
Last synced: 01 May 2026
https://github.com/mateusoliveira30/top-intelligent-people
This project performs an exploratory analysis of the top_intelligent_people_in_the_world_5000.csv dataset, featuring some of the world's most intelligent individuals. Using pandas and matplotlib, the analysis includes checking for missing values, describing variables, and visualizing data.
data-analysis graphics kaggle-dataset python3
Last synced: 03 May 2026
https://github.com/sanikamal/machine-learning-atoz
Beginner-friendly machine learning tutorials and mini-projects.
collaborative-filtering data-analysis data-visualization decision-trees kmeans-clustering knn machine-learning machine-learning-algorithms recommender-system regression svm
Last synced: 08 Jun 2026
https://github.com/devexpress-examples/winforms-pivot-create-user-folders-within-the-customization-form
This example demonstrates how to organize the Customization Form fields in folders.
data-analysis dotnet pivot-grid-for-winforms winforms xtrapivotgrid-suite
Last synced: 10 May 2026
https://github.com/kineticloom/plydb-fun-nfl-analyst
Analyze NFL data with your AI agent
data-analysis football-analytics nfl
Last synced: 15 May 2026
https://github.com/bheemisme/icc-t20-world-cup-dashboard
2024 icc t20 world cup dashboard
dashboard data-analysis data-analytics data-science data-visualization matplotlib pandas seaborn
Last synced: 02 May 2026
https://github.com/srummanf/elnino-anomaly-study
Study on El Niño’s impact on Chennai groundwater sustainability
data-analysis machine-learning python satellite-imagery-analysis
Last synced: 15 May 2026
https://github.com/maxwelllzh/linearizer
Linearizing parameters for linear regression
data-analysis machine-learning scikit-learn
Last synced: 02 May 2026
https://github.com/more-joao/color-distance-luminance
Data analysis project that aims to establish a relation between the Canberra distance between white and any given color in the RGB colorspace and its luminance.
canberra-distance data-analysis luminance python r rgb
Last synced: 02 May 2026
https://github.com/harshindcoder/salifort_motors_project
This people analytics project analyzes factors influencing employee turnover and predicts whether an employee is likely to leave. It aims to uncover patterns behind departures, helping Salifort improve retention, workplace culture, and professional growth strategies.
data-analysis data-science data-visualization hr-analytics machine-learning tree-models
Last synced: 02 May 2026
https://github.com/waseemofficial/ml-practice
ML Practice
data data-analysis jupyter-notebook machine-learning ml python
Last synced: 02 May 2026
https://github.com/nishnash54/sidba
CSV to MongoDB with type conversion
csv-converter data-analysis mongodb statistics
Last synced: 02 May 2026
https://github.com/vikktor93/datascience-spotify
Analysis of Spotify dataset containing the top songs currently trending for over 70 countries.
data-analysis data-science data-scientist jupyter-notebook kaggle matplotlib pandas seaborn
Last synced: 10 May 2026
https://github.com/faithererer/haokanvideo_spider
好看视频爬取与数据分析
data-analysis data-visualization python spider
Last synced: 02 May 2026
https://github.com/adithya17-star/ai-powered-fraud-detection
An AI-powered fraud detection system using machine learning algorithms to identify suspicious transactions and provide interactive visualizations for financial security.
dashboard-visualization data-analysis finance-technology fintech flask fraud-detection machine-learning python security transaction-monitoring
Last synced: 02 May 2026
https://github.com/wwgolay/hr1099-timelapse-vlbi
The repository for HR1099 timelapse VLBI.
astronomy astrophysics data-analysis website
Last synced: 03 Apr 2025
https://github.com/shridhar1504/milk-production-time-series-forecasting-datascience-project
This project uses time series forecasting to predict future milk production. The data used in this project is monthly milk production data from January 1962 to December 1975. The ARIMA (autoregressive integrated moving average) model is used to forecast the milk production. The model is evaluated using various metric.
adf arima-model augmented-dickey-fuller-test data-analysis data-analytics data-science data-visualization eda exploratory-data-analysis machine-learning machine-learning-algorithms python python3 residuals sarimax seasonality time-series time-series-forecasting trends
Last synced: 02 May 2026
https://github.com/teja-1403/ignosis-tech-ml-assignment
Analysis of transaction data to identify the most profitable products and key customer segments, providing insights for targeted marketing strategies.
customer-segmentation data-analysis data-visualization machine-learning marketing-strategy python
Last synced: 02 May 2026
https://github.com/lucas54neves/financial-organizer
Financial organizer using Streamlit
data-analysis data-science financial-organizer plotly python streamlit
Last synced: 02 May 2026
https://github.com/shreeparab1890/unicorns-of-india-till-sep-2022-analysis-eda
This ipython notebook is the Exploratory data analysis (EDA) of the Unicorns of India till Sep 2022.
analysis data-analysis eda exploratory-data-analysis matplotlib-pyplot numpy pandas plotly
Last synced: 02 May 2026
https://github.com/greenpau/esqrunner
Run Elasticsearh queries and create metrics based on the result of the queries in Elasticsearch database.
data-analysis elasticsearch query-builder querydsl
Last synced: 10 May 2026
https://github.com/asadiahmad/word-counter-spark
Word counter with spark
data-analysis nlp spark word-counter
Last synced: 02 May 2026