Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-07-02 00:07:33 UTC
- JSON Representation
https://github.com/marielachirinosr/hotel-data-analysis
Pandas & Matplotlib Learning Analysis. Repository featuring data analysis projects using Pandas and Matplotlib libraries
data data-analysis matplotlib pandas python
Last synced: 25 Apr 2026
https://github.com/devexpress-examples/winforms-create-a-custom-exporter-for-pivotgridcontrol-with-xtrareport
This example illustrates how to dynamically create a custom report based on PivotGridControl content in WinForms.
data-analysis dotnet pivot-grid pivot-grid-for-winforms winforms
Last synced: 26 Apr 2026
https://github.com/ibrahimhabibeg/national-university-of-singapore-sms-analysis
Analysis of SMS messages collected by the National University of Singapore
analytics data-analysis data-science nlp python
Last synced: 13 May 2026
https://github.com/rociobenitez/happiness-index-data-processing
Repository for Big Data Processing - Contains Jupyter Notebooks and Datasets for data analysis and processing tasks related to Big Data.
big-data big-data-processing data-analysis data-processing happiness-index happiness-report jupyter-notebook matplotlib pandas seaborn
Last synced: 15 May 2026
https://github.com/deliprofesor/cinematic-data-analytics-and-recommendation-platform
This project analyzes a movie dataset using machine learning algorithms to predict success, explore revenue-popularity relationships, and develop recommendation systems. It employs techniques like K-Means, DBSCAN, GMM, decision trees, PCA, and NLP for insights and personalized suggestions.
clustering content-based-recommendation data-analysis data-visualization decision-tree gmm k-means machine-learning natural-language-processing nlp pca predictive-modeling python recommendation-system scikit-learn user-based-recommendation
Last synced: 26 Apr 2026
https://github.com/moshora99/sql-data-warehouse-project
Build modern data warehouse with mysql, Including ETL processes, data modeling and analytics
data-analysis data-engineering data-science database datawarehouse datawarehousing etl scheme sql sql-query sql-server
Last synced: 27 Apr 2026
https://github.com/rupashi03/fitbit-user-eda-case-study
Performed Exploratory Data Analysis (EDA) on Fitbit users' data to uncover trends in activity and health metrics.
business-analysis case-study consumer-insights data-analysis exploratory-data-analysis health-data r user-behavior-analytics
Last synced: 25 Mar 2025
https://github.com/parsabordbar/ctx3docs
The Documentation for context Tree Project.
ai-tools context ctx3 ctx3-docs data-analysis documentation tree workflow
Last synced: 25 Jun 2026
https://github.com/deliprofesor/joblocationmapper
JobLocationMapper is a Python tool that visualizes job listings on an interactive map. It uses city and state data to place job markers accurately and color-codes them by occupation (Software, Marketing, Design). The map clusters markers for better organization, and users can click on them to view job details.
clustrered-markers data-analysis data-visualization folium geocoding geographical-visualization interactive-map job-listings map-visualization pandas python
Last synced: 14 May 2026
https://github.com/hfzdzakii/dicoding-airqualityanalysisdata
This repo is a master submission for my Dicoding Final Project. Air Quality Dataset is being used to fulfill the submission. Feel free to explore and I hope my work give you some insight!
data-analysis data-visualization streamlit
Last synced: 27 Apr 2026
https://github.com/yeonjaee/data-analytics
converts raw data into actionable insights
Last synced: 11 Jun 2026
https://github.com/satvikpraveen/matplotlibmasterpro
📷 MatplotlibMasterPro is a complete, portfolio-ready project to master data visualization using matplotlib. Includes 16 notebooks, real datasets, exportable plots, custom themes, Streamlit dashboard, and Docker support. Ideal for learners and data professionals.
charts custom-plots dashboarding data-analysis data-science data-visualization educational-project interactive-visualizations jupyter-notebook matplotlib notebooks open-source plotting portfolio-project python python-utilities reproducible-research subplots time-series-analysis visualization-tools
Last synced: 14 May 2026
https://github.com/sferez/simple_linear_regression
Simple Linear Regression using Python
data-analysis data-science linear-regression python regression
Last synced: 28 Apr 2026
https://github.com/1401dev/iowa-liquor-retail-sales-analysis
This repository contains the analysis of Iowa liquor retail sales data, aimed at uncovering sales trends and forecasting future sales patterns. The project involves data cleaning, preparation, and advanced time series analysis using Microsoft SQL Server and Google Colab.
customer-behavior data-analysis data-cleaning data-science data-visualization exploratory-data-analysis forecasting google-colab machine-learning microsoft-sql-server pandas prophet python retail-analytics retail-sales sales-forecasting sales-performance sql statsmodels time-series-analysis
Last synced: 16 Feb 2026
https://github.com/yashsingh43/cdc-sleep-duration-health-analysis
Analysis of CDC BRFSS 2022 data exploring how sleep duration relates to mental and physical health outcomes.
beautifulsoup brfss cdc data-analysis data-visualization matplotlib pandas plotly public-health python
Last synced: 11 Jun 2026
https://github.com/stefagnone/movies-dataset-analysis-project
Comprehensive analysis of the Movies dataset, exploring genre trends, comparisons, and qualitative insights using Python, Pandas, and visualizations. Designed to uncover actionable findings for stakeholders.
data-analysis data-visualization exploratory-data-analysis matplotlib movies-analysis pandas python seaborn storytelling-with-data
Last synced: 28 Apr 2026
https://github.com/prakhargpt/sql-data-warehouse-project
Building Data Warehouse project using SQL Server, including ETL processes, data modelling and analytics.
analytics data data-analysis data-cleaning data-engineering data-engineering-pipeline data-lakehouse data-science data-warehouse etl etl-job etl-pipeline medallion-architecture sql sql-server
Last synced: 12 Jun 2026
https://github.com/emmanuelletocs/steam-game-recommender
A powerful recommendation system for Steam games, combining Content-Based and Collaborative Filtering techniques. Built with Python, Scikit-learn, and Streamlit to deliver accurate, real-time game recommendations. Perfect for gamers and data scientists interested in building intelligent recommendation engines.
als-algorithm data-analysis gaming-industry knn machine-learning mds mysql ncf neural-network pyspark recommendation-engine recommendation-system scikit-learn spark
Last synced: 28 Apr 2026
https://github.com/sufyan14/weather-data-analysis
A Streamlit dashboard that forecasts 30-day weather trends using uploaded CSV data and Facebook Prophet.
data-analysis python streamlit
Last synced: 28 Apr 2026
https://github.com/delonnewman/relational
Relational programming for Ruby
csv csv-import data data-analysis database export json relational relational-algebra relational-database relational-model relational-programming reporting reports ruby yaml
Last synced: 28 Apr 2026
https://github.com/matheusafonseca/python-data-visualization-matplotlib-seaborn-masterclass-udemy
This repository is dedicated to storing the code developed during the "Python Data Visualization: Matplotlib & Seaborn Masterclass" course on Udemy.
charts data-analysis data-analysis-python data-science data-visualization database graphics graphics-programming jupyter-notebook matplotlib matplotlib-plots python python3 seaborn seaborn-plots
Last synced: 28 Apr 2026
https://github.com/prgermux/data-plotter
This Python application provides a graphical user interface (GUI) for analyzing and visualizing data from various sources. It uses the PyQt5 framework for the GUI and Matplotlib for plotting data. The application supports multiple file formats, allows users to select any columns for the X and Y axes, and provides dynamic plots.
automation data-analysis plott python
Last synced: 12 Jun 2026
https://github.com/techshot25/blackfriday
Shopping behavior data analysis
consumer data-analysis data-science gradient-descent machine-learning normal-equation regression-analysis shopping shopping-habits statistics
Last synced: 12 Jun 2026
https://github.com/neelimabonangi/defect-detection-hot-rolling
Defect Detection in Hot Rolling Using Machine Learning
classification data-analysis data-science defect-detection jupyter-notebook machine-learning manufacturing numpy pandas predictive-analytics python random-forest scikit-learn
Last synced: 12 Jun 2026
https://github.com/gaurav-van/optimizing-rate-of-penetration-in-geothermal-drilling-a-digital-twin-approach
Let’s explore something interesting together. In this project, we developed a machine learning digital twin using Intel-optimized XGBoost and daal4py to simulate and optimize the Rate of Penetration (ROP) in geothermal drilling. We leveraged SHAP for Explainable AI (XAI) to interpret model predictions.
data-analysis data-science digital-twin explainable-ai geothermal geothermal-energy jupyter-notebook machine-learning python shap xai xgboost
Last synced: 28 Apr 2026
https://github.com/leosimoes/alura-7daysofcode-dados
Desafios das Trilhas de Dados - Ciência de Dados, Machine Learning e Python Pandas.
data-analysis data-science jupyter-notebook machine-learning python
Last synced: 28 Apr 2026
https://github.com/kisaa-fatima/data-visualization-with-tableauleu
Conducted Exploratory Data Analysis (EDA) on the Berkeley Earth Dataset (large scale dataset), which features high-resolution land and ocean time series data. Created interactive dashboards using Tableau to effectively visualize and highlight trends and patterns within the data.
data-analysis data-science exploratory-data-analysis insights python tableau visualizations
Last synced: 29 Apr 2026
https://github.com/devexpress-examples/web-forms-pivot-grid-change-summary-display-mode
This example shows how to use different summary display modes in Pivot Grid for Web Forms.
asp-net-web-forms data-analysis dotnet pivot-grid pivot-grid-for-web-forms
Last synced: 29 Apr 2026
https://github.com/souza-vitor/stock-market
codecademy data data-analysis data-mining data-science sql sqlite
Last synced: 26 Jun 2026
https://github.com/devexpress-examples/winforms-visualize-pivot-grid-data-in-chart
The following example shows how to integrate the Pivot Grid with the Chart control.
charting data-analysis dotnet pivot-grid-for-winforms winforms
Last synced: 29 Apr 2026
https://github.com/nivasharmaa/spiderverse
A comprehensive Java program for analyzing and managing events and data points within a fictional spiderverse. Features event handling, anomaly detection, cluster management, and robust file I/O operations.
advanced-algorithms anomaly-detection clustering data-analysis file-io object-oriented-programming
Last synced: 29 Apr 2026
https://github.com/kasraskari/learn-r-codes
A learning repository for R programming, covering data manipulation, visualization, and statistical analysis. (Work in progress!) 🚧
data-analysis data-analysis-r data-visualization r r-examples r-graphics r-statistics statistics
Last synced: 08 Jun 2026
https://github.com/mdaffailhami/king_county_home_sales_analysis
This repository contains code and analysis for exploring home sales data in King County, featuring geospatial mapping to visualize trends and factors influencing housing prices, including location, size, and various property features, using Python and popular data analysis libraries.
data-analysis data-science folium-maps geospatial python
Last synced: 29 Apr 2026
https://github.com/jigyasag18/aircraft-data-management
This repository offers a comprehensive simulation of global military air deployments involving 10 countries, aircraft models, mission types, and strategic zones. It analyzes air power distribution, mission intent (offensive, defensive, support), and geopolitical positioning. The project provides structured insights into regional & zone level threat
aircraft-data aircraft-performance data data-analysis data-visualization database database-management dataset datavisualisation mysql powerbi powerbi-report powerbi-visuals sql
Last synced: 04 Feb 2026
https://github.com/marialuizaleitao/walmartsalesanalysis
This project explored data collection and preprocessing, advanced application of SQL queries, and feature engineering. Key calculations, such as COGS (Cost of Goods Sold) and VAT (Value Added Tax), were performed to assess the profitability and financial efficiency of the branches.
business-analytics data-analysis mysql-database sql
Last synced: 13 Jun 2026
https://github.com/mehedi-hassan81/mastercourse
Data analysis project analysing renewable energy production across 212 countries, visualizing trends with Tableau. Highlights China's dominance (2,894 TWh) and Paraguay's 100% renewable share.
data-analysis pandas python renewable-energy selenium tableau-dashboards tableau-public web-scraping
Last synced: 08 May 2026
https://github.com/cannt39t/wylsacom-analysis-reflinks-datamining
data data-analysis data-mining python3 sql
Last synced: 13 Jun 2026
https://github.com/carlos-edulira/mbabigdata-projeto
Entrega do projeto MBA Unipe Big Data BI
data-analysis delta minio python spark
Last synced: 29 Apr 2026
https://github.com/roland045/bike-share-dataset-analysis
User behaviour analysis on a public bike-share dataset
data-analysis data-visualization python time-series-analysis user-behavior-analytics
Last synced: 29 Apr 2026
https://github.com/hardikk-7/election-analysis-project
A data analytics project exploring the 2024 Indian General Election results using Python. Includes party-wise, state-wise, and vote share analysis with visualizations.
data-analysis data-science election-analysis jupyter-notebook python
Last synced: 29 Apr 2026
https://github.com/srinibas-masanta/yelp-business-reviews-analysis
This project analyzes Yelp business reviews using Python, Snowflake, and SQL, focusing on efficient data ingestion, transformation, and analysis. We preprocess JSON data, optimize ingestion via Amazon S3, classify sentiments with Python UDFs, and extract insights using SQL queries—showcasing a streamlined end-to-end workflow.
amazon-s3 data-analysis json python snowflake sql
Last synced: 29 Apr 2026
https://github.com/sdley/cas_pratique-del_annuel
Del-Annuel est logiciel de deliberation annuelle des ecoles superieures ou universités
data-analysis pandas python tkinter-gui
Last synced: 29 Apr 2026
https://github.com/nob101/lotto-analyzer
Ein Node.js & SQLite basiertes Tool zur Analyse und Auswertung der Euromillionenziehung und Joker. A Node.js and SQLite web application to analyze, track, and evaluate lottery (Euromillionen) and Joker results.
backend css data-analysis express html5 javascript nodejs sqlite statistical-analysis
Last synced: 14 Jun 2026
https://github.com/odessaz/portfolio-projects
This is a repository I have created to showcase skills, share projects and track my progress in Data Analytics and Data Science
applied-mathematics data-analysis data-science excel jupyter-notebook matplotlib-pyplot pandas portfolio python r r-studio seaborn sql statistics
Last synced: 12 Apr 2026
https://github.com/shimaa83/eda-repo
Exploratory data analysis for Police and retail dataset in kaggle
Last synced: 29 Apr 2026
https://github.com/jkazari/rollercoaster-eda
Repository of a small data-analysis project in R for Mathematical Software class on the 3rd semester of studying Mathematics at Gdańsk University of Technology
Last synced: 14 Jun 2026
https://github.com/tawfikhammad/sql-leetcode-solutions
The solutions of SQL 50 LeetCode problems
data-analysis data-engineering database leetcode leetcode-solutions sql
Last synced: 15 Jun 2026
https://github.com/brunomontezano/sleep-quality-cognition
💤 Analysis of the paper "Associations between general sleep quality and measures of functioning and cognition in subjects recently diagnosed with bipolar disorder".
bipolar-disorder cognition data-analysis sleep-analysis sleep-research
Last synced: 15 Jun 2026
https://github.com/akorablov/project_data_jobs
Looking into the data job market across Czechia, Slovakia, and the US.
data-analysis data-science git job-market jupyter learn-to-code matplotlib pandas python seaborn skills visual-studio-code visualisation
Last synced: 29 Apr 2026
https://github.com/farhad-here/student_performance_analyzer
Student Performance Analyzer with python, it is on of my data analysis course project. I teach you about filter(),lambda,map() in python
data-analysis data-visualization filter kaggle kaggle-dataset lambda map pandas python python-tutorial streamlit
Last synced: 29 Apr 2026
https://github.com/europanite/gundam-forest
Random Forest Data Analysis of Kill In Action rate for every personnel in GUNDAM world, like in Titanic.
data data-analysis data-science data-visualization death-rate death-rates gundam-model gundam-series gundom-forest jupyter jupyter-notebook kia kill-in-action python random-forest titanic titanic-kaggle titanic-survival titanic-survival-prediction
Last synced: 29 Jun 2026
https://github.com/fbarffmann/nosql-challenge
Analyzed 28,000+ UK restaurant records using MongoDB and PyMongo. Queried hygiene scores, location data, and customer ratings.
data-analysis data-cleaning database-analysis json mongodb nosql pymongo python restaurant-data
Last synced: 13 Apr 2026
https://github.com/shellynagar27/business-insights-360-project
A comprehensive Dashboard which provides better understanding of the business's market standing, key focus areas for optimization, underperforming customers, and year-wise financial insights, aiding in better inventory planning and performance tracking. Further it can be used in answering n number of why questions based on the situations.
dashboard data-analysis data-visualization dax-languague dax-studio excel performance-optimization power-bi reporting sql storage-manager
Last synced: 27 Jan 2026
https://github.com/fabriciocovalesci/webscraping
Web Scraping para obter os dados da página
automation beautiful-soup-4 data-analysis data-cleaning data-extraction data-mining data-scraping data-wrangling html-parsing python web-crawling web-data-scraping web-scraping webscraping
Last synced: 13 Jun 2025
https://github.com/edgarhtt/uber_freight_data_analysis
Uber Freight interview homework. It consisted of solving a 2 warehouse problem and an ETL task
data-analysis data-science data-visualization python
Last synced: 30 Apr 2026
https://github.com/aishwaryagade02/loan-funnel-optimization-analysis
Tracks how loan applications move through each stage, helps spot where people drop off, and gives clear insights to improve approval strategies and overall performance.
ab-testing data-analysis data-creation hypothesis-testing python reporting sql statistical-methods streamlit
Last synced: 30 Apr 2026
https://github.com/srinibas-masanta/ibm-applied-data-science-capstone
This repository contains the work completed for the Applied Data Science Capstone Project offered by IBM on Coursera. The capstone project is the final course in the IBM Data Science Professional Certificate series and serves as an opportunity to apply the skills and knowledge gained throughout the series to a real-world data science problem.
capstone-project data-analysis data-science data-visualization machine-learning python web-scraping
Last synced: 30 Apr 2026
https://github.com/celineboutinon/laplace-immo
ENSAE-ENSAI Formation Continue (Cepe)/OpenClassrooms Data Analyst 2022-2023 - Projet 3
data-analysis data-analytics data-structures database-design database-schema databases mysql-connector-python mysql-workbench python sql
Last synced: 30 Apr 2026
https://github.com/mfakhriazhar/nlp-movie-recommender-system
This project is a content-based movie recommender system built using Natural Language Processing (NLP) techniques. By extracting and combining important text features from movie metadata, this system suggests movies that are similar to a user's selected title.
data-analysis data-science deep-learning machine-learning natural-language-processing python recommender-system
Last synced: 30 Apr 2026
https://github.com/mitchellharrison/mitchellharrison.github.io
Welcome to my slice of the internet, where I share the knowledge that Duke gave me, so you don't have to spend the mortgage-sized amount to access it. Built with R, Python, Quarto, and love.
ai algorithms-and-data-structures blog data-analysis data-science data-visualization educational machine-learning portfolio portfolio-website quarto r r-language statistics tutorials
Last synced: 30 Apr 2026
https://github.com/josewebdev2000/space-mission-data-analysis
Exploring space mission data and creating graphs in base of it.
csv data-analysis data-science data-visualization matplotlib matplotlib-figures matplotlib-pyplot pandas pandas-dataframe python
Last synced: 30 Apr 2026
https://github.com/abhi227070/ipl-2024-sold-player-data-analysis
This project analyzes IPL 2024 auctioned players' data, including name, team, cricket type, nationality, and price. Users input a player's name to access team, style, nationality, and auction price, aiding research and fantasy leagues. It offers insights into player dynamics, serving cricket enthusiasts with comprehensive data exploration.
data-analysis data-visualization dataanalytics machine-learning machine-learning-algorithms python3
Last synced: 30 Apr 2026
https://github.com/fbarffmann/credit-risk-classification
Classified 19,000+ loans as high-risk or healthy using logistic regression. Achieved 100% precision for healthy loans and 84% precision for high-risk loans.
classification credit-risk data-analysis logistic-regression machine-learning model-evaluation pandas python scikit-learn
Last synced: 30 Apr 2026
https://github.com/devexpress-examples/wpf-pivot-grid-provide-custom-summary-values
This example demonstrates how to determine the value type when you calculate custom summary values in Pivot Grid for WPF.
data-analysis dotnet dxpivotgrid pivot-grid pivot-grid-for-wpf wpf
Last synced: 01 May 2026
https://github.com/falakrana/data-analysis-visualization
This repository showcases data analysis and visualization projects using Python and Tableau. It includes exploratory data analysis, interactive dashboards, and insightful visual stories derived from real-world datasets.
data-analysis data-visualization python tableau-public
Last synced: 01 May 2026
https://github.com/prathmesh2507/global-stock-intelligence-dashboard
Interactive Global Stock Market Analytics Dashboard built using Python, YFinance, Pandas, Streamlit, and Plotly. Analyze 20+ countries and 400+ top stocks with advanced visualizations and financial insights.
dashboard data-analysis data-visualization python stock-analysis streamlit
Last synced: 15 Jun 2026
https://github.com/dipeshgoyal013/crypto-currency-dashboard
This project analyzes historical cryptocurrency data and builds an interactive Power BI dashboard. It includes time-series forecasting of Bitcoin and Ethereum using ARIMA and Power BI’s forecasting model.
data-analysis excel power-bi python
Last synced: 15 Jun 2026
https://github.com/dnut/associations
Python 3 library to identify high-dimensional statistical relationships in any data set.
analytics arch-linux association-rules data data-analysis data-mining data-science machine-learning python-modules
Last synced: 01 May 2026
https://github.com/sairupeshl/leo-orbital-congestion-analysis
Geospatial data analysis of the UCS Satellite Database using Python to map active LEO space assets, validate orbital parameters, and isolate mega-constellation traffic bottlenecks.
aerospace-engineering data-analysis geospatial-analysis orbital-mechanics pandas python satellite-data seaborn
Last synced: 08 Jun 2026
https://github.com/monish-nallagondalla/sensor_fault_detection
This repo contains sensor data for analysis, focusing on sensor readings, their attributes, and classification (Good/Bad). It includes 500+ sensors with features for predictive modeling, anomaly detection, and sensor failure prediction.
anomaly-detection classification data-analysis data-science machine-learning predictive-modeling python sensor-data
Last synced: 01 May 2026
https://github.com/dhruwsunita/customer-churn-analysis
Customer Churn Analysis using panda library
data-analysis data-cleaning data-manipulation data-science pandas python3
Last synced: 01 May 2026
https://github.com/shibbir24/amazon-product-sales-data-analysis-trends-and-insights
Amazon Product Sales Data Analysis: Trends and Insights
amazon-dataset data-analysis matplotlib numpy pandas seaborn
Last synced: 01 May 2026
https://github.com/abdoomohamedd/python-data-analysis-projects
A collection of data analysis projects that I have worked on. Each project involves cleaning data, performing exploratory data analysis (EDA), and creating visualizations to extract insights. The projects utilize various Python libraries, including pandas, numpy, matp
data-analysis data-analysis-python data-cleaning data-visualization matplotlib matplotlib-pyplot numpy pandas python
Last synced: 01 May 2026
https://github.com/saifalibaig/covid-19-infection-rate-analysis-using-python
Analysis of Covid-19 Infection rate and the world happiness report to identify if there is any relationship between infection rate and happiness
data-analysis data-visualization jupyter-notebook numpy pandas python3 sns
Last synced: 18 Apr 2026
https://github.com/kheriberto/pandas_and_seabron_project
In this project I showcase my ability using pandas and seaborn to mold, transform and plot data.
data-analysis pandas python seaborn
Last synced: 01 May 2026
https://github.com/vedantshi/stock-price-prediction-for-maang-companies
This project utilizes Long Short-Term Memory (LSTM) networks to forecast stock prices. It includes steps for data preprocessing, model training, and visualization of predictions using Python in Jupyter Notebook. The project demonstrates proficiency in machine learning, data analysis, and Python programming.
data-analysis data-visualization lstm machine-learning python stock-price-prediction
Last synced: 01 May 2026
https://github.com/mateusoliveira30/top-intelligent-people
This project performs an exploratory analysis of the top_intelligent_people_in_the_world_5000.csv dataset, featuring some of the world's most intelligent individuals. Using pandas and matplotlib, the analysis includes checking for missing values, describing variables, and visualizing data.
data-analysis graphics kaggle-dataset python3
Last synced: 03 May 2026
https://github.com/victorlcastro-dsa/coping_struggles_prediction
Repositório para prever dificuldades de enfrentamento com base em dados de saúde mental. Inclui análise, visualização e modelagem usando aprendizado de máquina. Resultados alcançam 86.58% de acurácia com um Voting Classifier.
classification-algorithm data-analysis data-science data-visualization machine-learning-algorithms problem-solving project-based-learning python
Last synced: 19 Apr 2025
https://github.com/harshindcoder/salifort_motors_project
This people analytics project analyzes factors influencing employee turnover and predicts whether an employee is likely to leave. It aims to uncover patterns behind departures, helping Salifort improve retention, workplace culture, and professional growth strategies.
data-analysis data-science data-visualization hr-analytics machine-learning tree-models
Last synced: 02 May 2026
https://github.com/marielachirinosr/nyc-taxi-trip-exploration-2019-2020
Explores passenger behavior & impact of COVID-19 on NYC taxi industry (Q1 2019-2020).
bigquery data data-analysis data-visualization python sql tableau
Last synced: 15 Jun 2026
https://github.com/victoryfanfare/car-price-prediction
ML модель для определения рыночной стоимости автомобилей с пробегом. Проект включает анализ данных, feature engineering и сравнение различных алгоритмов машинного обучения.
catboost data-analysis jupyter-notebook lightgbm machine-learning pandas python regression
Last synced: 15 Jun 2026
https://github.com/lucas54neves/financial-organizer
Financial organizer using Streamlit
data-analysis data-science financial-organizer plotly python streamlit
Last synced: 02 May 2026
https://github.com/asadiahmad/word-counter-spark
Word counter with spark
data-analysis nlp spark word-counter
Last synced: 02 May 2026
https://github.com/rodrigojunqueiradev/curso-python-3-do-basico-ao-avancado
Curso de Python 3 do básico ao avançado - com projetos reais
data data-analysis data-science python python-3 python-library python-script python3
Last synced: 27 Jun 2026
https://github.com/jimohola/breast-cancer-detection
Breast Cancer Detection-Machine learning
data-analysis data-visualization exploratory-data-analysis machine-learning python3
Last synced: 02 May 2026
https://github.com/bhawnagoyal18/ai-doctor-a-symptom-checker-disease-predictor
AI Doctor is an intelligent healthcare application that utilizes machine learning (ML) and Python to predict potential diseases based on user-input symptoms. The project integrates data from multiple medical datasets and provides an interactive web-based UI for an intuitive user experience.
data-analysis data-engineering data-visualization dataset flask html5 machine-learning python sql stacking statistics
Last synced: 02 May 2026
https://github.com/rorrell/employmentdata
A Jupyter Notebook where I use group by to analyze the average unemployment rate by year
data-analysis data-visualization jupyter-notebook python3
Last synced: 02 May 2026
https://github.com/neuro-mechatronics-interfaces/matlab_analyses
Tools for analysis, statistics, and/or simulation in Matlab.
data-analysis data-visualization matlab matlab-codes matlab-functions matlab-gui matlab-scripts neuroscience weber-lab
Last synced: 09 Jun 2026
https://github.com/asergioscosta/portfolio-dados
Portfolio of dashboards and data analysis projects.
business-intelligence dashboard data-analysis data-visualization kpi looker-studio powerbi
Last synced: 22 Feb 2026
https://github.com/khushi-sabarad/8-week-sql-challenge
Case studies' solutions for the #8WeekSQLChallenge by Danny Ma
8weeksqlchallenge case-study data-analysis mysql sql
Last synced: 06 Sep 2025
https://github.com/se7en69/rna-seq-data-processing-and-analysis-pipeline
This pipeline automates essential steps for RNA-Seq data analysis, including quality control, read trimming, alignment to a reference genome, and coverage quantification. It leverages tools like FastQC, fastp, STAR, and bedtools to ensure high-quality results, with MultiQC reports providing an overview at each stage.
bioinformaitcs-scripting bioinformatics bioinformatics-pipeline data-analysis linux scripts shell
Last synced: 02 May 2026
https://github.com/bala-1409/tableau-visualization-viz.-project
This repository contains Visualization Projects which is visualized through Tableau Software, by using the visualization we can gain multiple insights and strategies which helps to develop the business for gaining high profit margins and also it provides social values in some cases to calculate damages and intensity by calamities.
dashboard data-analysis data-science data-visualization exploratory-data-analysis tableau tableau-dashboards tableau-public visualization
Last synced: 04 Feb 2026
https://github.com/inevolin/multivariate-data-analysis
Showcases of modern multivariate & multidimensional data analysis in industrial and high-tech settings.
analytics data-analysis data-science data-visualization javascript
Last synced: 09 Jun 2026
https://github.com/sarah-marion/sovereign-osint-toolkit
Sovereign OSINT Toolkit - Advanced, self-hosted intelligence platform for security researchers and investigators. Ethical, private and production-ready.
correlation-engine cybersecurity data-analysis docker fastapi infosec intelligence investigation open-source osint privacy python3 security-research security-tools threat-intelligence
Last synced: 02 May 2026
https://github.com/robertpaulp/expenseadvisor
HackITall 2023- Hackathon
chatgpt-api data-analysis data-processing python scrapping-python
Last synced: 03 May 2026
https://github.com/bhavna-kale/cars-eda-project
Project analyzing used car market data to identify high-impact price drivers and depreciation curves, presented through an interactive web application.
data-analysis excel matplotlib numpy pandas python3 searborn streamlit
Last synced: 03 May 2026
https://github.com/ahmedhosssam/lesser_pandas
Pandas-like Data Analysis library in C++
cpp data-analysis data-science pandas
Last synced: 03 May 2026
https://github.com/fatihilhan42/tourist_analysis_in_turkey_with_python
In this project, the number of tourists coming to Turkey between 2008-2021 was analyzed. The data from the data set you can find in the warehouse was first organized using data cleaning algorithms. These cleaned data were then output graphically using data visualization algorithms.
data-analysis data-cleaning data-science data-visualization jupyter-notebook python
Last synced: 03 May 2026
https://github.com/vedanty3/supermarket-sales-data-analysis
This project contains data visualization techniques (using pandas and matplotlib) to explore different aspects of supermarket sales data of 3 months.
data-analysis data-science jupyter-notebook matplotlib numpy pandas python
Last synced: 08 May 2026