Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-07-03 00:07:37 UTC
- JSON Representation
https://github.com/sandk21/detection_faux_billets
Algorithme de détection de faux billets selon leurs dimensions géométriques et application web pour générer les prédictions
data-analysis data-science data-visualization machine-learning pandas python scipy sklearn streamlit
Last synced: 03 Apr 2026
https://github.com/archie-cm/credit_risk_model_vix_id-x_partners
The objective project is to decrease the company's losses by up to 30% through bad loans by creating a machine learning system to assist in automating loan assessments
credit-risk data-analysis data-visualization machine-learning scorecard
Last synced: 01 May 2026
https://github.com/jmssnr/shuffle-kit
shuffle-kit: model and analyze playing card shuffles in Python
data-analysis playing-cards python shuffle statistics
Last synced: 19 Jun 2026
https://github.com/vyjayanthipolapragada/early_sepsis_detection_ml
An end-to-end project leveraging clinical datasets (PhysioNet, MIMIC-IV, MIMIC-IV-ED) to develop and compare ML and LSTM-based models for early sepsis prediction.
data-analysis data-visualization deep-learning healthcare jupyter-notebook keras-tensorflow lstm-neural-networks machine-learning neural-network python
Last synced: 28 Apr 2026
https://github.com/scarblase/homeless-animals-analysis
A data-driven exploration of homeless animal statistics 🐶🐱. Analyze age distribution, shelter dynamics, and adoption patterns using Python, Pandas, and Seaborn.
animals data-analysis data-mining data-science data-science-projects data-visualization matplotlib matplotlib-pyplot numpy pandas plotly python python3 ukraine
Last synced: 06 May 2026
https://github.com/sakan811/tekken-8-jun-kazama-data-analysis-showcase
Showcase visualizations about Jun Kazama from Tekken 8
data data-analysis data-visualization game games powerbi tekken tekken-8 video-game visualization web-scraping
Last synced: 28 Feb 2026
https://github.com/rakumar99/jp-morgan-chase-virtual-internship
This repository contains the various tasks assigned by JPMorgan Chase & Co. Virtual Internship on Microsoft Excel
conditional-formatting dashboard data-analysis data-visualization hlookup pivot-tables presentation vba-macros vlookup
Last synced: 02 Mar 2026
https://github.com/alexandrelamarre/fission
Data analytics & Structured streaming optimized for the Edge
data-analysis data-engineering rust structured-data unstructured-data
Last synced: 08 Jun 2026
https://github.com/arhcoder/base-hackathon-2022
💸 Sistema que analiza las facturas de compra-venta de una empresa de importaciones y exportaciones, y crea una base de conocimiento con la que crea sugerencias de abastecimiento para las empresas clientes de Banco BASE, con el fin de ahorrarles dinero.
algorithms bank companies data-analysis decision-making exportation hackaton importation javascript mysql python suggestions
Last synced: 16 Apr 2026
https://github.com/scarblase/portfolioprojects
A collection of data analysis and business intelligence projects using SQL, Python, and visualization tools to uncover insights from real-world datasets. 🚀📊
csv data-analysis data-engineering data-mining data-science data-visualization matplotlib matplotlib-pyplot pandas python python3 seaborn sql
Last synced: 06 May 2026
https://github.com/harshmule1/school-data-analysis-
School Data Analysis Using SQL
Last synced: 04 Apr 2026
https://github.com/monish-nallagondalla/diamondpriceprediction
Diamond Price Prediction is an end-to-end machine learning project that predicts diamond prices based on attributes like carat, cut, color, clarity, and dimensions. It features a Flask web application for real-time predictions and utilizes models such as Linear Regression, Lasso, and Ridge.
data-analysis data-science flask jupyter-notebooks machine-learning predictive-modeling python
Last synced: 06 May 2026
https://github.com/pheithar/socialdata_madridcentral
Social data and visualization course at DTU - 2022. Effectiveness of Madrid Central
data-analysis data-visualization jupyer-notebook madrid python
Last synced: 28 Apr 2026
https://github.com/eslamdyab21/imdb-data-analysis
This data set contains information about 10,000 movies collected from The Movie Database (TMDb), including user ratings and revenue
data-analysis pandas python udacity-data-analyst-nanodegree
Last synced: 06 May 2026
https://github.com/chiemekaifemegbulem/useful_tools
Advanced Web Scraping
automation beautifulsoup captcha-solving data-analysis data-extraction data-science proxy-rotation python scraping-bots selenium tor-network web-scraping web-scraping-python webscraping
Last synced: 28 Apr 2026
https://github.com/bhavik444/techistanbul_python_bootcamp
👨💻 Master Python programming through practical exercises in this 80-hour bootcamp, designed for beginners to advanced learners.
algorithms api-development automation coding-bootcamp data-analysis data-visualization django flask git machine-learning python software-engineering testing web-development web-scraping
Last synced: 28 Apr 2026
https://github.com/freebirdscrew/covid-19-data-analysis
Coronavirus Data-Analysis with Live Data Streaming from the Website and Made a DASH Web-App at Last.
coronavirus coronavirus-real-time coronavirus-tracking countryinfo covid-19 covid-19-india covid19 covid19-data dash dash-button dashboard-application data data-analysis data-cleaning data-science data-visualization github jupyter pycountry python
Last synced: 07 May 2026
https://github.com/shrawans007/google_cyclistic_2023
Google Data Analytics Capstone Case Study (SQL and Tableau)
big-query bigquery coursera-assignment cyclistic cyclistic-bike-share-analysis-case-study cyclistic-bikshare data-analysis data-analysis-project data-analytics data-cleaning data-combination data-exploration data-science google-data-analytics sql tableau tableau-dashboard tableau-public
Last synced: 19 Jun 2026
https://github.com/sayantanidalui/indian-government-budget-analysis
A complete end to end data analysis project using Python, SQL, and Power BI based on a Kaggle dataset. Built to explore trends, allocations, and insights from India’s Union Budget (2021–24) for practice purposes.
data-analysis mysql pandas powerbi storytelling
Last synced: 07 May 2026
https://github.com/affec-ds/netflix-recommender-system
Sistema de recomendación de títulos de Netflix basado en contenido. Incluye filtros por título, género y tipo de contenido (películas o series) con interfaz interactiva en Jupyter Notebook.
content-based-recommendation data-analysis eda ipywidgets jupyter-notebook machine-learning movies netflix portfolio-project python recommender-system
Last synced: 28 Apr 2026
https://github.com/mirokeimioniemi/classifying-software-pirates
Exploring the factors driving people into software piracy by training two machine learning models to predict whether a person with certain characteristics and sentiments is likely to possess any pirated software or not using a dataset collected via a survey targeting users of music production software.
data-analysis data-science decision-tree-classifier logistic-regression machine-learning piracy python software-piracy survey
Last synced: 06 May 2026
https://github.com/sarmadahmad8/ml-and-deeplearning-projects-for-beginners
Beginner ML/DL projects spanning core libraries and problem sets.
beginner-friendly data-analysis data-science deep-learning fastai machine-learning opencv pytorch scikit-learn transformer
Last synced: 17 Apr 2026
https://github.com/allanccwang/electronic_projects
implement the circuit with microcontroller
arduino circuit-analysis circuit-simulations circuits-and-electronics cpp data-analysis microcontroller physics python wemos
Last synced: 07 May 2026
https://github.com/mattdelaune/retail_rfm_analysis
Power BI multi-page report leveraging advanced data visualization for RFM analysis. Delivers deep analytical insights into customer behavior, engagement, and spending patterns, driving strategic business decisions.
data-analysis dax powerbi report rfm-analysis sales-data visualization
Last synced: 19 Mar 2026
https://github.com/sivas-2/coffee-sales-visualization
This repository contains data visualization scripts and notebooks analyzing coffee sales data from a vending machine, sourced from Kaggle. The visualizations explore sales trends, customer preferences, and product popularity over time.
data data-analysis data-science data-visualization python visualization
Last synced: 07 May 2026
https://github.com/pngo1997/axa-xl-insurance-bi-dashboard
Provides a comprehensive analysis of insurance submissions, approvals, compliance rates, and profitability for AXA XL Insurance.
bi-analytics bi-dashboard business-analytics data-analysis filtering performance-analysis powerbi segmentation visualization
Last synced: 08 Feb 2026
https://github.com/chahiriabderrahmane/carpricepredictor
🚗 Cars Exploration & Price Prediction | Analyzing Cars.com Listings
data-analysis data-science data-visualization machine-learning python streamlit web-scraping
Last synced: 08 Feb 2026
https://github.com/dsrodrigovieira/houserocketsales
Este repositório contém um projeto desenvolvido para praticar habilidades de análise de dados utilizando Python
data-analysis data-visualization heroku kaggle-dataset python
Last synced: 29 Apr 2026
https://github.com/eduardoedubox/health_data_analysis
Health data analysis using Jupyter Notebook
data-analysis data-science database jupyter-notebook pandas python
Last synced: 07 May 2026
https://github.com/alxrm/scent-of-literature
Russian literature sentiment analysis in terms of very small dataset
classification data-analysis sentiment-analysis sklearn tf-idf
Last synced: 28 Apr 2026
https://github.com/hamzacham/data_set_projet-5
data-analysis data-science database dataset jupyter jupyter-notebook paython training
Last synced: 07 May 2026
https://github.com/henrylin03/china-gdp
Analysis and visualisation of China GDP data using Python.
data data-analysis data-visualisation dataset kaggle pandas
Last synced: 01 May 2026
https://github.com/garcane/london-housing-price-dashboard
This Excel-based Housing Visual Dashboard provides a comprehensive view of average house prices across various boroughs in London from 1996 to 2013. The dashboard is designed to offer insights into housing market trends and price variations across different areas of London over time.
data data-analysis data-visualization excel visual
Last synced: 13 Feb 2026
https://github.com/lebrancconvas/how-much-love-in-thai-song
How much Love song among the Thai Songs?
data-analysis side-project web-scraping
Last synced: 19 Jun 2026
https://github.com/thevinh-ha-1710/diabetes-predictive-model
This project aims to train a predictive model to diagnose diabetes on women patients.
data-analysis data-science data-visualization model-training-and-evaluation python
Last synced: 13 Feb 2026
https://github.com/pxaris/expenditure-analyzer
Application for analyzing expenditure data over time
data-analysis data-visualization docker python statistics
Last synced: 29 Apr 2026
https://github.com/idaraabasiudoh/knn-customer-classification
Labels telecommunication customer base to respective groups to determine service type required for each customer.
data-analysis jupyter-notebook machine-learning pyhton3 scikit-learn
Last synced: 07 May 2026
https://github.com/mr-vozhyk/karpov.courses-study
Часть заданий, мини-проектов и финальный проект от karpov.courses
airflow data-analysis git python sql statistics
Last synced: 05 May 2026
https://github.com/kirkalyn13/open-signal-report-generator
Script used to generate results/summary, including the trends of flagged provinces, from the raw excel data file,
data-analysis data-science data-visualization matplotlib numpy pandas python
Last synced: 19 Jun 2026
https://github.com/sayedgamal99/data-science
This is a repository for Data Science Projects.
data-analysis data-science deep-learning machine-learning python regression supervised-learning
Last synced: 07 May 2026
https://github.com/hebaqaisar/movie-recommender-system
AI Recommender System - Recommends you similar movies based on Directors, Tags, Name, Type, Actors, Genre etc
artificial-intelligence data-analysis data-mining data-science jupyter-notebook machine-learning machine-learning-algorithms ml movies-rate pycharm python
Last synced: 17 Apr 2026
https://github.com/backdoorali/insider-threat-detection-project
Personal data analysis project combining insider threat detection, cybersecurity, and exploratory data analytics. Built for portfolio showcase and practical skills demonstration.
cybersecurity data-analysis data-analysis-excel data-analysis-project data-analyst data-analytics data-visualization eda excel insider-threat jupyter-lab jupyter-notebook matplotlib numbers pandas portfolio-project python python3 threat-detection threat-intelligence
Last synced: 07 May 2026
https://github.com/gorodroz/crypto-tracker
Realtime Bitcoin price tracker using Binance WebSocket and REST API. Logs prices to CSV and supports Pandas for data analysis.
binance bitcoin crypto csv-logger data-analysis pandas python rest-api websocket
Last synced: 07 May 2026
https://github.com/bala-1409/foreign-exchange-rate-time-series-data-science-project
This project will use time series analysis to forecast the exchange rate between the euro and the US dollar. The project will use a variety of statistical techniques, such as ARIMA to model the data and forecast the exchange rate.
data-analysis data-science data-visualization datapreprocessing eda exploratory-data-analysis forecasting machine-learning-algorithms model modelfitting predictive-modeling python3 scikit-learn statsmodels time-series time-series-analysis
Last synced: 07 May 2026
https://github.com/md-emon-hasan/data-science
Data science tutorials, including data preprocessing, analysis, visualization, project deployment, machine learning and deep learning algorithms.
artificial-intelligence data-analysis data-engineering data-science deep-learning machine-learning-algorithms python
Last synced: 07 May 2026
https://github.com/pradipece/weather_forecast_data_analysis
Using decision trees and random forest algorithms to solve real-world data analysis. "sklearn_decision_trees_random_forests"
data-analysis data-science data-visualization git github python python3
Last synced: 19 Apr 2026
https://github.com/mrsamsonn/monolithic-polylithic-crystal-segmentation
A grid segmentation algorithm for clustering crystal structures using diffraction patterns. Useful in material science and nanotechnology, this code enables detailed analysis of crystals for research and industrial applications.
clustering crystal-structure crystallography data-analysis diffraction-patterns grid-segmentation image-processing k-means machine-learning matertial-science nanotechnology python research-project research-tools scientific-computing
Last synced: 28 Apr 2026
https://github.com/dogan-the-analyst/web_scraping_job_vacancies
data-analysis python web-scraping
Last synced: 07 May 2026
https://github.com/y-india/retail-sales-analysis-project
Analysis and preprocessing of retail store sales data. Includes data loading, merging, and initial inspection. 📌 Recommended: See README.md for detailed project progress and dataset information.
ai dashboard data-analysis data-science data-visualization jupiter-notebook machine-learning matplotlib python real-world-problem-solving real-world-project retail-analytics sales-analysis seaborn sklearn-library streamlit
Last synced: 07 May 2026
https://github.com/thevinh-ha-1710/rstudio-statistics
This project deeply studies 2 datasets using applied statistics techniques.
applied-statistics data-analysis data-science data-visualization rmarkdown rstudio
Last synced: 31 Jan 2026
https://github.com/1ayanabil1/iris-visualization
This repository focuses on visualizing the Iris dataset using various data visualization techniques. It includes histograms, scatter plots, box plots, pie charts, bubble charts, and KDE plots to provide insights into the dataset’s structure. The project utilizes Matplotlib, Seaborn, Plotly, and Scikit-learn to generate insightful visualizations.
analytics clustering data-analysis data-science data-visualization datavisualization-project datavisualizations eda exploratory-data-analysis machine-learning machinelearning-python python
Last synced: 07 May 2026
https://github.com/tks18/pyquery
PyQuery is a local-first data operating system built on lazy execution that processes 100GB+ files while you doomscroll. No cap. 🧢
data-analysis data-science etl hdfs parquet pipeline polars python
Last synced: 14 Jan 2026
https://github.com/gdbecker/analyticsportfolio
Analytics Professional Project Work
data-analysis data-science decision-trees firebase google-looker-studio k-means-clustering k-nearest-neighbors kaggle linear-regression logistic-regression machine-learning microsoft-fabric powerbi principal-component-analysis python3 random-forest synapse-data-engineering
Last synced: 10 Apr 2026
https://github.com/europanite/client_side_ml
Client Side Browser-Based CART Time-Series Prediction.
cart data-analysis data-science docker docker-compose expo expo-go machine-learning metro react-native ti time-series time-series-analysis time-series-forecasting time-series-prediction typescript vite
Last synced: 10 Apr 2026
https://github.com/viztruth/google-play-store-data-analysis
This repository contains all the materials of my final project 'Google Play store Data Analysis' for the 'Telling Stories with Data' course at PES University.
data-analysis data-visualization
Last synced: 21 Aug 2025
https://github.com/aadityatamrakar/futures_spread_chart
Cash Market & Futures Daily Spread Chart - NSE Stocks
data data-analysis data-mining expressjs nodejs requests
Last synced: 10 Apr 2026
https://github.com/kyleprotho/analysistoolbox
Analysis Tool Box (i.e. "analysistoolbox") is a collection of tools in Python for data collection and processing, statisitics, analytics, and intelligence analysis.
analytics data-analysis open-source-intelligence python3 r research snippets statistics
Last synced: 22 Aug 2025
https://github.com/mohamed3nan/udacity
Udacity Data Analysis Nanodegree Program
data-analysis data-visualization numpy pandas python
Last synced: 10 Apr 2026
https://github.com/realorangeone/docker-cyberchef
A containerized deployment of CyberChef, with additional protections
cyberchef data-analysis data-manipulation docker encoding
Last synced: 24 Aug 2025
https://github.com/chen0040/pyspark-advanced-algorithms
Samples of Advanced Algorithms and Data Analysis implemented in pyspark
advanced-algorithms data-analysis map-reduce pyspark
Last synced: 12 Jan 2026
https://github.com/mathurutkarsh/zomato_bangalore_restaurants_kaggle
Exploratory data analysis
data-analysis data-visualization machine-learning
Last synced: 25 Aug 2025
https://github.com/wildanmujjahid29/books-sales-analytics-python
Books Sales Analytics With Pyhton
data data-analysis data-science data-visualization
Last synced: 12 Jun 2026
https://github.com/ahmad-ali-rafique/weather-prediction-fcnn
This project demonstrates a complete pipeline for weather prediction using a Fully Connected Neural Network (FCNN). The project is implemented in Python using Jupyter Notebook, and it covers data loading, preprocessing, model training, and performance evaluation.
ai artificial-intelligence data-analysis data-science deep-learning deep-neural-networks fully-connected-network machine-learning machine-learning-algorithms weather-information
Last synced: 28 Aug 2025
https://github.com/nafisalawalidris/sales-performance-dashboard
Sales Performance Dashboard: Analyze and visualize sales data using Power BI. Gain insights into trends, customer segments, product performance, and geographic distribution. Make data-driven decisions to optimize sales strategies and maximize revenue.
analytics-revenue dashboard-power-bi data data-analysis intelligence-sales optimization performance sales visualization-business
Last synced: 03 Feb 2026
https://github.com/prajakta1321/kaggle-ai-report-2023
A Report describing the trends in emergence of AI over the years !
data-analysis data-visualization python3
Last synced: 28 Jun 2025
https://github.com/Zen204/airbnb-availability
A machine learning model that predicts Airbnb listing availability, utilizing feature engineering and supervised learning techniques to improve guest experience and optimize host management.
binary-classification data-analysis data-preprocessing data-visualization feature-engineering machine-learning matplotlib model-evaluation nlp pandas predictive-modeling python scikit-learn seaborn supervised-learning
Last synced: 02 Apr 2025
https://github.com/luminati-io/airbnb-dataset-samples
A sample dataset of over 1000 Airbnb listings, extracted using the Bright Data API, ideal for competitor tracking, brand reputation, and market analysis.
airbnb airbnb-listings api data-analysis datasets web-scraper web-scraper-api web-scraping
Last synced: 04 Jan 2026
https://github.com/discdiver/new-belgium-ratings
Find the most popular New Belgium beers of all time!
beautifulsoup data-analysis pandas python seaborn webscraping
Last synced: 10 Apr 2026
https://github.com/nirmit27/book-recommender-system
This is a book recommendation system based on item-based Collaborative Filtering memory-based model created using Flask.
data-analysis data-science flask python python3 recommender-system render
Last synced: 05 May 2026
https://github.com/unnatmalik/dattavism-ai-powered-data-insight-generator-
Dattavism is an AI-powered data insight platform that transforms raw CSV files into comprehensive, contextualized reports—complete with visualizations, statistical summaries, and natural language insights. Dattavism is designed to handle datasets across diverse domains. it is Built using Python, Streamlit, Gemini API, Pandas, Matplotlib, NumPy,
data-analysis python streamlit
Last synced: 24 Jul 2025
https://github.com/jossimmar/ensa-ss25
Repositorio destinado al manejo de datos de consumo de los Clientes Mayores de ENSA del Grupo Distriluz.
data-analysis electrical-engineering python sqlite
Last synced: 30 Mar 2025
https://github.com/prernarohra/heart-disease-prediction
This project develops a machine learning model to predict heart disease risk based on symptoms and medical history. The model achieved the best accuracy with Logistic Regression, as it works well for binary classification problems.
artificial-intelligence data-analysis data-science dataset heartdisease-prediction machine-learning models
Last synced: 06 Nov 2025
https://github.com/cr-mao/machine-learning
机器学习笔记
data-analysis data-handling machine-learning math numpy pandas
Last synced: 12 Nov 2025
https://github.com/umutsevdi/hr-management
HR Management, Analytics and Salary Determination System
analytics data-analysis java java17 postgresql python spring spring-boot vaadin vaadin-flow
Last synced: 10 Apr 2026
https://github.com/rkirlew/workoutrecommendationsdataset
This repository contains a synthetic dataset designed for building personalized workout recommendation models. The data is generated for educational and experimental purposes, allowing users to practice machine learning techniques such as classification, k-NN, and clustering, as well as explore fitness-related data analysis.
classification data-analysis dataset k-nearest-neighbours machine-learning
Last synced: 08 May 2026
https://github.com/ipanalytics/vpn-provider-overlap-intelligence
Aggregate VPN provider infrastructure overlap analysis: exact-IP overlap, shared /24 prefixes, hosting dependency, and provider relationship clusters. No raw VPN IP lists.
anti-fraud asn cybersecurity data-analysis fraud-detection infrastructure ip ip-intelligence ip-reputation network-analysis network-intelligence osint proxy-detection threat-intelligence vpn vpn-detection
Last synced: 25 May 2026
https://github.com/virajbhutada/global-universities-success-analysis-powerbi-sql-excel
This capstone project conducts in-depth analysis using Power BI, SQL, and Excel to explore complex dynamics shaping global university success. Integrating data from diverse ranking systems and criteria, our aim is to unravel the factors influencing universities worldwide.
capstone capstoneproject data-analysis data-analytics data-insights data-science data-science-projects data-visualization excel exploratory-data-analysis mece mysql powerbi powerpoint sql
Last synced: 20 Jun 2025
https://github.com/ibnaleem/cyberchef-discord
A versatile Discord bot that implements CyberChef's features for encoding, decoding, encrypting, compressing, analysing data directly and more in your Discord server
compression cti cyberchef cybersecurity data-analysis data-manipulation discord-bot discord-js encoding encryption hashing infosec parsing redteam
Last synced: 28 Jan 2026
https://github.com/gappeah/global-shipping-analytics-dashboard
This Tableau project provides a comprehensive visual analysis of global sales, shipping costs, and quality metrics across different regions and countries.
data data-analysis data-analyst data-visualization metrics tableau
Last synced: 25 Feb 2025
https://github.com/aryansharma5/data-visualization-and-thorough-analysis
comprehensive guide for data analysis and visualization
data-analysis data-visualization
Last synced: 18 Mar 2025
https://github.com/allanotieno254/codsoft
This repository showcases a series of data science projects completed during an internship with CODESOFT. Each project utilizes Python and various machine learning techniques to solve specific problems in data analysis, classification, regression, and predictive modeling.
classification data-analysis data-science feature-engineering machine-learning model-evaluation predictive-modeling python-programming regression
Last synced: 15 May 2025
https://github.com/antonijn/polyfit
Fits a polygon to a given data input
c data-analysis linear-algebra toy
Last synced: 16 Jul 2025
https://github.com/hifza-khalid/book-management-system-sql
A Book Management System SQL project 📚 featuring tables for Authors ✍️, Books 📖, Customers 👤, and Orders 🛒. Includes sample queries for tracking book sales 💰, pricing by genre 🎭, and customer order history 📅.
book-management data-analysis database-management sql sql-queries
Last synced: 03 Feb 2026
https://github.com/gesiscss/wikipedia-language-olga-master
Measuring Gender Inequalities of German Professions on Wikipedia
bias crowdflower data-analysis data-science gender images python statistics wikipedia
Last synced: 10 Apr 2026
https://github.com/sauravbhattacharya001/biobots
A simple tool for checking 3D printer run statistics.
3d-bioprinting aspnet-web-api biobot bioinformatics bioprinting csharp data-analysis dotnet rest-api tissue-engineering
Last synced: 19 May 2026
https://github.com/satvikvirmani/engineering-graduate-salary-analysis
A Data Science/Machine Learning project to analyse and study salary patterns of a engineering graduate in India.
data-analysis data-science jupyter-notebook machine-learning prediction python regression
Last synced: 19 Jun 2026
https://github.com/husna-poyraz/titanic-machine-learning
Use machine learning to create a model that predicts which passengers survived the Titanic shipwreck.
data data-analysis data-science data-visualization deep-learning machine-learning missing-data outlier-detection python titanic
Last synced: 10 May 2026
https://github.com/paezha/isdas
Companion package for An Introduction to Spatial Data Analysis and Statistics with R
data-analysis gis rstats spatial-analysis spatial-statistics
Last synced: 04 Jan 2026
https://github.com/5ekastanx/data-analysis
Extracting data from parsing, for example, like hacking using Python using all sorts of function methods
Last synced: 14 Mar 2025
https://github.com/matthewgrosman/messenger-analytics
Project that ingests Facebook Messenger conversations and generates analytics.
analytics data-analysis excel facebook facebook-messenger java mongodb
Last synced: 15 Apr 2025
https://github.com/kishlayjeet/zomato-data-exploration
In this project, we will be exploring a dataset containing information on various restaurants and their ratings, location, and other attributes.
data-analysis eda matplotlib numpy pandas zomato-data-exploration
Last synced: 10 Apr 2026
https://github.com/gf712/abpytools-qt
Qt interface of AbPyTools
antibody-numbering antibody-sequences cpp11 data-analysis python3 qt5
Last synced: 10 Apr 2026
https://github.com/greed2411/ndl
Numbers Don't Lie, attempt on Data Analysis using pandas and matplotlib.
cities data-analysis data-science data-visualization india kaggle
Last synced: 19 Apr 2026
https://github.com/vineet416/airbnb_data_analysis
Ineuron.ai Internship Project
data-analysis exploratory-data-analysis matplotlib numpy pandas-python python seaborn statistical-analysis statistics
Last synced: 10 Apr 2026
https://github.com/nouman6093/advanced-statistical-models
in this repository i will upload everything i have learned about data science advanced statistical models. there are over 42 statistical models. each of them work on algorithms. and there are over 32 algorithms. each library has its own way of writing such statistical models. after learning i will try to upload as much statistical models as possibl
data data-analysis data-science data-visualization
Last synced: 11 Jun 2026
https://github.com/mardavsj/weather-prediction
Weather prediction model which mainly focuses on visualization.
data-analysis data-visualization matplotlib numpy pandas pandas-dataframe
Last synced: 10 Apr 2026