Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-07-03 00:07:37 UTC
- JSON Representation
https://github.com/chiemekaifemegbulem/useful_tools
Advanced Web Scraping
automation beautifulsoup captcha-solving data-analysis data-extraction data-science proxy-rotation python scraping-bots selenium tor-network web-scraping web-scraping-python webscraping
Last synced: 28 Apr 2026
https://github.com/faezeh-gholamrezaie/coronavirus-d3-chart
Tracing the spread of the coronavirus from the White House under the title of diagram D3
coronavirus coronavirus-analysis coronavirus-outbreak-tracking d3 data-analysis datavisualization white-house-coronavirus-outbreak-tracking
Last synced: 29 Apr 2026
https://github.com/adnanrahin/apache-spark-complete-reference
This repository reflects on all the necessary steps to take before jump in into Big Data.
big-data data-analysis data-science kaggle-dataset machine-learning rdd scala spark
Last synced: 29 Apr 2026
https://github.com/manwithacap/by-the-metric-match
🎲🃏 A game data tracker for your board/card/video games!
data-analysis data-visualization games jupyter-notebook python utility
Last synced: 29 Apr 2026
https://github.com/akash1070/data-analytics-virtual-experience-program-by-quantium
Data Analytics Virtual Experience Program by Quantium
data-analysis data-science machine-learning-algorithms python3 tableau
Last synced: 29 Apr 2026
https://github.com/azevedontc/datapulse
DataPulse
automation brazil cli data data-analysis matplotlib meteorology open-meteo pandas prevision pycharm python python3 reports venv weather
Last synced: 29 Apr 2026
https://github.com/alemalvarez/data-analysis-web-project
Web-app providing a simple interface for data storage,
data-analysis data-science javascript react webapp
Last synced: 29 Apr 2026
https://github.com/sejalmankar1012/yuvaco_data_analysis_assessment
This assignment involves writing a Python script to calculate the cost of package deliveries based on provided data and a cost grid. The script takes package details such as weight, distance, and delivery type, applies the cost calculation rules, and saves the results in an output file. You can also run the script in Google Colab for convenience.
csv-file-handling data-analysis google-colab package-delivery python python-scripting
Last synced: 29 Apr 2026
https://github.com/iamjuniorb/data_structures_and_algorithms
I'm working on Data Structures and Algorithms I C949 class in school and decided to write up all of these searching algorithms, sorting algorithms, strutures, and so on to get a better understanding. These can be used with large datasets to test their space and time complexities.
data data-analysis data-science data-structures datastructures datastructures-algorithms datastructuresandalgorithm math mathematics programming python python-app python-library python3
Last synced: 08 Jun 2026
https://github.com/ismailtekin05/caloriedetectingai
🍎🔍 Smart AI system that identifies food items in photos and calculates their calorie content automatically. Built with TensorFlow, YOLOv8, CUDA and computer vision for accurate nutrition tracking.
ai aimodel calorie-calculator computer-vision cuda data-analysis data-science data-segmentation data-visualization dataset dataset-generation image-processing image-recognition python segmentation-models tensorflow ultralytics yaml yolo yolov8
Last synced: 29 Apr 2026
https://github.com/arielle0222/data_analysis
📊 Data analysis projects for autonomous driving and smart mobility engineering using Python and SQL.
autonomous-driving composite data-analysis electric-vehicles environmental-data python visualizatoin
Last synced: 30 Apr 2026
https://github.com/mehulcode12/atliq-bank_creditcard_transaction_analysis
The credit card project at Atliq Bank comprises two key phases: market identification and trial. This initiative aims to leverage mathematical and statistical concepts to analyze data related to demographics, income, credit scores, and spending patterns in order to identify the target audience for the credit card.
codebasics data-analysis data-science data-visualization mathematics python python3 statistics
Last synced: 30 Apr 2026
https://github.com/phillbertnevinemmanuel/movieindustryanalysis-correlation
This project is a comprehensive data analysis endeavor within the Movie Industry, spanning from Data Cleaning to Exploratory Data Analysis, Correlation Analysis, and Temporal Analysis. The dataset was sourced from Kaggle, purportedly scraped using the IMDb API. Python was the primary tool utilized for analysis.
data-analysis data-cleaning python
Last synced: 30 Apr 2026
https://github.com/karthikudyawar/passwordometer
To predict the strength of the password
cybersecurity data-analysis data-visualization dataset docker exploratory-data-analysis-eda fastapi jupyter-notebook mongodb password-security password-strength-meter
Last synced: 30 Apr 2026
https://github.com/alcestide/scianalytics
Playground for Data Analysis and Visualization for Research and Scientifical Purposes with Pandas and Plotly.
csv data-analysis data-science data-visualization pandas plotly python science-research statistics
Last synced: 30 Apr 2026
https://github.com/affec-ds/dashboard-ventas-vinilos
Dashboard interactivo de ventas para tienda de vinilos. Análisis visual, KPIs clave y filtros dinámicos para decisiones comerciales.
business-intelligence data-analysis data-visualization ipywidgets jupyter-notebook kpis matplotlib music-industry notebook-project python retail-analytics sales-analysis seaborn vinyl-records
Last synced: 30 Apr 2026
https://github.com/shubham14p3/python-word-cloud
Simple python application to create word cloud.
data data-analysis data-science data-visualization nbextension python-3 upload-file
Last synced: 01 May 2026
https://github.com/scarblase/salary-comparison
Submission for the DataCamp Salary Competition(1 level). 🏆
data data-analysis data-science data-visualization engineering python sql structured-data
Last synced: 01 May 2026
https://github.com/pedestriandynamics/cloudfast-dl4pude
A Cloud-based Deep Learning System for Improving Crowd Safety at Event Entrances
anomaly-detection artificial-intelligence cloud-environment computer-vision convolutional-neural-network crowd-behavior-analysis data-analysis data-visualisation deep-learning live-camera machine-learning
Last synced: 01 May 2026
https://github.com/lisa-ho/breadit
Respository for scraping and analysing data from the Reddit/Sourdough community to explore lockdown baking trends.
data-analysis data-viz nltk python reddit-api sentiment-analysis web-scraping
Last synced: 01 May 2026
https://github.com/com-480-data-visualization/project-2023-choo-choo-data-darlings
This repository contains the source code for our data visualization project, an interactive platform designed to explore the intricate Swiss transportation network. Developed by the Choo Choo Data Darlings team at EPFL, the project provides an in-depth view into the vast array of Swiss transportation operations, including trains, buses, and trams.
boats buses data-analysis data-science data-visualisation data-visualization epfl metro public-transport public-transportation switzerland trains trams
Last synced: 01 May 2026
https://github.com/henrylin03/china-gdp
Analysis and visualisation of China GDP data using Python.
data data-analysis data-visualisation dataset kaggle pandas
Last synced: 01 May 2026
https://github.com/nafisalawalidris/data-analysis-with-python
This repo features Jupyter Notebook labs for learning data analysis with Python. Explore data acquisition, wrangling, visualization, modeling, and evaluation. Enhance your skills in Python data analysis.
data-acquisition data-analysis data-science data-wrangling exploratory-data-analysis feature-engineering machine-learning model-development model-evaluation-and-refinement pandas
Last synced: 02 May 2026
https://github.com/emso-exe/reclamacoes_de_consumidores_com_empresa_de_telecomunicacoes
Projeto de análise de reclamações de consumidores com empresa de telecomunicações no 1º semestre de 2021 com base nos dados do site consumidor.gov.br.
analise-de-dados ciencia-de-dados data-analysis data-science datascience python python-3 python3
Last synced: 02 May 2026
https://github.com/y-india/project-road-accident-severity-prediction-system
see README below , please.
application data data-analysis data-classification data-cleaning data-science data-visualization data-visualization-project machine-learning ml pandas project real-world-problem-solving real-world-project road-project streamlit-webapp
Last synced: 02 May 2026
https://github.com/gaurav-van/house_price_predictor_streamlit_web_app
Data Science Project to Predict House Prices in Bangalore using the concept of Regression. This Repository is used for Deployment of the Project
data-analysis data-science exploratory-data-analysis machine-learning prediction python regression streamlit
Last synced: 02 May 2026
https://github.com/seankwarren/water-quality-analysis
An examination of water quality in the Atlanta watershed with a focus on identifying neglected areas and potential strategies for improving water quality monitoring
analytics data-analysis jupyter-notebook python
Last synced: 03 May 2026
https://github.com/madhuresh2011/telco-customer-churn-analysis-using-python
The analysis primarily investigates factors influencing customer churn, particularly focusing on payment methods and contract types.
csv data-analysis matplotlib numpy pandas pyhton seaborn vizualisation
Last synced: 02 May 2026
https://github.com/gauthamnairvm/trex-app
Text Refinement EXplorer - An EDA tool for text based data.
data-analysis data-visualization groq-api large-language-models llama3 natural-language-processing text2sql
Last synced: 03 May 2026
https://github.com/ferrangarciarovira/premier-league-betting-analysis
Comprehensive Python analysis of Premier League betting market inefficiencies (2005–2024). Evaluates bookmaker biases, betting strategies, and market efficiency using statistical methods and Monte Carlo simulations.
betting-strategies bias-detection data-analysis market-efficiency monte-carlo-simulation premier-league python sports-analytics
Last synced: 03 May 2026
https://github.com/cs-joy/pandasv2.0.3
learn data analysis with pandas
data-analysis pandas pandas-learning
Last synced: 03 May 2026
https://github.com/manikantasanjay/time_series_data_analysis_on_stocks
Time Series Data Analysis project on Daily Stock Prices of the following companies(Apple, Microsoft, Google, Amazon) for a span of 5 years.
data-analysis pandas stock time-series time-series-analysis
Last synced: 03 May 2026
https://github.com/theairbend3r/mice-memory-response
Effect of memory on current response in mice using methods from computational neuroscience and machine learning.
computational-neuroscience data-analysis data-science machine-learning neuroscience python
Last synced: 09 Jun 2026
https://github.com/shelton-beep/predicting-gpa-using-lifestyle-factors
Predicting student GPA using lifestyle factors like study habits, sleep, and stress levels. A machine learning model built to help students and educators understand the impact of lifestyle choices on academic performance.
data-analysis data-preprocessing data-science feature-engineering gpa-prediction machine-learning model-interpretability predictive-modeling python regression-analysis student-performance xgboost
Last synced: 04 May 2026
https://github.com/ruchit0807/heart_disease_prediction
An interactive ML-powered web app that predicts the risk of heart disease based on clinical inputs like age, chest pain, cholesterol, ECG, and more. Built using Python, Streamlit, and scikit-learn, it offers early risk assessment in a simple and accessible way—just enter your health metrics and get instant feedback.
data-analysis data-science knn-regression pandas streamlit
Last synced: 04 May 2026
https://github.com/gowthamsundaresan/eigenscan
blockexplorer for eigenlayer
crypto data-analysis eigenlayer nextjs web3
Last synced: 04 May 2026
https://github.com/ehtisham-sadiq/building-an-ml-based-heart-disease-diagnosis-system-with-flask
It is an end-to-end project that combines machine learning to create a user-friendly Heart Disease Diagnosis System, powered by Flask.
data-analysis exploratory-data-analysis feature-engineering flask machine-learning model-building model-evaluation pipelines python3 rest-api
Last synced: 04 May 2026
https://github.com/gursv/autoworth
Used Car Price Prediction (India)
data-analysis data-analysis-python data-analytics data-cleaning data-preprocessing data-science-projects eda fine-tuning gridsearchcv machine-learning matplotlib-pyplot pandas python3 random-forest-regressor scikit-learn seaborn
Last synced: 05 May 2026
https://github.com/vara-co/python-api-challenge
Weather and Perfect Vacationing Spots Worldwide, by using APIs
api apis data-analysis data-visualization hvplot jupyter-notebook matplotlib pandas vacation weather
Last synced: 05 May 2026
https://github.com/shuddha2021/stellar-candidate-selector
A sophisticated candidate selection algorithm leveraging multi-criteria analysis and machine learning to identify top software engineering candidates. This tool features flexible filtering, score adjustment, and detailed visualizations to streamline the recruitment process.
candidate-selection data-analysis data-visualization machine-learning pandas plotting-in-python python python-data-analysis recruitment scikit-learn
Last synced: 05 May 2026
https://github.com/wizardoftrap/football-team-analytics
This Jupyter notebook, created on Kaggle, analyzes football player and team statistics for the 2024-2025 season. It provides insights into player performance, team metrics, and playing styles across major European leagues using data from the dataset players_data-2024_2025.csv.
data-analysis data-visualization jupyter-notebook pandas python
Last synced: 05 May 2026
https://github.com/kiranmayi5/python-projects
A collection of Python projects showcasing skills in data analysis and visualization.
data-analysis data-visualization machine-learning nlp python
Last synced: 05 May 2026
https://github.com/myounus-codes/saleprice-prediction-dataset-analysis-and-cleaning-advance-regression
In this project I have cleaned the data for the model. Project Google Colab Link: https://colab.research.google.com/drive/1vQY-XEFJSdEkW2PQOSf1j13Yk8L-XXNw?usp=sharing
algorithms data-analysis data-science eda google-colab machine-learning numpy pandas python scikit-learn scikit-learn-python
Last synced: 05 May 2026
https://github.com/githubuseraccountamazing/the-amari-project
a project in which I attempted to push some of the limits of stable-diffusion while taking some data along the way
ai ai-generated-images bash data-analysis machine-learning stable-diffusion textual-inversion
Last synced: 05 May 2026
https://github.com/mr-vozhyk/karpov.courses-study
Часть заданий, мини-проектов и финальный проект от karpov.courses
airflow data-analysis git python sql statistics
Last synced: 05 May 2026
https://github.com/mrankitgupta/titanic-survival-prediction-93-xgboost
Titanic Survival Prediction Project (93% Accuracy)🛳️ In this notebook, The goal is to correctly predict if someone survived the Titanic shipwreck using different Machine Learning Model & Hyperparameter tunning.
classification data-analysis data-science data-visualization gradient-boosting kaggle-competition linear-regression logistic-regression machine-learning machine-learning-algorithms ml ml-models nlp prediction predictive-modeling random-forest titanic titanic-kaggle titanic-survival-prediction xgboost
Last synced: 06 May 2026
https://github.com/scarblase/portfolioprojects
A collection of data analysis and business intelligence projects using SQL, Python, and visualization tools to uncover insights from real-world datasets. 🚀📊
csv data-analysis data-engineering data-mining data-science data-visualization matplotlib matplotlib-pyplot pandas python python3 seaborn sql
Last synced: 06 May 2026
https://github.com/monish-nallagondalla/diamondpriceprediction
Diamond Price Prediction is an end-to-end machine learning project that predicts diamond prices based on attributes like carat, cut, color, clarity, and dimensions. It features a Flask web application for real-time predictions and utilizes models such as Linear Regression, Lasso, and Ridge.
data-analysis data-science flask jupyter-notebooks machine-learning predictive-modeling python
Last synced: 06 May 2026
https://github.com/freebirdscrew/covid-19-data-analysis
Coronavirus Data-Analysis with Live Data Streaming from the Website and Made a DASH Web-App at Last.
coronavirus coronavirus-real-time coronavirus-tracking countryinfo covid-19 covid-19-india covid19 covid19-data dash dash-button dashboard-application data data-analysis data-cleaning data-science data-visualization github jupyter pycountry python
Last synced: 07 May 2026
https://github.com/allanccwang/electronic_projects
implement the circuit with microcontroller
arduino circuit-analysis circuit-simulations circuits-and-electronics cpp data-analysis microcontroller physics python wemos
Last synced: 07 May 2026
https://github.com/sivas-2/coffee-sales-visualization
This repository contains data visualization scripts and notebooks analyzing coffee sales data from a vending machine, sourced from Kaggle. The visualizations explore sales trends, customer preferences, and product popularity over time.
data data-analysis data-science data-visualization python visualization
Last synced: 07 May 2026
https://github.com/eduardoedubox/health_data_analysis
Health data analysis using Jupyter Notebook
data-analysis data-science database jupyter-notebook pandas python
Last synced: 07 May 2026
https://github.com/backdoorali/insider-threat-detection-project
Personal data analysis project combining insider threat detection, cybersecurity, and exploratory data analytics. Built for portfolio showcase and practical skills demonstration.
cybersecurity data-analysis data-analysis-excel data-analysis-project data-analyst data-analytics data-visualization eda excel insider-threat jupyter-lab jupyter-notebook matplotlib numbers pandas portfolio-project python python3 threat-detection threat-intelligence
Last synced: 07 May 2026
https://github.com/gorodroz/crypto-tracker
Realtime Bitcoin price tracker using Binance WebSocket and REST API. Logs prices to CSV and supports Pandas for data analysis.
binance bitcoin crypto csv-logger data-analysis pandas python rest-api websocket
Last synced: 07 May 2026
https://github.com/md-emon-hasan/data-science
Data science tutorials, including data preprocessing, analysis, visualization, project deployment, machine learning and deep learning algorithms.
artificial-intelligence data-analysis data-engineering data-science deep-learning machine-learning-algorithms python
Last synced: 07 May 2026
https://github.com/dogan-the-analyst/web_scraping_job_vacancies
data-analysis python web-scraping
Last synced: 07 May 2026
https://github.com/y-india/retail-sales-analysis-project
Analysis and preprocessing of retail store sales data. Includes data loading, merging, and initial inspection. 📌 Recommended: See README.md for detailed project progress and dataset information.
ai dashboard data-analysis data-science data-visualization jupiter-notebook machine-learning matplotlib python real-world-problem-solving real-world-project retail-analytics sales-analysis seaborn sklearn-library streamlit
Last synced: 07 May 2026
https://github.com/bassamn/titanic-data-analysis
Exploratory data analysis (EDA) of the Titanic dataset using Python. Analyzed survival patterns by age, gender, and class with visualizations (seaborn/matplotlib). Non-ML focus—highlighting insights with statistics and plots.
data-analysis eda pandas python seaborn titanic visualization
Last synced: 08 May 2026
https://github.com/geo-y20/loan-approval-automation-using-mongodb-and-pymongo
This project demonstrates the implementation of a loan approval system that utilizes MongoDB for distributed data storage and management, and PyMongo for database operations. The project aims to automate the assessment of loan eligibility using customer details from online applications.
crud-application data data-analysis data-science data-visualization deployment jupyter-notebook loan-default-prediction loan-prediction-analysis machine-learning machine-learning-algorithms matplotlib mongodb pymongo streamlit web
Last synced: 08 May 2026
https://github.com/robson-python/airplane-price-data-analysis
Airplane Price Data Analysis - Airplane Price Prediction
data-analysis data-science data-visualization jupyter-notebook linear-regression machine-learning matplotlib pandas python seaborn vscode
Last synced: 10 Jun 2026
https://github.com/guglielmo/datalab-notebooks
Data analysis at openpolis
data-analysis data-science jupyter-notebooks pandas python3
Last synced: 08 May 2026
https://github.com/iguptashubham/ott-churn-eda-ml
Understanding why customers discontinue their subscriptions will be crucial in optimizing the user experience, reducing churn, and maximizing customer lifetime value. By using Machine learning model to predict the Customer Churn.
data-analysis data-analysis-project data-science data-science-portfolio data-science-projects data-visualization machine-learning python
Last synced: 08 May 2026
https://github.com/miroslav-reiter/kurz_jazyk_sql_analytici_datovi_vedci
Materiály ku kurzu Jazyk SQL 1 pre Analytikov a Dátových Vedcov
analysis analytics data data-analysis data-science database mysql reiter sql
Last synced: 08 May 2026
https://github.com/framebuffers/mindhunter
Wrappers for Pandas DataFrames to add quicker access for common statistical values, utilities and functionality.
data-analysis data-science numpy pandas python utilities-python
Last synced: 08 May 2026
https://github.com/md-emon-hasan/data_analytics_project
Data analytics tasks and solutions, featuring hands-on exercises for data cleaning, visualization, and analysis using Python libraries.
cars-dataset census-data covid19-data data-analysis london-house-price police-data weather-data
Last synced: 08 May 2026
https://github.com/sermonzagoto/data_manipulation_with_pandas
Data Manipulation with Pandas - Part 1
data-analysis data-science jupyter-notebook pandas-python python
Last synced: 09 May 2026
https://github.com/tanu272004/-income-mortgage-housing-insights-a-state-city-analysis-
To analyze state & city housing trends and affordability using data analytics.”
bigquery business-intelligence data-analysis data-visualization dax googlecloud kpi numpy powerbi predective-modeling python sql
Last synced: 09 May 2026
https://github.com/rubinlake/rl-academy-data-analytics
Educational data analysis project demonstrating BMW sales data analysis with AI-powered code assistance using Cursor IDE and Jupyter notebooks
cursor-ide data-analysis educational-project jupyter langchain matplotlib numpy pandas python scipy seaborn
Last synced: 09 May 2026
https://github.com/mariam-badr-mb/gtc-ml-project2-diabetes-prediction
This project is part of the GTC Machine Learning Program. It demonstrates the end-to-end ML workflow by building a predictive model for diabetes detection
classification-algorithm data-analysis data-visualization diabetes-prediction gridsearchcv hyperparameter-tuning machine-learning python
Last synced: 09 May 2026
https://github.com/pratik-khose/data-analysis-with-pandasai
PandasAI with Llama3 for Interactive Data Analysis
data-analysis llama3 llma pandasai streamlit visualization
Last synced: 11 May 2026
https://github.com/chayandatta/got_script_manipulation
Game of Thrones Script - String & file manipulation
data-analysis data-science pandas python3
Last synced: 11 May 2026
https://github.com/ahmednasef3/heart-attack-full-eda
Simple EDA for Heart Attack Dataset.
data-analysis data-science data-visualization eda exploratory-data-analysis heartattack matplotlib pandas seaborn
Last synced: 11 May 2026
https://github.com/is-leeroy-jenkins/sherpa
A budget execution & data analysis tool based on Winforms, .NET 6, and written in C# for EPA analysts
budget-management data-analysis data-science data-visualization federal-government
Last synced: 13 May 2026
https://github.com/jo-phil/hopkins-statistic
A Python package for computing the Hopkins statistic to assess clustering tendency.
cluster-analysis cluster-tendency clustering data-analysis data-science hopkins-statistic machine-learning python statistical-analysis statistics
Last synced: 13 May 2026
https://github.com/zpreisler/modules
Python libraries and modules for processing simulation outputs
data-analysis python scripts tensorflow
Last synced: 13 May 2026
https://github.com/iguptashubham/pizzahut-analysis-sql
best dataset for data analysis. Pizzahut data analysis done by Shubham Gupta in MySql. This dataset is provided by friend of mine intern at pizzahut. In pizzahut, they used this dataset to train and ask question. This data does not reveal anything about the pizzahut. It is safe to share. data
data-analysis data-analytics database dataset datasets mysql mysql-database pizzahut
Last synced: 14 May 2026
https://github.com/pferreirafabricio/data-immersion
🏊🏻♂️ Activities and exercises from 'Imersão Dados' event
data data-analysis data-science dataset jupiter-notebook python
Last synced: 14 May 2026
https://github.com/reinmagine/eliminating-no-sensor
Contains my project that analyzes air quality sensor data to determine if the NO (Nitric Oxide) sensor in N. Mai, Los Angeles, CA can be removed without affecting data accuracy.
air-quality-sensor colab-notebook cost-optimization data-analysis data-optimization matplotlib-python nitric-oxide pyspark-python python sql
Last synced: 14 Jun 2026
https://github.com/soufianboukir/ecom-analytics-platform
End-to-end data science project on an Amazon sales dataset, including data preprocessing, analysis, modeling, and a Streamlit dashboard for insights and decision-making.
data-analysis data-science data-visualization data-visualization-dashboard forecasting-models timeseries
Last synced: 14 Jun 2026
https://github.com/kaz-yos/distributed
Comparison of Privacy-Protecting Analytic and Data-sharing Methods: a Simulation Study (Pharmacoepidemiol Drug Saf 2018)
data-analysis epidemiology statistics
Last synced: 15 Jun 2026
https://github.com/ensinho/data-analysis
My repository for data analysis studys in Python.
csv data-analysis graphs python python-documentation
Last synced: 15 Jun 2026
https://github.com/metalwarrior665/actor-results-checker
apify data-analysis json-schema-checker
Last synced: 16 Jun 2026
https://github.com/kaushik0911/jubilant-guide
A Streamlit application for advanced route planning and accessibility analysis using OpenRouteService (ORS). Explore optimal routes while avoiding roadblocks and discover points of interest (POIs) within travel time ranges.
data-analysis data-visualization geospatial-analysis python streamlit
Last synced: 16 Jun 2026
https://github.com/shrawans007/google_cyclistic_2023
Google Data Analytics Capstone Case Study (SQL and Tableau)
big-query bigquery coursera-assignment cyclistic cyclistic-bike-share-analysis-case-study cyclistic-bikshare data-analysis data-analysis-project data-analytics data-cleaning data-combination data-exploration data-science google-data-analytics sql tableau tableau-dashboard tableau-public
Last synced: 19 Jun 2026
https://github.com/lebrancconvas/how-much-love-in-thai-song
How much Love song among the Thai Songs?
data-analysis side-project web-scraping
Last synced: 19 Jun 2026
https://github.com/kirkalyn13/open-signal-report-generator
Script used to generate results/summary, including the trends of flagged provinces, from the raw excel data file,
data-analysis data-science data-visualization matplotlib numpy pandas python
Last synced: 19 Jun 2026
https://github.com/mohnoor94/datasciencefundementalsusingpython
My journey to learn Data Science with Python
data data-analysis data-science data-visualization learning learning-by-doing python python3
Last synced: 19 Jun 2026
https://github.com/souravsuvarna/whatsapp-chat-analyzer-api
The WhatsApp Chat Analyzer API is a public api specifically designed for frontend enthusiasts who are interested in building a WhatsApp Chat Data Visualizer project. Built on FastAPI, this API offers a seamless and efficient method to process chat data and returns the processed result data in JSON format.
api data-analysis data-science fastapi publicapi python
Last synced: 20 Jun 2026
https://github.com/markmusic27/data-statistics-calculator
💣 This method (made in JavaScript / Python) can find the mean, median, mode, range, and standard deviation.
data-analysis standard-deviation statistics statistics-calculator
Last synced: 20 Jun 2026
https://github.com/alicankaya192/world-happiness-report-2025
Comprehensive exploratory data analysis (EDA) and visualization of the World Happiness Report 2025. Analyzes global rankings, regional distributions, key happiness factors, and detects wealth-happiness paradox outliers using Python (Pandas, Matplotlib, SciPy).
correlation-analysis data-analysis data-science data-visualization eda exploratory-data-analysis global-happiness happiness-index matplotlib pandas python scipy statistics whr-2025 world-happiness-report
Last synced: 21 Jun 2026
https://github.com/datadotworld/dw-jupyter-contents
Jupyter ContentsManager implementation for data.world
data data-analysis data-science dwstruct-t50-public-projects jupyter jupyter-notebook jupyterlab reference-implementation
Last synced: 22 Jun 2026
https://github.com/ryanfranklin237/data-cleansing
A group of python scripts that clean large data sets by removing duplicate data, putting data in correct formats, and removing redundant cells
data-analysis data-cleaning data-science extract-transform-load pandas-dataframe python
Last synced: 23 Jun 2026
https://github.com/abhik1711/material-classification-and-energy-band-prediction---excavate-25
A Two-Stage Machine Learning Pipeline: A Binary Classifier to identify insulators with high accuracy and a Stacking Regressor to predict precise band gap values for insulators by leveraging advanced feature engineering techniques and ensemble learning methods
data-analysis machine-learning python
Last synced: 23 Jun 2026
https://github.com/rudra-g-23/find-my-joint
A utility to find potential join keys (matching columns) across multiple DataFrames.
data-analysis data-visualization join network-graph pandas pandas-dataframe
Last synced: 24 Jun 2026
https://github.com/anandanraju/youtube-data-api-model
The YouTube Analytics API enables you to generate custom reports containing YouTube Analytics data. The API supports reports for channels and for content owners. Report fields are characterized as either dimensions or metrics
analytics data-analysis data-science metrics model python telemetry youtube youtube-api
Last synced: 03 May 2026
https://github.com/jabhij/eda_experiments
In this repo I'll use different types of datasets to explore and implement various Exploratory Data Analysis (EDA) approaches.
ames-housing analysis battery-life blackfriday-analysis data-analysis data-science data-visualization eda matplotlib-pyplot numpy pandas python seaborn visualization zomato-data-analysis
Last synced: 14 Apr 2026
https://github.com/jossimmar/ensa-scripts_py
Repositorio destinado al manejo de datos de consumo de los Clientes Mayores de ENSA del Grupo Distriluz.
data-analysis electrical-engineering python sqlite
Last synced: 10 May 2026
https://github.com/neerajcodes888/whatsapp-chat-analyzer
A Python tool for effortless analysis of WhatsApp conversations. Gain insights with basic statistics, word cloud visualizations, and URL statistics. Powered by pandas, urlextract, wordcloud, seaborn, and Streamlit. 📊📱
analyzer chat data-analysis data-visualization pandas python3 seaborn urlextract whatsapp wordcloud
Last synced: 12 Apr 2026