Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-07-03 00:07:37 UTC
- JSON Representation
https://github.com/vimlesh-gupta/blinkit_data_analytics_project
End-to-end Blinkit data analytics project using Python, SQL Server & Power BI
blinkit data-analysis eda pandas powerbi python sql-server
Last synced: 06 May 2026
https://github.com/dhruwsunita/iphones-eda-analysis
EDA analysis on apple products.
data-analysis data-visualization eda matplotlib numpy pandas plotly python seaborn
Last synced: 06 May 2026
https://github.com/aicorsair/python-case-study-ab-testing-for-lunartech-homepage-cta-button
This repository contains a detailed case study on an A/B test of LunarTech's homepage CTA button, using proxy data structured similarly to the company's real data.
ab-testing click-through-rate confidence-intervals data-analysis data-analytics data-exploration data-science data-visualization hypothesis-testing matplotlib normal-distribution numpy pandas practical-significance python statistical-analysis statistical-significance z-critical z-statistic z-test
Last synced: 07 May 2026
https://github.com/jjkay03/discord-call-extractor
Collect HTML data from Discord group/DM to create database of calls
data-analysis database discord discord-tool
Last synced: 07 May 2026
https://github.com/manel15279/datamining-project
A university project that aims to explore various data mining techniques like Data Exploration, Association Rule Mining, Supervised and Unsupervised Learning, applied to real-world datasets, focusing on soil fertility analysis and COVID-19 cases evolution over time.
covid-19 data-analysis data-mining data-visualization datascience gradio machine-learning python soil-properties
Last synced: 10 Jun 2025
https://github.com/ddihora1604/advanced_business_analytics_on_world_bank_global_financial_inclusion_data_2021
Bridging the Gaps in Financial Inclusion: Understanding the Cash-Credit Paradox, Divide between Cash and Digital Payments, and Financial Resilience.
advanced-excel business-analytics data-analysis data-engineering data-mining data-visualization database exploratory-data-analysis machine-learning preprocessing-data python
Last synced: 07 May 2026
https://github.com/anshulkansal121/music_store_analysis_sql
SQL project to analyze online music store data
beginner-friendly contributions-welcome data-analysis database mysql sql
Last synced: 07 May 2026
https://github.com/shinie19/sql-data-warehouse-project
Build a modern Data Warehouse from scratch with SQL Server, including ETL processes, data modeling and analytics.
data-analysis data-analytics data-cleaning data-engineering data-lake data-lakehouse data-modeling data-normalization data-science data-standardization data-warehouse etl-pipeline medallion-architecture sql-server
Last synced: 29 Jun 2026
https://github.com/aidan-zamfir/the-iliad
Data analysis & relationship network for the characters of Homers Iliad
data data-analysis dataframes networks networkx python selenium spacy webscraping
Last synced: 08 May 2026
https://github.com/devexpress-examples/wpf-pivot-grid-group-date-time-values
This example shows how to group date-time values in Pivot Grid for WPF.
data-analysis dotnet dxpivotgrid pivot-grid pivot-grid-for-wpf wpf
Last synced: 08 May 2026
https://github.com/phanchenh/adventureworkdataset-rfm-analysis-sqlproject
RFM Analysis Using SQL on the AdventureWorks Dataset (2011-2014)
business-analytics business-intelligence data-analysis mssql rfm-analysis sql
Last synced: 10 Jun 2026
https://github.com/shelton-beep/trading-algorithm
A simple trading algorithm for SPY ETF using a moving average crossover strategy. This project analyzes SPY weekly price data, implements a buy/sell algorithm, and tracks performance metrics to evaluate profitability and risk. Ideal for learning algorithmic trading basics and financial data analysis.
data-analysis financial-analysis investment-strategy jupyter-notebook pandas python quantitative-finance technical-analysis time-series-analysis trading-strategies
Last synced: 08 May 2026
https://github.com/pedrosfaria2/analisandopostshn
Projeto para analisar as postagens da comunidade HackerNews
analise-de-dados data-analysis datetime jupyter-notebook matplotlib python python3
Last synced: 08 May 2026
https://github.com/mikeesto/ausvotes19
:bird: A collection of 67,284 public tweets published on the night of the 2019 Australian election
australia data-analysis data-visualization elections open-data twitter
Last synced: 06 Apr 2025
https://github.com/suhas-005/eda-indian-startup-funding
Exploratory Data Analysis on Indian Startup Funding (2015-2020)
data-analysis data-analytics data-science data-visualization exploratory-data-analysis matplotlib pandas python seaborn startup-funding
Last synced: 09 May 2026
https://github.com/tsbarr/toronto-open-data
Analysis of Toronto's open data initiatives. 🌆 Exploring Toronto's urban systems through data science 📊 Python-based analyses of public datasets 🔍 Focus on community impact and urban patterns 🎓 Academic rigour meets practical insights 🔄 Regularly updated with new analyses
api-integration civic-tech ckan-api data-analysis data-cleaning data-science data-visualization exploratory-data-analysis jupyter-notebook open-data pandas public-data python tableau toronto urban-analytics
Last synced: 09 May 2026
https://github.com/mehtadigisha/clean-visualize-analyze
Clean Visualize Analyze
cleaning-data data-analysis data-cleaning data-visualization eda juypter-notebook pandas python seaborn seaborn-plots visualization
Last synced: 09 May 2026
https://github.com/guptaachin/airline-sentiment-analysis-from-twitter-feeds
Analyses of the airline service providers' sentiment from twitter feeds
classification data-analysis data-science jupyter-notebook machine-learning natural-language-processing pandas pca python sklearn-library tf-idf visualization
Last synced: 09 May 2026
https://github.com/europanite/data-analysis-stability-evaluator
data-analysis-stability-evaluator
data-analysis data-analysis-project data-analysis-reliability data-analysis-stability data-analytics small-data-change
Last synced: 29 Jun 2026
https://github.com/master-helix/ibm-data-analyst-certification-stock-analysis-project
This is a mini project repository of my IBM Certification involving stock analysis and plotting of Tesla and GameStop
analytics data data-analysis data-visualization ibm matplotlib pandas python web-scraping
Last synced: 09 May 2026
https://github.com/prakashjha1/new-analysis-using-llm-locally
An interactive news analysis tool built with Streamlit and local LLMs. This app allows users to analyze and gain insights from the latest news articles using advanced language models, all running locally. Explore trends, sentiment, and key topics with an intuitive interface.
artificial-intelligence data-analysis data-science llms ollama python streamlit
Last synced: 14 Mar 2025
https://github.com/mradovic38/cnn-dominant-color-recognition
Detecting a dominant color of Tulip images using various convolutional neural networks.
artificial-intelligence cnn computer-vision data-analysis deep-learning dominant-color kmeans-clustering machine-learning neural-network pytorch regression resnet resnet-18 squeeze-and-excitation
Last synced: 01 Jul 2026
https://github.com/tyriek-cloud/nyc-mobility-survey-analysis
An end-to-end data engineering project in which five NYC DOT datasets were modified in an ETL process and analyzed for insights.
aws aws-athena aws-glue aws-glue-crawler aws-quicksight aws-s3 data-analysis data-engineering etl-pipeline json python
Last synced: 09 May 2026
https://github.com/datalopes1/manufacturing_defects
Projeto de EDA utilizando o Manufacturing Defects que pode ser encontrado no Kaggle
data-analysis data-visualization eda exploratory-data-analysis python
Last synced: 09 May 2026
https://github.com/europanite/gundam-forest
Random Forest Data Analysis of Kill In Action rate for every personnel in GUNDAM world, like in Titanic.
data data-analysis data-science data-visualization death-rate death-rates gundam-model gundam-series gundom-forest jupyter jupyter-notebook kia kill-in-action python random-forest titanic titanic-kaggle titanic-survival titanic-survival-prediction
Last synced: 29 Jun 2026
https://github.com/simranjeet97/netflix-analysis-top-rated-_visualization_plotly
Netflix Data Analysis based on Age Based Ratings and Top Genres of 2021 of Movies - TV Shows along side Data Visualization
data-analysis data-science data-visualization database datascience datastructures deep-learning google google-cloud-platform machine-learning machine-learning-algorithms netflix netflix-dataanlysis netflix-dataset netflix-prize python3
Last synced: 10 May 2026
https://github.com/fbarffmann/python-api-challenge
Accessed and analyzed real-world weather and location data using Python and public APIs. Automated data collection, cleaned API responses, and visualized geographic trends to support business-ready insights.
api automation data-analysis data-visualization google-places-api mapping openweathermap-api pandas python weather-analysis
Last synced: 10 May 2026
https://github.com/devexpress-examples/winforms-pivot-create-user-folders-within-the-customization-form
This example demonstrates how to organize the Customization Form fields in folders.
data-analysis dotnet pivot-grid-for-winforms winforms xtrapivotgrid-suite
Last synced: 10 May 2026
https://github.com/vatshayan/research-papers-
Research papers as blue prints, drafts, pre prints and published papers
artificial-intelligence big-data blockchain blockchain-technology data-analysis data-mining data-science data-visualization deep-learning deep-neural-networks image-classification image-manipulation image-processing machine-learning machine-learning-algorithms machine-learning-library python supervised-learning unsupervised-machine-learning webdevelopment
Last synced: 10 Jun 2026
https://github.com/szuzick/us-immigration-presidential-analysis
Power BI dashboard analyzing 40 years of U.S. immigration data across presidential administrations (1981-2020)
dashboard data-analysis data-visualization government-data immigration powerbi powerbi-dashboards powerbi-visuals presidential-analysis
Last synced: 10 Jun 2026
https://github.com/codingvangogh/data-science
Data Science, Machine Learning, Data Exploration, Big Data etc
data-analysis datascience decision-tree-classifier decision-tree-regression heatmap jupyter-notebook machinelearning python python3 ridge-regression seaborn sklearn svm-classifier
Last synced: 11 May 2026
https://github.com/shridhar1504/titanic-survivor-prediction-datascience-classification-project
This projects predicts whether a passenger on the titanic survived or not using machine learning algorithms with the given details of the passenger data.
classification-algorithm data-analysis data-analytics data-cleaning data-preprocessing data-science data-visualization eda exploratory-data-analysis gradient-boosting jupyter-notebook machine-learning machine-learning-algorithms matplotlib naive-bayes-classifier predictive-modeling python3 scikit-learn seaborn supervised-learning
Last synced: 11 May 2026
https://github.com/farhad-here/weatherprediction_webapplication
Weather Prediction webApplication with streamlit
chart data-analysis datavisualization geopy pandas pydeck python requests streamlit weather-app webapplication
Last synced: 11 May 2026
https://github.com/syed-amjad-ali/-bank-churn-ml
Predicting bank customer churn using machine learning. This project includes exploratory data analysis (EDA), feature engineering, classification models (Logistic Regression, Random Forest), and customer segmentation using K-Means clustering.
classification data-analysis data-science eda jupyter-notebook k-means-clustering machine-learning ml python segmentation
Last synced: 09 Mar 2025
https://github.com/sferez/gradient_descent
Multiple Linear Regression, Gradient Descent with Python
data-analysis data-science gradient-descent linear-regression python
Last synced: 12 May 2026
https://github.com/jayita11/customer-engagement-insights-for-yelp-restaurant-business-success
This project analyzes Yelp restaurant data using SQLite, Python, and Tableau to explore user engagement, reviews, and ratings. It provides insights into restaurant success across cities, regions, and user behavior.
customer-engagement data-analysis interactive-visualizations json python ratings review sqlite3 tableau-dashboards-for-data-visualization yelp-restaurants
Last synced: 12 May 2026
https://github.com/krypten/playingcardsstatisticalanalysis
Statistical Analysis of Playing Cards (Descriptive Statistics: Final Project)
data-analysis machine-learning machinelearning python statistics udacity
Last synced: 12 May 2026
https://github.com/elishah-john/happiness-report-2019
Analysis of "Happiness Report 2019" using python.
data-analysis data-visualization educational jupyter-notebook python
Last synced: 12 May 2026
https://github.com/johannaschmidle/amazon-cat-couch
Customer product reviews + ratings analysis and visualization [Python, Excel, Tableau, R]
data-analysis data-visualization jupyter-notebook python-notebook r-markdown sentiment-analysis text-analysis web-scraping
Last synced: 11 Jun 2026
https://github.com/shridhar1504/peerloankart-loan-fraud-detection-datascience-project
This project uses machine learning to predict whether a loan applicant will repay their loan. The project uses a dataset of historical loan data from PeerLoanKart, a peer-to-peer lending platform.
classification-model data-analysis data-analytics data-cleaning data-science data-visualization dimensional-analysis eda exploratory-data-analysis feature-engineering gradient-boosting-classifier hyperparameter-tuning jupyter-notebook machine-learning machine-learning-algorithms predictive-modeling python supervised-learning
Last synced: 30 Apr 2026
https://github.com/devanshsahu47/prime-content-analytics
Prime Data Explorer analyzes Amazon Prime's content and credits data to uncover trends in release years, genres, and ratings. It cleans, merges, and visualizes the data to provide actionable insights for optimizing content strategy and boosting audience engagement.
data-analysis data-visualization exploratory-data-analysis jupyter-notebook python3
Last synced: 13 May 2026
https://github.com/ani717/pneumonia_detection_effecientnet_b7
Pneumonia Detection in Chest X-ray Image with EfficientNet-B7. Accuracy = 87.98%, Precision = 100%, Recall = 83.87%, F1 Score = 91.23.
cnn computer-vision data-analysis data-augmentation efficientnet image-classification image-processing machine-learning
Last synced: 13 May 2026
https://github.com/rybakov-ks/particleanalyzer
A Computer Vision-based tool for automatic segmentation and size analysis of particles in Scanning Electron Microscope (SEM) images.
computer-vision data-analysis deep-learning detectron2 electron-microscopy image-segmentation materials-characterization microscopy-images nanotechnology object-detection particle-analysis scanning-electron-microscopy scientific-research sem sem-image-analysis yolo
Last synced: 13 May 2026
https://github.com/danmadeira/algoritmos-estatistica-pl-sql
Demonstração de Algoritmos de Estatística em PL/SQL
algorithms data-analysis data-science database oracle oracle-database pl-sql statistics
Last synced: 11 Jun 2026
https://github.com/ibrahimhabibeg/national-university-of-singapore-sms-analysis
Analysis of SMS messages collected by the National University of Singapore
analytics data-analysis data-science nlp python
Last synced: 13 May 2026
https://github.com/eslamdyab21/weratedogs-twitter-data-analysis
In this challenging project, I do data wrangling processes
csv data-analysis data-wrangling data-wrangling-twitter json-data pandas python twitter udacity-data-analyst-nanodegree
Last synced: 14 May 2026
https://github.com/bhaveshbhakta/emotions-analysis-using-gru
Emotions Analysis
data-analysis data-preprocessing data-visualization deep-learning emotion-analysis gru rnn
Last synced: 14 May 2026
https://github.com/deliprofesor/joblocationmapper
JobLocationMapper is a Python tool that visualizes job listings on an interactive map. It uses city and state data to place job markers accurately and color-codes them by occupation (Software, Marketing, Design). The map clusters markers for better organization, and users can click on them to view job details.
clustrered-markers data-analysis data-visualization folium geocoding geographical-visualization interactive-map job-listings map-visualization pandas python
Last synced: 14 May 2026
https://github.com/balajimohan18/foreign-exchange-rate-time-series-datascience-project
This project will use time series analysis to forecast the exchange rate between the euro and the US dollar. The project will use a variety of statistical techniques, such as ARIMA to model the data and forecast the exchange rate.
data-analysis data-analytics data-preprocessing data-science data-transformation data-visualization eda exploratory-data-analysis foreign-exchange-rates machine-learning model-fitting predictive-modeling python3 time-series time-series-analysis
Last synced: 14 May 2026
https://github.com/yashsingh43/cdc-sleep-duration-health-analysis
Analysis of CDC BRFSS 2022 data exploring how sleep duration relates to mental and physical health outcomes.
beautifulsoup brfss cdc data-analysis data-visualization matplotlib pandas plotly public-health python
Last synced: 11 Jun 2026
https://github.com/prakhargpt/sql-data-warehouse-project
Building Data Warehouse project using SQL Server, including ETL processes, data modelling and analytics.
analytics data data-analysis data-cleaning data-engineering data-engineering-pipeline data-lakehouse data-science data-warehouse etl etl-job etl-pipeline medallion-architecture sql sql-server
Last synced: 12 Jun 2026
https://github.com/rachel-xmr/data-analysis-of-hand-drawn-symbols
Data analysis of hand draw symbols
cross-validation data-analysis decision-trees k-means-clustering machine-learning model-evaluation r
Last synced: 12 Jun 2026
https://github.com/sambit-mondal/stockx
StockX is a full-stack application designed to help store owners efficiently manage their inventory, track purchases, and analyze stock levels. The system integrates MongoDB, Express, React, and Flask (Python) to provide a seamless experience.
artificial-intelligence data-analysis inventory-management-system machine-learning mern-stack
Last synced: 12 Jun 2026
https://github.com/madrury/commute-times
Simulated Commute Times Data
data-analysis data-science data-visualization dataset
Last synced: 12 Jun 2026
https://github.com/saksham-jain177/cryptodataanalysis
A Python powered project that fetches live cryptocurrency data from the CoinMarketCap API, analyzes it, and updates a live Excel sheet every 5 minutes.
api-integration coinmarketcap cryptocurrency data-analysis excel live-data python
Last synced: 12 Jun 2026
https://github.com/marialuizaleitao/walmartsalesanalysis
This project explored data collection and preprocessing, advanced application of SQL queries, and feature engineering. Key calculations, such as COGS (Cost of Goods Sold) and VAT (Value Added Tax), were performed to assess the profitability and financial efficiency of the branches.
business-analytics data-analysis mysql-database sql
Last synced: 13 Jun 2026
https://github.com/cannt39t/wylsacom-analysis-reflinks-datamining
data data-analysis data-mining python3 sql
Last synced: 13 Jun 2026
https://github.com/nob101/lotto-analyzer
Ein Node.js & SQLite basiertes Tool zur Analyse und Auswertung der Euromillionenziehung und Joker. A Node.js and SQLite web application to analyze, track, and evaluate lottery (Euromillionen) and Joker results.
backend css data-analysis express html5 javascript nodejs sqlite statistical-analysis
Last synced: 14 Jun 2026
https://github.com/mohsinraza2999/generous-tipper
A production level modular data science project aims to predict generous tippers for taxi drivers.
backend-development ci-pipeline data-analysis data-cleaning-and-preprocessing docker exploratory-data-analysis fastapi feature-engineering front-end hypothesis-testing logistic-regression randon-forest understanding-business-problem xgboost-classifier
Last synced: 14 Jun 2026
https://github.com/jkazari/rollercoaster-eda
Repository of a small data-analysis project in R for Mathematical Software class on the 3rd semester of studying Mathematics at Gdańsk University of Technology
Last synced: 14 Jun 2026
https://github.com/tawfikhammad/sql-leetcode-solutions
The solutions of SQL 50 LeetCode problems
data-analysis data-engineering database leetcode leetcode-solutions sql
Last synced: 15 Jun 2026
https://github.com/prathmesh2507/global-stock-intelligence-dashboard
Interactive Global Stock Market Analytics Dashboard built using Python, YFinance, Pandas, Streamlit, and Plotly. Analyze 20+ countries and 400+ top stocks with advanced visualizations and financial insights.
dashboard data-analysis data-visualization python stock-analysis streamlit
Last synced: 15 Jun 2026
https://github.com/manojrathod0777/loan-prediction
Predict loan approval status using machine learning techniques. This project demonstrates data preprocessing, feature engineering, model training, and evaluation, along with an interactive Streamlit app for real-time predictions. Ideal for financial decision-making.
classification-models data-analysis data-science financial-analytics jupyter-notebook loan-prediction machine-learning predictive-modeling python streamlit-app
Last synced: 13 Apr 2026
https://github.com/luminati-io/Shopee-dataset-samples
A sample dataset of over 1000 Shopee products, extracted using the Bright Data API, ideal for pricing optimization, gap analysis, and market strategy refinement..
api data-analysis data-mining datasets products shopee web-scraping
Last synced: 09 Apr 2025
https://github.com/leandrocollares/nyc-film-permits
NYC film permits: an exploratory data analysis
data-analysis data-visualization pandas plotly
Last synced: 05 Jul 2025
https://github.com/marielachirinosr/nyc-taxi-trip-exploration-2019-2020
Explores passenger behavior & impact of COVID-19 on NYC taxi industry (Q1 2019-2020).
bigquery data data-analysis data-visualization python sql tableau
Last synced: 15 Jun 2026
https://github.com/shellynagar27/candy-market-share-analysis
Candy Market Share Analysis explores confectionery sales data using Power BI, Python, and Power Query. It uncovers key market trends, top-selling candies, manufacturer performance, and packaging preferences to support data-driven decision-making for industry researchers.
critical-thinking data-analysis data-visualization exploratory-data-analysis powerbi powerquery problem-solving sales-analysis
Last synced: 03 Feb 2026
https://github.com/mrprajapati18/100-days-of-code-data-science
100 Days of Code Challenge to learn Data Science from scratch! 📊🔍
anaconda-navigator data-analysis data-science data-visualization machine-learning-algorithms pyhton-library python-3
Last synced: 18 Apr 2026
https://github.com/pkjjoshi/behind-the-menu-uncovering-insights-from-restaurant-data
Discover hidden patterns in dining data — from popular cuisine pairings to geographic restaurant clusters
data-analysis data-visualization insights jupyter-notebook pandas python restaurant-data
Last synced: 05 Jul 2025
https://github.com/devexpress-examples/wpf-pivot-grid-connect-to-an-olap-datasource
This example shows how to specify connection settings to the server and create fields that relate to specific measures and dimensions of the cube for the Pivot Grid for WPF.
data-analysis dotnet dxpivotgrid pivot-grid pivot-grid-for-wpf wpf xpf
Last synced: 06 May 2026
https://github.com/steviecurran/dashboards
Compilation of Links to the dashboards in the other repositories
dashboard data-analysis data-science data-visualization pandas powerbi python-dash tableau
Last synced: 21 Feb 2026
https://github.com/rupashi03/fitbit-user-eda-case-study
Performed Exploratory Data Analysis (EDA) on Fitbit users' data to uncover trends in activity and health metrics.
business-analysis case-study consumer-insights data-analysis exploratory-data-analysis health-data r user-behavior-analytics
Last synced: 25 Mar 2025
https://github.com/ravi-prakash1907/covid-19-china
A data-science research work to understand the growth rate of the novel Coronavirus.
china coronavirus covid-19 data-analysis data-mining data-science mathematical-modelling project r research research-paper
Last synced: 06 Sep 2025
https://github.com/chiragkumargohil/co2-emissions-data-analysis
A Python programme that analyses CO2 emission data from 1997 to 2010. This programme prints data, provides brief of a given year, displays and compares Year vs. Emission graphs for chosen countries, and generates a separate data file for chosen countries. It was a self-paced project that Guru 99 provided.
co2-emission data-analysis matplotlib python
Last synced: 28 Aug 2025
https://github.com/busradeveci/student-performance-prediction
A machine learning project to predict student exam performance based on academic, social, and personal features. Built with Python and scikit-learn.
data-analysis kaggle linear-regression machine-learning predictive-modeling python scikit-learn student-performance
Last synced: 25 Apr 2025
https://github.com/anushkundu/student-performance-analysis
Exploring Student Performance Factors
classification-algorithm clustering-algorithm data-analysis data-science exploratory-data-analysis machine-learning matplotlib numpy pandas python seaborn
Last synced: 13 Apr 2026
https://github.com/giyanellow/time-series-analysis-on-philippine-debt-and-inflation
A Time Series Analysis on the Philippine Inflation Rate with some predictions using RandomForest.
data-analysis data-analysis-python machine-learning python random-forest
Last synced: 18 Mar 2026
https://github.com/ray-chew/pycsam
pyCSAM is a robust approach for approximating geodesic subgrid-scale orographic spectra with applications to weather forecasting and broader data analysis
data-analysis gmted icon-model merit-dem orographic spectral-analysis topography weather-forecast
Last synced: 28 Feb 2025
https://github.com/pinedah/sleep-data-analysis-exercise
Análisis de un dataset médico sobre el sueño, explorando duración, calidad y factores relacionados. Incluye limpieza de datos, EDA y visualizaciones con Python (pandas, numpy, matplotlib, seaborn, scipy).
data-analysis data-science escom numpy pandas python school-project scipy
Last synced: 13 Apr 2026
https://github.com/fer-aguirre/covid19-venezuela
Análisis de datos de muertes por covid-19 en Venezuela
covid-19 data-analysis dataviz line-chart
Last synced: 09 Apr 2025
https://github.com/vatshayan/pokemon-analysis
Visualization, Analysis & Predicting the accuracy of finding Pokemon power, attack & speed through Machine Learning
artificial-intelligence data data-analysis data-science data-visualization dataset machine-learning machine-learning-algorithms pokemon scikit-learn
Last synced: 30 May 2026
https://github.com/damiieibikun/web-scrapping-and-python-data-visualization-on-top-500-movies-imdb
Web Scrapping and Python Data visualization on Top 500 movies IMDb
beautifulsoup4 data-analysis data-visualization matplotlib-pyplot numpy pandas plotly-express python requests seaborn web-scraping
Last synced: 13 Apr 2026
https://github.com/abhisek-13/whatsapp-chat-analyzer
The WhatsApp Chat Analyzer is a data analysis project that provides insights into WhatsApp chats. It analyzes chat data to show metrics like the number of lines, most used letter, chatting duration, media files shared, most used emojis, and group member activity. The results are displayed on a user-friendly dashboard built with Streamlit.
data-analysis data-mining data-visualization eda machine-learning machine-learning-algorithms matplotlib numpy pandas python seaborn sklearn
Last synced: 13 Apr 2026
https://github.com/isaqueiros/newspapersales-predictions-linearregression_and_regularisation
This notebook is a study on the sales of newspapers of a local stand, with intention to predict the newspaper sales performance based on the different features available. For this, 4 sklearn models are applied: Linear Regression, Lasso Regression, Ridge Regression and Elastic Net Regression.
data-analysis data-science linear-regression machine-learning python regularization-methods sklearn-library sklearn-linear-regression
Last synced: 02 May 2026
https://github.com/1adityakadam/carnegie_classifications_website
A comprehensive data analytics platform analyzing 50+ years of U.S. higher education trends through interactive visualizations and historical institution tracking.
css data-analysis html javascript python ui-design web-development
Last synced: 13 Apr 2026
https://github.com/grandechowhiskey/fcc-data_analysis-projects
A collection of projects completed as part of the FreeCodeCamp "Data Analysis with Python" certification. These projects cover statistical calculations, data visualization, and trend analysis using real-world datasets.
data-analysis data-visualization matplotlib pandas python3 scikit-learn seaborn
Last synced: 01 May 2026
https://github.com/nature40/casestudies
Case studies for testing the functionality of database systems, sensors, etc
casestudies data-analysis data-visualization database
Last synced: 02 May 2026
https://github.com/amoghkori/working-with-apache-spark-mllib
Implemented Apache Spark MLLib to analyze a large car dataset, predict car selling prices, and gain insights into the car market.
amazon-web-services data-analysis data-visualization exploratory-data-analysis linear-regression machine-learning model-selection pyspark python random-forest sagemaker spark
Last synced: 13 Apr 2026
https://github.com/srinibas-masanta/olympics-data-analysis
The Olympics Analysis project explores Olympic data to uncover trends in athlete performance, medal distribution, and participation across countries and demographics. By leveraging detailed datasets, it provides insights into the evolution of the Games, highlighting key patterns and disparities over time.
data-analysis data-science data-visualization olympics olympics-visualization
Last synced: 02 Apr 2025
https://github.com/dcs-training/data-wrangling-and-vis-pandas
Introduction to analyzing structured data with the Python libraries pandas, for CSV and TSV data, and ElementTree, for XML data. Go to the readme file
data-analysis data-visualisation data-wrangling python
Last synced: 16 Jun 2026
https://github.com/ashleydavis/brisjs-data-analysis-talk
Code for my talk to BrisJS on data analysis in JavaScript
charting data-analysis data-visualization data-viz javascript node node-js nodejs visualization
Last synced: 25 Mar 2025
https://github.com/aimin-nur/visualisasi_bikestore
Data Analyst - Dashboard Bike Store
data-analysis sql visualization
Last synced: 29 Jan 2026
https://github.com/deypadma2020/sql_project
✏️ A collection of practical SQL case studies and solutions exploring real-world business scenarios: car showroom analysis, esports tournament, customer insights, finance analysis, pricing strategy, and marketing analytics.
business-intelligence case-study data-analysis database mysql queries sql
Last synced: 30 May 2026
https://github.com/abhijeet107/final-project
Final project summation INTERNSHIP PROJECTS (2 WEEKS)
data-analysis data-cleaning-and-preprocessing excel mysql-database python tableau-public
Last synced: 23 Feb 2026
https://github.com/darrenjolson/pba-analysis-app
Data analysis and visualization tool for professional bowling tournaments, predicting performance across different oil patterns and venues.
bowling data-analysis data-visualization flask pba predictive-analytics python reactjs sports-analytics
Last synced: 13 Apr 2026
https://github.com/abeltavares/online_retail_pyspark_analysis
PySpark data analysis of the Online Retail Data Set
business-intelligence churn-analysis customer-segmentation data-analysis data-visualization jupyter-notebook machine-learning market-basket-analysis online-retail product-affinity-analysis pyspark
Last synced: 12 May 2026