Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-07-03 00:07:37 UTC
- JSON Representation
https://github.com/pronzzz/diabetes-prediction
Diabetes prediction using a KNN model and Pima Indian Diabetes Dataset
data-analysis data-manipulation data-preprocessing data-visualization knn machine-learning outlier-detection seaborn
Last synced: 13 Apr 2025
https://github.com/jelhamm/model-ensembles-boosting-in-machine-learning
"This repository contains implementations of Boosting method, popular techniques in Model Ensembles, aimed at improving predictive performance by combining multiple models. by using titanic database."
boosting boosting-algorithms boosting-ensemble boosting-machine data-analysis database-analysis datamining datamining-algorithms jupyter-notebook machine-learning machine-learning-models machine-learning-projects matplotlib-python model-ensemble module numpy-library pandas-library python sklearn-library
Last synced: 16 May 2026
https://github.com/hadarsharon/grizzlys
User-friendly Python DataFrames 🔵🟡 powered by Julia 🔴🟢🟣
big-data data data-analysis data-engineering data-frame data-frames data-science dataframe dataframe-library dataframes dataframes-jl julia python
Last synced: 18 May 2026
https://github.com/ireneflorez/exploration_r
Data exploration on the 'White Wine Quality' dataset using R
data-analysis data-visualization r
Last synced: 16 Jun 2026
https://github.com/jelhamm/singular-value-decomposition-data-mining
"This repository hosts an implementation of the Singular Value Decomposition (SVD) algorithm tailored for data mining tasks. SVD is utilized for efficient dimensionality reduction, aiding in the extraction of key patterns and features from large and complex datasets."
data-analysis dimension-reduction jyputer-notebook machine-learning matplotlib numpy-library pandas-library preprocessing python scipy-library singular-value-decomposition sklearn-library standardscaler svd svd-matrix-factorisation
Last synced: 18 May 2026
https://github.com/dineshdhamodharan24/singapore_flat_resale_
This project focuses on developing a machine learning model to predict the resale values of apartments in Singapore. The goal is to create a user-friendly online application that enables users to obtain accurate predictions for the resale values of specific properties.
data-analysis flat json numpy pandas pickle project python streamlit
Last synced: 07 Apr 2026
https://github.com/chandkund/loan-eligibility-prediction
This project is designed to predict the eligibility of loan applicants based on various factors such as income, credit history, and marital status. By analyzing historical loan application data, the model helps to determine whether a loan application should be approved or not.
data-analysis data-science data-visualization machine-learning-algorithms matplotlib numpy pandas python seaborn
Last synced: 09 Apr 2026
https://github.com/eesunmoon/spam_review_detection
[Project] Capstone Design - Spam Detection
crawler-python data-analysis konlpy natural-language-processing python sorting-algorithms spam-detection
Last synced: 12 Oct 2025
https://github.com/dinamohsin/ai-job-market-analysis-using-sql-excel
This project explores a dataset of AI-related jobs to uncover insights about salary trends, in-demand skills, education levels, and remote work preferences. The analysis was done using SQL for querying and Excel for data cleaning and preparation.
data-analysis data-preprocessing excel functions query sql sql-server
Last synced: 25 Jun 2025
https://github.com/gappeah/credit-card-transactions-fraud-detection-project
The Credit Card Transactions Fraud Detection Project repository is designed to analyse and detect fraudulent transactions in credit card data.
Last synced: 12 Jul 2025
https://github.com/vbhvsingh0/nflteam_corr_population
The goal of this project is to find the correlation in between NFL teams' win and loss with the population of the city.
data-analysis data-cleaning-and-preprocessing data-manipulation-with-pandas numpy-library pandas-python pearson-correlation python3
Last synced: 29 Jun 2026
https://github.com/farzeennimran/fashion-mnist-dataset-classification-using-neural-network
Implementation of a Multi-layer Perceptron classifier with hyperparameter tuning and k-fold cross-validation employing GridSearchCV for classifying images on the Fashion MNIST dataset 👗👚👖
artificial-intelligence data-analysis data-mining data-science dataset deep-learning fashion-mnist-dataset gridsearchcv hyperparameter-tuning kfold-cross-validation machine-learning multilayer-perceptron-network neural-network numpy pandas python sklearn
Last synced: 03 Apr 2026
https://github.com/tathithienthanh/womenfashionproductrecommendationsystem
Build a recommendation system for recommending woman fashion's products on e-commerce platforms
content-based data-analysis data-collection data-processing data-visualization dfd e-commerce erd jupyter-notebook lazada nlp python recommender-system scraping-websites sql system-design tiki vietnamese visualization
Last synced: 16 May 2026
https://github.com/nuccitheboss/jespipe-plugin
Your go to spot for creating and using Jespipe plugins.
adversarial-attacks data-analysis data-manipulation data-visualization machine-learning machine-learning-algorithms
Last synced: 23 Jun 2025
https://github.com/shubhammittal-data/sales-customer_dashboard_tableau
An interactive Tableau project showcasing advanced data visualization techniques for sales performance and customer analytics. This dashboard provides key business insights using KPIs, trend analysis, and customer segmentation. Designed for executives, sales managers, and marketing teams to drive data-driven decision-making.
customer-behavior-analysis customer-segmentation data-analysis data-visualization product-analytics sales-analysis tableau tableau-dashboards tableau-public
Last synced: 07 Mar 2026
https://github.com/neeraj08823/bellabeat_case-study
HOW CAN A WELLNESS COMPANY PLAY IT SMART?
data-analysis data-cleaning data-visualization r rmarkdown rstudio tableau tableau-public
Last synced: 25 Jun 2025
https://github.com/jlee9503/defense-risk-prediction
Build a machine learning pipeline that ingests defense procurement data, identifies high-risk contracts, and visualizes the results in an interactive dashboard.
data-analysis data-visualization exploratory-data-analysis python
Last synced: 25 Jan 2026
https://github.com/jwt218/sinc
MATLAB Standardization and Isotope Normalization for CSIA (with integrated correction and uncertainty quantification)
data-analysis geochemistry isotopes matlab
Last synced: 23 Jun 2025
https://github.com/jofaval/boston-housing
Regression Analysis into the Boston Housing in-demand pricing in 1978
boston-housing data-analysis data-science data-visualization machine-learning python regression
Last synced: 16 May 2026
https://github.com/harmanveer-2546/motor-vehicle-accidents-in-india
As per the report, a total of 4,61,312 road accidents have been reported by States and Union Territories (UTs) during the calendar year 2022, which claimed 1,68,491 lives and caused injuries to 4,43,366 persons.
accidents accidents-analysis darkgrid data-analysis eda exploratory-data-analysis indian-roads inline matplotlib motor-vehicles numpy pandas review seaborn visualization
Last synced: 19 Jan 2026
https://github.com/mrendiks/analyst-data-survey-monkey
Learn how to analyst data from dataset surver monkey using Excel and Python
data-analysis ipynb-jupyter-notebook python
Last synced: 07 Mar 2026
https://github.com/mituskillologies/aiml-dypiemr-sep24
Programs conducted at DYPIEMR, Pune in training on AIML during September 2024.
artificial-intelligence data-analysis data-science machine-learning matplotlib neural-network numpy pandas python3
Last synced: 05 Apr 2025
https://github.com/chahelgupta/fitness-data-analysis-r-project
This project focuses on analyzing fitness data collected from various tracking devices to gain insights into users' activity levels, sleep patterns, calorie expenditure, and heart rate. The dataset used in this project consists of multiple CSV files, each containing different aspects of fitness-related data.
data-analysis data-cleaning data-exploration data-science data-visualization r r-language r-programming r-studio
Last synced: 18 May 2026
https://github.com/nikbarb810/motif_detection_in_r
Motif Detection for TFBS in Glycolysis and Glyconeogenesis pathways
bioinformatics data-analysis null-hypothesis pwm r
Last synced: 23 Jun 2025
https://github.com/jonathancaleb/adap
📊🌱 Agricultural Data Analysis Platform 🌍🚜 A personal initiative to analyze coffee growth trends in Uganda using Python, data science, and machine learning. This project supports sustainable farming with predictive models and interactive visualizations. 🍃📈
data-analysis data-science python
Last synced: 18 May 2026
https://github.com/rociobenitez/airbnb-data-mining
Análisis detallado y modelado predictivo de alojamientos en Madrid utilizando técnicas de Big Data y estadística en R, enfocado en optimización de datos y predicción de características de propiedades.
airbnb data-analysis data-mining estadistica prediction-model predictive-analytics predictive-modeling qmd r rstudio
Last synced: 23 Jun 2025
https://github.com/Fisseha-Estifanos/telecom
A showcase repository for a specific telecommunication company. Used to analyze several telecommunication data set features and generate useful insights accordingly. Insights generated could be seen at https://github.com/Fisseha-Estifanos/telecom-visualizer or at https://fisseha-estifanos-telecom-visualizer-home-huxgy0.streamlitapp.com/
data-analysis notebooks-jupyter python visual-studio-code visualization
Last synced: 11 Mar 2025
https://github.com/majajuri/text-classification-using-string-kernels
Projekt u sklopu predmeta Uvod u znanost o podacima
Last synced: 05 Apr 2025
https://github.com/ashvinhandoo/bionic-lab-projects
Computational neurophysiology pipelines for analyzing astrocyte and vascular dynamics. Includes Python- and MATLAB-based analysis frameworks for modeling calcium, vasomotion, and pupil-linked activity, demonstrating advanced signal processing, transfer entropy estimation, and data visualization skills used in biomedical research.
biocomputation bioinformatics biomedical-engineering computational-biology data-analysis matlab neuroscience python signal-processing time-series
Last synced: 18 May 2026
https://github.com/alvarezekiel19/movie-data-analysis
A Data Science elective activity
data-analysis data-science data-visualization jupyter-notebook python python3
Last synced: 18 May 2026
https://github.com/phanchenh/datacosupplychain_sqlproject
Supply Chain Optimization – Tackling Delivery Delays and Profitability Challenges (2015-2017)
business-analytics business-intelligence data-analysis insights jupyter-notebook mssql mssqlserver python supply-chain supply-chain-analytics supply-chain-optimization
Last synced: 09 Mar 2026
https://github.com/martachesnova/python-apis
A weather analysis that randomly selects more than 500 cities across the globe, pulls data from the OpenWeatherMap API for each city. Analysis of the weather and perfect vacation spot is viewable on my Jupyter Notebook.
Last synced: 24 Feb 2025
https://github.com/martachesnova/python
Created a Python script to calculate and analyze financial records of a company. Created another Python script to do calculations and analysis of the voting process in a small town.
Last synced: 24 Apr 2026
https://github.com/shrutiijoshi/corporate-campus-hiring-analysis
This project analyzes corporate campus hiring trends for fresh graduates in India.
dashboard data-analysis data-visualization excel powerbi
Last synced: 09 Mar 2026
https://github.com/jpcadena/pharmacy-prices-prediction
Prices prediction project for Pharmacy products.
artificial-intelligence data-analysis data-science deep-learning keras machine-learning machine-learning-models neural-network numpy pandas pharmacy prediction price-prediction pylint python scikit-learn supervised-learning tensorflow
Last synced: 07 Apr 2026
https://github.com/data-edd/e-commercestore_analysis
This project analyzes e-commerce data to provide insights into sales performance, profitability, and customer behavior using Power BI.
data-analysis powerbi powerbidashboard
Last synced: 02 Feb 2026
https://github.com/lparham2/factors-driving-ev-adoption-charging-station-deployment
This project explores factors driving EV adoption and charging station deployment using Python-based data analysis. It examines sales trends, infrastructure growth, and socioeconomic influences to uncover key insights. The goal is to aid policymakers and businesses in optimizing EV infrastructure and accelerating sustainable transportation.
data-analysis data-visualization electric-vehicle-charging-station electric-vehicles powerpoint-presentations python
Last synced: 18 May 2026
https://github.com/SebastianUrdaneguiBisalaya/diseases-fissal-peru
Análisis holístico de atenciones por enfermedades raras, huérfanas y transplantes coberturados por FISSAL en el Perú.
data-analysis data-visualization python
Last synced: 04 Jul 2026
https://github.com/antononcube/wl-mosaicplot-paclet
Wolfram Language (aka Mathematica) paclet for mosaic plots over datasets or lists of records.
data-analysis machine-learning mosaic mosaic-plots
Last synced: 16 Jan 2026
https://github.com/shubh-bharadwaj/zomato-dataset-analysis
Zomato Dataset Analysis
data-analysis data-science data-visualization numpy pandas python sklearn
Last synced: 07 Apr 2026
https://github.com/jayita11/eda-student-exam-performance
This project performs Exploratory Data Analysis (EDA) and hypothesis testing on student performance data. It explores trends based on attributes like gender, race/ethnicity, parental education, lunch type, and test preparation course completion.
data-analysis eda hypothesis-testing matplotlib pandas python seaborn statsmodels student-performance-analysis
Last synced: 11 Jul 2025
https://github.com/karaniwachira/baby_names_analysis
Data Analysis: Baby Names Exploration
data data-analysis quarto quartopub r rstats tidyverse-ggplot2
Last synced: 22 Jun 2025
https://github.com/gui-sitton/games
Identify patterns that determine whether a game is successful or not. This will allow you to identify potential big winners and plan advertising campaigns.
data data-analysis data-analysis-python data-science data-visualization python
Last synced: 18 May 2026
https://github.com/wikidata/purdue-data-mine-2024
Program materials for WMDE's 2024 Purdue Data Mine project
analytics data-analysis data-quality data-science etl open-data python wikidata wikimedia
Last synced: 12 May 2025
https://github.com/elissorokin/data-analyst-portfolio
Это репозиторий, в котором я демонстрирую свои навыки, делюсь проектами и отслеживаю прогресс в области анализа данных и Data Science.
ab-testing data data-analysis datalense matplotlib numpy pandas plotly portfolio postgresql python scipy seaborn sql statistical-analysis
Last synced: 09 Apr 2026
https://github.com/xjwllmsx/profitable-app-profiles
Analyzes Google Play & App Store data to recommend profitable profiles for free, ad-supported mobile apps
data data-analysis data-cleaning jupyter pandas python
Last synced: 18 May 2026
https://github.com/pyramidheadshark/ai-mirea-sem1p
Completed set of all MIREA AI an DA practices (1 sem.)
beginner-friendly data-analysis data-science jupyter mirea
Last synced: 05 Apr 2025
https://github.com/balajimohan18/loan-clustering-datascience-project
This project uses Machine Learning to Cluster loan together based on their similarities. The project uses a dataeset of loan application which includes information about the Loan amount and Balance. The project then use the clustering algorithm to group the loan together based on the similarities.
clustering-algorithm data-analysis data-science data-visualization eda kmeans-clustering machine-learning sql unsupervised-learning
Last synced: 27 Jul 2025
https://github.com/acerbilab/svbmc
Stacking Variational Bayesian Monte Carlo (S-VBMC) algorithm for combining Variational Bayesian Monte Carlo (VBMC) posteriors to boost inference performance.
bayesian-inference data-analysis machine-learning model-fitting python stacking variational-inference
Last synced: 20 Jan 2026
https://github.com/quocduyenanhnguyen/yelp-analysis
Yelp data analysis of business rating, categories, any trends/patterns, correlation, etc.
csv-to-database data-analysis data-analytics data-visualization database json json-parsing json-to-csv mysql mysql-database mysql-workbench pycharm python python3 restaurant sql tableau tableau-dashboards tableau-public yelp-dataset
Last synced: 27 Jan 2026
https://github.com/vedantshi/tableau-bike-data-dashboard
London Bike Rides Analysis explores bike usage patterns using data visualization and machine learning. It identifies trends through a dynamic moving average, analyzes weather impact with heatmaps, and provides actionable insights via an interactive Tableau dashboard. Tools: Python, Tableau.
data-analysis data-visualization python tableau weather-data
Last synced: 16 May 2026
https://github.com/shubhamprajapati7748/end-to-end-house-price-prediction
A machine learning model that accurately predicts housing prices using the Boston Housing dataset by analyzing various house features, and it utilizes a CatBoost model to assist potential buyers or sellers in estimating housing prices.
boston-housing-price-prediction data-analysis data-science-projects machine-learning regression regression-models
Last synced: 30 Oct 2025
https://github.com/ddihora1604/social_media_analysis
A powerful, interactive dashboard for analyzing social media conversations, trends, and network dynamics. This tool allows researchers and analysts to explore patterns in social media data, identify key trends, and detect coordinated behavior.
aiml css data-analysis data-visualization html javascript python
Last synced: 01 Jul 2026
https://github.com/maxbiostat/diehl_ebola_cell_2016
supplementary code and data to Diehl et al, 2016 (Cell)
data-analysis data-visualization disease-spread ebola mutation
Last synced: 11 Jul 2025
https://github.com/purushothamadluru/atlantic-gdp-job-demand-analysis
data-analysis data-visualization powerbi
Last synced: 17 Feb 2026
https://github.com/i-e-b/dynamictimewarp
A quick C# implementation of https://jeremykun.com/2012/07/25/dynamic-time-warping/
data-analysis pattern-matching working
Last synced: 17 Aug 2025
https://github.com/alex-petrov-git/petrowiki
My wiki-pages
acoustics aeroacoustics aeromechanics algebra analysis data-analysis fourier-analysis hydrodynamics linear-algebra math ml obsidian physics probability-theory statistics wiki wikipedia
Last synced: 02 Mar 2025
https://github.com/olympus-terminal/data-processing
Data analysis and processing tools
automation data-analysis data-processing data-science etl machine-learning pdf-extraction python r research statistics web-scraping
Last synced: 16 May 2026
https://github.com/adriangalvanzamora/ecommerce-analytics-olist
Data analysis project based on the Olist Brazilian E-Commerce dataset. Includes data cleaning, exploratory analysis, delivery performance metrics, customer satisfaction modeling, and geospatial insights. Built entirely in Python (Jupyter Notebook) using real-world data from Kaggle.
brazil customer-satisfaction data-analysis data-visualization ecommerce folium geospatial-analysis machine-learning matplotlib notebook pandas plotly python seaborn
Last synced: 06 May 2026
https://github.com/sakan811/gachascope
Evaluate the cost-effectiveness of various in-app purchase bundles available in gacha games.
data data-analysis data-visualization game honkai honkai-star-rail honkai-starrail hoyoverse javascript nextjs tableau tableau-public typescript wutheringwaves
Last synced: 04 May 2026
https://github.com/adnanrahin/nlp-with-disaster-tweets
Kaggle Competition: Predict which Tweets are about real disasters and which ones are not. Natural Language Processing.
data-analysis data-science data-visualization kaggle-competition machine-learning natural-language-processing regular-expression tweets
Last synced: 21 Jun 2025
https://github.com/drisskhattabi6/meteo-data-mining
This repo contains using Data Mining Techniques to analyze meteorological (meteo) data. The objective is to extract meaningful insights and patterns from the data that can aid in understanding weather phenomena and predicting future weather conditions.
cart data-analysis data-mining data-visualization decision-making decision-tree extract-data extract-insights insights-analytics insights-data k-means knn machine-learning svm
Last synced: 21 Mar 2025
https://github.com/andremenezesds/pa004_health_insurance
Health Insurance Cross-Sell(Learning to Rank Machine Learning Project)
backend backend-api data-analysis data-science data-visualization dataviz lgbm machine-learning matplotlib numpy optuna pandas python scikit-learn shell-script sql webapi xgboost
Last synced: 09 Apr 2026
https://github.com/pkjjoshi/restaurants-analysis
Performed beginner-level EDA on a restaurant dataset using Python. Analyzed top cuisines, city-wise ratings, price ranges, and online delivery impact using Pandas and Matplotlib. Includes 4 well-structured notebooks with visual insights.
beginner-project data-analysis data-visualization exploratory-data-analysis jupyter-notebook pandas python restaurant-data seaborn
Last synced: 21 Jun 2025
https://github.com/teditae/data-analysis-with-pandas
Mini data science projects focused on Pandas-powered analysis.
data-analysis data-manipulation pandas python
Last synced: 30 Apr 2026
https://github.com/atharvkadammm/suicide-prediction-system
A machine learning project predicting suicide risk based on multiple socio-economic and environmental factors using data mining techniques.
csv data-analysis data-science data-visualization datamining exploratory-data-analysis feature-engineering machine-learnin matplotlib mental-health numpy pandas riskassesment seaborn sklearn suicide-prediction supervised-
Last synced: 01 Jul 2025
https://github.com/bhaveshbhakta/wine-quality-prediction-using-ml
Wine Quality Prediction
data-analysis data-visualization machine-learning ml random-forest wine-quality-prediction
Last synced: 07 Aug 2025
https://github.com/atharvkadammm/calmlytic
An end-to-end machine learning project that predicts anxiety severity using classification models (Naive Bayes, Decision Tree, SVM, Logistic Regression, XGBoost), based on lifestyle, health, and behavioral features.
anxiety-prediction classification csv data-analysis data-preprocessing-and-cleaning data-science data-visualization ensemble-learning logistic-regression machine-learning-algorithms matplotlib mental-health numpy pandas python sci-kit-learn seaborn supervised-learning svm xgboost
Last synced: 21 Jun 2025
https://github.com/rezowanrahat/netflix_analysis
Data analysis of Netflix content using Python, Pandas, and Seaborn
data-analysis data-visualization netflix pandas python
Last synced: 07 May 2026
https://github.com/liebsen/overlemon
Overlemon institutional application
data-analysis design devops sysadmin webdev
Last synced: 21 Jul 2025
https://github.com/capjamesg/personal-notebooks
Notebooks for personal experiments with machine learning and computer vision.
data-analysis machine-learning notebooks
Last synced: 03 Apr 2025
https://github.com/bamresearch/utah-saxs-tools
The Utah SAXS Tools (USToo), adapted for Python 3, originally by David P. Goldenberg, 2009-2012
data-analysis saxs small-angle-scattering small-angle-xray-scattering
Last synced: 17 Jan 2026
https://github.com/theashishmavii/job-trends-analyzer-automation
End-to-end automation: job scraping, data analysis, and trends reporting for job seekers and researchers.
automation beautifulsoup data-analysis open-source pandas python selenium webscraping
Last synced: 07 Aug 2025
https://github.com/kushagrakumar04/visual-age-distribution
A Bar chart or histogram to visually depict the distribution of a categorical or continuous variable, such as the age distribution or gender composition within a population. This graphical representation provides a clear and insightful overview of the data's patterns and trends.
data-analysis data-science google-colab
Last synced: 21 Jun 2025
https://github.com/jpcadena/malware-analysis
Analysis of malware signatures and their associated Common Vulnerabilities and Exposures (CVEs)
black common-vulnerabilities-and-exposures cve-search data-analysis data-engineering data-reporting data-visualization isort malware-analysis matplotlib mypy numpy pandas plotly poetry pre-commit pydantic python ruff seaborn
Last synced: 03 Mar 2026
https://github.com/alpkanoz/ibm_data_science_professional_certificate
The repository contains projects and training materials carried out throughout the IBM data science professional course.
classification clustering data-analysis data-science data-visualization dataframe ibm ibm-watson machine-learning mathplotlib pandas predictive-modeling python scikit-learn
Last synced: 07 Mar 2026
https://github.com/lavkalsi/tableau-project-stock-market-analysis
The Tableau Project: Stock Market Analysis features a dashboard that combines Descriptive, Diagnostic, Predictive, and Prescriptive analytics to provide insights into stock market trends. Using Python for data processing and an LSTM model for forecasting, this project visualizes historical and predicted stock prices, helping make informed decision.
dashboard data-analysis deep-learning lstm-model python tableau
Last synced: 18 May 2026
https://github.com/estherslabbert/data-exploration
Data analysis and data visualizations for different data sets
data data-analysis data-science data-visualization jupyter-notebook titanic-dataset usa-arrests-dataset
Last synced: 06 Apr 2025
https://github.com/caprogs/paris-events-analyzer
A project to analyze events in Paris using open source data provided by the city.
data data-analysis data-platform dbt docker ingestion python streamlit transformation vizualisation
Last synced: 04 May 2026
https://github.com/riciokzz/mental-health-in-tech-analysis
Analysis of the Mental Health in the Tech Industry.
data-analysis data-engineering data-science exploratory-data-analysis
Last synced: 21 Jul 2025
https://github.com/maprihoda/learning-spark
apache-spark data-analysis data-science data-wrangling machine-learning pyspark python
Last synced: 19 May 2026
https://github.com/rathod-shubham/google-data-analytics
Learning a wide range of skills that are useful in everyday life as well as being a data analyst.
data-analysis data-analysis-in-r data-analyst data-analyst-nanodegree data-analytics data-visualization google
Last synced: 03 Feb 2026
https://github.com/adikahnf/Data-analysis-with-Python
data-analysis numpy pandas python streamlit
Last synced: 31 Dec 2025
https://github.com/dsrodrigovieira/rossmannsales
Este repositório contém um projeto desenvolvido para praticar análise de dados e aplicação de modelos de regressão (aprendizagem supervisionada)
data-analysis data-science machine-learning python telegram-bot xgboost-regression
Last synced: 19 May 2026
https://github.com/jgohel9902/toronto-airbnb-snowflake
This project analyzes Airbnb listings in Toronto using **Snowflake’s cloud data platform**. It follows a **Bronze → Silver → Gold** medallion architecture and leverages **Snowflake Cortex** to generate **AI-driven executive insights**.
data-analysis python snowflake sql
Last synced: 07 Mar 2026
https://github.com/kevin-rsj/the-substance-sentiment-analysis
Se analiza los comentarios de usuarios de Reddit sobre la película The Substance (2024) usando técnicas de NLP. Se obtuvo un sentiment score promedio de 0.19, y palabras clave como "horror" y "like" destacan entre las opiniones.
data-analysis notebook python sentiment-analysis tableau visualization
Last synced: 19 May 2026
https://github.com/kianaasd93/faostat
build a multilayer perceptron model that can be used for forecasting the export value of crop products for a geographical region three years into the future
agriculture data-analysis data-science faostat machine-learning ml multiplayer python rnn
Last synced: 19 May 2026
https://github.com/fortunewalla/birdstrikes
birdstrikes database created for postgresql with simple sample queries
birdstrikes csv data-analysis data-science database dataset pgsql postgresql practice sample sql sql-query workshop
Last synced: 02 Oct 2025
https://github.com/marcogdepinto/olympichistoryanalysis
Python visual analysis of the Olympic Games history. Kaggle gold medal with 15000+ views, 200+ upvotes and 100+ comments.
data-analysis data-science jupyter-notebook olympic-games python seaborn
Last synced: 29 Apr 2026
https://github.com/marlysson/craw
A system to show the data collected from various sources using chartjs - ⚡️
chartsjs data-analysis data-science web-scraping
Last synced: 21 Jun 2025
https://github.com/ujjwalll/econometrics_analysis_of_india_gdp_misestimation
A Econometric Analysis of the India's GDP to determine whether their is any flaw in India's GDP, as quoted by Dr. Arvind Subhramanium.
coefficient-estimates data-analysis econometrics economics gdp india r statistics
Last synced: 04 Jul 2026
https://github.com/shrunga92/5g_qos_data_transformation_python
Resource Allocation in 5G Network Service
Last synced: 19 May 2026
https://github.com/bho0920/crime-data-analysis-eu
Crime Data Analysis for Self-Defense Tool Market Entry in the EU.
data data-analysis sql sqlite tableau
Last synced: 21 Jun 2025
https://github.com/jesusgomez-data/retail-sales-data-analysis
End-to-end retail sales data analysis project using SQL, SQLite and Python (Pandas). Includes data generation, KPIs and business insights.
data-analysis junior-data-analyst pandas portfolio-project python retail-analysis sql sqlite sqlite3
Last synced: 11 Apr 2026
https://github.com/saidabderrahmane/bus_line_supervision
Performance evaluation of the Saint-Sébastien bus line using real data to predict the number of passengers.
beautifulsoup4 data-analysis data-science deep-learning machine-learning python scraper sklearn
Last synced: 11 Apr 2026
https://github.com/jatin-s16/netflix_analysis
This project involves a comprehensive analysis of Netflix's movies and TV shows data using SQL. The goal is to extract valuable insights and answer various business questions based on the dataset. The following README provides a detailed account of the project's objectives, business problems, solutions, findings, and conclusions.
data-analysis excel postgresql sql
Last synced: 19 May 2026
https://github.com/rosa-lpz/data-analysis-handbook
Data Analysis base knowledge and practical applications
data data-analysis data-visualization database dax documentation power-bi python r sql tableau tableau-public
Last synced: 06 Apr 2026