Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-23 00:07:29 UTC
- JSON Representation
https://github.com/mr-chang95/datascience_airbnb
Data Science Project for Udacity's Data Scientist Program. Using Python in Jupyter Notebook.
airbnb data-analysis data-science data-visualization jupyter-notebook numpy pandas python sklearn
Last synced: 08 Apr 2026
https://github.com/devexpress-examples/winforms-pivot-change-the-field-value-header-appearance-backcolor
This example handles the CustomDrawFieldValue event to fill the header's color.
data-analysis dotnet pivot-grid-for-winforms winforms xtrapivotgrid-suite
Last synced: 07 May 2026
https://github.com/auliannee/new-york-uber-pickups-analysis
This repository contains the projects related to data collecting, quality check, manipulation, analyzing, and visualizations.
data-analysis data-science ipython-notebook jupyter-notebook python
Last synced: 07 Feb 2026
https://github.com/amishidesai04/flipkart-mobile-sales-analysis
Flipkart Mobile Sales Analysis is a Tableau project that visualizes mobile sales data from Flipkart. It highlights trends in brand performance, pricing, ratings, and customer preferences. The interactive dashboard helps users explore key insights for data-driven decisions in e-commerce and retail.
dashboard data-analysis data-visualization storyboard tableau
Last synced: 31 Jan 2026
https://github.com/karan-malik/r-basicda-2
Basic Data Analysis in R
data-analysis data-science r-language rprogramming
Last synced: 19 Jun 2026
https://github.com/malthejorgensen/repx
Python regular expression file transformer
command-line-tool data-analysis text-processing
Last synced: 31 Jan 2026
https://github.com/alex-pierron/ekip-enedis-genai
Repository for the team "Ekip" during the H-GenAI Hackathon 2025 organized at SIA Partners, Paris, France
amazon-nova artificial-intelligence aws aws-lambda data-analysis database generative-ai mistral nlp
Last synced: 15 Apr 2026
https://github.com/axsk/geekgraph
parse, cluster and visualize boardgamegeek.com user profiles
Last synced: 01 Feb 2026
https://github.com/chaedoll/teamproject-foreignerreport
국내 외국인 대상 인프라 개선을 위한 보고서 (Report on improving infrastructure for foreigners)
Last synced: 25 Feb 2025
https://github.com/tashi-2004/data-visualization-tableau-traffic-collision-insights
Analysis of traffic collision data using Tableau, featuring interactive visualizations that highlight trends in injuries and fatalities, contributing factors, and geographic distributions. It includes various sheets and dashboards, with recommendations for enhancing road safety. The dataset is available for further exploration.
data-analysis data-visualization eda geospatial-analysis machine-learning predictive-modeling statistics tableau traffic-analysis
Last synced: 19 Mar 2026
https://github.com/nagar2nd/jenson-usa-mysql-analysis
We are analyzing Jenson USA's dataset to gain valuable insights into customer behavior, staff performance, inventory management, and store operations. By crafting advanced SQL queries, the analysis explores key metrics such as product sales, customer spending, and order patterns, ultimately guiding strategic decision-making and operations.
data-analysis problem-solving sql
Last synced: 01 Feb 2026
https://github.com/ginga1402/car_price_prediction
Predict the price of a car using MS Excel.
college-project data-analysis excel linear-regression
Last synced: 30 Mar 2025
https://github.com/mmzong/gee_lifestyleeffectsonhypertension
Generalized Estimating Equations (GEE), Quasi-likelihood under the Independence Model Criterion (QIC), Longitudinal data, Embedded box plots within violin plots with hypertension risk categories, spaghetti plots, aggregate line plots, histograms, faceted-area plots, box and jitter plots. Investigating the impact of lifestyle on health.
aggregate-line-plot area-faceted-plots box-plots data-analysis data-manipulation data-science data-visualization generalized-estimating-equations histograms jitter-plots longitudinal-data qic quasi-likelihoods r spaghetti-plots violin-plots
Last synced: 29 Jul 2025
https://github.com/faizantkhan/automated-eda
This repository showcases tools for automatic Exploratory Data Analysis (EDA) in Python. These tools help you quickly understand your datasets and generate insightful reports.
automatic automation autoviz data-analysis data-analysis-python data-science data-visualization dtale dtale-library eda exploratory-data-analysis ml pandas pandas-profiling python python-library sweetviz
Last synced: 18 Apr 2026
https://github.com/saidulalimallick04/smart-traffic-violation-pattern-detector-dashboard
This project is a Streamlit web application designed to analyze traffic violation data. It provides a user-friendly interface to explore, visualize, and gain insights from traffic violation datasets. Users can upload their own data, perform analysis, and view summaries and trends.
dashboard data-analysis data-visualization internship-project pandas python smart-traffic streamlit
Last synced: 18 Apr 2026
https://github.com/devbigboy/excel-power-query-get-transform
Power Query is a feature in Excel that allows you to quickly import data from multiple sources and easily clean, transform, and reshape it to suit your needs.
data-analysis data-science excel
Last synced: 08 Feb 2026
https://github.com/sroman0/data-analytics
Data Analytics Exercises is a collection of comprehensive university-level exercises aimed at enhancing skills in data analytics. The repository includes practical notebooks covering data manipulation, exploratory data analysis (EDA), statistical analysis, data visualization, and machine learning fundamentals.
data-analysis data-analytics data-science data-visualization education exercises exploratory-data-analysis hands-on-practice jupyter-notebook machine-learning python statistics
Last synced: 15 Apr 2026
https://github.com/grindelfp/datasets-analysis
The Machine Learning and Data Analysis course task dedicated to training skills of data normalizing and preprocessing.
data-analysis datasets ipynb mlda
Last synced: 05 Mar 2026
https://github.com/siddhant2105s/airline-performance-analysis-dashboard
Enhancing Airline Performance Analysis for the Department of Transport
data-analysis data-visualization tableau
Last synced: 08 Feb 2026
https://github.com/djm158/learning-microsoft-r
Working through https://www.gitbook.com/book/smott/introduction-to-microsoft-r-server/details and creating samples
data-analysis data-science microsoft microsoft-sql-server r
Last synced: 15 Apr 2026
https://github.com/ddihora1604/advanced_business_analytics_on_world_bank_global_financial_inclusion_data_2021
Bridging the Gaps in Financial Inclusion: Understanding the Cash-Credit Paradox, Divide between Cash and Digital Payments, and Financial Resilience.
advanced-excel business-analytics data-analysis data-engineering data-mining data-visualization database exploratory-data-analysis machine-learning preprocessing-data python
Last synced: 07 May 2026
https://github.com/l0rd-inquisit0r/data-analytics
A repository of data analytics implementations in Python
ai data-analysis data-analysis-python data-analytics
Last synced: 18 Jun 2025
https://github.com/themihirmathur/uber-data-analytics
The goal of this project is to perform comprehensive data analytics on Uber trip data using a modern data engineering stack on Google Cloud Platform (GCP).
bigquery data-analysis data-engineering etl-pipeline google-cloud-platform looker python
Last synced: 09 Feb 2026
https://github.com/teja-1403/forage-tata-data-visualisation-empowering-business-with-effective-insights
This repository contains solutions to the 4 different tasks that must be performed during the Data Visualisation: Empowering Business with Effective Insights virtual internship provided by TATA via Forage.
analysis-and-reporting analytics analytics-and-decision-science charts communications dashboards data-analysis data-cleanup data-interpretation data-storytelling data-visualizations graph insights power-bi visual-basic visualizations
Last synced: 18 Feb 2026
https://github.com/27ahmad/amazon-sales-analysis
This repository contains an exploratory data analysis (EDA) and visualization project of Amazon sales data. The goal is to uncover insights and present key metrics through a Tableau dashboard.
data-analysis eda pandas python seaborn tableau
Last synced: 15 Apr 2026
https://github.com/habiburrahman-mu/data-wrangling
Data Wrangling is the process of converting data from the initial format to a format that may be better for analysis.
data-analysis data-mining data-science
Last synced: 21 May 2026
https://github.com/rajeev2806/netflix-data-analysis
In this project i have implemented ETL . I used netflix dataset to clean and analyze using postgresql and python
data-analysis data-cleaning postgresql python
Last synced: 15 Apr 2026
https://github.com/mathusanm6/critics-vs-players-analysis
This data analysis examines the relationship between critic scores, sales (owners), player engagement, and pricing to determine the ROI of critic reviews.
data-analysis data-science data-visualization game-reviews games-sales jupyter-notebook python-3 steam-games
Last synced: 16 Apr 2026
https://github.com/shruti23-ui/blinkit-powerbi-dashboard
A comprehensive Power BI dashboard analyzing Blinkit's sales performance, outlet metrics, and multi-tier market analytics with interactive visualizations and business intelligence insights.
data-analysis data-visualization microsoft-excel microsoft-power-bi powerbi sales-analysis sql
Last synced: 09 Feb 2026
https://github.com/kelvintechnical/web-scraper
Tableau Book Price Analysis
data data-analysis data-science tableau tableau-public
Last synced: 25 Jan 2026
https://github.com/kathisnehith/medicare-ip-hospital-analysis
In-depth Data analysis and visualization of Medicare inpatient hospital data.
data-analysis data-cleaning-and-preprocessing data-merging excel exploratory-data-analysis medicare-claim-costs-prediction powerquery sql tableau-dashboards
Last synced: 10 Feb 2026
https://github.com/jpgiant/gujaratrainfallanalysis_2021
Analysis about the rainfall that occurred in the districts of Gujarat state in 2021
data-analysis exploratory-data-analysis exploratory-data-visualizations matplotlib numpy pandas-python python
Last synced: 07 May 2026
https://github.com/sreekar0101/bank-financial-loan-performance-trend-analysis
About This project analyzes the performance trends of financial loans using SQL for data extraction and Tableau for visualization. The goal was to perform exploratory data analysis (EDA) to understand key metrics like loan applications, funded amounts, interest rates, and debt-to-income ratios using sql and tableau for visualization
data-analysis data-visualization sql tableau
Last synced: 27 Feb 2026
https://github.com/edanur-y/abalone-age-prediction-with-regression-models
Comparing the performances of simple linear, multiple linear, multi-layer perceptron and k-nearest neighbors regressions on abalone data to predict the age.
data-analysis hyperparameter-tuning missing-values-analysis outlier-analysis python recursive-feature-elimination
Last synced: 20 May 2026
https://github.com/prateekbisht23/inventory_management
This project is an Inventory Management System built using Python (Pandas, NumPy, SciPy) and Jupyter Notebook. It allows efficient tracking of stock, performing data analysis, and generating useful statistical insights (mean, standard error, confidence intervals) to support better decision-making.
data-analysis jupyter-notebook management python3
Last synced: 11 Feb 2026
https://github.com/georgehanymilad/mobile-usage-behavior-analysis
Excel Project for Data Analysis
data-analysis data-visualization dataanalyst dataanalytics excel-dashboard pivot-tables powerquery storytelling
Last synced: 11 Feb 2026
https://github.com/nickenshidqia/startup-venture-funding-dashboard-data-analysis
The Startup Venture Funding Dashboard is a comprehensive visual representation of the dynamic landscape of startup funding, providing valuable insights into the top startups, funding round types, markets, startup statuses, and investor details.
dashboard data-analysis tableau tableau-dashboards
Last synced: 11 Feb 2026
https://github.com/shrutiijoshi/crm-sales-analysis
The dataset contained records exported from MavenTech's CRM from October 2016 to December 2017. It held details of opportunities with associated information such as product, account, and whether the sale was won or lost.
data-analysis data-visualization dax-functions powerbi powerquery
Last synced: 11 Feb 2026
https://github.com/ohimoiza1205/mastercard-cybersecurity-simulation
Served as an analyst on Mastercard’s Security Awareness Team to identify and report security threats
cybersecurity data-analysis data-presentation security-awareness-training technical-security-awareness
Last synced: 11 Feb 2026
https://github.com/joemull/pyjade
A data curation script for the Jane Addams Digital Edition
data-analysis digital-humanities
Last synced: 11 Feb 2026
https://github.com/virajbhutada/telecom-customer-churn-prediction
Predict and prevent customer churn in the telecom industry with this project. Harness the power of advanced analytics and Machine Learning on a diverse dataset to develop a robust classification model. Gain deep insights into customer behavior and identify critical factors influencing churn using interactive Power BI visualizations.
churn-prediction classification-models customer-attrition-analysis customer-churn-prediction data-analysis data-science decision-tree-classifier eda logistic-regression machine-learning machine-learning-algorithms machine-learning-models pandas powerbi powerbi-desktop python random-forest-classifier roc-curve xgboost-classifier
Last synced: 09 Apr 2026
https://github.com/sharmas1ddharth/mode_of_transport_analysis
This project requires you to understand what mode of transport employees prefers to commute to their office. The data includes employee information about their mode of transport as well as their personal and professional details like age, salary, and work exp. We need to predict whether or not an employee will use private transport. Also, which variables are a significant predictor behind this decision.
Last synced: 11 Feb 2026
https://github.com/jakobzmrzlikar/pca-on-genomes
An analysis of human genome mutations from different populations.
data-analysis genome-analysis pca-analysis
Last synced: 16 May 2025
https://github.com/an0n1mity/spamclassifiereval
A repository for evaluating the misclassification rate of spam classification models using a threshold-based approach.
data-analysis machine-learning natural-language-processing python-programming spam-classification text-classification
Last synced: 02 Nov 2025
https://github.com/mlund2k/project-1-baseball-performance-vs.-attendance
Project assets for my first exploratory data analysis: Baseball Performance vs. Attendance.
bigquery data-analysis data-cleaning data-visualization excel rstudio sql tableau tidyverse
Last synced: 12 Feb 2026
https://github.com/mrendiks/analyst-data-survey-monkey
Learn how to analyst data from dataset surver monkey using Excel and Python
data-analysis ipynb-jupyter-notebook python
Last synced: 07 Mar 2026
https://github.com/biginformatics/git-basics
Hands-on Git and GitHub lessons for analysts and statisticians
data-analysis git github public-health training
Last synced: 10 Jun 2026
https://github.com/anshulkansal121/music_store_analysis_sql
SQL project to analyze online music store data
beginner-friendly contributions-welcome data-analysis database mysql sql
Last synced: 07 May 2026
https://github.com/bnvulpe/regression-and-time-series
This work centers on assessing and comparing predictive models for regression and time series prediction using specific datasets, with the goal of selecting the most effective methodology for unseen test data.
colab data-analysis data-analysis-python data-science data-visualization forecasting jupyter-notebook machine-learning model-evaluation predictive-modeling python regression sarima sarimax time-series-analysis time-series-analysis-and-forecasting
Last synced: 08 May 2026
https://github.com/deborangueira/campeonado_kaggle_2025
Desenvolvimento de um modelo de machine learning para prever o sucesso de startups. O objetivo é identificar quais empresas têm maior probabilidade de se tornarem casos de sucesso no mercado.
computacao data-analysis desafio kaggle modulo3 ponderada
Last synced: 16 May 2026
https://github.com/projects-developer/ransomware-prediction-using-machine-learning-project
The project aims to develop a machine learning-based system to predict and detect ransomware attacks on computer systems. Ransomware is a type of malware that encrypts a victim's files and demands a ransom in exchange for the decryption key. Project Includes Source Code, PPT, Synopsis, Report, Documents, Base Research Paper & Video tutorials
artificial-intelligence btechproject computerscienceproject cybersecurity-malware data-analysis data-mining deep-learning machinelearning mtechproject neural-networks ransomware-machine-learning
Last synced: 12 Feb 2026
https://github.com/blladerunner/customer-churn-dashboard
Customer Churn Dashboard — SQL + Python analytics project exploring customer retention patterns, churn rate by demographics and services, and key insights for telecom business strategy.
business-intelligence churn-analysis customer-retention dashboard data-analysis data-analytics data-science pandas powerbi python sql sqlite telecom
Last synced: 08 May 2026
https://github.com/prakashjha1/stock-investment-analysis
Stock Investment Analysis Project can help investor to select the better performing stocks.
data-analysis data-science numpy pandas pandas-datareader parallel-programming python
Last synced: 08 May 2026
https://github.com/kariemseiam/geoegy
An innovative and responsive dashboard to discover, filter, and analyze places across Egypt. Featuring advanced search, interactive maps with Leaflet.js, real-time analytics, dark mode, and seamless data export—all wrapped in a sleek, modern design with RTL support.
accessibility data-analysis data-visualization es6-modules geojson javascript leaflet mapping openstreetmap places-data responsive-design web-development
Last synced: 13 Feb 2026
https://github.com/samruddhi3012/health-care-analytics
Hi! This repo involves analyzing the Healthcare analytics using Advanced Microsoft Excel.
dashboard data-analysis data-visualization healthcare microsoft-excel pivot-chart pivot-tables vlookup
Last synced: 29 Mar 2025
https://github.com/mananabbasi/dashboard-power-bi
This repository showcases **Power BI projects** focused on data visualization and business intelligence. Each project transforms raw data into interactive dashboards and reports, providing actionable insights for decision-making. The repository includes Power BI files, datasets, and documentation for each project.
data-analysis data-science data-visualization powerbi
Last synced: 13 Feb 2026
https://github.com/farhad-here/tegenx
TeGenX: Multilingual Text Generation App.TeGenX is a lightweight, interactive text generation application built with Streamlit. It leverages multiple pre-trained transformer models to generate text in both English and Persian.
data-analysis data-science deep-learning happytransformer huggingface nlp python stream text-generation text-generator textgeneration transformer web-application
Last synced: 25 Jan 2026
https://github.com/lavkalsi/tableau-project-stock-market-analysis
The Tableau Project: Stock Market Analysis features a dashboard that combines Descriptive, Diagnostic, Predictive, and Prescriptive analytics to provide insights into stock market trends. Using Python for data processing and an LSTM model for forecasting, this project visualizes historical and predicted stock prices, helping make informed decision.
dashboard data-analysis deep-learning lstm-model python tableau
Last synced: 18 May 2026
https://github.com/sumit-sinha9/sales-analysis
Analyzing 12 months worth fo Sales data
data-analysis pandas python visualization
Last synced: 08 May 2026
https://github.com/chanmeng666/mnist-handwritten-digit-recognition-project
【Sprinkle some star dust on this repo! ⭐️ It's good karma!】A comprehensive implementation and analysis of handwritten digit recognition using multiple neural network architectures on the MNIST dataset. Features basic MLP, optimized feature-selected model, and deep CNN approaches with detailed performance comparisons and visualizations.
cnn computer-vision data-analysis data-visualization deep-learning feature-analysis handwritten-digit-recognition keras machine-learning mlp mnist model-optimization neural-networks python scikit-learn tensorflow
Last synced: 02 Apr 2026
https://github.com/faizantkhan/python_matplotlib
Matplotlib is a powerful Python library for creating visualizations and plots. It’s widely used for data representation, making complex information more accessible and interpretable. It offers various types of plots, including line graphs, scatter plots, bar charts, histograms, and more
data-analysis data-analytics data-engineering data-science data-visualization deep-learning graphs line machine-learning machine-learning-algorithms matplotlib matplotlib-pyplot matplotlib-python python
Last synced: 20 May 2026
https://github.com/dhruwsunita/zomato-data-analysis-project
Zomato data analysis project using Python.
data-analysis data-visualization jupyter-notebook matplotlib numpy pandas-dataframe python
Last synced: 08 May 2026
https://github.com/fhdsl/seattlestatsummer_r
A 4-day introduction to R programming, focused on Fred Hutch Research Interns
beginner beginner-friendly course data-analysis data-science introduction-to-programming r-programming tidyverse
Last synced: 19 Mar 2026
https://github.com/projects-developer/full-stack-network-intrusion-detection-system-using-machine-learning
The project aims to design and develop a full-stack network intrusion detection system using machine learning techniques. Project Includes Source Code, PPT, Synopsis, Report, Documents, Base Research Paper & Video tutorials
algorithms computerscienceproject cybersecurity data-analysis full-stack-development intrusion-detection-system machine-learning network-intrusion-detection network-security web-development
Last synced: 14 Feb 2026
https://github.com/hlexnc/project-arepo
Data-driven stroke risk assessment & personalized recommendations, powered by machine-learning and an NLU-driven chatbot.
chatbot data-analysis docker docker-compose machine-learning nlu-chatbot python rasa scikit-learn sklearn streamlit
Last synced: 15 Feb 2026
https://github.com/prathmesh2507/global-stock-intelligence-dashboard
Interactive Global Stock Market Analytics Dashboard built using Python, YFinance, Pandas, Streamlit, and Plotly. Analyze 20+ countries and 400+ top stocks with advanced visualizations and financial insights.
dashboard data-analysis data-visualization python stock-analysis streamlit
Last synced: 15 Jun 2026
https://github.com/nmelgar/marathons_data_viz
Data visualization project to analyze finishing times and other data.
csv csv-files data data-analysis data-insight data-visualization data-viz dataset tableau
Last synced: 15 Feb 2026
https://github.com/samruddhi3012/rfm-analysis
Hi there! In this project I have performed Sales Analysis (RFM Analysis) using SQL and Tableau.
data-analysis data-visualization mssqlserver rfm-analysis segmentation tableau
Last synced: 27 Jun 2025
https://github.com/nemat-al/multivariate_data_analysis
Tasks for Multivariate Data Analysis Course @ ITMO University
data-analysis multivariate-analysis python
Last synced: 20 May 2026
https://github.com/kunalkumar2001/data-analyst-power-bi
Data Analyst Power BI Project for Portfolio
data-analysis data-analyst data-analyst-power-bi powerbi
Last synced: 16 Feb 2026
https://github.com/lunarwhite/lake-george-viz
Geroge Lake data analysis and visualization, ANU COMP1730/6730
Last synced: 01 Nov 2025
https://github.com/prekshivyas/cis-595-big-data-analytics
Comprehensive real estate price prediction project, integrating socioeconomic indicators and property features.
data-analysis data-cleaning data-mining data-preprocessing data-science data-visualization data-wrangling exploratory-data-analysis web-scraping
Last synced: 16 Feb 2026
https://github.com/k-bloch/car-theft-analysis
A dashboard created to inform the public about car theft, providing insights extracted from real-world police stats.
data-analysis maven-analytics tableau
Last synced: 19 Mar 2026
https://github.com/devexpress-examples/aspxpivotgrid-group-date-time-values
This example shows how to group date-time values in Pivot Grid for Web Forms.
asp-net-web-forms data-analysis dotnet pivot-grid pivot-grid-for-web-forms
Last synced: 01 Mar 2026
https://github.com/arnoudbuzing/iowa-caucus
Data Analysis on 2020 Iowa Caucus results
caucus data-analysis iowa iowa-caucus mathematica primaries primary-election wolfram-language
Last synced: 01 Mar 2026
https://github.com/deepanshkhurana/udacityproject-prediciting-boston-housing-prices
This is a Udacity Project for the Machine Learning Nanodegree. Here, we are trying to predict Boston Housing Prices using sklearn.
data-analysis data-science machine-learning python scikit-learn udacity
Last synced: 08 May 2026
https://github.com/an4pdm/relatorio-de-vendas
O presente projeto foi feito através das ferramentas oferecidas pelo Power BI afim de aprimorar meus conhecimentos sobre ETL. Os dados utilizados foram de origem do site "Kaggle".
data-analysis data-visualization database etl powerbi
Last synced: 20 Jun 2026
https://github.com/oscarmtr/metrov
Interactive viewer for tropospheric meteorological soundings
climate data-analysis meteorology skew-t soundings temperature tropospheric web
Last synced: 01 Mar 2026
https://github.com/liebsen/overlemon
Overlemon institutional application
data-analysis design devops sysadmin webdev
Last synced: 21 Jul 2025
https://github.com/aleks-andrs/bigdataanalytics
Public repository for CM3111: Big Data Analytics Coursework (Meteorite landings analysis)
data-analysis data-science machine-learning
Last synced: 02 Mar 2026
https://github.com/cnoret/ibm-data-analyst-professional
Final project & Courses Notebooks
analyzing-data data-analysis data-analyst data-manipulation data-science data-visualization ibm ibm-certificate ibm-data-analyst-professional ibm-datascience-certification pandas python
Last synced: 09 May 2026
https://github.com/mayankyadav23/amazon-sales-data-analysis
Diving into Amazon sales data to uncover hidden gems! 📈 Analyzing iNeuron's dataset to optimize sales strategies and boost performance 💡 Driving business growth with data-driven decisions! 💻
amazon data-analysis data-visualization ineuron-ai internship-project
Last synced: 02 Mar 2026
https://github.com/mbarbetti/bachelor-thesis-public
:book: My bachelor thesis at the University of Firenze
bachelor-degree bachelor-degree-thesis bachelor-thesis data-analysis lhcb-experiment particle-physics thesis
Last synced: 02 Mar 2026
https://github.com/drisskhattabi6/meteo-data-mining
This repo contains using Data Mining Techniques to analyze meteorological (meteo) data. The objective is to extract meaningful insights and patterns from the data that can aid in understanding weather phenomena and predicting future weather conditions.
cart data-analysis data-mining data-visualization decision-making decision-tree extract-data extract-insights insights-analytics insights-data k-means knn machine-learning svm
Last synced: 21 Mar 2025
https://github.com/chaitanyaprasad60/sql-queries
This is a list of complex SQL Queries I have practiced.
data-analysis sql window-functions
Last synced: 03 Mar 2026
https://github.com/ranagaballah/true-fake-news
True Fake News Detector NLP model
data-analysis data-science data-visualization deployment machine-learning matplotlib nlp numpy pandas python
Last synced: 09 May 2026
https://github.com/elrf3lipes/ramon-s_portfolio
I'm passionate about Cloud and DevOps, and for the moment I'm posting some of my work and personal projects here to showcase that. If its useful for you, feel free to integrate or contribute!
api-integration biopython clinical-trials data-analysis data-extraction data-parsing django docker entrez ipython medline-xml pandas pubmed-parser requests rest-api
Last synced: 27 Mar 2026
https://github.com/rizkipragustono/data_analysis_spark
Exploration: Data Analysis using Spark
apache-spark data-analysis pyspark python spark-sql sql
Last synced: 09 May 2026
https://github.com/alex-petrov-git/petrowiki
My wiki-pages
acoustics aeroacoustics aeromechanics algebra analysis data-analysis fourier-analysis hydrodynamics linear-algebra math ml obsidian physics probability-theory statistics wiki wikipedia
Last synced: 02 Mar 2025
https://github.com/ibrahimceyisakar/hotel-finder-streamlit-dashboard
Streamlit dashboard of hotel-finder
data-analysis data-science data-visualization pandas plotly python streamlit
Last synced: 16 Apr 2026
https://github.com/estherslabbert/final-capstone-unsupervised-ml
Exploration of USArrests data using unsupervised machine learning
arrests correction data data-analysis data-clustering data-visualization jupyter-notebook machine-learning pca-analysis standardised-data usa
Last synced: 26 Jun 2025
https://github.com/dvaser/world-happiness-expanatory-data-analysis
DATA ANALYSIS
data-analysis data-visualization dataset jupyter jupyter-notebook kaggle python
Last synced: 03 Mar 2026
https://github.com/victoryfanfare/car-price-prediction
ML модель для определения рыночной стоимости автомобилей с пробегом. Проект включает анализ данных, feature engineering и сравнение различных алгоритмов машинного обучения.
catboost data-analysis jupyter-notebook lightgbm machine-learning pandas python regression
Last synced: 15 Jun 2026
https://github.com/banner-19/extraction-and-analysis-of-text
The objective is to analyze text content from a list of URLs. This involves extracting article titles and text, then performing natural language processing to generate metrics like sentiment, readability, and word usage. Finally, the results are stored for further analysis or visualization.
data-analysis data-analytics data-science nlp nltk python3 text-analysis text-extraction
Last synced: 03 May 2026
https://github.com/adrianlardies/feelms_predict_by_emotion
Feelms is a mood-based movie recommendation app that uses collaborative filtering and machine learning to suggest films based on your emotions. Built with Streamlit and powered by AWS, Feelms personalizes each user's experience through simulated interactions and tailored predictions.
aws-ec2 aws-rds data-analysis data-science machine-learning python streamlit
Last synced: 16 Apr 2026
https://github.com/abhipatel35/gym-performance-analysis
Analyzing gym performance and user engagement in Arizona using Spark SQL, PySpark, and visualization techniques on the Yelp dataset.
apache-spark asu business-insights data-analysis data-processing-at-scale data-visualization dps gym-analysis rating-patterns sql trend-analysis user-insights yelp-dataset
Last synced: 16 Apr 2026
https://github.com/kosuri-indu/allaboutolympics
All About Olympics is an interactive dashboard presenting comprehensive data and insights on Olympic Games from 1896 to 2020.
data-analysis pandas plotly python streamlit
Last synced: 16 Apr 2026