Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-29 00:07:38 UTC
- JSON Representation
https://github.com/doughtnerd/pod-old
Read and write Excel data
data data-analysis excel poi-library workbook
Last synced: 21 Jan 2026
https://github.com/nsandoya/python_scrp_project
This is a tool specially made for Dipaso ecommerce website. You can extract data from there, analyze it and see keywords, brands, and categories frecuency, prices distribution and other market tendencies as well —all in a group of friendly stadistic tables and graphics (exported from a Jupyter notebook) :)
beautifulsoup4 data data-analysis jupyter-notebook pandas python3
Last synced: 28 Apr 2026
https://github.com/fbarffmann/python-challenge
Automated financial and election data analysis using Python. Cleaned and transformed large CSV datasets, calculated key business metrics, and generated automated reports for stakeholders.
automation csv data-analysis data-cleaning election-analysis financial-analysis python reporting
Last synced: 24 Apr 2025
https://github.com/tralahm/kaggle-titanic-competition
Predicting Titanic Passenger Survival Using Machine Learning
data-analysis jupyter-notebook kaggle-competition kaggle-dataset machine-learning matplotlib numpy pandas predictive-modeling python3 sklearn tralahm tralahtek
Last synced: 13 Apr 2026
https://github.com/sumit0ubey/internship
This repository showcases the tasks and projects I completed during various internships. It includes work across diverse domains such as: Data Analysis: Exploratory data analysis, data visualization, and insights generation using Python and libraries like Pandas, Matplotlib, and Seaborn. Backend Development: Designing and implementing RESTful API
backend-development data-analysis python-developer
Last synced: 05 Sep 2025
https://github.com/avratanubiswas/fluorpenplugin
A matlab user interface for analysing OJIP curve datasets from FluorPen instrument. That is, serving as an additional plug in for "quick categorical analysis".
data-analysis fluorpen ojip-curve
Last synced: 18 Mar 2026
https://github.com/hazim-hf/data-science
This course covers basic data science principles, Python programming, and the concept of big data and its types. It explores algorithms, methods, and analyses in data science with practical Python examples. Additionally, it highlights current data technologies for storing and archiving.
data-analysis data-wrangling time-series
Last synced: 04 Jul 2025
https://github.com/darrenjolson/pba-analysis-app
Data analysis and visualization tool for professional bowling tournaments, predicting performance across different oil patterns and venues.
bowling data-analysis data-visualization flask pba predictive-analytics python reactjs sports-analytics
Last synced: 13 Apr 2026
https://github.com/analysisbyvivek/Road-Accident
Analyzes road accident patterns, exploring factors like lighting, weather, speed limits, time of day, and road conditions to uncover trends in severity and frequency.
data-analysis data-visualization eda jupyter-notebook kaggle tableau-public
Last synced: 29 Jan 2026
https://github.com/ireneflorez/nypd-mvc
Analysis of NYPD Motor Vehicle Collisions
basemap data-analysis folium jupyter-notebook matplot pandas python
Last synced: 08 May 2026
https://github.com/sco1/xbmini-py
Python Toolkit for the GCDC HAM
data-analysis data-visualization python python3
Last synced: 07 May 2025
https://github.com/cezlul/analyse-ventes-immobilier
Solution ML d'analyse immobilière parisienne : classification automatique appartements vs commerces (K-means, 91%) et prédiction prix (régression, R²=0.98) sur 26K transactions. Valorise portefeuille 169M€ avec recommandations stratégiques data-driven.
data-analysis jupyter-notebook machine-learning matplotlib numpy pandas python sklearn
Last synced: 13 Apr 2026
https://github.com/namratha2301/python-dashboard-streamlit
Experimenting with Streamlit. Streamlit app provides an interactive visualization of the best-selling books, showcasing trends, top-selling books, top authors, genre distributions, and sales by decade.
css dashboard data-analysis pandas plotly python seaborn streamlit
Last synced: 05 May 2026
https://github.com/samruddhi3012/tata-data-visualization
Hi! This repo contains the dashboard I created using Tableau for TATA Data Visualization Training!
data-analysis data-visualization tableau tata
Last synced: 07 Jan 2026
https://github.com/evan-dg31/data-science
Exploratory Data Analysis (EDA), Predictive Modeling (Supervised and Unsupervised), Regression, Classification, Clustering
classification clustering data-analysis data-science data-visualization machine-learning matplotlib numpy pandas python regression-analysis seaborn
Last synced: 13 Apr 2026
https://github.com/ray-chew/pycsam
pyCSAM is a robust approach for approximating geodesic subgrid-scale orographic spectra with applications to weather forecasting and broader data analysis
data-analysis gmted icon-model merit-dem orographic spectral-analysis topography weather-forecast
Last synced: 28 Feb 2025
https://github.com/busradeveci/student-performance-prediction
A machine learning project to predict student exam performance based on academic, social, and personal features. Built with Python and scikit-learn.
data-analysis kaggle linear-regression machine-learning predictive-modeling python scikit-learn student-performance
Last synced: 25 Apr 2025
https://github.com/mxagar/data_science_udacity
My personal notes, code and projects of the Udacity Data Science Nanodegree.
dashboard data-analysis data-engineering data-science machine-learning-pipelines
Last synced: 09 Apr 2025
https://github.com/hemangsharma/streamingcontentanalyzer
This Streamlit application provides an interactive dashboard for analyzing streaming content data. It allows users to explore movie and TV show ratings, distributions, temporal trends, and genre breakdowns through various visualizations and filters.
dashboard data-analysis data-science data-visualization python streamlit-dashboard streamlit-webapp
Last synced: 02 Apr 2025
https://github.com/mishaa931/amazon-sales-dashboard-power-bi
This project features a dynamic Power BI dashboard built on dummy Amazon sales data. It visualizes key business metrics such as revenue trends, top-selling categories, discount impact, and geographic performance. The dashboard is designed to help stakeholders make data-driven decisions through clear, interactive visuals.
data-analysis data-quality data-visualization microsoftpowerbi
Last synced: 05 Feb 2026
https://github.com/joaquinmoron/airbnb-eda-python
EDA de Airbnb — limpieza, exploración y visualización en Python (pandas, matplotlib, seaborn).
airbnb data-analysis eda matplotlib pandas python seaborn
Last synced: 13 Apr 2026
https://github.com/marianamartiyns/rfm-cluster-analysis
Customer behavior and sales analysis, including data cleaning, RFM calculation, churn analysis and customer clustering.
cluster-analysis data-analysis data-cleaning data-visualization pyhton
Last synced: 16 Mar 2025
https://github.com/luminati-io/Shopee-dataset-samples
A sample dataset of over 1000 Shopee products, extracted using the Bright Data API, ideal for pricing optimization, gap analysis, and market strategy refinement..
api data-analysis data-mining datasets products shopee web-scraping
Last synced: 09 Apr 2025
https://github.com/chaganti-reddy/weather-prediction-australia
Creating a fully-automated system that can use today's weather data for a given location to predict whether it will rain at the location tomorrow.
data-analysis logistic-regression machine-learning prediction-model python3
Last synced: 13 Apr 2026
https://github.com/hari7261/playwithdata-python
This is one of the repository where I have put lot of data science and machine learning related questions on their solutions I hope you will find something better than some other platforms. Thank you Happy exploring
data-analysis data-science data-science-learning machienlearning matplotlib matplotlib-python ml numpy numpy-arrays numpy-library pandas pandas-dataframe pandas-library python python-script sklearn
Last synced: 13 Apr 2026
https://github.com/tj2904/lfb-callout-analysis
An investigation into London Fire Brigade's callout data.
data-analysis decsion-tree kmeans lfb-incidents london-fire-brigade pandas python seaborn
Last synced: 13 Apr 2026
https://github.com/lopes51789/salaryanalysis
This salary dataset is a good candidate for descriptive analysis, and we can identify which demographics experience reduced or increased salaries. For example, we could explore the salary variations by gender, age, industry, and even years of prior work.
data-analysis json mysql python3 sql tableau
Last synced: 13 Apr 2026
https://github.com/pratanup/bank-customer-churn
A prediction model based on ML as well as DL and compare their performances to find Churned Customers
adaboost-classifier ann churn-prediction data-analysis data-visualization decision-tree-classifier deep-learning deep-learning-algorithms gaussian-naive-bayes-classification gradient-boosting-classifier k-nearest-neighbours logistic-regression machine-learning machine-learning-algorithms random-forest-classifier svc svm-classifier xgboost-classifier
Last synced: 10 Mar 2026
https://github.com/jcm-ai/Standard-Bank-Data-Science-Virtual-Experience-Programme
This repository has all of the assignments I had to do for the Standard Bank Data Science Virtual Experience Program. 📉👨💻📊📈
automl business-analysis business-solutions client-communication data-analysis data-mining data-science data-visualization machine-learning machine-learning-algorithms matplotlib-pyplot model-evaluation model-interpretation power-point presentation-slides programming-language python3 seaborn sql statical-analysis
Last synced: 19 Aug 2025
https://github.com/dbriane208/python-for-data-science
Machine Learning and Data Science repository. Love crafting Machine Learning models.
data-analysis data-science data-visualization machine-learning numpy pandas python seaborn
Last synced: 13 Apr 2026
https://github.com/fer-aguirre/cookiecutter-data-analysis-lite
A cookiecutter template for data journalism projects that offers a simplified and beginner-friendly structure.
cookiecutter data-analysis data-journalism project-template python
Last synced: 14 Jun 2025
https://github.com/jakubteichman/bullbozer_price_prediction_ml_project
A bulldozer price estimatior from Kaggle competition dataset
data-analysis data-science estimation machine-learning prediction
Last synced: 06 Sep 2025
https://github.com/grandechowhiskey/fcc-data_analysis-projects
A collection of projects completed as part of the FreeCodeCamp "Data Analysis with Python" certification. These projects cover statistical calculations, data visualization, and trend analysis using real-world datasets.
data-analysis data-visualization matplotlib pandas python3 scikit-learn seaborn
Last synced: 01 May 2026
https://github.com/srinibas-masanta/zomato-customer-and-restaurant-analysis
This repository contains a comprehensive analysis of Zomato's platform, focusing on various aspects of customer behavior, restaurant performance, and market trends. The analysis leverages data-driven insights to answer key questions that can guide business strategies, enhance customer satisfaction, and optimize operational efficiency.
business-analytics data-analysis data-science data-visualization
Last synced: 02 Apr 2025
https://github.com/sehgal-vishal/ev-vehicle-market-analysis-dashboard
This Dashboard is related to EV vehicles adoption
clean-energy data-analysis data-visualization electricvehicles future-technologies
Last synced: 04 Mar 2026
https://github.com/deypadma2020/dataanalysis-mlalgo
Practice repository for data analysis, feature engineering, statistics, web scraping, and building ML model pipelines in Python.
data-analysis eda feature-engineering machine-learning-algorithms ml-pipeline statistics web-scraping
Last synced: 30 May 2026
https://github.com/spacebakery/nba-trends-project
Data Science Foundations I | Exploratory Data Analysis in Python | Summarizing Relationship Between Two Features
categorical-variables data-analysis data-visualization matplotlib nba-dataset quantitative-variables scipy seaborn subset summary-statistics
Last synced: 11 Mar 2025
https://github.com/satyam4229/prediction-of-cement-compressive-strength
Prediction of cement compressive strength is a model which is based on Regression model, Here we predict that how much is the compressive strength of the particular cement has with variety of mixtures of its component.
data-analysis data-science data-visualization jupyter-notebook kaggle python
Last synced: 13 Apr 2026
https://github.com/pratik-khose/realtime-sales-simulation
Power BI: Realtime Sales Simulation using SQL Server and Direct Query
data-analysis data-analytics data-visualization dax-query powerbi sql sql-server sqlserver
Last synced: 10 Jun 2026
https://github.com/xre22zax/airline-analysis
Travel agency and need to know the ins and outs of airline prices for your clients
data-analysis data-visualization python python3 visualization
Last synced: 13 Apr 2026
https://github.com/fbarffmann/mycitibike
Built an interactive Leaflet.js map visualizing over 750 Citi Bike station locations in NYC. Analyzed usage patterns, station density, and user navigation across the network.
citibike data-analysis data-visualization geojson geospatial interactive-map javascript leaflet nyc web-mapping
Last synced: 07 Jul 2025
https://github.com/chen0040/spark-tabular-analytics
Spark statistical inference framework for performing column pair-wise data analytics for large data table
anova chi-square-test confidence-intervals data-analysis hypothesis-testing spark statistical-inference tabular-data
Last synced: 07 Jul 2025
https://github.com/masamallow/jupyterlab-my-local
Configuration to run my personal JupyterLab on my local.
data-analysis jupyter jupyter-notebook jupyterlab
Last synced: 26 Mar 2025
https://github.com/deliprofesor/k-means-clustering-for-retail-data-analysis
This project uses K-Means clustering to segment wholesale customers based on their spending habits. The data is preprocessed, scaled, and clustered into four groups. The Elbow and Silhouette methods determine the optimal number of clusters, and results are visualized using boxplots and scatter plots to uncover spending patterns.
clustering-visualisation data-analysis elbow-method k-means k-means-clustering r silhouette-score
Last synced: 10 Apr 2025
https://github.com/luciocolonna/cyclistic-bikesharing-2023
Case study on public data from Chicago's Divvy bikeshare, using R
bikesharing capstone-project cyclistic cyclistic-bikshare data-analysis data-visualization geojson ggplot2 google-data-analytics google-data-analytics-capstone-project google-data-analytics-professional leaflet r sf tidyverse
Last synced: 02 Apr 2025
https://github.com/bitcoin-apps-suite/bitcoin-spreadsheet
Open source Bitcoin-powered spreadsheet application with blockchain data integration, smart contract calculations, and collaborative financial modeling | By THE BITCOIN CORPORATION LTD
bitcoin bitcoin-sv blockchain bsv cryptocurrency dapp data-analysis decentralized excel-alternative nextjs spreadsheet typescript web3-spreadsheet
Last synced: 05 May 2026
https://github.com/jianxi-erin/bigdata-machinelearning-lab
本项目是一个综合性的大数据与机器学习实验平台,包含两个主要任务,每个任务涵盖三个关键技术模块:大数据处理、数据分析和机器学习。项目基于真实的竞赛设计,提供完整的数据处理模拟和建模实践。
data-analysis data-visualization hadoop machine-learning python spark sql
Last synced: 03 May 2026
https://github.com/joyceannie/sql-data-with-danny-case-studies
Case study solutions for #8WeekSQLChallenge at https://8weeksqlchallenge.com
8-week-sql-challenge 8weeksqlchallenge case-study data-analysis data-analytics postgresql sql
Last synced: 05 Oct 2025
https://github.com/affan005-ai/tesla-stock-prediction
This project analyzes Tesla stock data and builds machine learning models to predict and classify stock movements. The analysis includes EDA, feature correlation, moving averages, and two models
data data-analysis data-science data-visualization-project eda machine-learning matplotlib pandas predictive-analytics predictive-modeling python scikit-learn
Last synced: 05 Oct 2025
https://github.com/trismald/eurosoccer1023
Data Analyst - European Soccer 2010 2023
data-analysis data-visualization jupyter-notebook pandas powerbi python
Last synced: 06 May 2026
https://github.com/sora468/best-of-ml-python
🏆 Discover top-ranked Python libraries for machine learning, updated weekly to help you find the best tools for your projects.
airport airport-simulation chatgpt configuration data-analysis data-science data-visualization data-visualizations gpt keras machine-learning nlp python scikit-learn tensorflow transformer usg-ai-training-data usg-artificial-intelligence
Last synced: 09 May 2026
https://github.com/swatisinghit/treadmill-customer-profiling-for-aerofit
Create comprehensive customer profiles for each AeroFit treadmill product through descriptive analytics. Develop two-way contingency tables and analyze conditional and marginal probabilities to discern customer characteristics, facilitating improved product recommendations and informed business decisions.
analytics conditional-probability data-analysis data-science data-visualization eda numpy pandas probability statistics
Last synced: 08 May 2026
https://github.com/ilaxi/lomicontadores
data management tool in reference to number of actions per day in a year
data-analysis gdscript godot godot4 python
Last synced: 19 Apr 2026
https://github.com/surbhi242singh/pizza_sales_project
Used SQL to analyze pizza sales data
data-analysis mysql pizza-sales sql
Last synced: 07 Oct 2025
https://github.com/prajjwol09/sql_retail_analysis_project
This project demonstrates SQL-based data cleaning, exploration, and business analysis on a retail sales dataset. It involves setting up a database, removing null values, performing EDA, and using SQL queries to extract key insights such as top customers, best-selling categories, and monthly sales trends.
data data-analysis datacleaning dataexploration pgadmin4 sql
Last synced: 15 Feb 2026
https://github.com/gabboraron/biostatisztika_es_alkalmazasai
"A statisztika a matematika azon ága, melynek feladata, hogy eszközt adjon a politikusok kezébe, mellyel tetszőleges állítás és annak ellentéte is tudományos alapon igazolható"
biostatistics data-analysis data-visualization r statistics statistics-course
Last synced: 24 Oct 2025
https://github.com/rusiru-erandaka/pupil-dilation-signal-classification-pipeline-with-noise-filtering-feature-extraction
In this repository I have worked on Pupil Diameter Time series Dataset. here I have worked on data sampling, Blink detection and Noise Handling, Stimulus Onset Alignment & Ensemble Averaging, Baseline correction, Feature Extraction and finally create a Patient classification ML pipeliner
anomaly-detection classification-pipeline data-analysis data-preprocessing data-science time-series
Last synced: 08 Oct 2025
https://github.com/ak-abhilash/super-market-sales-data-analysis-and-forecasting-using-power-bi
Power BI project to visualize sales data of a supermarket.
dashboard data-analysis powerbi salesdata visualization
Last synced: 05 Feb 2026
https://github.com/dcs-training/exploratory-data-analysis-and-visualisation-with-observable-plot
This two-hour workshop will teach you how to follow an exploratory data analysis pipeline with Observable Plot, a new JavaScript library based on the Grammar of Graphics, that proposes a simple yet expressive interface to create powerful graphics easily shareable on the web. Go to the Readme file
d3 data-analysis data-visualisation javascript observable-notebook
Last synced: 17 May 2026
https://github.com/sarvesh2304/stellarator_simulation
A comprehensive Julia package for stellarator fusion reactor physics analysis featuring 3D magnetic field calculations, neoclassical transport modelling, quasi-isodynamic optimisation algorithms, and interactive 3D visualisations. Includes tokamak comparison framework and high-resolution plotting capabilities for fusion research.
3d-visualisation data-analysis field-line-tracing fusion-physics fusion-research interactive-3d julia magnetic-confinement magnetic-field-calculations magnetic-surfaces matplotlib neoclassical-transport numerical-methods optimisations physics-simulation plasma-physics plotly quasi-isodynamic stellarator stellarator-optimization
Last synced: 09 Oct 2025
https://github.com/l1ght14/tradersentiment_primetrade
Analyzes Bitcoin market sentiment's impact on Hyperliquid trader PnL & behavior. Uncovers patterns using Python (Pandas, Seaborn) to derive actionable trading insights. Junior Data Scientist assignment for PrimeTrade
bitcoin crypto-trading cryptocurrency data-analysis financial-data-analysis jupyter-notebook market-sentiment pandas python trader-behavior web3
Last synced: 20 Oct 2025
https://github.com/marianamartiyns/api-logisticregression
Data analysis, modeling, and deployment of a logistic regression model for churn prediction, integrating a FastAPI backend and a Streamlit frontend.
data-analysis data-science fastapi logistic-regression pyhton streamlit
Last synced: 29 Apr 2026
https://github.com/ninadpatil09/hospital_emergency_room_analysis
This comprehensive analysis delves into the performance and characteristics of the hospital's emergency room over the past year. By scrutinizing key metrics and patient demographics, this study aims to provide valuable insights for optimizing patient care, resource allocation, and overall operational efficiency.
data-analysis tableau-public visualization
Last synced: 15 Feb 2026
https://github.com/loaiwalid07/automation_data_overviwe
This is Streamlit app that gives an overview for a dataset you upload
automation data data-analysis data-exploration data-science data-transformation data-visualization
Last synced: 19 May 2026
https://github.com/anandu-jpg/coffee-shop-sales-analysis
This project analyzes coffee shop sales data to identify trends, patterns, and insights that can help improve operations, boost revenue, and enhance the customer experience.
business-intelligence data-analysis data-visualization exploratory-data-analysis jupyter-notebook pandas phyton
Last synced: 18 May 2026
https://github.com/badranalyst/time-series-analysis-of-global-trends-in-diet-gym-and-finance
This project analyzes global trends in diet, gym, and finance over time using time series data. The analysis is performed using Python libraries like Pandas, Matplotlib, and Seaborn to visualize trends and identify patterns in these sectors across various countries.
data-analysis dataset matplotlib-pyplot numpy pandas python seaborn time-series
Last synced: 14 Apr 2026
https://github.com/amirreza81/kaggle-pandas-course-solutions
Kaggle Pandas Course - Solved exercises in another way of sample solution
data data-analysis data-cleaning data-manipulation data-science dataframe jupyter-notebook kaggle machine-learning open-source pandas
Last synced: 14 Apr 2026
https://github.com/imrandil/excel_learning_dir
Excel learning practice with some data, the doing
Last synced: 27 Jan 2026
https://github.com/scarlet-enlight/ml_project
Comparison of different classifiers (KNN, Naive Bayes, Decision Tree) on Sleep Health and Lifestyle Dataset
data-analysis machine-learning
Last synced: 13 Mar 2026
https://github.com/mouadtaoussi/capmpingi-employee-reviews
Analysis of Capmpingi employee reviews using Python/Pandas and Power BI
data-analysis data-science kaggle pandas powerbi python python3
Last synced: 14 Apr 2026
https://github.com/thinzarhninyu/dap
Notes and Projects for Data Analysis with Python course from FreeCodeCamp.org
data-analysis data-analysis-python ipynb jupyter-notebook python
Last synced: 18 Feb 2026
https://github.com/dzakwanalifi/stadata-x
Terminal UI untuk menjelajahi dan mengunduh data BPS Indonesia secara interaktif
bps-api cli-app data-analysis data-visualization indonesia-statistics indonesian-data open-data python statistics terminal-ui textual tui
Last synced: 20 Jan 2026
https://github.com/jeffbrennan/analysis-templates
Templates of commonly used graphics/functions/settings to help focus on the bigger picture
Last synced: 12 Oct 2025
https://github.com/ntaraujo/cleo
Contact data processor for Cléo
contacts-manager data-analysis data-visualization whatsapp whatsapp-web
Last synced: 15 May 2026
https://github.com/veronsheva/hr_dashboards
Interactive HR dashboard using Tableau & MySQL – explore employee trends, performance, attrition, and salary insights.
calculated-fields charts cte dashboards data-analysis data-cleaning design eda mysql queries tableau window-functions
Last synced: 24 Jan 2026
https://github.com/sunsided/esc2024
Exploratory Data Analysis on the ESC 2024 results
csv data-analysis eurovision-song-contest scraping
Last synced: 18 Feb 2026
https://github.com/alefrp/properties_dbt
A DBT project for analyzing city property data.
data-analysis data-warehouse dbt python sql
Last synced: 13 Oct 2025
https://github.com/gmalbert/rugby
Rugby Data Analysis and Sports Betting
data-analysis rugby sports-betting
Last synced: 31 May 2026
https://github.com/giseletoledo/case-study-wellness-smart
Project from the coursera course Google Data Analytics
data-analysis kaggle-dataset r
Last synced: 14 Oct 2025
https://github.com/ayorick23/python-data-science-cheat-sheet
Guía rápida y práctica de sintaxis, comandos y funciones esenciales de Python para Ciencia de Datos. Perfecta para recordar cómo usar las librerías más comunes como NumPy, Pandas, Matplotlib y Scikit-learn en tus análisis diarios.
cheat-sheet data-analysis data-science data-visualization deep-learning jupyter-notebook machine-learning matplotlib ml numpy pandas python scikit-learn scipy seaborn statistics sympy tensorflow
Last synced: 07 Apr 2026
https://github.com/saisurajmatta/healthcare-data-analytics
Power BI project analyzing Emergency Department data, demonstrating skills in data transformation, DAX, and visualization. It focuses on patient flow, wait times, demographics, and satisfaction, providing actionable insights for healthcare improvement. Includes documentation, data dictionary, and code samples.
data-analysis data-modeling data-visualization dax power-bi powerbi-visuals powerquery
Last synced: 22 Jan 2026
https://github.com/pedrosfaria2/analisetitulosnetflix
Estudo de popularidade dos filmes da Netflix no IMDB.
analise-de-dados data-analysis jupyter-notebook matplotlib numpy pandas python
Last synced: 14 Apr 2026
https://github.com/virajbhutada/hollywood-insights-tableau
Strategic cinematic insights through Hollywood's data landscape. Tableau-driven analytics for genre, studio profitability, and audience dynamics. Uncover trends, assess audience reception, and navigate through years of film data, elevating your understanding of the cinematic world.
analystics business-intelligence dashboard data-analysis data-visualization entertainment hollywood storytelling tableau tableau-desktop visualization
Last synced: 05 Feb 2026
https://github.com/balajimohan18/sql-projects
The repository contains Structured Query Language (SQL) Scripts. The Multiple SQL scripts for various projects which includes data cleaning, data pre-processing, data processing, data transformation and insights gaining through Query Language
data-analysis data-mining data-science eta microsoft-sql-server query-language sql sql-server sql-server-management-studio sqlqueries
Last synced: 14 Mar 2026
https://github.com/mindlessmuse666/iris-ml-based-on-decision-trees
Проект демонстрирует применение моделей машинного обучения на основе деревьев решений и случайного леса для классификации набора данных Iris. Включает в себя загрузку данных, обучение моделей, оценку производительности и визуализацию результатов. Предназначен для изучения основ машинного обучения и анализа данных.
classification data-analysis data-visualization decision-trees iris-dataset machine-learning model-evaluation python random-forest scikit-learn
Last synced: 17 Oct 2025
https://github.com/bhaveshbhakta/diamond-price-prediction-using-xgboost
Diamond Price Prediction
data-analysis data-visualization diamond-prices-predictions ensemble-learning machine-learning xgboost
Last synced: 27 Oct 2025
https://github.com/antodata/hate_crimes_spain_2014_2017
Analysis of hate crimes in Spain between 2014 and 2017 using official data
chi-square chi-square-test data-analysis data-visualization datascience folium hatecrime json lgtbiq linear-regression maps matplotlib numpy pandas python python3 scipy selenium selenium-webdriver sklearn
Last synced: 14 Apr 2026
https://github.com/lucashomuniz/Project-02
Data Analysis and Machine Learning Techniques for Liver Disease Prediction
classification-model data-analysis decision-tree healthcare-application knn-algorithm liver-disease-prediction logistic-regression machine-learning python-language python-script random-forest supervised-learning svm-model
Last synced: 20 Oct 2025
https://github.com/mothraa/etl-marketanalysis-webscraping-poo
OC project 2 refactoring (POO version not yet completed)
data-analysis etl poo python web-scraping
Last synced: 20 Oct 2025
https://github.com/omr5221/bi-scripts
Example of DI BI tool scripting
automation configuration-files data-analysis data-warehousing dimensional-analysis diver etl-pipeline lookup modeling perl-script python shell-script sql summarization
Last synced: 15 Mar 2026
https://github.com/changyeop-yang/study-datasciencefoundation
Big Data Science and its Analytics plays a major role in this decade. How to clean and prepare your data for analysis is still a challenge, like How to perform basic visualization of your data, How to model your data, How to curve-fit your data, And finally, how to present your findings and wow the audience
data-analysis ios kyungpook-national-university swift
Last synced: 23 Oct 2025
https://github.com/nikkvd/ipl-data-analysis-for-2024-special-edition-magazine
This project analyzes IPL data (2021-2023) using SQL to extract insights on player performances, team strategies, and trends for a special IPL 2024 edition magazine.
Last synced: 24 Feb 2026
https://github.com/browndwarf/contracosta
Wavelength dependent starspot contrast with Kepler/K2 and TESS
Last synced: 23 Jan 2026
https://github.com/sugumarsrinivasan/sql-datawarehouse-project
Building Mordern datawarehouse with SQL Server, including ETL Processes, data modeling, and data analytics.
data-analysis data-analytics data-engineering data-lake data-science data-warehouse datawarehousing etl etl-pipeline medallion-architecture sql sql-query sql-server
Last synced: 19 Jun 2026
https://github.com/a26nine/kortext-usage-dashboard
An interactive data visualisation dashboard built using Tableau software to understand the value of digital resources issued on Kortext platform at Middlesex University, London.
data-analysis data-science data-visualization knime tableau
Last synced: 01 Feb 2026