Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-07-02 00:07:33 UTC
- JSON Representation
https://github.com/jiwookseo/natural_language_analysis
api sample for google natural language and ECOS(한국은행 경제통제시스템)
data-analysis google-natural-language-api text-analysis
Last synced: 11 Oct 2025
https://github.com/pyrypp/koivunen-vastaanottoanalyysi
An analysis on warehouse goods receiving
business-intelligence data-analysis interactive-visualizations
Last synced: 11 Oct 2025
https://github.com/vineet416/eda-hr-analytics
EDA on HR-Analytics by PW Skills Data Analytics course
data-analysis data-analysis-python data-analytics data-preprocessing data-processing data-visualization exploratory-data-analysis jupyter-notebook matplotlib-pyplot numpy pandas python seaborn statistical-analysis
Last synced: 14 Apr 2026
https://github.com/sumit0ubey/internship
This repository showcases the tasks and projects I completed during various internships. It includes work across diverse domains such as: Data Analysis: Exploratory data analysis, data visualization, and insights generation using Python and libraries like Pandas, Matplotlib, and Seaborn. Backend Development: Designing and implementing RESTful API
backend-development data-analysis python-developer
Last synced: 05 Sep 2025
https://github.com/fbarffmann/python-challenge
Automated financial and election data analysis using Python. Cleaned and transformed large CSV datasets, calculated key business metrics, and generated automated reports for stakeholders.
automation csv data-analysis data-cleaning election-analysis financial-analysis python reporting
Last synced: 24 Apr 2025
https://github.com/kishorep26/school-recommendation-system
Intelligent school recommendation system that matches students with suitable educational institutions based on preferences and performance metrics
bootstrap data-analysis decision-support edtech education education-technology flask matching-algorithm python recommendation-system school-finder school-search student-portal web-application
Last synced: 06 May 2026
https://github.com/anudeepkaddala/bankds
This repository contains a Python-based solution for cleaning, matching, and formatting bank data. The primary goal is to match banks from two datasets based on their names and associate each bank with its respective asset size. The final output is a cleaned dataset with asset sizes in Indian-style currency format.
data-analysis data-science fuzzy-matching pandas python
Last synced: 12 Apr 2026
https://github.com/sarthakagg29/sql-share-trading-analysis
Analysis of share trading transactions using SQL. Includes table setup, sample data, and a variety of queries to answer typical business questions about stocks and trading.
data-analysis dbeaver portfolio postgresql share-market sql
Last synced: 04 Jul 2025
https://github.com/wtmcgrew/sql-credit-risk-analysis
Credit Risk Analysis using SQL & Excel – Approval trends by FICO, DTI, PTI, LTV, and delinquencies.
case-study credit-risk data-analysis financial-analysis loan-applications portfolio-project sql sqlite underwriting
Last synced: 04 Jul 2025
https://github.com/prakshal0809/power-bi-analytics-dashboard
I have developed a dashboard in Power BI utilizing data from an Excel file. The dashboard effectively visualizes and analyzes the given data.
Last synced: 22 Feb 2026
https://github.com/NurFakhri/scraping-and-analysis-skincare
Scraping and data analysis of Indonesian skincare reviews.
beutifulsoup data-analysis data-scraping python requests review scraping-websites
Last synced: 12 Oct 2025
https://github.com/josepablodmg/python--linear-regression-advertising
A linear regression analysis to predict sales based on advertising spending across TV, radio, and newspaper channels. The project includes exploratory data analysis, model training, coefficient visualization, and residual analysis.
advertising data-analysis exploratory-data-analysis linear-regression machine-learning python regression scikit-learn visualization
Last synced: 06 May 2026
https://github.com/jeffbrennan/analysis-templates
Templates of commonly used graphics/functions/settings to help focus on the bigger picture
Last synced: 12 Oct 2025
https://github.com/akash1070/project--uber-data-analysis
To Determine UBER data from the dataset using Python
data-analysis data-science python
Last synced: 09 May 2026
https://github.com/leosimoes/digitalinnovationone-analise-covid
Projeto prático "Criando modelos com Python e Machine Learning para prever a evolução do COVID-19 no Brasil" da Digital Innovation One.
arima-models data-analysis data-science python time-series
Last synced: 09 May 2026
https://github.com/faysalalmahmud/bd-med-professional-analysis
Analysis of healthcare professionals in Bangladesh through web scraping, data processing, and interactive visualization.
data-analysis data-visualization jupyter-notebook python scraper selenium selenium-webdriver tableau
Last synced: 04 Sep 2025
https://github.com/rachit1084/sql-practice-ankit-bansal
Personal SQL problem-solving practice based on Ankit Bansal's YouTube series, with logic-driven solutions for analyst prep.
analytics data-analysis data-analyst interview-preparation logical-reasoning postgresql sql sql-practice
Last synced: 04 Jul 2025
https://github.com/rohitblaze10/netflix_analysis_using_tableau
The Netflix dashboard in Tableau provides a professional and visually captivating interface for users to explore a vast collection of TV shows and series. With seamless navigation and interactive filters, users can easily personalize their recommendations based on release year, genre, duration, and rating.
data data-analysis data-science data-visualization netflix tableau
Last synced: 04 Feb 2026
https://github.com/saroshfarhan/dublin_pedestrian_data_analysis
Pedestrian's footfall data analysis for the city of Dublin
data-analysis data-visualization r-programming
Last synced: 07 Jan 2026
https://github.com/krypten/nycsubwayturnstileweatheranalysis
Analyzing the NYC Subway Dataset
data-analysis machine-learning machinelearning python
Last synced: 01 Sep 2025
https://github.com/kseniatyschuk/excel-data-matcher
Compare and match Excel files via a simple Python GUI
automation data-analysis etl excel gui pandas python3 tkinter
Last synced: 23 Apr 2025
https://github.com/alefrp/properties_dbt
A DBT project for analyzing city property data.
data-analysis data-warehouse dbt python sql
Last synced: 13 Oct 2025
https://github.com/angelalim88/jakarta-air-quality-index-classification
This project classifies Jakarta's Air Quality Index (AQI) from 2010 to 2023 using machine learning models (Random Forest, MLP, SVM) based on pollutant concentrations.
data-analysis data-visua machine-learning scikit-learn tensorflow
Last synced: 13 Oct 2025
https://github.com/edanur-y/variable-analysis-of-banks-ratio-data
Testing variables for multicollinearity, multivariate normality and analyzing outliers and missing values. ⭕SPSS 🔵R
data-analysis log-transformation missing-values-analysis multicollinearity normality-test r spss
Last synced: 10 Jun 2026
https://github.com/jackieocham/rest-metrics-data-analysis
Data analysis on sleep and health tracking data collected over many years
data-analysis data-cleaning data-manipulation data-preparation data-project exploratory-data-analysis initial-data-analysis mysql mysql-database sql
Last synced: 01 Apr 2025
https://github.com/montanaz0r/kaggle-titanic-disaster-ml-project
Full workflow of building a classification model that scored 0.80382 (top 8%)
classification data-analysis data-science data-visualization jupyter-notebook kaggle-competition kaggle-titanic machine-learning matplotlib pandas python random-forest seaborn sklearn
Last synced: 29 Apr 2026
https://github.com/korniichuk/pydatan-homework
Python Data Analysis course homework
course data-analysis data-analysis-python python python3
Last synced: 06 May 2026
https://github.com/anderson-andre-p/uber-data-analysis
This repository contains a comprehensive data analysis project focused on Uber rides. The dataset used in this project is a spreadsheet obtained from Uber, containing data related to ride details, such as pick-up and drop-off locations, date and time of the ride, and the fare amount.
data-analysis data-science data-visualization python
Last synced: 15 Jun 2026
https://github.com/callmezoe/neo4j-supplychainmanagement
cypher data-analysis data-visualization graphdatabase neo4j
Last synced: 08 Apr 2025
https://github.com/erick957/saleprice-prediction-dataset-analysis-and-cleaning-advance-regression
🏠 Predict house prices using advanced regression techniques with this comprehensive analysis and cleaning project, from data loading to model deployment.
data-analysis data-science eda google-colab machine-learning numpy pandas python scikit-learn scikit-learn-python
Last synced: 06 May 2026
https://github.com/montanaz0r/testing-if-mma-math-deduction-works-using-ufc-fighters-data
The probabilistic reasoning about phenomenon called MMA math using UFC fighters data and Python.
bayesian-inference data-analysis data-science graphviz jupyter-notebook pandas python scipy statistics
Last synced: 14 Apr 2026
https://github.com/mouadtaoussi/capmpingi-employee-reviews
Analysis of Capmpingi employee reviews using Python/Pandas and Power BI
data-analysis data-science kaggle pandas powerbi python python3
Last synced: 14 Apr 2026
https://github.com/dhruvil-26/tableau-projects
This repository contains Tableau visualization projects focused on data analysis across different domains. Projects include: 1. IPL Visualization - Insights into IPL match, Team and player statistics. 2. EV Analysis - Visualizations exploring the adoption of electric vehicles. 3. Road Accident Analysis - Analysis of road accident patterns
analysis data data-analysis data-analytics electric-vehicles ipl road-accident-analysis tableau tableau-public
Last synced: 19 Jan 2026
https://github.com/aran203/fluxease
Python package for eddy flux data post processing
data-analysis data-science eddy-covariance python
Last synced: 03 Apr 2025
https://github.com/mikma03/datascience_python_datacamp
DataScience with Python. Code and examples. Python libraries, including pandas, NumPy, Matplotlib, and many more.
data-analysis data-science datacamp datascience numpy pandas python
Last synced: 06 May 2026
https://github.com/bakulwani/data-mart-weekly-sales
Cleaned and analyzed weekly sales data using SQL to build a business-focused data mart with KPIs, customer segmentation, and platform insights.
customer-segmentation data-analysis data-cleaning etl kpi-analysis mysql sales-analysis sql
Last synced: 21 Feb 2026
https://github.com/thinzarhninyu/dap
Notes and Projects for Data Analysis with Python course from FreeCodeCamp.org
data-analysis data-analysis-python ipynb jupyter-notebook python
Last synced: 18 Feb 2026
https://github.com/chaitanyac22/investment-analysis-for-an-asset-management-company
Data analysis to identify the best sectors, countries, and a suitable investment type for making investments.
business-analytics business-intelligence data-analysis data-cleaning data-insights data-manipulation data-preparation data-visualization decision-making finance python3 risk-management statistics
Last synced: 06 May 2026
https://github.com/bhushan148/finance-domain-bank-loan-report-tableau
I analyzed 🏦 bank loan data to reveal trends, KPIs, and insights. Using Tableau 📈 for dashboards and SQL 🗃️ for data extraction, I visualized loan applications, borrower profiles, and repayment behaviors 💡.
bussiness-intelligence dashboard-design data-analysis data-visualization excel figma sql sqlqueries tableau
Last synced: 08 Apr 2025
https://github.com/abeltavares/postql
Python library and command-line interface (CLI) tool for interacting with PostgreSQL databases, providing simplified database management, query execution, and result export functionalities.
cli command-line-interface data-analysis data-engineering data-export data-management data-processing data-visualization database database-administration database-tools etl oop postgres postgresql psycopg2 python sql sqlalchemy wrapper
Last synced: 19 Jan 2026
https://github.com/treasarose/us_candy_distribution_analysis_project
This project focuses on advanced data analysis and optimization using SQL. It includes queries for analyzing sales, product margins, and shipping efficiency for a US candy distributor.
data-analysis entity-relationship mssql optimization query sql-server sqlproject us-candy-distributor
Last synced: 12 Oct 2025
https://github.com/bhaveshbhakta/flight-price-prediction-using-ml
Flight Price Prediction
data-analysis data-visualization flight-price-prediction machne-learning random-forest
Last synced: 12 Oct 2025
https://github.com/harryrlk/data_analysis_showcase
This repository showcases my data analysis and visualization projects using Excel, Python, R, and Tableau. Some projects are under NDA, so key figures and specific numbers are not included, but brief overviews and methodologies are provided. Feel free to explore and contact me for further details.
data-analysis data-science data-visualization excel portfolio python r tableau
Last synced: 06 May 2026
https://github.com/bala-1409/power-bi-visualization-project
This repository contains Visualization Projects which is visualized through Power BI Software, by using the visualization we can gain multiple insights and strategies which helps to develop the business for gaining high profit margins and by the insights we can reduce the damages by accidents & calamities.
dashboard data-analysis data-science data-visualization exploratory-data-analysis microsoft-excel microsoft-power-bi microsoft-powerpoint power-bi powerbi powerbi-reports powerbi-visuals visualization
Last synced: 04 Jan 2026
https://github.com/anniefib/my_projects
Powering Data Dreams: From Orchestration to Analytics with Cloud Precision
airflow-etl-orchestration aws cloud-native-data-solutions data-analysis data-visualization eda end-to-end-data-pipelines machine-learning-models spark-analytics sql-modeling
Last synced: 25 Mar 2025
https://github.com/zulhaditya/web-scraping-python
A repository that stores various source code and web scraping methods using Python.
data-analysis python3 webscraping
Last synced: 12 Oct 2025
https://github.com/chirlmin-joo-lab/papylio
Single-molecule fluorescence trace extraction and analysis
biophysics data-analysis fluorescence fret single-molecule sparxs
Last synced: 12 Oct 2025
https://github.com/agb2k/twitter-analyzer
Project to extract tweets based on searches, analyze it's data and autocorrect potentially incorrect words
data-analysis python tweepy twitter
Last synced: 13 Oct 2025
https://github.com/sunsided/esc2024
Exploratory Data Analysis on the ESC 2024 results
csv data-analysis eurovision-song-contest scraping
Last synced: 18 Feb 2026
https://github.com/saiteja-talluri/data-analytics-assignement
Report on World Happiness Data (Data Analysis and Visualisation of the data)
data-analysis data-visualization ipynb-jupyter-notebook
Last synced: 20 Jan 2026
https://github.com/seif-elkateb/dataset-analysis-r
cu-boulder data data-analysis datamodeling datascience ms-ds msds434 r
Last synced: 01 Apr 2025
https://github.com/jsimell/sleepanalysis
A Python data analysis project analyzing the sleep quality affecting factors and temporal patterns in the sleeping data of a single subject.
data-analysis matplotlib numpy pandas python scikit-learn seaborn
Last synced: 14 Apr 2026
https://github.com/zenithclown/finfolio
A Personal Finance Management Tool for the Developers, by the Developer
data-analysis data-science finance finance-application finance-management good-habits personal-finance portfolio
Last synced: 04 Feb 2026
https://github.com/gaboelc/analysis-of-the-employment-situation-in-costa-rica-2018-2022
This is an analysis with data extracted from the INEC in order to identify the changes that occurred in the Costa Rican labor market before, during and after the COVID-19 pandemic.
costa-rica data-analysis empleo employment
Last synced: 24 Mar 2025
https://github.com/drill-n-bass/dealavo-project
Cartesian product from dictionary to list of dictionaries and faster methods for finding index than the `index` method.
data-analysis data-analysis-python matplotlib pandas python python3 random timeit
Last synced: 06 May 2026
https://github.com/soyuid/bakery-data-analyst
# About the Project This Bakery Data Analysis project was created to help bakery owners understand their sales patterns. With in-depth data analysis, it is expected to provide useful insights to improve sales and operational strategies.
bakery data-analysis python sales visualization
Last synced: 24 Mar 2025
https://github.com/mateib20/proiect-achizi-ia-i-prelucrarea-datelor
Procesarea semnalului, analiza datelor și analiza spectrală pentru semnal sonor
c c-language c-programming c-programming-language data-analysis data-engineering data-science data-visualization datascience python python-lambda python-library signal-analysis signal-processing
Last synced: 11 Jun 2025
https://github.com/leosimoes/digitalinnovationone-analise-datasets
Projeto prático "Análise de dados com Python e Pandas" do Bootcamp "Banco Carrefour Data Engineer" da Digital Innovation One.
data-analysis data-science python
Last synced: 24 Mar 2025
https://github.com/jasontan22/aefes-time-series-forecasting
Bu proje, Anadolu Efes Biracılık ve Malt Sanayii A.Ş. (AEFES) piyasa verilerini kullanarak kapanış fiyatlarının gelecekteki değerlerini tahmin etmek amacıyla derin öğrenme yöntemleri (LSTM, BiLSTM, CNN+LSTM) kullanmaktadır. Projede, veri ön işleme, model eğitimi ve değerlendirme adımları detaylandırılmıştır.
bilstm cnn-lstm data-analysis deep-learning financial-forecasting lstm machine-learning python stock-price-prediction tensorflow
Last synced: 09 Aug 2025
https://github.com/mindlessmuse666/train-test-splitter
Анализ данных о пассажирах Титаника и разбиение на обучающую и тестовую выборки. Практическое задание по дисциплине "Основы применения методов искусственного интеллекта в программировании".
data-analysis data-preprocessing data-visualization machine-learning pandas python scikit-learn seaborn titanic train-test-split
Last synced: 12 Apr 2026
https://github.com/anushkundu/london-housing-market-analysis
London Housing Market Analysis: An Insightful Power BI Dashboard"
data-analysis data-visualization powerbi transformation
Last synced: 27 Jan 2026
https://github.com/ayorick23/python-data-science-cheat-sheet
Guía rápida y práctica de sintaxis, comandos y funciones esenciales de Python para Ciencia de Datos. Perfecta para recordar cómo usar las librerías más comunes como NumPy, Pandas, Matplotlib y Scikit-learn en tus análisis diarios.
cheat-sheet data-analysis data-science data-visualization deep-learning jupyter-notebook machine-learning matplotlib ml numpy pandas python scikit-learn scipy seaborn statistics sympy tensorflow
Last synced: 07 Apr 2026
https://github.com/adarshpheonix2810/fake-job-post-detection
This project focuses on detecting fake job posts using machine learning. Fake job advertisements are often created to scam individuals by stealing personal information or money.
data-analysis deep-learning joblib machine-learning nlp-machine-learning numpy pandas python scikit-learn tkinter
Last synced: 12 Apr 2026
https://github.com/supernyv/data_science_projects
Personal Data Science Projects
data-analysis data-science data-visualization exploratory-data-analysis machine-learning
Last synced: 14 Oct 2025
https://github.com/hilalguleryuz/powerbi_hotelbooking_data_analysis_project
Hotel Booking Data Analysis with Power BI
dashboard data-analysis data-visualization dax hotel-booking powerbi
Last synced: 06 Jan 2026
https://github.com/syarwinaaa09/analyzing-students-mental-health
data-driven exploration into student mental health trends using survey data
csv-dataset data-analysis education jupyter-notebook mental-health-awareness pandas psychology student-mental-health visualization
Last synced: 29 Jun 2026
https://github.com/saisurajmatta/e-commerce-sales-advanced-data-analysis
Excel-based e-commerce analytics for FNP, a gift company. It covers data extraction, modeling, and visualization, providing actionable insights on revenue, customer behavior, and operations. Key skills include Excel, Power Query, Power Pivot, and DAX. The analysis culminates in data-driven business recommendations.
data-analysis data-visualization dax excel power-pivot power-query
Last synced: 22 Jan 2026
https://github.com/adrianlardies/from-data-to-insight
This project creates and manages a MySQL database to analyze the performance of Bitcoin, Gold, and the S&P 500 in response to economic factors. It integrates historical data, executes advanced SQL queries, and visualizes key insights, showcasing the power of SQL and Python in financial analysis.
data-analysis data-science matplotlib pandas python seaborn sql
Last synced: 12 Apr 2026
https://github.com/mh-pedro/data-science-notes
Notes about Data Science
data-analysis data-science machine-learning pandas python scipy
Last synced: 14 Apr 2026
https://github.com/mattsebastianh/Analyze-Data-with-Python-Portfolio-Project
Analyze Data with Python
barplot categories chi-square-test conservation contingency-table crosstab data-analysis data-cleaning-and-preprocessing eda endangered-species matplotlib national-parks pandas-dataframe species species-conservation
Last synced: 18 Jun 2026
https://github.com/codesaadumair/exploratory-data-analysis
A centralized repository showcasing various Exploratory Data Analysis (EDA) projects using Jupyter notebooks, visualizations, and accompanying documentation.
data-analysis data-science data-visualization eda jupyter-notebook jupyterlab python
Last synced: 24 Mar 2025
https://github.com/jpcadena/ventas-facturas
Ventas con facturas
data data-analysis data-exploration data-extraction data-science excel feature-engineering matplotlib microsoft numpy pandas powerbi product-sales pylint python receipts sales
Last synced: 12 Apr 2026
https://github.com/bhaveshbhakta/fish-weight-prediction-using-ml
Fish Weight Prediction
data-analysis data-visualization fish-weight-prediction gradient-boosting machine-learning
Last synced: 16 Oct 2025
https://github.com/fatihilhan42/nba-players-data-1950-to-2021
In this project, the data of the NBA players between the years 1950-2021 were examined. After the NBA players' season, height, performance, averages of points, teams and positions they played were obtained through csv files, important tables and graphs were created using data cleaning and data visualization algorithms.
data data-analysis data-engineering data-science data-visualization
Last synced: 16 Oct 2025
https://github.com/analysisbyvivek/road-accident
Analyzes road accident patterns, exploring factors like lighting, weather, speed limits, time of day, and road conditions to uncover trends in severity and frequency.
data-analysis data-visualization eda jupyter-notebook kaggle tableau-public
Last synced: 19 Jun 2026
https://github.com/bhiogade/hpi-analysis
House price index (HPI) Analysis
data-analysis data-cleaning data-visualization gathering-data numpy pandas-python
Last synced: 30 Apr 2026
https://github.com/alchemine/analysis-tools
Analysis tools for machine learning projects
data-analysis explanatory-data-analysis machine-learning python
Last synced: 06 Aug 2025
https://github.com/mindlessmuse666/iris-ml-based-on-decision-trees
Проект демонстрирует применение моделей машинного обучения на основе деревьев решений и случайного леса для классификации набора данных Iris. Включает в себя загрузку данных, обучение моделей, оценку производительности и визуализацию результатов. Предназначен для изучения основ машинного обучения и анализа данных.
classification data-analysis data-visualization decision-trees iris-dataset machine-learning model-evaluation python random-forest scikit-learn
Last synced: 17 Oct 2025
https://github.com/ibttf/bayborhood
Interactive map to find the ideal neighborhood in San Francisco based on data.
data data-analysis data-visualization gis mapbox react
Last synced: 18 Jun 2026
https://github.com/khulnasoft/data-science-materials
data-analysis data-engineering data-science data-visualization
Last synced: 17 Oct 2025
https://github.com/helosantosdesousa/analise-dados-titanic
Análise de dados com o dataset 'Titanic - Machine Learning from disaster'
analise-de-dados analise-exploratoria bootcamp bootcamp-project data-analysis data-girls data-science matplotlib numpy pandas python
Last synced: 07 May 2026
https://github.com/abhijeet107/task-4
Design an interactive dashboard for business stakeholders.
data-analysis excel-csv tableau-dashboards tableau-public
Last synced: 22 Jan 2026
https://github.com/casassg/ms_thesis
Social Media Analysis for Crisis Informatics in the Cloud
casassg-thesis data-analysis google-cloud kubernetes
Last synced: 19 Oct 2025
https://github.com/devexpress-examples/winforms-pivot-change-the-field-value-header-appearance-backcolor
This example handles the CustomDrawFieldValue event to fill the header's color.
data-analysis dotnet pivot-grid-for-winforms winforms xtrapivotgrid-suite
Last synced: 07 May 2026
https://github.com/codeslash21/data_analyst_nanodegree
Notes and projects on Data Analyst Nanodegree.
data-analysis data-analyst-nanodegree data-cleaning data-visualization data-wrangling numpy pandas python3
Last synced: 06 May 2026
https://github.com/duoan/ds-nbs
Data analysis and machine learning notebook.
data-analysis data-scientists deep-learning kaggle-competition machine-learning
Last synced: 18 Jun 2026
https://github.com/lucashomuniz/Project-02
Data Analysis and Machine Learning Techniques for Liver Disease Prediction
classification-model data-analysis decision-tree healthcare-application knn-algorithm liver-disease-prediction logistic-regression machine-learning python-language python-script random-forest supervised-learning svm-model
Last synced: 20 Oct 2025
https://github.com/ashwin331133/sql-pizza-outlet-sales-analysis
This project analyzes pizza sales data to gain insights into customer behavior and revenue patterns. Key analyses include customer insights, popular pizza types and sizes, revenue generation, and order trends. The findings help optimize menu offerings, staffing, and marketing strategies to boost overall business performance.
Last synced: 24 Feb 2026
https://github.com/saisurajmatta/nashville-housing-data-cleaning-project
Clean and standardize Nashville Housing dataset using SQL queries for improved data quality and structure.
azure-data-studio data-analysis mssql mysql sql sql-data-cleaning sql-queries sql-server-management-studio
Last synced: 23 Jan 2026
https://github.com/windjammer6/9.-employee-exit-data-analysis-python
A personal project to analyse data from a Employee Exit survey from DETE and TAFE. Python libraries used: Numpy, Pandas, Matplotlib
Last synced: 24 Mar 2025
https://github.com/danielrosehill/data-projects-index
Data apps and datasets deployed to Streamlit Community Cloud, Hugging Face, and elsewhere.
data-analysis data-science data-visualization
Last synced: 16 Mar 2026
https://github.com/victorlcastro-dsa/coping_struggles_prediction
Repositório para prever dificuldades de enfrentamento com base em dados de saúde mental. Inclui análise, visualização e modelagem usando aprendizado de máquina. Resultados alcançam 86.58% de acurácia com um Voting Classifier.
classification-algorithm data-analysis data-science data-visualization machine-learning-algorithms problem-solving project-based-learning python
Last synced: 19 Apr 2025
https://github.com/katiesaund/tidy_tuesday
A weekly data project in R from the R4DS online learning community
data-analysis data-visualization datascience plot r rstats tidytuesday
Last synced: 24 Mar 2025
https://github.com/ghurault/huraultmisc
Personal R package
bayesian-statistics data-analysis r-package statistical-models
Last synced: 22 Oct 2025
https://github.com/chrispsang/customerchurnanalysis
Predicting customer churn using a RandomForestClassifier with detailed EDA, model evaluation, and visualization. Includes a Tableau dashboard for interactive insights.
customerchurn data-analysis data-visualization datapreprocessing machine-learning python scikit-learn tableau
Last synced: 31 Jan 2026
https://github.com/neerajcodes888/data-science
This repository is a hub for data science enthusiasts, offering a diverse collection of projects, notebooks, and resources covering topics such as data analysis, machine learning, deep learning, and generative AI. Explore innovative ideas, contribute to cutting-edge research, and enhance your skills in the dynamic field of data science
data-analysis data-science data-visualization deep-learning deep-learning-algorithms eda genai jupyter-notebook machine-learning machine-learning-algorithms openai-api pandas plotting python3 sklearn-library streamlit
Last synced: 01 May 2026
https://github.com/jigyasag18/bird-strikes-in-aviation-project
This project analyzes over a decade of U.S. bird strike data (2000–2011) to evaluate safety risks, damage trends, and cost implications in aviation. Using PostgreSQL for database management and Power BI for dashboard visualization, it uncovers critical insights into when, where, and how wildlife impacts aircraft. Key findings inform strategically.
bird-strike-prevention bird-strike-prevention-in-real-airport data data-analysis data-analysis-project data-visualisation data-visualization data-visualization-project data-visualizations database dataset dax-query postgresql postgresql-database powerbi powerbi-desktop powerbi-report powerbi-visuals sql sql-database
Last synced: 09 May 2026
https://github.com/thenazar9/user-behavior-email-campaign-analysis-sql
Analysis of user behavior and email campaign performance using BigQuery and Looker Studio, focusing on account creation trends, email engagement, and user segmentation.
analytics bigquery data-analysis data-visualization etl looker-studio sql structured-query-language
Last synced: 16 Oct 2025