Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-30 00:07:38 UTC
- JSON Representation
https://github.com/waheed24-03/-ipl-stats-compare
Comparing Stats of Cricketers in IPL
cricket dashboard data-analysis data-visualization ipl python sports-analytics streamlit
Last synced: 28 Jun 2025
https://github.com/annnieglez/computer-vision-parking-lot
This project leverages computer vision techniques to analyze parking lot occupancy. The goal is to detect available parking spaces in real-time using image and video input.
computer-vision data-analysis data-science data-visualization google-colab image-classification image-processing machine-learning python transfer-learning
Last synced: 15 May 2026
https://github.com/kathisnehith/realestate-sales-analysis
Investigating real estate sales trends to understand market dynamics and inform investment decisions.
data-analysis excel realestate sales sql stastical-analysis-tools tableau
Last synced: 12 Feb 2026
https://github.com/mmfava/analises-papers
Script base de alguns papers publicados entre 2019 e 2021.
Last synced: 22 May 2026
https://github.com/sunnyrao07/data-analysis-dashboard-in-excel
I implemented a comprehensive data analysis solution using Excel, developing multiple dashboards and tables to visualize and interpret the data. This involved a rigorous data cleaning and preprocessing pipeline followed by data visualization.
dashboard data-analysis excel visualization
Last synced: 03 Feb 2026
https://github.com/al-ghaly/hotel-revenue-excel-analysis
Excel Dashboard to analyze data of a hotel over the past three years.
dashboard data-analysis data-visualization excel excel-analysis
Last synced: 02 Jan 2026
https://github.com/analyticslover/salifort-motors-turnover-project
The Salifort Motors H.R. Project serves as the capstone for the Google Advanced Analytics Program on Coursera. This project presents a business scenario and a problem on the scnario context, employee turnover. In this project, essential techniques as EDA and Data Modeling are used to analyze and predict the employee turnover rates in the company.
data data-analysis datamodeling eda machine-learning pandas python sklearn
Last synced: 10 Apr 2026
https://github.com/poglolopez/nesarc_research
Analyzing the relationship between Social Anxiety Disorder (SAD) and family history of behavioral problems using NESARC data. Includes statistical hypothesis testing (ANOVA, Chi-Square, Pearson Correlation, Moderation Analysis). Developed as part of the Data Analysis and Interpretation Specialization from Wesleyan University (Coursera).
anova chi-square coursera-assignment data-analysis hypothesis-testing mental-health moderation-analysis nesarc pandas pearson-correlation python social-anxiety statistical-analysis
Last synced: 14 Apr 2026
https://github.com/yrohitha/titanic-data-analysis
Predict Survival Outcomes from the 1912 Titanic disaster based on each passenger's features, such as sex and age.
data-analysis machine-learning matplotlib pandas scipy-stats statistical-models
Last synced: 13 Mar 2025
https://github.com/madhursinghbhadoriya/data_analysis_sales_insights_using_tableau
• Performed Data Cleaning using MySQL. • Data analysis and ETL in Tableau. • Created an Interactive Dashboard with significant information about the Sales Insights, Profit and Revenue Analysis.
data-analysis data-visualization dataanalysis etl mysql tableau-dashboards tableau-desktop
Last synced: 09 Apr 2025
https://github.com/bkataru/math-ia
Data and analysis for IB Math IA
data-analysis data-science data-visualization matplotlib modeling plotting regression-analysis regression-models
Last synced: 09 Apr 2025
https://github.com/fazej99/u.s-climate-and-temperature-analysis
This project analyzes historical temperature trends in the U.S., explores their economic impacts, predicts future changes using machine learning, visualizes regional anomalies with GIS, and presents findings through a secure and interactive Streamlit dashboard.
data-analysis data-science data-visualization gis machine-learning streamlit
Last synced: 22 May 2026
https://github.com/vipulbunny/web-tech-scanner
A Python-based web scraping tool that detects technologies used on a website by analyzing its scripts, meta tags, and HTML content.
beautifulsoup beautifulsoup4 data-analysis data-science python requests technology-detection web-scraping
Last synced: 22 May 2026
https://github.com/nishumehta/retail-sales-analysis
Retail sales performance analysis using Python and Power BI.
data-analysis ipynb-notebook jupyter-notebook powerbi python
Last synced: 15 May 2026
https://github.com/data-edd/california_population_projection
This project demonstrates a population projection analysis for the state of California using MySQL
Last synced: 30 Mar 2025
https://github.com/prakashjha1/whatsapp-chat-analyzer
WhatsApp Analyzer means we are analyzing our WhatsApp group activities. It tracks our conversation and analyses how much time we are spending or saying it as “wasting” on WhatsApp.
data-analysis data-science natural-language-processing pandas pyhton regular-expression
Last synced: 15 May 2026
https://github.com/nymarya/analise-correlacao-sifilis
Código da análise de correlação entre notificações de casos de sífilis e disponibilidade de testes e medicamentos
data-analysis healthcare pandas
Last synced: 03 Jan 2026
https://github.com/merrill007/sql-data-warehouse-project
The Data Warehouse and Analytics Project is a comprehensive initiative designed to demonstrate the end-to-end process of building a modern data warehouse and deriving actionable insights through SQL-based analytics.
architecture business-intelligence crm data data-analysis database database-management datawarehouse erp etl etl-pipeline model sql sqlserver
Last synced: 22 Mar 2025
https://github.com/kaushik-puttaswamy/airline-passenger-referral-prediction-using-machine-learning
This project uses a machine learning model to predict if passengers referred by existing customers will book a flight, helping airlines target likely customers. Key factors like service ratings and value for money drive predictions, achieving over 90% accuracy.
airline-marketing customer-referral-prediction customer-satisfaction data-analysis feature-engineering hyperparameter-tuning machine-learning model-evaluation predictive-analytics
Last synced: 22 Mar 2025
https://github.com/bala-1409/peerloankart-loan-fraud-detection-datascience-project
This project uses machine learning to predict whether a loan applicant will repay their loan. The project uses a dataset of historical loan data from PeerLoanKart, a peer-to-peer lending platform.
correlation data-analysis data-cleaning data-science data-visualization dimensional-analysis eda exploratory-data-analysis feature-engineering gradient-boosting-classifier hyperparameter-tuning juypter-notebook machine-learning machine-learning-algorithms numpy pandas predictive-modeling python3 scikitlearn-machine-learning supervised-learning
Last synced: 08 Apr 2026
https://github.com/macnianios/salifort-motors_retention
Google Advanced Data Analytics Capstone: Analyzing customer retention at Salifort Motors.
data-analysis machine-learning pandas python seaborn sklearn
Last synced: 08 Apr 2026
https://github.com/farzeennimran/cryptocurrency-price-prediction
Cryptocurrency Price Prediction using machine learning models 💲💰💸📈📉📊₿
artificial-intelligence cryptocurrency data-analysis data-preprocessing data-science data-visualization deep-learning feature-selection machine-learning matplotlib numpy pandas plotly prediction python regression-models scipy seaborn sklearn
Last synced: 07 Jan 2026
https://github.com/l337x911/simulations
data analysis via in silico simulations
data-analysis machine-learning python3
Last synced: 06 Apr 2025
https://github.com/sanjana-bongale/cancer_survival_data_analysis_and_prediction_using_logistic_regression
This project performs data analysis using Python to predict cancer patient survival outcomes. It involves data cleaning, exploratory analysis, and visualizations to explore factors like cancer type, stage, and treatments. A logistic regression model is built to predict patient survival based on demographic and medical data.
data-analysis data-cleaning data-science data-visualization eda jupyter-notebook kaggle logistic-regression machine-learning matplotlib numpy pandas predictive-modeling python scikit-learn seaborn
Last synced: 08 Apr 2026
https://github.com/valna/mercado-play
Mercado Play (streaming service of @mercadolibre) redesign with Astro.
astro data-analysis data-manipulation data-science data-visualization husky javascript justd mercado-play mit-license prettier react rocketseat typescript
Last synced: 08 Apr 2026
https://github.com/rugwiroparfait/alx_sql
This repo is where I save my queries and learning materials in Data Science program from ALX
anaconda data data-analysis jupyter-notebook sql
Last synced: 19 Aug 2025
https://github.com/vara-co/solar-eclipse-2024
Group Project on the 2024 Solar Eclipse's Path over the US with an interactive map and a couple of visualizations on the data gathered.
data-analysis data-visualizations html-css-javascript interactive-map javascript map solar-eclipse
Last synced: 15 May 2026
https://github.com/jofaval/daily-california-births
Data Analysis of the Daily AFAB (Assigned Female At Birth) Births in California, 1959
california data-analysis data-science data-visualization deep-learning google-colab machine-learning python tensorflow timeseries timeseries-analysis
Last synced: 28 Jun 2025
https://github.com/k31ner/inmopipeline
Proyecto integral de análisis y modelado predictivo de datos inmobiliarios, que abarca recolección, transformación, visualización y machine learning utilizando Python y herramientas modernas de ingeniería y ciencia de datos.
data-analysis data-engineering data-science fastapi python streamlit
Last synced: 08 May 2026
https://github.com/anas436/data-science-projects
Explore my diverse collection of projects showcasing machine learning, data analysis, and more. Organized by project, each directory contains code, datasets, documentation, and resources. Dive in to discover insights and techniques in data science. Reach out for collaborations and feedback.
data-analysis data-science machine-learning
Last synced: 27 Mar 2025
https://github.com/namratagulati/fraud_detection
This fulfills all the requirements of a fraud detection model developed on linear regression using feature scaling, engineering and testing model with the help of auc-roc curve and others.
data-analysis data-visualization machine-learning machine-learning-algorithms machinelearning-python
Last synced: 04 Jun 2026
https://github.com/dimits-ts/college_analysis
A statistical study about US college admissions, featuring a full report in LaTeX.
anova data-analysis exploratory-data-analysis linear-regression statistics
Last synced: 25 Jan 2026
https://github.com/aalkiyumi/predicting-hospital-readmission-risk
This project aims to create a predictive model that forecasts the likelihood of a patient being readmitted to the hospital within 30 days of discharge.
big-data-analytics cs5165 data-analysis data-cleaning data-engineering data-science introduction-to-cloud-computing jupyter-notebook machine-learning precipitation-analysis predictive-modeling pyspark statistical-analysis uc uc2026 university-of-cincinnati
Last synced: 11 Oct 2025
https://github.com/onome-joseph/flexisaf
Generative AI & Data Science
data-analysis data-science machine-learning
Last synced: 16 Sep 2025
https://github.com/tushar2704/sql-query
Repository is designed to help you strengthen your SQL query skills by providing a collection of common and interview-based SQL queries for practice.
artificial-intelligence data-analysis data-engineering data-science database database-management database-schema relational-databases sql sql-database sql-query tushar2704
Last synced: 04 Nov 2025
https://github.com/tushar2704/loan-limits-by-country
This project aims to leverage a diverse dataset encompassing economic indicators, demographic factors, and credit history to establish a predictive model. By establishing appropriate loan limits, financial institutions can enhance risk management, ensure responsible lending, and promote financial inclusivity.
artificial-intelligence data-analysis data-science loan project tushar2704
Last synced: 30 Oct 2025
https://github.com/aalkiyumi/project-3-docker-container-for-data-processing-script
This Dockerized Python application analyzes two text files (IF.txt and AlwaysRememberUsThisWay.txt). It counts total words, identifies the largest file, and finds the top three most frequent words in each. Results are saved to an output file and printed to the console.
cs5165 data-analysis data-engineering data-science docker introduction-to-cloud-computing statistical-analysis text-processing uc uc2026 university-of-cincinnati
Last synced: 17 May 2026
https://github.com/lawwrites/uncovering_fintech_user_insights
Linear Regression and K-Means to obtain user behavior insights for a fintech company
data-analysis data-science kmeans-clustering linear-regression python unsupervised-machine-learning
Last synced: 22 May 2026
https://github.com/thesfinox/fit-the-data
Data analysis using Wolfram Mathematica
analysis data data-analysis lab mathematica wolfram wolfram-mathematica
Last synced: 24 Jan 2026
https://github.com/shreshthvashisht/xyz-ads-airing-report_analysis
Ad data analysis using Advanced Excel
ad-airing-analysis advanced-excel data-analysis data-visualization pivot-tables
Last synced: 18 Feb 2026
https://github.com/anjalikumari021/sports_data_analysis_using_excel
Analyzed Sports data and prepared advanced dashboard using MS Excel.
data-analysis data-cleaning excel-dashboard ms-excel pivot-tables reporting
Last synced: 08 Mar 2026
https://github.com/sayamalt/fake-news-classification-using-fine-tuned-bert
Successfully developed a text classification model to predict whether a given news text is fake or not by fine-tuning a pretrained BERT transformed model imported from Hugging Face.
bert-embeddings bert-model data-analysis data-visualization deep-learning fine-tuning-bert model-evaluation model-training-and-evaluation text-classification text-preprocessing text-tokenization tokenizer-nlp wordcloud-visualization
Last synced: 05 Apr 2025
https://github.com/mmfava/significados-aulas-biologia-quasiexp-2019
Repositório das análises realizadas para o paper "Construção de significados em aulas práticas de laboratório de biologia: uma avaliação por delineamento quase-experimental".
Last synced: 28 Jun 2025
https://github.com/williamjardim/analise
A data analysis package made from scratch in JavaScript
computer-science data-analysis data-analytics data-cleaning data-engineering data-science data-structures database-search datascience datasets feature-engineering feature-selection mathematical-functions matrix-search numerical-computation sample-search search-algorithm statistics vector-search
Last synced: 03 Apr 2025
https://github.com/bala-1409/milk-production-time-series-forecasting-datascience-project
This project uses time series forecasting to predict future milk production. The data used in this project is monthly milk production data from January 1962 to December 1975. The ARIMA (autoregressive integrated moving average) model is used to forecast the milk production. The model is evaluated using various metric.
acf adf arima-model data-analysis data-science data-visualization datapreprocessing eda exploratory-data-analysis forecasting machine-learning-algorithms pacf python python3 sarimax-model seasonality seasonality-analysis time-series time-series-forecasting trends
Last synced: 27 Apr 2026
https://github.com/bala-1409/sales-forecasting-datascience-project
Develop a data science project using historical sales data to build a regression model that accurately predicts future sales. Preprocess the dataset, conduct exploratory analysis, select relevant features, and employ regression algorithms for model development. Evaluate model performance, optimize hyperparameters, and provide actionable insights.
data data-analysis data-science data-visualization datacleaning exploratory-data-analysis machine-learning-algorithms modelfitting prediction predictive-analytics predictive-modeling python3 regression-models salesforecast supervised-learning
Last synced: 26 Apr 2026
https://github.com/bala-1409/loan-classification-data-science-projects
This project uses machine learning algorithms to predict the classification of loan status. The dataset is loaded and some transformation is done using SQL for getting a proper dataset with some valid informations.
data data-analysis datacleaning datascience datavisualization exploratory-data-analysis loan machine-learning machine-learning-algorithms modelfitting sql supervised-learning visualization
Last synced: 22 Mar 2025
https://github.com/bala-1409/loan-clustering-datascience-projects
This project uses Machine Learning to Cluster loan together based on their similarities. The project uses a dataeset of loan application which includes information about the Loan amount and Balance. The project then use the clustering algorithm to group the loan together based on the similarities.
clustering clustering-algorithm data-analysis data-science data-visualization kmeans-clustering machine-learning machine-learning-algorithms sql unsupervised-learning unsupervised-machine-learning
Last synced: 22 Mar 2025
https://github.com/bala-1409/rafik-s-kitchen-data-analysis
The Project is about the Analysis of the Sales and Expenses Data of a Famous Fast-food Restaurant. This mainly focuses on gaining Insights that will boost the Future Sales and also Business Strategies it Improve the Profit Margins. Handled Tools are SQL, Python, Power BI, MS Office Tools.
business-analytics business-intelligence data-analysis data-analytics data-visualization eda exploratory-data-analysis ms-office powerbi-report powerpoint-presentations python sql-server
Last synced: 06 May 2026
https://github.com/aleskandro/r-hadoop-madreduce-examples
A lot of examples about using R with hadoop for MapReduce with and without libraries as rhadoop/rhipe - DIEEI@unict.it - Advanced Programming Languages
data-analysis hadoop mapreduce r
Last synced: 04 Nov 2025
https://github.com/beolawork-art/novabank-churn-analysis
NovaBank has noticed that customers are closing accounts or going inactive, and they want to understand why.
data-analysis data-science-projects data-visualization eda machine-learning numpy pandas python scikit-learn sql
Last synced: 08 Apr 2026
https://github.com/ryan-wong1/min-wage-by-state-1968-to-2020-data-analysis
US Minimum Wage by State (1968 - 2020)
data-analysis data-visualization exploratory-data-analysis sql
Last synced: 25 Jan 2026
https://github.com/smohanta23/ev-trendanalytics-24
This Tableau project analyzes EV adoption trends using data up to May 2024. Visualizations cover growth, geography, market share, CAFV eligibility, and consumer preferences, supporting data-driven decisions with detailed drill-downs. Data is meticulously cleaned, offering stakeholders valuable insights into EV market dynamics and trends for future.
business-intelligence data-analysis data-engineering electric-vehicles feature-engineering kpianalysis predictive-analytics tableau trendanalysis
Last synced: 27 Mar 2026
https://github.com/vladstudennikov/diabetes-prediction-app
ML-powered web app built with Laravel and Vue.js to predict diabetes risk based on users' daily habits and behavior
cypress data-analysis diabetes-prediction fastapi inertiajs laravel matplotlib medicine ml pandas php scikit-learn seaborn vuejs
Last synced: 08 Apr 2026
https://github.com/wb-az/sql-for-data-analysis-udacity
This repository contains SQL queries for the SQL for Data Analysis given by Udacity. The queries include commands to define, select, manipulate, control access, aggregate, and join data and data tables.
aggregation data-analysis data-cleaning erd joints postgresql sql subqueries-and-joins window-functions-in-sql
Last synced: 23 May 2026
https://github.com/mjshubham21/ny_yellow_taxi_python_da_project
A data analysis project of New York Yellow Taxi (Feb of 2025) using Python and its libraries for analytics like : NumPy, MatPlotLib, Pandas and Seaborn.
data-analysis jupyter-notebook matplotlib numpy pandas python seaborn
Last synced: 04 May 2026
https://github.com/cyberoctane29/epa-air-quality-aqi-analysis
This project involved analyzing air quality data from the EPA, focusing on the Air Quality Index (AQI). I used Python data structures like dictionaries and sets to manage and process the data, simulating real-world data analysis to assess pollution levels and their health implications.
data-analysis numpy pandas python statistics
Last synced: 10 Apr 2026
https://github.com/yousef-jaber-abdelaziz/electrical-vehicles-data-analysis-project
A full stack Data Engineering f\project from Getting the data to the Data warehousing and then the Dashboard using Power BI
data-analysis data-engineering data-modeling data-visualization data-warehouse data-warehousing fabric microsoft-azure microsoft-fabric-data-engineer powerbi sql-server
Last synced: 23 Jun 2026
https://github.com/esr-style/stylegrid
A free alternative to AG grid built by me for personal use case.
aggrid data-analysis grid pivot-chart pivot-grid table
Last synced: 16 Sep 2025
https://github.com/jpgiant/training_project
Analyzing whether there is a difference between the average death ages of left handers and right handers using Bayesian Conditional Probability Theorem.
bayesian-statistics data-analysis data-visualization numpy pandas-dataframe python
Last synced: 30 Apr 2026
https://github.com/nikashj/pizza-sales-dashboard-analysis
Pizza sales analysis using Power Bi
data data-analysis data-visualization dax-expression excel powerbi
Last synced: 06 Apr 2026
https://github.com/faris771/identify_customer_segments
This project is part of the Palestine Launchpad by Spark, and Udacity with Google. It uses unsupervised learning to identify customer segments for a mail-order company in Germany. The goal is to direct marketing campaigns towards the most promising audiences. The data is provided by Bertelsmann Arvato Analytics.
clustering data-analysis decomposition feature-engineering machine-learning unsupervised-learning
Last synced: 08 Aug 2025
https://github.com/kingflow-23/association-matching
Recherche et Structuration d'Opportunités de Financement pour les Associations
association data-analysis data-engineering excel fondation pyqt5 python webscraping
Last synced: 07 Apr 2025
https://github.com/hatamiarash7/ir-system
IR System for Reuters DB
data-analysis data-mining ir python
Last synced: 29 Mar 2025
https://github.com/mindlessmuse666/eda-explorer
Инструмент на Python для разведочного анализа данных (EDA) и визуализации, поддерживающий загрузку данных CSV и JSON, с модульной архитектурой ООП. Практическая работа по теме: "Обнаружение и визуализация данных для понимания их сущности" дисциплины "МДК 13.01: Основы применения методов искусственного интеллекта в программировании".
csv-visualization data-analysis data-science data-visualization exploratory-data-analysis json-visualization matplotlib oop pandas python seaborn
Last synced: 13 Apr 2026
https://github.com/jedrzej-wydra/competition-cooperation
Competition, cooperation, and parental effects in larval aggregations formed on carrion by communally breeding beetles Necrodes littoralis (Staphylinidae: Silphinae)
data-analysis non-linear-regression r
Last synced: 20 Aug 2025
https://github.com/kailenroa/sleep-efficiency-project
This project focuses on analyzing sleep efficiency using wearable technology data. It explores patterns in sleep behavior and key factors impacting sleep quality. A dashboard was created using phyton and data visualization tools to provide actionable insights and recommendations for improving sleep health.
dashboard data-analysis html phyton sleep-efficiency
Last synced: 06 Jan 2026
https://github.com/dimits-ts/visualization-assignments
Visualizing and analyzing results from the PISA-2018 competitions with regards to Greek performance and gender gap.
data-analysis data-visualization interactive-graphs presentation-slides r-language tableau
Last synced: 06 Nov 2025
https://github.com/jofaval/iris-flowers
Multilabel Classification of the famous Iris Flowers Dataset from Ronald Aylmer Fisher in 1936
classification data-analysis data-science data-visualization google-colab iris-flowers kaggle machine-learning python scikit-learn xgboost
Last synced: 05 Apr 2026
https://github.com/noor188/preswald-data-app
A data app to visualize and manipulate the graduate admission dataset
data-analysis data-visualization open-source
Last synced: 04 Jul 2025
https://github.com/kefilweditse/awesome-matchem-datasets
Awesome-matchem-datasets is a curated collection of high-quality datasets for machine learning and data analysis in the field of chemistry. This repository includes various datasets, ranging from molecular structures to experimental results, suitable for both research and educational purposes.
awesome awesome-dataset awesome-dataset-collection awesome-match-data awesome-matchem data-analysis data-matching dataset dataset-collection dataset-research dataset-samples match match-data match-dataset-analysis match-examples
Last synced: 07 Apr 2025
https://github.com/tejaswirupa/data-analysis-of-departure-delays-at-united-airlines
Explored how weather and time factors influence delays in 58,000+ UA flights. Used permutation testing and visual analytics to show how temperature, visibility, and time of day affect departure punctuality.
Last synced: 25 Jan 2026
https://github.com/leosimoes/datascienceacademy-python-analisededados
Atividades do curso Análise de Dados com Linguagem Python da DataScienceAcademy.
data-analysis data-science jupyter-notebook python sql
Last synced: 29 Apr 2026
https://github.com/hevalhazalkurt/word_analyser
A web app developed in Python and Django that analyzes given text mathematically and sentimentally.
analyzer analyzes content data-analysis django emotion python python3 sentiment sentiment-analyser sentiment-analysis text text-analysis
Last synced: 19 May 2026
https://github.com/abhirajp595/python
Data Science Project using Python
data-analysis data-science data-visualization eda jyputer-notebook numpy pandas statistics
Last synced: 08 May 2026
https://github.com/shz-code/diwali_sales_data_analysis
Customer Product Purchase Behavior Analysis
behavior-analysis data-analysis matplotlib ml sales seaborn
Last synced: 14 Mar 2025
https://github.com/vipulbunny/ml-learning_projects
A collection of machine learning projects implemented in Python, showcasing core concepts like regression, classification, clustering, and model evaluation techniques. Ideal for learners and data science enthusiasts.
classification clustering data-analysis data-science data-visualization decision-trees jupyter-notebook machine-learning model-evaluation random-forest regression supervised-learning unsupervised-learning
Last synced: 23 Jul 2025
https://github.com/wwgolay/hr1099-timelapse-vlbi
The repository for HR1099 timelapse VLBI.
astronomy astrophysics data-analysis website
Last synced: 03 Apr 2025
https://github.com/muthukumar0908/cardekho_used_car_price_prediction
The project aim is to build a machine learning model that offers users to find current valuations for used cars.
data-analysis data-visualization datacleaning eda machine-learning python streamlit
Last synced: 30 Mar 2025
https://github.com/galal-pic/advanced_regression
A project to predict house prices through machine learning different techniques
data-analysis data-science deep-learning feature-engineering flask machine-learning python regression
Last synced: 08 Jul 2025
https://github.com/sivkri/shiny-scatter-plot-app
This repository contains a Shiny app that allows users to create interactive scatter plots by selecting the X and Y axes and customizing the point color. The app utilizes the shiny package in R to provide a user-friendly interface and the ggplot2 package for creating visually appealing plots.
data-analysis data-visualization ggplot2 interactive-web-application r rprogramming scatter-plot shiny
Last synced: 22 Mar 2025
https://github.com/sivkri/rnaseq-analysis-junctionseq-qorts
This repository provides scripts for RNA-Seq data analysis using JunctionSeq and QoRTs, enabling quality control, differential splicing analysis, and generation of browser tracks.
bioinformatics data-analysis differential-splicing genomics junctionseq qorts quality-control rna-seq rna-seq-analysis splice-junctions splice-variants spliced-alignment transcriptomics
Last synced: 22 Mar 2025
https://github.com/habiburrahman-mu/exploratory-data-analysis
Methods to see if certain characteristics or features can be used to predict.
data-analysis data-mining data-science data-visualization
Last synced: 20 Jan 2026
https://github.com/rohitha-tata/bike-sales
This project focuses on data cleaning, transformation, and dashboard creation using a bike buyers dataset. It includes Pivot Tables, slicers, visualizations, and statistical insights to analyze trends based on income, age, occupation, and other key factors. Insights help understand customer behavior, purchasing patterns, and decision-making trends.
data-analysis data-cleaning excel-dashboards interactive-slicers pivot-charts pivot-tables
Last synced: 08 Mar 2026
https://github.com/hasnathjami/data-analysis-of-covid-19
An Oracle PL/SQL-based project on COVID-19 data analysis. It is my CSE 4.1 project of Distributive Database Management System LAB.
data-analysis naive-bayes-classifier oracle-database probability-statistics sqlplus
Last synced: 08 Mar 2026
https://github.com/nimomach/amazon-sales-data
This is a small dataset containing Amazon sales data analysis for few regions.
dashboards data data-analysis data-visualization
Last synced: 08 Mar 2026
https://github.com/cosmoduende/r-earthquakes
Análisis y visualización de datos de actividad sísmica en México con R. Cómo analizar y visualizar la historia sísmica de México con datos del SSN (Servicio Sismológico Nacional)
data-analysis data-analytics data-science dataviz earthquakes r-code r-programming r-studio rstudio sismo sismologia sismos ssn ssnmx terremoto terremotos
Last synced: 24 Jan 2026
https://github.com/tolumie/web-scraping-rest-api-stock-data-operations
Web Scraping, REST API & Stock Data Operations is a data-driven project that explores the power of web scraping, API interactions, and stock market analysis using Python. From extracting stock data and public records to analyzing real-world financial trends, this repository is a one-stop resource for data enthusiasts, traders, and analysts.
api-integration data-analysis data-cleaning data-visualization financial-data python rest-api sql-databases stock-data web-scraping
Last synced: 19 May 2026
https://github.com/mrham17/spotify_streaming_analytics
Project is stable & documentation will be completed soon. Thank you for your understanding and patience.
big-data-analytics data-analysis google-colab music-data r-programming spotify streaming-analytics
Last synced: 24 Jul 2025
https://github.com/puspacempaka/superstore-analysis-with-sql
This repository showcases various data analyses on the popular Superstore dataset using SQL queries. The analyses cover a range of business insights, including sales performance, customer segmentation, and product profitability. Each analysis is documented with the SQL queries used and explanations of the steps involved.
business-intelligence data-analysis sales-analysis sql superstore-dataset
Last synced: 09 Mar 2026
https://github.com/bris0yzbekaye/json-to-excel-converter
This repository provides a tool to convert JSON data to Excel format (.xlsx). It allows you to easily transform structured JSON data into a well-organized spreadsheet for better analysis and visualization.
automation-script automation-tools data-analysis data-converter data-export data-formatting data-tools data-visualization excel excel-automation excel-converter excel-tools json json-exporter json-parser json-processing json-to-csv json-to-excel programming-tools spreadsheet-tools
Last synced: 25 Jul 2025
https://github.com/leandrocollares/street-cherry-trees-in-vancouver
Street cherry trees in Vancouver: an exploratory data analysis
data-analysis data-visualization folium pandas plotly-express
Last synced: 17 Sep 2025
https://github.com/samuelsoaress/predict-future-sales
Machine Learning applied to sales forecast
data-analysis data-mining data-science data-visualization forecasting-models
Last synced: 24 Jul 2025
https://github.com/netesf13d/expt-sequence-analysis
Data processing, analysis and visualization package for atomic physics experiments in the single-atom regime.
cold-atoms data-analysis data-visualization optical-tweezers
Last synced: 24 Jul 2025
https://github.com/matte34/auto-insurance-analysis
Conducted a comprehensive exploratory data analysis (EDA) on an auto insurance dataset that I found from Kaggle. I performed a permutation test and generated data visualizations.
data-analysis data-visualization permutation-test python3 scipy seaborn
Last synced: 06 May 2026
https://github.com/muhammadhussain-2009/stock-price-prediction-using-stacked-lstm
Predicting Google Stock Prices using Deep Learning Techniques.
data-analysis data-science data-visualization deep-learning jupyter-notebook keras lstm-neural-networks machine-learning-algorithms python stock-data stock-price-prediction tensorflow
Last synced: 16 Apr 2026
https://github.com/omr5221/sqlexamples
SQL Example Data Corrections
big-data data-analysis data-correction data-modeling pl-sql sql
Last synced: 16 Feb 2026
https://github.com/swethajoseph/netflix-powerbi-interactive-dashboard
Created an interactive Netflix Power BI dashboard to analyze and visualize Netflix's content library, uncovering trends in content type, genre distribution, and global reach
data-analysis data-visualization interactive-visualizations powerbi powerbi-dashboards powerbi-report
Last synced: 03 Jan 2026
https://github.com/tomy-jr98/air-quality-sql-project
Air pollution analysis using BigQuery and Tableau, with data cleaning, aggregation, and visualization.
air-pollution bigquery data-analysis portfolio sql tableau
Last synced: 25 Jul 2025