Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-30 00:07:38 UTC
- JSON Representation
https://github.com/soypete/example-go-dataframes-parser
example of https://godoc.org/github.com/kniren/gota/dataframe
data-analysis data-science datastructures golang-examples ml
Last synced: 12 Sep 2025
https://github.com/shrikantnaidu/greyatom-projects
GreyAtom Projects.
data-analysis data-science greyatom machine-learning portfolio
Last synced: 24 Jul 2025
https://github.com/luminati-io/target-dataset-samples
A sample dataset of over 1000 target products, extracted using the Bright Data API, ideal for brand reputation, tracking inventory, and optimizing prices.
api data-analysis data-mining datasets target web-scraper web-scraping
Last synced: 04 Jan 2026
https://github.com/mehrab-kalantari/olympics-data-analysis
A streamlit application to analyze the Olympics dataset from several views
data-analysis streamlit-dashboard streamlit-webapp
Last synced: 20 Apr 2026
https://github.com/celineboutinon/la-faim-dans-le-monde
OpenClassrooms Data Analyst 2022-2023 - Projet 4
data-analysis data-analytics data-visualisation dataframes matplotlib-pyplot numpy pandas python seaborn
Last synced: 20 Jul 2025
https://github.com/nmelgar/birthday_sports_dataviz
We will analyze how the Matthew Effect has influenced in professional sports players.
analysis csv data data-analysis data-science data-visualization datavisualization dataviz probability research tableau
Last synced: 08 Jan 2026
https://github.com/scailfin/benchmark-templates
Workflow Templates are parameterized workflow specifications for the Reproducible Open Benchmarks for Data Analysis Platform (ROB)
benchmarks data-analysis reproducibility
Last synced: 16 Jan 2026
https://github.com/iness000/online-retail-customer-segmentation
This project performs comprehensive customer segmentation analysis on an online retail dataset using machine learning clustering techniques and RFM (Recency, Frequency, Monetary) analysis. The goal is to identify distinct customer segments to drive better customer relationship management strategies and business insights.
customer-segmentation data-analysis k-means
Last synced: 31 Aug 2025
https://github.com/akashprak/socialnetworkads
Predicting customer purchase behavior from the Social Network Ads dataset.
data-analysis machine-learning mlflow pandas python scikit-learn seaborn xgboost
Last synced: 30 Mar 2025
https://github.com/jayqi/data-analysis-tools
Presentation on Data Analysis Tools
data-analysis presentation-slides
Last synced: 06 Jan 2026
https://github.com/pranjalya/hand-washing-data-visualisation
A small project of Data Visualization, where we analyze the effect of hand washing after introduced by Dr. Semmelweis to the nurses and midwives after giving birth.
data-analysis data-visualization jupyter-notebook pandas python3
Last synced: 06 May 2026
https://github.com/evanwporter/sloth
Faster Pandas Dataframe
cython data-analysis dataframe pandas
Last synced: 14 Mar 2025
https://github.com/virajbhutada/hr-analytics-excel-sql-tableau-powerbi
Explore a comprehensive HR Analytics portfolio showcasing data analysis and visualization skills. Featuring dashboards in Power BI, Excel, and Tableau, along with SQL queries for deeper insights. A holistic view of expertise in HR analytics, data visualization, and database management. Let's dive into the game of data insights!
data-analysis data-management data-visualization excel hr-analytics interactive-dashboards portfolio-project postgresql powerbi powerbi-visuals sql sql-queries tableau tableau-public
Last synced: 02 Aug 2025
https://github.com/lanzafame/polycarp
[WIP] Subset operations on latlon data read from CSVs
Last synced: 12 Jan 2026
https://github.com/satvikpraveen/pandasplayground
📊 A comprehensive pandas mastery project with 10 modular Jupyter notebooks covering data loading, cleaning, grouping, merging, time series, visualization, and performance profiling. Includes real-world workflows, Docker, Streamlit, and reusable utils. Ideal for data scientists and analysts to learn, practice, and refer. Practice-ready and modular.
analytics cheatsheet data-analysis data-cleaning data-pipeline data-science data-visualization docker etl exploratory-data-analysis jupyter-notebook jupyterlab learning-resource memory-profiling open-source pandas performance-tuning python streamlit time-series
Last synced: 10 Apr 2026
https://github.com/barraharrison/spotify-listening-trends
Using EDA to look at song longevity, regional preferences, and streaming behavior in the charts and on Spotify.
data-analysis data-visualization jupyter-notebook kaggle-dataset
Last synced: 03 Feb 2026
https://github.com/idb-devs/dataanalyticsairbnb
Construir um modelo de previsão de preço que permita uma pessoa comum que possui um imóvel possa saber quanto deve cobrar pela diária do seu imóvel.
data-analysis data-science jupyter python
Last synced: 18 Apr 2026
https://github.com/akmj1011/hill-and-valley-prediction-using-logistic-regression
Created A Prediction System Using Logistic Regression For Figuring Out The Hall And Valley From The Given Datasets
cloud-computing data-analysis data-manipulation data-preprocessing data-transformation data-visualization google-colab
Last synced: 13 May 2026
https://github.com/ronylpatil/whatsapp-group-chat-analysis
This project is totally based on data analysis where our college official Whatsapp group is used to extract useful information from the chat. Some of the useful extracted features are most active members of the group, most active day of the week, top-10 media contributors in the Group, and many more...
data-analysis data-preprocessing data-wrangling feature-engineering
Last synced: 14 Jun 2025
https://github.com/sisolieri/prova_ds_saloocupacio2024
Admission challenge to Hackató Saló Ocupació by Barcelona activa
arima barcelona catboost data-analysis data-visualizations forecasting machine-learning pandas public-funding python scikit-learn time-series xgboost
Last synced: 10 Apr 2026
https://github.com/satyam4229/omnify-dataanalysis
Our assessment of Omnify focused on data-driven strategies to maximize profitability. We identified "Product X" as the most profitable product and recommended leveraging the "Wellness Solutions" keyword category for optimal keyword strategy.
data-analysis data-science data-visualization excel omnify
Last synced: 04 Jan 2026
https://github.com/skysign/dat
데이터분석을 함께 공부하는 스터디입니다.
data data-analysis data-science
Last synced: 02 Jan 2026
https://github.com/rahul-jha98/restauranttrends.stats-backend
Application that scrapes the Zomato Dataset and enables the user to visualise the results.
data-analysis data-extraction firebase-storage web-scraping zomato-api
Last synced: 16 Mar 2026
https://github.com/gattupalli-saketh/sentiment-analysis-on-products-
Product reviews sentiment analysis.
data-analysis machine-learning nlp review-analysis sentiment-analysis sentiment-classification
Last synced: 18 Apr 2026
https://github.com/badranalyst/titanic-survival-prediction-full-data-science-project-classification
This project predicts Titanic survivors using classification models. It includes data cleaning, pre-processing, exploratory data analysis (EDA), categorical feature conversion, model building, and evaluation. Python libraries like Pandas, NumPy, Matplotlib, and Seaborn are used to analyze and predict survival outcomes.
classification data-analysis data-science eda exploratory-data-analysis machine-learning matplo matplotlib-pyplot ml model numpy pandas predictive-modeling python seaborn
Last synced: 06 May 2026
https://github.com/azaz9026/data_cleaning
Welcome to the Data Cleaning repository! This collection is dedicated to showcasing techniques and methods for cleaning and preparing datasets for analysis.
data-analysis data-engineering data-structures data-visualization eda feature-engineering machine-learning numpy outliers pandas python seaborn
Last synced: 13 Apr 2026
https://github.com/bibymaths/python_snippets
A collection of Python scripts for bioinformatics data analysis, including tools for transcription counts, nucleotide composition, and protein sequence evaluation.
amino-acid-scoring bioinformatics data-analysis fasta-generation mathematical-evaluation nucleotide-analysis protein-sequence-analysis transcription-counts
Last synced: 29 Jul 2025
https://github.com/nevermendel/revolut-analysis
Python script to analyse Revolut transactions
data-analysis revolut revolut-analysis
Last synced: 12 Apr 2025
https://github.com/asghar-rizvi/youtube-statistics-project
This project analyzes a dataset of global YouTube statistics to uncover insights about YouTube channels, their ranks, and other attributes. The dataset used for this analysis was obtained from Kaggle.
data-analysis data-analysis-python data-science data-science-projects matplotlib numpy pandas pycharm-ide python seaborn
Last synced: 13 Jun 2026
https://github.com/lopez86/rust-mlearn
Machine Learning Tools in Rust
data-analysis data-science machine-learning rust
Last synced: 15 May 2025
https://github.com/dogan-the-analyst/data_analysis_in_the_office
Data analysis with R in the Office.
data-analysis ggplot r theoffice tidyverse
Last synced: 14 Mar 2025
https://github.com/antoniszks/music-category-identifier
A 'Data-Science & Machine Learning' project where we are training a neural network to identify what kind of music we give to it. Based on a university project.
ai artificial-intelligence data-analysis data-science jupyter-notebook machine-learning ml notebook python
Last synced: 25 Feb 2025
https://github.com/shivam5509/power-bi-project
Expert in creating interactive dashboards and reports using Power BI, utilizing 10+ visual tools like cards, slicers, and charts. Skilled in cleaning and transforming large datasets with Power Query Editor. Proficient in advanced DAX functions (SUMX, FILTER, CALCULATE) to derive insights and drive data-driven decisions.
advanced-excel computer-science data-analysis data-mining data-visualization engineering mysql numpy pandas powerbi pyhton3 sql sql-server
Last synced: 11 Apr 2026
https://github.com/shubham200137/cyclistic-case-study
This repository contains a case study for Google's Data Analytics Professional Certificate, focusing on Cyclistic, a fictional bike sharing company in Chicago. The case study aims to drive growth by converting casual riders into members through a marketing strategy.
data-analysis data-visualization numpy-python pandas-python presentation-slides sql tableau
Last synced: 11 Jun 2026
https://github.com/kath92/my_data_projects
My data projects.
data-analysis data-vizualisation nlp-machine-learning poewrbi python tableau
Last synced: 23 Mar 2025
https://github.com/zborovskaanna/dou-salary-analysis
Python data analysis project focused on improving data manipulation skills using Pandas
Last synced: 26 Feb 2025
https://github.com/elakkiya-u/digital-marketing-campaign
A machine learning project to predict whether a customer will convert based on digital marketing campaign data.
campaigns data-analysis deployment digital-marketing machine-learning predictive-modeling python
Last synced: 30 Jun 2025
https://github.com/jayita11/healthcare-management-optimization-analysis-and-visualization
This project analyzes healthcare data from 2019 to May 2024, optimizing patient care, resource allocation, and financial management. Insights include billing trends, blood bank management, doctor performance, and medication demand, supported by excel,interactive Tableau dashboards and SQL analysis.
data-analysis excel healthcare interactive-dashboards mysql sql tableau-dashboards
Last synced: 23 Mar 2025
https://github.com/diem0n/100daysofdatascience
This repository is a collection of things i do on as a data scientist each day as i am hired at a fictional company called keko corp
data-analysis data-engineering data-science data-science-from-scratch data-warehousing machine-learning python
Last synced: 09 Apr 2026
https://github.com/tolumie/loan-approval-prediction
Loan Approval Prediction using Machine Learning | EDA + Decision Tree, Random Forest & Logistic Regression | Automating loan eligibility for Dream Housing Finance by analyzing customer data and predicting loan approvals.
classification credit-risk-analysis data-analysis decision-tree-classifier finance-analytics loan-approval logistic-regression-algorithm machine-learning predictive-modeling-techniques random-forest
Last synced: 30 Jun 2025
https://github.com/bationoa/how_does_a_bike_share_navigate_speedy_success
Bike rendting case study
analytics business-intelligence cleaning-data data-analysis data-collection data-visualization r
Last synced: 26 May 2026
https://github.com/dug22/jjournal
A Jupyter like notebook software for Java
data data-analysis data-science java jshell jshell-repl notebook swing swing-application
Last synced: 11 Apr 2026
https://github.com/neha-adnani/sql_music-store-analysis
SQL-based data analysis of a digital music store's sales and customer data.
business-analysis data data-analysis database follow-along-projects pgadmin4 portfolio-project postgres queries sql
Last synced: 18 Jun 2025
https://github.com/danpoynor/python-number-guessing-game-with-stats
A number guessing game written in Python 3 that presents median, mode, and mean statistics
console-game data-analysis number-guessing-game python3 statistics
Last synced: 26 May 2026
https://github.com/regmibijay/opencarp-analyzer
Reads Trace Files created by OpenCARP Models and exports data for easy plotting with inbuilt plotter script.
bioinformatics data-analysis opencarp
Last synced: 16 Jan 2026
https://github.com/felinjob/ibm-applied-data-science-capstone
Este projeto, parte da especialização IBM Data Science Professional Certificate, prevê o sucesso do pouso do Falcon 9 da SpaceX. Usando dados da API da SpaceX e Web Scraping, o projeto inclui análise de dados e Machine Learning para gerar insights sobre os lançamentos.
data-analysis data-science data-visualization ibm jupyter-notebook machine-learning numpy pandas python scikit-learn seaborn sql
Last synced: 11 Apr 2026
https://github.com/rock12231/weather-analysis-backend
Weather analysis, visualization & Data science
data-analysis data-science data-visualisation django-rest-framework jyputer-notebook prediction python
Last synced: 15 Mar 2025
https://github.com/stas1f1/methods-and-models-for-multivariate-data-analysis
Completed tasks for the course on methods of mutivatiate data analysis, 1st year of masters, FDT ITMO
data-analysis multivariate-analysis python
Last synced: 10 Mar 2026
https://github.com/deva-246/datacleaning-excel-powerqueryeditor
data-analysis data-science excel powerquery
Last synced: 04 Jan 2026
https://github.com/amanyadav-07/customer-churn-prediction
Machine Learning project to predict customer churn using Logistic Regression, Random Forest, and XGBoost. Includes data preprocessing, feature engineering, SMOTE balancing, model training, evaluation, and business insights.
accuracy-metrics data-analysis data-visualization logistic-regression machine-learning matplotlib numpy pandas python3 random-forest-classifier seaborn sklearn xgboost-classifier
Last synced: 11 Apr 2026
https://github.com/mudassir-a/vendor-performance-analysis
vendor performance data analysis project using sql, python and power bi
data-analysis powerbi python sql
Last synced: 18 May 2026
https://github.com/bhavanachitragar/data-analysis-using-pyspark
Working with pyspark module in python and using google colab environment in order to apply some queries to the dataset. The dataset consist of two csv files listening.csv and genre.csv. Also, visualizing query results using matplotlib.
data-analysis google-colab pyspark-sql
Last synced: 30 Jun 2025
https://github.com/zulfachafidz/titanic_explorer_predicting_survival_with_classification_using_knn_algorithm
Tracking Life Safety with the KNN Predictive Analysis Approach. Leveraging the Titanic Dataset, we apply classification analysis to predict the fate of passengers based on a variety of features.
algorithm algorithms data data-analysis data-mining data-science datamodeling datapreprocessing dataset knn-algorithm knn-classification machine-learning machine-learning-algorithms prediction-model
Last synced: 01 Sep 2025
https://github.com/27ahmad/heart-disease-diagnostic-eda
This project conducts Exploratory Data Analysis on a dataset related to heart diagnostic disease, aiming to derive valuable insights from the analysis.
data-analysis data-visualization pandas python
Last synced: 06 May 2026
https://github.com/andersoncrs/arboles_de_decision_calidad_del_vino
Contiene un análisis detallado de la calidad del vino utilizando un modelo de clasificación basado en árboles de decisión. Incluye la exploración de datos, detección y manejo de valores atípicos, análisis Univariado y Bivariado, y la creación y evaluación de un modelo predictivo. El objetivo principal es predecir la calidad del vino.
data-analysis data-science data-visualization machine-learning matplotlib seaborn sklearn tree-decision
Last synced: 20 May 2026
https://github.com/dpbm/diabetes-analysis
simple diabete analysis with python
analysis data-analysis data-science data-science-projects data-set diabetes-detection diabetes-prediction machine-learning pandas python
Last synced: 11 Apr 2026
https://github.com/rupeshtr78/machine_learning
Machine Learning TensorFlow Neural Networks Deep Learning
classification data-analysis deep-learning deep-neural-networks flink jupyter-notebook keras machine-learning machinelearning-python perceptron python3 spark tensorflow
Last synced: 11 Apr 2026
https://github.com/omnipotence-eth/manufacturing-quality-analytics
SQL + Python pipeline for semiconductor NCR analysis — supplier performance, defect Pareto, yield trends
analytics data-analysis etl manufacturing matplotlib pandas postgresql python quality sql
Last synced: 11 Apr 2026
https://github.com/jaseel342/pizza_sales_report
This Pizza Sales dashboards provide valuable insights, including sales trends, pizza category breakdown, size distribution, top-selling, and least-selling pizzas, enabling data-driven decisions to boost sales and business performance.
data-analysis dax-query power-query powerbi sql sql-server-management-studio visualization
Last synced: 05 Jan 2026
https://github.com/jbizzlefoshizzle/crowdfunding-trends-excel
Excel project examining funding trends for Kickstarter projects
category-breakdown data-analysis excel kickstarter kickstarter-campaigns line-graph pivot-charts pivot-tables trends
Last synced: 05 Jan 2026
https://github.com/csoren66/customer-personality-analysis
Predict how different customer segments will respond for a particular product or service.
data-analysis data-visualization python
Last synced: 03 Mar 2025
https://github.com/siddharthbadal/kpmgdataanalysisproject
Data Analytics Consulting Virtual Internship
data-analysis data-cleaning data-visualization googlestudio msexcel powerpoint
Last synced: 05 Jan 2026
https://github.com/arianarmw/da01-bike-sharing-analysis
🚴♀️ Data analysis project on bike-sharing systems. Includes data wrangling, exploratory data analysis (EDA), visualization, and interactive dashboards built with Streamlit. Explore patterns in bike usage and rental data!
bike-sharing-analysis data-analysis exploratory-data-analysis python streamlit visualization
Last synced: 11 Apr 2026
https://github.com/mohit01chugh/edu_sql_analysis
SQL queries used to analyze student data.
data-analysis database education plpgsql postgresql sql
Last synced: 17 May 2026
https://github.com/chanmeng666/douban-review-scraper
【One star = One happy developer doing a little dance 💃⭐️】A robust Python scraper for collecting and analyzing movie reviews from Douban.com, featuring comprehensive data processing and analysis capabilities.
beautifulsoup4 data-analysis data-processing douban movie-reviews pandas python sentiment-analysis text-mining web-scraping
Last synced: 02 May 2026
https://github.com/khushi-sabarad/adinsights_dashboard
AdInsights Dashboard: An interactive web dashboard built with Python (Flask, Pandas, Plotly) to visualize and analyze digital advertising performance. Allows filtering by gender, ad type, and location for detailed insights
ad-performance advertising dashboard data-analysis data-visualization flask pandas plotly python web-application
Last synced: 01 May 2026
https://github.com/vedantshi/coffee-sales-dashboard
This project analyzes coffee sales data using Excel, featuring data cleaning, trend analysis, and an interactive dashboard. Key insights highlight top-performing products, regional sales trends, and seasonal patterns. Recommendations focus on marketing strategies and inventory optimization. Future plans include Power BI integration for visuals.
business-insights data-analysis data-visualization excel-dashboard pivot-tables sales-trends
Last synced: 05 Jan 2026
https://github.com/bhaskaracharjee/student-results-analysis
Analyzing student results to uncover insights
Last synced: 16 May 2025
https://github.com/ayushsiloiya619/spotify-song-analysis
Data Analytics with Python
data-analysis matplotlib-pyplot pandas-dataframe python3 seaborn
Last synced: 08 May 2026
https://github.com/upes-open/open-cryptocurrency-analysis
A web app to visualise and predict the cryptocurrency’s impact by using Web scraping, data exploration, EDA and Data Visualization.
analysis cryptocurrency data-analysis data-science data-visualization jupyter-notebook streamlite visualization
Last synced: 15 Apr 2025
https://github.com/yaser-123/energy-consumption-dashboard
A Power BI dashboard to analyze energy consumption for water, gas, and electricity across cities and buildings. Features include interactive charts, drill-down insights, and dynamic filters for easy monitoring and optimization.
dashboard data-analysis data-analytics data-visualization energy-consumption energy-efficiency powerbi
Last synced: 05 Jan 2026
https://github.com/akansharajput280799/strategic-analysis-of-retail-brand-in-south-america-using-sql
Leveraged Big Query and MySQL to analyze 100K records for sales optimization, trend identification, and enhancing customer satisfaction for a retail brand in South America and to provide insights and recommendations to improve their userbase and improve their services
bigquery data-analysis data-science database database-schema google-bigquery mysql-server sql
Last synced: 19 May 2026
https://github.com/lvmalware/lsm-module
A simple statistics module, which provides 4 basic types of regressions using the Least Squares Method (LSM)
data-analysis least-square-regression regression regression-analysis statistics
Last synced: 31 Mar 2025
https://github.com/mizzy/tweetduck
Twitter Archive to DuckDB Importer - Extract and import Twitter archive data (2025 format) into DuckDB for analysis
archive cli data-analysis duckdb golang twitter
Last synced: 28 Jun 2026
https://github.com/dacrol/filterdataset
Filters a dataset based on attributes
data-analysis dataset deep-learning machine-learning python python3
Last synced: 25 Jul 2025
https://github.com/masum184e/exploratory_data_analysis_projects
This space to showcase my journey in exploring various datasets, uncovering patterns, and extracting meaningful insights. Each project highlights different aspects of EDA, demonstrating techniques and tools that are essential for making sense of data.
data-analysis data-analysis-projects data-science data-science-projects eda eda-projects exploratory-data-analysis exploratory-data-analysis-projects
Last synced: 31 Mar 2025
https://github.com/rcv911/cluster_generation
Generation of cluster test data
cluster cluster-analysis cluster-generation clustering clustering-algorithm clusters data-analysis machine-learning
Last synced: 18 Jan 2026
https://github.com/totonga/ods-exd-api-box
Helper package to build ASAM ODS EXD API grpc plugins.
asam data-analysis grpc grpc-server ods plugin python
Last synced: 03 Feb 2026
https://github.com/jiyanshgarg/delhivery-logistics-data-analysis
This project analyzes Delhivery's logistics delivery dataset to understand delivery performance, route efficiency, and operational patterns using data analytics techniques. The analysis focuses on transforming raw segment-level logistics data into meaningful trip-level insights that can help improve delivery efficiency and route planning.
business-insights-and-recommendations data-analysis data-cleaning-and-preprocessing data-visualization exploratory-data-analysis feature-engineering feature-extraction feature-selection hypothesis-testing outlier-detection outlier-treatment
Last synced: 12 Jun 2026
https://github.com/farhannirzhor/vrinda_store_excel_project
This project is about excel analysis and visualization. In this project, I analyzed Vrinda Store's sales and made an annual sales report
data-analysis data-cleaning data-preprocessing data-visualization microsoft-excel reporting
Last synced: 05 Jan 2026
https://github.com/amlanmohanty1/genai-data-analysis-report-generator
Generating data analysis and EDA reports from CSV files using Generative AI - Langchain, Llama, Groq.
ai data-analysis data-science flask generative-ai groq langchain llama3 llm prompt-engineering python
Last synced: 28 Jan 2026
https://github.com/eslamdyab21/data-visualization-using-matplotlib-and-seaborn
This is the last project in the nanodegree udacity program. it's about data visualization.
data data-analysis data-visualization matplotlib pandas python seaborn udacity udacity-data-analyst-nanodegree
Last synced: 09 May 2026
https://github.com/manumoolimani/data-analysis
Data Analysis Projects
data-analysis data-visualization excel
Last synced: 21 Feb 2026
https://github.com/tolumie/exploratory-data-analytics-projects
Exploratory Data Analytics – A collection of projects covering data exploration, feature engineering, hypothesis testing, and predictive modeling across diverse datasets, including insurance, real estate, laptops, cars, COVID-19, and the Olympics.
data-analysis data-visualization data-wrangling exploratory-data-analysis-eda feature-engineering hypothesis-testing machine-learning matplotlib numpy pandas predictive-modeling python seaborn statistical-analysis
Last synced: 11 Apr 2026
https://github.com/vanshuchaudhary/retail-sale
project uses MySQL to analyze retail sales data, focusing on customer behavior, sales trends, and product performance. The dataset includes transactions, customer demographics, and purchase details, helping businesses optimize strategies. Key Insights: 📊 Revenue Analysis – Total sales, top-spending customers 📅 Sales Trends
business-intelligence customer-behavior customer-behavior-analysis data-analysis mysql predictive-analytics retail-analytics sales-analysis sql-queries
Last synced: 23 Mar 2025
https://github.com/vishal-verma-96/pre-owned-car-price-prediction-using-streamlit-app
Capstone Project by skill Academy- Exploratory Analysis, Visualization and Prediction of Used Car Prices. Deploying the highest-scoring model with Streamlit web app
data-analysis data-science jupyter-notebook machine-learning machine-learning-algorithms matplotlib numpy pandas python3 regression-algorithms scikit-learn seaborn streamlit
Last synced: 11 Apr 2026
https://github.com/thenorthkun/movies-dataset-analysis
Analysis & categorizing of Movies based on Actors, Genres, Gross covered etc 🦸🏼🧜🏼♀️🎧
data-analysis data-visualization filtering
Last synced: 23 Mar 2025
https://github.com/chardyb/prob-and-stats-bmi6106
A repository for Spring 2025 BMI 6106: Statistics and Probability. This repository contains coursework, code examples, and projects exploring statistical methods and probabilistic models in biomedical informatics.
biomedical-informatics data-analysis data-science probability r statistical-modeling
Last synced: 02 Sep 2025
https://github.com/chitranjan806/predicting-on-time-premium-deposits
A Predictive analysis project to predict the success rate of On-Time deposits of Premiums by Policy Holders.
analytics-vidhya analytics-vidhya-competition catboostregressor data-analysis data-science linear-regression logistic-regression python3
Last synced: 16 May 2026
https://github.com/rosanafss/r-journey
Diving into to wonderful see of DATA
Last synced: 19 Nov 2025
https://github.com/salma-mamdoh/investigating-netflix-movies-and-guest-stars-in-the-office
My Project to learn the Basics of Analysis & Visualization on DataCamp
data-analysis data-visualization datacamp matplotlib pandas python
Last synced: 11 Apr 2026
https://github.com/codewithmayank-py/covid19-data-analysis-using-python
COVID-19 and Happiness Analysis
data-analysis data-analysis-python data-visualization dataset jupyter-notebooks numpy pandas python3 seaborn
Last synced: 11 Apr 2026
https://github.com/swapnil-jain/tailored-tomes
Web application which shows Top 50 books of all time & recommends similar books if a book name is provided.
book bookrecommendsystem books bootstrap3 cosine-similarity data-analysis html machine-learning python
Last synced: 20 Jan 2026
https://github.com/erickchacon/day2day
Functions that can be useful in the day-to-day data analysis. It comprehends functions to find paths for projects, make summaries of databases inside folder and so on.
data-analysis exploratory-data-analysis simulation spatial-analysis
Last synced: 02 Sep 2025
https://github.com/aya-jafar/python
Practice files & exercises during the journey of Python leaning 🐍
Last synced: 16 May 2025
https://github.com/agrdatasci/climmob-analysis
Workflow for data analysis applied on ClimMob.net
citizen-science data-analysis workflow
Last synced: 24 Jun 2025
https://github.com/mylena13s/dio-python-data-analytics-bootcamp
bootcamp-project data-analysis data-science learning python
Last synced: 15 Mar 2025