Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-23 00:07:29 UTC
- JSON Representation
https://github.com/pcanadas/weather_scraper
Este proyecto automatiza la recopilación y el procesamiento de datos meteorológicos históricos y previsionales. Utiliza Selenium para extraer información de sitios web de clima, procesa los datos con Pandas y los almacena en archivos CSV limpios. Es ideal para análisis climáticos, visualización de datos o integración en otros sistemas.
beautifulsoup data-analysis pandas python selenium
Last synced: 05 May 2026
https://github.com/akash-47-tank/personalized-e-commerce-review-summarizer
Personalized E-commerce Product Review Summarizer: A Streamlit app that summarizes product reviews (e.g., from a CSV) using T5-small and tailors summaries to user preferences (price, durability, etc.) with NLP and lightweight ML.
data-analysis e-commerce machine-learning nlp personalization portfolio python scikit-learn sentiment-analysis streamlit t5 transformers web-app
Last synced: 05 May 2026
https://github.com/aryar-06/linear-regression
A Python project demonstrating basic linear regression with gradient descent and matrix operations, alongside scikit-learn comparison.
data-analysis data-preprocessing educational-project gradient-descent linear-regression machine-learning python regression-algorithms scikit-learn
Last synced: 05 May 2026
https://github.com/nimbostratos/titanic-survival-prediction
Machine learning project predicting Titanic survival using AdaBoost with feature engineering and hyperparameter optimization
data-analysis data-science data-science-projects kaggle machine-learning machine-learning-models python scikit-learn
Last synced: 05 May 2026
https://github.com/hms75/movie_rating_analysis
A movie rating analysis which identifies trends amongst a dataset of 5000 movies.
data-analysis data-visualization matplotlib-pyplot numpy pandas python
Last synced: 05 May 2026
https://github.com/ryuzen6/bangalore-real-estate-price-prediction
This is a Data Science Project which predicts the cost of Real Estate in Bangalore. Requirements: Jupyter Notebook (for Data Cleaning and creating the Linear Regression using various python libraries) , Pycharm (python IDE for creating Python Flask Server), Visual Studio Code (to create the UI with HTML, CSS and Javascript).
css3 data-analysis data-science html5 javascript jupyter-notebook machine-learning python3
Last synced: 06 May 2026
https://github.com/shreshthvashisht/instgram-user-analytics
SQL Fundamentals
data data-analysis data-science mysql social-network-analysis
Last synced: 09 Jun 2026
https://github.com/alinababer/covid19-timeseries-cases-and-deaths-forecasting-
This study is based on confirmed cases and deaths collected from Pakistan. Results demonstrate the promising potential of TIME SERIES model in forecasting COVID-19 cases and highlight the superior performance of the time series compared to the LSTM.we apply AI-based forecasting models such time series ARIMA, LSTM, prophet and VAR.
arima covid-19 data-analysis data-science data-visualization fbprophet forecasting lstm rnn time-series var vectorautoregression
Last synced: 19 Jun 2026
https://github.com/abhinav330/customer-behavior-analysis-linear-regression
This repository explores customer behavior data for an NYC clothing company with both a mobile app and website. They want to understand which platform drives higher sales.
data-analysis data-science data-visualization eda exploratory-data-analysis jupyter jupyter-notebook linear-regression machine-learning machine-learning-algorithms machinelearning-python numpy pandas python regression-analysis
Last synced: 06 May 2026
https://github.com/superpandas-ai/superpandas
Adding LLM integration to Pandas library
Last synced: 06 May 2026
https://github.com/marknature/oibsip
AICTE Oasis Infobyte Data Science Internship
data-analysis data-science data-visualization github google-sheets jupyter-notebook linkedin machine-learning project-management python
Last synced: 06 May 2026
https://github.com/edanur-y/variable-analysis-of-banks-ratio-data
Testing variables for multicollinearity, multivariate normality and analyzing outliers and missing values. ⭕SPSS 🔵R
data-analysis log-transformation missing-values-analysis multicollinearity normality-test r spss
Last synced: 10 Jun 2026
https://github.com/korniichuk/pydatan-homework
Python Data Analysis course homework
course data-analysis data-analysis-python python python3
Last synced: 06 May 2026
https://github.com/urbanekda/upwork_dashboard
A data analysis project examining trends and patterns in the data science job market on Upwork. This project analyzes job postings, requirements, and market demands to provide insights into the freelance data science ecosystem.
data-analysis data-science data-science-projects data-visualization freelance jupyter-notebook python streamlit
Last synced: 07 May 2026
https://github.com/suhas-005/jovian-data-analysis-course-assignment
These are my assignments for Data Analysis : Zero to Pandas course by Jovian.ai
data-analysis data-analytics numpy pandas python
Last synced: 07 May 2026
https://github.com/helosantosdesousa/analise-dados-titanic
Análise de dados com o dataset 'Titanic - Machine Learning from disaster'
analise-de-dados analise-exploratoria bootcamp bootcamp-project data-analysis data-girls data-science matplotlib numpy pandas python
Last synced: 07 May 2026
https://github.com/bryanhe24/data_analysis_app
A full-stack web application that allows users to upload CSV datasets, analyze the data with statistical summaries and visualizations, and interact with an AI-powered assistant for querying the dataset.
ai data data-analysis data-visualization fullstack-development javascript math python reactjs
Last synced: 07 May 2026
https://github.com/joseph-pabian/life-expectancy-
Statistical analysis of life expectancy in developed vs developing countries using SQL and Python
data-analysis duckdb public-health python sql statistics
Last synced: 07 May 2026
https://github.com/biginformatics/git-basics
Hands-on Git and GitHub lessons for analysts and statisticians
data-analysis git github public-health training
Last synced: 10 Jun 2026
https://github.com/vyjayanthipolapragada/genai_smart_retail_recommendation
GenAI Smart Retail is a recommendation system designed for retail environments. It provides personalized product recommendations to users based on product descriptions using a content-based filtering approach. The system leverages FastAPI for backend integration, allowing users to interact with the recommendation engine via an API. This project aim
content-based-recommendation data-analysis data-science data-visualization fastapi gen-ai instacart-data jupyter-notebook open-ai python3 retail scikitlearn-machine-learning stream
Last synced: 07 May 2026
https://github.com/riborings/python_projects
Python projects and other programming experiences
data-analysis machine-learning project python regression-analysis
Last synced: 08 May 2026
https://github.com/rileynwong/forecasting-coffee-prices
Predict coffee prices in Kenya
data data-analysis data-scraping data-visualization forecasting forecasting-models forecasting-prices jupyter-notebook prophet prophet-model
Last synced: 20 Jun 2026
https://github.com/aidan-zamfir/the-iliad
Data analysis & relationship network for the characters of Homers Iliad
data data-analysis dataframes networks networkx python selenium spacy webscraping
Last synced: 08 May 2026
https://github.com/devexpress-examples/wpf-pivot-grid-group-date-time-values
This example shows how to group date-time values in Pivot Grid for WPF.
data-analysis dotnet dxpivotgrid pivot-grid pivot-grid-for-wpf wpf
Last synced: 08 May 2026
https://github.com/phanchenh/adventureworkdataset-rfm-analysis-sqlproject
RFM Analysis Using SQL on the AdventureWorks Dataset (2011-2014)
business-analytics business-intelligence data-analysis mssql rfm-analysis sql
Last synced: 10 Jun 2026
https://github.com/shelton-beep/trading-algorithm
A simple trading algorithm for SPY ETF using a moving average crossover strategy. This project analyzes SPY weekly price data, implements a buy/sell algorithm, and tracks performance metrics to evaluate profitability and risk. Ideal for learning algorithmic trading basics and financial data analysis.
data-analysis financial-analysis investment-strategy jupyter-notebook pandas python quantitative-finance technical-analysis time-series-analysis trading-strategies
Last synced: 08 May 2026
https://github.com/dhruwsunita/zomato-data-analysis-project
Zomato data analysis project using Python.
data-analysis data-visualization jupyter-notebook matplotlib numpy pandas-dataframe python
Last synced: 08 May 2026
https://github.com/satvikpraveen/numpymasterpro
A hands-on, production-ready toolkit to master NumPy — from first principles to real-world applications. Includes modular Jupyter notebooks, reusable utility scripts, cheatsheets, and advanced projects like K-Means clustering from scratch.
broadcasting data-analysis data-science data-source data-visualization jupyter-notebook kmeans-clustering linear-algebra machine-learning matrix-algebra numerical-computation numpy numpy-broadcasting numpy-examples numpy-tutorial open-source python scientific-computing standardization vectorization
Last synced: 08 May 2026
https://github.com/deepanshkhurana/udacityproject-prediciting-boston-housing-prices
This is a Udacity Project for the Machine Learning Nanodegree. Here, we are trying to predict Boston Housing Prices using sklearn.
data-analysis data-science machine-learning python scikit-learn udacity
Last synced: 08 May 2026
https://github.com/dcs-training/intro-to-statistics
Intro to Statistics workshop. In this repo, you are going to find the code and files we are going to use for the practical part of the workshop, together with the ppt associated with this training. Go to the readme file
data-analysis data-visualisation data-wrangling r statistics
Last synced: 20 Jun 2026
https://github.com/drod75/burger_king_analysis
A simple analysis on a burger king dataset.
data-analysis data-visualization jupyter-notebook pandas python seaborn
Last synced: 09 May 2026
https://github.com/rahulsm20/trackbyte
A full-stack web application that helps users keep track of their playlist and provides analytics based on their music taste. Built using React, Node.js, Express.js, MySQL and Bootstrap.
bootstrap data-analysis expressjs mysql nodejs reactjs sql
Last synced: 07 Apr 2026
https://github.com/itsharshparmar/uber_data_analysis
Data analysis on Uber ride data using Python and visualization libraries.
analytics-projects business-analysis colab-notebook data-analysis data-cleaning data-science data-visualization eda exploratory-data-analysis matplotlib pandas python python-project real-world-data seaborn time-series-analysis transportation uber uber-data-analysis
Last synced: 09 May 2026
https://github.com/urmesthamondal/data_analysis_projects
Portfolio Data analysis projects built using Excel, Python, SQL and for visualization used Power bi .
data-analysis pivot-tables powerbi python sql sql-server visualisation
Last synced: 09 May 2026
https://github.com/lopez86/rust-mlearn
Machine Learning Tools in Rust
data-analysis data-science machine-learning rust
Last synced: 15 May 2025
https://github.com/dogan-the-analyst/data_analysis_in_the_office
Data analysis with R in the Office.
data-analysis ggplot r theoffice tidyverse
Last synced: 14 Mar 2025
https://github.com/dsrodrigovieira/favoritasales
Este repositório contém o projeto desenvolvido para o desafio do kaggle "Store Sales - Time Series Forecasting. Use machine learning to predict grocery sales"
data-analysis data-science kaggle-competition machine-learning python telegram-bot xgboost-regression
Last synced: 05 May 2026
https://github.com/shivam5509/power-bi-project
Expert in creating interactive dashboards and reports using Power BI, utilizing 10+ visual tools like cards, slicers, and charts. Skilled in cleaning and transforming large datasets with Power Query Editor. Proficient in advanced DAX functions (SUMX, FILTER, CALCULATE) to derive insights and drive data-driven decisions.
advanced-excel computer-science data-analysis data-mining data-visualization engineering mysql numpy pandas powerbi pyhton3 sql sql-server
Last synced: 11 Apr 2026
https://github.com/shubham200137/cyclistic-case-study
This repository contains a case study for Google's Data Analytics Professional Certificate, focusing on Cyclistic, a fictional bike sharing company in Chicago. The case study aims to drive growth by converting casual riders into members through a marketing strategy.
data-analysis data-visualization numpy-python pandas-python presentation-slides sql tableau
Last synced: 11 Jun 2026
https://github.com/engusseus/warframe-market-set-profit-analyzer
Python tool that analyzes Warframe Market data to find profitable item sets to trade
api data-analysis python trading waframe
Last synced: 23 Mar 2025
https://github.com/ssoehdata/sql_for_data_science_specialization_course
Materials and Certifications from the SQL for DataScience Course
data-analysis data-science database databricks postgresql sql sqlite
Last synced: 10 Apr 2026
https://github.com/stribedi-94/comcast-telecom-consumer-complaints-data-analysis
Comcast Telecom Consumer Complaints Analysis
comcast-telcom-complaints data-analysis data-science data-visualization python
Last synced: 26 May 2026
https://github.com/elakkiya-u/digital-marketing-campaign
A machine learning project to predict whether a customer will convert based on digital marketing campaign data.
campaigns data-analysis deployment digital-marketing machine-learning predictive-modeling python
Last synced: 30 Jun 2025
https://github.com/shimaa83/eda_v2
Automatic EDA library
data-analysis data-science python
Last synced: 20 Apr 2026
https://github.com/diem0n/100daysofdatascience
This repository is a collection of things i do on as a data scientist each day as i am hired at a fictional company called keko corp
data-analysis data-engineering data-science data-science-from-scratch data-warehousing machine-learning python
Last synced: 09 Apr 2026
https://github.com/aphp/jupyter-eds-notebooks
jupyter-eds-notebooks provides Docker images with preconfigured Jupyter environments for clinical and health data analysis, tailored for AP‑HP Datalabs and the HELIX platform.
data-analysis data-science data-visualization healthcare lab
Last synced: 13 Jan 2026
https://github.com/neeraj08823/cyclisticbike-case-study-with-r-tableau
Cyclistic Bike-Share Analysis
data-analysis data-cleaning data-visualization r tableau
Last synced: 19 Jun 2025
https://github.com/tenifayo/analysis-of-fordgobike-trip-data
Data Visualization using Ford GoBike Trip Data
data-analysis matplotlib pandas
Last synced: 11 Jul 2025
https://github.com/ved-coder-king/wheat_ai_project
This project, Smart Wheat Farming AI System, was developed as part of the coursework for the Artificial Intelligence program at Esprit School of Engineering.
agriculture data-analysis data-visualization deep-learning image-classification machine-learning object-detection python wheat
Last synced: 15 Apr 2025
https://github.com/sanchittechnogeek/overscripted-analysis
Geolocation and user language extraction analysis from Mozilla Overscripted dataset
analysis data data-analysis mozilla
Last synced: 23 Mar 2025
https://github.com/regmibijay/opencarp-analyzer
Reads Trace Files created by OpenCARP Models and exports data for easy plotting with inbuilt plotter script.
bioinformatics data-analysis opencarp
Last synced: 16 Jan 2026
https://github.com/rodrigojunqueiradev/data-exploration-and-cleaning
Credit Analysis Data: Foundations for Cleaning and Exploration
data-analysis data-engineering data-science data-visualization datascience matplotlib matplotlib-pyplot numpy pandas python python-3 python3
Last synced: 13 Apr 2026
https://github.com/stas1f1/methods-and-models-for-multivariate-data-analysis
Completed tasks for the course on methods of mutivatiate data analysis, 1st year of masters, FDT ITMO
data-analysis multivariate-analysis python
Last synced: 10 Mar 2026
https://github.com/deva-246/datacleaning-excel-powerqueryeditor
data-analysis data-science excel powerquery
Last synced: 04 Jan 2026
https://github.com/badranalyst/student-tests-data-analysis-application
Python-based analysis of student test scores in math, reading, and writing, examining correlations with parental education, lunch type, and test preparation. Includes data cleaning, visualization, and statistical insights into factors influencing academic performance.
data-analysis data-visualization dataset matplotlib numpy pandas python sklearn
Last synced: 05 May 2026
https://github.com/bhavanachitragar/data-analysis-using-pyspark
Working with pyspark module in python and using google colab environment in order to apply some queries to the dataset. The dataset consist of two csv files listening.csv and genre.csv. Also, visualizing query results using matplotlib.
data-analysis google-colab pyspark-sql
Last synced: 30 Jun 2025
https://github.com/serlo/data-pipeline-interactive-exercises
processing pipeline for exercise dashboards
Last synced: 26 Feb 2025
https://github.com/mstovarh/analisis-de-bebidas-de-starbucks
En este repositorio se encuentran unas gráficas basadas en diversas características de las bebidas de Starbucks, usé tecnologías como la herramienta de Data Analysis de ChatGPT, Excel y PowerQuery.
chatgpt data-analysis excel powerquery
Last synced: 15 Apr 2025
https://github.com/zxjahid/matplotlib
A comprehensive guide to mastering data visualization with Matplotlib through hands-on examples and advanced techniques. 🚀📊
candlestick candlestick-chart cheatsheet data-analysis data-visualization gtk jupyter-notebook maps matplotlib-python pandas thesis-template tk tutorial wx
Last synced: 09 May 2026
https://github.com/javedali99/machine-learning-hw-solution-notebooks
Machine Learning Homework Solution Notebooks (UCF CAP5610)
data-analysis data-preprocessing data-science decision-trees machine-learning python random-forest recommender-system supervised-learning support-vector-machines titanic-kaggle unsupervised-learning
Last synced: 05 Jan 2026
https://github.com/farhad-here/median-performance-comparison
Benchmarking the performance of median calculation using vanilla Python vs NumPy.
data-analysis matplotlib numpy python
Last synced: 18 Apr 2026
https://github.com/jaseel342/pizza_sales_report
This Pizza Sales dashboards provide valuable insights, including sales trends, pizza category breakdown, size distribution, top-selling, and least-selling pizzas, enabling data-driven decisions to boost sales and business performance.
data-analysis dax-query power-query powerbi sql sql-server-management-studio visualization
Last synced: 05 Jan 2026
https://github.com/jbizzlefoshizzle/crowdfunding-trends-excel
Excel project examining funding trends for Kickstarter projects
category-breakdown data-analysis excel kickstarter kickstarter-campaigns line-graph pivot-charts pivot-tables trends
Last synced: 05 Jan 2026
https://github.com/csoren66/customer-personality-analysis
Predict how different customer segments will respond for a particular product or service.
data-analysis data-visualization python
Last synced: 03 Mar 2025
https://github.com/arianarmw/da01-bike-sharing-analysis
🚴♀️ Data analysis project on bike-sharing systems. Includes data wrangling, exploratory data analysis (EDA), visualization, and interactive dashboards built with Streamlit. Explore patterns in bike usage and rental data!
bike-sharing-analysis data-analysis exploratory-data-analysis python streamlit visualization
Last synced: 11 Apr 2026
https://github.com/anuppm9917/super-store-sales-analysis-power-bi-project
My drive to know which products, regions, categories and customer segments a company should target or avoid, I search and selected an appropriate dataset on kaggle which will match a standard superstore requirement.
data data-analysis data-visualization datacleansing excel exploratory-data-analysis jupyter-notebook numpy pandas plotly powerbi python3
Last synced: 10 Apr 2026
https://github.com/lanzafame/polycarp
[WIP] Subset operations on latlon data read from CSVs
Last synced: 12 Jan 2026
https://github.com/khushi-sabarad/adinsights_dashboard
AdInsights Dashboard: An interactive web dashboard built with Python (Flask, Pandas, Plotly) to visualize and analyze digital advertising performance. Allows filtering by gender, ad type, and location for detailed insights
ad-performance advertising dashboard data-analysis data-visualization flask pandas plotly python web-application
Last synced: 01 May 2026
https://github.com/benami171/ml_knn_decision-trees
A ml implementation comparing Decision Trees and k-Nearest Neighbors (k-NN) algorithms for Iris flower classification. Features comprehensive analysis of different approaches including brute-force and entropy-based decision trees, along with k-NN using multiple distance metrics.
classification cross-validation data-analysis decision-trees iris-dataset k-nearest-neighbours machine-learning nearest-neighbors python
Last synced: 30 Jun 2025
https://github.com/first-coding/smart_analysis
Smart Analysis is an AI-powered data analysis tool that leverages large language models (LLMs) to generate SQL queries from natural language prompts. Upload CSV files, explore the data schema, and retrieve insights with ease. The system ensures error correction in SQL queries, delivering detailed reports and visualizations in a streamlined workflow
data-analysis llm openai prompt-engineering python
Last synced: 08 Mar 2025
https://github.com/sun-lab-nbb/sl-shared-assets
A Python library that stores assets shared between multiple Sun (NeuroAI) lab data acquisition and processing repositories.
data-analysis data-collection data-processing experiment sunlab
Last synced: 10 Mar 2026
https://github.com/upes-open/open-cryptocurrency-analysis
A web app to visualise and predict the cryptocurrency’s impact by using Web scraping, data exploration, EDA and Data Visualization.
analysis cryptocurrency data-analysis data-science data-visualization jupyter-notebook streamlite visualization
Last synced: 15 Apr 2025
https://github.com/yaser-123/energy-consumption-dashboard
A Power BI dashboard to analyze energy consumption for water, gas, and electricity across cities and buildings. Features include interactive charts, drill-down insights, and dynamic filters for easy monitoring and optimization.
dashboard data-analysis data-analytics data-visualization energy-consumption energy-efficiency powerbi
Last synced: 05 Jan 2026
https://github.com/akansharajput280799/strategic-analysis-of-retail-brand-in-south-america-using-sql
Leveraged Big Query and MySQL to analyze 100K records for sales optimization, trend identification, and enhancing customer satisfaction for a retail brand in South America and to provide insights and recommendations to improve their userbase and improve their services
bigquery data-analysis data-science database database-schema google-bigquery mysql-server sql
Last synced: 19 May 2026
https://github.com/mizzy/tweetduck
Twitter Archive to DuckDB Importer - Extract and import Twitter archive data (2025 format) into DuckDB for analysis
archive cli data-analysis duckdb golang twitter
Last synced: 02 Sep 2025
https://github.com/hanzopgp/lolanalysis
League Of Legends game data engineering, analysis, visualization and machine learning. Business intelligence project.
data-analysis data-cleaning data-engineering data-visualization dataiku deep-learning etl machine-learning scraping university
Last synced: 27 May 2026
https://github.com/jayqi/data-analysis-tools
Presentation on Data Analysis Tools
data-analysis presentation-slides
Last synced: 06 Jan 2026
https://github.com/masum184e/exploratory_data_analysis_projects
This space to showcase my journey in exploring various datasets, uncovering patterns, and extracting meaningful insights. Each project highlights different aspects of EDA, demonstrating techniques and tools that are essential for making sense of data.
data-analysis data-analysis-projects data-science data-science-projects eda eda-projects exploratory-data-analysis exploratory-data-analysis-projects
Last synced: 31 Mar 2025
https://github.com/scailfin/benchmark-templates
Workflow Templates are parameterized workflow specifications for the Reproducible Open Benchmarks for Data Analysis Platform (ROB)
benchmarks data-analysis reproducibility
Last synced: 16 Jan 2026
https://github.com/rcv911/cluster_generation
Generation of cluster test data
cluster cluster-analysis cluster-generation clustering clustering-algorithm clusters data-analysis machine-learning
Last synced: 18 Jan 2026
https://github.com/jiyanshgarg/delhivery-logistics-data-analysis
This project analyzes Delhivery's logistics delivery dataset to understand delivery performance, route efficiency, and operational patterns using data analytics techniques. The analysis focuses on transforming raw segment-level logistics data into meaningful trip-level insights that can help improve delivery efficiency and route planning.
business-insights-and-recommendations data-analysis data-cleaning-and-preprocessing data-visualization exploratory-data-analysis feature-engineering feature-extraction feature-selection hypothesis-testing outlier-detection outlier-treatment
Last synced: 12 Jun 2026
https://github.com/farhannirzhor/vrinda_store_excel_project
This project is about excel analysis and visualization. In this project, I analyzed Vrinda Store's sales and made an annual sales report
data-analysis data-cleaning data-preprocessing data-visualization microsoft-excel reporting
Last synced: 05 Jan 2026
https://github.com/amlanmohanty1/genai-data-analysis-report-generator
Generating data analysis and EDA reports from CSV files using Generative AI - Langchain, Llama, Groq.
ai data-analysis data-science flask generative-ai groq langchain llama3 llm prompt-engineering python
Last synced: 28 Jan 2026
https://github.com/eslamdyab21/data-visualization-using-matplotlib-and-seaborn
This is the last project in the nanodegree udacity program. it's about data visualization.
data data-analysis data-visualization matplotlib pandas python seaborn udacity udacity-data-analyst-nanodegree
Last synced: 09 May 2026
https://github.com/kalyan4636/chocos-sales-analysis-report-and-dashboard.-
📊 Built using Power BI, this dashboard delivers actionable insights to boost strategic decision-making. Would you like me to include GitHub tags or a project description for the README as well?
bussiness-analyst data-analysis data-visualization dataanalyst microsoft-power-bi powerbi
Last synced: 26 Jan 2026
https://github.com/vishal-verma-96/pre-owned-car-price-prediction-using-streamlit-app
Capstone Project by skill Academy- Exploratory Analysis, Visualization and Prediction of Used Car Prices. Deploying the highest-scoring model with Streamlit web app
data-analysis data-science jupyter-notebook machine-learning machine-learning-algorithms matplotlib numpy pandas python3 regression-algorithms scikit-learn seaborn streamlit
Last synced: 11 Apr 2026
https://github.com/leandrocollares/home-team-advantage-in-epl
Home team advantage in the English Premier League: an exploratory data analysis
data-analysis matplotlib pandas plotly
Last synced: 11 Jun 2026
https://github.com/luminati-io/target-dataset-samples
A sample dataset of over 1000 target products, extracted using the Bright Data API, ideal for brand reputation, tracking inventory, and optimizing prices.
api data-analysis data-mining datasets target web-scraper web-scraping
Last synced: 04 Jan 2026
https://github.com/ilovenooodles/probstat-water-potability
Tugas Besar Probabilitas dan Statistika 1
csv data-analysis jupyter-notebooks python
Last synced: 03 May 2026
https://github.com/shrikantnaidu/greyatom-projects
GreyAtom Projects.
data-analysis data-science greyatom machine-learning portfolio
Last synced: 24 Jul 2025
https://github.com/devanshsahu47/talentscape-glassdoor-analysis
TalentScape is an end-to-end Python project that cleans and analyzes a comprehensive Glassdoor Jobs dataset. It features robust data wrangling and 20 insightful visualizations to uncover trends in job titles, salary ranges, company ratings, and more—providing actionable recommendations to optimize recruitment and compensation strategies.
business-intelligence data-analysis data-vizualisation jupyter-notebook python3
Last synced: 15 May 2026
https://github.com/shubhammittal-data/hr_dashboard_tableau
An interactive HR Analytics Dashboard built using Tableau. Provides insights into workforce demographics, hiring trends, salary analysis, and employee records for data-driven decision-making.
chatgpt4 data data-analysis data-visualization drawio-tools faker-generator hr-analytics hr-analytics-dashboard human-resources numpy python tableau tableau-public
Last synced: 17 May 2026
https://github.com/shellynagar27/mobile-sales-analysis
Analyzed 2024 mobile sales data to uncover product trends, customer behavior, and regional insights using Power BI dashboards and structured data modeling.
cleaning-data data-analysis data-visualization dax eda figma modelling powerbi powerquery storytelling wireframe
Last synced: 16 May 2025
https://github.com/salma-mamdoh/investigating-netflix-movies-and-guest-stars-in-the-office
My Project to learn the Basics of Analysis & Visualization on DataCamp
data-analysis data-visualization datacamp matplotlib pandas python
Last synced: 11 Apr 2026
https://github.com/dcs-training/pca-2023
PCA workshop. In this repo, you are going to find the code and files we are going to use for the practical part of the workshop, together with the ppt associated with this training
data-analysis data-visualisation data-wrangling r statistics
Last synced: 20 Jun 2026
https://github.com/swapnil-jain/tailored-tomes
Web application which shows Top 50 books of all time & recommends similar books if a book name is provided.
book bookrecommendsystem books bootstrap3 cosine-similarity data-analysis html machine-learning python
Last synced: 20 Jan 2026
https://github.com/agdturner/ccg-data
A modularised Java library for processing data sets with classes for: data records; collections of data records; and identifiers.
Last synced: 12 Jan 2026