Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-23 00:07:29 UTC
- JSON Representation
https://github.com/jihoonerd/national_health_insurance_sharing_service_project
국민건강보험 데이터를 활용한 EDA
data-analysis exploratory-data-analysis health insurance
Last synced: 18 Jul 2025
https://github.com/teja-1403/forage-tata-data-visualisation-empowering-business-with-effective-insights
This repository contains solutions to the 4 different tasks that must be performed during the Data Visualisation: Empowering Business with Effective Insights virtual internship provided by TATA via Forage.
analysis-and-reporting analytics analytics-and-decision-science charts communications dashboards data-analysis data-cleanup data-interpretation data-storytelling data-visualizations graph insights power-bi visual-basic visualizations
Last synced: 18 Feb 2026
https://github.com/daniel-jcvv/daniel-jcvv
👨💻 Data Engineer | 3+ years enterprise experience with Telcel & Citi Banamex Develop ETL pipelines, data governance, and cloud solutions. Building scalable data architectures and automated workflows for Fortune 500 clients. Tech Stack: Python, SQL Server, Oracle, Apache Airflow, PySpark
agentic-ai apache-airflow apache-kafka apache-spark automation business-intelligence citi-bank-apis data-analysis data-engineering data-lake data-warehouse etl-pipeline medallion-architecture mlops n8n-workflow python rag sql-server
Last synced: 15 Apr 2026
https://github.com/stkisengese/numpy-data-fundamentals
A comprehensive collection of NumPy exercises covering array manipulation, slicing, broadcasting, random data generation, and real-world data analysis applications.
data data-analysis numpy pre-processing
Last synced: 16 May 2026
https://github.com/bibymaths/python_snippets
A collection of Python scripts for bioinformatics data analysis, including tools for transcription counts, nucleotide composition, and protein sequence evaluation.
amino-acid-scoring bioinformatics data-analysis fasta-generation mathematical-evaluation nucleotide-analysis protein-sequence-analysis transcription-counts
Last synced: 29 Jul 2025
https://github.com/mvharsh/blinkit-sales-dashboard
An interactive Power BI dashboard visualizing Blinkit's sales performance across outlets, item types, and customer ratings for strategic insights.
blinkitdashboard data-analysis data-visualization powerbi
Last synced: 25 Jan 2026
https://github.com/shafaq-aslam/data-gathering
A hands on collection of notebooks exploring multiple techniques of data gathering, from reading CSV, Excel, JSON, and SQL files to exporting data in various formats and fetching real time data through APIs. This repository documents my complete learning journey of data ingestion, preparation, and extraction for data analysis workflows.
api data-analysis data-export data-gathering data-import data-science jupyter-notebook machine-learning pandas python python3
Last synced: 21 May 2026
https://github.com/carlosvinimsouza/jupyter-notebook-basic
Armazenado todos os trabalhos referentes a Ciência de Dados.
data-analysis data-science programas-jupyter-notebook python
Last synced: 11 May 2026
https://github.com/as16082023/global-electronics-retailer
Analyzed Maven Electronics' performance data to identify factors driving revenue decline since 2020.
advanced-excel data-analysis data-visualization
Last synced: 03 Feb 2026
https://github.com/anonymo2239/big-data-churn-analyzer
Scalable customer churn prediction using PySpark. Includes EDA, feature engineering, modeling, and real-time inference on new data.
big-data churn-analysis churn-prediction classification-algorithm data-analysis data-science data-visualization modeling pyspark
Last synced: 21 May 2026
https://github.com/mboula/mboula.github.io
GitHub portfolio + interactive resume | Showcasing data projects in civil rights (housing), cannabis, and analytics
cannabis case-study civil-rights compliance dashboards data-analysis data-cleaning data-vizualization excel google-data-analytics housing open-data pattern-analysis portfolio pro-se public-data r sql tableau
Last synced: 10 Jul 2025
https://github.com/clchinkc/zombie
Personal project, Python, NumPy, Matplotlib, Pygame, Scikit-learn, TensorFlow, Docker
algorithms data-analysis docker machine-learning matplotlib numpy pygame python sklearn tensorflow zombie-simulation
Last synced: 05 Apr 2026
https://github.com/zen204/accenture-tech-news-summarization-engine
A tool developed to analyze knowledge graphs from technology news articles, uncovering insights and trends about technology products, platforms, services, and their industry impact. Built during an internship at Accenture to inform decision-making in the tech landscape.
data-analysis decision-making graph-visualization industry-insights jupyter-notebook knowledge-graph machine-learning python tech-news tech-trends
Last synced: 29 Apr 2026
https://github.com/gui-sitton/bank-loans
In this project I will prepare a report for a bank's loan division. I find out whether a customer's marital status and number of children have an impact on loan default, as well as other factors
data data-analysis data-analysis-python data-science data-visualization python
Last synced: 21 May 2026
https://github.com/abhipatel35/moviematcher-movie-recommender-system
A robust movie recommendation system using the MovieLens dataset, employing Collaborative Filtering, Matrix Factorization, and Hybrid Models to enhance recommendation accuracy and diversity.
collaborative-filtering content-based-filtering data-analysis eda hybrid-models machine-learning matrix-factorization movie-recommendations movielens-dataset python recommender-system surprise-library
Last synced: 21 May 2026
https://github.com/cyberoctane29/noaa-lightning-analysis
This project explores lightning strike data from the National Oceanic and Atmospheric Administration (NOAA) to identify seasonal trends and analyze strike frequency across months. It demonstrates data manipulation, aggregation, and visualization using Python, providing insights into lightning activity patterns.
data-analysis data-science data-visualization eda python
Last synced: 20 Apr 2026
https://github.com/katarinatmb/serbia-protest-analysis
This project analyzes the frequency, regional distribution, and group characteristics of protests that emerged across Serbia following the fatal collapse of the Novi Sad train station roof in November 2024. The analysis explores how different communities responded in the aftermath of the disaster, using data visualization in RStudio
data-analysis data-visualization r r-mark rstudio
Last synced: 10 Jul 2025
https://github.com/colindean/allegheny_voter_reg_analysis
Allegheny County Voter Registration Analysis Tools
data-analysis data-science elections pandas polars python voting
Last synced: 16 May 2026
https://github.com/gaurav-van/data-analysis-projects
Collections of Projects that involves Data Analysis and Informed Decision Making
data-analysis database powerbi sql
Last synced: 06 Sep 2025
https://github.com/tapas-gope/pizza-sales
This project analyzes Pizza Sales Data to provide insights into customer preferences and sales performance. Key metrics include total revenue, orders, and average order value, with a breakdown by pizza category and size. The dashboard identifies peak sales periods and top-selling items, supporting data-driven business decisions.
business-intelligence dashboard data-analysis data-visualization dax powerbi sales-analysis
Last synced: 02 Jan 2026
https://github.com/kaushik-puttaswamy/food-delivery-time-prediction-using-machine-learning
The Food Delivery Time Prediction Model estimates delivery times using regression algorithms, with XGBoost as the best performer, and is deployed as a real-time application via Streamlit.
data-analysis data-science delivery food-delivery geolocation machine-learning modeldeployment predictive-modeling python realtimeproject regression-models streamlit xgboost
Last synced: 16 Apr 2026
https://github.com/ManuMoolimani/Data-Analysis
Data Analysis Projects
data-analysis data-visualization excel
Last synced: 10 Jul 2025
https://github.com/shivani8136/bellabeat-smart-device-data-analysis
This project analyzes smart device fitness data to uncover insights into user behavior, engagement, and wellness patterns. Conducted for Bellabeat, a high-tech company specializing in health-focused smart products for women, this analysis supports strategic decisions around product development and feature prioritization.
data-analysis data-visualization r-programming-language
Last synced: 08 Feb 2026
https://github.com/lotfiferaga/sig_explore
3d-graphics api data-analysis data-visualization openstreetmap python
Last synced: 06 Mar 2026
https://github.com/asghar-rizvi/youtube-statistics-project
This project analyzes a dataset of global YouTube statistics to uncover insights about YouTube channels, their ranks, and other attributes. The dataset used for this analysis was obtained from Kaggle.
data-analysis data-analysis-python data-science data-science-projects matplotlib numpy pandas pycharm-ide python seaborn
Last synced: 13 Jun 2026
https://github.com/engraulleite/local-data-warehousing-with-docker
Creating a DW from 0 to hero. Starting with logical and physical modeling to valuable reports.
airbyte data-analysis datawarehouse docker etl-pipeline metabase pgadmin4 postgresql
Last synced: 01 May 2026
https://github.com/pooja-manjunatha/nyc_parking_violations_dbt
This project uses dbt to transform NYC parking violations data through a layered architecture: Bronze: Raw ingested data Silver: Cleaned and enriched data Gold: Aggregated tables for analytics Using DuckDB as the warehouse backend, it ensures data quality with tests and documentation. The project enables reliable analysis of parking violations
data data-analysis data-engineering dbt duckdb python sql
Last synced: 14 May 2026
https://github.com/touradbaba/multi-page_dash_application
This repository contains a Multi-Page Dash Application designed to provide interactive visualizations of geo-spatial data, focusing on population and GDP. The app offers insights into demographic and economic trends through interactive maps and various types of charts. It is built with Python, using Plotly and Dash, and is deployed on Heroku.
dash dashboard data-analysis data-visualization exploratory-data-analysis heroku-deployment plotly pythonanywhere
Last synced: 27 Jul 2025
https://github.com/balajimohan18/power-bi-visualization-project
This repository contains Visualization Projects which is visualized through Power BI Software, by using the visualization we can gain multiple insights and strategies which helps to develop the business for gaining high profit margins and by the insights we can reduce damages by accidents & calamities.
data-analysis data-cleaning data-science data-visualization exploratory-data-analysis microsoft-excel microsoft-power-bi microsoft-powerpoint powerbi powerbi-visuals powerpoint-slides
Last synced: 08 Mar 2026
https://github.com/datalopes1/fifa21_datacleaning
Neste projeto será feito o processo de limpeza e manipulação a partir do dataset FIFA 21 messy, raw dataset for cleaning/ exploring, que pode ser encontrado no Kaggle, com licensa CC0: Public Domain e enviado por Rachit Toshniwal.
data-analysis data-cleaning python
Last synced: 30 Apr 2026
https://github.com/kheriberto/logistic_regression_project
A project that analyses dummie data from an advertising company using logistic regression
data-analysis logistic-regression pandas python scikit-learn seaborn
Last synced: 08 Apr 2026
https://github.com/cnoret/retail-data-analysis
Let's analyze historical sales data from a large retail chain and predict weekly sales using machine learning on a Streamlit web app
data-analysis data-analyst data-science data-vizualisation pandas python streamlit streamlit-webapp
Last synced: 10 Apr 2026
https://github.com/kunalkumar2001/data-analytics-python-project
Data Analyst Python Project for Portfolio
data-analysis data-anaytics matplotlib numpy pandas python seaborn
Last synced: 19 Apr 2026
https://github.com/hayatiyrtgl/cryptocurrency_time_series_rnn
Python script for training a Simple RNN model on cryptocurrency price data to predict future prices, including data exploration and evaluation
data-analysis data-science data-visualization keras pandas pandas-python prediction predictive-modeling python python-script rnn rnn-tensorflow tensorflow time-series time-series-analysis
Last synced: 08 Apr 2026
https://github.com/martachesnova/sql
Performing data modeling (ERD) and data engineering. Then, writing series of SQL queries to analyze Employee Database of a company.
data-analysis data-engineering data-modeling erd postgresql sql
Last synced: 16 May 2026
https://github.com/l0rd-inquisit0r/data-analytics
A repository of data analytics implementations in Python
ai data-analysis data-analysis-python data-analytics
Last synced: 18 Jun 2025
https://github.com/rupeshtr78/machine_learning
Machine Learning TensorFlow Neural Networks Deep Learning
classification data-analysis deep-learning deep-neural-networks flink jupyter-notebook keras machine-learning machinelearning-python perceptron python3 spark tensorflow
Last synced: 11 Apr 2026
https://github.com/ejw-data/tableau-drug-study
Brief analysis of drug treatments that were also analyzed with pandas
Last synced: 02 Jan 2026
https://github.com/saidulalimallick04/smart-traffic-violation-pattern-detector-dashboard
This project is a Streamlit web application designed to analyze traffic violation data. It provides a user-friendly interface to explore, visualize, and gain insights from traffic violation datasets. Users can upload their own data, perform analysis, and view summaries and trends.
dashboard data-analysis data-visualization internship-project pandas python smart-traffic streamlit
Last synced: 18 Apr 2026
https://github.com/gkn-tech/brisecheck_website
Web Crawler, Visualizations and Game
choropleth-map contact-form data-analysis data-visualization game-development pygame python-flask scatter-plot web-crawler web-scraping
Last synced: 25 Feb 2025
https://github.com/jofaval/iris-flowers
Multilabel Classification of the famous Iris Flowers Dataset from Ronald Aylmer Fisher in 1936
classification data-analysis data-science data-visualization google-colab iris-flowers kaggle machine-learning python scikit-learn xgboost
Last synced: 05 Apr 2026
https://github.com/ssoehdata/sql_for_data_science_specialization_course
Materials and Certifications from the SQL for DataScience Course
data-analysis data-science database databricks postgresql sql sqlite
Last synced: 10 Apr 2026
https://github.com/faizantkhan/automated-eda
This repository showcases tools for automatic Exploratory Data Analysis (EDA) in Python. These tools help you quickly understand your datasets and generate insightful reports.
automatic automation autoviz data-analysis data-analysis-python data-science data-visualization dtale dtale-library eda exploratory-data-analysis ml pandas pandas-profiling python python-library sweetviz
Last synced: 18 Apr 2026
https://github.com/anamakarevich/suicide_rates_factors
Female suicide rates analysis for Udacity Hacathon
data-analysis data-cleaning linear-regression suicide
Last synced: 21 May 2026
https://github.com/javorraca/unsupervised-ml
A short exercise using R to perform unsupervised machine learning (clustering) on a sample data set.
ade4 clustering clustering-algorithm clustering-analysis data-analysis data-analytics data-science dplyr jupyter k-means-clustering machine-learning machinelearning ml r r-programming sse unsupervised-machine-learning
Last synced: 05 Apr 2025
https://github.com/hassanislam463/british-airways-data-science
Analyze Skytrax reviews to uncover customer sentiments and key themes while predicting booking behavior using machine learning. This repository includes data collection, analysis, and modeling scripts alongside concise, visualized insights to improve customer experience and operational efficiency.
data-analysis data-science data-visualization
Last synced: 28 Mar 2025
https://github.com/puspacempaka/hackerrank-sql-challenges-intermediate
This repository features solutions to various intermediate-level SQL challenges from HackerRank. It includes efficient SQL queries, problem-solving techniques, and well-documented scripts. Explore these solutions to understand different SQL problems and enhance your skills.
challenges data-analysis database hackerrank-solutions queries sql sql-intermediate-level
Last synced: 02 Jan 2026
https://github.com/simranshaikh20/credit-card-dashboard
A Data Visualization Project using Microsoft Power bi
data-analysis data-visualization powerbi
Last synced: 02 Jan 2026
https://github.com/mmzong/gee_lifestyleeffectsonhypertension
Generalized Estimating Equations (GEE), Quasi-likelihood under the Independence Model Criterion (QIC), Longitudinal data, Embedded box plots within violin plots with hypertension risk categories, spaghetti plots, aggregate line plots, histograms, faceted-area plots, box and jitter plots. Investigating the impact of lifestyle on health.
aggregate-line-plot area-faceted-plots box-plots data-analysis data-manipulation data-science data-visualization generalized-estimating-equations histograms jitter-plots longitudinal-data qic quasi-likelihoods r spaghetti-plots violin-plots
Last synced: 29 Jul 2025
https://github.com/aakk23/netflix_sql_project
This SQL project provides an analytical overview of Netflix's movies and TV shows dataset, uncovering key insights related to content types, ratings, release trends, and geographic distribution. It helps explore patterns in content availability, audience targeting, and regional preferences to support data-driven decisions.
data-analysis netflix-data-analysis postgresql sql
Last synced: 10 Apr 2025
https://github.com/iliyasalve/cyclistic_case_study
Analysis of the Bike-Sharing System for the following question: "How do annual members and casual riders use Cyclistic bikes differently?"
bike-sharing data data-analysis data-visualisation r
Last synced: 06 Apr 2025
https://github.com/josericodata/josericodata
Adding a cool README file
big-data data-analysis data-science dublin hadoop hadoop-mapreduce hadoop-spark ireland jobsearch jobseeker portfolio portfolio-data-science portfolio-website python sql
Last synced: 26 Aug 2025
https://github.com/kevingastelum/mydataanalysis
My DataAnalyst Projects | Python, SQL, Excel, PowerBI & Tableau
data-analysis python sql visualization
Last synced: 20 May 2026
https://github.com/ginga1402/demand-supply-analysis
Demand- Supply Analysis using Python
data-analysis data-science demand-supply-management driver-rider-relationship
Last synced: 30 Mar 2025
https://github.com/ginga1402/car_price_prediction
Predict the price of a car using MS Excel.
college-project data-analysis excel linear-regression
Last synced: 30 Mar 2025
https://github.com/abhishekyadav915/diwali_sales_analysis
This project aims to analyze sales data during the Diwali festival using Python. The analysis focuses on identifying key trends, customer purchasing behavior, and sales performance across different segments. By leveraging data visualization and statistical analysis, we uncover insights.
data-analysis data-visualization matplotlib-pyplot numpy-library pandas-dataframe seaborn-python
Last synced: 05 Apr 2025
https://github.com/tashi-2004/data-visualization-tableau-traffic-collision-insights
Analysis of traffic collision data using Tableau, featuring interactive visualizations that highlight trends in injuries and fatalities, contributing factors, and geographic distributions. It includes various sheets and dashboards, with recommendations for enhancing road safety. The dataset is available for further exploration.
data-analysis data-visualization eda geospatial-analysis machine-learning predictive-modeling statistics tableau traffic-analysis
Last synced: 19 Mar 2026
https://github.com/hassanislam463/sentiment_analysis_of_financial_news_headlines_and_affect_on_stock_price_prediction
This project analyzes financial news sentiment using a fine-tuned RoBERTa model and integrates it with stock data to predict price movements using LSTM and GRU. It highlights the role of sentiment in enhancing stock market forecasting.
data-analysis data-science data-visualization deep-learning lstm-neural-networks nlp-machine-learning
Last synced: 28 Mar 2025
https://github.com/laudebugs/fec-data-analysis-2020
The project aimed to determine the total sum of contributions to the candidate committees as well as the number of contributions made by individuals.
data-analysis fec presidential-candidates
Last synced: 16 May 2026
https://github.com/dongdong7048/newtaipei-housing-trend
新北市房價趨勢分析專案
data-analysis housing new-taipei python real-estate
Last synced: 28 Mar 2025
https://github.com/koushikphy/kfutils
A common file operation utility
data-analysis data-files data-operations file-operations interpolation numerical-analysis python python-library python-package
Last synced: 03 Mar 2025
https://github.com/azizbekavazov/eda-uci-retail-dataset
Exploratory Data Analysis (EDA) on UCI Online Retail Dataset. Customer insights, product trends, sales patterns and product recommendations.
customer-insights data-analysis data-visualization eda exploratory-data-analysis jupyter-notebook matplotlib pandas personalized-recommendations product-recommendation python recommendation-system retail-analytics seaborn uci-online-retail
Last synced: 23 Jul 2025
https://github.com/jabulente/t-test-python-implementation
A Python-based implementation of one-sample, two-sample, and paired t-tests for statistical analysis and hypothesis testing.
automation data-analysis data-science eda exploratory-data-analysis hypothesis-testing independent-ttest one-sample-t-test python reporting statistics ttest two-sample-t-test
Last synced: 27 Jun 2025
https://github.com/thesfinox/mltools
A collection of simple tools for data science and machine learning projects.
ai data-analysis data-science data-visualization logging machine-learning matplotlib neural-network python toolbox
Last synced: 14 May 2025
https://github.com/chaedoll/teamproject-foreignerreport
국내 외국인 대상 인프라 개선을 위한 보고서 (Report on improving infrastructure for foreigners)
Last synced: 25 Feb 2025
https://github.com/bhiogade/tlc-trip-analysis
NYC Taxi and Limousine Commission (TLC) Trip Analysis
data-analysis data-cleaning data-collection data-visualization pandas-python tableau tableau-desktop
Last synced: 30 Mar 2025
https://github.com/chrisrobertsjr/chrisrobertsjr
Welcome to my Github Profile!
data data-analysis java r sql statistics
Last synced: 03 May 2026
https://github.com/satyacoder29/smartfinance-dynamic-financial-dashboard
SmartFinance: Dynamic Financial Dashboard is an interactive tool designed to visualize key financial metrics like revenue, expenses, and profit. It features real-time data updates, charts, slicers, and navigation for easy analysis. This dashboard helps businesses make data-driven decisions and optimize financial performance.
data-analysis data-cleaning data-modeling data-visualization powerbi powerbi-desktop powerbi-visuals powerquerym
Last synced: 13 Feb 2026
https://github.com/hossein-rahmati/credit-card-fraud-detection
This repository contains the implementation of a machine learning pipeline for detecting fraudulent credit card transactions. The project leverages common data science libraries to preprocess data, train multiple models, and evaluate their performance using appropriate classification metrics.
data-analysis fraud-detection k-fold-cross-validation machine-learning random-forest-classifier
Last synced: 15 Sep 2025
https://github.com/leticiamilan/dashboard-analitico-de-vendas-globais
Dashboard Analítico de Vendas Globais - DSA - Desenvolvido com Power BI
dashboard dashboard-power-bi data-analysis power-bi powerbi
Last synced: 03 Feb 2026
https://github.com/mr-chang95/datascience_airbnb
Data Science Project for Udacity's Data Scientist Program. Using Python in Jupyter Notebook.
airbnb data-analysis data-science data-visualization jupyter-notebook numpy pandas python sklearn
Last synced: 08 Apr 2026
https://github.com/nick-peter-marcus/chocolate-bar-analysis
Analyzing Chocolate Bar Features and Ratings - Data Visualization, Decision Trees, Random Forest
data-analysis data-visualization decision-trees python random-forest seaborn sklearn
Last synced: 10 May 2026
https://github.com/david-palma/python-for-engineers
A curated collection of exercises that focus on applying Python programming to solve real-world engineering problems.
control-systems data-analysis data-science data-visualisation education engineering hands-on jupyter-notebook learning-by-doing mathematical-modelling numerical-simulations practice programming python signal-processing
Last synced: 21 May 2026
https://github.com/lopez86/rust-mlearn
Machine Learning Tools in Rust
data-analysis data-science machine-learning rust
Last synced: 15 May 2025
https://github.com/karishmagupta05/e-commerce-sales-dashboard
This project is an interactive E-Commerce Sales Dashboard built using Power BI. It provides key insights into sales, profit, and customer behavior through visually engaging charts and graphs.
data-analysis data-visualization powerbi
Last synced: 09 Feb 2026
https://github.com/ishansurdi/data-visualisation-empowering-business-with-effective-insights
The following tasks are completed for Data Visualization: Empowering Business with Effective Insights on Forage in October 2024. It is important to note that this should not be interpreted as an endorsement.
chart communicating-insights-and-analysis dashboard data data-analysis forage powerbi powerbi-visuals tableau tata tata-group virtual-internship visual visualization
Last synced: 17 Feb 2026
https://github.com/mahmoudwal27/brazilian_ecommerce
This project explores and cleans the Olist Brazilian E-Commerce dataset using Python (Pandas) to prepare it for Power BI visualization. The process includes loading data, performing exploratory analysis, handling missing values and duplicates, formatting key columns, and exporting clean datasets.
analytics data-analysis data-analysis-python google-cloud python
Last synced: 16 May 2026
https://github.com/natnaelhhaile/Text-Similarity-Analysis
bag-of-words cosine-similarity data-analysis machine-learning natural-language-processing nltk-python one-hot-encoding python stemming stop-word-removal stop-words text-mining text-processing text-similarity-analysis tf tf-idf tokenization
Last synced: 11 Apr 2025
https://github.com/kunalkumar2001/sales-project-using-excel-and-sql
Comprehensive sales analysis using SQL, Excel, and PowerPoint to uncover insights on top-sellers, peak times, and branch performance.
data-analysis data-analytics excel mssql sql
Last synced: 03 Nov 2025
https://github.com/netcodez/analysing-unicorn-companies---sql
Analysing Unicorn Companies using SQL
data-analysis data-structures database postresql sql
Last synced: 16 May 2026
https://github.com/tathithienthanh/majorproject_womenfashionproductrecommendationsystem
Build a recommendation system for recommending woman fashion's products on e-commerce platforms
content-based data-analysis data-collection data-processing data-visualization dfd e-commerce erd jupyter-notebook lazada nlp python recommender-system scraping-websites sql system-design tiki vietnamese visualization
Last synced: 20 Mar 2025
https://github.com/grindelfp/two-data-manipulative-tasks
Two simple tasks on data analysis and processing.
Last synced: 17 Feb 2026
https://github.com/borjamome/visualization_supermarkets_with_r
Visualization using R and OpenStreetMaps
data-analysis datavisualization openstreetmap r
Last synced: 02 Jan 2026
https://github.com/michael-angelo-mootoo/quanta-app
Quanta is an open source statistical package app / toolkit for neuroscience and general computational descriptive and inferential statistics.
computational-statistics customtkinter data-analysis descriptive-statistics gui-application inferential-statistics neuroscience python r statistical-analysis statistics tkinter-python
Last synced: 16 May 2026
https://github.com/myke003/data-analysis-projects
This repository serves as a collection of all my projects.
data-analysis jupyter-notebook powerbi
Last synced: 14 Mar 2025
https://github.com/pdiegel/currencytracker
A Python application that fetches real-time currency exchange rates from an API, securely stores the data in an SQLite database, and includes error handling, logging, and good programming practices for reliable and periodic data capturing.
analysis api currency data-analysis data-capture logging python python3 sqlite3 tracker
Last synced: 09 Sep 2025
https://github.com/farhad-here/data-visualization-analysis-dva
This is my data analysis project. Users can use this project to clean and preprocessing the date or data visualization. Individuals can impute or ecnode ther dataset.
altair bokeh data-analysis data-analysis-python io matplotlib numpy pandas plotly python sklearn streamlit
Last synced: 11 Apr 2026
https://github.com/vishal-038/real_estate_price_prediction
The Real Estate Price Prediction project aims to develop a machine learning model to predict house prices based on various features
data-analysis data-science data-visualization machine-learning python
Last synced: 21 May 2026
https://github.com/ggarciajavier/udacity-dalf-project3-test-perceptual-phenomenom
Work performed for the 3rd project of Udacity Data Analyst Nanodegree: statistical testing of a perceptual phenomenom (Stroop task).
data-analysis python statistical-inference udacity-data-analyst-nanodegree
Last synced: 18 May 2026
https://github.com/easonsyc/kc-house-price-prediction
Prediction for House Price in King County.
data-analysis jupyter-notebook machine-learning python
Last synced: 21 May 2026
https://github.com/cano1998/data-visualization-project
A project focused on data visualization to explore various aspects of a car dataset. The visualizations provide insights into car performance, efficiency, and characteristics based on different manufacturers and features.
bar-pl bar-plot data-analysis data-visualization histogram jupyter-notebook line-plot
Last synced: 17 Jul 2025
https://github.com/valyaevgeorgiy/r_basic
Работа с основами среды R и тем самым изучения нового языка программирования, связанного непосредственно с анализом данных и построением графиков и диаграмм.
coding data data-analysis r rstudio
Last synced: 12 Dec 2025
https://github.com/garcane/credit-card-transactions-fraud-detection-project
The Credit Card Transactions Fraud Detection Project repository is designed to analyse and detect fraudulent transactions in credit card data.
Last synced: 03 Feb 2026
https://github.com/yasir-arafah/nyc-trip-fare-prediction-using-tcn
"NYC Trip Fare Prediction Using Temporal Convolutional Networks (TCN)" is a Data Analytics Project where the trip and fare data of NYC taxi are combined and then analyzed using Pyspark and visualized using Matplotlib library. The project predicts the fare by using Temporal Convolutional Neural Network.
colab data-analysis matplotlib nyc-taxi-dataset pyspark python
Last synced: 29 Apr 2026
https://github.com/ishmal793/basic-python-
Beginner-friendly Python code examples and exercises – a strong foundation for aspiring data analysts.
data-analysis data-analytics learning-python-code problem-solving python-basics python-for-beginners
Last synced: 23 Jul 2025
https://github.com/shrutiii1109/diwali-sales-analysis-through-python
Data analysis project on Diwali sales using Python (Pandas, NumPy, Matplotlib, Seaborn). The goal is to analyze customer behavior, identify sales trends, and provide insights to improve marketing and business strategies.
data-analysis jupyer-notebook matplotlib numpy pandas python seaborn
Last synced: 30 Apr 2026
https://github.com/sahilmaurya28/youtube-data-analysis
YouTube Data Analysis using Python — uncovering trends, engagement patterns, and correlations between likes, comments, views, and categories to understand what drives content success.
analysis data-analysis data-visualization matplotlib-pyplot numpy pandas portfolio-project python seaborn youtube
Last synced: 13 Apr 2026
https://github.com/nishumehta/british-airways-reviews-analysis
This project analyzes British Airways reviews using Tableau to create an interactive dashboard. The dashboard visualizes average ratings across multiple metrics and trends over time.
dashboard data-analysis data-visualization tableau tableau-public
Last synced: 12 Jan 2026