Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-07-01 00:07:23 UTC
- JSON Representation
https://github.com/jcaperella29/financial-data-scraper
Financial Data Scraper is a Python-based web scraping tool using Selenium to extract financial data from Stock Analysis. It scrapes Income Statement, Balance Sheet, Cash Flow, and Ratios for multiple companies and saves them as CSV files.
automation data-analysis finance financial-statements investment python selenium stock-market web-scraping
Last synced: 28 Jul 2025
https://github.com/sen2pi/bloquinho
Uma WebApp do genero de notion para correr on permises
agenda data-analysis data-visualization database education markdown note-taking notebook notebooks notepad notes notes-app notion obsidian password password-generator password-manager password-safety passwords personal-blog
Last synced: 09 Apr 2026
https://github.com/andystmc/nextflownyc
Developed a machine learning model (Bidirectional LSTM) to forecast NYC traffic volumes using 10 years of automated traffic count data. Achieved strong predictive accuracy, demonstrating the power of deep learning for urban traffic analysis.
data-analysis data-cleaning data-science data-visualization exploratory-data-analysis feature-engineering hyperparameter-tuning jupyter-notebook lstm-neural-networks machine-learning numpy pandas predictive-modeling python3 scikit-learn tensorflow-keras traffic-flow-forecasting
Last synced: 07 Apr 2026
https://github.com/numbersprotocol/dyda
Dynamic data pipeline framework
ai artificial-neural-networks data-analysis data-science
Last synced: 07 Nov 2025
https://github.com/yashodatta15/maven_unicorn_company_challenge
An analysis on Unicorn companies.
data-analysis data-cleaning data-visualization powerbi unicorn-companies
Last synced: 19 Feb 2026
https://github.com/archived-blueprints/amazonathena-blueprints
Simplified blueprints for building data pipelines with Amazon Athena.
amazon-athena athena cli data-analysis data-engineering data-science elt etl
Last synced: 29 Jul 2025
https://github.com/rayyan9477/youtube-spam-detection-with-flask-and-machine-learning
This is a web application built using Flask that detects spam comments on YouTube using a Naive Bayes classifier. It leverages techniques such as CountVectorizer for feature extraction and scikit-learn for machine learning. The application reads data from a CSV file and predicts whether a comment is spam or not.
data-analysis data-science machine-learning nlp-machine-learning spam-detection
Last synced: 21 Sep 2025
https://github.com/atharvbyadav/expensemate
A simple, lightweight personal finance tracker built with Streamlit and SQLite. Log expenses, visualize spending habits, manage budgets, and download reports โ all through an interactive web interface.
budgeting data-analysis data-visualization expense-tracker finance-app open-source pandas personal-finance plotly python sqlite streamlit streamlit-webapp
Last synced: 28 Apr 2026
https://github.com/gappeah/london-housing-price-dashboard
This Excel-based Housing Visual Dashboard provides a comprehensive view of average house prices across various boroughs in London from 1996 to 2013. The dashboard is designed to offer insights into housing market trends and price variations across different areas of London over time.
data data-analysis data-visualization excel visual
Last synced: 31 Jul 2025
https://github.com/dannyben/datamix
DSL for manipulating tabular data
csv data data-analysis data-engineering gem ruby tabular-data
Last synced: 31 Jul 2025
https://github.com/banyc/dfsql
SQL REPL/lib for Data Frames
cli csv data-analysis jsonl ndjson repl sql
Last synced: 31 Jul 2025
https://github.com/myself-aas/quantium_data_analytics_forage
This project analyzes retail customer chip purchasing behavior using Python, focusing on customer segmentation and key spending drivers to provide data-driven insights for strategic category management recommendations.
data-analysis data-engineering data-science data-visualization feature-engineering forage internship-project matplotlib-pyplot numpy-library pandas-dataframe pearson-correlation python quantium-virtual-experience scipy-stats seaborn
Last synced: 31 Jul 2025
https://github.com/simranjeet97/google-cloud-access-using-python
Google Drive Access using Python, Interact Programmatically and Manipulate accordingly
data-analysis data-science data-structures data-visualization gcp gcp-cloud-functions gcp-compute gcp-compute-engine gcp-projects gcp-storage google googlecloud googlecloudplatform python python3 visualization
Last synced: 23 May 2026
https://github.com/robson-python/customer-cancellation
Data science and analytics project to reduce customer cancellations.
data-analysis data-science data-visualization jupyter-notebook machine-learning matplotlib pandas python scikit-learn seaborn streamlit vscode
Last synced: 09 Apr 2026
https://github.com/jxareas/de-zoomcamp-2024
Solutions for @datatalksclub's Data Engineering Zoomcamp 2024.
data-analysis data-engineering data-science database datascience de-zoomcamp docker docker-compose etl etl-pipeline mage-ai orchestration python workflow
Last synced: 09 Apr 2026
https://github.com/shubhamgoyal575/diwali-sankranti-promotion-sales
This Power BI dashboard analyzes sales performance during Diwali and Sankranti festivals. It provides insights into revenue trends, top-selling products, regional sales distribution, and customer purchasing behavior to help optimize festive season sales strategies. ๐
buisness-intelligence dashboard data-analysis data-visualization diwali-sankranti-sales-analysis excel fast-moving-consumers-goods fmcg microsoft-power-bi mysql power-query powerbi revenue-insights sales-dashboard sales-insights sql
Last synced: 02 Mar 2026
https://github.com/athul64/exploratory-data-analysis
To preprocess and analyze the given employee dataset, present the findings graphically, and derive meaningful insights to help better understand the companyโs workforce.
colab-notebook data-analysis data-visualization matplotlib numpy pandas python seaborn statistical-analysis
Last synced: 25 Feb 2026
https://github.com/v6ntage/sql-sales_data-analytics-project
This repository contains a SQL scripts demonstration analytical techniques.
analytics business-analytics data data-analysis database query sql sql-server
Last synced: 12 Apr 2026
https://github.com/jahnavigupta06/zepto-delivery-customer-analytics
Real-time SQL + Power BI Analytics Project replicating Zepto's customer & delivery insights.
business-intelligence churn-analysis customer-segmentation data-analysis data-visualization powerbi sql-server
Last synced: 02 Aug 2025
https://github.com/thomasshikalepo/sql-data-warehouse-project
Building a modern data warehouse with SQL Server, including ETL processes, data modeling, and analytics
data-analysis data-cleaning data-engineering data-lakehouse data-science data-warehouse data-warehousing datascience datawarehousing etl-pipeline medallion-architecture sql sql-query sql-server
Last synced: 02 Aug 2025
https://github.com/ituvtu/Data-Science-AB-Testing
This project focuses on conducting A/B testing to evaluate the effectiveness of two marketing campaigns. Using statistical analysis and hypothesis testing, we determine which campaign is more effective in improving conversion rates.
a-b-testing data-analysis data-analysis-python data-mining ipynb jupyter jupyter-notebook python
Last synced: 26 Sep 2025
https://github.com/shriram-vibhute/data-analysis
This repository offers a comprehensive collection of data analysis techniques using NumPy Pandas, Matplotlib and Seaborn.
data-aggregation data-analysis data-visualization data-wrangling matplotlib numpy pandas seaborn
Last synced: 02 Aug 2025
https://github.com/sarathchandranpm/vehicle_theft_analysis
This project is a comprehensive data analysis of vehicle theft patterns, utilizing advanced SQL techniques to explore when, which, and where vehicles are most likely to be stolen. The analysis provides deep insights into vehicle theft characteristics through systematic, multi-dimensional exploration.
Last synced: 02 Aug 2025
https://github.com/t-mohamed-shafeek/data-analysis-on-tamil-nadu-road-accidents
The "Data Analysis on Tamil Nadu Road Accidents" is a project deals with analysis of data on Road Accidents encountered by Tamil Nadu ( one of the states of India ) in the year of 2020 and 2021. But the dataset is most recently created (created on February 15, 2023 with source form TN Police).
dashboard data-analysis data-science data-visualization jupyter-notebook tableau
Last synced: 03 Aug 2025
https://github.com/juliusmarkwei/iris-dataset-analysis
Data analysis, data visualization and model training using the popular Iris Dataset
data-analysis data-visualisation linear-regression machine-learning
Last synced: 03 Aug 2025
https://github.com/pinedah/loan-approval-predictor-excercise
Proyecto de Machine Learning para predecir la aprobaciรณn de tarjetas de crรฉdito utilizando dos datasets. Incluye limpieza, anรกlisis exploratorio, imputaciรณn de datos sintรฉticos y modelado con algoritmos como Random Forest, Gradient Boosting y รrboles de Decisiรณn.
data-analysis data-science decision-tree escom gradient-boosting machine-learning predictor random-forest school-project
Last synced: 11 Oct 2025
https://github.com/vimal0156/ruaroa-ai
๐งโโ๏ธ Zero-Code Machine Learning Wizard - Transform ideas into intelligent solutions without writing code. AI-powered ML pipeline automation with interactive web interface.
ai-agents ai-assistant artificial-intelligence automated-machine-learning code-generation data-analysis data-science deep-learning jupyter machine-learning machine-learning-pipeline neural-networks no-code openai python scikit-learn streamlit visualization
Last synced: 09 Apr 2026
https://github.com/ganesh2409/cricket-player-performance
This repository contains a comprehensive project focused on analyzing cricket player performance using various datasets, including batting, bowling, and match results. The project involves data preprocessing, feature engineering, and model training to predict and evaluate player performance scores. It includes detailed scripts for data analysis
cricket-performance-analysis data-analysis machine-learning sports-analytics
Last synced: 05 Aug 2025
https://github.com/cuadernin/coffeeanalysis
Anรกlisis de datos correspondiente a la tercera etapa de la certificaciรณn de Datacamp.
coffee data-analysis datacamp python
Last synced: 07 Aug 2025
https://github.com/nafisalawalidris/northwind-traders-sales-analysis
Northwind Traders Sales Analysis project, which analyses sales data for a fictitious company. It utilises the Northwind Database and includes SQL queries to provide insights on employees, products, suppliers and revenue. The project aims to help the company gain valuable information for business decision-making.
business-insights data-analysis database northwind-traders sales sql
Last synced: 07 Aug 2025
https://github.com/turquetti/projeto5-vamoai
Projeto final da Resilia + iFood <3
Last synced: 14 May 2026
https://github.com/the-tech-idea/beepdm
A Library for Managing your Connection to Different DataSources . Still in Alpha.please be patient
data-analysis data-management data-management-platform data-science database dataset information
Last synced: 08 Aug 2025
https://github.com/simranjeet97/ipl-dataanalysis
Data Analysis performed on IPL Dataset with Data Profiling, Data Pre-Processing, Data Manipulation, and Data Visualization.
artificial-intelligence data-analysis data-manipulation data-mining data-preprocessing data-science data-visualization indian-premier-league-2008-2018 ipl ipl-dataset iplayer python
Last synced: 08 May 2026
https://github.com/jayita11/atliqo-bank-credit-card-launch-eda
This project involves exploratory data analysis and statistical testing for AtliQo Bank's new credit card launch. Key insights include targeting high-income occupations and the 18-25 age group. Recommendations focus on tailored marketing campaigns, education, and incentives to enhance credit card adoption and usage among young adults.
data-analysis hypothesis-testing matplotlib p-value pandas python seaborn statistics z-test
Last synced: 09 Apr 2026
https://github.com/rafaumeu/super-trunfo
๐ฎ Super Trunfo Card Game in C | ๐ Country Comparison | ๐ Data Analysis | ๐ Educational Project
beginner-friendly brazilian-dev c-programming card-game cli-app command-line comparison-game computer-science data-analysis educational educational-project estacio game-development learning-c learning-to-code open-source programming-learning super-trunfo
Last synced: 10 Aug 2025
https://github.com/leocornus/leocornus-visualdata
JavaScript libraries to make data visualization simpler and easier.
data-analysis data-mining data-visualization data-visualization-simpler javascript-library
Last synced: 10 Aug 2025
https://github.com/filiplangiewicz/businessintelligence
๐ญ Data warehouses and business intelligence project
airbnb business-intelligence data-analysis data-warehouse
Last synced: 09 Mar 2026
https://github.com/r12habh/datacamp.com-micro_projects
data data-analysis data-science datascience python python3
Last synced: 23 May 2026
https://github.com/gabriela1dc/dashboard-de-analise-de-salarios-na-area-de-dados
Dashboard de anรกlise de salรกrios na รกrea da tecnologia
country data-analysis data-science data-visualization graphics jobs payments python streamlit
Last synced: 09 Apr 2026
https://github.com/virajbhutada/walmart-retail-analyzer
Gain valuable insights into retail sales with the "Walmart Retail Performance Dashboard" in MS Excel. This user-friendly tool facilitates an in-depth analysis of key sales metrics, providing a comprehensive view of Walmart's performance. Make data-driven decisions for informed and strategic business outcomes.
analytics data-analysis data-science data-visualization excel insights interactive-visualizations performance-analysis retail-sales walmart
Last synced: 04 Mar 2026
https://github.com/helosantosdesousa/analise-previsao-de-rotatividade-ml
Projeto final do Bootcamp Data Girls 2025 que analisa a rotatividade de funcionรกrios usando Machine Learning. Com base no dataset IBM HR Analytics Attrition, o projeto identifica os principais fatores de risco e cria modelos preditivos (SVC e Random Forest) com atรฉ 89% de acurรกcia para antecipar saรญdas e apoiar decisรตes estratรฉgicas de RH.
analise-de-dados analise-exploratoria bootcamp ciencia-de-dados colab-notebook dados data data-analysis data-science dataanalytics dataframe eda machine-learning machine-learning-algorithms pandas python random-forest svc
Last synced: 16 Apr 2026
https://github.com/nhsdigital/sde_summary_notebooks
Notebooks provided by the Wranglers for users to quickly gain insights on datasets inside the Secure Data Environment (SDE)
data-analysis data-linkage data-quality data-summary metrics statistics
Last synced: 12 Aug 2025
https://github.com/Narius2030/Hive-DataWarehouse-Analysis
Implement a Hive data warehouse to store meaningful data, apply Machine Learning like Clustering or Regression for dealing with business problems
apache-hadoop apache-hive data-analysis etl-pipeline hiveql machine-learning statistics
Last synced: 12 Aug 2025
https://github.com/bcko/ud-da-eda-whitewinequality
Udacity Data Analyst Nanodegree Project : Exploratory Data Analysis : White Wine Quality dataset
data-analysis exploratory-data-analysis rmarkdown rstudio udacity udacity-data-analyst-nanodegree
Last synced: 03 Jan 2026
https://github.com/shibam120302/heart-disease-data-analysis-by-shibam
You can read more on the heart disease statistics and causes for self-understanding. This project covers manual exploratory data analysis
analysis data-analysis scraper
Last synced: 13 Aug 2025
https://github.com/musbi8788/free_python_book_for_gambian_dev
Free, beginner-friendly Python books for Gambian learners and devs ๐๐
algorithms automation data-analysis data-science django flask machine-learning oops-in-python programming-language python python27 python3 testing-automation web-development
Last synced: 14 Aug 2025
https://github.com/nelsonkariuki/dataanalysis
This project involves data analysis of vido game sales from https://www.kaggle.com/gregorut/videogamesales/download
data-analysis data-visualization python
Last synced: 11 Jun 2026
https://github.com/mariam-badr-mb/student-score-prediction
This project predicts students' exam scores based on study-related and demographic factors using machine learning models.
data-analysis data-visualization explore linear-regression machine-learning mean-square-error student-project supervised-learning
Last synced: 16 Aug 2025
https://github.com/reusjimenez/sqlserver-data-analysis
Laboratorios prรกcticos sobre el anรกlisis de datos utilizando SQL Server. ๐
data-analysis database-scripts joins queries sql sql-server stored-procedures transactions views
Last synced: 16 Aug 2025
https://github.com/prankshaw/election-analytica
Analyzing previous election results for Haryana Vidhan Sabha and other factors and to compare them with various parameter to conclude results.
anaconda collection data-analysis data-science data-visualization elections jupyter-notebook python python-3 wrangling
Last synced: 16 May 2026
https://github.com/floressek/data_analysis_and_visualization
This repository contains a collection of statistical data analysis laboratories using R. Each lab focuses on different aspects of data exploration, visualization, and analysis techniques.
data-analysis data-visualization
Last synced: 05 Oct 2025
https://github.com/pranavarora1895/proteintypeprediction
Data Analysis on Protein Type Prediction
bioinformatics data-analysis supervised-learning
Last synced: 19 Apr 2026
https://github.com/yuvraj0412s/proactive-fraud-detection-using-machine-learning
An end-to-end machine learning project for detecting financial fraud using LightGBM, featuring in-depth EDA, advanced feature engineering, and a focus on actionable business insights.
class-imbalance classification-model data-analysis data-science data-visualization exploratory-data-analysis feature-engineering fintech fraud-detection jupyter-notebook lightgbm machine-learning pandas python scikit-learn smote
Last synced: 02 May 2026
https://github.com/ilchen/eu_economic_data_analysis
Jupyter notebooks for analysis of Eurozone GDP, yields on government bonds, inflation expectations, unemployment and participation rates, money supply, personal consumption and savings, stock market. Using APIs from Eurostat, ECB, OECD and Yahoo-Finance.
data-analysis disposable-income finance gdp hicp inflation interest-rates jupyter-notebook money-supply participation-rate risk-free-interest-rate savings stock-market unemployment-rate
Last synced: 10 Oct 2025
https://github.com/pavelgrigoryevds/olist-deep-dive
๐ Deep Sales Analysis of Olist E-Commerce: EDA | Time Series| Viz | RFM | NLP | Geospatial | Segmentation & Actionable Business Recommendations.
business-recommendations clusterization data-analysis data-analytics data-science deep-analysis e-commerce eda feature-engineering geospatial jupyter-notebook nlp pandas plotly preprocessing python rfm statistics time-series visualization
Last synced: 07 May 2026
https://github.com/airscholar/data_analysis_with_ai
A repository showing how to use AI and ChatGPT for Data Analysis with Pandas and Python
chatgpt data-analysis gpt4 openai pandas pandasai python
Last synced: 10 Apr 2026
https://github.com/devexpress-examples/web-forms-pivot-grid-implement-editable-aspxpivotgrid
This example demonstrates how to allow end-users to modify data cell values in Pivot Grid for Web Forms.
asp-net-web-forms data-analysis dotnet pivot-grid pivot-grid-for-web-forms
Last synced: 09 Mar 2026
https://github.com/rayyan9477/multiple-disease-prediction-system
This repository contains a Multiple Disease Prediction System leveraging machine learning techniques for accurate predictions. It utilizes Python, Pandas, Scikit-learn, and Flask for data preprocessing, model building, and web deployment. Explore the project and connect on LinkedIn for collaborations.
data-analysis data-science machine-learning python streamlit
Last synced: 10 Apr 2026
https://github.com/gdbecker/analyticsportfolio
Analytics Professional Project Work
data-analysis data-science decision-trees firebase google-looker-studio k-means-clustering k-nearest-neighbors kaggle linear-regression logistic-regression machine-learning microsoft-fabric powerbi principal-component-analysis python3 random-forest synapse-data-engineering
Last synced: 10 Apr 2026
https://github.com/1adityakadam/Carnegie_classifications_website
A comprehensive data analytics platform analyzing 50+ years of U.S. higher education trends through interactive visualizations and historical institution tracking.
css data-analysis html javascript python ui-design web-development
Last synced: 25 Jun 2025
https://github.com/nadamarei/data-analyzer
The Qualitative Data Analysis Tool is a powerful Streamlit application designed for researchers to analyze word frequencies in corporate documents. This tool processes PDF reports, identifies target words and their contextually relevant synonyms, and generates comprehensive reports with document statistics, summary analysis, and per-file breakdowns
data-analysis data-visualization python-3 streamlit
Last synced: 18 May 2026
https://github.com/rohansoni45/movie-recommendation-system
This project is a Content-Based Recommender System that suggests movies to users based on their preferences and watched history. The system leverages cosine similarity to find and recommend movies similar to a selected title. It is built using Python and libraries like Pandas, NumPy, and Scikit-learn.
content-based-filtering cosine-similarity data-analysis data-science machine-learning numpy pandas python recommender-system render scikit-learn
Last synced: 17 Apr 2026
https://github.com/ddihora1604/iitk_task
A comprehensive financial data analysis system that collects, processes, and analyzes data from approximately 500 tickers in the S&P Global Index. It provides detailed financial information, ESG metrics, and various financial statements for comprehensive market analysis.
beautifulsoup4 data-analysis data-visualization datamodelling dataset esg machine-learning python yahoo-finance
Last synced: 30 Jun 2026
https://github.com/v41bh4vr4jput/data-analysis-with-python
This repository is a comprehensive collection of data analysis projects and tutorials using Python's most powerful libraries: NumPy, Pandas, Seaborn, and Matplotlib. It is designed to help you explore, clean, visualize, and analyze data efficiently.
api data data-analysis data-visualization matplotlib numpy pandas python sakila-db seaborn
Last synced: 09 Apr 2026
https://github.com/pronzzz/diabetes-prediction
Diabetes prediction using a KNN model and Pima Indian Diabetes Dataset
data-analysis data-manipulation data-preprocessing data-visualization knn machine-learning outlier-detection seaborn
Last synced: 13 Apr 2025
https://github.com/jelhamm/model-ensembles-boosting-in-machine-learning
"This repository contains implementations of Boosting method, popular techniques in Model Ensembles, aimed at improving predictive performance by combining multiple models. by using titanic database."
boosting boosting-algorithms boosting-ensemble boosting-machine data-analysis database-analysis datamining datamining-algorithms jupyter-notebook machine-learning machine-learning-models machine-learning-projects matplotlib-python model-ensemble module numpy-library pandas-library python sklearn-library
Last synced: 16 May 2026
https://github.com/hadarsharon/grizzlys
User-friendly Python DataFrames ๐ต๐ก powered by Julia ๐ด๐ข๐ฃ
big-data data data-analysis data-engineering data-frame data-frames data-science dataframe dataframe-library dataframes dataframes-jl julia python
Last synced: 18 May 2026
https://github.com/ireneflorez/exploration_r
Data exploration on the 'White Wine Quality' dataset using R
data-analysis data-visualization r
Last synced: 16 Jun 2026
https://github.com/jelhamm/singular-value-decomposition-data-mining
"This repository hosts an implementation of the Singular Value Decomposition (SVD) algorithm tailored for data mining tasks. SVD is utilized for efficient dimensionality reduction, aiding in the extraction of key patterns and features from large and complex datasets."
data-analysis dimension-reduction jyputer-notebook machine-learning matplotlib numpy-library pandas-library preprocessing python scipy-library singular-value-decomposition sklearn-library standardscaler svd svd-matrix-factorisation
Last synced: 18 May 2026
https://github.com/dineshdhamodharan24/singapore_flat_resale_
This project focuses on developing a machine learning model to predict the resale values of apartments in Singapore. The goal is to create a user-friendly online application that enables users to obtain accurate predictions for the resale values of specific properties.
data-analysis flat json numpy pandas pickle project python streamlit
Last synced: 07 Apr 2026
https://github.com/gmasson/datadash
DataDash รฉ uma biblioteca JavaScript e CSS para criar dashboards interativos, para visualizaรงรฃo de dados dinรขmicos em pรกginas web.
dashboard dashboard-application dashboards data-analysis data-science data-visualization javascript
Last synced: 08 Aug 2025
https://github.com/eesunmoon/spam_review_detection
[Project] Capstone Design - Spam Detection
crawler-python data-analysis konlpy natural-language-processing python sorting-algorithms spam-detection
Last synced: 12 Oct 2025
https://github.com/dinamohsin/ai-job-market-analysis-using-sql-excel
This project explores a dataset of AI-related jobs to uncover insights about salary trends, in-demand skills, education levels, and remote work preferences. The analysis was done using SQL for querying and Excel for data cleaning and preparation.
data-analysis data-preprocessing excel functions query sql sql-server
Last synced: 25 Jun 2025
https://github.com/percival33/machine-learning-engineering
Uni project about enhancing fictional music streaming service, by developing machine learning models to generate popular playlists
data-analysis data-science machine-learning python
Last synced: 14 Jul 2025
https://github.com/vbhvsingh0/nflteam_corr_population
The goal of this project is to find the correlation in between NFL teams' win and loss with the population of the city.
data-analysis data-cleaning-and-preprocessing data-manipulation-with-pandas numpy-library pandas-python pearson-correlation python3
Last synced: 29 Jun 2026
https://github.com/simranrayait51/internshala-ds-projects
Projects from the Internshala Data Science course, showcasing my skills in Excel, SQL, Python, and Tableau for data manipulation, analysis, and visualization.
data-analysis data-science data-visualization excel internshala-project pgc postgresql python sql tableau
Last synced: 17 May 2026
https://github.com/guilherme-marcello/r-data-analysis-piechart
Reading RDS files, processing and presentation in pie charts
data-analysis data-visualization pie-chart r
Last synced: 13 Jul 2025
https://github.com/jhermienpaul/google-data-analytics-program
Hands-on learning materials from the 8-course Google Data Analytics Professional Certificate program, covering foundational data skills, tools, and real-world business problem-solving
bigquery dashboard data-analysis data-analytics data-modeling data-storytelling data-visualization data-wrangling descriptive-analytics diagnostic-analytics etl-pipeline r-programming rstudio sql tableau
Last synced: 13 Jul 2025
https://github.com/shubhammittal-data/sales-customer_dashboard_tableau
An interactive Tableau project showcasing advanced data visualization techniques for sales performance and customer analytics. This dashboard provides key business insights using KPIs, trend analysis, and customer segmentation. Designed for executives, sales managers, and marketing teams to drive data-driven decision-making.
customer-behavior-analysis customer-segmentation data-analysis data-visualization product-analytics sales-analysis tableau tableau-dashboards tableau-public
Last synced: 07 Mar 2026
https://github.com/neeraj08823/bellabeat_case-study
HOW CAN A WELLNESS COMPANY PLAY IT SMART?
data-analysis data-cleaning data-visualization r rmarkdown rstudio tableau tableau-public
Last synced: 25 Jun 2025
https://github.com/jlee9503/defense-risk-prediction
Build a machine learning pipeline that ingests defense procurement data, identifies high-risk contracts, and visualizes the results in an interactive dashboard.
data-analysis data-visualization exploratory-data-analysis python
Last synced: 25 Jan 2026
https://github.com/myktorijus/retention-cohort
Extracted cohort data using SQL in BigQuery focusing on weekly retention from week 0 to week 6
bigquery data-analysis data-visualization powerbi sql
Last synced: 13 Jul 2025
https://github.com/gappeah/credit-card-transactions-fraud-detection-project
The Credit Card Transactions Fraud Detection Project repository is designed to analyse and detect fraudulent transactions in credit card data.
Last synced: 12 Jul 2025
https://github.com/harmanveer-2546/motor-vehicle-accidents-in-india
As per the report, a total of 4,61,312 road accidents have been reported by States and Union Territories (UTs) during the calendar year 2022, which claimed 1,68,491 lives and caused injuries to 4,43,366 persons.
accidents accidents-analysis darkgrid data-analysis eda exploratory-data-analysis indian-roads inline matplotlib motor-vehicles numpy pandas review seaborn visualization
Last synced: 19 Jan 2026
https://github.com/mrendiks/analyst-data-survey-monkey
Learn how to analyst data from dataset surver monkey using Excel and Python
data-analysis ipynb-jupyter-notebook python
Last synced: 07 Mar 2026
https://github.com/mituskillologies/aiml-dypiemr-sep24
Programs conducted at DYPIEMR, Pune in training on AIML during September 2024.
artificial-intelligence data-analysis data-science machine-learning matplotlib neural-network numpy pandas python3
Last synced: 05 Apr 2025
https://github.com/chahelgupta/fitness-data-analysis-r-project
This project focuses on analyzing fitness data collected from various tracking devices to gain insights into users' activity levels, sleep patterns, calorie expenditure, and heart rate. The dataset used in this project consists of multiple CSV files, each containing different aspects of fitness-related data.
data-analysis data-cleaning data-exploration data-science data-visualization r r-language r-programming r-studio
Last synced: 18 May 2026
https://github.com/farzeennimran/fashion-mnist-dataset-classification-using-neural-network
Implementation of a Multi-layer Perceptron classifier with hyperparameter tuning and k-fold cross-validation employing GridSearchCV for classifying images on the Fashion MNIST dataset ๐๐๐
artificial-intelligence data-analysis data-mining data-science dataset deep-learning fashion-mnist-dataset gridsearchcv hyperparameter-tuning kfold-cross-validation machine-learning multilayer-perceptron-network neural-network numpy pandas python sklearn
Last synced: 03 Apr 2026
https://github.com/jonathancaleb/adap
๐๐ฑ Agricultural Data Analysis Platform ๐๐ A personal initiative to analyze coffee growth trends in Uganda using Python, data science, and machine learning. This project supports sustainable farming with predictive models and interactive visualizations. ๐๐
data-analysis data-science python
Last synced: 18 May 2026
https://github.com/tathithienthanh/womenfashionproductrecommendationsystem
Build a recommendation system for recommending woman fashion's products on e-commerce platforms
content-based data-analysis data-collection data-processing data-visualization dfd e-commerce erd jupyter-notebook lazada nlp python recommender-system scraping-websites sql system-design tiki vietnamese visualization
Last synced: 16 May 2026
https://github.com/Fisseha-Estifanos/telecom
A showcase repository for a specific telecommunication company. Used to analyze several telecommunication data set features and generate useful insights accordingly. Insights generated could be seen at https://github.com/Fisseha-Estifanos/telecom-visualizer or at https://fisseha-estifanos-telecom-visualizer-home-huxgy0.streamlitapp.com/
data-analysis notebooks-jupyter python visual-studio-code visualization
Last synced: 11 Mar 2025
https://github.com/majajuri/text-classification-using-string-kernels
Projekt u sklopu predmeta Uvod u znanost o podacima
Last synced: 05 Apr 2025
https://github.com/ashvinhandoo/bionic-lab-projects
Computational neurophysiology pipelines for analyzing astrocyte and vascular dynamics. Includes Python- and MATLAB-based analysis frameworks for modeling calcium, vasomotion, and pupil-linked activity, demonstrating advanced signal processing, transfer entropy estimation, and data visualization skills used in biomedical research.
biocomputation bioinformatics biomedical-engineering computational-biology data-analysis matlab neuroscience python signal-processing time-series
Last synced: 18 May 2026
https://github.com/alvarezekiel19/movie-data-analysis
A Data Science elective activity
data-analysis data-science data-visualization jupyter-notebook python python3
Last synced: 18 May 2026
https://github.com/sourceduty/data_metrics
๐ Analyzing, sorting and visualizing data.
data data-analysis data-metrics data-sci data-science data-science-projects data-sorting data-visualization database dataset metrics sorting statistics visualization
Last synced: 08 Aug 2025
https://github.com/martachesnova/python-apis
A weather analysis that randomly selects more than 500 cities across the globe, pulls data from the OpenWeatherMap API for each city. Analysis of the weather and perfect vacation spot is viewable on my Jupyter Notebook.
Last synced: 24 Feb 2025
https://github.com/martachesnova/python
Created a Python script to calculate and analyze financial records of a company. Created another Python script to do calculations and analysis of the voting process in a small town.
Last synced: 24 Apr 2026
https://github.com/jackmnob/python-tableau-eda-stockdash
Data cleaning, preparation, and manipulation (EDA) for an interactive stock market dashboard with Tableau - using pandas (Python) via JupyterLab
cleaning-data dashboard data-analysis data-preparation eda jupyter-notebook jupyterlab python tableau-public
Last synced: 14 May 2026