Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-07-03 00:07:37 UTC
- JSON Representation
https://github.com/prakhar-code/british_airways_review_analysis
Analysis of the British Airways Reviews by Customers, filtered by several different factors such as food, entertainment, services, etc.
data-analysis data-cleaning excel tableau-dashboards tableau-public tableau-visualization
Last synced: 15 Jan 2026
https://github.com/ezmiller/esd-viz
Visualization of European Social Survey (http://www.europeansocialsurvey.org/data/)
clojure data-analysis visualization
Last synced: 28 May 2026
https://github.com/rosanafss/r-ladies-bh-workshop-metricas
Como Plotar Métricas e Entregar Valor para Times Ágeis
data-analysis data-visualization r
Last synced: 25 Aug 2025
https://github.com/mindlessmuse666/iris-knn
Проект демонстрирует применение алгоритма k-ближайших соседей (KNN) для классификации набора данных Iris. Включает загрузку данных, обучение модели, оценку производительности и визуализацию результатов с использованием библиотек Pandas, Scikit-learn, Matplotlib, Seaborn и Plotly.
algorithm classification data-analysis data-visualization iris-dataset knn lazy-learning machine-learning python scikit-learn
Last synced: 17 Aug 2025
https://github.com/ziaeemehr/neuro_toolbox
Single Header File C++ library for analysis of neurophysiological and simulated data.
data-analysis data-science signal-processing synchronization
Last synced: 21 Jul 2025
https://github.com/rafinha0rafinha/web-analyzer-backend
(Legacy) This is the backend for Mazaoro SARLU's lead magnet "Web Analyzer". This project analyzes websites using Google Lighthouse and returns a detailed report consumed by the frontend.
azure-app-service azure-devops chartjs cicd data-analysis data-science data-visualization express flask hacktoberfest lighthouse numpy sentiment-analysis vader-sentiment-analyzer
Last synced: 10 Apr 2026
https://github.com/mfakhriazhar/stock-price-prediction
Stock prices are highly volatile and influenced by various factors, making accurate prediction a major challenge in investment decisions.
data-analysis data-science deep-learning python recurrent-neural-networks
Last synced: 18 May 2026
https://github.com/harshnevse/performance_analysis_of_solar_plants_in_india
A Data Analysis project using Tableau
Last synced: 03 Jan 2026
https://github.com/alexjackson1/commons-indicative-votes
A cluster analysis of the House of Commons' Indicative Brexit Voting Process on 27 Match 2019
Last synced: 19 Jul 2025
https://github.com/spring-0/netflix-media-data-analysis
Exploring and analyzing Netflix data to uncover trends through data visualization and statistical analysis.
Last synced: 27 Mar 2025
https://github.com/jasonsu131/cps188-term-project
A data analysis program developed in C to extract information about diabetic patients across Canada from a governmental spreadsheet available online. The program showcases summaries and averages based on the extracted data.
c data-analysis data-statictics file-reading
Last synced: 28 Mar 2025
https://github.com/shrutiijoshi/restaurant-order-analysis
Analyze order data to identify the most and least popular menu items and types of cuisine
analytics data-analysis mysql sql
Last synced: 26 Aug 2025
https://github.com/kwonnayeon/urban-parks-childrens-happiness
Grad thesis on urban parks’ impact on children’s happiness – data, results, and code
causal-inference data-analysis environmental-psychology latex matching propensity-score public-health r social-science statistical-analysis thesis-project urban-studies weighting
Last synced: 17 Feb 2026
https://github.com/vishal-verma-96/pre-owned-car-price-prediction-using-streamlit-app
Capstone Project by skill Academy- Exploratory Analysis, Visualization and Prediction of Used Car Prices. Deploying the highest-scoring model with Streamlit web app
data-analysis data-science jupyter-notebook machine-learning machine-learning-algorithms matplotlib numpy pandas python3 regression-algorithms scikit-learn seaborn streamlit
Last synced: 11 Apr 2026
https://github.com/velut/thesis-sw
Software and datasets used in the "Cost-effective and Scalable Activity Matching using Crowdsourcing" thesis
bpmn cost crowdflower crowdsourcing data-analysis dataset performance-analysis plotting-algorithms r thesis
Last synced: 19 Jun 2025
https://github.com/mae776569/weratedogs-wrangling
Wrangling WeRateDogs Twitter data to create interesting and trustworthy analyses and visualizations
data-analysis data-science data-visualization tweets twitter-api
Last synced: 25 Jan 2026
https://github.com/simranjeet97/covid-19
Covid-19 Data Analysis and Important Topics to be Covered to get the Impact and Solution.
coronavirus coronavirus-analysis coronavirus-dataset coronavirus-prediction coronavirus-tracking covid-19-data-analysis covid19 covid19-data covid19-india dash dash-app dash-plotly data data-analysis data-science data-science-projects data-visualization python3
Last synced: 18 May 2026
https://github.com/mkk-1817/cvip-ds-exploratory_data_analysis-terrorism
This repository deals with exploring global terrorism trends analyzing the Global Terrorism Database to uncover temporal patterns, identify top terrorist groups, examine attack types, and gain insights into geographical and success/failure dynamics.
coderscave data-analysis data-science data-visualization eda exploratory-data-analysis python terrorism-analysis
Last synced: 19 Jun 2025
https://github.com/mindlessmuse666/apartment-price-predictor
Python-проект по прогнозированию стоимости аренды квартир с помощью линейной регрессии. Практическая работа по теме: "Основы машинного обучения" дисциплины "МДК 13.01: Основы применения методов искусственного интеллекта в программировании".
apartment-price-prediction data-analysis data-science linear-regression linear-regression-models machine-learning matplotlib python regression sklearn unit-testing
Last synced: 11 Apr 2026
https://github.com/cyberoctane29/noaa-lightning-analysis
This project explores lightning strike data from the National Oceanic and Atmospheric Administration (NOAA) to identify seasonal trends and analyze strike frequency across months. It demonstrates data manipulation, aggregation, and visualization using Python, providing insights into lightning activity patterns.
data-analysis data-science data-visualization eda python
Last synced: 20 Apr 2026
https://github.com/tathithienthanh/majorproject_womenfashionproductrecommendationsystem
Build a recommendation system for recommending woman fashion's products on e-commerce platforms
content-based data-analysis data-collection data-processing data-visualization dfd e-commerce erd jupyter-notebook lazada nlp python recommender-system scraping-websites sql system-design tiki vietnamese visualization
Last synced: 20 Mar 2025
https://github.com/mfakhriazhar/ecom-qtt-prediction
In e-commerce, understanding seasonal sales trends and best-selling products is critical to business strategy. However, companies often struggle with predicting sales, determining factors that influence sales (discounts, product categories, locations), and optimizing stock and marketing.
data-analysis data-science data-visualization e-commerce-project eda machine-learning python
Last synced: 19 May 2026
https://github.com/hadjiprocopis/histocurse
A Java implementation of a multidimensional histogram backed on dense/conventional OR sparse array. Extremely efficient when number of dimensions is large and back-store is sparse array. This module depends on other projects which can be found on my repo here. See README below to see what you need to download.
data-analysis data-structures histogram multidimensional
Last synced: 03 Jul 2026
https://github.com/kenwuqianghao/scotiabank-datathon-2023
Code and data analysis done for 2023 Scotiabank Datathon
data-analysis fraud-detection jupyter-notebook python
Last synced: 18 May 2026
https://github.com/lauratrigo/dias_geomagneticamente_calmos
📡Script MATLAB que analisa parâmetros ionosféricos (hF, f0F2, hmF2) via FFT, gerando espectros unilaterais/bilaterais para identificar padrões temporais em resolução, crucial para estudos de variações ionosféricas.
data-analysis geophysics matlab scientific
Last synced: 29 Aug 2025
https://github.com/emmarhoffmann/analysis-of-student-debt-among-first-generation-college-students
Explores the financial landscape of first-generation college students, analyzing patterns in student debt based on factors like median income, net price of attendance, and enrollment size.
data-analysis first-generation-college-students r statistical-models
Last synced: 17 Mar 2025
https://github.com/yash22222/british-airways-data-science-internship
All 2 Task Assigned By British Airways Data Science Virtual Internship Programme
csv data-analysis data-science data-visualization google-colaboratory jyputer-notebook machine-learning microsoft-excel microsoft-powerpoint python
Last synced: 16 May 2026
https://github.com/yash22222/web-scraping-for-data-analysis-predictive-model-on-customer-data
Utilized web scraping for customer feedback at Air India, conducting robust data analysis, and applying machine learning for predictive modeling. Drove data-driven decisions, enhancing services, and elevating customer satisfaction. Expertise in web scraping, analysis, and predictive modeling for actionable insights.
data-analysis data-preprocessing data-science data-visualization exploratory-data-analysis machine-learning powerbi random-forest-classifier sentiment-analysis tableau web-scraping
Last synced: 30 May 2026
https://github.com/sabdikay/telco-customer-churn-analysis-ibm-dataset
This project explores customer churn trends for a company in California using an IBM dataset. Built in a Jupyter Notebook, it employs pandas, NumPy, matplotlib, seaborn, plotly, and scipy to clean, analyze, and visualize data. Through statistical tests and interactive maps, it uncovers key drivers behind customer cancellations
business-intelligence customer-churn data-analysis data-analysis-python data-visualization exploratory-data-analysis jupyter-noteboook matplotlib numpy pandas plotly predictive-modeling python scipy seaborn statistical-analysis
Last synced: 07 Apr 2026
https://github.com/annaanastasy/classification-project-student-grades
A machine learning project to predict students' academic performance using features like demographics, study habits, and parental involvement, achieving 74% accuracy with the CatBoost model.
catboost-classifier classification data-analysis data-visualization machine-learning-algorithms predictive-modeling
Last synced: 29 Mar 2025
https://github.com/emmarhoffmann/analysis-of-california-real-estate-market-factors-influencing-home-prices
Investigates how home size, number of bedrooms, and bathrooms influence home prices, with comparisons across California, New York, New Jersey, and Pennsylvania.
data-analysis r real-estate statistical-models
Last synced: 17 Mar 2025
https://github.com/manuelgil/vscode-data-pack
This extension pack includes the essential extensions for data analysts.
data-analysis data-science data-structures data-visualization vscode-extension
Last synced: 07 Apr 2026
https://github.com/janashanaa/flightanalysis
This Jupyter Notebook presents an exploratory data analysis of data derived from a flight booking website.
data-analysis data-visualization exploratory-data-analysis jupyter-notebook python
Last synced: 15 May 2026
https://github.com/prarthana-singh/heart-attack-prediction-model
A Machine Learning model that predicts the risk of a heart attack based on health parameters like cholesterol levels, blood pressure, BMI, smoking habits, and age. Built using Classification models, Scikit-Learn, Pandas, and Python.
classification data-analysis data-science heart-attack-prediction logistic-regression machine-learning numpy pandas python scikit-learn
Last synced: 25 Jun 2025
https://github.com/sparkerdata/hockeyshotmap
Interactive Streamlit app for NHL shot maps & player analysis. Pulls live (or demo) play-by-play data, normalizes rink coordinates, and visualizes shots with context filters (strength, period, player).
data-analysis data-visualization duckdb hockey hockey-analytics ice-hockey nhl nhl-data python sports sports-analytics
Last synced: 18 May 2026
https://github.com/robinmillford/analyzing-e-commerce-transactions---data-cleaning-cohort-analysis-and-sql
In this project, I aimed to analyze the profitability of products in an e-commerce dataset. I performed various SQL queries to extract valuable insights about product profitability, including the identification of the top 5 products with the highest profit margin, and unique combinations of brands and product lines with the highest profitability.
cohort-analysis data-analysis data-visualization excel jupyter-notebook powerbi python3 sql
Last synced: 18 May 2026
https://github.com/ivanayala96/end-to-end-business-intelligence-solution-logistics-financial-performance-dashboard
Project Overview: This project features a comprehensive Power BI solution developed for Ayala's Consultancy. It transforms raw operational data (generated via Python) into a strategic decision-making tool, managing a dataset of $7.71M in total sales and over 2,500 transactions.
anlytics bussines-report bussiness-intelligence data-analysis dax power-bi powerbi python
Last synced: 22 Apr 2026
https://github.com/rosa-lpz/data-analysis-handbook
Data Analysis base knowledge and practical applications
data data-analysis data-visualization database dax documentation power-bi python r sql tableau tableau-public
Last synced: 06 Apr 2026
https://github.com/dacosmicgiant/marketing-sms-analyser
Mini project for R language SEM - V
Last synced: 21 Mar 2025
https://github.com/kammarah/studentdata
I created & deployed a Streamlit app to store, manage & analyze student data. 📊🎓
connection data data-analysis data-visualization deploy deployments libraries python streamlit streamlit-webapp webapp
Last synced: 18 May 2026
https://github.com/stefagnone/unsupervised-analysis-project
This project investigates the impact of video content on social media engagement using advanced analytics techniques like PCA, k-means clustering, and logistic regression. It provides actionable insights for optimizing social media strategies for Thai fashion and cosmetics retailers.
data-analysis data-visualization engagement-metrics facebook-live-sellers k-means-clustering logistic-regression marketing-insights pca-analysis python social-media-analytics
Last synced: 05 Apr 2025
https://github.com/stefagnone/data_storyboarding_visualization
Data Storyboarding and Visualization Techniques for Effective Communication
data-analysis data-visualization ggplot2-analysis r tableau-dashboards
Last synced: 05 Apr 2025
https://github.com/stefagnone/-wedding-vendor-pricing-and-customer-satisfaction-analysis
Data-driven analysis of wedding vendor pricing and customer satisfaction, with database design, SQL optimization, and cost breakdown generation.
business-intelligence cost-optimization customer-satisfaction data-analysis database-design python sql vision-board-analysis wedding-planning wedding-vendor-pricing
Last synced: 03 May 2026
https://github.com/stefagnone/moneyball_project
Data-driven analysis inspired by the Moneyball approach, identifying affordable replacements for key Oakland A's players using R and sabermetrics to support cost-effective recruitment.
baseball-statistics data-analysis data-driven-decision-making player-replacement-strategy r-programming sabermetrics sports-analytics
Last synced: 05 Apr 2025
https://github.com/rorrell/rightwhaledata
A Jupyter Notebook where I wrangle some data on right whale sightings and create a visualization
data-analysis data-visualization jupyter-notebook python3
Last synced: 11 May 2026
https://github.com/nour-zayed/shopping-trends-analytics-sql-python-power-bi
"End-to-end Shopping Trends analytics project using SQL, Python, Excel & Power BI — data cleaning, EDA, KPI generation, and interactive dashboards with DAX for actionable business insights."
business-intelligence data-analysis data-visualization dax powerbi python sql
Last synced: 18 May 2026
https://github.com/jatin-mehra119/car_price_prediction
Predicting price of the cars using small dataset.
data-analysis data-visualization jupyter-notebook machine-learning python regression-models sklearn sklearn-pipeline
Last synced: 07 Apr 2026
https://github.com/tarasbln/big-quant
Official public repository of the Berlin Investment Group (BIG) Quant Team, featuring quantitative finance research, algorithmic trading strategies, market analyses, educational materials, and open-source projects.
data-analysis education finance investment investment-club python3 quantative-finance quantative-trading quantitative-research research
Last synced: 21 Mar 2025
https://github.com/misszeferino/netflix-exploratory-analysis
Netflix exploratory analysis using python
data-analysis data-visualization pandas plotly python
Last synced: 07 Apr 2026
https://github.com/jayita11/exploring-most-streamed-songs-for-last-four-decades-eda
Perform EDA to uncover trends in streaming patterns, likes, and artists over the last four decades.
data-analysis eda hypothesis-testing matplotlib most-streamed-songs pandas python seaborn
Last synced: 07 Apr 2026
https://github.com/ahadly/sql-data-analytics-project
This repository contains a collection of SQL scripts demonstrating various analytical techniques, such as changes over time, cumulative, performance, data segmentation, part-to-whole analysis.
analytics business-analytics business-intelligence data data-analysis data-analyst data-analytics data-engineering data-science data-scientist database datascience query reporting sql sql-queries sql-query sql-server window-functions window-functions-in-sql
Last synced: 18 May 2026
https://github.com/netcodez/analysing-unicorn-companies---sql
Analysing Unicorn Companies using SQL
data-analysis data-structures database postresql sql
Last synced: 16 May 2026
https://github.com/enayar478/nomad_machine_learning_dash_app
An interactive Machine Learning app built with Dash and Plotly, developed as part of the Data Analytics Bootcamp at Le Wagon Bordeaux. It allows users to visualize data, make real-time predictions, and explore various model insights.
analytics cachetools dash dashboard-application data-analysis data-science deployment gunicorn interactive-visualization machine-learning pandas plotly plotly-dash prediction-model python python3 render scikit-learn web-application
Last synced: 02 Jan 2026
https://github.com/ahnaf19/rokomari_price_analysis
This was a job hiring assignment given my rokomari.com. The data was small, obviously a generated one for test purpose. I tried to describe myself while diving deep as much as possible.
data-analysis data-cleaning data-visualization etl
Last synced: 30 Aug 2025
https://github.com/oshinrathor/Data-Science-Systems-and-Analytics-Projects
Dive into my Data Science Projects Repository, featuring a Spam SMS Classifier, NIA Dashboard, H1N1 Vaccine Prediction, and NYC Taxi Fare Prediction. Each project showcases my skills in data cleaning, exploratory analysis, modeling, and visualization, offering valuable insights and methodologies for data enthusiasts and practitioners.
dashboard data-analysis data-driven-decisions data-presentation data-science data-visualization dataexploration eda insights nia webanalytics
Last synced: 12 Sep 2025
https://github.com/chingu-voyages/v47-tier3-team-30
An easily accessible tool for calculating electricity-related carbon emissions, along with insights for reducing environmental impact. | Voyage-47 | https://chingu.io/ | Twitter: https://twitter.com/ChinguCollabs
carbon-emissions carbon-footprint data-analysis data-engineering data-science
Last synced: 10 May 2026
https://github.com/karlyndiary/adidas-sales-analysis
Analyzed Adidas' product sales performance, top retailers, monthly trends, yearly growth, regional distribution, and pricing insights. Performed ETL from Python (Pandas) to SQL Server, extracted data with SQL, and visualized key insights in Excel.
adidas-sales-analysis adidas-sales-dashboard dashboard data-analysis data-cleaning data-pipeline data-visualization etl excel-dashboard microsoft-excel microsoft-sql-server python
Last synced: 10 Feb 2026
https://github.com/satyacoder29/comparison-of-region-based-sales-tableau
The region-based sales comparison analyzes sales performance across different regions. It identifies trends, top-performing regions, and areas needing improvement by comparing metrics like revenue, growth rate, and product demand. This analysis helps optimize sales strategies and resource allocation for better performance.
data-analysis data-cleaning data-collection data-visualization powerquerym relationships tableau tableau-desktop unions
Last synced: 02 Feb 2026
https://github.com/derogative404/google_data_analytics_capstone
Capstone project part of the Google Data Analytics Certificate Program
Last synced: 26 Mar 2025
https://github.com/hoxo-m/blog
HOXO-M Blog
data-analysis data-science r-package
Last synced: 30 Oct 2025
https://github.com/dina-hosny/sparkify---data-modeling-with-cassandra
Sparkify - Data Modeling with Cassandra - Udacity Data Engineering Expert Track.
cassandra cql data-analysis data-engineering data-modeling data-warehousing etl python
Last synced: 11 Apr 2026
https://github.com/akash1070/predicting-zomato-restaurant-ratings
Perform extensive Exploratory Data Analysis(EDA) on the Zomato Dataset. Building an appropriate Machine Learning Model that will help various Zomato Restaurants to predict their respective Ratings based on certain features deploy the Machine learning model via Flask
data-analysis extratreesregressor flask linear-regression machine-learning random-forest zomato-bangalore zomato-data-analysis
Last synced: 18 May 2026
https://github.com/huynhtanphatt/diagnosing-uk-railway-performances
This project analyzes UK railway ticket and operation data to show how revenue, passenger demand, and on-time performance are connected.
data-analysis data-visualization datastorytelling python railway sql ticketing transportation
Last synced: 24 Apr 2026
https://github.com/sbera01/credit-card-approval-predictor
End-to-end Machine Learning project to predict credit card approval decisions using real-world financial features. Includes EDA, model training, and deployment-ready architecture
credit-card-approval-prediction data-analysis machine-learning python scikit-learn streamlit
Last synced: 24 Dec 2025
https://github.com/sebastianurdaneguibisalaya/enfermedades-fissal
Análisis holístico de atenciones por enfermedades raras, huérfanas y transplantes coberturados por FISSAL en el Perú.
data-analysis data-visualization python
Last synced: 24 Feb 2025
https://github.com/arjunraj77/analysis-hub
International Fraud Group Hackathon
data-analysis data-visualization hackathon-project kibana-cluster kibana-dashboard
Last synced: 30 Mar 2025
https://github.com/sciencesar-labs/py485-final-project
ROOT-based muon data analysis using Python & Jupyter – final project for PY485E @ CERN
cern computational-physics data-analysis jupyter-notebook muons python root uproot
Last synced: 15 May 2026
https://github.com/dinamohsin/toman-bikeshare-data-analysis-sql-power-bi
This project involves data analysis using SQL, Power BI, and CSV datasets to extract insights and visualize key business metrics.
csv-files data-analysis data-visualization database powerbi sql sql-server
Last synced: 22 Apr 2026
https://github.com/jerinpious/house-price-prediction
This project is a machine learning-based application to predict house prices. A frontend interface has been developed using Streamlit to make the prediction process user-friendly for regular customers. The project is structured
data-analysis data-engineering data-science eda machine-learning pandas python random-forest scikit-learn streamlit
Last synced: 05 Apr 2026
https://github.com/arkww/chinesenewspaperwordcount
Analysis the word count of Chinese characters in Simplified and Traditional Chinese characters and comparing the results
chinese-language data-analysis data-science python
Last synced: 16 May 2026
https://github.com/iwasakiyuuki/data-analysis-platform-etl
A collection of Airflow DAGs for automating data collection into our on-premises data analysis platform.
airflow airflow-dags data-analysis data-collection
Last synced: 01 Jul 2025
https://github.com/jonek/pv-city-mastr
Extract and analyze data about photovoltaic systems in Germany
data-analysis germany jupyter-notebook pandas photovolatic-power photovoltaic
Last synced: 11 May 2026
https://github.com/saisurajmatta/data-warehousing-and-advanced-data-analytics
Data Analytics Project: Analyzed Promotions and Provided Tangible Insights to Sales Director
data data-analysis data-architecture data-flow-analysis data-modeling data-pipeline data-segmentation data-visualization data-warehousing docker etl etl-pipeline mssql sql tableau
Last synced: 17 May 2026
https://github.com/sreejabethu/smart-report-analyzer
An AI-powered app to analyze and summarize Excel, CSV, and PDF reports using Hugging Face language models. Built with Streamlit.
data-analysis huggingface llm nlp pdf-analysis python question-answering streamlit summarization
Last synced: 18 May 2026
https://github.com/bhaveshbhakta/mobile-price-prediction-using-xgboost
Mobile Price Prediction
data-analysis data-visualization machine-learning mobile-price-prediction xgboost
Last synced: 19 Jul 2025
https://github.com/lord3008/instances-of-data-analysis
This repository of mine shows my work on data analysis of various projects that I made. I feel data analysis is the very key to investigate a solution. Further more it enlightens the direction towards model building.
Last synced: 03 Mar 2025
https://github.com/cowboymrzamo2380/json-to-excel-converter
This repository provides a tool to convert JSON data to Excel format (.xlsx). It allows you to easily transform structured JSON data into a well-organized spreadsheet for better analysis and visualization.
automation-script automation-tools data-analysis data-converter data-export data-formatting data-tools data-visualization excel excel-automation excel-converter excel-tools json json-exporter json-parser json-processing json-to-csv json-to-excel programming-tools spreadsheet-tools
Last synced: 05 Apr 2025
https://github.com/ilovenooodles/probstat-water-potability
Tugas Besar Probabilitas dan Statistika 1
csv data-analysis jupyter-notebooks python
Last synced: 03 May 2026
https://github.com/coditheck/data_analysis
Data analysis is the process of inspecting, cleaning, transforming, and modeling data in order to discover useful information, draw conclusions, and support decision making.
Last synced: 17 Jun 2025
https://github.com/debjyotisaha/power-bi-projects-phase-1
Portfolio projects related to data visualisation in Power BI
data-analysis data-visualization dax-expression powerbi powerquery
Last synced: 18 Jan 2026
https://github.com/luminati-io/walmart-dataset-samples
A sample dataset of over 1000 Walmart products, extracted using the Bright Data API, ideal for consumer market insights and competitor analysis.
api data-analysis dataset walmart walmart-scraper web-scraping
Last synced: 04 Jan 2026
https://github.com/calebtheman116/hotel_customers_sentiments
Sentiment Analysis for a Hotel Based on Customer's Reviews
2018-2019 data-analysis data-analysis-in-excel data-cleaning data-cleaning-and-preprocessing data-visualization excel excel-pivot-tables github hotel-review-sentiments pivot-tables sentiment-analysis tableau-public text-reviews
Last synced: 21 Jul 2025
https://github.com/celineboutinon/lafleche-et-associes
OpenClassrooms Data Analyst 2022-2023 - Projet 7 using KNIME Analytics Platform
data-analysis data-analytics data-visualisation knime-analytics-platform no-code rgpd
Last synced: 08 Feb 2026
https://github.com/vavarm/data-analysis-french-electric-automobile-infrastructure
Data analysis realized in R Shiny and Python about the French electric vehicle and charging station infrastructure
data-analysis data-science data-visualization factominer geojson ggplot2 plotly python r rshiny
Last synced: 15 May 2026
https://github.com/jofaval/california-housing-pricing
Data Analysis about the California Housing Pricing in 1997
data-analysis data-science data-visualization deep deep-learning deep-neural-networks google-colab keras machine-learning matplotlib python regression scikit-learn seaborn tensorflow
Last synced: 05 Apr 2026
https://github.com/xenon1919/credit-card-fraud-detection
Credit Card Fraud Detection is a machine learning project to predict fraudulent credit card transactions. It handles imbalanced data using undersampling and applies Logistic Regression and XGBoost models. With an AUC of 0.98, it offers robust fraud detection. Includes a Streamlit app for real-time predictions.
data-analysis machine-learning python
Last synced: 14 May 2026
https://github.com/onemoredavid/python-like-a-boss
This is where I stash my Python study material.
data data-analysis data-engineering data-science data-visualization datascience ipynb ipynb-jupyter-notebook ipynb-notebook numpy pandas python python3
Last synced: 04 Apr 2025
https://github.com/clarajacintho/ig4-ds
The final project for the Multidimensional Data Analysis and Data Mining courses, where we analyze data from motorcyclists to determine what causes accidents
data-analysis data-science shiny-apps
Last synced: 11 May 2025
https://github.com/saadhaniftaj/logistic--lasso-regression-data-analysis
Iris dataset analysis with logistic and Lasso regression, using coordinate descent for feature selection and binary classification. Includes preprocessing and data visualizations
data-analysis lasso-regression-model logistic-regression python statistics
Last synced: 18 May 2026
https://github.com/natnaelhhaile/Text-Similarity-Analysis
bag-of-words cosine-similarity data-analysis machine-learning natural-language-processing nltk-python one-hot-encoding python stemming stop-word-removal stop-words text-mining text-processing text-similarity-analysis tf tf-idf tokenization
Last synced: 11 Apr 2025
https://github.com/satyacoder29/crm-analytics
CRM Analytics Dashboard – An interactive dashboard using Tableau, SQL, and Salesforce CRM Analytics (CRMA) to analyze sales performance, customer segmentation, and churn prediction. Features automated ETL pipelines, predictive analytics, and real-time insights for data-driven decision-making. 🚀📊
advanced-excel data-analysis data-cleaning data-collection data-transformation data-visualization matplotlib numpy pandas powerbi python seaborn sql tableau
Last synced: 03 Mar 2025
https://github.com/andremenezesds/house_rocket_sales_insights
Data analysis for House Rocket Sales Company database, including insights for sales optimization.
data-analysis data-visualization geopandas git github jupyter-notebook linux numpy pandas powerbi python seaborn seaborn-python streamlit streamlit-webapp ubuntu vscode windows10
Last synced: 21 Jan 2026
https://github.com/bhaveshbhakta/amazon-sales-data-visualization
Amazon Sales Data Visualization
amazon-sales-data-analysis data-analysis data-preprocessing data-visualization machine-learning
Last synced: 18 May 2026
https://github.com/thoratstuti/power-bi-dashboards-for-finance-analysis
Power BI can group and gather information from multiple systems to present the whole picture of business data analytics in one “single view”. It made the staff of the financial institution work in a collective digital platform, where they can compute and share relevant data.
data-analysis data-visualizations excel graph pie-chart powerbi
Last synced: 07 Mar 2026
https://github.com/steviecurran/multi-dish
Scripts to reduce data from large radio telescopes (GMRT, VLA)
data-analysis interferometer pipeline radio-astronomy telescopes
Last synced: 09 May 2026
https://github.com/poglolopez/prueba_tecnica_inlaze
Este repositorio muestra mis habilidades en análisis de datos a través de una prueba técnica para Inlaze. Incluye flujos de trabajo con Python, SQLite y Power BI para analizar el comportamiento de jugadores, depósitos y rendimiento de fuentes de tráfico, destacando eficiencia operativa e información estratégica.
data-analysis data-v etl jupyter powerbi python sqlite
Last synced: 26 Feb 2025
https://github.com/hcrlau/cyclistic-bike-share-analysis
Google Data Analytics Capstone Project
bigquery cyclistic-bike-share-analysis-case-study data-analysis data-visualization sql tableau
Last synced: 05 Apr 2025
https://github.com/mahmoudwal27/brazilian_ecommerce
This project explores and cleans the Olist Brazilian E-Commerce dataset using Python (Pandas) to prepare it for Power BI visualization. The process includes loading data, performing exploratory analysis, handling missing values and duplicates, formatting key columns, and exporting clean datasets.
analytics data-analysis data-analysis-python google-cloud python
Last synced: 16 May 2026