Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-07-01 00:07:23 UTC
- JSON Representation
https://github.com/darkdk123/house-valuation-model
A Challenge Project in a Boot-Camp to create a ML Model to predict the prices of houses in Boston Massachusetts from multiple parameters Using Multivariable Regression.
data-analysis data-science data-visualization matplotlib-pyplot multivariate-regression predictive-modeling statistics
Last synced: 07 Jul 2025
https://github.com/simranrayait51/internshala-ds-projects
Projects from the Internshala Data Science course, showcasing my skills in Excel, SQL, Python, and Tableau for data manipulation, analysis, and visualization.
data-analysis data-science data-visualization excel internshala-project pgc postgresql python sql tableau
Last synced: 17 May 2026
https://github.com/iamsainikhil/data-visualization
Visualization of Web data using Python
data-analysis data-visualization python webscraping
Last synced: 13 Jun 2026
https://github.com/srvcl/lung-cancer-survival-analysis
Data Cleaning of a dataset and Survival Analysis in R Language
data-analysis data-science data-visualization r survival-analysis
Last synced: 11 May 2026
https://github.com/vikpires/ds_tips-dataset
Projeto individual do bootcamp de ciência de dados avanti 2024.2, com o objetivo de analisar e observar padrões no conjunto de dados "Tips".
data-analysis data-science data-visualization exploratory-data-analysis jupyter-notebook matplotlib numpy pandas python seaborn tips
Last synced: 17 Sep 2025
https://github.com/guilherme-marcello/r-data-analysis-piechart
Reading RDS files, processing and presentation in pie charts
data-analysis data-visualization pie-chart r
Last synced: 13 Jul 2025
https://github.com/manishbisht/machine-learning
Machine Learning
data-analysis data-mining machine-learning machine-learning-algorithms machinelearning numpy pandas python
Last synced: 13 Apr 2026
https://github.com/jhermienpaul/google-data-analytics-program
Hands-on learning materials from the 8-course Google Data Analytics Professional Certificate program, covering foundational data skills, tools, and real-world business problem-solving
bigquery dashboard data-analysis data-analytics data-modeling data-storytelling data-visualization data-wrangling descriptive-analytics diagnostic-analytics etl-pipeline r-programming rstudio sql tableau
Last synced: 13 Jul 2025
https://github.com/myktorijus/retention-cohort
Extracted cohort data using SQL in BigQuery focusing on weekly retention from week 0 to week 6
bigquery data-analysis data-visualization powerbi sql
Last synced: 13 Jul 2025
https://github.com/madi-s/tennispredictor
Program to predict outcomes of major tennis matches.
data-analysis prediction-algorithm python scraper tennis webdriver
Last synced: 06 Jul 2025
https://github.com/gappeah/credit-card-transactions-fraud-detection-project
The Credit Card Transactions Fraud Detection Project repository is designed to analyse and detect fraudulent transactions in credit card data.
Last synced: 12 Jul 2025
https://github.com/jabulente/kruskall-wallis-test
This repository contain project that provides a reusable Python function to perform the Kruskal-Wallis H-test across multiple continuous variables, grouped by a categorical feature
data-analysis data-science eda hypothesis-tests kruskal-wallis kruskals-algorithm scipy-stats statistics
Last synced: 22 Jul 2025
https://github.com/debjyotisaha/hands-on-sql
My Learning Path towards SQL
cte data data-analysis insert joins select sql subqueries update
Last synced: 04 Apr 2025
https://github.com/logan722/employee-management-system
An Employee Management System
data-analysis problem-solving pycharm-ide python-library
Last synced: 06 Apr 2025
https://github.com/ireneflorez/e_commerce_a_b_test_analysis
website A/B test data analysis
data-analysis jupyter-notebook matplotlib numpy pandas python statsmodels
Last synced: 14 Apr 2026
https://github.com/farzeennimran/fashion-mnist-dataset-classification-using-neural-network
Implementation of a Multi-layer Perceptron classifier with hyperparameter tuning and k-fold cross-validation employing GridSearchCV for classifying images on the Fashion MNIST dataset 👗👚👖
artificial-intelligence data-analysis data-mining data-science dataset deep-learning fashion-mnist-dataset gridsearchcv hyperparameter-tuning kfold-cross-validation machine-learning multilayer-perceptron-network neural-network numpy pandas python sklearn
Last synced: 03 Apr 2026
https://github.com/tathithienthanh/womenfashionproductrecommendationsystem
Build a recommendation system for recommending woman fashion's products on e-commerce platforms
content-based data-analysis data-collection data-processing data-visualization dfd e-commerce erd jupyter-notebook lazada nlp python recommender-system scraping-websites sql system-design tiki vietnamese visualization
Last synced: 16 May 2026
https://github.com/carvalhoandre/coletor-tweets
Criado para coletar e armazenar tweets utilizando a API do Twitter. Inicialmente inspirado no caso de uso do livro Um Voluntário na Campanha de Obama, este projeto tem como objetivo demonstrar a importância do monitoramento no X. O coletor permite buscar tweets sobre qualquer termo desejado
data-analysis mongodb python twiter-analysis twitter
Last synced: 19 May 2026
https://github.com/prasad-chavan1/bank_data_analysis_r
Bank data analysis in R language
data data-analysis data-science r
Last synced: 24 Feb 2025
https://github.com/nuccitheboss/jespipe-plugin
Your go to spot for creating and using Jespipe plugins.
adversarial-attacks data-analysis data-manipulation data-visualization machine-learning machine-learning-algorithms
Last synced: 23 Jun 2025
https://github.com/datastalker/survival-cox
This repository contains an R script for performing survival analysis on breast cancer surgery data from the University of Chicago's Billings Hospital. The analysis includes Kaplan-Meier estimation and Cox Proportional Hazards modeling to assess patient survival.
breast-cancer-prediction cox-model data-analysis data-science data-visualization epidemiology kaplan-meier r survival-analysis
Last synced: 02 Apr 2025
https://github.com/chaganti-reddy/ai-prototype-customer-segmentation
Artificial Intelligence Prototype product based model for Customer Segmentation in E-Commerce Industry.
artificial-intelligence cluster-analysis customer-segmentation data-analysis machine-learning product-based prototype
Last synced: 13 Mar 2025
https://github.com/ujjwalll/econometrics_analysis_of_india_gdp_misestimation
A Econometric Analysis of the India's GDP to determine whether their is any flaw in India's GDP, as quoted by Dr. Arvind Subhramanium.
coefficient-estimates data-analysis econometrics economics gdp india r statistics
Last synced: 31 Oct 2025
https://github.com/jwt218/sinc
MATLAB Standardization and Isotope Normalization for CSIA (with integrated correction and uncertainty quantification)
data-analysis geochemistry isotopes matlab
Last synced: 23 Jun 2025
https://github.com/victoorv/risques_financiers
Classification pour l'attribution ou non de prêts financiers et prédiction de scores de risque.
classification data-analysis data-science data-visualization hyperparameter-tuning loan-prediction loan-prediction-analysis loanapproval machine-learning machine-learning-algorithms python regression regression-models risk-analysis risk-score scoring scoring-models statistical-analysis statistical-tests statistics
Last synced: 19 May 2026
https://github.com/sweta-kaundilya/911-calls-capstone-project
For this capstone project we will be analyzing some 911 call data from Kaggle.
data data-analysis data-visualization jupyter-notebook matplotlib numpy pandas python seaborn
Last synced: 28 Apr 2026
https://github.com/sweta-kaundilya/sql_projects_data_analytics
This repository contains SQL porfolio projects
data-analysis mysql-database mysql-workbench
Last synced: 10 Sep 2025
https://github.com/al-ogr/sf_pr2_job_analysis_hh_sql
SkillFactory DataScience PROJECT-2. Анализ вакансий из HeadHunter
data-analysis data-science ipynb plotly python sql
Last synced: 19 May 2026
https://github.com/hari7261/data-visualization
Python-based application built using CustomTkinter for the graphical user interface (GUI) and Matplotlib for data visualization. It allows users to import datasets, perform real-time data visualization, and analyze data using various chart types and machine learning techniques.
data-analysis data-visualization export hari7261 import python realtime-visualization
Last synced: 17 Jun 2025
https://github.com/lmuffato/jiboia
Jiboia is a Python package for automatically normalizing and optimizing DataFrames efficiently.
data-analysis data-science dataframe normalization pandas python
Last synced: 19 May 2026
https://github.com/jofaval/boston-housing
Regression Analysis into the Boston Housing in-demand pricing in 1978
boston-housing data-analysis data-science data-visualization machine-learning python regression
Last synced: 16 May 2026
https://github.com/prawy126/data-analysis
ai data-analysis data-visualization python python3 tinker
Last synced: 14 Jun 2025
https://github.com/codeonthespectrum/web-scrap
Este projeto realiza o web scraping da Wikipédia para obter dados sobre os municípios mais populosos do estado do Rio de Janeiro.
data-analysis data-visualization webscraping
Last synced: 16 Feb 2026
https://github.com/nikbarb810/motif_detection_in_r
Motif Detection for TFBS in Glycolysis and Glyconeogenesis pathways
bioinformatics data-analysis null-hypothesis pwm r
Last synced: 23 Jun 2025
https://github.com/rociobenitez/airbnb-data-mining
Análisis detallado y modelado predictivo de alojamientos en Madrid utilizando técnicas de Big Data y estadística en R, enfocado en optimización de datos y predicción de características de propiedades.
airbnb data-analysis data-mining estadistica prediction-model predictive-analytics predictive-modeling qmd r rstudio
Last synced: 23 Jun 2025
https://github.com/joe-stifler/llm-sig-playground
This repository is a collaborative space for MSc Earth Science students at Imperial College London to experiment with and apply Large Language Models (LLMs) to real-world Earth Science problems. Follows below the persona playground link.
data-analysis earth-science llms machine-learning research-automation
Last synced: 29 Mar 2025
https://github.com/mansiikumarii/mysql
A curated collection of MySQL scripts covering DDL, DML, and DRL operations. Ideal for beginners to practice and understand core SQL concepts.
backend data-analysis data-modeling database database-integration database-management database-performance database-schema mysql mysql-admin mysql-database orm php-mysql query-optimization rdbms sql sql-query sql-script stored-procedure
Last synced: 19 May 2026
https://github.com/jpcadena/pharmacy-prices-prediction
Prices prediction project for Pharmacy products.
artificial-intelligence data-analysis data-science deep-learning keras machine-learning machine-learning-models neural-network numpy pandas pharmacy prediction price-prediction pylint python scikit-learn supervised-learning tensorflow
Last synced: 07 Apr 2026
https://github.com/the-pinbo/dimensionalityredux-pca-vs-autoencoders
Comparative study of PCA and Autoencoders for effective dimensionality reduction, assessed through PSNR and SSIM metrics.
autoencoder-mnist autoencoders data-analysis dimensionality-reduction image-compression mnist neural-networks pca psnr ssim
Last synced: 13 May 2025
https://github.com/julie-fliorko/rockbuster-insights-sql-project
Data analysis using PostgreSQL to help Rockbuster Stealth LLC identify revenue trends, customer insights, and rental behavior patterns.
Last synced: 22 Jul 2025
https://github.com/aniruddha-biswas/jpmorgan-chase-excel-internship
JPMorgan Chase & Co.'s Excel Skills on Forage Virtual Internship
conditional-formatting data-analysis data-cleaning data-visualization excel excel-dashboard macos pivot-tables power-query shortcuts storytelling vba-excel
Last synced: 01 Apr 2025
https://github.com/shubh-bharadwaj/zomato-dataset-analysis
Zomato Dataset Analysis
data-analysis data-science data-visualization numpy pandas python sklearn
Last synced: 07 Apr 2026
https://github.com/jayita11/eda-student-exam-performance
This project performs Exploratory Data Analysis (EDA) and hypothesis testing on student performance data. It explores trends based on attributes like gender, race/ethnicity, parental education, lunch type, and test preparation course completion.
data-analysis eda hypothesis-testing matplotlib pandas python seaborn statsmodels student-performance-analysis
Last synced: 11 Jul 2025
https://github.com/vigneshrocky262/powersub-demo-1434
🔧 Streamline your workflow with powersub-demo-1434, a simple tool for managing and automating tasks efficiently.
api automation coding-sandbox collaborative-tools data-analysis demo dynamic-programming machine-learning neural-networks performance-testing powersub project-management python software-development visualization
Last synced: 05 May 2026
https://github.com/jhrcook/protein-language-models
Experimenting with protein language model predictions
data-analysis protein-language-model variant-effect-prediction
Last synced: 28 May 2026
https://github.com/amishidesai04/interactive-data-visualisation-tool
A Java-based application leveraging JavaFX to create dynamic and interactive charts, including pie charts, bar charts, and line graphs. Ideal for visualizing various datasets, this tool offers customizable features and a user-friendly interface. Easily input and manage data, customize chart styles, and observe trends and patterns effectively.
charts data-analysis data-visualisation data-visualization-project gui java javafx visualization-tools
Last synced: 17 Apr 2026
https://github.com/karaniwachira/baby_names_analysis
Data Analysis: Baby Names Exploration
data data-analysis quarto quartopub r rstats tidyverse-ggplot2
Last synced: 22 Jun 2025
https://github.com/mumtaz4118/employee-satisfaction-and-attrition
Analysis of attrition based on environmental satisfaction from a Kaggle dataset.
data data-analysis data-science data-visualization ipynb jupyter-notebook machine-learning python statistical-analysis statistical-models
Last synced: 19 May 2026
https://github.com/balajimohan18/loan-clustering-datascience-project
This project uses Machine Learning to Cluster loan together based on their similarities. The project uses a dataeset of loan application which includes information about the Loan amount and Balance. The project then use the clustering algorithm to group the loan together based on the similarities.
clustering-algorithm data-analysis data-science data-visualization eda kmeans-clustering machine-learning sql unsupervised-learning
Last synced: 27 Jul 2025
https://github.com/vedantshi/tableau-bike-data-dashboard
London Bike Rides Analysis explores bike usage patterns using data visualization and machine learning. It identifies trends through a dynamic moving average, analyzes weather impact with heatmaps, and provides actionable insights via an interactive Tableau dashboard. Tools: Python, Tableau.
data-analysis data-visualization python tableau weather-data
Last synced: 16 May 2026
https://github.com/andrewzgheib/football-database-analysis
Football database utilizing PostgreSQL and Pandas for data management, with PowerBI for intuitive KPI visualization
data-analysis data-visualization database pandas pgsql postgr powerbi sql
Last synced: 04 Apr 2025
https://github.com/nerooc/device-downtime-detection
Repozytorium dotyczące projektu z przedmiotu "Sztuczne Sieci Neuronowe"
data-analysis detection-model recurrent-neural-networks
Last synced: 22 Mar 2025
https://github.com/timkong21/siemens-mobility-operations-industrial-engineer-simulation
Operations Industrial Engineer job simulation with Siemens Mobility. Includes time study analysis to identify assembly bottlenecks (Task 1) and a proposed layout redesign to improve efficiency without automation (Task 2).
data-analysis forage industrial-engineering job-simulation manufacturing process-improvement production-engineering python siemens time-analysis
Last synced: 19 May 2026
https://github.com/m4tice/qm_project
Bicycle project crowd evaluation.
data-analysis data-engineering data-visualization
Last synced: 16 Mar 2025
https://github.com/lopez86/datascienceexamples
Examples of various data science & data analysis topics using various sources of data.
data-analysis data-science pandas scikit-learn tutorial visualization
Last synced: 13 Apr 2026
https://github.com/sharduljunagade/human-activity-recognition
This repository contains the code for the Assignment-1 of the course ES 335: Machine Learning 2024 at IIT Gandhinagar taught by Prof. Nipun Batra.
data-analysis data-collection decision-trees groq-api human-activity-recognition jupyter langchain-python machine-learning pandas prompt-engineering python sklearn tsfel
Last synced: 08 Apr 2026
https://github.com/parthds02/pizza_sales_sql
SQL project analyzing pizza sales data. Includes creating tables, executing queries, and solving basic to advanced analytical questions to derive insights from sales data.
analytics data-analysis data-science pizza-sales sql sql-query
Last synced: 04 Mar 2026
https://github.com/drisskhattabi6/exploratory-data-analysis-projects
This Repo contains My Exploratory Data Analysis Projects for many datasets
data-analysis data-preprocessing data-visualization datasets diabetes-prediction eda exploratory-data-analysis iris-dataset
Last synced: 26 Jun 2025
https://github.com/maxbiostat/diehl_ebola_cell_2016
supplementary code and data to Diehl et al, 2016 (Cell)
data-analysis data-visualization disease-spread ebola mutation
Last synced: 11 Jul 2025
https://github.com/shubhamgoyal575/credit-card-fraud-detection
📌 Credit Card Fraud Detection using Machine Learning This project focuses on detecting fraudulent credit card transactions using machine learning models like Random Forest, XGBoost, and Deep Learning. The dataset is preprocessed to handle class imbalance, and multiple models are evaluated based on ROC AUC Score and F1 Score.
adaboost-classifier artificial-neural-networks credit-card-fraud data-analysis data-cleaning data-preprocessing data-science data-visualization deep-learning exploratory-data-analysis lightgbm machine-learning machine-learning-algorithms random-forest-classifer scikit-learn tensorflow xgboost
Last synced: 08 Feb 2026
https://github.com/purushothamadluru/atlantic-gdp-job-demand-analysis
data-analysis data-visualization powerbi
Last synced: 17 Feb 2026
https://github.com/swatisinghit/e-commerce-trend-analysis-for-target
An exploratory and in-depth study of the E-Commerce sales data for a Brazilian store using SQL.
bigquery data-analysis mysql sql
Last synced: 19 May 2026
https://github.com/amarlearning/exploring-the-evolution-of-linux
Data Analysis about the development of the Linux operating system by exploring its Git repository history.
cleaning-data data data-analysis data-wrangling datacamp first-commit git-history linux
Last synced: 12 May 2026
https://github.com/imnotamr/datasets-used
A comprehensive collection of datasets for machine learning and data science projects, covering topics from advertising and sales to health and sports analytics
ai classification data-analysis data-science data-visualization deep-learning jupyter-notebook machine-learning models python regression-models
Last synced: 19 May 2026
https://github.com/mulukensholaye/spark_kafka_streaming_csv
Real-time streaming data analysis pipeline with integrating apache spark's streaming library to read records from kafka topic
apache-kafka apache-spark data-analysis python3 realtime-messaging
Last synced: 19 May 2026
https://github.com/hawmex/aut_data_and_information_analysis_project
This repository contains the files of my project for the "Data & Information Analysis" course at AUT (Tehran Polytechnic).
data-analysis data-science k-means outlier-detection python
Last synced: 19 May 2026
https://github.com/olympus-terminal/data-processing
Data analysis and processing tools
automation data-analysis data-processing data-science etl machine-learning pdf-extraction python r research statistics web-scraping
Last synced: 16 May 2026
https://github.com/devexpress-examples/wpf-pivotgrid-how-to-display-underlying-data
This example demonstrates how to obtain the records from the control's underlying data source for a selected cell or multiple selected cells.
data-analysis dotnet dxpivotgrid pivot-grid pivot-grid-for-wpf wpf
Last synced: 19 May 2026
https://github.com/ashwin331133/hospital_allpatients_waitinglist_data
This Power BI project analyzes patient waiting lists across various medical specialties and case types (Day Case, Inpatient, Outpatient). The goal is to gain insights to improve healthcare management and resource allocation.
data-analysis data-visualization powerbi
Last synced: 03 Jan 2026
https://github.com/sakan811/gachascope
Evaluate the cost-effectiveness of various in-app purchase bundles available in gacha games.
data data-analysis data-visualization game honkai honkai-star-rail honkai-starrail hoyoverse javascript nextjs tableau tableau-public typescript wutheringwaves
Last synced: 04 May 2026
https://github.com/adnanrahin/nlp-with-disaster-tweets
Kaggle Competition: Predict which Tweets are about real disasters and which ones are not. Natural Language Processing.
data-analysis data-science data-visualization kaggle-competition machine-learning natural-language-processing regular-expression tweets
Last synced: 21 Jun 2025
https://github.com/sukhitashvili/pca_tutorial
PCA algorithm from scrach, using only matrix-vector multiplications
data-analysis data-science data-visualization machine-learning-algorithms pca
Last synced: 29 Mar 2025
https://github.com/prady2309/stock-analysis
Analysis on the stock prices of Apple, Google, Microsoft and Amazon
data-analysis data-science data-visualization python stock-market
Last synced: 19 May 2026
https://github.com/pkjjoshi/restaurants-analysis
Performed beginner-level EDA on a restaurant dataset using Python. Analyzed top cuisines, city-wise ratings, price ranges, and online delivery impact using Pandas and Matplotlib. Includes 4 well-structured notebooks with visual insights.
beginner-project data-analysis data-visualization exploratory-data-analysis jupyter-notebook pandas python restaurant-data seaborn
Last synced: 21 Jun 2025
https://github.com/teditae/data-analysis-with-pandas
Mini data science projects focused on Pandas-powered analysis.
data-analysis data-manipulation pandas python
Last synced: 30 Apr 2026
https://github.com/sebastianurdaneguibisalaya/colocaciones-de-credito-fondo-mivivienda-peru
Exploro las Colocaciones de Crédito del Fondo MIVIVIENDA S.A. entre 2018 y 2022, con un conjunto de datos descargado del Portal Nacional de Datos Abiertos del Perú. 🏠
data-analysis jupyter-notebook python
Last synced: 24 Feb 2025
https://github.com/atharvkadammm/suicide-prediction-system
A machine learning project predicting suicide risk based on multiple socio-economic and environmental factors using data mining techniques.
csv data-analysis data-science data-visualization datamining exploratory-data-analysis feature-engineering machine-learnin matplotlib mental-health numpy pandas riskassesment seaborn sklearn suicide-prediction supervised-
Last synced: 01 Jul 2025
https://github.com/k8hertweck/intro_r
data-analysis data-analysis-in-r r tidyverse training
Last synced: 29 May 2026
https://github.com/atharvkadammm/calmlytic
An end-to-end machine learning project that predicts anxiety severity using classification models (Naive Bayes, Decision Tree, SVM, Logistic Regression, XGBoost), based on lifestyle, health, and behavioral features.
anxiety-prediction classification csv data-analysis data-preprocessing-and-cleaning data-science data-visualization ensemble-learning logistic-regression machine-learning-algorithms matplotlib mental-health numpy pandas python sci-kit-learn seaborn supervised-learning svm xgboost
Last synced: 21 Jun 2025
https://github.com/vatshayan/ip-address-data-analysis-
Extraction of 100's of IP Address and using Machine Learning algorithm for detecting threats
data data-analysis data-science data-visualization dataset ip ipconfig ipv4-address jupyter-notebook machine-learning machine-learning-algorithms supervised-machine-learning unsupervised-learning
Last synced: 15 Jul 2025
https://github.com/nivasharmaa/friskwatch
A Java program for analyzing stop-and-frisk data from the NYPD. Features data import, organization, and statistical analysis to compare occurrences during and after policy implementation.
data-analysis data-visualization dataprocessing datascience file-io java java-oop nypd-data
Last synced: 19 May 2026
https://github.com/shellynagar27/marketing-content-performance-analysis
Analyzed 2024 social media campaign data from TikTok, Instagram, LinkedIn, and X.com using Power BI to uncover performance trends across platforms, content types, and regions. Built an interactive dashboard to drive insights on engagement, optimal posting times, and content strategy.
data-analysis data-modelling data-visualization excel figma marketing-analytics powerbi powerquery wireframing
Last synced: 26 Jun 2025
https://github.com/prakshal0809/power-bi-analytics-dashboard
I have developed a dashboard in Power BI utilizing data from an Excel file. The dashboard effectively visualizes and analyzes the given data.
Last synced: 22 Feb 2026
https://github.com/kevin-rsj/sectores_economicos_covid-19
Análisis Exploratorio de Datos (EDA): Comportamiento de Sectores Económicos antes, durante y después de la Pandemia de COVID-19 (2019-2022)
data-analysis financial-analysis pandemic-analysis python stock-market time-series visualization yahoo-finance
Last synced: 20 May 2026
https://github.com/ryuzen6/kaggle-series
This is a series of Machine Learning/Deep Learning Models made for practice.
artificial-intelligence data-analysis data-science deep-learning machine-learning python3
Last synced: 20 May 2026
https://github.com/evamaerey/ma206distributions
data-analysis data-science ggplot2 statistics
Last synced: 22 Jul 2025
https://github.com/astrojarhead/irafscripts
IRAF cl scripts
astronomy data-analysis image-processing iraf scripts
Last synced: 12 Jan 2026
https://github.com/badranalyst/restaurant-reviews-sentiment-analysis-nlp-case-study
This project analyzes restaurant reviews using Natural Language Processing (NLP) for sentiment analysis. It covers data exploration, pre-processing (NLTK text cleaning), model building, prediction, and deployment. The goal is to predict sentiment from reviews using Python libraries such as Pandas, NumPy, Matplotlib, and Seaborn.
data-analysis data-science eda exploratory-data-analysis matplotlib-pyplot model model-building numpy pandas pre-processing predictive-modeling python seaborn
Last synced: 13 Apr 2026
https://github.com/sharoonjoseph321/samsung_stock_prediction
Predicting future price of Samsung stock, using machine learning , scikit learn and pandas
algorithms data-analysis data-science jupyter-notebook machine-learning machine-learning-algorithms prediction predictive-analytics predictive-modeling python stock-price-prediction supervised-learning
Last synced: 06 Apr 2025
https://github.com/rezowanrahat/netflix_analysis
Data analysis of Netflix content using Python, Pandas, and Seaborn
data-analysis data-visualization netflix pandas python
Last synced: 07 May 2026
https://github.com/srinibas-masanta/hotel-revenue-analysis-dashboard
This project focuses on analyzing hotel booking data to uncover key metrics and insights that drive revenue management decisions. By creating an interactive Power BI dashboard, the project aims to improve strategic decision-making, optimize occupancy rates, and enhance overall financial performance within the hospitality industry.
business-analytics data-analysis data-science data-visualization dax-functions hospitality powerbi
Last synced: 12 Jan 2026
https://github.com/yuvrajsaraogi/uber-data-analysis-using-machine-learning
This repository contains Uber Data Analysis using various Machine learning Algorithms
data-analysis data-science exploratory-data-analysis linear-regression logistic-regression machine-learning random-forest uber-data-analysis
Last synced: 24 Aug 2025
https://github.com/kiran-kumar-k3/sales-performance-dashboard
The Sales Performance Dashboard is an interactive Python-based web application that visualizes and analyzes sales data, providing actionable insights through dynamic charts and metrics.
data-analysis python streamlit
Last synced: 20 May 2026
https://github.com/kushagrakumar04/visual-age-distribution
A Bar chart or histogram to visually depict the distribution of a categorical or continuous variable, such as the age distribution or gender composition within a population. This graphical representation provides a clear and insightful overview of the data's patterns and trends.
data-analysis data-science google-colab
Last synced: 21 Jun 2025
https://github.com/jpcadena/malware-analysis
Analysis of malware signatures and their associated Common Vulnerabilities and Exposures (CVEs)
black common-vulnerabilities-and-exposures cve-search data-analysis data-engineering data-reporting data-visualization isort malware-analysis matplotlib mypy numpy pandas plotly poetry pre-commit pydantic python ruff seaborn
Last synced: 03 Mar 2026