Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-07-02 00:07:33 UTC
- JSON Representation
https://github.com/florence-nyokabi/house-power-consumption
Machine Learning: Exploring Regression Analysis
data-analysis data-cleaning data-science data-visualization feature-engineering jupyter-notebook jupyterlab machine-learning pandas-python regression-analysis regression-models
Last synced: 05 Jun 2026
https://github.com/nikhilfuke1/a-b-testing-and-regression-analysis-python
Python Statistical Project involves data analysis, visualization, A/B testing, and regression analysis to determine the best-performing platform.
ab-testing data-analysis hypothesis-testing libraries python regression-analysis statistics visualization
Last synced: 21 Apr 2026
https://github.com/danpoynor/data-analysis-spotify-songs-2010-2019
Spotify data analysis for songs between 2010 and 2019 using Jupyter Notebooks including pandas and Seaborn plots.
data-analysis jupyter-notebook matplotlib pandas-dataframe python3 seaborn-plots spotify
Last synced: 22 Apr 2026
https://github.com/mhuwaimel/data-analysis-of-students-results-in-qiyas
Analysis of student performance data from Qiyas (قياس), the Saudi Arabian National Center for Assessment
data-analysis jupyter-notebook python
Last synced: 22 Apr 2026
https://github.com/kgelli/apple-data-analysis---apache-spark
Modular ETL pipeline for analyzing Apple product purchase patterns using Apache Spark on Databricks with factory design patterns.
apache-spark data-analysis databricks delta-lake etl-pipeline factory-pattern pyspark
Last synced: 22 Apr 2026
https://github.com/prgermux/yield-reporter
This Python application provides a graphical user interface (GUI) for analyzing and visualizing production data from various machines. It uses the PyQt5 framework for the GUI and Matplotlib for plotting data.
automation data-analysis python reporting
Last synced: 22 Apr 2026
https://github.com/nurulashraf/polynomial-regression-manufacturing
A Python project implementing polynomial regression to analyse and predict manufacturing-related data. Features include data preprocessing, model training, and visualisation of results. Ideal for exploring machine learning applications in manufacturing process optimisation.
data-analysis data-visualization machine-learning manufacturing polynomial-regression predictive-modeling process-optimization python regression-models scikit-learn
Last synced: 16 Apr 2026
https://github.com/rorrell/lifeexpectancy
A Jupyter Notebook where I create a chart with two line plots on it to check out the life expectancy of men vs. women from 1900-2018
data-analysis data-visualization jupyter-notebook python3
Last synced: 22 Apr 2026
https://github.com/codesaadumair/data-science-monorepo
Comprehensive Data Science monorepo featuring EDA, Machine Learning, Preprocessing, Feature Engineering, and Visualization projects with Jupyter notebooks and Python.
data-analysis data-science data-science-projects data-visualization eda jupyter-notebook jupyterlab machine-learning python
Last synced: 01 May 2026
https://github.com/abdoomohamedd/python-data-analysis-projects
A collection of data analysis projects that I have worked on. Each project involves cleaning data, performing exploratory data analysis (EDA), and creating visualizations to extract insights. The projects utilize various Python libraries, including pandas, numpy, matp
data-analysis data-analysis-python data-cleaning data-visualization matplotlib matplotlib-pyplot numpy pandas python
Last synced: 01 May 2026
https://github.com/rafath0ssain/predihome
Data analysis using economic factors affecting living conditions across Canadian provinces.
data-analysis data-visualization dplyr ggplot2 graph kaggle linear-regression prediction-model r shiny tidyr
Last synced: 01 May 2026
https://github.com/leftcoastnerdgirl/introduction_to_pandas
This project introduces the use of Python in a JupyterNotebook.
analytics budget-analysis budget-planner-tool budget-planning data-analysis dataframes jupyter-notebook pandas pandas-python python
Last synced: 01 May 2026
https://github.com/vedantshi/stock-price-prediction-for-maang-companies
This project utilizes Long Short-Term Memory (LSTM) networks to forecast stock prices. It includes steps for data preprocessing, model training, and visualization of predictions using Python in Jupyter Notebook. The project demonstrates proficiency in machine learning, data analysis, and Python programming.
data-analysis data-visualization lstm machine-learning python stock-price-prediction
Last synced: 01 May 2026
https://github.com/guptakushal03/whatsapp-chat-analyser
The WhatsApp Chat Analyzer is a Python-based tool built with Streamlit for analyzing WhatsApp chat data. It provides insights such as total messages, word count, media shared, links shared, monthly activity timeline, most active users, activity maps, and word clouds.
chat-analysis data-analysis data-visualization python streamlit text-processing whatsapp word-cloud
Last synced: 01 May 2026
https://github.com/ujjwalll/get-that-flair
It is a repository for project detecting the flair of reddit post through their links. You can find the working model of it at - https://get-that-flair.herokuapp.com/
data-analysis data-visualization django-application herokuapp machine-learning naive-bayes-classifier praw-reddit python3 random-forest reddit-api sentiment-analysis topic-modeling
Last synced: 01 May 2026
https://github.com/sanikamal/machine-learning-atoz
Beginner-friendly machine learning tutorials and mini-projects.
collaborative-filtering data-analysis data-visualization decision-trees kmeans-clustering knn machine-learning machine-learning-algorithms recommender-system regression svm
Last synced: 08 Jun 2026
https://github.com/more-joao/color-distance-luminance
Data analysis project that aims to establish a relation between the Canberra distance between white and any given color in the RGB colorspace and its luminance.
canberra-distance data-analysis luminance python r rgb
Last synced: 02 May 2026
https://github.com/waseemofficial/ml-practice
ML Practice
data data-analysis jupyter-notebook machine-learning ml python
Last synced: 02 May 2026
https://github.com/teja-1403/ignosis-tech-ml-assignment
Analysis of transaction data to identify the most profitable products and key customer segments, providing insights for targeted marketing strategies.
customer-segmentation data-analysis data-visualization machine-learning marketing-strategy python
Last synced: 02 May 2026
https://github.com/suma-aljudaia/my-portfolio
Suma Aljudaia | Portfolio – AI & Data Analysis Enthusiast
ai css data-analysis html machine-learning portfolio
Last synced: 02 May 2026
https://github.com/hamzazafar10/movie-recommendation-system
Content based movie recommendation system using cosine similarity.
cosine-similarity data-analysis data-preprocessing data-science data-structures data-visualization jupyter-notebook machine-learning movie-recommendation python
Last synced: 02 May 2026
https://github.com/amishidesai04/emergency-calls-data-analysis-project
Welcome to the Emergency Calls Data Analysis project repository. This project is dedicated to extracting, processing, and visualizing data from the "Emergency – 911 Calls, Montgomery County" dataset, sourced from Kaggle. The main objective is to analyze trends in emergency calls in Montgomery County, Pennsylvania, spanning multiple years.
analysis data-analysis data-extraction data-processing data-science data-visualization numpy pandas python seaborn
Last synced: 02 May 2026
https://github.com/rorrell/employmentdata
A Jupyter Notebook where I use group by to analyze the average unemployment rate by year
data-analysis data-visualization jupyter-notebook python3
Last synced: 02 May 2026
https://github.com/fatihilhan42/spotify-songs-recommendations-system_with_python
We developed a song recommendation system for the user with the data we received from our Spotify song dataset. Data set and other applications are given in the description. Have a nice day.
data-analysis data-science data-visualization jupyter-notebook python recommendation-engine recommendation-system
Last synced: 02 May 2026
https://github.com/leticiamilan/santander-tech-data-science
Este repositório contém os projetos desenvolvidos durante o curso de Data Science, uma parceria entre a Ada Tech e o Santander Open Academy. O curso é dividido em vários módulos, cada um focado em um aspecto fundamental da ciência de dados.
ada-tech calculus data-analysis data-science letscode python santander
Last synced: 09 Jun 2026
https://github.com/bhaveshbhakta/gold-price-prediction-using-ml
Gold Price Prediction
data-analysis data-visualization gold-price-prediction machine-learning python
Last synced: 02 May 2026
https://github.com/dissorial/prx21_erikz
Analysis of self-tracked data: interactive visualizations & predictive algorithms
analytics data-analysis data-science data-visualization machine-learning matplotlib pandas python python3 visualization
Last synced: 02 May 2026
https://github.com/mehanix/dhrw
🎢 IaaS visual editor to create & deploy data processing pipelines - python, rmq, react, meteorjs
computational-graph computational-graphs data-analysis data-engineering data-pipeline data-pipelines data-processing data-processing-and-analysis data-processing-pipelines data-processing-system data-science data-visualization docker-compose good-first-issue help-wanted meteorjs-application rabbitmq react-flow
Last synced: 02 May 2026
https://github.com/sarah-marion/sovereign-osint-toolkit
Sovereign OSINT Toolkit - Advanced, self-hosted intelligence platform for security researchers and investigators. Ethical, private and production-ready.
correlation-engine cybersecurity data-analysis docker fastapi infosec intelligence investigation open-source osint privacy python3 security-research security-tools threat-intelligence
Last synced: 02 May 2026
https://github.com/maddieemihle/pandas-challenge
Python analysis to create and manipulate school and standardized test data. Scores are calculated, grouped, aggregated, summarized, and organized using pandas.
Last synced: 09 Jun 2026
https://github.com/robertpaulp/expenseadvisor
HackITall 2023- Hackathon
chatgpt-api data-analysis data-processing python scrapping-python
Last synced: 03 May 2026
https://github.com/jofaval/red-wine-quality
Data Analysis of the Red Portuguese's Wine's Quality in 2009
classification data-analysis data-science data-visualization google-colab kaggle logistic-regression machine-learning python scikit-learn wine-quality xgboost
Last synced: 03 May 2026
https://github.com/bhavna-kale/cars-eda-project
Project analyzing used car market data to identify high-impact price drivers and depreciation curves, presented through an interactive web application.
data-analysis excel matplotlib numpy pandas python3 searborn streamlit
Last synced: 03 May 2026
https://github.com/ahmedhosssam/lesser_pandas
Pandas-like Data Analysis library in C++
cpp data-analysis data-science pandas
Last synced: 03 May 2026
https://github.com/stepankuzmin/machine-learning-data-analysis
My homeworks on Coursera Machine Learning and Data Analysis specialization
coursera data-analysis jupiter machine-learning python
Last synced: 03 May 2026
https://github.com/fatihilhan42/tourist_analysis_in_turkey_with_python
In this project, the number of tourists coming to Turkey between 2008-2021 was analyzed. The data from the data set you can find in the warehouse was first organized using data cleaning algorithms. These cleaned data were then output graphically using data visualization algorithms.
data-analysis data-cleaning data-science data-visualization jupyter-notebook python
Last synced: 03 May 2026
https://github.com/chaedoll/analysis-python-foreignerinfra
국내 외국인 대상 인프라 개선을 위한 보고서 (Report on improving infrastructure for foreigners)
data-analysis python team-project
Last synced: 03 May 2026
https://github.com/cassiofb-dev/projetos-intensivao-python
Projetos do evento intensivão de Python da Hashtag treinamentos.
automation data-analysis data-science data-visualization jupyter-notebook machine-learning python webscraping
Last synced: 03 May 2026
https://github.com/emredemirbas/movie-ratings-analysis
A data analysis project investigating potential bias in movie ratings from 2015, comparing them with ratings from other platforms using Python, pandas, and visualization libraries.
data-analysis matplotlib pandas python seaborn
Last synced: 03 May 2026
https://github.com/maddieemihle/python-challenge
Creating a Python script that analyzes financial records and election results
Last synced: 09 Jun 2026
https://github.com/mohnish88/e-commerce-data-analysis
I analyzed sales data to identify trends and patterns, which significantly enhanced decision-making processes. Additionally, I created interactive visualizations to present these insights clearly and effectively, facilitating better understanding and communication of the data's implications.
data-analysis data-cleaning jupyter-notebook pandas plotly python python-library sales sales-analysis visulaization
Last synced: 03 May 2026
https://github.com/abhijais4896/cardiovascular-disease-prediction
Cardiovascular Disease Prediction using machine learning and Python
data-analysis datascience deep-learning machine-learning matplotlib numpy pandas python seaborn visualization
Last synced: 03 May 2026
https://github.com/nathadriele/diabetes-clinical-etl-pipeline
Este projeto de Engenharia de Dados em Saúde Pública implementa um pipeline completo para coletar, tratar, padronizar, validar, integrar e visualizar dados públicos do SUS relacionados ao Diabetes Mellitus no Brasil, filtrando pelos códigos CID-10 E10 a E14.
cid data-analysis data-extraction data-pipeline data-science data-structures data-visualization datasus diabetes-detection diabetes-prediction epidemiology-analysis etl-pipeline healthcare-analytics ibge logger pytest sih streamlit sus
Last synced: 09 Jun 2026
https://github.com/codeslash21/tmdb_data_analysis
We analysed TMDB dataset which contains around 11000 movies details. We analyzed to find some interesting facts about the dataset.
data-analysis data-visualization matplotlib nanodegree-project numpy pandas python tmdb-movie
Last synced: 03 May 2026
https://github.com/muskanmi/data_analysis_python
Data analysis on students result dataset using python libraries.
boxplot countplots data-analysis numpy pandas pie-chart python3 seaborn
Last synced: 03 May 2026
https://github.com/syed-m-nofel/python-data-science-fundamentals
Python notebooks for data manipulation (Pandas/NumPy) and API workflows – from basics to practical examples.
api beginner-friendly data-analysis data-science http-requests jupyter-notebook numpy pandas pandas-dataframe python tutorial
Last synced: 03 May 2026
https://github.com/obinnaokoye89/fraud-detection-monitoring
ML model monitoring for fraud detection using NannyML
analytics automation data-analysis fraud-detection jupyter-notebook machine-learning monitoring nannyml pandas python sciki-learn
Last synced: 03 May 2026
https://github.com/nurulashraf/logistic-regression-loan-prediction
Loan approval prediction using logistic regression based on applicant data, including income, credit history, and property details, after data preparation and feature engineering.
data-analysis data-science loan-prediction logistic-regression machine-learning predictive-modeling python sklearn
Last synced: 03 May 2026
https://github.com/ljadhav25/swiggy-restaurant-analysis
This repository contains data and analysis related to restaurants listed on Swiggy, one of India's largest online food ordering and delivery platforms. The objective is to explore restaurant trends, customer reviews, pricing strategies, and delivery metrics to gain insights into the food delivery industry.
data-analysis data-visualization matplotlib-pyplot numpy-library pandas-library python seaborn-plots
Last synced: 03 May 2026
https://github.com/iguptashubham/ev-market-exploration
So, market size analysis is a crucial aspect of market research that determines the potential sales volume within a given market
data-analysis data-analysis-projects data-science-project forecast projects python
Last synced: 03 May 2026
https://github.com/devesh8423/machine_learning
Machine Learning practice projects, Jupyter notebooks, and datasets for learning regression, classification, and data analysis.
classification data-analysis data-science data-visualization jupyter-notebook machine-learning matplotlib ml-project numpy-library pandas python regression sckit-learn seaborn
Last synced: 03 May 2026
https://github.com/donmaruko/flask-data-analysis
Flask API for statistical calculations. Data analysis, cleansing, visualization, and manipulation. Documented by Swagger.
api api-rest data-analysis data-science data-visualization datascience flasgger matplotlib pandas seaborn sqlite wordcloud
Last synced: 03 May 2026
https://github.com/nathadriele/world-marathon-run-majors-analytics-challenge
This project presents a complete data engineering, analytics, machine learning, and Streamlit dashboard pipeline focused on the Abbott World Marathon Majors: Tokyo, Boston, London, Berlin, Chicago, and New York City. Covering the 2018 to 2025 seasons, it analyzes more than 628,000 runner records and 86 verified winner entries.
challenge data-analysis data-pipeline gradient-boosting lasso-regression linear-regression machine-learning models predictive-modeling python random-forest ridge-regression run-analytics world-marathon
Last synced: 09 Jun 2026
https://github.com/bpkaur/whats-in-a-name
Exploring dataset of first names of babies born in the US in order to uncover interesting stories
data-analysis datacamp numpy pandas python3
Last synced: 04 May 2026
https://github.com/mindlessmuse666/titanic-data-visualization
Проект по визуализации данных о пассажирах Титаника с использованием библиотек Python Matplotlib, Seaborn и Plotly.
data-analysis data-visualization matplotlib pandas plotly python seaborn titanic
Last synced: 04 May 2026
https://github.com/nickenshidqia/uber-new-york-data-analysis
Analyze Uber pickups on New York to get insight from this data
data-analysis data-analyst exploratory-data-analysis python
Last synced: 04 May 2026
https://github.com/xiaohan2012/myunisport
Visualize your Unisport annual training records
data-analysis data-visualization pandas pygal sports-stats tikzposter
Last synced: 04 May 2026
https://github.com/sanchittechnogeek/rental-data-visualization_python
Statistics and visualization of rental data with python
data-analysis data-science data-visualization statistics
Last synced: 04 May 2026
https://github.com/soham7998/data-analysis-projects
My Data Analysis Projects which are completed by me and gain a hands on Experience from each project. the project showcase different Concepts , Visualization and many things.
data data-analysis data-science machine-learning nlp python soham visualization
Last synced: 04 May 2026
https://github.com/damisparks/become_data_analyst
Are you new to Data Analysis ? Here you will find simple notebook that will help through your journey. These are personal projects I work on and still working.
data data-analysis data-visualization matplotlib numpy pandas-tutorial
Last synced: 04 May 2026
https://github.com/ibrahimm7004/supermarket-sales-analysis
This project focuses on Data Mining techniques to gather inisights about customer behaviour regarding Supermarket Sales. Includes: Association Rule Mining, Temporal Patterns in customer behavior, Sequential Pattern Mining, Classification, Regression, and Outlier Detection.
apriori association-rules data-analysis data-mining data-science data-visualization fpgrowth python sales-analysis supermarket-sales
Last synced: 04 May 2026
https://github.com/mchenryspagg/investigate_a_dataset
This is a data analysis project that demonstrates the student's ability to use python data analysis libraries such as pandas, numpy and pyplot in matplotlib to investigate a dataset and answer specific questions from the dataset, thus demonstrating skills in data cleaning, data wrangling, and exploratory data analysis.
data-analysis datetime descriptive-analysis descriptive-statistics exploratory-data-analysis numpy pandas pyplot python visualization
Last synced: 04 May 2026
https://github.com/abhijais4896/belarus-car-price-prediction
Belarus-car-price-prediction
data-analysis datacleaning macine-learning numpy pandas python
Last synced: 04 May 2026
https://github.com/sweta-kaundilya/python_for_data_analysis
Learning Python and all the relevant libraries in python for Data field.
cufflinks data-analysis data-science matplotlib numpy pandas plotly python seaborn
Last synced: 04 May 2026
https://github.com/analitico-771/etf_analyzer
This is an An application that pulls and analyzes ETF data from a database
conda-environment data-analysis data-structures data-visualization database etf-investments fintech hvplot pandas-dataframe python quantitative-finance sqlalchemy
Last synced: 04 May 2026
https://github.com/hilalguleryuz/northwind_data_analysis_capstone_project
Northwind Capstone Project
capstone-project dashboard data-analysis data-visualization dax jupyter-notebook matplotlib northwind northwind-database pandas postgresql powerbi python seaborn sql
Last synced: 04 May 2026
https://github.com/hyperplasma/olympic-visualization-analysis
Multidimensional analysis and visualization of Olympic medals, economy, and happiness index.
data-analysis data-visualization matplotlib numpy pandas python wordcloud
Last synced: 04 May 2026
https://github.com/abhinav330/911-emergency-calls-analysis
This Python Notebook analyzes emergency call data from the '911.csv' dataset. It uses various data visualization techniques to explore and gain insights into the emergency call data, including the types of calls, reasons for calls, and call patterns over time.
data-analysis data-science data-visualization eda exploratory-data-analysis exploratory-data-visualizations numpy pandas python
Last synced: 09 Jun 2026
https://github.com/ljadhav25/logistic-regression-data-science-
Logistic regression estimates the probability of an event occurring, such as voted or didn’t vote, based on a given data set of independent variables.
data-analysis data-science data-visualization logestic-regression machine-learning
Last synced: 04 May 2026
https://github.com/mugilan1309/csv_analyzer
📊 A simple Streamlit-based CSV Analysis & Preprocessing Tool for quick data insights.
csv-processing data-analysis data-visualization machine-learning python streamlit
Last synced: 04 May 2026
https://github.com/saitoxu/data-analysis-workspace
Docker image for data analysis
Last synced: 04 May 2026
https://github.com/bishopce16/pyber_analysis
The purpose of this project was to complete an exploratory analysis and create visualizations of the 2019 ride sharing data from PyBer.
data-analysis data-visualization jupyter-notebook matplotlib pandas python
Last synced: 04 May 2026
https://github.com/drod75/nyc-arrests-analysis
This is a simple Data Science Project made to analyze and display data and trends found within the NYC Arrests Year to Date Dataset.
data-analysis data-visualization folium jupyter-notebook matplotlib-pyplot nyc-opendata nypd python scikit-learn seaborn
Last synced: 04 May 2026
https://github.com/vara-co/crowdfunding_etl
ETL Mini Project based on a Crowdfunding Database, using CRUD operations. SQL, Postgres, and an ERD.
data-analysis database datacleaning erd erdiagram etl jupyter-notebook postgres postgresql regex schema sql
Last synced: 04 May 2026
https://github.com/yokawaiik/data_science
Time series forecasting with future predict.
data-analysis keras lstm neural-network predict-future python python-3 rnn time-series-forecast visualization
Last synced: 05 May 2026
https://github.com/shishirshekhar/census-web-app
This web app allows a user to explore and visualise census data
data-analysis data-science data-visualization machine-learning python python3 streamlit streamlit-application streamlit-dashboard streamlit-web streamlit-webapp visualization
Last synced: 05 May 2026
https://github.com/dhruvsrikanth/basic-data-science
A short Data Science Project I took up for fun! This is a data analysis based on a dataset I created to predict the distribution of wealth within an economy as well as several characteristics of each class within society!
analysis data-analysis data-pipeline data-science data-visualization machine-learning matplotlib pandas python seaborn sklearn
Last synced: 05 May 2026
https://github.com/demon-2-angel/product-customer-acquisition-analysis-using-behaviour
The database encompasses eight tables with varied attributes and rows. Key analyses include product restocking needs, top VIP customers' contributions, and an average customer profit of $39,039.59. Recommendations emphasize strategic marketing to new customers and incentives for existing VIP clients based on acquisition costs and profit insights.
customer-products customer-segmentation data-analysis database sqlite
Last synced: 05 May 2026
https://github.com/badranalyst/residential-unit-prices-data-analysis-application
Python-based analysis of residential unit prices, focusing on data cleaning, visualization, and exploratory data analysis (EDA). Key features include price distribution, and correlation analysis between factors like size, location, and pricing.
data-analysis data-visualization dataset matplotlib numpy pandas python seaborn
Last synced: 05 May 2026
https://github.com/beaprogrammer02345/python_data_analysis
Sales Analysis using Python
data-analysis data-visualization python
Last synced: 05 May 2026
https://github.com/13anush/python-libraries-
A collection of essential Python libraries—NumPy, Pandas, Matplotlib, and Seaborn—perfect for anyone starting out in data analysis.
data-analysis matplotlib numpy pandas python seaborn
Last synced: 05 May 2026
https://github.com/zafir100100/cancer-stage-prediction
This code predicts cancer data using various regression models, calculates their average R-squared scores, and prints the best model.
cross-validation data-analysis data-preprocessing decision-trees gradient-boosting linear-regression machine-learning-algorithms numpy pandas random-forest regression scikit-learn
Last synced: 05 May 2026
https://github.com/codewithmayank-py/box-office-analysis-with-seaborn-and-python
This repository contains Python code and datasets for analyzing box office data. Explore trends, patterns, and factors influencing movie performance.
analysis box-office-data-analysis data-analysis data-visualization dataset jupyter-notebook matplotlib pandas python3 seaborn
Last synced: 05 May 2026
https://github.com/kuranez/eu-energy-map
Dashboard visualizing renewable energy trends in the European Union.
dashboard dashboards data-analysis data-visualization energy-data european-union geopandas green-energy interactive-map map pandas plotly python renewable-energy renewables web-app
Last synced: 05 May 2026
https://github.com/mohitsai/boston-housing-data-analysis
Data Analysis Project for the City of Boston Government for insights into effect of property rennovations and remodelling on housing availability in the city
data-analysis data-science matplotlib numpy pandas python
Last synced: 05 May 2026
https://github.com/akash-47-tank/personalized-e-commerce-review-summarizer
Personalized E-commerce Product Review Summarizer: A Streamlit app that summarizes product reviews (e.g., from a CSV) using T5-small and tailors summaries to user preferences (price, durability, etc.) with NLP and lightweight ML.
data-analysis e-commerce machine-learning nlp personalization portfolio python scikit-learn sentiment-analysis streamlit t5 transformers web-app
Last synced: 05 May 2026
https://github.com/akotronis/qualitycontrol
HRH Quality Control app
data-analysis gui-application latex newton-method oop pandas progress-bar pyinstaller pysimplegui python quality-control sqlite3
Last synced: 05 May 2026
https://github.com/kammarah/data-sample
I designed a database website 🌐 that can be uploaded easily for use 📤. You can check my website 👀.
data-analysis data-visualization database deploy deployment library-management-system panaversity streamlit webapp
Last synced: 05 May 2026
https://github.com/nimbostratos/titanic-survival-prediction
Machine learning project predicting Titanic survival using AdaBoost with feature engineering and hyperparameter optimization
data-analysis data-science data-science-projects kaggle machine-learning machine-learning-models python scikit-learn
Last synced: 05 May 2026
https://github.com/hms75/movie_rating_analysis
A movie rating analysis which identifies trends amongst a dataset of 5000 movies.
data-analysis data-visualization matplotlib-pyplot numpy pandas python
Last synced: 05 May 2026
https://github.com/ryuzen6/bangalore-real-estate-price-prediction
This is a Data Science Project which predicts the cost of Real Estate in Bangalore. Requirements: Jupyter Notebook (for Data Cleaning and creating the Linear Regression using various python libraries) , Pycharm (python IDE for creating Python Flask Server), Visual Studio Code (to create the UI with HTML, CSS and Javascript).
css3 data-analysis data-science html5 javascript jupyter-notebook machine-learning python3
Last synced: 06 May 2026
https://github.com/yashpaneliya/bank-loan-default-analysis
Analyze and understand the driving factors (or driver variables) behind loan default, i.e. the variables which are strong indicators of default.
data-analysis loan-default-analysis matplotlib numpy pandas python
Last synced: 06 May 2026
https://github.com/mikma03/datascience_python_datacamp
DataScience with Python. Code and examples. Python libraries, including pandas, NumPy, Matplotlib, and many more.
data-analysis data-science datacamp datascience numpy pandas python
Last synced: 06 May 2026
https://github.com/friendotjava/air-quality-analysis
Data Analysis for Air Quality in different District in Beijing.
data-analysis explanatory-data-analysis exploratory-data-analysis jupyter-notebook pytho streamlit
Last synced: 06 May 2026
https://github.com/abdelmajidlh/cours
Cours Data engineering et data analyse.
apache-spark big-data data-analysis data-engineering docker jupyter-notebook pyspark
Last synced: 06 May 2026
https://github.com/vimlesh-gupta/blinkit_data_analytics_project
End-to-end Blinkit data analytics project using Python, SQL Server & Power BI
blinkit data-analysis eda pandas powerbi python sql-server
Last synced: 06 May 2026
https://github.com/rlalpha49/anisearch-model
AniSearchModel leverages Sentence-BERT (SBERT) models to generate embeddings for synopses, enabling the calculation of semantic similarities between descriptions. This allows users to find the most similar anime or manga based on a given description.
anime api data-analysis data-merging embeddings flask hugging-face-datasets kaggle-datasets machine-learning manga natural-language-processing nlp python sentence-bert similarity-search
Last synced: 06 May 2026
https://github.com/siddhantprateek/machine-learning-resources
Machine Learning Resources
best-practices clustering-algorithm data-analysis deep-learning in-progress journey linear-regression machine-learning machine-learning-algorithms neural-language-modelling neural-language-processing neural-network numpy python3 read reinforcement-learning-algorithms tensorflow visualisation
Last synced: 07 May 2026
https://github.com/badranalyst/exploratory-data-analysis-on-salaries-dataset
Performing EDA on a dataset related to salaries, exploring relationships between factors like job titles, industries, and locations. Insights are visualized with plots to identify trends and disparities in salary data.
data-analysis dataset eda exploratory-data-analysis pandas python
Last synced: 07 May 2026