Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-27 00:07:21 UTC
- JSON Representation
https://github.com/lucycatherine/healthinsuranceproject
This repository contains a machine learning project that analyzes the factors influencing health insurance charges, such as age, smoking status, and medical conditions.
data-analysis data-science data-visualization jupyter-notebook machine-learning python
Last synced: 18 May 2026
https://github.com/maheera421/pandas
Implementation of essential Pandas functions.
data-analysis data-manipulation pandas-dataframes pandas-datareader pandas-python
Last synced: 17 Jul 2025
https://github.com/gui-sitton/y.music
In this project I compared the musical preferences of the citizens of Springfild and Shelbyville. I examined real Y.Music data to test hypotheses and compare the behavior of users in these two cities.
data data-analysis data-analysis-python data-science data-visualization python
Last synced: 18 May 2026
https://github.com/niniola-creator/niniola-creator
This is a repository that I have created to show my skills, share my projects and track my progress in my data science/web development journey.
bootstrap5 css3 data-analysis data-science data-visualization database html5 javascipt javascript matplotlib pandas powerbi python spreadsheets sql
Last synced: 07 Apr 2026
https://github.com/amr-yasser226/interactive-sales-analytics-dashboard
An interactive web-based dashboard for visualizing multinational electronics sales data. This project for the DSAI 203 course integrates a Python/Flask backend with an amCharts frontend to provide dynamic insights into product revenues, sales distribution, and employee statistics across different countries.
am5charts amcharts business-intelligence css dashboard data-analysis data-analytics data-visualization flask html javascript python sqlalchemy sqlite web-application
Last synced: 13 Apr 2026
https://github.com/namratagulati/tweets_analysis
This repository focuses on sentiment analysis of Twitter data using Python, Natural Language Processing (NLP), and the Natural Language Toolkit (NLTK). The goal is to extract valuable insights from social media discussions, such as word frequency, hashtag trends, and sentiment patterns.
analysis data-analysis natural-language-processing nlp-machine-learning nltk-corpus nltk-python sentiment-analysis twitter-sentiment-analysis
Last synced: 07 Aug 2025
https://github.com/priyadarshinijain/air-quality-data-analysis-and-visualization
# 🌍 Air Quality Data Analysis and Visualization
data-analysis jupyter-notebook python visualization
Last synced: 06 Feb 2026
https://github.com/gagan8605/zepto_sql_analysis
This project explores and analyzes the inventory data of Zepto, a rapidly growing 10-minute grocery delivery platform in India. The dataset contains over 3,000+ SKUs across key product categories such as Fruits & Vegetables, Dairy, Beverages, Packaged Foods, and more. The analysis was performed using PostgreSQL, covering both data cleaning and bus
cleaning-data data-analysis database-management postgresql sql
Last synced: 16 Jul 2025
https://github.com/progati00/marketing-mix-modeling-mmm-for-marketing-budget-optimization
A Marketing Mix Modeling (MMM) project using Python to analyze channel performance, calculate ROI, and simulate marketing budget changes for better business decisions. Includes a trained Linear Regression model, ROI analytics, and a Flask API for revenue prediction.
api budget-optimization data data-analysis data-science ecommerce eda flask jupyter-notebook linear-regression machine-learning marketing-analytics marketing-mix-modeling python roi-analysis vscode
Last synced: 14 Apr 2026
https://github.com/shrutiijoshi/airbnb-listing-reviews
Airbnb is an online marketplace that connects people who want to rent out their homes with travelers seeking accommodations.
data-analysis matplotlib-pyplot pandas-python python seaborn
Last synced: 17 May 2026
https://github.com/v41bh4vr4jput/data-analysis-with-python
This repository is a comprehensive collection of data analysis projects and tutorials using Python's most powerful libraries: NumPy, Pandas, Seaborn, and Matplotlib. It is designed to help you explore, clean, visualize, and analyze data efficiently.
api data data-analysis data-visualization matplotlib numpy pandas python sakila-db seaborn
Last synced: 09 Apr 2026
https://github.com/aakashjhawar/twitter-sentiment-analysis
Sentiment analysis of tweets to detect negative tweets.
bagofwords data-analysis data-science doc2vec logistic-regression machine-learning nlp-machine-learning nltk regex sentiment-analyser sentiment-analysis support-vector-machine textblob tf-idf-features twitter twitter-sentiment-analysis word2vec xgboost
Last synced: 29 Mar 2025
https://github.com/cosmoduende/r-ggcats
StrangeR things: Adding… Cats? to your plots on R. How to analyze and visualize data with the help of funny cats with the ”ggcat” package.
data-analysis data-analytics data-science data-visualisation data-visualization data-viz dataviz ggcats r-language r-library r-package r-programming r-scripts r-studio rstats rstudio
Last synced: 22 Jul 2025
https://github.com/gmasson/datadash
DataDash é uma biblioteca JavaScript e CSS para criar dashboards interativos, para visualização de dados dinâmicos em páginas web.
dashboard dashboard-application dashboards data-analysis data-science data-visualization javascript
Last synced: 08 Aug 2025
https://github.com/a19xys/dm-csgo_analysis
Analysis to address the most important aspects of the knowledge discovery process from data.
data-analysis data-mining data-science dataset jupyter-notebook python
Last synced: 18 May 2026
https://github.com/datalopes1/bankabc_churn
Neste projeto será realizado o processo de EDA (Exploratory Data Analysis) com foco na análise de Churn a partir do datas ser Bank Customer Churn Dataset, que pode ser encontrado no Kaggle e disponibilizado por Gaurav Topre.
churn-analysis data-analysis data-science eda python
Last synced: 18 May 2026
https://github.com/viper373/lol-dataanalytics
腾讯游戏-英雄联盟赛事20/21/22年数据综合分析预测
crawler-python data-analysis jupyter-notebook lol python spider
Last synced: 15 Jul 2025
https://github.com/kwonnayeon/medium-post-projects
Code & projects from my Medium posts
data-analysis data-science data-visualization medium-articles python r-language sql
Last synced: 18 May 2026
https://github.com/vishnu-vamshii/layoffs-data-analysis-in-sql
This project focuses on the cleaning and exploratory analysis of a dataset containing layoff information. It includes data deduplication, standardization of columns, handling null and blank values, and analyzing layoffs by company, industry, country, and date. Various SQL queries are used to explore trends and patterns in layoffs over time.
Last synced: 15 Jul 2025
https://github.com/hosseinkarimi128/zed-one
An AI-powered assistant that analyzes CSV data using natural language queries to generate pandas code and visualizations.
ai-data-analysis automated-pandas automated-pandas-queries csv data-analysis fastapi langchain machine-learning matplotlib nlp openai pandas restful-api summarization visualization-tools
Last synced: 07 Apr 2026
https://github.com/jarrarshahid/nutrition-calculator
Simple python app to calculate nutritions in everyday meals.
data-analysis health json jupyter-notebook logic-programming python
Last synced: 15 Jul 2025
https://github.com/sourceduty/data_metrics
📈 Analyzing, sorting and visualizing data.
data data-analysis data-metrics data-sci data-science data-science-projects data-sorting data-visualization database dataset metrics sorting statistics visualization
Last synced: 08 Aug 2025
https://github.com/1adityakadam/carnegie-classifications-ancestry-grid
A concise, interactive tool for exploring the historical lineage of U.S. higher education institutions using Carnegie Classification data from 1973–2021.
dash data-analysis html javascript pandas python
Last synced: 25 Jun 2025
https://github.com/miusarname2/proyectos-final-analitica-de-datos
Welcome to the repository where the magic of data analytics comes to life! This is the result of our effort and creativity in the subject of data analysis at the Universidad Cooperativa de Colombia (UCC). Here we keep everything we did to analyse data, draw cool conclusions and solve the workshop we were given. 🎯📊
data-analysis data-science data-visualization pip python
Last synced: 15 Jul 2025
https://github.com/andersoncrs/analisis-de-texto-tweets
En este proyecto exploro el análisis de texto de tweets para descubrir tendencias, opiniones y temas relevantes en redes sociales. Usando herramientas de procesamiento de lenguaje natural, convierto grandes volúmenes de mensajes en información clara y visualmente atractiva.
data-analysis data-visualization eda text-mining
Last synced: 21 Jul 2025
https://github.com/artemzarubin/xml-document-processor
XML processing tool using the Strategy design pattern.
csharp data-analysis data-transformation design-patterns strategy xml
Last synced: 21 Jul 2025
https://github.com/BingyanStudio/github-analyzer
锐评一下你都在 GitHub 写了什么
data-analysis github llm reports selfhosted typescript
Last synced: 12 May 2025
https://github.com/1adityakadam/Carnegie_classifications_website
A comprehensive data analytics platform analyzing 50+ years of U.S. higher education trends through interactive visualizations and historical institution tracking.
css data-analysis html javascript python ui-design web-development
Last synced: 25 Jun 2025
https://github.com/nadamarei/data-analyzer
The Qualitative Data Analysis Tool is a powerful Streamlit application designed for researchers to analyze word frequencies in corporate documents. This tool processes PDF reports, identifies target words and their contextually relevant synonyms, and generates comprehensive reports with document statistics, summary analysis, and per-file breakdowns
data-analysis data-visualization python-3 streamlit
Last synced: 18 May 2026
https://github.com/rohansoni45/movie-recommendation-system
This project is a Content-Based Recommender System that suggests movies to users based on their preferences and watched history. The system leverages cosine similarity to find and recommend movies similar to a selected title. It is built using Python and libraries like Pandas, NumPy, and Scikit-learn.
content-based-filtering cosine-similarity data-analysis data-science machine-learning numpy pandas python recommender-system render scikit-learn
Last synced: 17 Apr 2026
https://github.com/teja-1403/ignosis-tech-ml-assignment
Analysis of transaction data to identify the most profitable products and key customer segments, providing insights for targeted marketing strategies.
customer-segmentation data-analysis data-visualization machine-learning marketing-strategy python
Last synced: 02 May 2026
https://github.com/lucas54neves/financial-organizer
Financial organizer using Streamlit
data-analysis data-science financial-organizer plotly python streamlit
Last synced: 02 May 2026
https://github.com/asadiahmad/word-counter-spark
Word counter with spark
data-analysis nlp spark word-counter
Last synced: 02 May 2026
https://github.com/jimohola/breast-cancer-detection
Breast Cancer Detection-Machine learning
data-analysis data-visualization exploratory-data-analysis machine-learning python3
Last synced: 02 May 2026
https://github.com/hamzazafar10/movie-recommendation-system
Content based movie recommendation system using cosine similarity.
cosine-similarity data-analysis data-preprocessing data-science data-structures data-visualization jupyter-notebook machine-learning movie-recommendation python
Last synced: 02 May 2026
https://github.com/amishidesai04/emergency-calls-data-analysis-project
Welcome to the Emergency Calls Data Analysis project repository. This project is dedicated to extracting, processing, and visualizing data from the "Emergency – 911 Calls, Montgomery County" dataset, sourced from Kaggle. The main objective is to analyze trends in emergency calls in Montgomery County, Pennsylvania, spanning multiple years.
analysis data-analysis data-extraction data-processing data-science data-visualization numpy pandas python seaborn
Last synced: 02 May 2026
https://github.com/fatihilhan42/spotify-songs-recommendations-system_with_python
We developed a song recommendation system for the user with the data we received from our Spotify song dataset. Data set and other applications are given in the description. Have a nice day.
data-analysis data-science data-visualization jupyter-notebook python recommendation-engine recommendation-system
Last synced: 02 May 2026
https://github.com/dissorial/prx21_erikz
Analysis of self-tracked data: interactive visualizations & predictive algorithms
analytics data-analysis data-science data-visualization machine-learning matplotlib pandas python python3 visualization
Last synced: 02 May 2026
https://github.com/mehanix/dhrw
🎢 IaaS visual editor to create & deploy data processing pipelines - python, rmq, react, meteorjs
computational-graph computational-graphs data-analysis data-engineering data-pipeline data-pipelines data-processing data-processing-and-analysis data-processing-pipelines data-processing-system data-science data-visualization docker-compose good-first-issue help-wanted meteorjs-application rabbitmq react-flow
Last synced: 02 May 2026
https://github.com/benzerinsio/breastcancer-eda
📊 Análise Exploratória de Dados (EDA) - Câncer de Mama | Exploração de características clínicas para identificar padrões e relações no diagnóstico de câncer de mama.
analise-de-dados analise-exploratoria analise-exploratoria-de-dados data-analysis data-visualization diagnosis eda exploratory-data-analysis health-care medical-data python seaborn
Last synced: 02 May 2026
https://github.com/inevolin/multivariate-data-analysis
Showcases of modern multivariate & multidimensional data analysis in industrial and high-tech settings.
analytics data-analysis data-science data-visualization javascript
Last synced: 09 Jun 2026
https://github.com/faiyaz-zaman/used-car-market-trends-on-bikroy.com
Used Car Market Trends on Bikroy.com
data-analysis python scraping-websites selenium tableau
Last synced: 02 May 2026
https://github.com/robertpaulp/expenseadvisor
HackITall 2023- Hackathon
chatgpt-api data-analysis data-processing python scrapping-python
Last synced: 03 May 2026
https://github.com/jofaval/red-wine-quality
Data Analysis of the Red Portuguese's Wine's Quality in 2009
classification data-analysis data-science data-visualization google-colab kaggle logistic-regression machine-learning python scikit-learn wine-quality xgboost
Last synced: 03 May 2026
https://github.com/mehtadigisha/iris-flower-classification
Iris Flower Classification
accuracy-score classification-report data-analysis data-visualization eda iris-classification machine-learning matplotlib pandas prediction python scikit-learn seaborn svc-model svm-model visualization
Last synced: 03 May 2026
https://github.com/ahmedhosssam/lesser_pandas
Pandas-like Data Analysis library in C++
cpp data-analysis data-science pandas
Last synced: 03 May 2026
https://github.com/monteirooscar98/tarifas-publicas-sp-dieese
Extração de dados através de WebScraping no site do Dieese e Analise em relação as Tarifas Públicas do Município de São Paulo.
data-analysis data-visualization python webscraping
Last synced: 03 May 2026
https://github.com/aicorsair/python-case-study-imdb-movie-reviews-sentiment-analysis-with-nlp
This repository contains a comprehensive case study on sentiment analysis using the IMDb dataset of movie reviews.
ada-boost artificial-intelligence classification data-analysis data-analytics data-cleaning data-preprocessing data-science data-visualization feature-engineering feature-extraction hyperparameter-tuning logistic-regression machine-learning naive-bayes natural-language-processing nltk python random-forest shap
Last synced: 03 May 2026
https://github.com/rohitinu6/tesla-price-prediction
A machine learning project that predicts future stock price movements using Logistic Regression, SVC, and XGBoost with engineered financial features.
data-analysis data-visualization feature-engineering financial-analysis logistic-regression machine-learning matplotlib python scikit-learn seaborn stock-market stock-price-prediction support-vector-machine time-series xgboost
Last synced: 03 May 2026
https://github.com/maddieemihle/python-challenge
Creating a Python script that analyzes financial records and election results
Last synced: 09 Jun 2026
https://github.com/vipulbunny/restaurant-insight-analysis
A comprehensive data analysis project exploring restaurant ratings, locations, and customer sentiments. This project includes data preprocessing, descriptive analysis, geospatial mapping, sentiment analysis, and price-rating correlations using Python and visualization tools.
data-analysis data-preprocessing data-visualization folium geospatial geospatial-analysis geospatial-visualization machine-learning nlp pandas python restaurant-insights seaborn sentiment-analysis
Last synced: 03 May 2026
https://github.com/abhijais4896/cardiovascular-disease-prediction
Cardiovascular Disease Prediction using machine learning and Python
data-analysis datascience deep-learning machine-learning matplotlib numpy pandas python seaborn visualization
Last synced: 03 May 2026
https://github.com/codeslash21/tmdb_data_analysis
We analysed TMDB dataset which contains around 11000 movies details. We analyzed to find some interesting facts about the dataset.
data-analysis data-visualization matplotlib nanodegree-project numpy pandas python tmdb-movie
Last synced: 03 May 2026
https://github.com/salma-mamdoh/project-writing-functions-for-product-analysis
My Project to learn the Basics of Analysis on DataCamp
data-analysis data-camp pandas python
Last synced: 03 May 2026
https://github.com/syed-m-nofel/python-data-science-fundamentals
Python notebooks for data manipulation (Pandas/NumPy) and API workflows – from basics to practical examples.
api beginner-friendly data-analysis data-science http-requests jupyter-notebook numpy pandas pandas-dataframe python tutorial
Last synced: 03 May 2026
https://github.com/ggarciajavier/udacity-dalf-project4-identify-fraud-enron-email
Work performed for the 4th project of the Udacity Data Analyst Nanodegree: machine learning classifier for identifying fraud in Enron email corpus.
data-analysis data-science machine-learning nlp-machine-learning python python27
Last synced: 03 May 2026
https://github.com/nurulashraf/logistic-regression-loan-prediction
Loan approval prediction using logistic regression based on applicant data, including income, credit history, and property details, after data preparation and feature engineering.
data-analysis data-science loan-prediction logistic-regression machine-learning predictive-modeling python sklearn
Last synced: 03 May 2026
https://github.com/ljadhav25/swiggy-restaurant-analysis
This repository contains data and analysis related to restaurants listed on Swiggy, one of India's largest online food ordering and delivery platforms. The objective is to explore restaurant trends, customer reviews, pricing strategies, and delivery metrics to gain insights into the food delivery industry.
data-analysis data-visualization matplotlib-pyplot numpy-library pandas-library python seaborn-plots
Last synced: 03 May 2026
https://github.com/iguptashubham/ev-market-exploration
So, market size analysis is a crucial aspect of market research that determines the potential sales volume within a given market
data-analysis data-analysis-projects data-science-project forecast projects python
Last synced: 03 May 2026
https://github.com/syarwinaaa09/analyzing-crime-in-los-angeles
Exploratory data analysis of Los Angeles crime data with insights on temporal patterns, locations, and age demographics.
crime-data data-analysis eda los-angeles pandas public-safety python visualization
Last synced: 03 May 2026
https://github.com/bpkaur/whats-in-a-name
Exploring dataset of first names of babies born in the US in order to uncover interesting stories
data-analysis datacamp numpy pandas python3
Last synced: 04 May 2026
https://github.com/samruddhi3012/screen-time-analysis
Hi! This repo demonstrates a python project on Screen Time Analysis.
data-analysis data-visualization python
Last synced: 04 May 2026
https://github.com/mindlessmuse666/titanic-data-visualization
Проект по визуализации данных о пассажирах Титаника с использованием библиотек Python Matplotlib, Seaborn и Plotly.
data-analysis data-visualization matplotlib pandas plotly python seaborn titanic
Last synced: 04 May 2026
https://github.com/nickenshidqia/uber-new-york-data-analysis
Analyze Uber pickups on New York to get insight from this data
data-analysis data-analyst exploratory-data-analysis python
Last synced: 04 May 2026
https://github.com/fatihilhan42/the-office-eda
Data analysis study of my favorite sitcom, The Office (US).
data-analysis data-science data-visualization fatihilhan office python sitcom
Last synced: 04 May 2026
https://github.com/soham7998/data-analysis-projects
My Data Analysis Projects which are completed by me and gain a hands on Experience from each project. the project showcase different Concepts , Visualization and many things.
data data-analysis data-science machine-learning nlp python soham visualization
Last synced: 04 May 2026
https://github.com/damisparks/become_data_analyst
Are you new to Data Analysis ? Here you will find simple notebook that will help through your journey. These are personal projects I work on and still working.
data data-analysis data-visualization matplotlib numpy pandas-tutorial
Last synced: 04 May 2026
https://github.com/ibrahimm7004/supermarket-sales-analysis
This project focuses on Data Mining techniques to gather inisights about customer behaviour regarding Supermarket Sales. Includes: Association Rule Mining, Temporal Patterns in customer behavior, Sequential Pattern Mining, Classification, Regression, and Outlier Detection.
apriori association-rules data-analysis data-mining data-science data-visualization fpgrowth python sales-analysis supermarket-sales
Last synced: 04 May 2026
https://github.com/marionchaff/real-estate-price-prediction-france
Real estate price prediction using French public database DVF
data-analysis dvf-data machine-learning price-prediction python real-estate scikit-learn
Last synced: 04 May 2026
https://github.com/sweta-kaundilya/python_for_data_analysis
Learning Python and all the relevant libraries in python for Data field.
cufflinks data-analysis data-science matplotlib numpy pandas plotly python seaborn
Last synced: 04 May 2026
https://github.com/fatihilhan42/book-recommendation-system-with-python
In this project, we are making a book recommendation system that recommends similar books according to the genres or ratings that the user enters, using a large book dataset. The link of the dataset is given below. Happy reading...
books data-analysis data-science data-visualization kaggle python recommendation-engine recommendation-system
Last synced: 04 May 2026
https://github.com/sagarprajapat2004/data-analysis-visualization
Downloaded and analyzed a dataset from Kaggle using NumPy and Pandas created visualizations with Matplotlib and Seaborn developed a Flask web application to showcase data insights and conclusions.
data-analysis data-modeling data-visualization exploratory-data-analysis flask python statical-analysis
Last synced: 04 May 2026
https://github.com/jatin-mehra119/flight-price-prediction
This study aims to analyze flight booking data from "Ease My Trip" website, using statistical tests and linear regression to extract insights. By understanding this data, valuable information can be gained to benefit passengers using the platform.
data-analysis datacleaning datavisualization machine-learning preprocessing-data python sklearn-pipeline sklearn-regression-algorithm streamlit-webapp
Last synced: 04 May 2026
https://github.com/ljadhav25/logistic-regression-data-science-
Logistic regression estimates the probability of an event occurring, such as voted or didn’t vote, based on a given data set of independent variables.
data-analysis data-science data-visualization logestic-regression machine-learning
Last synced: 04 May 2026
https://github.com/saitoxu/data-analysis-workspace
Docker image for data analysis
Last synced: 04 May 2026
https://github.com/jendives2000/regressions
Performing of a Linear Regression analysis to determine the strength of the relationship between the number of reviews and sales for a retail company.
data-analysis linear-regression pearson-correlation-coefficient regression
Last synced: 04 May 2026
https://github.com/surayasumona/test_bowlers_analysis
Data Analysis with Python
data-analysis data-manipulation data-preprocessing numpy pandas
Last synced: 04 May 2026
https://github.com/georgehanymilad/plantycare-app
Graduation Project - Fayoum Center
ai backend cnn-classification colab-notebook data-analysis deep-learning diagrams front-end java kaggle machine-learning native ui-design
Last synced: 04 May 2026
https://github.com/vara-co/crowdfunding_etl
ETL Mini Project based on a Crowdfunding Database, using CRUD operations. SQL, Postgres, and an ERD.
data-analysis database datacleaning erd erdiagram etl jupyter-notebook postgres postgresql regex schema sql
Last synced: 04 May 2026
https://github.com/zobayerakib/credit-card-fraud-analysis__data-analysis-project
credit-card data-analysis decision-trees fraud-detection gradient-descent knn-classification logistic-regression machine-learning machine-learning-algorithms naive-bayes-classifier random-forest-classifier
Last synced: 05 May 2026
https://github.com/matt-ags/jornada-python
Repositório com os projetos realizados durante a semana "Jornada Python" - 01/2025
artificial-intelligence automation data-analysis jupyter-notebook machine-learning python
Last synced: 05 May 2026
https://github.com/jcm-ai/personal-data-science-projects
This page contains all of my personal data science projects. 📊📈📉👨💻
data-analysis data-visualization exploratory-data-analysis jupyter-notebooks machine-learning-algorithms matplotlib-pyplot numpy-library pandas-python personal-project predictive-modeling programming python3 scikit-learn scipy seaborn statistical-analysis
Last synced: 05 May 2026
https://github.com/rtlich/sap-sustainable-management
Project for the ERP & BI course at Esprit School of Engineering. It optimizes resource and operations management in an agri-food company using SAP MM & PM, focusing on sustainability, CO₂ reduction, and predictive maintenance.
angular business-intelligence data-analysis flask machine-learning ocr powerbi python sql-server talend
Last synced: 05 May 2026
https://github.com/13anush/python-libraries-
A collection of essential Python libraries—NumPy, Pandas, Matplotlib, and Seaborn—perfect for anyone starting out in data analysis.
data-analysis matplotlib numpy pandas python seaborn
Last synced: 05 May 2026
https://github.com/zafir100100/cancer-stage-prediction
This code predicts cancer data using various regression models, calculates their average R-squared scores, and prints the best model.
cross-validation data-analysis data-preprocessing decision-trees gradient-boosting linear-regression machine-learning-algorithms numpy pandas random-forest regression scikit-learn
Last synced: 05 May 2026
https://github.com/cicku/en.650.672
HW of EN.650.672
analytics data-analysis numpy pandas
Last synced: 05 May 2026
https://github.com/pcanadas/weather_scraper
Este proyecto automatiza la recopilación y el procesamiento de datos meteorológicos históricos y previsionales. Utiliza Selenium para extraer información de sitios web de clima, procesa los datos con Pandas y los almacena en archivos CSV limpios. Es ideal para análisis climáticos, visualización de datos o integración en otros sistemas.
beautifulsoup data-analysis pandas python selenium
Last synced: 05 May 2026
https://github.com/kuranez/eu-energy-map
Dashboard visualizing renewable energy trends in the European Union.
dashboard dashboards data-analysis data-visualization energy-data european-union geopandas green-energy interactive-map map pandas plotly python renewable-energy renewables web-app
Last synced: 05 May 2026
https://github.com/akash-47-tank/personalized-e-commerce-review-summarizer
Personalized E-commerce Product Review Summarizer: A Streamlit app that summarizes product reviews (e.g., from a CSV) using T5-small and tailors summaries to user preferences (price, durability, etc.) with NLP and lightweight ML.
data-analysis e-commerce machine-learning nlp personalization portfolio python scikit-learn sentiment-analysis streamlit t5 transformers web-app
Last synced: 05 May 2026
https://github.com/ayaatmohammed/amazon-sales-analysis-pyspark
In-depth analysis of the Olist E-commerce dataset from Kaggle using PySpark for customer segmentation (RFM) and market basket analysis.
big-data big-data-analytics customer-segmentation data-analysis data-science ecommerce jupyter-notebook kaggle pyspark python rfm-analysis
Last synced: 05 May 2026
https://github.com/kammarah/data-sample
I designed a database website 🌐 that can be uploaded easily for use 📤. You can check my website 👀.
data-analysis data-visualization database deploy deployment library-management-system panaversity streamlit webapp
Last synced: 05 May 2026
https://github.com/zyna-b/insurance-cost-analysis-and-prediction
Medical insurance EDA and prediction: feature engineering, correlation analysis & Chi-square tests
adjusted-r-squared chisquare-test data-analysis data-science data-visualization eda exploratory-data-analysis linear-regression pandas r2-score sklearn statistical-analysis
Last synced: 05 May 2026
https://github.com/nimbostratos/titanic-survival-prediction
Machine learning project predicting Titanic survival using AdaBoost with feature engineering and hyperparameter optimization
data-analysis data-science data-science-projects kaggle machine-learning machine-learning-models python scikit-learn
Last synced: 05 May 2026
https://github.com/meinhere/dicoding-analisis-data
Submission Analisis Data dengan tema E-Commerce Streamlit App
data-analysis data-mining e-commerce python streamlit
Last synced: 05 May 2026
https://github.com/benjaminrose/data-analysis-book
A Jupyter Book for my Spring 2025 PHY 5381 class on Data Analysis
book data-analysis data-science data-visualization jupyter-book open-book python r statistics-course
Last synced: 06 May 2026
https://github.com/ryuzen6/bangalore-real-estate-price-prediction
This is a Data Science Project which predicts the cost of Real Estate in Bangalore. Requirements: Jupyter Notebook (for Data Cleaning and creating the Linear Regression using various python libraries) , Pycharm (python IDE for creating Python Flask Server), Visual Studio Code (to create the UI with HTML, CSS and Javascript).
css3 data-analysis data-science html5 javascript jupyter-notebook machine-learning python3
Last synced: 06 May 2026
https://github.com/shreshthvashisht/instgram-user-analytics
SQL Fundamentals
data data-analysis data-science mysql social-network-analysis
Last synced: 09 Jun 2026