Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-07-01 00:07:23 UTC
- JSON Representation
https://github.com/gappeah/nike_web_crawler
This project involves web scraping Nike's product pages to extract product names, prices and links. The project showcases three different implementations of the web crawler using Selenium and BeautifulSoup. It also includes visualisation of the scraped data using Matplotlib and Seaborn.
beautifulsoup data-analysis data-visualization python selenium web-crawler web-scraper webcrawler webscraper webscraping webscraping-beautifulsoup
Last synced: 04 Jul 2025
https://github.com/hebaqaisar/movie-recommender-system
AI Recommender System - Recommends you similar movies based on Directors, Tags, Name, Type, Actors, Genre etc
artificial-intelligence data-analysis data-mining data-science jupyter-notebook machine-learning machine-learning-algorithms ml movies-rate pycharm python
Last synced: 17 Apr 2026
https://github.com/sarmadahmad8/ml-and-deeplearning-projects-for-beginners
Beginner ML/DL projects spanning core libraries and problem sets.
beginner-friendly data-analysis data-science deep-learning fastai machine-learning opencv pytorch scikit-learn transformer
Last synced: 17 Apr 2026
https://github.com/patriloto/intro_r_para_reinventartec_2021
Material del taller Primeros pasos en R para el análisis de datos
Last synced: 12 Feb 2026
https://github.com/mariam-badr-mb/gtc-ml-project2-diabetes-prediction
This project is part of the GTC Machine Learning Program. It demonstrates the end-to-end ML workflow by building a predictive model for diabetes detection
classification-algorithm data-analysis data-visualization diabetes-prediction gridsearchcv hyperparameter-tuning machine-learning python
Last synced: 09 May 2026
https://github.com/flexmonster/svelte-flexmonster
Svelte wrapper for Flexmonster Pivot Table & Charts
data-analysis data-visualization frontend pivot-tables svelte sveltekit
Last synced: 27 Feb 2026
https://github.com/zeinhasan/eksploration-and-data-visualization-course-material
Exploratory Data Analysis (EDA) Laboratory Assistant Teaching Materials
data-analysis data-visualization statistics
Last synced: 24 Jun 2026
https://github.com/ivanildobarauna-dev/currency-quote
Complete solution for extracting currency pair quotes data with comprehensive testing, parameter validation, flexible configuration management, Hexagonal Architecture, CI/CD pipelines, code quality tools, and detailed documentation.
data-analysis data-analytics data-engineering library pypi-packages python
Last synced: 27 Oct 2025
https://github.com/manvendra747/customer-segmentation
Customer segmentation using Python and PowerBI
customer-segmentation dashboard data-analysis data-science data-visualization powerbi python rfm-analysis
Last synced: 28 Apr 2025
https://github.com/sandk21/detection_faux_billets
Algorithme de détection de faux billets selon leurs dimensions géométriques et application web pour générer les prédictions
data-analysis data-science data-visualization machine-learning pandas python scipy sklearn streamlit
Last synced: 03 Apr 2026
https://github.com/devexpress-examples/aspnet-pivot-grid-custom-aggregates
This example shows how to aggregate data by the field's first value.
asp-net-web-forms data-analysis dotnet pivot-grid pivot-grid-for-web-forms
Last synced: 06 Jul 2025
https://github.com/shadowk29/cusumtools
An eclectic collection of python scripts I have found to be useful in processing nanopore data
data-analysis data-visualization time-series-analysis
Last synced: 16 Mar 2026
https://github.com/christos99/scraping-project
This project is a Python-based tool for web scraping with a user-friendly GUI. Built with PyQt5 and Selenium, it allows users to scrape online listings by specifying keywords, price ranges, and exclusions. Results are displayed in a table and can be exported to an Excel file.
automation data-analysis excel gui openpyxl pandas pyqt5 python selenium web-scraping
Last synced: 10 May 2026
https://github.com/carmoreno/analisisaccidentalidadbogota
Data Analysis about traffic accidents at Bogotá, Colombia.
data-analysis data-science jupyer-notebook matplotlib numpy pandas scikit-learn
Last synced: 17 Apr 2026
https://github.com/gabrielmpinho/cs50-sql
Solutions and notes from CS50’s Introduction to Databases with SQL. Covers CRUD operations, data modeling, normalization, joins, views, indexes, and connecting SQL with Python and Java. Begins with SQLite for portability and introduces PostgreSQL and MySQL for scalability.
data-analysis data-structures data-visualization database databases javascript python sql
Last synced: 10 May 2026
https://github.com/smahala02/calorimtery
A calorimetry lab project involving Python and Excel for computing heat transfer from experimental data.
calorimetry chemistry data-analysis excel jupyter-notebook python thermodynamics
Last synced: 05 Feb 2026
https://github.com/suhas-005/power-bi-dashboard
Power BI Dashboard Projects
data-analysis data-visualization dataset power-bi-project powerbi
Last synced: 01 Apr 2025
https://github.com/vijayjoshi16/credit-card-fraud-detection-using-ml-in-python
Credit Card Fraud Detection Using ML in Python
data-analysis jupyter-notebook logistic-regression machine-learning matplotlib-pyplot numpy pandas python regression seaborn
Last synced: 17 Apr 2026
https://github.com/asifdotexe/air-quality-analysis-aqa
AQA is a data-driven project focused on analyzing air quality data sourced from data.gov.in. The project encompasses data preprocessing, analysis, and visualization to gain insights into air pollution levels across various locations in India. By examining six key pollutants, the project aims to raise awareness about the environmental issues
aqi-analysis data-analysis data-preprocessing data-science data-visualization presentation
Last synced: 07 Jun 2026
https://github.com/abdelhakim-gh/machine-learning_data-analysis_project
recognizing handwritten numbers & comparing the Life Expectancy vs Fertility in 1960 & 2013 of regions
data-analysis jupyter-notebook machine-learning python r r-studio
Last synced: 12 Apr 2026
https://github.com/snehilk1312/data_science
This Repository contains the Data Science things I have done in recent times along with visualization , cleaning , models, statistics, Courses, Datasets. :=)
data-analysis data-science glove natural-language-processing nlp nltk statistics word2vec
Last synced: 02 Apr 2026
https://github.com/tomasoak/datahopper
Python package for data engineering and data wrangling
data data-analysis data-engineering data-mining data-science data-structures data-wrangling datascience pandas python
Last synced: 12 Mar 2026
https://github.com/rudra-g-23/find-my-joint
A utility to find potential join keys (matching columns) across multiple DataFrames.
data-analysis data-visualization join network-graph pandas pandas-dataframe
Last synced: 24 Jun 2026
https://github.com/hordiales/redpanal-db-analysis
Analysis of the RedPanal.org music database
creative-commons data-analysis dataset etl machine-learning music music-information-retrieval statistical-analysis
Last synced: 10 Mar 2025
https://github.com/gher-uliege/bluecloud-plankton
Spatial interpolation of plankton data using a neural network
data data-analysis data-visualization neural-network oceanography
Last synced: 30 Mar 2025
https://github.com/ifibla/adsdb-project
Algorithms, Data Structures and Databases Project
data-analysis data-engineering python
Last synced: 12 Apr 2026
https://github.com/mgimond/meteo_waterville
Waterville (Maine) meteorological data
data-analysis data-science exploratory-data-analysis meteorology r
Last synced: 24 Jan 2026
https://github.com/mateibejan1/ai-masters
A repository for all the projects I have done during my AI MSc.
ai-masters bayesian-inference big-data computer-vision data-analysis data-mining data-visualization deep-learning machine-learning-algorithms natural-language-processing
Last synced: 05 Sep 2025
https://github.com/chayandatta/got_script_manipulation
Game of Thrones Script - String & file manipulation
data-analysis data-science pandas python3
Last synced: 11 May 2026
https://github.com/srinivasrm/graphics_cards_analysis_and_application
In the current project I have extracted graphics card current prices from an authorizer retailer in India and performed analysis
beautifulsoup data-analysis data-science data-visualization etl graphic-card-price-prediction graphics-card graphics-card-analysis heroku-database machine-learning matplotlib pgsql python regression scikit-learn seaborn sql streamlit webapplication
Last synced: 04 Mar 2026
https://github.com/easycris-software/easycris
Professional statistical analysis and RNA-seq for researchers — no coding required
anova bioinformatics data-analysis desktop-app genomics pharmacology research-tools rna-seq statistics tauri
Last synced: 11 May 2026
https://github.com/vhawk19/ambaan
just wants the average analyst to be happi
data-analysis duckdb-wasm sql vue
Last synced: 01 Mar 2026
https://github.com/steno-aarhus/legliv
Substitution of red meat with legumes and risk of primary liver cancer in UK Biobank participants: A prospective cohort study
cancer-research data-analysis epidemiology nutritional-epidemiology nutritional-science open-science reproducibility reproducible-research rstats ukbiobank
Last synced: 03 Mar 2026
https://github.com/ahmednasef3/heart-attack-full-eda
Simple EDA for Heart Attack Dataset.
data-analysis data-science data-visualization eda exploratory-data-analysis heartattack matplotlib pandas seaborn
Last synced: 11 May 2026
https://github.com/cintia0528/data_cleaning_and_analytics-python
Evaluate if aggressive discounting benefits Eniac long-term, considering differing views on customer acquisition and brand positioning. Focus on data cleaning for informed decision-making.
colab-notebook data data-analysis datacleaning dataquality jupyter-notebook matplotlib pandas python seaborn
Last synced: 08 Jan 2026
https://github.com/shubham5027/kisanai--the-ultimate-ai-ml-powered-platform-smart-farming-platform
KisanAI – The Ultimate AI/ML-Powered Smart Farming Platform KisanAI leverages AI/ML to optimize farming practices, enhance crop yields, and empower small-scale farmers with data-driven insights.
ai api aws chatbot crm data-analysis deep-learning deplyment farming llm mapping ml nodejs predictive-modeling reactjs supabase sustainability
Last synced: 30 May 2026
https://github.com/antonioscardace/mri-brainage
Showing Accelerated Brain Ageing in Alzheimer's Patients.
alzheimers-disease brain-age classification data-analysis medical-analysis predictive-modeling regression
Last synced: 18 Jan 2026
https://github.com/john-science/data_science_by_example
Examples of Data Science Tools & Libraries
data-analysis data-science ipython pandas
Last synced: 12 May 2025
https://github.com/olob0/badwords-pt-br
💬 Wordlist com palavrões em pt-BR para análise de dados, filtros, ou texto considerado "evitável"
badword-filter badwords brasil data-analysis filter filter-lists filterlist portugues portuguese text-analysis wordlist
Last synced: 06 Jan 2026
https://github.com/mafda/seattle_airbnb_data_analysis
This repository contains a comprehensive analysis of the Seattle Airbnb dataset, conducted using the CRISP-DM (Cross Industry Standard Process for Data Mining) methodology.
crisp-dm data-analysis data-science jupyter-notebook pandas-python seattle-data
Last synced: 29 May 2026
https://github.com/mayankyadav23/air-bnb-data-analysis
Data analysis and insights from NYC Airbnb listings, focusing on key metrics such as host performance, neighborhood trends, pricing, and customer reviews. Comprehensive documentation of ETL processes and analytical methodologies is provided. Perfect for understanding Airbnb dynamics and decision-making in the NYC market.
advanced-excel business-intelligence data-analysis data-analytics data-visualization power-bi ppt
Last synced: 19 Mar 2026
https://github.com/titanscouting/tra-analysis
Titan Robotics 2022 Strategy Team Analysis Repository
data-analysis frc frc-scouting hacktoberfest python
Last synced: 29 Jan 2026
https://github.com/babak2/synthea-data-analysis
Synthea Data Analysis
data-analysis data-visualization jupyter-notebook jupytext matplotlib numpy pandas python3 seaborn synthea
Last synced: 11 Apr 2026
https://github.com/vasishth/lecturesintrobayes
Please go to the website for these online lectures:
bayesian-inference brms data-analysis stan
Last synced: 06 Feb 2026
https://github.com/denisecase/nlp-03-text-exploration
Exploratory analysis of text corpora using tokenization, frequency, co-occurrence, and bigrams to reveal structure in text.
bigrams co-occurence corpus-analysis data-analysis nlp python text-analysis text-exploration tokenization
Last synced: 02 Jun 2026
https://github.com/ronaldkanyepi/python-streamlit-covid-19-dashboard
This is a responsive streamlit covid 19 Dashboard
analytics data data-analysis data-visualization datascience python streamlit
Last synced: 18 May 2026
https://github.com/anushadatta/airbnb-in-seattle
🏨 Understanding the Airbnb rental landscape in Seattle using data science.
airbnb data-analysis data-exploration data-visualization datascience sentiment-analysis
Last synced: 13 Jun 2025
https://github.com/gustavo-zamai/shop_data_analisys
Analysis diferents shopping mall sells
data-analysis openpyxl pandas python3 pywin32
Last synced: 01 Mar 2025
https://github.com/ultrasage-danz/weather-data-analysis
Weather Data Analysis notebook project. Created using Google collab
collaboration data-analysis data-science dataset google google-colab-notebook project
Last synced: 24 Mar 2025
https://github.com/jamiemagee/rhi
Collating the data on the Renewable Heat Incentive scheme, and presenting it in a more readable format.
data-analysis open-data open-government rhi
Last synced: 25 Feb 2026
https://github.com/rakumar99/jp-morgan-chase-virtual-internship
This repository contains the various tasks assigned by JPMorgan Chase & Co. Virtual Internship on Microsoft Excel
conditional-formatting dashboard data-analysis data-visualization hlookup pivot-tables presentation vba-macros vlookup
Last synced: 02 Mar 2026
https://github.com/tnleite/projeto_king_lift
Este projeto apresenta uma análise detalhada dos dados financeiros da King Lift, uma empresa de locação de empilhadeiras. Utilizando Microsoft Excel, Power Query e Power Pivot, desenvolvi um dashboard interativo, também em Excel, que ajuda a empresa a obter insights valiosos para melhorar a eficiência operacional e aumentar o faturamento.
data-analysis data-science data-visualization excel
Last synced: 19 Mar 2026
https://github.com/antononcube/wl-datareshapers-paclet
Wolfram Language (aka Mathematica) paclet for data reshaping functions, like, long- and wide form, cross tabulation, etc.
contingency-table cross-tabulation data-analysis data-transformation long-form wide-form
Last synced: 20 Mar 2026
https://github.com/shadan100/sales-prediction-analysis
The aim is to build a predictive model and find out the sales of each product at a particular store. Using this model, BigMart will try to understand the properties of products and stores which play a key role in increasing sales.
artificial-intelligence data-analysis data-science django django-framework jupyter-notebook machine-learning matplotlib pandas predictive-modeling python sales-prediction
Last synced: 01 Mar 2026
https://github.com/huseyincenik/tableau
This repository contains Tableau visualizations and related resources for my project.
analytics api bianalyst business-analytics business-intelligence business-solutions dashboard data data-analysis data-science data-structures dataanalysis dataset datavisualization drilldown interactive-visualizations tableau tableau-dashboards viz
Last synced: 19 Mar 2026
https://github.com/is-leeroy-jenkins/sherpa
A budget execution & data analysis tool based on Winforms, .NET 6, and written in C# for EPA analysts
budget-management data-analysis data-science data-visualization federal-government
Last synced: 13 May 2026
https://github.com/jinkogule/multi-analyst
O Multi Analyst é uma ferramenta de análise de dados com uma usabilidade simples, que utiliza inteligência artificial para interpretar os resultados das análises realizadas, retornando insights úteis aos usuários.
apriori-algorithm bootstrap css data-analysis django html numpy open-ai pandas python web-application
Last synced: 12 Apr 2026
https://github.com/guruakaashjn/te_project_microsoft_ai
AI based statistical analysis of land-use plastic pollution in India using AI/ML techniques.
artificial-intelligence data-analysis data-analytics data-science data-visualization machine-learning powerbi
Last synced: 27 Feb 2025
https://github.com/harmanveer-2546/supply-chain
Supply chain analytics is a valuable part of data-driven decision-making in various industries such as manufacturing, retail, healthcare, and logistics. It is the process of collecting, analyzing and interpreting data related to the movement of products and services from suppliers to customers.
customer-segmentation-analysis data data-analysis data-cleaning data-insights ggplot2 numpy pandas performance-evaluation predictive-analytics-for-business python risk-assessment sales-analysis statistical-analysis supply-chain tidyverse trend-analysis
Last synced: 10 Apr 2026
https://github.com/uchida16104/healthanalysis
It abstracts the health status of each device from its operational time calculated from RescueTime and analyzes the data.
data-analysis portfolio portfolio-website security security-tool
Last synced: 02 Feb 2026
https://github.com/zpreisler/modules
Python libraries and modules for processing simulation outputs
data-analysis python scripts tensorflow
Last synced: 13 May 2026
https://github.com/nakshjainsonigara/vba-canteenmanagementsystem
The Canteen Management System is a comprehensive software solution designed to modernize and optimize canteen operations. It aims to simplify the complexities of managing a canteen by automating key processes such as order management, payment processing, and report generation.
canteen canteen-mangement-system charts data-analysis email excel microsoft payment-gateway vba vba-excel vba-macros word
Last synced: 30 Jan 2026
https://github.com/meetup-python-grenoble/datasette-workshop
Exploration de données avec Datasette
data-analysis data-science data-visualization datasette exploratory-data-analysis python sql workshop
Last synced: 13 May 2026
https://github.com/iguptashubham/pizzahut-analysis-sql
best dataset for data analysis. Pizzahut data analysis done by Shubham Gupta in MySql. This dataset is provided by friend of mine intern at pizzahut. In pizzahut, they used this dataset to train and ask question. This data does not reveal anything about the pizzahut. It is safe to share. data
data-analysis data-analytics database dataset datasets mysql mysql-database pizzahut
Last synced: 14 May 2026
https://github.com/infinitode/duplipy
DupliPy is a quick and easy-to-use package that can handle text formatting and data augmentation tasks for NLP in Python. It now offers support for image augmentation tasks as well.
ai augmentation data-analysis data-preprocessing data-science images language-models nlp preprocessing text-data text-datasets text-formatting
Last synced: 28 Jun 2026
https://github.com/edisedis777/duckdb-analyzer
A powerful tool for analyzing large CSV datasets using DuckDB.
csv data-analysis database duckdb
Last synced: 16 Apr 2026
https://github.com/averma205/national-power-outages-severity-analysis
DSC 80 final project at UCSD
data-analysis data-science geospatial-data pandas predictive-modeling python sklearn
Last synced: 09 Feb 2026
https://github.com/nafisalawalidris/international-breweries
This GitHub readme provides an overview of data analysis using SQL on the International Breweries dataset, including dataset description, analysis questions, example SQL queries, and key insights derived from the analysis.
data-analysis insights international-breweries-dataset queries sql
Last synced: 31 Jan 2026
https://github.com/nafisalawalidris/hici-african-foods
HiCi African Foods: Excel dashboard & pivot table analysis of EU food rejection data to identify risks & recommend focus areas for market expansion.
data-analysis data-cleaning data-visualization eu-food-rejection excel-dashboard hici-african-foods market-expansion pivot-tables
Last synced: 19 Mar 2026
https://github.com/prernarohra/quakeguard
QuakeGuard is an innovative project for reducing earthquake intensity and structural damage. It takes a proactive approach to seismic activity, by using complex algorithms and real-time data to improve safety and resilience for people in earthquake-prone areas.
artificial-intelligence backend data-analysis data-science earthquake-intensity final-year-project front-end geology machine-learning open-source python visualization
Last synced: 21 May 2026
https://github.com/akash1070/project---applied-statistics-
To dive deep into this data & find some valuable insights.
data-analysis data-science python statistics
Last synced: 30 Apr 2026
https://github.com/pradeepchegur/seamantic_web_design
We designed a semantic web for Instagram in Wix platform.
data-analysis framework instagram semantic-web website-design wix
Last synced: 19 Mar 2026
https://github.com/brunomontezano/benzocovid
💊 Data Analysis Project of Benzodiazepines during COVID-19 Pandemic.
benzodiazepines covid-19 data-analysis
Last synced: 28 Feb 2025
https://github.com/abhi18av/innovation-competition
Submission for a programming challenge
clojure clojurescript data-analysis
Last synced: 13 Jun 2026
https://github.com/aliciagilmatute/analisis-multinivel-bayesiano
Este estudio explora el análisis multinivel desde un enfoque bayesiano para evaluar la variabilidad del rendimiento en matemáticas entre 10 centros educativos
bayesian-statistics cmdstanr data-analysis hierarchical-models multilevel-models rstats rstudio stan
Last synced: 29 Jun 2026
https://github.com/swarnim1812/crime_project
AI-Driven Crime Forecasting Across Indian States — A pioneering machine learning project that harnesses time series modeling (SARIMAX, Ridge Regression) to uncover patterns and forecast crime trends using real-world multi-state temporal and socio-economic data.
analytics crime-locator crime-prediction data-analysis deep-learning machine-learning prophet-facebook sarimax-model time-series-forecasting
Last synced: 31 Jan 2026
https://github.com/jiachengwang-punch/predictive-analytics-skill
A reusable, multi-model, language-adaptive methodology for end-to-end machine learning analysis of tabular data.
claude-skill codex-skill data-analysis data-science deepseek feature-engineering lightgbm llm machine-learning methodology prompt-engineering tabular-data
Last synced: 30 May 2026
https://github.com/chiemekaifemegbulem/make.com
A curated portfolio of Make.com automation workflows engineered to streamline operations and ensure precision. Featuring solutions for e-commerce, data integration, marketing, and bespoke business processes, it exemplifies expertise in designing scalable, efficient, and dependable automated systems.
api automate automated automation business data-analysis data-science dataengineering integration integromat make scenario software-engineering upwork workflows
Last synced: 15 Feb 2026
https://github.com/reinmagine/eliminating-no-sensor
Contains my project that analyzes air quality sensor data to determine if the NO (Nitric Oxide) sensor in N. Mai, Los Angeles, CA can be removed without affecting data accuracy.
air-quality-sensor colab-notebook cost-optimization data-analysis data-optimization matplotlib-python nitric-oxide pyspark-python python sql
Last synced: 14 Jun 2026
https://github.com/soufianboukir/ecom-analytics-platform
End-to-end data science project on an Amazon sales dataset, including data preprocessing, analysis, modeling, and a Streamlit dashboard for insights and decision-making.
data-analysis data-science data-visualization data-visualization-dashboard forecasting-models timeseries
Last synced: 14 Jun 2026
https://github.com/garcane/british-airways-analysis
This project focuses on analyzing and visualising travel data from British Airways using Tableau. The goal is to extract insights and present them in an interactive and visually appealing manner.
data data-analysis data-visualization tableau
Last synced: 19 Mar 2026
https://github.com/chahiriabderrahmane/carpricepredictor
🚗 Cars Exploration & Price Prediction | Analyzing Cars.com Listings
data-analysis data-science data-visualization machine-learning python streamlit web-scraping
Last synced: 08 Feb 2026
https://github.com/edikedik/lxtractor
Library for analysing protein structures and sequences
bioinfomatics computational-biology data-analysis data-mining feature-extraction python structural-biology
Last synced: 14 Feb 2026
https://github.com/kaz-yos/distributed
Comparison of Privacy-Protecting Analytic and Data-sharing Methods: a Simulation Study (Pharmacoepidemiol Drug Saf 2018)
data-analysis epidemiology statistics
Last synced: 15 Jun 2026
https://github.com/chetanmalviya513/Firm-Financial-Transaction-Analysis
📊 Financial Analysis & Forecasting Processed large-scale financial data using Python for trend analysis and insights. Developed interactive Tableau dashboards to improve forecasting accuracy and reduce costs by 25%.
data-analysis financial-data forecasting insights msexcel pandas python reporting tableau-dashboards
Last synced: 15 Jun 2026
https://github.com/thevinh-ha-1710/rstudio-statistics
This project deeply studies 2 datasets using applied statistics techniques.
applied-statistics data-analysis data-science data-visualization rmarkdown rstudio
Last synced: 31 Jan 2026
https://github.com/daniel1kp/openrtb-dashboard
This is a demo project designed to illustrate using Rill to analyze programmatic bid logs using the canonical open RTB framework.
data-analysis openrtb real-time-bidding rill
Last synced: 19 Mar 2026
https://github.com/metalwarrior665/actor-results-checker
apify data-analysis json-schema-checker
Last synced: 16 Jun 2026
https://github.com/techshot25/baltimore-911-calls
Analysis of 911 calls provided by the city of Baltimore.
data-analysis data-science decision-tree-classifier logistic-regression machine-learning machine-learning-algorithms statistics
Last synced: 16 Jun 2026
https://github.com/jakobzmrzlikar/fake-news-analysis
An analysis of the FakeNewsNet dataset using NLP techniques.
data-analysis fake-news ipynb-jupyter-notebook nlp-machine-learning
Last synced: 05 Mar 2026
https://github.com/avijit-jana/redbus-data_scraping_and_filtering_with_streamlit_app
A Streamlit-based application leveraging Selenium to automate data scraping from Redbus, enabling efficient collection, analysis, and visualization of bus travel data for improved operational efficiency and strategic planning in the transportation industry.
automation dashboard data-analysis data-visualization datadrivendecisions python3 redbus selenium-python streamlit-application webscrapping
Last synced: 15 Mar 2025
https://github.com/thevinh-ha-1710/diabetes-predictive-model
This project aims to train a predictive model to diagnose diabetes on women patients.
data-analysis data-science data-visualization model-training-and-evaluation python
Last synced: 13 Feb 2026
https://github.com/ndohvich/ibm-data-science-professional-certificate
Kickstart your career in data science & ML. Build data science skills, learn Python & SQL, analyze & visualize data, build machine learning models. No degree or prior experience required.
coursera dash data-analysis data-science html5 ibm ibm-professional-certificate javascript machine-learnng python sql
Last synced: 16 Nov 2025
https://github.com/nomadsdev/financial-trend-analyzer
FinancialTrendAnalyzer helps analyze and visualize sales data to uncover financial trends. It uses Python to calculate total sales, track changes, and generate insightful charts for better decision-making.
business-intelligence data-analysis data-visualization financial-analysis matplotlib numpy pandas python revenue-trends sales-data seaborn time-series-analysis
Last synced: 19 Jan 2026
https://github.com/apache/cloudberry-devops-release
DevOps and Release for Apache Cloudberry (Incubating)
ai big-data cloudberry data-analysis data-warehouse database devops distributed-database greenplum mpp olap postgres postgresql
Last synced: 04 Sep 2025
https://github.com/com-480-data-visualization/project-2023-choo-choo-data-darlings
This repository contains the source code for our data visualization project, an interactive platform designed to explore the intricate Swiss transportation network. Developed by the Choo Choo Data Darlings team at EPFL, the project provides an in-depth view into the vast array of Swiss transportation operations, including trains, buses, and trams.
boats buses data-analysis data-science data-visualisation data-visualization epfl metro public-transport public-transportation switzerland trains trams
Last synced: 01 May 2026
https://github.com/sivkri/perseus-ms-proteomics-venn
Mass spectrometry Perseus Data analysis
data-analysis mass-spectrometry perseus proteomics proteomics-data proteomics-data-analysis proteomics-data-integration
Last synced: 14 Apr 2026
https://github.com/riddhis2226/titanic-survival-data-analysis
Titanic-Survival-Data-Analysis : Analyze passenger data from the Titanic to predict survival based on features like age, gender, class, and fare.
data-analysis data-mining data-science data-visualization database jupyter-notebook machine-learning-models machinelearning-python plotlyjs python3
Last synced: 01 May 2026
https://github.com/billgewrgoulas/hypothesis-testing-on-37-seasons-of-nba
First assignment for the course Data Mining @CSE.UOI
data-analysis data-science numpy scipy seaborn statistics
Last synced: 01 May 2026
https://github.com/listiangr/ecommerce_sales_data_analysis
Proyek ini menganalisis data penjualan e-commerce untuk membantu bisnis memahami tren penjualan, performa produk, dan segmen pelanggan. Tujuan utamanya adalah memberikan wawasan yang dapat meningkatkan strategi pemasaran dan pengelolaan produk.
dashboard data-analysis data-cleaning data-collection data-penjualan data-visualization exploratory-data-analysis microsoft-excel
Last synced: 19 Jan 2026