Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-07-02 00:07:33 UTC
- JSON Representation
https://github.com/khulnasoft/data-science-materials
data-analysis data-engineering data-science data-visualization
Last synced: 17 Oct 2025
https://github.com/codeslash21/communicate_data_findings
Analyze and visualize Bay Wheel system data which contains 2.5M individual trips data. And communicate the data findings from the dataset in the form notebook slide.
bay-wheel data-analysis data-visualization explanatory-data-visualization exploratory-data-analysis
Last synced: 22 Jan 2026
https://github.com/yulia-momotyuk/dla-data-analysis-practice
This repository contains my homework assignments completed during the "Data Analyst in IT" course at Data Loves Academy.
analytics data-analysis data-visualization excel mysql numpy pandas postgres powerbi python seaborn sql tableau
Last synced: 14 Apr 2026
https://github.com/lucashomuniz/Project-03
Data-Driven Decision Making: Selecting the Best Regression Model for E-commerce Sales
benchmark-framework data-analysis data-driven data-visualization e-commerce-project language-python lasso-regression linear-regression-models machine-learning python ridge-regression
Last synced: 20 Oct 2025
https://github.com/jimohola/zomato-restaurant-ratings-ml
Flask Deployment Machine Learning
css data-analysis flask html machine-learning python3
Last synced: 04 May 2026
https://github.com/mothraa/etl-marketanalysis-webscraping-poo
OC project 2 refactoring (POO version not yet completed)
data-analysis etl poo python web-scraping
Last synced: 20 Oct 2025
https://github.com/saisurajmatta/nashville-housing-data-cleaning-project
Clean and standardize Nashville Housing dataset using SQL queries for improved data quality and structure.
azure-data-studio data-analysis mssql mysql sql sql-data-cleaning sql-queries sql-server-management-studio
Last synced: 23 Jan 2026
https://github.com/dcs-training/spatial_dynamics
Use of QGIS and R to analyse first and second order geospatial effects. Go to the Readme file
data-analysis geographical-data gis qgis r statistics
Last synced: 23 Oct 2025
https://github.com/alessandroryo/bike-rental-data-analysis
A data analysis project focused on understanding and predicting bike rental patterns. This project utilizes data processing, visualization, and predictive modeling techniques to gain insights into bike rental usage, fulfilling the final submission requirement for Dicoding Indonesia's Data Analysis course.
bike-rental data-analysis data-visualization jupyter-notebook machine-learning python streamlit
Last synced: 09 Apr 2026
https://github.com/a26nine/kortext-usage-dashboard
An interactive data visualisation dashboard built using Tableau software to understand the value of digital resources issued on Kortext platform at Middlesex University, London.
data-analysis data-science data-visualization knime tableau
Last synced: 01 Feb 2026
https://github.com/psychelzh/cogstruct-old
Data Analysis on Cognitive Structure
cognition data-analysis intelligence psychology
Last synced: 25 Oct 2025
https://github.com/gjjvdburg/veld
Easy command line analytics
cli command-line-tool data-analysis data-science data-visualization statistics
Last synced: 26 Oct 2025
https://github.com/aakk23/professional-survey-powerbi
This Power BI dashboard analyzes survey data from data professionals, highlighting salary trends, job roles, and career satisfaction. It provides insights into work-life balance, programming language preferences, and industry demographics.
data-analysis data-visualization dax excel powerbi powerquery
Last synced: 23 Feb 2026
https://github.com/codewithjazmine/bookbot
Python command-line tool that analyzes text files for word count and character statistics
command-line-tool data-analysis learning-project python text-analysis
Last synced: 23 Jan 2026
https://github.com/9dl/usbfalcon
Automatically copies files from plugged USB drives to a specified location, enabling quick data retrieval for analysis.
automation data-analysis data-retrieval ethical-hacking file-copying usb
Last synced: 27 Oct 2025
https://github.com/OneMoreDavid/python-like-a-boss
This is where I stash my Python study material.
data data-analysis data-engineering data-science data-visualization datascience ipynb ipynb-jupyter-notebook ipynb-notebook numpy pandas python python3
Last synced: 28 Oct 2025
https://github.com/valentinoli/swiss-foodprint
Project in Applied Data Analysis, EPFL 2019
carbon-emissions data-analysis diet foodprint swiss switzerland
Last synced: 24 Jan 2026
https://github.com/mysto-007/cyclistic-bike-share-analysis
Analyzed the dataset of Cyclistic Rental Service as the Capstone project for Google Data Analytics SpecializationAnalyzed the dataset of Cyclistic bike-share (Capstone project for Google Data Analytics Specialization)
bigquery data-analysis excel ms-sql-server sql tableau tableau-public
Last synced: 16 Mar 2026
https://github.com/wassimhd/pwc-switzerland-power-bi-in-data-analytics-virtual-case-experience
The Project helps to build a foundation in data analysis and Power BI software which is provided by PWC virtual internship
data-analysis data-visualization datastorytelling powerbi
Last synced: 28 Jan 2026
https://github.com/andreicirciumaru/best-of-breed
CSV fundamentals screener: schema validation + market-cap weights
csv data-analysis finance pandas python screener
Last synced: 15 Apr 2026
https://github.com/engineertolulope/us_states_living_ranking_analysis
Python script for analyzing and ranking U.S. states based on factors like cost of living, tax burden, diversity, crime rates, and climate. Uses weighted criteria to identify the best states to live in according to these metrics. Ideal for decision-making on relocation.
data-analysis data-science linear-regression machine-learning python scikit-learn
Last synced: 29 Jan 2026
https://github.com/data-forge-notebook/ohlc-aggregation-example
An example of aggregating OHLC stock data using Data-Forge Notebook
algorithmic-trading data data-aggregation data-analysis ohlc quantitative-finance share-market stock-market trading
Last synced: 30 Jan 2026
https://github.com/edumoraes1/comissao-reduzida
Criação de segmentação de publico via SQL para nova feature do enjoei de comissão reduzida
bq data-analysis salesforce sql
Last synced: 06 Feb 2026
https://github.com/ljadhav25/decision-tree-random-forest-algorithm-data-science-
This repository contains an implementation of decision tree and random forest algorithms from scratch in Python. Decision trees and random forests are popular machine learning algorithms used for classification and regression tasks. The goal of this project is to provide a clear and understandable implementation of these algorithms
data-analysis data-science decision-trees machine-learning-algorithms matplotlib numpy pandas python random-forest-classifier
Last synced: 15 Apr 2026
https://github.com/jcaperella29/jc_bioinformatics_hub
A personal hub to showcase my bioinformatics applications including RNA-Seq, ATAC-Seq, and miRNA-Seq analysis tools. Powered by simple HTML, CSS, and JavaScript with a biotech-themed design.
atac-seq bioinformatics biotech data-analysis github-pages portal rna-seq webapp
Last synced: 25 Feb 2026
https://github.com/aavishkarmahajan/sql
SQL code assignments and practice questions from SQL courses, SQL data analysis
Last synced: 07 Feb 2026
https://github.com/ginanti-riski/streamlit_datapenyewaansepeda
Analisis Bike Sharing adalah proyek yang bertujuan untuk memahami pola penyewaan sepeda berdasarkan berbagai faktor seperti cuaca, musim, dan hari. Proyek ini menggunakan teknik analisis data untuk mendapatkan wawasan yang lebih dalam mengenai tren peminjaman sepeda.
data-analysis data-analysis-python data-science data-visualization python streamlit
Last synced: 15 Apr 2026
https://github.com/nikitalpopov/evotor_champ
solution for evotor data challenge
data-analysis data-science python scikit-learn
Last synced: 15 Apr 2026
https://github.com/axsk/geekgraph
parse, cluster and visualize boardgamegeek.com user profiles
Last synced: 01 Feb 2026
https://github.com/rissh/titanicsurvivalpredictionusingml
Predicting Titanic passenger survival through machine learning. This project includes data preprocessing, exploratory data analysis, feature engineering, and model training using Python. 🚢
data data-analysis data-science data-visualization dataanalysis jupiter-notebook machine-learning machine-learning-algorithms machinelearning matplotlib numpy pandas prediction prediction-model python python3 seaborn tenserflow tflearn titanic
Last synced: 01 Feb 2026
https://github.com/vishnu-vamshii/data-science-jobs-salaries
Created an interactive dashboard to analyze data science jobs salaries in different regions of the world, experience levels, average salaries in USD and type of employment along with a geographical visual.
data-analysis data-science data-visualization tableau tableau-dashboard
Last synced: 01 Feb 2026
https://github.com/vladimiracunadev-create/python-data-science-program
Python Data Science Program — 197 clases en 9 partes. Pauta avanzada derivada de Géron, VanderPlas, Huyen, ISLP y Barocas/Hardt/Narayanan. Recurso personal de aprendizaje, enseñanza y mejora continua.
bootcamp data-analysis data-science education jupyter machine-learning matplotlib numpy pandas python scikit-learn
Last synced: 01 Jun 2026
https://github.com/amanraghuvanshi/adidas-western-zone-sales
Adidas United States Sales Report Analysis
data-analysis datatable pandas plotly statsmodels time-series
Last synced: 08 Feb 2026
https://github.com/suhail25/hotel-booking-analysis
Analyzed the cancelling of booking of hotels and summarized insights to the Hotel Manager to increase profit by 30%. Demonstrated data exploration, cleaning, analysis using Python and its libraries: pandas, seaborn, matplot. Documented the results in PDF report: reduced cancellation by 30% and releasing discounts for 10 days in a month.
data-analysis ipynb-notebook matplotlib pandas python seaborn
Last synced: 08 Feb 2026
https://github.com/grindelfp/datasets-analysis
The Machine Learning and Data Analysis course task dedicated to training skills of data normalizing and preprocessing.
data-analysis datasets ipynb mlda
Last synced: 05 Mar 2026
https://github.com/josericodata/statisticsapp
Interactive statistics analysis app using Python and Streamlit. Perform key statistical tests, visualise distributions, and explore data with ease.
alpha-value chi-square-test confidence-intervals data-analysis dublin dublin-ireland europe hyphotesis-tests ireland normal-distribution null-hypothesis p-value portfolio python statistics streamlit t-test tech ubuntu z-test
Last synced: 26 Feb 2026
https://github.com/barraharrison/airbnb-price-trends
Looking at how Airbnbs differ in price when it comes to location, room type and host activity
data-analysis data-science pandas plotly python streamlit
Last synced: 09 Feb 2026
https://github.com/27ahmad/amazon-sales-analysis
This repository contains an exploratory data analysis (EDA) and visualization project of Amazon sales data. The goal is to uncover insights and present key metrics through a Tableau dashboard.
data-analysis eda pandas python seaborn tableau
Last synced: 15 Apr 2026
https://github.com/mathusanm6/critics-vs-players-analysis
This data analysis examines the relationship between critic scores, sales (owners), player engagement, and pricing to determine the ROI of critic reviews.
data-analysis data-science data-visualization game-reviews games-sales jupyter-notebook python-3 steam-games
Last synced: 16 Apr 2026
https://github.com/purushothamadluru/kpi-driven-insights-dashboard-customer-churn-analysis
This repository features a Power BI project designed to deliver KPI-driven insights into customer churn patterns. Leveraging a robust dataset and advanced data modeling techniques, this project uncovers trends, identifies key drivers of churn, and enables businesses to make data-driven decisions.
customer-churn-analysis data-analysis insights-dashboard kpi powerbi
Last synced: 09 Feb 2026
https://github.com/kathisnehith/medicare-ip-hospital-analysis
In-depth Data analysis and visualization of Medicare inpatient hospital data.
data-analysis data-cleaning-and-preprocessing data-merging excel exploratory-data-analysis medicare-claim-costs-prediction powerquery sql tableau-dashboards
Last synced: 10 Feb 2026
https://github.com/prateekbisht23/inventory_management
This project is an Inventory Management System built using Python (Pandas, NumPy, SciPy) and Jupyter Notebook. It allows efficient tracking of stock, performing data analysis, and generating useful statistical insights (mean, standard error, confidence intervals) to support better decision-making.
data-analysis jupyter-notebook management python3
Last synced: 11 Feb 2026
https://github.com/chinmayee4/sales-analysis-for-ferns-n-petals
Analyzed Data By Creating Interactive Dashboard Using MS Excel
data-analysis data-cleaning data-visualization excel pivot-tables powerquery
Last synced: 11 Feb 2026
https://github.com/shrutiijoshi/crm-sales-analysis
The dataset contained records exported from MavenTech's CRM from October 2016 to December 2017. It held details of opportunities with associated information such as product, account, and whether the sale was won or lost.
data-analysis data-visualization dax-functions powerbi powerquery
Last synced: 11 Feb 2026
https://github.com/dhruwsunita/car-sales-dashboard
Car sales dashboard using Tableau visualization tool.
car-sales data-analysis data-visualization excel kpis tableau
Last synced: 27 Feb 2026
https://github.com/joemull/pyjade
A data curation script for the Jane Addams Digital Edition
data-analysis digital-humanities
Last synced: 11 Feb 2026
https://github.com/virajbhutada/telecom-customer-churn-prediction
Predict and prevent customer churn in the telecom industry with this project. Harness the power of advanced analytics and Machine Learning on a diverse dataset to develop a robust classification model. Gain deep insights into customer behavior and identify critical factors influencing churn using interactive Power BI visualizations.
churn-prediction classification-models customer-attrition-analysis customer-churn-prediction data-analysis data-science decision-tree-classifier eda logistic-regression machine-learning machine-learning-algorithms machine-learning-models pandas powerbi powerbi-desktop python random-forest-classifier roc-curve xgboost-classifier
Last synced: 09 Apr 2026
https://github.com/sharmas1ddharth/mode_of_transport_analysis
This project requires you to understand what mode of transport employees prefers to commute to their office. The data includes employee information about their mode of transport as well as their personal and professional details like age, salary, and work exp. We need to predict whether or not an employee will use private transport. Also, which variables are a significant predictor behind this decision.
Last synced: 11 Feb 2026
https://github.com/bala-1409/sql-projects
The repository contains Structured Query Language (SQL) Scripts. The Multiple SQL scripts for various projects which includes data cleaning, data pre-processing, data processing, data transformation and insights gaining through Query Language.
data-analysis data-mining data-science data-transformation database eda etl-framework exploratory-data-analysis microsoft-sql-server query-language sql sql-server sql-server-database sql-server-management-studio
Last synced: 27 Feb 2026
https://github.com/projects-developer/ransomware-prediction-using-machine-learning-project
The project aims to develop a machine learning-based system to predict and detect ransomware attacks on computer systems. Ransomware is a type of malware that encrypts a victim's files and demands a ransom in exchange for the decryption key. Project Includes Source Code, PPT, Synopsis, Report, Documents, Base Research Paper & Video tutorials
artificial-intelligence btechproject computerscienceproject cybersecurity-malware data-analysis data-mining deep-learning machinelearning mtechproject neural-networks ransomware-machine-learning
Last synced: 12 Feb 2026
https://github.com/rahulsm20/storedata
A data analysis project aimed at analyzing the sales data of the super store and providing useful insight into customer preferences.
data-analysis matplotlib numpy pandas python streamlit
Last synced: 16 Apr 2026
https://github.com/ryan-wong1/nyc-job-postings-data-analysis
City of New York Current Job Postings 2024
data-analysis data-cleaning exploratory-data-analysis sql
Last synced: 13 Feb 2026
https://github.com/secureauditx/ecommerce-user-behavior-analysis
E-commerce User Behavior Analysis with Streamlit Dashboard
customer-segmentation data-analysis ecommerce python streamlit
Last synced: 28 Feb 2026
https://github.com/kambleakash0/mubi_eda
Mini Project #1 for EAS503 course at SUNY Buffalo
data-analysis data-visualization eda
Last synced: 16 Apr 2026
https://github.com/balajimohan18/tableau-visualization-project
This repository contains Visualization Projects which is visualized through Tableau Software, by using the visualization we can gain multiple insights and strategies which helps to develop the business for gaining high profit margins and also it provides social values in some cases to reduce damages by calamities.
data-analysis data-science data-visualization exploratory-data-analysis tableau tableau-public
Last synced: 19 Mar 2026
https://github.com/misszeferino/nashville-housing-data-cleaning
Data cleaning using SQL
data-analysis data-cleaning sql
Last synced: 19 Mar 2026
https://github.com/suhail25/pizza-sales-analysis
Delved into detailed analysis of sales data presented in Excel by Pizza sales manager; implemented strategic pricing adjustments resulting in a 25% revenue surge and enhanced profit margins. Explore and cleaned the data set using SQL and then performed data analysis by filtering the 12% of data using SQL commands in MySQL.
data-analysis excel powerpoint-presentations sql
Last synced: 15 Feb 2026
https://github.com/achique-luisdan/tops-songs-db
Base de datos de Tops Semanales de Canciones🎵 más reproducidas en Spotify🎶. Prácticas de SQL enfocadas en el Análisis de Datos (Data Analysis).
Last synced: 15 Feb 2026
https://github.com/siddhant2105s/bring-your-own-device-boyd-system
This repository contains the design and implementation of the Bring Your Own Device (BYOD) System for managing personal devices at Life Insurance Company. It includes an ERD diagram, MySQL scripts for database creation, data insertion, and queries, as well as detailed data definitions and system requirements documentation.
data-analysis database-design database-normalization entity-relationship-diagram entity-relationship-models my-sql relational-databases relational-model sql-queries
Last synced: 15 Feb 2026
https://github.com/k-bloch/car-theft-analysis
A dashboard created to inform the public about car theft, providing insights extracted from real-world police stats.
data-analysis maven-analytics tableau
Last synced: 19 Mar 2026
https://github.com/grishmahat/discord-data-cli
A terminal UI tool to analyze your Discord data exportbuilt in Rust
cli data-analysis discord discord-data ratatui rust terminal tui
Last synced: 01 Mar 2026
https://github.com/yash22222/pwc-power-bi-virtual-case-experience
The Power BI PwC Virtual Case Experience is an exciting and educational program designed to provide participants with hands-on exposure to Power BI, a prominent business intelligence and data visualization tool, within the context of consulting at PwC.
business-analyst business-analytics business-intelligence dashboard data-analysis data-analyst data-analytics dax microsoft-power-bi powerbi powerbi-dashboards powerbi-visuals pwc
Last synced: 02 Mar 2026
https://github.com/dmatking/dtlab
Date Time Lab
csv data-analysis data-quality datetime python timezone
Last synced: 02 Jun 2026
https://github.com/madusales/powerbi-etl-elt
Venho estudando, através do Bootcamp da DIO sobre Data Analytics & Power BI, acerca do uso de SQL para criar soluções em BI. Esse repositório é dedicado a registrar os meus conhecimentos adquiridos até então sobre o que é BI, Tipos de análises, ETL e ELT.
big-data business-intelligence data-analysis powerbi
Last synced: 19 Mar 2026
https://github.com/elrf3lipes/ramon-s_portfolio
I'm passionate about Cloud and DevOps, and for the moment I'm posting some of my work and personal projects here to showcase that. If its useful for you, feel free to integrate or contribute!
api-integration biopython clinical-trials data-analysis data-extraction data-parsing django docker entrez ipython medline-xml pandas pubmed-parser requests rest-api
Last synced: 27 Mar 2026
https://github.com/ibrahimceyisakar/hotel-finder-streamlit-dashboard
Streamlit dashboard of hotel-finder
data-analysis data-science data-visualization pandas plotly python streamlit
Last synced: 16 Apr 2026
https://github.com/dvaser/world-happiness-expanatory-data-analysis
DATA ANALYSIS
data-analysis data-visualization dataset jupyter jupyter-notebook kaggle python
Last synced: 03 Mar 2026
https://github.com/samalyarov/practicum_projects
Various data analysis projects displaying tools and instruments that I am proficient with
data-analysis datetime folium geojson matplotlib numpy pandas plotly postgresql powerpoint python regular-expressions requests-library-python scipy seaborn sql sqlalchemy tableau tqdm
Last synced: 02 Apr 2026
https://github.com/lintangwisesa/ujian_analyticsvisualization_jcds07
Panduan Soal Ujian Data Analytics & Visualization Job Connector Data Science batch 7
data-analysis data-science data-visualisation exam
Last synced: 04 Mar 2026
https://github.com/edanur-y/bank-customer-churn-prediction-with-classification-models
Comparing the performances of multi-layer perceptron, decision tree, random forest, gradient boosting and extreme gradient boosting classifications on customer data to predict their status of exiting the bank.
data-analysis data-transformation hyperparameter-tuning python
Last synced: 16 Apr 2026
https://github.com/bishopce16/school_district_analysis
The school board requested an analysis on the various performance metrics for the school district.
data-analysis jupyter-notebook numpy pandas python visual-studio-code
Last synced: 16 Apr 2026
https://github.com/kosuri-indu/allaboutolympics
All About Olympics is an interactive dashboard presenting comprehensive data and insights on Olympic Games from 1896 to 2020.
data-analysis pandas plotly python streamlit
Last synced: 16 Apr 2026
https://github.com/themihirmathur/qlik-intern-project
Qlik Analysis of Road Safety & Accident Patterns in India 📈 Analyzed & visualized road safety data for 20.85k+ accident cases with 9+ accident data patterns in India using Qlik 📉 Reduced inefficiencies by 25% by developing design of an avant-garde data tracking dashboard that monitored injuries.
data-analysis data-visualization presentation qlik qlik-cloud qlik-sense qlikview
Last synced: 04 Mar 2026
https://github.com/ribin-baby/the-sparks-foundation-data-science-internship
This repository contains tasks and solutions assigned as part of internship program. This repository contains workbooks on data analysis and model building parts.
Last synced: 16 Apr 2026
https://github.com/kheriberto/knn_project
This is a simple project that uses dummie data to practice and demonstrate my knowledge of the KNN algorithm.
data-analysis knn-classifier numpy python scikit-learn seaborn
Last synced: 02 Apr 2026
https://github.com/ngangawairimu/linear-regression-
This project builds a linear regression model in Python to predict outcomes and derive insights from feature data. It covers data cleaning, feature analysis, and model evaluation, showcasing predictive modeling techniques using scikit-learn, pandas, and visualization libraries.
data-analysis linear-regression machine-learning predictive-modeling python scikit-learn
Last synced: 17 Apr 2026
https://github.com/jhrcook/checkplease
Analysis of an immune checkpoint-blockade screen.
bayesian-statistics data-analysis pymc3 python python3 r
Last synced: 17 Apr 2026
https://github.com/atlassandx90/cryptocurrency-volatility-prediction
Cryptocurrency volatility prediction ML pipeline
cryptocurrency data-analysis data-science data-visualization machine-learning
Last synced: 17 Apr 2026
https://github.com/dharininadkar/movies-data-dashboard
Data Analysis of Movies data
data-analysis data-mining data-science data-visualization ms-excel ms-sql-server tableau
Last synced: 04 Apr 2026
https://github.com/sharmas1ddharth/data-analysis-with-python
Freecodecamp's Data Analysis with Python Projects Code
data-analysis data-analysis-with-python freecodecamp-project
Last synced: 03 Jun 2026
https://github.com/rishisolanke/pdf_query_langchain
PDF Query LangChain is a tool that extracts and queries information from PDF documents using advanced language processing. Leveraging LangChain, OpenAI, and Cassandra, this app enables efficient, interactive querying of PDF content. Ideal for data analysis, research, and automated reporting, it simplifies detailed document analysis with ease.
artificial-intelligence data-analysis document-query langchain natural-language-processing nlp openai pdf-analysis pdf-extraction python research-tool
Last synced: 17 Apr 2026
https://github.com/victoorv/prediction_covid19
Prédire si un invidu est positif au COVID19 ou non.
classification covid-19-classifier covid-19-data-analysis covid19-data data-analysis data-science data-visualization exploratory-data-analysis hyperparameter-tuning machine-learning machine-learning-algorithms neural-networks oversampling-algorithms python statistical-tests statistics
Last synced: 04 Apr 2026
https://github.com/victoorv/criminalite_us
Une analyse de la criminalité en fonction de variables socio-économiques a été menée, incluant la sélection et la comparaison de modèles de régression multiple ainsi que des tests d'hypothèses sur les coefficients et la significativité des modèles.
data-analysis data-science r regression regression-analysis regression-models statistical-analysis statistical-tests statistics
Last synced: 04 Apr 2026
https://github.com/vitornegromonte/eda_stroke
Exploratory data analysis in the stroke prediction dataset
data-analysis data-science exploratory-data-analysis kaggle-dataset visualization
Last synced: 17 Apr 2026
https://github.com/ahmad-ali-rafique/decision-tree-regressor-modeling
Comprehensive exploration of decision tree regressors, including data cleaning, model building, and performance evaluation on various datasets.
artificial-intelligence data data-analysis dataanalytics decision-trees decisiontreeregressor modeling models regression-models
Last synced: 17 Apr 2026
https://github.com/santos-k/fashion-recommender-dashboard
The project is a neural network-based fashion recommendation system built using Python. The model used for this system is Resnet50, which is a deep learning model used for image recognition. The data used for training the model is scraped from Flipkart, with a total of 65,000 images.
ann cnn dash dashboard data-analysis data-science deep-learning eda gcp heroku kera machine-learning nueral-networks plolty python tensorflow
Last synced: 04 Apr 2026
https://github.com/prangonghose/wikipedia-blocking-policies
This study investigates the relationship between editors’ disruptive behavior and regulation policies on English Wikipedia, focusing on the Blocking Policy page. The study collects and analyzes data from 2004 to 2022 using the Wikipedia API, page statistics, and keyword extraction.
data-analysis data-visualization matplotlib open-source pandas python3 seaborn
Last synced: 18 Apr 2026
https://github.com/awanraskall/retail-demand-analysis
Data analysis of retail meal orders, fulfillment centers, and product demand using Python
data-analysis data-visualization jupyter-notebook numpy pandas python
Last synced: 18 Apr 2026
https://github.com/vansh-py04/data-analysis-questions-pandas-numpy-sql
Solution to 450+ Data Science Tech Stack questions essential for Data Analysts and Scientists!
data-analysis data-science deepnote machine-learning numpy pandas python sql
Last synced: 18 Apr 2026
https://github.com/mtimma001/clinical-trial-data-tool
Clinical Trial Data Analysis Tool is a Flask-based web app for healthcare professionals to manage and analyze clinical trial data. It features full CRUD functionality, interactive visualizations (Plotly/Matplotlib), a responsive Bootstrap UI, MySQL database integration, and Heroku deployment for accessible, scalable use.
bootstrap5 clinical-trials crud data-analysis data-visualization flask healthcare heroku mysql pandas plotly python
Last synced: 05 Apr 2026
https://github.com/jordanconallluthaiswright/purchase-behaviour-data-analysis
This project analyzes Black Friday purchase behavior for Company XYZ, uncovering trends by gender, age, and location. Using data cleaning, statistical analysis, and visualization, it evaluates spending patterns, confidence intervals, and category preferences to provide actionable insights for optimizing marketing strategies and targeting.
business-analytics data-analysis jupyter-notebook python
Last synced: 18 Apr 2026
https://github.com/mi7773/advanced_sql_data_analytics_project
A hands-on SQL project simulating data analysis using fact and dimension tables, covering trends over time, cumulative metrics, performance breakdowns, segmentation, and reporting via SQL.
analytics business-analytics business-intelligence data data-analysis data-analyst data-analytics database query reporting sql sql-queries sql-query sql-server window-functions window-functions-in-sql
Last synced: 18 Apr 2026
https://github.com/stimulsoft/samples-dashboards.web-for-blazor-webassembly
Blazor WebAssembly (Wasm) samples for Reports.BLAZOR embedded components, Visual Studio C# projects, .NET 6, .NET 7, .NET 8 dashboards tool
blazor client-side converter dashboard data data-analysis data-sources database datagrid designer diagram dimension json net presentation print runtime viewer wasm webassembly
Last synced: 18 Apr 2026
https://github.com/shubh-bharadwaj/income-dataset-analysis
data-analysis data-science pandas python
Last synced: 18 Apr 2026
https://github.com/arv-anshul/notebooks
My Jupyter notebooks in which I practice data science.
data-analysis data-science jupyter-notebook llm machine-learning marimo matplotlib regression transformers
Last synced: 19 Apr 2026
https://github.com/yuvrajsaraogi/unemployment-analysis-with-python
Unemployment is measured by the unemployment rate which is the number of people who are unemployed as a percentage of the total labour force. We have seen a sharp increase in the unemployment rate during Covid-19, so analyzing the unemployment rate can be a good data science project.
big-data big-data-analytics data-analysis data-science data-visualization engineering excel jupyter-notebook machine-learning mini-project natural-language-processing nlp project python3 sql
Last synced: 19 Apr 2026
https://github.com/souraevshing/data-science-01
Data analysis using jupyter notebook.
data-analysis data-science data-visualization jupyter-notebook python
Last synced: 19 Apr 2026
https://github.com/diegoglezsu/bulletin-fetcher
bulletin-py is python package to easily fetch bulletins and legal acts from a wide variety of sources of Eurpean Union.
data-analysis european-union legal-documents python sparql
Last synced: 19 Apr 2026
https://github.com/samwhaaa/superfoodsmax
A customer demographic & spending trend analysis on the fictional SuperFoodsMax grocery chain
data-analysis data-analytics data-visualization jupyter jupyter-notebook python
Last synced: 20 Apr 2026