Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-07-02 00:07:33 UTC
- JSON Representation
https://github.com/wrighang/shipping-data-analysis
Independent Project: Transit time trends analysis following a major shipping process change.
data-analysis matplotlib numpy pandas python
Last synced: 18 Apr 2026
https://github.com/sunnybibyan/marketing_campaign_analysis_power_bi_dashboard
Campaign Performance Analysis This project analyzes the performance of Spring, Summer, and Fall marketing campaigns, revealing key insights and actionable recommendations.
data-analysis data-visualization dax marketing-campaign powerbi
Last synced: 19 Mar 2026
https://github.com/ipanalytics/vpn-provider-overlap-intelligence
Aggregate VPN provider infrastructure overlap analysis: exact-IP overlap, shared /24 prefixes, hosting dependency, and provider relationship clusters. No raw VPN IP lists.
anti-fraud asn cybersecurity data-analysis fraud-detection infrastructure ip ip-intelligence ip-reputation network-analysis network-intelligence osint proxy-detection threat-intelligence vpn vpn-detection
Last synced: 25 May 2026
https://github.com/affec-ds/dashboard-ventas-vinilos
Dashboard interactivo de ventas para tienda de vinilos. Análisis visual, KPIs clave y filtros dinámicos para decisiones comerciales.
business-intelligence data-analysis data-visualization ipywidgets jupyter-notebook kpis matplotlib music-industry notebook-project python retail-analytics sales-analysis seaborn vinyl-records
Last synced: 30 Apr 2026
https://github.com/agungbudiwirawan/sales-analysis-using-excel-formulas
The objective of this project is to analyze supermarket sales data using formulas in Microsoft Excel.
data-analysis excel excel-formulas microsoft-excel spreadsheet
Last synced: 08 Jan 2026
https://github.com/rkirlew/workoutrecommendationsdataset
This repository contains a synthetic dataset designed for building personalized workout recommendation models. The data is generated for educational and experimental purposes, allowing users to practice machine learning techniques such as classification, k-NN, and clustering, as well as explore fitness-related data analysis.
classification data-analysis dataset k-nearest-neighbours machine-learning
Last synced: 08 May 2026
https://github.com/rani-sikdar/pwc-virtual-internship-powerbi
Comprehensive Power BI dashboards showcasing insights on Call Centre Trends, Customer Retention, and Diversity & Inclusion to drive business impact.
business-analytics business-intelligence data-analysis data-cleaning data-visualization interactive interactive-visualizations powerbi
Last synced: 07 Jan 2026
https://github.com/renanmoliveir/analise_de_dados_bikestore_power-bi_atualizan-o
Projeto de análise de dados do banco de dados Bike Store com Power BI.
data-analysis dax-languague powerbi query
Last synced: 15 Mar 2026
https://github.com/shubham14p3/python-word-cloud
Simple python application to create word cloud.
data data-analysis data-science data-visualization nbextension python-3 upload-file
Last synced: 01 May 2026
https://github.com/zpreisler/modules
Python libraries and modules for processing simulation outputs
data-analysis python scripts tensorflow
Last synced: 13 May 2026
https://github.com/hetuvpatel/research-chatgpt
Research and data analysis project evaluating the social, ethical, and educational impacts of ChatGPT using survey-driven insights and Python-powered data analysis. 📚🤖
data-analysis matplotlib pandas python seaborn
Last synced: 01 May 2026
https://github.com/prajakta1321/kaggle-ai-report-2023
A Report describing the trends in emergence of AI over the years !
data-analysis data-visualization python3
Last synced: 28 Jun 2025
https://github.com/anthonybench/datapeek
Peek summary of datafile in a succinct, opinionated manner.
Last synced: 02 Mar 2026
https://github.com/aritrakar/statpy
A simple package containing some functions for analysing Gaussian and Binomial distributions. Created for the Udacity AWS MLE Foundations 2021 course.
data-analysis python statistics
Last synced: 24 Oct 2025
https://github.com/mayankyadav23/air-bnb-data-analysis
Data analysis and insights from NYC Airbnb listings, focusing on key metrics such as host performance, neighborhood trends, pricing, and customer reviews. Comprehensive documentation of ETL processes and analytical methodologies is provided. Perfect for understanding Airbnb dynamics and decision-making in the NYC market.
advanced-excel business-intelligence data-analysis data-analytics data-visualization power-bi ppt
Last synced: 19 Mar 2026
https://github.com/scarblase/salary-comparison
Submission for the DataCamp Salary Competition(1 level). 🏆
data data-analysis data-science data-visualization engineering python sql structured-data
Last synced: 01 May 2026
https://github.com/fdtomasi/regain-applications
Containers for notebooks and data where REGAIN has been used.
algorithms data-analysis latent-variable-models machine-learning minimization network-inference regain sklearn time-series
Last synced: 16 Apr 2026
https://github.com/santiagortiiz/snowflake-data-warehousing
Snowflake University. Snowflake Data Warehousing. Foundamentals
big-data data-analysis data-warehouse olap snowflake
Last synced: 19 Mar 2026
https://github.com/satvikvirmani/engineering-graduate-salary-analysis
A Data Science/Machine Learning project to analyse and study salary patterns of a engineering graduate in India.
data-analysis data-science jupyter-notebook machine-learning prediction python regression
Last synced: 19 Jun 2026
https://github.com/rayyan9477/household-transactions-analysis-and-clustering
This project involves analyzing household transaction data to gain insights into spending patterns and behaviors. The analysis includes data cleaning, exploratory data analysis (EDA), clustering using K-Means, and visualization of customer segments.
customer-segmentation data-analysis data-cleaning data-science exploratory-data-analysis kmeans-clustering machine-learning
Last synced: 27 Feb 2025
https://github.com/priyanshubiswas-tech/aws-mwaa-elt-airflow-sql-dbt-superset-project
This project was created as part of an assessment for DigitalXC AI. It demonstrates a cloud-based ELT pipeline using AWS MWAA, Airflow, dbt, PostgreSQL, and Superset. The pipeline automates data ingestion from S3, transformation with dbt, and visualization through Superset, following modern data engineering practices on a scalable AWS architecture.
apache-airflow apache-superset aws-s3 dag data-analysis data-engineering-pipeline data-visualization dbt elt-pipeline python rds-postgres
Last synced: 03 Jul 2025
https://github.com/billgewrgoulas/hypothesis-testing-on-37-seasons-of-nba
First assignment for the course Data Mining @CSE.UOI
data-analysis data-science numpy scipy seaborn statistics
Last synced: 01 May 2026
https://github.com/mokeddembillel/student-performance-prediction
Using Machine learning to predict a student final grade
data-analysis data-exploration feature-extraction feature-importance feature-selection linear-regression machine-learning power-bi principal-component-analysis regression spyder student-performance-prediction svm-regressor
Last synced: 15 Mar 2025
https://github.com/shrawans007/hotel_customers_sentiments
Sentiment Analysis for a Hotel Based on Customer's Reviews
2018-2019 data-analysis data-analysis-in-excel data-cleaning data-cleaning-and-preprocessing data-visualization excel excel-pivot-tables github hotel-review-sentiments hotel-service ms-excel ms-excel-data-analytics pivot-tables sentiment-analysis tableau tableau-public text-reviews treemap
Last synced: 22 Mar 2025
https://github.com/riddhis2226/titanic-survival-data-analysis
Titanic-Survival-Data-Analysis : Analyze passenger data from the Titanic to predict survival based on features like age, gender, class, and fare.
data-analysis data-mining data-science data-visualization database jupyter-notebook machine-learning-models machinelearning-python plotlyjs python3
Last synced: 01 May 2026
https://github.com/nafisalawalidris/sales-performance-dashboard
Sales Performance Dashboard: Analyze and visualize sales data using Power BI. Gain insights into trends, customer segments, product performance, and geographic distribution. Make data-driven decisions to optimize sales strategies and maximize revenue.
analytics-revenue dashboard-power-bi data data-analysis intelligence-sales optimization performance sales visualization-business
Last synced: 03 Feb 2026
https://github.com/atymri/linqsimulator
LINQ Simulator is an interactive C# console application designed to let you experiment with LINQ queries in real time.
console csharp data data-analysis linq query sql
Last synced: 23 Oct 2025
https://github.com/steno-aarhus/legliv
Substitution of red meat with legumes and risk of primary liver cancer in UK Biobank participants: A prospective cohort study
cancer-research data-analysis epidemiology nutritional-epidemiology nutritional-science open-science reproducibility reproducible-research rstats ukbiobank
Last synced: 03 Mar 2026
https://github.com/sanam2405/chatinfo
Analysing the WhatsApp Chat with my crush over a 6M period
data-analysis data-visualization python
Last synced: 27 Apr 2026
https://github.com/com-480-data-visualization/project-2023-choo-choo-data-darlings
This repository contains the source code for our data visualization project, an interactive platform designed to explore the intricate Swiss transportation network. Developed by the Choo Choo Data Darlings team at EPFL, the project provides an in-depth view into the vast array of Swiss transportation operations, including trains, buses, and trams.
boats buses data-analysis data-science data-visualisation data-visualization epfl metro public-transport public-transportation switzerland trains trams
Last synced: 01 May 2026
https://github.com/sunnybibyan/call_centre_power_bi_dashboard
Create a dashboard in Power BI to visualize relevant KPIs and metrics that will help the call center manager understand trends.
call-centre-analysis dashboard data-analysis data-visualization powerbi
Last synced: 19 Mar 2026
https://github.com/srinivasrm/graphics_cards_analysis_and_application
In the current project I have extracted graphics card current prices from an authorizer retailer in India and performed analysis
beautifulsoup data-analysis data-science data-visualization etl graphic-card-price-prediction graphics-card graphics-card-analysis heroku-database machine-learning matplotlib pgsql python regression scikit-learn seaborn sql streamlit webapplication
Last synced: 04 Mar 2026
https://github.com/asifdotexe/air-quality-analysis-aqa
AQA is a data-driven project focused on analyzing air quality data sourced from data.gov.in. The project encompasses data preprocessing, analysis, and visualization to gain insights into air pollution levels across various locations in India. By examining six key pollutants, the project aims to raise awareness about the environmental issues
aqi-analysis data-analysis data-preprocessing data-science data-visualization presentation
Last synced: 07 Jun 2026
https://github.com/willie-conway/datavista
DataVista is a comprehensive, production-grade data analysis and machine learning platform that combines real-time data ingestion from live APIs, interactive visualizations, statistical analysis, hypothesis testing, and machine learning model training — all in a unified, professional-grade interface. Built with React and Recharts.
analytics-platform api-integration classification coingecko-api csv-import data-analysis data-cleaning-and-preprocessing data-pipeline data-science data-visualizations etl hypothesis-testing json-export machine-learning-models open-meteo react recharts regression statistics world-bank
Last synced: 30 May 2026
https://github.com/film2549/data-analysis-of-a-simulated-marketing-business-case-using-python-sql-and-power-bi
Data Analysis of a Simulated Marketing Business Case Using Python, SQL and Power BI
chulalongkorn computer-engineering computer-science data-analysis data-visualization database marketing nltk-library pandas powerbi pyodbc python simulation sql sqlserver
Last synced: 01 May 2026
https://github.com/henrylin03/china-gdp
Analysis and visualisation of China GDP data using Python.
data data-analysis data-visualisation dataset kaggle pandas
Last synced: 01 May 2026
https://github.com/ankitml/underscore
collections data-analysis json python3 underscore
Last synced: 14 Apr 2026
https://github.com/quantitext/quantitext
Official repository for QuantiText applications in the .NET ecosystem.
api aspnet-core csharp data-analysis dotnet-core mvc-architecture
Last synced: 30 Mar 2025
https://github.com/sauravbhattacharya001/biobots
A simple tool for checking 3D printer run statistics.
3d-bioprinting aspnet-web-api biobot bioinformatics bioprinting csharp data-analysis dotnet rest-api tissue-engineering
Last synced: 19 May 2026
https://github.com/mohamed3nan/udacity
Udacity Data Analysis Nanodegree Program
data-analysis data-visualization numpy pandas python
Last synced: 10 Apr 2026
https://github.com/odeyiany2/flit-apprenticeship-data-science-projects
This repo contains all my projects for my FLiT Apprenticeship
data-analysis data-science data-visualization machine-learning sql
Last synced: 17 May 2026
https://github.com/umutsevdi/hr-management
HR Management, Analytics and Salary Determination System
analytics data-analysis java java17 postgresql python spring spring-boot vaadin vaadin-flow
Last synced: 10 Apr 2026
https://github.com/faisal-khann/diwali-sales-analysis
The "Diwali Sales Analysis" project aims to analyze the sales data during the Diwali festival period to uncover insights and trends that can help improve marketing strategies and sales performance in the future
csv data-analysis eda jupyter-notebook matplotlib numpy pandas python seaborn
Last synced: 11 Apr 2026
https://github.com/ryanfranklin237/data-visualization-python
A tool that allows you to visualize data from a csv or excel file in a graph or charts form
data-analysis data-science data-visualization matplotlib pandas-dataframe python
Last synced: 11 Jun 2026
https://github.com/rajnish93/jpandas
A lightweight JavaScript library for working with tabular data, inspired by Pandas in Python. Built with TypeScript, it provides an intuitive API for data manipulation and analysis.
data-analysis data-analytics data-manipulation data-science dataframe javascript pandas stream-processing table typescript
Last synced: 11 Jun 2025
https://github.com/ayenpure/stockmeup
This is a class project for 'CIS 610 : Data Science' where I try and validate Stock Market recommendations.
data-analysis data-mining data-science java mapreduce mapreduce-java
Last synced: 24 Oct 2025
https://github.com/solrikk/pictrace-web
PicTraceV2 is a highly efficient image matching platform that leverages computer vision using OpenCV, deep learning with TensorFlow and the ResNet50 model, asynchronous processing with aiohttp, and Selenium for browser automation. PicTraceV2 allows users to upload images directly or provide URLs, quickly scanning a vast database to find image
automation computer-vision data-analysis data-extraction deep-learning image-processing image-search machine-learning natural-language-processing opencv openpyxl pandas python selenium tensorflow web-scraping yandex yandex-api
Last synced: 12 Apr 2026
https://github.com/cr-mao/machine-learning
机器学习笔记
data-analysis data-handling machine-learning math numpy pandas
Last synced: 12 Nov 2025
https://github.com/melogabriel/nubank-expenses-analysis
This project consolidates monthly credit card statement data from Nubank into a single CSV file using Python, enabling data visualization through a Google Sheets dashboard in Looker Studio.
data-analysis data-visualization googlesheets lookerstudio pandas python
Last synced: 02 May 2026
https://github.com/walkerdustin/vergleich-von-messmethoden-fuer-punktwolken
Bei der Vermessung eines physischen Raumes ist das Ergebnis eine Punktwolke. Diese Punktwolke beschreibt dann ausgewählte Punkte im Raum, zum Beispiel auf den Wänden und der Decke. Wenn diese Punkte in zwei seperaten Messungen gemessen werden, vielleicht sogar von unterschiedlichen Geräten, soll hinterher herausgefunden werden wie genau diese Punktwolken übereinstimmen. Dafür gibt es zwei grundsätzlich verschiedene Methoden. Diese sollen hier verglichen werden.
3d-models accuracy-metrics data-analysis data-visualization kaggle measure-distance numpy point-cloud pointcloudprocessing punkte python science-research simulation statistics
Last synced: 11 Apr 2026
https://github.com/gesiscss/wikipedia-language-olga-master
Measuring Gender Inequalities of German Professions on Wikipedia
bias crowdflower data-analysis data-science gender images python statistics wikipedia
Last synced: 10 Apr 2026
https://github.com/alphonsg/bimana
Package for performing automated bio-image analysis tasks.
bioimage-analysis bioinformatics data-analysis deep-learning edge-detection image-analysis image-processing
Last synced: 25 Oct 2025
https://github.com/samruddhi3012/customer-behavior-analysis
Hello there! This repo contains python project based on E-Commerce Customer Behavior analysis.
customer-segmentation customerbehavior data-analysis ecommerce python
Last synced: 02 May 2026
https://github.com/gaurav-van/house_price_predictor_streamlit_web_app
Data Science Project to Predict House Prices in Bangalore using the concept of Regression. This Repository is used for Deployment of the Project
data-analysis data-science exploratory-data-analysis machine-learning prediction python regression streamlit
Last synced: 02 May 2026
https://github.com/wewoc/garmin_local_archive
Secure, local-first archive for Garmin Connect health data (HRV, sleep, activities). Private & offline. Structured for local analysis (Excel, HTML-Dashboard, Ollama, Open WebUI, AnythingLLM). Your data stays on your machine.
backup dashboard data-analysis fitness-tracker garmin garmin-connect ollama open-webui privacy privacy-enhancing-technologies privacy-first privacy-focused python self-hosted
Last synced: 16 Apr 2026
https://github.com/msthamizh/phonepe-pulse-data-visualization-and-exploration
Developing a Streamlit application that allows users to explore and analyze transaction data from the PhonePe Pulse dataset. The project aims to provide insights into digital payment trends across India.
data-analysis data-visualization dataframe mysql pandas plotly python streamlit
Last synced: 02 May 2026
https://github.com/madhuresh2011/career-aspiration-of-gen-z-project-using-excel
Career Aspiration Of Gen-Z ,To explore the industries, roles, and pathways using Excel .
dashboards data-analysis data-visualization datacleaning dataset designing excel functional-dashboard gen-z kpi pivot-charts pivot-tables project-using-excel
Last synced: 27 Jan 2026
https://github.com/fybex/chatgpt-conversations-analysis
Analysis of 89,000 ChatGPT conversations to understand interaction patterns and response behaviors.
chatgpt conversation-analysis data-analysis data-visualization language-analysis prompt-patterns sentiment-analysis
Last synced: 02 May 2026
https://github.com/gholamrezadar/favourite-youtube-channels
this program goes through your youtube watch history and sorts channels based how many of their videos you have watched!
data-analysis data-visualization python
Last synced: 16 Jan 2026
https://github.com/mkk-1817/adhd-prediction
This project focuses on leveraging machine learning techniques to predict Attention-Deficit/Hyperactivity Disorder (ADHD) in children. Accurate and early diagnosis is crucial for effective intervention and support.
adhd data-analysis data-science jupyter-notebook machine-learning machine-learning-algorithms prediction python
Last synced: 09 May 2026
https://github.com/mg380/ibm-applied-data-science-capstone
This Capstone is the 10th (final) course in IBM Data Science Professional Certificate specialization, and it actually summarises in the form of project all materials that have been learned during this specialization
capstone data data-analysis data-science datascience ibm machine-learning plotly python scikit-learn sql
Last synced: 05 Mar 2026
https://github.com/priyanshubiswas-tech/pwc-power-bi-task-1-2
Power BI dashboards analyzing Phonenow's call center performance and customer retention. Task 1 focuses on KPIs like satisfaction rating, call count, and agent efficiency. Task 2 analyzes retention trends and customer behavior to enhance loyalty. Built using Power BI, DAX, and Excel.
dashboard data data-analysis dax-measures excel powerbi powerbidashboard
Last synced: 23 Jan 2026
https://github.com/gustavo-zamai/shop-sell-data-analisys
Analysis of selling
data-analysis pandas plotly python3
Last synced: 16 May 2026
https://github.com/madhuresh2011/telco-customer-churn-analysis-using-python
The analysis primarily investigates factors influencing customer churn, particularly focusing on payment methods and contract types.
csv data-analysis matplotlib numpy pandas pyhton seaborn vizualisation
Last synced: 02 May 2026
https://github.com/vijayjoshi16/credit-card-fraud-detection-using-ml-in-python
Credit Card Fraud Detection Using ML in Python
data-analysis jupyter-notebook logistic-regression machine-learning matplotlib-pyplot numpy pandas python regression seaborn
Last synced: 17 Apr 2026
https://github.com/supersjgk/data-analysis-dns-over-https
A Data Analytics + ML project to classify Benign and Malicious DNS-over-HTTPS traffic
classification-model data-analysis data-analysis-python data-analytics datamining decision-trees dns dns-over-https doh gradient-boosting knn machine-learning random-forest
Last synced: 19 Mar 2025
https://github.com/nikita-data/unit_economics_projects
unit economics & cohort analysis projects
cac churn-rate conversion create-function data-analysis data-visualization eda hypothesis-testing ltv math matplotlib numpy python retention-rate roi scipy seaborn segmentation statistics unit-economics
Last synced: 06 Jan 2026
https://github.com/carusel02/sequential-data-processing-and-analysis
Sequential data processing and analysis using linked-list in C
data-analysis data-processing linked-list
Last synced: 26 Oct 2025
https://github.com/rudra-g-23/find-my-joint
A utility to find potential join keys (matching columns) across multiple DataFrames.
data-analysis data-visualization join network-graph pandas pandas-dataframe
Last synced: 24 Jun 2026
https://github.com/datasciencelovers/ai-financial-market-data-analysis
Analyse Financial Market Data of AI companies with Python
ai artificial-intelligence big-data-analytics chatgpt data-analysis data-analytics data-science data-visualization financial-analysis gemini google llama machine-learning market-data-analysis matplotlib-python meta openai pandas-python python
Last synced: 05 May 2026
https://github.com/azmainadel/twitter-data-neo4j
Playing with graph database on a large dataset of twitter data.
data-analysis data-visualization neo4j-database snap
Last synced: 06 Apr 2025
https://github.com/shuklayash02/data_analysis_using_r
Covid19 analysis and cleaning of data where the death age and deaths of specific gender is cleaned and analysed
analysis cleaning-data data-analysis data-visualization rprogramming
Last synced: 09 Oct 2025
https://github.com/carmoreno/analisisaccidentalidadbogota
Data Analysis about traffic accidents at Bogotá, Colombia.
data-analysis data-science jupyer-notebook matplotlib numpy pandas scikit-learn
Last synced: 17 Apr 2026
https://github.com/jmssnr/shuffle-kit
shuffle-kit: model and analyze playing card shuffles in Python
data-analysis playing-cards python shuffle statistics
Last synced: 19 Jun 2026
https://github.com/reza-saeedi-coding/netflix-data-analysis
A complete end-to-end Netflix dataset analysis using Python, SQL, and Matplotlib. Explores genres, content ratings, and trends using exploratory data analysis and visualizations.
data-analysis data-cleaning eda matplotlib netflix pandas portfolio-project python sql sqlite
Last synced: 17 Apr 2026
https://github.com/sing-group/bew
Public repository for Biofilmfs Experiment Workbench (BEW).
aibench data-analysis data-management java jfreechart workbench
Last synced: 03 Jul 2025
https://github.com/gauranshgoel123/predictive-demand-analysis
Demand Forecasting Project A web application for predicting future demand for part numbers based on historical data. Built with React for the frontend and FastAPI with Python for the backend, this application visualizes demand trends and allows users to input additional data for improved accuracy. In render analyzer is frontend analysis is backend
chartjs data-analysis data-science data-visualization dataset deployment full-stack machine-learning numpy pandas predictive-analysis prophet-model python reactjs render
Last synced: 13 Apr 2026
https://github.com/theairbend3r/mice-memory-response
Effect of memory on current response in mice using methods from computational neuroscience and machine learning.
computational-neuroscience data-analysis data-science machine-learning neuroscience python
Last synced: 09 Jun 2026
https://github.com/sandk21/detection_faux_billets
Algorithme de détection de faux billets selon leurs dimensions géométriques et application web pour générer les prédictions
data-analysis data-science data-visualization machine-learning pandas python scipy sklearn streamlit
Last synced: 03 Apr 2026
https://github.com/geetisha/sales_insight_data_analysis_using_sql_and_tableau-etl-
Sales Insights - A Data Analysis Project performed on Tableau & SQL Topics
dashboard data-analysis data-visualization mysql project sales-analysis sql tableau
Last synced: 07 Jan 2026
https://github.com/gher-uliege/bluecloud-plankton
Spatial interpolation of plankton data using a neural network
data data-analysis data-visualization neural-network oceanography
Last synced: 30 Mar 2025
https://github.com/nomadsdev/sys-moninsight
System Monitoring and Analysis Tool is a utility for real-time performance tracking. It logs CPU, memory, and disk usage, provides visual graphs, and offers performance recommendations. Perfect for optimizing system efficiency.
automation cpu-usage data-analysis data-visualization disk-usage matplotlib memory-usage performance-analysis performance-optimization psutil python real-time-monitoring resource-management sys-moninsight system-metrics
Last synced: 19 Jun 2026
https://github.com/foggy-projects/foggy-data-mcp-bridge
MCP Data Bridge for Java. Enabling safe Text-to-Query via a semantic layer, making enterprise data accessible to AI Agents.
agent data-analysis java llm mcp semantic-layer spring-boot text-to-sql
Last synced: 16 Mar 2026
https://github.com/sarmadahmad8/ml-and-deeplearning-projects-for-beginners
Beginner ML/DL projects spanning core libraries and problem sets.
beginner-friendly data-analysis data-science deep-learning fastai machine-learning opencv pytorch scikit-learn transformer
Last synced: 17 Apr 2026
https://github.com/0xjeremy/me-18-final
Data collection and Analysis tools for IMUs
data-analysis imu raspberry-pi
Last synced: 03 May 2026
https://github.com/mardavsj/weather-prediction
Weather prediction model which mainly focuses on visualization.
data-analysis data-visualization matplotlib numpy pandas pandas-dataframe
Last synced: 10 Apr 2026
https://github.com/sustentarea/gs-data-analysis-report-3
📓 Exploring potential associations between childhood undernutrition and the Standardized Precipitation Evapotranspiration Index (SPEI) in Brazilian municipalities (2008–2019)
brazil climate-change data-analysis data-science food-systems global-syndemic ibge malnutrition nutrition obesity r rstats sisvan spei sustainable-eating wasting worldclim
Last synced: 27 Oct 2025
https://github.com/shadowk29/cusumtools
An eclectic collection of python scripts I have found to be useful in processing nanopore data
data-analysis data-visualization time-series-analysis
Last synced: 16 Mar 2026
https://github.com/shrawans007/google_cyclistic_2023
Google Data Analytics Capstone Case Study (SQL and Tableau)
big-query bigquery coursera-assignment cyclistic cyclistic-bike-share-analysis-case-study cyclistic-bikshare data-analysis data-analysis-project data-analytics data-cleaning data-combination data-exploration data-science google-data-analytics sql tableau tableau-dashboard tableau-public
Last synced: 19 Jun 2026
https://github.com/chouaib-629/customersegmentation
Hadoop-based Customer Segmentation project using the Online Retail Dataset. Implements MapReduce for processing and Python for preprocessing to uncover customer purchasing patterns for targeted marketing.
big-data customer-segmentation data-analysis data-science distributed-computing hadoop hadoop-mapreduce java mapreduce marketing-analytics python
Last synced: 04 May 2026
https://github.com/pradipece/weather_forecast_data_analysis
Using decision trees and random forest algorithms to solve real-world data analysis. "sklearn_decision_trees_random_forests"
data-analysis data-science data-visualization git github python python3
Last synced: 19 Apr 2026
https://github.com/shelton-beep/predicting-gpa-using-lifestyle-factors
Predicting student GPA using lifestyle factors like study habits, sleep, and stress levels. A machine learning model built to help students and educators understand the impact of lifestyle choices on academic performance.
data-analysis data-preprocessing data-science feature-engineering gpa-prediction machine-learning model-interpretability predictive-modeling python regression-analysis student-performance xgboost
Last synced: 04 May 2026
https://github.com/elkronos/stat_py
Statistics functions for python
assumption-check data-analysis data-visualization python regression statistical-analysis statistical-inference statistical-models statistical-tests statistics
Last synced: 10 Oct 2025
https://github.com/yonatanadam/film-success-prediction
Analyzing Hollywood movie success based on genre, target audience, and runtime using machine learning
data-analysis ipynb machine-learning
Last synced: 25 Jan 2026
https://github.com/ahmad-ali-rafique/weather-prediction-fcnn
This project demonstrates a complete pipeline for weather prediction using a Fully Connected Neural Network (FCNN). The project is implemented in Python using Jupyter Notebook, and it covers data loading, preprocessing, model training, and performance evaluation.
ai artificial-intelligence data-analysis data-science deep-learning deep-neural-networks fully-connected-network machine-learning machine-learning-algorithms weather-information
Last synced: 28 Aug 2025
https://github.com/sevdanurgenc/python-for-data-science-lecture-notes
In this repo, I have the course contents of Python for Data Science training, which will be given to Siemens by the cooperation of Academy Peak Information Technologies Training and Consultancy between 28 June - 1 July 2022.
data-analysis data-mining data-modeling data-science data-structure data-visualization matplotlib-tutorial numpy-tutorial pandas-tutorial
Last synced: 23 Mar 2025
https://github.com/manvendra747/customer-segmentation
Customer segmentation using Python and PowerBI
customer-segmentation dashboard data-analysis data-science data-visualization powerbi python rfm-analysis
Last synced: 28 Apr 2025
https://github.com/nafisalawalidris/tools-for-data-science
It covers popular languages (Python, R, SQL) and libraries (NumPy, Pandas) used in the field. The author shares their objectives of teaching data analysis, web development, and critical thinking skills. The repository also includes code examples, explanations of arithmetic expressions, and contact information for the author.
arithmetic-expressions data-analysis data-science data-visualization languages libraries matplotlib numpy pandas programming python r sql tools web-development
Last synced: 11 Apr 2026
https://github.com/smusab9152/pokemon_data_analysis
This repo that explores and analyzes a dataset of Pokémon attributes. The analysis includes data cleaning, exploratory data analysis (EDA), and visualizations .
analytics data-analysis data-visualization exploratory-data-analysis jupyter-notebook matplotlib numpy pandas pokemon python seaborn statistical-analysis
Last synced: 02 May 2026
https://github.com/wildanmujjahid29/books-sales-analytics-python
Books Sales Analytics With Pyhton
data data-analysis data-science data-visualization
Last synced: 12 Jun 2026