Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-23 00:07:29 UTC
- JSON Representation
https://github.com/rodrigojunqueiradev/curso-python-3-do-basico-ao-avancado
Curso de Python 3 do básico ao avançado - com projetos reais
data data-analysis data-science python python-3 python-library python-script python3
Last synced: 22 May 2026
https://github.com/onome-joseph/flexisaf
Generative AI & Data Science
data-analysis data-science machine-learning
Last synced: 16 Sep 2025
https://github.com/tushar2704/sql-query
Repository is designed to help you strengthen your SQL query skills by providing a collection of common and interview-based SQL queries for practice.
artificial-intelligence data-analysis data-engineering data-science database database-management database-schema relational-databases sql sql-database sql-query tushar2704
Last synced: 04 Nov 2025
https://github.com/tushar2704/loan-limits-by-country
This project aims to leverage a diverse dataset encompassing economic indicators, demographic factors, and credit history to establish a predictive model. By establishing appropriate loan limits, financial institutions can enhance risk management, ensure responsible lending, and promote financial inclusivity.
artificial-intelligence data-analysis data-science loan project tushar2704
Last synced: 30 Oct 2025
https://github.com/omr5221/sqlexamples
SQL Example Data Corrections
big-data data-analysis data-correction data-modeling pl-sql sql
Last synced: 16 Feb 2026
https://github.com/lawwrites/uncovering_fintech_user_insights
Linear Regression and K-Means to obtain user behavior insights for a fintech company
data-analysis data-science kmeans-clustering linear-regression python unsupervised-machine-learning
Last synced: 22 May 2026
https://github.com/thesfinox/fit-the-data
Data analysis using Wolfram Mathematica
analysis data data-analysis lab mathematica wolfram wolfram-mathematica
Last synced: 24 Jan 2026
https://github.com/shreshthvashisht/xyz-ads-airing-report_analysis
Ad data analysis using Advanced Excel
ad-airing-analysis advanced-excel data-analysis data-visualization pivot-tables
Last synced: 18 Feb 2026
https://github.com/anjalikumari021/sports_data_analysis_using_excel
Analyzed Sports data and prepared advanced dashboard using MS Excel.
data-analysis data-cleaning excel-dashboard ms-excel pivot-tables reporting
Last synced: 08 Mar 2026
https://github.com/sayamalt/fake-news-classification-using-fine-tuned-bert
Successfully developed a text classification model to predict whether a given news text is fake or not by fine-tuning a pretrained BERT transformed model imported from Hugging Face.
bert-embeddings bert-model data-analysis data-visualization deep-learning fine-tuning-bert model-evaluation model-training-and-evaluation text-classification text-preprocessing text-tokenization tokenizer-nlp wordcloud-visualization
Last synced: 05 Apr 2025
https://github.com/mmfava/significados-aulas-biologia-quasiexp-2019
Repositório das análises realizadas para o paper "Construção de significados em aulas práticas de laboratório de biologia: uma avaliação por delineamento quase-experimental".
Last synced: 28 Jun 2025
https://github.com/tharun2806/end-to-end-internship-data-analysis
Internship Dataset Analysis is an end-to-end project analyzing an internship dataset obtained from Kaggle. The project involves cleaning and preprocessing the data using Excel and SQL, followed by exploratory data analysis (EDA). The analysis includes statistical, sectoral and geospatial insights, visualized through an interactive Tableau dashboard
bigquery data-analysis data-cleaning data-preprocessing data-visualization exploratory-data-analysis geospatial-analysis microsoft-excel reporting sectoral-analysis statistical-analysis tableau-public
Last synced: 01 Apr 2025
https://github.com/bala-1409/milk-production-time-series-forecasting-datascience-project
This project uses time series forecasting to predict future milk production. The data used in this project is monthly milk production data from January 1962 to December 1975. The ARIMA (autoregressive integrated moving average) model is used to forecast the milk production. The model is evaluated using various metric.
acf adf arima-model data-analysis data-science data-visualization datapreprocessing eda exploratory-data-analysis forecasting machine-learning-algorithms pacf python python3 sarimax-model seasonality seasonality-analysis time-series time-series-forecasting trends
Last synced: 27 Apr 2026
https://github.com/bala-1409/sales-forecasting-datascience-project
Develop a data science project using historical sales data to build a regression model that accurately predicts future sales. Preprocess the dataset, conduct exploratory analysis, select relevant features, and employ regression algorithms for model development. Evaluate model performance, optimize hyperparameters, and provide actionable insights.
data data-analysis data-science data-visualization datacleaning exploratory-data-analysis machine-learning-algorithms modelfitting prediction predictive-analytics predictive-modeling python3 regression-models salesforecast supervised-learning
Last synced: 26 Apr 2026
https://github.com/bala-1409/loan-classification-data-science-projects
This project uses machine learning algorithms to predict the classification of loan status. The dataset is loaded and some transformation is done using SQL for getting a proper dataset with some valid informations.
data data-analysis datacleaning datascience datavisualization exploratory-data-analysis loan machine-learning machine-learning-algorithms modelfitting sql supervised-learning visualization
Last synced: 22 Mar 2025
https://github.com/bala-1409/loan-clustering-datascience-projects
This project uses Machine Learning to Cluster loan together based on their similarities. The project uses a dataeset of loan application which includes information about the Loan amount and Balance. The project then use the clustering algorithm to group the loan together based on the similarities.
clustering clustering-algorithm data-analysis data-science data-visualization kmeans-clustering machine-learning machine-learning-algorithms sql unsupervised-learning unsupervised-machine-learning
Last synced: 22 Mar 2025
https://github.com/bala-1409/rafik-s-kitchen-data-analysis
The Project is about the Analysis of the Sales and Expenses Data of a Famous Fast-food Restaurant. This mainly focuses on gaining Insights that will boost the Future Sales and also Business Strategies it Improve the Profit Margins. Handled Tools are SQL, Python, Power BI, MS Office Tools.
business-analytics business-intelligence data-analysis data-analytics data-visualization eda exploratory-data-analysis ms-office powerbi-report powerpoint-presentations python sql-server
Last synced: 06 May 2026
https://github.com/aleskandro/r-hadoop-madreduce-examples
A lot of examples about using R with hadoop for MapReduce with and without libraries as rhadoop/rhipe - DIEEI@unict.it - Advanced Programming Languages
data-analysis hadoop mapreduce r
Last synced: 04 Nov 2025
https://github.com/beolawork-art/novabank-churn-analysis
NovaBank has noticed that customers are closing accounts or going inactive, and they want to understand why.
data-analysis data-science-projects data-visualization eda machine-learning numpy pandas python scikit-learn sql
Last synced: 08 Apr 2026
https://github.com/smohanta23/ev-trendanalytics-24
This Tableau project analyzes EV adoption trends using data up to May 2024. Visualizations cover growth, geography, market share, CAFV eligibility, and consumer preferences, supporting data-driven decisions with detailed drill-downs. Data is meticulously cleaned, offering stakeholders valuable insights into EV market dynamics and trends for future.
business-intelligence data-analysis data-engineering electric-vehicles feature-engineering kpianalysis predictive-analytics tableau trendanalysis
Last synced: 27 Mar 2026
https://github.com/vladstudennikov/diabetes-prediction-app
ML-powered web app built with Laravel and Vue.js to predict diabetes risk based on users' daily habits and behavior
cypress data-analysis diabetes-prediction fastapi inertiajs laravel matplotlib medicine ml pandas php scikit-learn seaborn vuejs
Last synced: 08 Apr 2026
https://github.com/wb-az/sql-for-data-analysis-udacity
This repository contains SQL queries for the SQL for Data Analysis given by Udacity. The queries include commands to define, select, manipulate, control access, aggregate, and join data and data tables.
aggregation data-analysis data-cleaning erd joints postgresql sql subqueries-and-joins window-functions-in-sql
Last synced: 23 May 2026
https://github.com/yousef-jaber-abdelaziz/electrical-vehicles-data-analysis-project
A full stack Data Engineering f\project from Getting the data to the Data warehousing and then the Dashboard using Power BI
data-analysis data-engineering data-modeling data-visualization data-warehouse data-warehousing fabric microsoft-azure microsoft-fabric-data-engineer powerbi sql-server
Last synced: 23 Jun 2026
https://github.com/esr-style/stylegrid
A free alternative to AG grid built by me for personal use case.
aggrid data-analysis grid pivot-chart pivot-grid table
Last synced: 16 Sep 2025
https://github.com/alexzalox/us_stocks
Read US stock tickers and their costs from a CSV and display the formatted DataFrame in the terminal using pandas.
data-analysis finance pandas python python3 stocks yfinance
Last synced: 15 May 2026
https://github.com/faris771/identify_customer_segments
This project is part of the Palestine Launchpad by Spark, and Udacity with Google. It uses unsupervised learning to identify customer segments for a mail-order company in Germany. The goal is to direct marketing campaigns towards the most promising audiences. The data is provided by Bertelsmann Arvato Analytics.
clustering data-analysis decomposition feature-engineering machine-learning unsupervised-learning
Last synced: 08 Aug 2025
https://github.com/kingflow-23/association-matching
Recherche et Structuration d'Opportunités de Financement pour les Associations
association data-analysis data-engineering excel fondation pyqt5 python webscraping
Last synced: 07 Apr 2025
https://github.com/hatamiarash7/ir-system
IR System for Reuters DB
data-analysis data-mining ir python
Last synced: 29 Mar 2025
https://github.com/mindlessmuse666/eda-explorer
Инструмент на Python для разведочного анализа данных (EDA) и визуализации, поддерживающий загрузку данных CSV и JSON, с модульной архитектурой ООП. Практическая работа по теме: "Обнаружение и визуализация данных для понимания их сущности" дисциплины "МДК 13.01: Основы применения методов искусственного интеллекта в программировании".
csv-visualization data-analysis data-science data-visualization exploratory-data-analysis json-visualization matplotlib oop pandas python seaborn
Last synced: 13 Apr 2026
https://github.com/juliargubolin/sql-for-data-analysis
This repository was created in order to insert all the documents, files and notes I took while learning SQL and data analysis through "SQL for Data Analysis: Advanced Techniques for Transforming Data Into Insights" by Cathy Tanimura (O'Reilly).
advanced data-analysis data-science sql
Last synced: 11 Jan 2026
https://github.com/dimits-ts/visualization-assignments
Visualizing and analyzing results from the PISA-2018 competitions with regards to Greek performance and gender gap.
data-analysis data-visualization interactive-graphs presentation-slides r-language tableau
Last synced: 06 Nov 2025
https://github.com/noor188/preswald-data-app
A data app to visualize and manipulate the graduate admission dataset
data-analysis data-visualization open-source
Last synced: 04 Jul 2025
https://github.com/kefilweditse/awesome-matchem-datasets
Awesome-matchem-datasets is a curated collection of high-quality datasets for machine learning and data analysis in the field of chemistry. This repository includes various datasets, ranging from molecular structures to experimental results, suitable for both research and educational purposes.
awesome awesome-dataset awesome-dataset-collection awesome-match-data awesome-matchem data-analysis data-matching dataset dataset-collection dataset-research dataset-samples match match-data match-dataset-analysis match-examples
Last synced: 07 Apr 2025
https://github.com/tejaswirupa/data-analysis-of-departure-delays-at-united-airlines
Explored how weather and time factors influence delays in 58,000+ UA flights. Used permutation testing and visual analytics to show how temperature, visibility, and time of day affect departure punctuality.
Last synced: 25 Jan 2026
https://github.com/leosimoes/datascienceacademy-python-analisededados
Atividades do curso Análise de Dados com Linguagem Python da DataScienceAcademy.
data-analysis data-science jupyter-notebook python sql
Last synced: 29 Apr 2026
https://github.com/shz-code/diwali_sales_data_analysis
Customer Product Purchase Behavior Analysis
behavior-analysis data-analysis matplotlib ml sales seaborn
Last synced: 14 Mar 2025
https://github.com/s1m0n38/cr-analysis
An exercise in data collection/analysis
clash-royale data-analysis data-collection data-science
Last synced: 08 Jul 2025
https://github.com/muneeb706/r-programming
R-Programming examples for data analysis.
Last synced: 26 Mar 2025
https://github.com/muthukumar0908/cardekho_used_car_price_prediction
The project aim is to build a machine learning model that offers users to find current valuations for used cars.
data-analysis data-visualization datacleaning eda machine-learning python streamlit
Last synced: 30 Mar 2025
https://github.com/alikhalajii/text-classification-life-sciences
Text classification of Life Science apps
data-analysis data-science datasets feature-importance jupyter-notebook pandas sbert scikit-learn word2vec
Last synced: 05 May 2026
https://github.com/sivkri/shiny-scatter-plot-app
This repository contains a Shiny app that allows users to create interactive scatter plots by selecting the X and Y axes and customizing the point color. The app utilizes the shiny package in R to provide a user-friendly interface and the ggplot2 package for creating visually appealing plots.
data-analysis data-visualization ggplot2 interactive-web-application r rprogramming scatter-plot shiny
Last synced: 22 Mar 2025
https://github.com/sivkri/rnaseq-analysis-junctionseq-qorts
This repository provides scripts for RNA-Seq data analysis using JunctionSeq and QoRTs, enabling quality control, differential splicing analysis, and generation of browser tracks.
bioinformatics data-analysis differential-splicing genomics junctionseq qorts quality-control rna-seq rna-seq-analysis splice-junctions splice-variants spliced-alignment transcriptomics
Last synced: 22 Mar 2025
https://github.com/habiburrahman-mu/exploratory-data-analysis
Methods to see if certain characteristics or features can be used to predict.
data-analysis data-mining data-science data-visualization
Last synced: 20 Jan 2026
https://github.com/rohitha-tata/bike-sales
This project focuses on data cleaning, transformation, and dashboard creation using a bike buyers dataset. It includes Pivot Tables, slicers, visualizations, and statistical insights to analyze trends based on income, age, occupation, and other key factors. Insights help understand customer behavior, purchasing patterns, and decision-making trends.
data-analysis data-cleaning excel-dashboards interactive-slicers pivot-charts pivot-tables
Last synced: 08 Mar 2026
https://github.com/hasnathjami/data-analysis-of-covid-19
An Oracle PL/SQL-based project on COVID-19 data analysis. It is my CSE 4.1 project of Distributive Database Management System LAB.
data-analysis naive-bayes-classifier oracle-database probability-statistics sqlplus
Last synced: 08 Mar 2026
https://github.com/nimomach/amazon-sales-data
This is a small dataset containing Amazon sales data analysis for few regions.
dashboards data data-analysis data-visualization
Last synced: 08 Mar 2026
https://github.com/mjshubham21/ny_yellow_taxi_python_da_project
A data analysis project of New York Yellow Taxi (Feb of 2025) using Python and its libraries for analytics like : NumPy, MatPlotLib, Pandas and Seaborn.
data-analysis jupyter-notebook matplotlib numpy pandas python seaborn
Last synced: 04 May 2026
https://github.com/cosmoduende/r-earthquakes
Análisis y visualización de datos de actividad sísmica en México con R. Cómo analizar y visualizar la historia sísmica de México con datos del SSN (Servicio Sismológico Nacional)
data-analysis data-analytics data-science dataviz earthquakes r-code r-programming r-studio rstudio sismo sismologia sismos ssn ssnmx terremoto terremotos
Last synced: 24 Jan 2026
https://github.com/jibbs1703/airline-data-analysis
This repository contains the Exploratory Data Analysis of the flight delay and cancellation for airline flights in the United States in the year 2015. With this EDA, insights and solutions are suggested for business owners and airport managers.
business-insights business-solution data-analysis data-visualization
Last synced: 20 Mar 2025
https://github.com/tolumie/web-scraping-rest-api-stock-data-operations
Web Scraping, REST API & Stock Data Operations is a data-driven project that explores the power of web scraping, API interactions, and stock market analysis using Python. From extracting stock data and public records to analyzing real-world financial trends, this repository is a one-stop resource for data enthusiasts, traders, and analysts.
api-integration data-analysis data-cleaning data-visualization financial-data python rest-api sql-databases stock-data web-scraping
Last synced: 19 May 2026
https://github.com/mrham17/spotify_streaming_analytics
Project is stable & documentation will be completed soon. Thank you for your understanding and patience.
big-data-analytics data-analysis google-colab music-data r-programming spotify streaming-analytics
Last synced: 24 Jul 2025
https://github.com/bris0yzbekaye/json-to-excel-converter
This repository provides a tool to convert JSON data to Excel format (.xlsx). It allows you to easily transform structured JSON data into a well-organized spreadsheet for better analysis and visualization.
automation-script automation-tools data-analysis data-converter data-export data-formatting data-tools data-visualization excel excel-automation excel-converter excel-tools json json-exporter json-parser json-processing json-to-csv json-to-excel programming-tools spreadsheet-tools
Last synced: 25 Jul 2025
https://github.com/kavicastelo/colab
This repository includes a data analysis and model training practical Jupyter notebooks using a soil fertilizer dataset. (use 4th edition)
data-analysis jupyter-notebook python
Last synced: 26 Mar 2025
https://github.com/leandrocollares/street-cherry-trees-in-vancouver
Street cherry trees in Vancouver: an exploratory data analysis
data-analysis data-visualization folium pandas plotly-express
Last synced: 17 Sep 2025
https://github.com/samuelsoaress/predict-future-sales
Machine Learning applied to sales forecast
data-analysis data-mining data-science data-visualization forecasting-models
Last synced: 24 Jul 2025
https://github.com/netesf13d/expt-sequence-analysis
Data processing, analysis and visualization package for atomic physics experiments in the single-atom regime.
cold-atoms data-analysis data-visualization optical-tweezers
Last synced: 24 Jul 2025
https://github.com/matte34/auto-insurance-analysis
Conducted a comprehensive exploratory data analysis (EDA) on an auto insurance dataset that I found from Kaggle. I performed a permutation test and generated data visualizations.
data-analysis data-visualization permutation-test python3 scipy seaborn
Last synced: 06 May 2026
https://github.com/muhammadhussain-2009/stock-price-prediction-using-stacked-lstm
Predicting Google Stock Prices using Deep Learning Techniques.
data-analysis data-science data-visualization deep-learning jupyter-notebook keras lstm-neural-networks machine-learning-algorithms python stock-data stock-price-prediction tensorflow
Last synced: 16 Apr 2026
https://github.com/codeonthespectrum/web-scrap
Este projeto realiza o web scraping da Wikipédia para obter dados sobre os municípios mais populosos do estado do Rio de Janeiro.
data-analysis data-visualization webscraping
Last synced: 16 Feb 2026
https://github.com/swethajoseph/netflix-powerbi-interactive-dashboard
Created an interactive Netflix Power BI dashboard to analyze and visualize Netflix's content library, uncovering trends in content type, genre distribution, and global reach
data-analysis data-visualization interactive-visualizations powerbi powerbi-dashboards powerbi-report
Last synced: 03 Jan 2026
https://github.com/tomy-jr98/air-quality-sql-project
Air pollution analysis using BigQuery and Tableau, with data cleaning, aggregation, and visualization.
air-pollution bigquery data-analysis portfolio sql tableau
Last synced: 25 Jul 2025
https://github.com/dimits-ts/sport-repression-repl-study
A replication Study for the recent paper "International Sports Events and Repression in Autocracies: Evidence from the 1978 FIFA World Cup" paper.
data-analysis jupyter regression-models replication-study statistical-analysis
Last synced: 25 Jul 2025
https://github.com/cescedes/medical-insurance-costs-with-python
Investigate how different factors affect the prediction of medical insurance costs by practicing many python concepts.
codecademy data-analysis python python-dictionaries python-functions python-lists python-loops python-strings
Last synced: 19 May 2026
https://github.com/maazie-khan/olympics-data-enigeering
Worked with Azure Data Factory, Databricks, Data Lake Storage, and Synapse Analytics to build an ETL pipeline for processing and analyzing Olympic Games data from Kaggle.
azure big-data data-analysis dataengineering devops pipeline
Last synced: 13 May 2026
https://github.com/andersoncrs/clasificacion-propina-restaurante
Este informe desarrolla, de manera clara y práctica, un análisis completo del conocido conjunto de datos de propinas (tips), mostrando paso a paso cómo transformar la información cruda en modelos predictivos útiles.
clasification data-analysis data-visualization tips
Last synced: 26 Jul 2025
https://github.com/jpcadena/tweets-classification-frontend
Frontend project for the Classification Tweets project
api axios css data-analysis data-science data-visualization ecuador eslint frontend html insecurity json machine-learning node npm openapi-typescript-generator react tweets-classification twitter typescript
Last synced: 06 Apr 2026
https://github.com/dadvaiahpavan/ai-data-scientist-
AI-powered tool for dataset analysis, featuring data preprocessing, classification, regression, anomaly detection, and text analysis. Built with scikit-learn, pandas, and Plotly for visualization. Includes an interactive Streamlit web interface for real-time data analysis.
ai anomaly-detection classification data-analysis data-science machine-learning panda plotu regression scikit-learn sentiment-analysis streamlit
Last synced: 03 May 2026
https://github.com/riyajain255/customer-segmentation-for-e-commerce
This project analyzes online retail data to segment customers using K-Means clustering and build classification models to predict those segments based on purchasing behavior.
customer-segmentation data-analysis kmeans-clustering logistic-regression machine-learning matplotlib numpy pandas python random-forest scikit-learn seaborn-plots
Last synced: 02 Apr 2026
https://github.com/zakintaliban/indonesia-train-passenger-forecasting
A project to analyze and forecast train passenger numbers in Indonesia using Python, Pandas, and Scikit-learn.
bps data-analysis data-science dataanalysis datascience forecasting indonesia kereta-api machine-learning machinelearning numpy pandas python scikit-learn scikitlearn seaborn time-series
Last synced: 29 Apr 2026
https://github.com/kushalagarwalla/netflix-movie-data-analysis
🚀 Netflix Data Analytics Project 🎬📊 | Analyzed 9K+ movies to uncover insights on genres, popularity, votes & release trends. Includes EDA, KPIs & visualizations using Python (Pandas, NumPy, Matplotlib, Seaborn). Supports data-driven content & engagement strategy.
data-analysis data-visualization jupyter-notebook numpy pandas python seaborn
Last synced: 06 May 2026
https://github.com/samiksha29-patil/flipkart-mobiles-data-analysis-visualization-in-python
This project analyzes Flipkart Mobiles Dataset to extract useful insights about mobile phones, their pricing, ratings, discounts, and customer reviews. The analysis and visualization are done using Python to understand market trends and customer preferences.
data-analysis data-visualization matplotlib numpy pandas python seaborn
Last synced: 04 May 2026
https://github.com/labex-labs/numpy-for-beginners
This comprehensive course covers the fundamental concepts and practical techniques of NumPy, the essential library for numerical computing in Python. Learn to create, manipulate, and analyze arrays efficiently.
array-manipulation array-slicing beginner-friendly course data-analysis data-science data-structures fast-computation hands-on labex labs linear-algebra matrix-operations numerical-computing numpy programming python python-programming scientific-computing vectorized-operations
Last synced: 20 Jun 2026
https://github.com/vitor-ace/sunspots-data-analysis
This is a Jupyter Notebook which works with Data Analysis logic and libraries implementation with Python.
data-analysis data-visualization debbuging error-handling file-handling matplotlib-pyplot numpy pandas python
Last synced: 06 May 2026
https://github.com/hecatops/ad_libs
A real time advertisement data analytics platforming, displaying important metrics in easy to understand language.
dashboard data-analysis data-visualization kpi plotly-dash python
Last synced: 07 Nov 2025
https://github.com/sandergi/ekichabi
A digital phonebook to connect sustenance farmers in Tanzania. Works via USSD so farmers without an internet connection can use it (via their Telecom). Build with Django in Python and a MySQL database. This is a public copy of the private repo with user information stripped.
android data-analysis ict4d research ussd
Last synced: 14 May 2026
https://github.com/zwelz3/unofficial-survivor-knowledge-graph
A comprehensive RDF knowledge graph covering all 50 seasons of Survivor (US), with 23,000+ triples across 749 named graphs.
Last synced: 23 May 2026
https://github.com/ddjain/jsonl-visualizer
A beautiful web tool for visualizing JSONL files with syntax highlighting and multiple view modes
data-analysis json jsonl viusal
Last synced: 18 Sep 2025
https://github.com/karsterr/repeated-measurement
An R-based workflow for conducting repeated measures ANOVA using the ez package, with data wrangling via tidyverse and visualization through ggplot2. Includes data import, transformation to long format, statistical analysis, and graphical summary.
anove data-analysis experimental-design ezanove ggplot2 r repeated-measurements rstats statistics tidyverse
Last synced: 18 Sep 2025
https://github.com/ariyaarka/sales-analysis
A simple analysis on random dataset of pizza sales using SQL
data-analysis presentation-slides sql
Last synced: 17 Jan 2026
https://github.com/rh01/data-analysis-with-r
Duke University - Data Analysis With R
data-analysis r r-language r-studio rmarkdown
Last synced: 23 May 2026
https://github.com/jpcadena/cancer-classification
Breast cancer classification project.
cancer-detection classification data-analysis data-science deep-learning imblearn machine-learning neuronal-network numpy pandas pylint python scikit-learn supervised-learning tensorflow
Last synced: 09 Apr 2026
https://github.com/nagasai123-k/twitter-sentiment-analysis
Twitter sentiment analysis
csv data-analysis data-analysis-python data-analytics data-science data-visualization ipynb ipynb-jupyter-notebook jyputer-notebook jypyternotebook jyutping md python raw-data
Last synced: 02 Mar 2025
https://github.com/ljadhav25/healthcare-data-collection-and-analysis
This repository contains a project focused on collecting healthcare data from the web, storing it in a structured format, and performing comprehensive analysis. The objective is to gather valuable health-related information, process and clean the data, and derive insights to support healthcare research and decision-making.
data-analysis data-visualization flask-application flask-backend html-css-javascript pycharm-ide python
Last synced: 09 Apr 2026
https://github.com/ramapinnimty/udacity-mlfoundation-nanodegree
This is a repository containing solutions to the assignments that are a part of the Udacity Machine Learning Foundation Nanodegree program.
assignments data-analysis python3 statistics udacity-machine-learning-nanodegree
Last synced: 26 Jul 2025
https://github.com/sharoonjoseph321/indian-liver-diseases
Indian Liver Disease Analysis and Prediction This project leverages the Indian Liver Patient Dataset (ILPD) to analyze liver disease trends and develop predictive models for early diagnosis. Through data preprocessing, exploratory analysis, and machine learning, it identifies key risk factors and builds classification models
data-analysis data-science data-visualization logistic-regression machine-learning pandas python seaborn
Last synced: 27 Jul 2025
https://github.com/hfzdzakii/dicoding-shipclusteringanalysisdataandmodelling
This repo is a master submission for my Dicoding Final Project. Ship Performance Clustering Dataset was being used to fulfill the submission. Feel free to explore and I hope my work give you some insight!
clustering data-analysis machine-learning
Last synced: 27 Jul 2025
https://github.com/hemangsharma/bookingdataanalysisreport
The report helps understand key trends and insights around customer bookings, pricing, and other related attributes.
analysis data data-analysis data-analytics data-visualization streamlit streamlit-dashboard
Last synced: 14 May 2026
https://github.com/danymukesha/bioga
Apply multi-objective genetic algorithms to genomic data for biologically informed feature selection and pattern discovery.
data-analysis gene-expression genetic-algorithms genomics optimization-algorithms
Last synced: 18 Sep 2025
https://github.com/dmvianna/python-nix
Trivial Nix environment with pandas and postgresql
Last synced: 27 Jul 2025
https://github.com/grindelfp/data-analysis-example
One of my UNI Artificial Intelligence Systems course's projects.
data-analysis data-preprocessing ipynb
Last synced: 19 Sep 2025
https://github.com/tbep-tech/tbeploads
R Package for estimating nutrient loading to Tampa Bay
data-analysis loads package tampa-bay tbep tbnmc water-quality
Last synced: 19 Feb 2026
https://github.com/jofaval/ionosphere
Binary Classification of Ionosphere signals at Goose Bay, Labrador in 1988
data-analysis data-science data-visualization deep-learning google-colab keras machine-learning python scikit-learn tensorflow uci xgboost
Last synced: 09 Apr 2026
https://github.com/zulfachafidz/green_horizon_forecasting_peak_organic_avocado_sales_with_the_prophet_algorithm
The Green Horizon Project leverages the Prophet algorithm to predict peak sales of organic avocados, supporting the campaign "APEAM GO ORGANIC." Using Python and Looker Studio, this analysis aims to provide deep insight into sales trends and potential, forming the basis of smarter marketing strategies.
algorithm algorithms analytics data data-analysis data-engineering data-mining data-science data-visualization forecasting machine-learning machine-learning-algorithms prophet-model python python-script
Last synced: 17 May 2026
https://github.com/benmar2406/rent-in-germany
Interactive visualizations and maps depicting topics around rent prices and income in Germany built with Svelte.
charts d3 d3-visualization d3js data-analysis data-visualization gis gis-data infographic infographics map mapbox mapbox-gl mapbox-gl-js mapboxgl svelte
Last synced: 26 Mar 2025
https://github.com/tbep-tech/tbep-r-training
Repository for miscellaneous R training materials
data-analysis open-science workshop
Last synced: 19 Feb 2026
https://github.com/tbep-tech/pep-graphics
Materials for generating PEP graphics
data-analysis pep water-quality
Last synced: 19 Feb 2026
https://github.com/tbep-tech/tberf-oyster
Materials for evaluating TBERF oyster restoration success
ccmp-bh4 ccmp-bh6 data-analysis tampa-bay tbep tberf
Last synced: 19 Feb 2026
https://github.com/tbep-tech/pep-r-training
Materials for PEP R training
data-analysis open-science workshop
Last synced: 19 Feb 2026
https://github.com/shafaq-aslam/predicting-heart-disease-risk-with-logistic-regression-techniques
Develop a predictive model using logistic regression techniques to assess heart disease risk based on patient health metrics and data analysis.
data-analysis heart-disease logistic-regression machine-learning machine-learning-models matplotlib numpy pandas python scikit-learn seaborn
Last synced: 09 Apr 2026