Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-23 00:07:29 UTC
- JSON Representation
https://github.com/mindful-ai-assistants/credit-card-prediction
💳 This repository focuses on building a predictive model to assess the likelihood of credit card defaults. The project includes data analysis, feature engineering, and machine learning to provide accurate default predictions.
artificial-intelligence data-analysis data-science jupyter logistic-regression machine-learning oneness-consciousness predictive-modeling python3 scikit-learn
Last synced: 12 Apr 2025
https://github.com/nunesma/Health-analytics
Data analysis focusing on health problems
data-analysis epidemiology-analysis health-analytics python r-programming
Last synced: 30 Jul 2025
https://github.com/jimbrig/EDA
Exploratory Data Analysis R Package and Shiny App
data-analysis data-visualization eda r shiny
Last synced: 30 Jul 2025
https://github.com/avinesh-masih/data-analytics-assignment
Complete PW Skills Data Analytics Assignments: This repository contains all PW Skills Data Analytics assignments, covering topics like Python, SQL, Statistics, Data Visualization, and more. It includes well-structured solutions with notebooks and queries, ideal for learners seeking clarity and hands-on practice.
ai api data-analysis data-science data-visualization eda flask jupyter-notebook machine-learning matplotlib numpy pandas pw pw-assignment pw-skills-assignment pwskills python seaborn sql statistics
Last synced: 13 Jun 2025
https://github.com/ivangrigorov/neutrino-search-engine
Creating Java search engine both for HTML or document type of files
data data-analysis data-knowledge information-extraction information-retrieval java-language search-engine
Last synced: 31 Mar 2025
https://github.com/giatraskon/clustering-countries-socioeconomic-health-analysis
Exploration and analysis of socio-economic and health data from 167 countries using MATLAB. Application of clustering algorithms to identify development patterns, visualize disparities, and understand global trends.
calinski-harabasz-index clustering country-data data-analysis data-visualization davies-bouldin-index elbow-method feature-selection health-indicators human-development-index k-means-clustering k-median-clustering k-medoids-clustering machine-learning matlab pca pearson-correlation silhouette-score socio-economic-indicators unsupervised-learning
Last synced: 29 Jan 2026
https://github.com/wiseaidev/corona-virus-data-analysis-modeling-and-visualization
Data analysis of covid-19 and SEIRD model implementation.
coronavirus coronavirus-tracking covid-19 data-analysis data-analysis-python data-visualization folium-maps modeling-dynamic-systems numpy ploty population python3 science science-research seird-model seird-simulator simulation
Last synced: 14 Apr 2025
https://github.com/DCS-training/intromachinelearning
This course is aimed at providing an introduction to machine learning for those with some beginner level python/Rstudio skills. Go to the readme file
data-analysis data-wrangling machine-learning python statistics
Last synced: 25 Apr 2025
https://github.com/lafayettegabe/g2m-insight-for-cab-investment-firm
📊 Exploratory Data Analysis (EDA) on multiple datasets related to the cab industry in the US, to provide actionable insights and recommendations to a private firm looking to invest in the market. The analysis includes data cleaning, transformation, visualization, and hypothesis testing.
big-data data data-analysis data-science data-visualization eda gotomarket
Last synced: 13 Jun 2025
https://github.com/saltiola7/data-analysis-portfolio
Data engineering & analysis portfolio, which showcases my use of Python & SQL
airflow airtable-block anaconda automation back4app chatgpt csv-parser data-analysis data-engineering docker-compose gcp graphql-api jupyter-notebook nosql prefect python rest-api sql streamlit web-scraping
Last synced: 21 Jan 2026
https://github.com/jhrcook/100daysofpython
100 days, at least 1 hour a day, of learning the Python programming language.
100-days-of-code 100daysofcode continued-learning data-analysis data-science decision-trees deep-learning keras keras-tensorflow machine-learning neural-network neural-networks plots python python3 scikit-learn tensorflow
Last synced: 09 Apr 2025
https://github.com/asifdotexe/timeseriesanalysis
This repository serves as a central hub for all of my projects related to time series analysis. Here, you'll find a collection of projects, code samples, and resources that explore various aspects of time series data and its analysis.
data-analysis feature-engineering jupyter-notebook pandas python time-series-analysis visualization
Last synced: 14 Apr 2026
https://github.com/aravind-selvam/covid_dashboard
With Covid death and vaccine data. I have created a dashboard.
covid-19 data-analysis data-science data-visualization tableau tableau-public visualization
Last synced: 08 Mar 2026
https://github.com/luminati-io/Amazon-dataset-samples
A sample dataset of over 1,000 Amazon product listings, extracted using the Bright Data API, perfect for competitive analysis, market trends, and eCommerce insights.
amazon api data-analysis data-science dataset ecommerce products web-scraping
Last synced: 09 Apr 2025
https://github.com/code-jl/nfl-point-kicker-data-scraper
A Python-based web scraping toolkit that extracts and processes NFL kicking statistics from Pro-Football-Reference. This project automates the collection of comprehensive game data, with a particular focus on field goal attempts and environmental conditions.
automation beautifulsoup csv data-analysis data-collection field-goals football-statistics kicking-stats nfl python selenium sports-analysis statistics weather-data web-scraping
Last synced: 06 Sep 2025
https://github.com/camille-maslin/simulfcimage
🔍 SimulFCImage: A professional multispectral image processing application developed for ImViA Laboratory.
academic-project computer-vision data-analysis gui-application image-processing image-viewer multispectral-images pyqt5 python scientific-visualization spectral-analysis
Last synced: 05 Feb 2026
https://github.com/mustafah/dream-my-plots
Create visual plots in Python with the help of text prompting popular LLMs through langchain
ai artificial-intelligence automation data-analysis data-visualization langchain llms machine-learning plotting python
Last synced: 13 Apr 2026
https://github.com/scailfin/flowserv-core
Reproducible and Reusable Data Analysis Workflow Server
benchmarks data-analysis reproducibility reusability workflows
Last synced: 14 Jan 2026
https://github.com/tathithienthanh/dataanalysis_diagnosis-of-diabetes-based-on-data-set-of-blood-test-result
Implement all learned knowledge about data analysis and data mining to make a complete project about Diagnosis of diabetes based on data set of blood test result
blood-test classification clustering data-analysis data-processing decision-tree diabetes-prediction diagnosis exercise google-colab health hierarchical ipynb kmeans knn knns py python smote-sampling visualization
Last synced: 08 Jan 2026
https://github.com/tillbiskup/cwepr
A Python package based on the ASpecD framework for handling cwEPR data.
continuous-wave data-analysis data-processing electron-paramagnetic-resonance reproducible-research reproducible-science
Last synced: 06 Sep 2025
https://github.com/agungbudiwirawan/supermarket_sales_dashboard-sales-analysis-using-pivot-table-and-chart
The objective of this project is to analyze supermarket sales using pivot table and chart in Microsoft Excel. After doing the analysis, I created an interactive dashboard that aims to make it easier for the audience to explore data.
dashboard data-analysis data-science data-visualization excel sales-dashboard spreadsheet
Last synced: 08 Jan 2026
https://github.com/is-leeroy-jenkins/badger
A data science & analysis toolkit for federal analysts with the environmental protection agency based on WPF, Net 8, and is written in C#.
ai budget budget-management data-analysis data-science data-visualization federal-government large-language-models machine-learning
Last synced: 14 Oct 2025
https://github.com/quantumudit/python-projects
Consists of various projects that are primarily powered by Python
data-analysis data-science data-visualization jupyter-notebook projects python pythonapplication pythonprojects
Last synced: 09 Apr 2025
https://github.com/ernestaroozoo/memestocks.net
MemeStocks.net is a Python web app that tracks the historical popularity of specific stocks by monitoring Reddit mentions. Users can search for a stock symbol and view information such as the stock's name, price, and historical popularity data. The data is gathered using the Pushshift API and stored in a PostgreSQL database.
dashboard data-analysis financial-data meme-stock python reddit-scraper scraping sql stock streamlit
Last synced: 06 Mar 2026
https://github.com/devexpress-examples/web-forms-pivot-grid-bind-to-sql-data-source
This example demonstrates how to create an ASPxPivotGrid and bind it to data via code.
asp-net-web-forms data-analysis dotnet pivot-grid pivot-grid-for-web-forms
Last synced: 06 Jul 2025
https://github.com/ashwinpn/visualization
Data Visualization using Matplotlib, Pandas Visualization, Seaborn, ggplot, and Plotly.
analysis data data-analysis data-science data-visualization graphs plots python python3 visualization
Last synced: 13 Apr 2026
https://github.com/wanglaoshi/wanglaoshi-pypi
Useful tools for DA ML DL
data-analysis deep-learning machine-learning unitility
Last synced: 14 Jan 2026
https://github.com/thesimonho/overwatch-ranked-data
Dataset of my ranked Overwatch matches
data-analysis data-science data-visualization dataset overwatch statistics video-game
Last synced: 26 Mar 2025
https://github.com/wittline/data-analytics-with-r
Repository for data analytics course using R
cassandra-database cql data-analysis genetic-algorithm pentaho-data-integration r
Last synced: 07 Jul 2025
https://github.com/arzan101/ola-data-analytics
Ola - Identified the reason and trends for ride cancellation. Process - Cleaned and Processed Data from multiple sources, applied Sql queries and visualized data using PoweBi . Motive - To reduce the cancellation rate
dashboard data-analysis data-mining data-visualization dataanalytics excel powerbi sql
Last synced: 06 Jan 2026
https://github.com/llnl/hdtopology
High-dimensional topological data analysis library for NDDAV
analysis cpp data-analysis data-viz high-dimensional-data topological-data-analysis visualization
Last synced: 29 Apr 2025
https://github.com/its-kanii/predictive-maintenance-for-healthcare-equipment
Predictive Maintenance for Healthcare Equipment utilizes machine learning to analyze operational metrics and predict equipment failures. This project leverages a dataset of usage hours, temperature, and maintenance history to enhance equipment reliability and reduce downtime.
data-analysis data-science failure-prediction feature-engineering healthcare-equipment jupyter-notebook machine-learning predictive-maintenance python time-series-analysis
Last synced: 09 May 2026
https://github.com/henrylin03/video-games
Using Python and SQL to clean, analyse and visualise video games' data from Metacritic. Includes scraping using BeautifulSoup.
analysis beautifulsoup beautifulsoup4 data data-analysis data-science eda jupyter-notebook pandas python sql sqlite3 video-game video-games
Last synced: 14 Apr 2026
https://github.com/naso7y/students-performance-analysis
A project analyzing students' academic performance to identify trends and factors affecting outcomes. Built with Python, using data visualization and statistical techniques to derive actionable insights.
data-analysis data-visualization machine-learning python
Last synced: 23 Feb 2026
https://github.com/ivanildobarauna-dev/data-pipeline-sync-ingest
ETL Process for Currency Quotes Data" project is a complete solution dedicated to extracting, transforming and loading (ETL) currency quote data. This project uses several advanced techniques and architectures to ensure the efficiency and robustness of the ETL process.
business-intelligence data-analysis data-analytics data-engineering data-pipeline data-visualization etl-pipeline python
Last synced: 28 Oct 2025
https://github.com/jpquast/icp-ms-data-explorer
A shiny app for the exploration of ICP-MS data.
data-analysis icp-ms r shiny shiny-apps
Last synced: 17 Jan 2026
https://github.com/emso-c/stream-analyser
A tool that analyses YouTube live streams.
cli data-analysis guessing highlights python youtube-video
Last synced: 18 Jan 2026
https://github.com/markmelnic/scalg
List scoring algorithm. Analyse data using a range based procentual proximity algorithm.
algorithm data-analysis pypi pypi-package score scorer scoring scoring-algorithm
Last synced: 08 Oct 2025
https://github.com/markmelnic/carsen-desktop
A python dashboard app for scraping and tracking cars for sale on websites such as mobile.de
automation dashboard data-analysis interface scraping scraping-websites tkinter-python
Last synced: 08 Oct 2025
https://github.com/fenghaojiang/ethereum-etl
ETL(Extract, Transform, Load) data from Ethereum like EVM Block chain
Last synced: 14 Jan 2026
https://github.com/ahmednasef3/titanic-full-eda
Simple EDA for Titanic Dataset.
data-analysis data-visualization eda exploratory-data-analysis matplotlib pandas seaborn titanic titanic-data-analytics
Last synced: 27 Jan 2026
https://github.com/lacerbi/vbmc
Variational Bayesian Monte Carlo (VBMC) algorithm for posterior and model inference in MATLAB (old location)
bayesian-inference data-analysis gaussian-processes machine-learning matlab variational-inference
Last synced: 11 Oct 2025
https://github.com/briatte/asr
Applied Stats with R and RStudio (first-year social-science tutorials)
course data-analysis data-science data-visualization r statistics
Last synced: 14 Apr 2026
https://github.com/erictleung/erictleung.github.io
:memo: Source code for my website, portfolio of projects, and more
bioinformatics blog data data-analysis data-science github-jekyll github-page jekyll lanyon open-science open-source software-engineering
Last synced: 21 Jan 2026
https://github.com/codebypinar/reta
🍃 Explore the world of renewable energy production, analyze historical data, and predict sustainable energy trends. Join us on the journey to a greener future!
arima clean-energy data-analysis data-science data-visualization energy-future forecasting-models innovation renewable-energy sustainability time-series
Last synced: 12 Oct 2025
https://github.com/1sumer/sql
This repository contains SQL scripts and data for various analytical and database management tasks. The project is designed to demonstrate SQL capabilities in handling complex queries, data analysis, and database design. It includes datasets related to e-commerce and streaming services, with a focus on real-world scenarios and use cases.
analytics data data-analysis data-storage sql vscode
Last synced: 19 Jan 2026
https://github.com/hvignolo87/ortex-programming-challenge
Coding challenges required for the Python Developer and Data Engineer job positions.
challenge data-analysis finance pandas python scripting sql sqlalchemy
Last synced: 17 May 2026
https://github.com/winter000boy/dsa-practice
This repository holds my solutions for LeetCode’s Pandas playlists. Each section includes code and notes on using Pandas to handle real-world data tasks efficiently. Perfect for anyone looking to deepen their understanding of data manipulation with Pandas.
data-analysis data-science leetcode leetcode-python pandas-python python3
Last synced: 06 Feb 2026
https://github.com/afondiel/ibm-data-science-professional-certificate-coursera
IBM Data Science Professional Certificate Coursera Notes
ai classification clustering coursera data-analysis data-engineering data-mining data-science data-science-challenges data-science-projects data-scientist data-visualization ibm ibm-certificate ibm-professional-certificate linear-algebra machine-learning python regression statistics
Last synced: 13 Oct 2025
https://github.com/kyle-wannacott/DataCamp-Projects
DataCamp project solutions.
data-analysis data-mining data-science datacamp-projects machine-learning python r
Last synced: 13 Oct 2025
https://github.com/jupyterphysscilab/documentation
Documentation for the Jupyter Physical Science Lab Suite of Packages
analog-to-digital-converter data-acquisition data-analysis education jupyter-notebooks pandas physical-sciences plotting python raspberry-pi
Last synced: 22 Jan 2026
https://github.com/pizofreude/data-career-navigator
An interactive dashboard providing deep insights into career opportunities for data-related roles, utilizing a comprehensive dataset sourced from LinkedIn. Features include analysis of experience levels, salaries, key skills, job locations, and industry trends, aiding job seekers and professionals in exploring and identifying optimal career paths.
codeinplace data-analysis data-visualization standford-university
Last synced: 13 Mar 2026
https://github.com/hayesall/babybear
🐼 It's like pandas, but tiny.
data-analysis data-analysis-python data-science dataframe python teaching teaching-tool
Last synced: 31 May 2026
https://github.com/quantumudit/alteryx-weekly-challenges
This repository contains Alteryx solutions to the weekly challenges published in Alteryx Community
alteryx alteryx-workflow data-analysis data-science data-transformation data-visualization etl
Last synced: 27 Jan 2026
https://github.com/gher-uliege/liege-colloquium-on-ocean-dynamics
Python tools and latex files for the Colloquium
data-analysis data-assimilation numerical-simulations ocean-modelling oceanography remote-sensing submesoscale turbulence
Last synced: 14 Oct 2025
https://github.com/parisaroozgarian/ibm-data-analyst-professional-certificate
The IBM Data Analyst Professional Certificate, consisting of 9 courses, equips with essential skills in Excel, SQL, Python, data visualization, and analysis techniques
big-data business-analysis business-communication communication data-analysis data-management data-structures data-visualization databases general-statistics human-resources planning python-programming spreedsheet sql
Last synced: 27 Jan 2026
https://github.com/yeisonmontoya1815/machine-learning_prediction_can_inflation
we aim to predict trends in the Canadian market basket using sentiment analysis techniques. Sentiment analysis involves analyzing text data to determine the sentiment expressed, whether positive, negative, or neutral.
algorithms-and-data-structures data data-analysis data-science data-visualization feature-engineering machine-learning matplotlib-pyplot numerical-analysis numpy pandas pipelines python sklearn structured-data super unsupervised-learning
Last synced: 05 Feb 2026
https://github.com/dcs-training/bayesian-statistics
Materials for the CDCS Introduction to Bayesian Statistics course. Go to the readme file
bayesian-statistics data-analysis r statistics
Last synced: 05 Feb 2026
https://github.com/rvalla/chessevolution
Some code to analyze my chess games using the Lichess API.
chess data-analysis lichess lichess-api python
Last synced: 23 Oct 2025
https://github.com/trybnetic/tu7-acceleration-sleep-wake-classification
Supporting material for the paper ''Discrimination of sleep and wake periods from a hip-worn raw acceleration sensor using recurrent neural networks''
accelerometer accelerometry actigraphy data-analysis sensors sleep
Last synced: 01 Jun 2026
https://github.com/kevinschoon/qviz
QViz Interactive Plotting
data-analysis data-visualization go gonum qframe yaegi
Last synced: 01 Jun 2026
https://github.com/dcs-training/r-qgisintegratingspatialanalysis
This was an intermediate course of three sessions with a focus on developing skills in data visualisation, analysis and integration using both R studio and QGIS. Go to the readme file
data-analysis data-visualisation data-wrangling gis qgis r spatial-analysis
Last synced: 28 Jan 2026
https://github.com/elfgk/ogretmenanalizantalya
OgretmenAnalizAntalya
analysis data-analysis data-science data-visualization ogretmenanaliz
Last synced: 08 Feb 2026
https://github.com/thennen/py-ivtools
A package for reproducible measurement and analysis of current-voltage characteristics of electronic devices.
current-voltage data-analysis data-visualization electrical-engineering emerging-technology instrumentation measurements
Last synced: 23 Jan 2026
https://github.com/roland045/road_quality_measurement_analysis
Novel road quality measurement system for cost effective pavement monitoring, ML-based
azure data-analysis data-engineering data-science machine-learning mlops model-deployment python sql unsupervised-learning
Last synced: 24 Jan 2026
https://github.com/mrjxtr/tokyo_airbnb_analysis_project
Full project case study and analysis to show potential opportunities to start an AirBnb business in Tokyo, Japan.
data-analysis data-cleaning data-science data-visualization pandas python3
Last synced: 24 Feb 2026
https://github.com/anil951/early-detection-of-mental-health
This project develops a predictive model to identify early signs of mental health issues in adolescents using social media activity, school performance, health records, and an AI chatbot. It analyzes emotional tone, academic changes, and health data, offering personalized recommendations and resources for mental wellness.
data-analysis deep-learning early-detection lstm mental-health sentiment-analysis social-media
Last synced: 28 Jan 2026
https://github.com/aloth/power-bi-book-resources
Official resources for "Teach Yourself VISUALLY Power BI" by Alexander Loth (Wiley). Get all Power BI project files (.pbix) and datasets to follow along with the visual, step-by-step exercises in the book.
analytics bi business-analytics business-intelligence dashboards data-analysis data-cleaning data-modeling datavisualization dax etl microsoft microsoft-power-bi power-bi-desktop power-platform powerbi powerquery reporting sql visualization
Last synced: 19 Feb 2026
https://github.com/spacebody/mcm-icm-2018-problem-c
The source code of MCM/ICM 2018 Problem C
Last synced: 13 Apr 2025
https://github.com/mindful-ai-assistants/movierevenueanalysis
🎬💰 Analyze movie companies' revenue, release strategies, and financial performance using statistical techniques for actionable insights. This project explores data on total revenue, number of releases, and lifetime gross to uncover patterns that can drive strategic decisions in the film industry.
correlation-analysis data-analysis data-science heatmap jupyter-notebook oneness-consciousness open-source python statistical-analysis statistical-analysis-and-hypothesis-testing statistics ttest
Last synced: 14 Apr 2025
https://github.com/bocaletto-luca/world-bank-explorer
World Bank Explorer is an interactive and responsive web application that retrieves, visualizes, and compares global development indicators sourced from the World Bank Open Data API. The application allows users to explore data on multiple scales ... By Bocaletto Luca
api bocaletto-luca chartjs css3 data-analysis data-visualization economic-indicatos economic-trends free-data global-development html5 interactive-dashboard javascript open-data open-source publicdata responsive world-bank
Last synced: 12 Mar 2026
https://github.com/umbrellaleaf5/drugdesign_data_analysis
Module of the DrugDesign project responsible for loading and pre-processing data from ChEMBL and PubChem, necessary for further modeling and analysis in drug development
chembl chemistry dafe data-analysis doxygen-documentation mipt pubchem python requests
Last synced: 15 Aug 2025
https://github.com/enricocid/monitoraggio-vaccini-italia
Sito web statico per github.com/apalladi/covid_vaccini_monitoraggio
covid-19 covid-19-data covid-19-data-analysis data data-analysis data-visualization dataset python python3 python37 sars-cov-2
Last synced: 09 Sep 2025
https://github.com/shramkoweb/bookbot
A Python-based text analyzer that counts words and character frequencies in any .txt file, providing a detailed, sorted report. Perfect for quick text insights and learning text processing basics!
automation beginner-friendly character-frequency data-analysis file-processing open-source python text-analysis text-parser text-processing word-count
Last synced: 02 Feb 2026
https://github.com/dataopstix/modelt
Modelt(mow·delt) is a modern data integration solution that connects data to data for advanced analytics.
airbyte airflow airflow-docker data data-analysis data-visualization database dbt elt etl etl-automation metabase metadata modern modern-dev modernization
Last synced: 28 Mar 2025
https://github.com/lykmapipo/scala-spark-product-sales-analysis
Scala application to process, and analyze product sales using Spark
anomaly-detection apache-spark apache-spark-sql customer-segmentation data-analysis data-processing lykmapipo market-basket-analysis product-sales product-sales-analysis rolling-average running-total sbt scala summary-statistics time-series-analysis
Last synced: 18 May 2026
https://github.com/yukito0209/predict-podcast-listening-time
Kaggle · Playground Prediction Competition, Playground Series - Season 5, Episode 4
data-analysis ensemble-learning jupyter-notebook kaggle-competition machine-learning prediction
Last synced: 10 Apr 2025
https://github.com/arzan101/ev--car-data-analysis
This Power BI dashboard provides an interactive and data-driven overview of the electric vehicle (EV) landscape. It visualizes key insights across various dimensions including sales trends, model performance, manufacturer comparisons, and market growth. The purpose of the dashboard is to enable stakeholders to explore and analyze development
data-analysis data-science data-visualization database datacleaning excel powerbi
Last synced: 17 Jun 2025
https://github.com/shibam120302/black-friday-sales-data-analysis
This repository contain Data Analysis on Black Friday Sales Data using various Regression ML algorithms
data-analysis eda machine-learning python random-forest regression
Last synced: 20 May 2026
https://github.com/rgalyeon/machine_learning_and_data_analysis
Machine Learning and Data Analysis specialization by Yandex and MIPT
coursera data-analysis data-science machine-learning mipt python yandex
Last synced: 03 Mar 2025
https://github.com/nafisalawalidris/analyzing-nobel-prize-dataset-demographics-and-trends
This project analyses a Nobel Prize dataset using Python and data analysis libraries. It explores the distribution of winners by category and country, examines the proportion of female winners over time, investigates the age of winners when they received the prize and identifies the oldest and youngest recipients.
age-at-award country-distribution data-analysis data-manipulation dataset demographics filtering gender-balance grouping nobel-prize notable-laureates python trends visualisation winners
Last synced: 19 May 2026
https://github.com/dcs-training/from-spss-to-r-how-to-make-your-statistical-analysis-reproducible
Comfortable/aware of how to run your stats in SPSS? Curious to learn how to run them in R? You've come to the right place. Go to the readme file
data-analysis data-visualisation data-wrangling good-practices-digital-research r rmarkdown spss statistics
Last synced: 25 Jan 2026
https://github.com/poga/dat-ipynb-demo
use ipython notebook to analyze data in dat archive
dat data-analysis distributed jupyter-notebook
Last synced: 17 Aug 2025
https://github.com/prathmesh2507/india-superstore-powerbi-dashboard
Interactive India Superstore Sales Dashboard built using Power BI
business-intelligence dashboard data-analysis data-visualization powerbi
Last synced: 16 May 2026
https://github.com/michaelcurrin/water-crisis-scraper
Scrape and explore data related to Cape Town's water crisis (Python3 application)
cape-town cron csv dam-levels data-analysis html open-data python3 schedule scraping south-africa water-crisis water-level webscraping
Last synced: 28 Jul 2025
https://github.com/emptymalei/mini-lab
Some code snippets used to explain stuff to myself in my personal data science wiki
data-analysis data-mining data-science data-visualization datascience
Last synced: 07 Apr 2025
https://github.com/emelyantsev/digital-twin-data-analysis
Notebooks for R&D tasks
data-analysis data-visualization
Last synced: 21 Mar 2025
https://github.com/jpcadena/classification-tweets-national-security-ecuador
Classification of Tweets about national security at Ecuador 2022
classification classification-model data-analysis data-science ecuador insecurity machine-learning natural-language-processing nlp nltk numpy pandas python pytorch scikit-learn snscrape supervised-learning tensorflow tweet twitter
Last synced: 07 Apr 2026
https://github.com/cosmoduende/r-marvel-vs-dc
DC Comics vs Marvel Comics - Exploratory Data Analysis and Data Visualization with R. Who has the smartest, strongest, fastest, or most powerful hero or villain? How to answer this and more questions with R
comics data-analysis data-analysis-r data-analytics data-visualization dataviz dc-characters dc-comics eda exploratory-analysis exploratory-data-analysis exploratory-data-visualizations marvel-characters marvel-comics marvel-vs-dc shdb superherodb superheroes superheros
Last synced: 11 Apr 2025
https://github.com/mohd-faizy/07p_tumor-diagnosis-exploratory-data-analysis-on-breast-cancer-wisconsin-dataset
Tumor Diagnosis: Exploratory Data Analysis With Seaborn
data-analysis data-visualization eda exploratory-data-analysis knn-classification pca-analysis python random-forest random-forest-classifier statistics support-vector-machines tumor-detection visualization
Last synced: 17 May 2026
https://github.com/bkataru/physics-ia
Programs and files written for Astrostatistics for IB Physics IA. Topic: Visualizing and analyzing the habitable zones for 150,000 stars from the hipparcos catalogue.
astronomical-algorithms astronomy astrophysics astrostatistics data-analysis data-science data-visualization matplotlib plotting
Last synced: 07 Jul 2025
https://github.com/thisisashukla/survival-analysis
Hands-On Survival Analysis in Python
data-analysis data-science survival-analysis
Last synced: 28 Jul 2025
https://github.com/janheinrichmerker/song-analysis
Analysing the Million Song Dataset.
big-data data-analysis data-science hadoop hadoop-mapreduce java kotlin songs
Last synced: 04 May 2026
https://github.com/jakebrehm/demesstify
📱Demystifies your messages and allows for easy analysis and visualization of conversations.
data-analysis data-science imessage messages messaging nlp pandas python sentiment-analysis visualization wordcloud
Last synced: 13 Apr 2025
https://github.com/sarincr/data-analytics-with-knime
Data Analytics with KNIME (Konstanz Information Miner), a free and open-source data analytics, reporting and integration platform. KNIME integrates various components for machine learning and data mining through its modular data pipelining concept. A graphical user interface and use of JDBC allows assembly of nodes blending different data sources, including preprocessing (ETL: Extraction, Transformation, Loading), for modeling, data analysis and visualization without, or with only minimal, programming.
ai artificial-intelligence artificial-intelligence-algorithms artificial-neural-networks data-analysis data-mining data-science data-structures data-visualization database datascience deep-learning machine-intelligence machine-learning machine-learning-algorithms machinelearning mining mining-software
Last synced: 14 Mar 2025
https://github.com/kevinyang372/san-francisco-crime-data-analysis
An ARIMA prediction model for forecasting potential crimes based on users' time and location
data-analysis machine-learning
Last synced: 29 Oct 2025
https://github.com/emaasit/pydata-book
Learning data analysis with python
data-analysis jupyter pandas python
Last synced: 12 Jul 2025
https://github.com/praju-1/pandas
The library is widely used in data science and machine learning for data cleaning, preparation, and analysis.
Last synced: 17 Feb 2026