data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-02 00:07:45 UTC
- JSON Representation
https://github.com/kirillsemyonkin/lsd
LSD (Less Syntax Data) configuration/data transfer format.
configuration data java parsing rust
Last synced: 27 Feb 2026
https://github.com/mwiatrzyk/modelity
Data parsing and validation library for Python
data library model parsing python tool validation
Last synced: 18 Jan 2026
https://github.com/quonverbat/ordner
A simple, customizable and cross-platform data tracker.
data datatracker javafx management
Last synced: 07 Jul 2025
https://github.com/danieljdufour/fast-b64
Quickly Convert between B64 and Binary Strings
b64 base64 base64-decoding base64-encoding binary bits compression data
Last synced: 08 Oct 2025
https://github.com/rorovic/rorovic.github.io
my github blog
code data datawarehouse devops realtime
Last synced: 01 Feb 2026
https://github.com/jacob-pitsenberger/python-electronics-inventory-management-system-object-oriented-programming-project
Welcome to the Python Electronics Inventory Management System project repository! This project is a demonstration of Object-Oriented Programming (OOP) principles in Python for managing an electronic parts inventory.
data data-structures dictionary exception-handling file-io filesystem input-output inventory-management-system management-system modules oop pickle python user-interface
Last synced: 08 Oct 2025
https://github.com/rahulthedevil/metric-converter
A simple utility package for converting between metric units such as meters, kilometers, grams, kilograms, liters, and more. Simple and powerful way for Units Convert solution
convert converter data fraction imperial length mass measurements metric metrics ratio system temperature unit unit-conversion unit-converter units uom utilities weight
Last synced: 08 Oct 2025
https://github.com/aiwithqasim/project_allocation_system
Project Allocation System (PAS) automates and simplifies the process of Allocating projects to students. Teachers can simply add details on prompting for input and perform a number of operation modules including Adding Projects, Updating Projects, Searching Projects , Deleting Projects and Display All Projects
algorithms-and-data-structures algorthims c-plus-plus data data-structures linked-list
Last synced: 08 Oct 2025
https://github.com/nits2612/data-science-projects
Portfolio of data science projects completed by me during PGP AI/ML, self learning, and hobby purposes.
data data-science dataanalysis deep deep-learning keras machine-learning matplotlib numpy opencv pandas python scikit-learn seaborn surprise-python tensorflow transfer-learning
Last synced: 01 Feb 2026
https://github.com/h-sutiwas/r2de-2025
This repository is related to the Road To Data Engineer Bootcamp by DataTH. It contains all related coursework, some mini projects and other resources within the field of Data Engineering.
data data-engineering data-visualization docker gcp pipeline spark
Last synced: 30 Apr 2026
https://github.com/stoyank7/football-prediction
This is my Semester 7 Project for my "AI for Society" minor at Fontys University of Applied Sciences.
ai betting data football machine-learning university-project
Last synced: 25 Mar 2025
https://github.com/ohspc89/better_call_jin
A repository containing mentoring materials for a Ph.D. student in Neuroscience
data matlab spss-statistics visualization visualization-tools wrangling-data
Last synced: 08 Oct 2025
https://github.com/pythoncoderunicorn/jamesbeardaward
a repo for James Beard Award data
Last synced: 07 Feb 2026
https://github.com/keminghe/osu
Unofficial and publicly-available NPM data-package about The Ohio State University.
college data majors ohio-state organizations public students university unofficial
Last synced: 06 Jan 2026
https://github.com/rissh/titanicsurvivalpredictionusingml
Predicting Titanic passenger survival through machine learning. This project includes data preprocessing, exploratory data analysis, feature engineering, and model training using Python. 🚢
data data-analysis data-science data-visualization dataanalysis jupiter-notebook machine-learning machine-learning-algorithms machinelearning matplotlib numpy pandas prediction prediction-model python python3 seaborn tenserflow tflearn titanic
Last synced: 01 Feb 2026
https://github.com/adithivs/prodigyy_ds_03
data data-visualization datapreprocessing decision-tree-classifier
Last synced: 07 Oct 2025
https://github.com/abdellah-laassairi/thyroid-disease-analysis
Thyroid dataset visualization dashboard in R
dashboard data flexdashboard imputation-methods rshiny visualization
Last synced: 18 Jan 2026
https://github.com/pythoncoderunicorn/startrek
a repo for Star Trek data from Technical Manuals
data klingon-language star-trek vulcan
Last synced: 07 Oct 2025
https://github.com/okieraised/rke2-deployment
Single-node RKE2 deployment
data helm helm-charts helm-deployment rke2
Last synced: 17 Mar 2026
https://github.com/iankitnegi/tableautales
"Discover my Tableau journey! Dive into data-driven stories, visualizations, and projects as I explore the power of data visualization."
data data-visualization tableau
Last synced: 21 Jan 2026
https://github.com/ms140569/loki-example-store
Testdata for loki password manager
Last synced: 26 Feb 2026
https://github.com/prajjwol09/sql_retail_analysis_project
This project demonstrates SQL-based data cleaning, exploration, and business analysis on a retail sales dataset. It involves setting up a database, removing null values, performing EDA, and using SQL queries to extract key insights such as top customers, best-selling categories, and monthly sales trends.
data data-analysis datacleaning dataexploration pgadmin4 sql
Last synced: 15 Feb 2026
https://github.com/ompreetham/data-structures
binary-search-tree c data data-structures datastructures graph linked-list list stack structures tree
Last synced: 25 Mar 2025
https://github.com/mlkav/digital-talent-scholarship
Learn in Digital Talent Scholarship Program
data data-science digital-talent-scholarship dts google-cloud google-cloud-platform science
Last synced: 26 Feb 2026
https://github.com/drostlab/biodbretrievr
Retrieve and efficiently index entire biological sequence databases
biological-data biological-sequences data databasestoring retrieval
Last synced: 26 Feb 2026
https://github.com/eharshit/end-to-end-vendor-insights
End-to-end analysis of vendor performance for wholesale/retail businesses, featuring data ingestion, cleaning, insights, and interactive Power BI dashboards.
analysis analysis-algorithms analytics dashboard data data-analysis datascience jupyter jupyter-notebook pandas powerbi powerbi-report retail wholesale
Last synced: 07 Oct 2025
https://github.com/alextanhongpin/node-github-api
:page_with_curl: sample github api queries with nodejs for scraping purposes
Last synced: 06 May 2026
https://github.com/assada/free-words
Data for/from NLP
corpus-data data nlp-machine-learning npl
Last synced: 26 Feb 2026
https://github.com/mehmetkahya0/earthquake-tracker
Earthquake Tracker, A real-time earthquake monitoring application that visualizes seismic activity worldwide using interactive maps and data visualization.
ai api css cursor data data-vizualisation earth-observation earthquake earthquake-data earthquake-visualization earthquakes html js modern-web scrape ui ui-design web
Last synced: 15 Apr 2026
https://github.com/ymorsi7/quranicvisualization
A visual exploration tool for the Holy Quran using D3.js treemaps.
css d3 d3js data data-visualization html islam islamic javascript js quran quranic treemaps visualization
Last synced: 15 Apr 2026
https://github.com/mahtabranjbar/onlineshopping_analysis_dashboard
This project analyzes online shopper behavior using various machine learning models and EDA techniques.
dashboard data dataanalysis eda machine-learning streamlit
Last synced: 08 Feb 2026
https://github.com/gman-au/white-knight
Experimental .NET data abstraction using specification pattern
abstractions data datastore dotnet repository-pattern specification-pattern
Last synced: 17 Mar 2026
https://github.com/vim89/flowforge
Let's be honest - most data pipeline frameworks treat types as suggestions. Config files are strings. Schemas are "validated" at runtime. Data quality is an afterthought. So, let's do differently
archetype data data-contracts data-engineering data-pipelines data-quality data-science database dataengineering datapipeline etl etl-framework pipelines scala scalability spark spark-sql spark-streaming
Last synced: 14 Apr 2026
https://github.com/lohithgsk/dynamic-qr-generator
A Python-based QR generator application was developed using the qrcode and Pillow libraries, dynamically generating QR codes for custom data inputs. Designed for a college grievance management system, the application creates QR codes containing block, floor, room, and machine numbers, allowing easy placement and identification on each floor.
data pillow python qrcode qrcode-generator
Last synced: 16 Mar 2025
https://github.com/tanyagarg25/project_covidanalysis
This repository is a project for analyzing COVID-19 data using SQL and visualizing it with Tableau. Technologies used include SQL for querying and Tableau for data visualization.
analysis dashboard data data-visualization sql tableau
Last synced: 08 Feb 2026
https://github.com/pawamoy/keycut-data
Keyboard shortcuts data stored in YAML files
Last synced: 12 Feb 2026
https://github.com/andykee/aurora
A lightweight tool for indexing, cataloging, and browsing data.
catalog data data-catalog data-discovery indexing metadata metadata-extraction search-and-discovery
Last synced: 17 Jan 2026
https://github.com/abdullahashfaqvirk/earth-engine-data-scraper
A Python based web scraper designed to extract and organize dataset metadata from the Google Earth Engine Datasets Catalog for research, and analysis purposes.
beautifulsoup data data-science python requests scraper web-scraping
Last synced: 10 May 2026
https://github.com/rysteq/abstract-data-structures
This repository contains two programs written in C about the stack and queue ADT's
abstract-data-structures c data queue stack
Last synced: 06 Oct 2025
https://github.com/michaelfromyeg/lyrics
Lyric-store and API hosted on Git.
Last synced: 08 Feb 2026
https://github.com/danicaalana/wine-dataset-decision-tree
This project is developed as part of Digital Skill Fair (DSF) 35.0 - Data Science by Dibimbing. I am using Wine Recognition Dataset from scikit-learn, which is the results of a chemical analysis of wines grown in the same region in Italy by three different cultivators.
data data-analysis-python data-science decision-tree-classification machine-learning python scikit-learn wine-dataset
Last synced: 18 Apr 2026
https://github.com/tsbarr/belly-button-challenge
Using front-end development tools (javascript, html and css) I built an interactive dashboard to explore the Belly Button Biodiversity dataset, which catalogs the microbes that colonize human navels.
data data-visualization javascript
Last synced: 04 Mar 2026
https://github.com/soenneker/soenneker.dtos.requestdataoptions
A flexible request options object for paging, sorting, and filtering queryable data, similar to OData-style parameters.
controller coordinator csharp data dotnet dto dtos http manager object odata options request requestdataoptions
Last synced: 12 Mar 2026
https://github.com/marielachirinosr/analysis-urgencias-hospital-pitalito
This project involves analyzing emergency room admission data from the E.S.E Hospital Departamental de Pitalito using a star schema model.
bigquery data data-analysis etl-pipeline tableau
Last synced: 21 Jan 2026
https://github.com/lightdash/quickstart-github
Instant analytics for Github
analytics business-intelligence data dbt github
Last synced: 14 Sep 2025
https://github.com/dhruvil-26/powerbi-projects
This repository contains Power BI projects showcasing data analysis and interactive dashboards. Each project includes detailed visualizations and insights on diverse topics such as loan analysis, sales performance, and customer behavior.
customer-behavior-analysis data data-analysis interactive-dashboards loan-analysis powerbi sales-performance visualization
Last synced: 04 Feb 2026
https://github.com/flyconnectome/hnf
Documentation for the hierarchical neuron format
annotations data dotprops hdf5 mesh neurons skeleton storage
Last synced: 17 Jan 2026
https://github.com/foundationallm/.github
A platform accelerating delivery of secure, trustworthy enterprise copilots.
agent ai data enterprise generative-ai large-language-model llm ml tool
Last synced: 12 Feb 2026
https://github.com/matt-dray/draytasets
:1234::disguised_face: Miscellaneous datasets I've collected or prepared
Last synced: 09 Feb 2026
https://github.com/bishtrishu/netflix_movies_dashboard
This project is a comprehensive dashboard for analyzing Netflix movies and shows. Using a combination of Power BI, Python, and Excel, this dashboard provides insights into various aspects of Netflix's content library.
ai artifical-intelligense dashboard data dataanalysis dataanalyst dataanalytics datacleaning datahandling datascience datavisualization excel machine-learning msexcel powerbi report
Last synced: 09 Feb 2026
https://github.com/dativo-io/dativo-ingest
big big-data data data-ingestion etl etl-framework gitops iceberg ingest nessie self-hosted
Last synced: 26 Feb 2026
https://github.com/sebastianhochreiter/sql-projects
business-intelligence data datascience microsoft microsoft-sql-server sql
Last synced: 22 Feb 2026
https://github.com/manishjanky/wrangle-weratedogs-dataset
A data wrangling project done ad part of Udacity DAND
data data-wrangling twitter udacity udacity-data-analyst-nanodegree udacity-nanodegree weratedogs
Last synced: 15 Apr 2026
https://github.com/samaalharbi2/project-recommendation-system
This project focuses on building a Recommendation System using real interaction data from IBM's Watson Studio platform.
clustering data ibm-watson kmeans nlp python rec svd udacity-nanodegree
Last synced: 09 Feb 2026
https://github.com/amethyst-php/catalogue-product
amethyst amethyst-catalogue-product api catalogue-product data laravel
Last synced: 20 May 2026
https://github.com/affan005-ai/tesla-stock-prediction
This project analyzes Tesla stock data and builds machine learning models to predict and classify stock movements. The analysis includes EDA, feature correlation, moving averages, and two models
data data-analysis data-science data-visualization-project eda machine-learning matplotlib pandas predictive-analytics predictive-modeling python scikit-learn
Last synced: 05 Oct 2025
https://github.com/bishtrishu/super_store_sales_dashboard
This repository contains a comprehensive sales analysis dashboard for a Superstore, created using Power BI. The objective is to contribute to the success of a business by utilizing data analysis technique, specially focusing on time series analysis, to provide valuable insights and accurate sales forecasting.
analytics data data-science dataanalysis dataanalyst datacleaning datascience datavisualization-project excel microsoft-azure microsoft-excel powerbi report sql
Last synced: 28 Feb 2026
https://github.com/suchi25sathavara/data-wrangling-with-r
Analyzing Road Accidents in Victoria, Australia
data r reporting rstudio wrangling-data
Last synced: 01 Apr 2025
https://github.com/metapsy-project/data-panic-psyctr
Database of psychotherapy for panic disorder compared to control conditions
Last synced: 18 Mar 2026
https://github.com/lananolana/test_data_generator
Generate test data with Telegram bot in one click: random users, files, texts and credit cards.
credit-card data data-generation fake-data random telegram-bot test-data test-data-generator test-file-generator testing testing-tools text-generation user-generator
Last synced: 18 Jan 2026
https://github.com/ludwing-mj/manipulacion_ej
Ejercicio utilizado en la seccion numero ocho del manual para ejemplificar las herramientas proporcionadas por el tydyverse para la manipulacion de datos.
data manipulate-data package r
Last synced: 01 Apr 2025
https://github.com/pathilink/ebury_case
Technical case study in Analytics Engineering using BigQuery, focusing on dimensional modeling and SQL queries for payment and client analysis.
Last synced: 05 Oct 2025
https://github.com/lukakerr/us-surnames
US Surname data visualisation using R. Displays top 25 US surnames and race/ethnic percentage per name.
Last synced: 05 Oct 2025
https://github.com/neurazum-ai-department/tumor-stages-dataset---v1
Synthetic MRI data generated by the ‘HF’ and 'Vbai' models based on real data.
brain data dataset datasets image mri neuroscience tumor tumor-segmentation
Last synced: 18 Mar 2026
https://github.com/shubhamsoni98/project_using_knn
This project applies the K-Nearest Neighbors (KNN) algorithm to predict iPhone purchases based on customer data. Using features like age, salary, and previous purchase behavior, the KNN model classifies customers into buyers and non-buyers.
anaconda analytics data data-science eda knn knn-classification machine-learning-algorithms predict project python scikit-learn tableau
Last synced: 03 Jan 2026
https://github.com/nicolaeiotu/dbindjs
Data Binding for Javascript
bind binding data data-binding databind dbind dbindjs javascript
Last synced: 09 Feb 2026
https://github.com/ludreinsalvador/global-covid-19-data-analysis
Contains Power BI dashboards that visualizes and analyzes global COVID-19 cases, deaths, and vaccination trends using data from the World Health Organization (WHO). The project aims to provide insights into the pandemic’s impact and vaccination progress worldwide through dynamic reports and advanced analytics.
analytics covid-19 covid19-data data data-analysis data-collection data-transformation data-visualization
Last synced: 26 Feb 2026
https://github.com/haroontrailblazer/machine_learning
About This Repository A curated resource hub for learning machine learning, featuring tutorials, code examples, datasets, and hands-on projects to build foundational skills and explore real-world applications.
data data-analysis data-visualization database dataset gradient-descent machine-learning pandas python3 random-forest sklearn statistics
Last synced: 16 Apr 2026
https://github.com/fnu-ankit/nyc_parking_violation
data dataengineering dbt githubactions python
Last synced: 16 Apr 2026
https://github.com/dysnomia-studio/achieve-games-dump
Dump parts of achieve.games database to public including Steam Games List
data dump games steam steam-api steam-game steam-games
Last synced: 27 Feb 2026
https://github.com/enescidem/twitter-topic-modeling
Topic modeling is an unsupervised method to identify topics in text. This project analyzes tweets from prominent Turkish accounts to uncover underlying themes in their shared content.
data data-science machine-learning nlp topic-modeling twitter x
Last synced: 10 Feb 2026
https://github.com/samhollings/nhs_data_cleansing
A repo of reusable functions for cleansing data
cleansing data data-cleaning data-cleansing preprocessing pyspark python python3
Last synced: 05 Oct 2025
https://github.com/ivinnyaraujo/sql-database-analyticsengineer-powerbi
SQL | SQL Spatial | Database | Microsoft Fabric | More
analytics data data-engineering data-science database microsoft-fabric sql sql-server
Last synced: 16 Apr 2026
https://github.com/irsol/udacity-data-foundations-nd
data data-analysis data-visualization exel sql udacity udacity-data udacity-nanodegree
Last synced: 05 Mar 2026
https://github.com/javdomgom/nifi-custom-processors
Apache NiFi custom processors
apache-nifi bigdata data data-engineering datascience flowfile nifi nifi-custom-processor
Last synced: 27 Feb 2026
https://github.com/robertoostenveld/bird
BagIt Research Data
bagit data fair open-datasets repository
Last synced: 18 Mar 2026
https://github.com/shubhamsoni98/prediction-with-binomial-logistic-regression
To predict client subscription to term deposits and optimize marketing strategies by identifying potential subscribers.
binomial data data-science eda machine-learning matplotlib pipeline python scikit-learn seaborn sklearn sql visualization
Last synced: 06 Feb 2026
https://github.com/andrewl/danelaw
Geopackage containing the boundary of the Danelaw
data geospatial medieval viking
Last synced: 23 Jan 2026
https://github.com/gabrieldim/complete-analysis-covid-19
Analysis of the Covid 19.
analysis covid-19 covid19 data data-science science virus
Last synced: 23 Jan 2026
https://github.com/athari22/analyzing-the-yelp-dataset
SQL for Data Science
analytics data data-science data-structures er sql
Last synced: 27 Jan 2026
https://github.com/sanand0/iss-location
Tracks the International Space Station position. A demo of how to use GitHub Actions to schedule commits weekly.
Last synced: 14 Feb 2026
https://github.com/namratha2301/sales-orders-analysis
Wanted to experiment with Looker. This dashboard visualizes sales trends across regions, customer segments, and product categories.
business-analytics dashboard data dataanalysis datavisualization excel looker looker-studio
Last synced: 13 Feb 2026
https://github.com/j-sephb-lt-n/data-warehouse-and-etl-best-practice
A catalogue of best practices for managing data
data data-cleaning data-engineering data-validation data-warehouse etl
Last synced: 23 Jan 2026
https://github.com/jigyasag18/bird-strikes-in-aviation-project
This project analyzes over a decade of U.S. bird strike data (2000–2011) to evaluate safety risks, damage trends, and cost implications in aviation. Using PostgreSQL for database management and Power BI for dashboard visualization, it uncovers critical insights into when, where, and how wildlife impacts aircraft. Key findings inform strategically.
bird-strike-prevention bird-strike-prevention-in-real-airport data data-analysis data-analysis-project data-visualisation data-visualization data-visualization-project data-visualizations database dataset dax-query postgresql postgresql-database powerbi powerbi-desktop powerbi-report powerbi-visuals sql sql-database
Last synced: 09 May 2026
https://github.com/softloud/spunk
Nutritional interventions for male infertility: a systematic review and meta-analysis
Last synced: 18 Mar 2026
https://github.com/eshan-sud/secureit
A Blockchain-based Data Sovereignty Platform
blockchain data decentralised-application platform sovereignty
Last synced: 21 Jan 2026