data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-23 00:07:41 UTC
- JSON Representation
https://github.com/courtois-neuromod/anat
Anatomical sub-dataset of Courtois-Neuromod project.
Last synced: 17 Jan 2026
https://github.com/gavinr/minor-league-baseball
Data set of minor league baseball teams
baseball data hacktoberfest maps open-data open-datasets sports
Last synced: 06 Apr 2025
https://github.com/sun-lab-nbb/sl-experiment
A Python library that provides tools to acquire, manage, and preprocess scientific data in the Sun (NeuroAI) lab.
ataraxis data experiment methods neuroscience
Last synced: 07 Mar 2026
https://github.com/bgmp/tesis-german-deuster
Datos estadísticos para tercería de una tésis
Last synced: 28 Mar 2025
https://github.com/unaygney/js-challenges-data-structures-and-algorithms
Repo of the challenges I'm trying to solve to understand data structures and algorithms..
algorithms-and-data-structures data javascript structure
Last synced: 29 Oct 2025
https://github.com/thealphadollar/messiah
Messiah: The Mighty Son Of God Is Here To Help You Through Times Of Calamity
azure backend data data-analysis flask frontend materialize natural-disasters
Last synced: 19 Jan 2026
https://github.com/laurensius/covid-19-info
data datatabel grafik peta visualisasi
Last synced: 21 May 2026
https://github.com/dataopstix/modelt
Modelt(mow·delt) is a modern data integration solution that connects data to data for advanced analytics.
airbyte airflow airflow-docker data data-analysis data-visualization database dbt elt etl etl-automation metabase metadata modern modern-dev modernization
Last synced: 28 Mar 2025
https://github.com/mohammadkarbalaee/introduction-to-data-science-sbu
Reports and full documentation of the introduction to data science course held at SBU
data data-science python shahid-beheshti-university
Last synced: 27 Mar 2025
https://github.com/enricocid/monitoraggio-vaccini-italia
Sito web statico per github.com/apalladi/covid_vaccini_monitoraggio
covid-19 covid-19-data covid-19-data-analysis data data-analysis data-visualization dataset python python3 python37 sars-cov-2
Last synced: 09 Sep 2025
https://github.com/inspect-js/is-data-view
Is this value a JS DataView? This module works cross-realm/iframe, does not depend on instanceof or mutable properties, and despite ES6 Symbol.toStringTag.
data dataview ecmascript javascript typedarray typedarrays view
Last synced: 05 Apr 2025
https://github.com/headless-start/data-augmentation-impact
This repository contains effect of Data Augmentation of Training Set during Model Training.
augmented-images cuda data gpu keras matplotlib mnist opencv-python python3 tensorflow training-data
Last synced: 05 Apr 2026
https://github.com/heikomuller/histore
Library for maintaining snapshots of evolving tabular data sets
Last synced: 10 Apr 2025
https://github.com/real-veersandhu/scifaa-covid-19-project
📈 COVID-19 Data Science Project (2021 Internship @ SCI-FAA)
covid-19 data data-science data-visualization python
Last synced: 14 May 2026
https://github.com/andrewjbateman/mevn-stack-data
:clipboard: MEVN Info & Full stack MEVN app with CRUD functions
data database express expressjs full-stack info mevn mevn-stack middleware mongodb mongodb-atlas nodejs typescript vue vue3 vue3-typescript
Last synced: 07 Apr 2026
https://github.com/johnmackintosh/simd2016_tmap
Mapping SIMD with tmap - static & interactive
data data-science data-visualization mapping r visualisation
Last synced: 20 Mar 2025
https://github.com/cheminfo/cheminfo-types
chemistry data hacktoberfest schema typescript
Last synced: 03 Apr 2026
https://github.com/zarr-developers/cookiecutter-zarr-store
Cookiecutter for Zarr store implementations
chunked data n-dimensional zarr
Last synced: 16 Jun 2025
https://github.com/meetyildiz/pandazip
Reduce RAM footprint of Pandas DataFrame without losing information.
compression data data-mining data-science machine-learning pandas utilities
Last synced: 20 Jan 2026
https://github.com/sheweny/discord-resolve
This module groups together functions to retrieve data from different types of arguments.
data discord discord-js mentions resolver sheweny utility
Last synced: 29 Oct 2025
https://github.com/helixspiral/ndbc
Golang wrapper for the National Data Buoy Center (NDBC)
data data-science golang government-data ndbc ndbc-buoy-data noaa noaa-api noaa-buoys noaa-data noaa-weather wrapper wrapper-api wrapper-library
Last synced: 14 Jun 2025
https://github.com/simoneas02/data-science
🐍 A planning study to become a data scientist and to improve my current skills. 🤘🏼🌻
data data-analysis data-science data-visualization deep-learning machine-learning pandas python3 r sql
Last synced: 12 Apr 2026
https://github.com/zenwor/table_editor
A simple table data editor, with easily scalable functions and operations & a nice GUI
data data-science formula java parser parsing preprocessing swing tokenizer
Last synced: 22 Jun 2025
https://github.com/writetome51/big-dataset-paginator
A TypeScript/JavaScript class for pagination in a real-world web app.
app data javascript pagination paginator typescript
Last synced: 17 May 2026
https://github.com/kabeech/real-dice
Random number generation based on physical media touched by humans
data dice haskell human-computer-interaction physical random random-generation random-number-generators rng
Last synced: 10 Apr 2025
https://github.com/felixklauke/atomizer
Playing around with butter knife, android bindings and rx java.
binding butterknife data java react rx rxjava
Last synced: 15 May 2026
https://github.com/antvis/create-antv-demo
A simple CV-dashboard framework for practicing how to use AntV.
antv cv dashboard data resume resume-template resume-website visualization
Last synced: 09 Apr 2025
https://github.com/royruddle/vizdataquality
Python package for visualizing data quality
data data-science data-visualization jupyter-notebook missing-data python
Last synced: 05 May 2025
https://github.com/justjavac/deno_open_data
data deno deno-mod deno-module denomod
Last synced: 17 May 2026
https://github.com/max-tonny8/android_web3
This is a library for Android to call data from Node on Ethereum Chain or Solana Chain
android blockchain coroutines coroutines-android data eth-call ethereum kotlin ktx retrofit rpc smart-contracts solana web3 web3j
Last synced: 27 Mar 2025
https://github.com/maskedsyntax/covid-tracker
Qt app to keep a track of Covid-19 records of different countries.
coronavirus coronavirus-tracking covid-19 data parsing scraping scraping-websites tracker web-scraping
Last synced: 29 Mar 2025
https://github.com/shahiakhilesh1304/dsa
This repository is my personal space for practicing Data Structures and Algorithms. I'll be adding questions, solutions, and explanations as I progress, covering topics from basics to advanced.
algorithms code contribution contributions-and-collaboration contributions-welcome data data-structures data-structures-and-algorithms dsa dsa-algorithm dsa-learning-series dsa-practice hacktober hacktoberfest hacktoberfest-accepted hacktoberfest-starter practice programming python-3
Last synced: 13 Apr 2025
https://github.com/stdlib-js/array-typed-integer-ctors
Integer-valued typed array constructors.
array constructor constructors ctor ctors data dtype dtypes javascript node node-js nodejs stdlib structure type typed typed-array types utilities
Last synced: 12 Jan 2026
https://github.com/nikoshet/exploratory-data-analysis-using-r
Exploratory Data Analysis using R Course Project for M.Sc. 'Data Science and Machine Learning' in NTUA
data data-analysis data-science eda exploratory-data-analysis ggplot2 r
Last synced: 14 May 2026
https://github.com/eliot-akira/png-compressor
Compress and encode data as PNG image
Last synced: 17 Mar 2025
https://github.com/rahulraikwar00/advault
Advault is a adhaar data vault generation tool
aadhaar data hacktoberfest uidai vault
Last synced: 05 Apr 2025
https://github.com/antoineaugusti/purchasing-power
Archive daily data about purchasing power parity: how much goods should cost in various countries
archive data purchasing-power-parity
Last synced: 28 Oct 2025
https://github.com/imadsaddik/bodmaghdataset
BoDmagh dataset is a Supervised Fine-Tuning (SFT) dataset for the Darija language
arabic-llm arabic-nlp darija-llm darija-nlp data dataset fine-tuning llm nlp sft
Last synced: 03 Apr 2025
https://github.com/kale-ko/ejcl
An advanced configuration library for Java with support for local files, mysql databases, and more.
config configs configuration configuration-files data java java-serialization json mariadb mysql parser serialization serializer smile yaml
Last synced: 02 Feb 2026
https://github.com/mickeyshi-syd/actuarial-hackathon-2019
2019 Actuarial Hackathon
actuarial actuaries analytics data data-science hackathon
Last synced: 15 Jul 2025
https://github.com/sharmadhiraj/free-json-datasets
Collection of free JSON data that are scraped and parsed from different websites.
collection crawler data data-scraping datasets json sports statistics web-scraping
Last synced: 28 Mar 2025
https://github.com/mostafanabieh/image-classification-with-data-augmentation
Project for Data augmentation with tensorflow v2
data deep-learning image-classification machine-learning tensorflow tensorflow2
Last synced: 07 May 2026
https://github.com/stdlib-js/ndarray-base-buffer-ctors
ndarray data buffer constructors.
array base buffer constructor constructors ctor ctors data dtype dtypes javascript multidimensional ndarray node node-js nodejs stdlib types utilities utility
Last synced: 06 Apr 2025
https://github.com/reppon97/cryptosnake
Simple, unofficial python wrapper for Binance API. You'll find this easy-to-use package helpful if you're interested in general market data and cryptocurrency values. You don't need to have a Binance account or API Key since you can't purchase/trade cryptocurrencies using this package.
api binance bitcoin crypto cryptocurrencies cryptocurrency cryptocurrency-exchanges cryptography data ethereum json litecoin python python3 statistics
Last synced: 05 Mar 2025
https://github.com/thejeshgn/thejeshgn
data data-visualization datameet india opendata public-interest
Last synced: 15 Jan 2026
https://github.com/victoorv/breast_cancer
Mammographic images classification.
breast-cancer breast-cancer-classification classification cnn cnn-classification convnext convnext-tiny convolutional-neural-networks data data-science data-visualization feature-tuning image image-classification mammogram-images mammographic-images neural-network resnet-50 resnet50 transfer-learning
Last synced: 27 Jan 2026
https://github.com/mzazakeith/puppetmaster
Puppeteer & Crawl4AI microservice for web automation, scraping, and AI processing with Bull queues
agent ai automation bull bullmq chrome crawl4ai crawler data data-extraction extraction gemini llm llms openai playwright puppeteer web-automation
Last synced: 13 May 2025
https://github.com/danlsn/causality
A Personal Data Platform and the culmination of years of curiosity and learning in the Data Engineering space.
data data-engineering datawarehousing personal-data quantified-self
Last synced: 06 Mar 2026
https://github.com/d2hydro/hydrodashboards
Open Source Dashboards for hydro data
bokeh data geopandas hydrology hydrometrics pandas sheetjs
Last synced: 26 Jul 2025
https://github.com/wpp-public/akqa-nz-tagmanager-connector
A simple javascript library to send events to a tag manager container
Last synced: 05 Apr 2025
https://github.com/ange007/jquery.mydata
jQuery.myData - Small jQuery&Zepto plugin for two-ways data binding.
data data-binding jquery jquery-plugin zepto zepto-plugin zeptojs
Last synced: 19 May 2026
https://github.com/rrighart/rrighart.github.io
A webpage about data science, programming, statistics and related topics
analyses data data-mining programming statistics
Last synced: 20 Jan 2026
https://github.com/marlenezw/speech-to-text
Turn any video or audio recording into a written transcript using python
data data-science python speech speech-recognition speech-synthesis speech-to-text
Last synced: 27 Apr 2026
https://github.com/pjmagee/starwars-data
A Star Wars Web app with Charts and entire Timeline events!
aspire blazor blazor-webassembly data database dataset docker dotnet json starwars starwars-data starwars-fandom
Last synced: 07 Mar 2026
https://github.com/ctechhindi/auto-fill-form-data
AUTO FILL AND AUTOCOMPLETE USER DATA WITH KEY NAME
autocomplete chrome-extension data extension
Last synced: 17 Apr 2026
https://github.com/emrecpp/datapacket-cpp
Send, recv, encrypt, decrypt, compress data as Packet and send it with socket for C++.
compress data deserialization deserialize deserializer encrypt packet recv send serialization serialize serializer socket storage
Last synced: 28 Mar 2025
https://github.com/mongoexpuser/synthetic-drilling-data-app-for-sqlite-ml
Generate synthetic drilling data that can be used for testing machine learning (ML) models.
classification data drilling events inference machine-learning ml-models node-sqlite3 nodejs prediction python sqlite3 training
Last synced: 08 Apr 2026
https://github.com/dhruvldrp9/simpledht
A Python-based Distributed Hash Table (DHT) implementation enabling cross-network key-value storage, automatic node discovery, and data replication with a simple CLI and library interface.
cross-network-node-communation data data-replication data-synchronization dht dht-python distributed-hash-table key-value-storage nat netowork node-discovery peer-to-peer peer-to-peer-network python sha-256 simple udp udp-socket-communication
Last synced: 28 Feb 2026
https://github.com/jesusgraterol/bitcoin-lightning-network-stats-dataset-builder
The dataset builder script extracts Bitcoin's Lightnining Network statistics through Mempool.space's public API. The data is stored in a .csv file, facilitating its use in data science and machine learning projects.
bitcoin blockchain blockchain-technology data data-science dataset dataset-generation lightning-network machine-learning
Last synced: 16 May 2026
https://github.com/longzheng/southeastwater-usage-scraper
Extract hourly water usage data from South East Water portal website for digital water meters
australia data iot playwright southeastwater victoria water
Last synced: 06 Feb 2026
https://github.com/karlinarayberinger/karbytes_23andme_ancestry_data
This repository contains web page source code and media files which constitute (some of) the intellectual property encapsulated by the website named Karbytes For Life Blog dot WordPress dot Com.
2023 23andme ancestry blog data dna genome karbytes website wordpress
Last synced: 12 May 2025
https://github.com/Ekey/ER.DATA.Tool
Tool for extract data archives from mobile game Earth Revival (Project Arrival)
data earth-revival idx project-arrival
Last synced: 19 May 2026
https://github.com/alexandregazagnes/scikit-res
Very Basic package to store results of ML models Grid search results are hard to exploit. This package aims to store them in a more convenient way.
data machine-learning mlops mlops-workflow results scikit-learn
Last synced: 20 Jan 2026
https://github.com/samboycoding/hungergames-data
data hunger-games javascript json
Last synced: 15 May 2026
https://github.com/minightdev/paperclip
Paperclip is a powerful privacy-focused data breach search engine that empowers users to swiftly and securely investigate breaches using email addresses and phone numbers. Our robust search engine delivers real-time results while prioritizing the privacy and security of user queries.
beaches data database pwn pwned search-engine
Last synced: 22 Mar 2025
https://github.com/joisino/twinpaper
Code for "Twin Papers: A Simple Framework of Causal Inference for Citations via Coupling" (CIKM 2022)
causal-inference data research science-of-science
Last synced: 21 Mar 2025
https://github.com/stdlib-js/datasets-suthaharan-single-hop-sensor-network
Labeled wireless sensor network data set collected from a simple single-hop wireless sensor network deployment using TelosB motes.
data dataset datasets javascript labeled machine-learning ml mote motes network node node-js nodejs outlier outliers sample sensor statistics stats stdlib
Last synced: 03 Mar 2025
https://github.com/sermetpekin/evdscpp
evdscpp is a C++ library for fast, efficient, and user-friendly interaction with the EVDS API Server. Designed with performance in mind, it provides built-in caching, an Excel export option, and an intuitive user interface for configuring and retrieving data. evdscpp can be extended for integration with other C++ projects and offers options for use
cbrt central-bank cpp data edds evds evds-api evdscpp tcmb tcmb-api
Last synced: 07 Sep 2025
https://github.com/hariprashad-ravikumar/ai-datascience-lab
AI‑DataScience‑Lab is a web app for uploading CSV datasets, cleaning with Pandas, and running quick exploratory analyses and regression models using scikit‑learn. Its modular design supports future AI extensions, like deep learning with TensorFlow or insight generation via the OpenAI API.
ai api azure cloudcomputing data data-analysis data-science data-visualization mathplotlib numpy openai pandas python scikit-learn
Last synced: 02 Aug 2025
https://github.com/jongirard/unique_names_generator
A Unique Names Generator built in Elixir
data data-generator elixir elixir-lang fake-data name-generator phoenix seed
Last synced: 21 Oct 2025
https://github.com/jaffarabbas/library-management-system-in-java-
GUI base + Database functionality
data database datastructures-algorithms dbms gson java javafx javafx-application javafx-desktop-apps javamail library-management-system mysql sql xammp
Last synced: 05 May 2026
https://github.com/jinsyin/datalink
⚡ 数据集成 | DataLink is a lightweight data integration framework build on top of DataX, Spark and Flink
batch big-data bigdata cdc data data-collection data-exchange data-integration data-pipeline data-synchronization datalink etl flink flink-cdc framework integration pipeline spark streaming
Last synced: 19 Jul 2025
https://github.com/peterdavehello/nrd-list-archive
🌐📂 A collection of past NRD lists to explore—perfect for fun, research, or just plain curiosity! 🎉🔍✨
Last synced: 17 Mar 2026
https://github.com/plabayo/datapoints.earth
Earth data liberation for and by its citizens.
Last synced: 15 Mar 2026
https://github.com/agnosticeng/agx
Query and explore local and remote data with Clickhouse
clickhouse d3 data rust svelte
Last synced: 26 Oct 2025
https://github.com/ymougenel/referencecollector
Helps you gather, store and share references links
ansible data docker keycloak kotlin spring-boot thymeleaf
Last synced: 14 Apr 2026
https://github.com/stdlib-js/array-int32
Int32Array.
array data int int32 int32array integer javascript long node node-js nodejs signed stdlib structure typed typed-array types
Last synced: 27 May 2026
https://github.com/rastmob/wordpress-llms-output-plugin
A WordPress plugin to export posts, pages, and custom post types as JSON for training Language Models (LLMs).
ai data llm llms training training-data wordpress wordpress-development wordpress-plugin
Last synced: 03 May 2026
https://github.com/kuraydev/react-native-modal-data-passing
Seamless Data Passing to React-Native-Modal
android apple data data-passing google hook ios modal react react-native react-native-modal useimperativehandle
Last synced: 18 Apr 2026
https://github.com/ivangrigorov/neutrino-search-engine
Creating Java search engine both for HTML or document type of files
data data-analysis data-knowledge information-extraction information-retrieval java-language search-engine
Last synced: 31 Mar 2025
https://github.com/junkwaxhero/cardlists
Sports Card set lists in easily consumable JSON Format for databases, apps, websites, and more!
baseball baseball-cards baseball-data bowman data dataset datasets donruss fleer json json-schema panini topps upper-deck
Last synced: 24 Apr 2025
https://github.com/nafisalawalidris/advanced-fraud-detection-with-anomaly-detection
This repository demonstrates how to build a robust fraud detection system that combines supervised learning techniques with anomaly detection models. It provides end-to-end implementation, from data preprocessing and model training to deploying a real-time fraud detection API using FastAPI.
anomaly-detection creditcardfrauddetection data dataanalytics fastapi fraud-detection machinelearning modeldeployment python supervised-machine-learning unsupervised-machine-learning
Last synced: 20 Apr 2026
https://github.com/stdlib-js/array-typed-float-ctors
Floating-point typed array constructors.
array constructor constructors ctor ctors data dtype dtypes javascript node node-js nodejs stdlib structure type typed typed-array types utilities
Last synced: 24 Apr 2025
https://github.com/lindsaygelle/emojipedia
Go application. Simple program that scrapes unicode.org for Emoji content. Parses out HTML into categorically ordered data subsets. Explored from the command line.
cli data data-mining emoji emojipedia encyclopedia go golang golang-application html-scraping unicode-characters
Last synced: 11 Mar 2026
https://github.com/mahmoud-saeed-mahmoud/loading_state_handler
The StateHandlerWidget manages different UI states—loading, error, empty, and normal—allowing you to customize the displayed widgets for each state.
dart data error flutter flutter-package flutter-widget loading state
Last synced: 10 Mar 2026
https://github.com/luminati-io/Pinterest-dataset-samples
Two sample datasets of over 1000 Pinterest profiles and posts, extracted using the Bright Data API, ideal for market research, influencer marketing, and product development.
data data-extraction data-mining database datasets pinterest pinterest-api structured-data web-scraping
Last synced: 09 Apr 2025
https://github.com/1sumer/sql
This repository contains SQL scripts and data for various analytical and database management tasks. The project is designed to demonstrate SQL capabilities in handling complex queries, data analysis, and database design. It includes datasets related to e-commerce and streaming services, with a focus on real-world scenarios and use cases.
analytics data data-analysis data-storage sql vscode
Last synced: 19 Jan 2026
https://github.com/evoluteur/web-scraper-sitemaps
Sitemaps for the Web Scraper Chrome extension.
chrome-extension data dataset scraper scraping scrapper scrapping scrapy-crawler sitemap web-scraper web-scraping
Last synced: 04 Jun 2026
https://github.com/wamphlett/input-collection
A smarter and stricter way to capture and validate request data
Last synced: 27 May 2026