data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-01-24 00:07:28 UTC
- JSON Representation
https://github.com/nomihq/nomi
Nomi enable people to use computer more simply.
action ai automation data devtools llm multimodal privacy productivity realtime reinforcement-learning security taskrunner voice
Last synced: 06 Apr 2025
https://github.com/zhangyoujia/hd_write_verify
LBA tools(hd_write_verify & hd_write_verify_dump) are very useful for testing Storage stability and verifying DATA consistency, there are much better than FIO & vdbench's verifying functions. for example: physical disk: ide/sata/scsi/ssd/iscsi/fc/raid/...; virtual disk: loop/nbd/lvm/soft raid/...; VM disk: ide/sata/scsi/virtio-blk/virtio-scsi/...;
consistency data filesystem migration physical-disk snapshot stability storage testing verifying virtual-disk vm-backup vm-disk
Last synced: 26 Jan 2026
https://github.com/cxmeel/sift
Immutable data library for Luau.
conversion data dictionary immutability immutable luau roact roblox roblox-lua roblox-rojo robloxdev robloxlua rojo typescript utility wally
Last synced: 21 Jan 2026
https://github.com/r3nt0n/wiper
Secure destruction of sensitive virtual data, temporary files and swap partitions
cyber-security cyberpunk data data-destruction delete delete-files destruction hacking-tool python python-script python3 secure secure-delete shred shredder shredding-algorithms shredding-files wipe-disk wipe-files wipe-out
Last synced: 18 Jan 2026
https://github.com/spine-tools/spine-toolbox
Spine Toolbox is an open source Python package to manage data, scenarios and workflows for modelling and simulation. You can have your local workflow, but work as a team through version control and SQL databases.
anaconda data energy miniconda python simulation-model spine-toolbox workflow
Last synced: 04 Apr 2025
https://github.com/hi-folks/data-block
PHP Package for handling, querying, filtering, and setting nested data structures
data data-structure hacktoberfest json-data php
Last synced: 04 Jan 2026
https://github.com/virtadpt/exocortex-halo
Various and sundry additional pieces of software I've written to incorporate into my exocortex.
bots conversation data exocortex interactive parser python rest-api
Last synced: 09 Apr 2025
https://github.com/glynnbird/countriesgeojson
Countries of the world as GeoJSON
data geography geojson geospatial json
Last synced: 21 Mar 2025
https://github.com/dkandalov/activity-tracker
Plugin for IntelliJ IDEs to track and record user activity
Last synced: 11 Sep 2025
https://github.com/anish-agnihotri/blog-effective-nft-launches-data
Data+code for NFT launch guide blogpost.
data exploiting fairness launches nft
Last synced: 14 Apr 2025
https://github.com/9b/netinfo
Simple IP enrichment service and API wrapping PyASN and MaxMind GeoIP.
cybersecurity data devops enrichment ip-address-lookup network osint python3 webservice
Last synced: 25 Jan 2026
https://github.com/trevorhobenshield/amazon_photos
Amazon Photos API
amazon amazon-photos api archive automation data
Last synced: 01 Aug 2025
https://github.com/ylem-co/ylem
Ylem is an open-source platform for real-time data streaming orchestration
data data-visualization dataorchestration etl etl-framework etl-pipeline ide ingestion orchestration pipelines processing real-time reverse-etl scheduler streaming streaming-data transformation workflows
Last synced: 12 Jan 2026
https://github.com/stephanakkerman/tensortrade-extras
Discover a curated list of projects complementing TensorTrade, distinct from those mentioned in its official documentation. Contributions are welcome; if you spot a missing project, please submit a pull request!
crypto cryptocurrency cryptocurrency-trading-bot data finance live-trading openai-gym reinforcement-learning reinforcement-learning-agent stock-trading stocks technical-analysis tensortrade
Last synced: 18 Mar 2025
https://github.com/joelgmsec/invoke-transfer
PowerShell Clipboard Data Transfer
base64 citrix clipboard data guacamole keystrokes ocr powershell rdp transfer
Last synced: 08 Sep 2025
https://github.com/stonecypher/flocks.js
A radically simpler alternative to Flux - opinionated React state and rendering management
data flux javascript js layer react react-js reactjs redux screw-flux simple simplicity
Last synced: 02 May 2025
https://github.com/packtworkshops/the-data-analysis-workshop
A New Interactive Approach to Learning Data Analysis
bivariate-analysis clustering data data-analysis exploratory-data-analysis feature-engineering hypothesis-testing kolmogorov-smirnov-tests linear-regression logistic-regression pandas-profiling python univariate-analysis visual-analysis visualizations
Last synced: 04 Sep 2025
https://github.com/anandchowdhary/bookshelf-action
📚 Track your reading using GitHub Actions
api books data generator github-actions reading tracker
Last synced: 05 Apr 2025
https://github.com/alibaba/dimbin
High-performance serialization for multi-dimension arrays 海量数据高性能序列化方案
binary csharp data serialization typescript
Last synced: 14 Oct 2025
https://github.com/packtworkshops/the-applied-sql-data-analytics-workshop
A Quick, Interactive Approach to Learning Analytics with SQL
aggregate-functions analysis copy data data-analytics database excel exporting group-by hash-scan having importing joins killing-queries performant python3 sequential-scan sql sqlalchemy window-function
Last synced: 16 Apr 2025
https://github.com/rishit-dagli/cppe-dataset
Code for our paper CPPE - 5 (Medical Personal Protective Equipment), a new challenging object detection dataset
artificial-intelligence computer-vision cppe5 data dataset deep-learning machine-learning models object-detection pretrained-models pytorch tensorflow vision
Last synced: 15 Jun 2025
https://github.com/chuongmep/aps-toolkit
An Libray Unlock BIM Data With Autodesk Platform Services
acc ai aps aps-toolkit autodesk-docs autodesk-forge automation big-data bim bim360 data data-analyst data-science database forge llm
Last synced: 14 Apr 2025
https://github.com/publici/us-polling-places
Standardized data on historical general election polling places in the United States.
data elections open-data open-elections polling-locations polling-places
Last synced: 03 Jul 2025
https://github.com/purarue/HPI
Human Programming Interface - a way to unify, access and interact with all of my personal data [my modules]
data gdpr history lifelogging personal-api quantified-self
Last synced: 23 Oct 2025
https://github.com/kirankunigiri/Apple-Family
A simple framework that brings Apple devices together - like a family
bluetooth connectivitiy data ios macos usb wifi
Last synced: 30 Jul 2025
https://github.com/AnandChowdhary/bookshelf-action
📚 Track your reading using GitHub Actions
api books data generator github-actions reading tracker
Last synced: 18 Apr 2025
https://github.com/enviodev/hyperindex
📖 Blazing-fast multi-chain indexer
blockchain dapp data envio evm framework fuel indexer rescript
Last synced: 04 Apr 2025
https://github.com/visual-layer/visuallayer
Simplify Your Visual Data Ops. Find and visualize issues with your computer vision datasets such as duplicates, anomalies, data leakage, mislabels and others.
cleaning computer computer-vision data data-science dataset datasets-preparation generative machine-learning python vision
Last synced: 19 Apr 2025
https://github.com/kirankunigiri/apple-family
A simple framework that brings Apple devices together - like a family
bluetooth connectivitiy data ios macos usb wifi
Last synced: 15 Sep 2025
https://github.com/tensorflow/tfjs-data
Simple APIs to load and prepare data for use in machine learning models
data deep-learning javascript machine-learning neural-network tensorflow tfjs
Last synced: 30 Sep 2025
https://github.com/doodlewind/bumpover
🚧 Async data transforming with simple rules.
data json markup migration tree validation xml
Last synced: 11 Sep 2025
https://github.com/0xplaygrounds/subgrounds
An intuitive Python library for interfacing with subgraphs and GraphQL
analytics blockchain data decentralized graph graphql pandas python subgraphs substreams
Last synced: 12 Sep 2025
https://github.com/malloydata/malloy-composer
Malloy Composer is a simple application to build dashboards or run ad-hoc queries using an existing Malloy model
business-analytics business-intelligence data data-modeling data-visualization malloy semantic-modeling
Last synced: 08 May 2025
https://github.com/ArtesiaWater/hydropandas
Module for loading observation data into custom DataFrames
data groundwater hydrology observations pandas timeseries
Last synced: 20 Jul 2025
https://github.com/javierbyte/canvas-image-utils
Utils for image data.
base64 canvas data image image-resizer javascript matrix tools utils
Last synced: 22 Apr 2025
https://github.com/matheusfelipeog/worldometer
Get live, population, geography, projected, and historical data from around the world 🌍
api data historical historical-data live livedata metrics mit-license projected pypi python scraping world worldometer worldometer-api worldometer-scraping worldometers
Last synced: 12 Apr 2025
https://github.com/noahgift/data-engineering-and-dataops
Duke MIDS: Data Engineering and DataOps Course
book cloud course data data-science dataengineering dataops duke mlops software-engineering
Last synced: 03 Sep 2025
https://github.com/0xPlaygrounds/subgrounds
An intuitive Python library for interfacing with subgraphs and GraphQL
analytics blockchain data decentralized graph graphql pandas python subgraphs substreams
Last synced: 09 May 2025
https://github.com/artesiawater/hydropandas
Module for loading observation data into custom DataFrames
data groundwater hydrology observations pandas timeseries
Last synced: 26 Jun 2025
https://github.com/ngnjs/ngn
A systems development platform. A revamped portal is coming soon at:
browser data data-model deno event-driven eventemitter javascript middleware ngn nodejs queue systems web
Last synced: 16 May 2025
https://github.com/paintedbicycle/sketch-data-faker
A Sketch plugin providing 130+ types of smart placeholder content for your mockups from Faker.js and other sources.
api content data fakerjs javascript json mock-data plugin sketch sketch-plugin sketchapp
Last synced: 26 Jun 2025
https://github.com/pedrokehl/caminho
Tool for creating efficient data pipelines in a JavaScript environment
backpressure concurency data dataprocessing functional javascript parallel pipeline reactive typescript
Last synced: 06 Apr 2025
https://github.com/whitfin/jen
A fast utility to generate fake/test documents based on a template
data dataset generator json template templating
Last synced: 22 Aug 2025
https://github.com/octoenergy/timeserio
Better `keras` models for time series and beyond
Last synced: 24 Jun 2025
https://github.com/nom-tam-fits/nom-tam-fits
A full featured 100% Java library for reading and writing FITS files
astronomy data fits fits-files fits-image fitsio java open-source scientific
Last synced: 11 Jan 2026
https://github.com/greenelab/adage
Data and code related to the paper "ADAGE-Based Integration of Publicly Available Pseudomonas aeruginosa..." Jie Tan, et al · mSystems · 2016
autoencoders data dataset denoising-autoencoders gene-expression machine-learning manuscript methodology neural-networks paper pseudomonas-aeruginosa research supplement
Last synced: 09 Oct 2025
https://github.com/alannikos/edg4llm
A unified tool to generate fine-tuning datasets for LLMs, including questions, answers, and dialogues. ✨🤖📚💬
chagpt chatglm data deepseek fine-tuning generation internlm llm
Last synced: 14 Jan 2026
https://github.com/altaurog/pdfforms
Populate fillable pdf forms from csv data file
Last synced: 12 May 2025
https://github.com/disease-sh/node-api
A JavaScript API Wrapper for NovelCOVID/API
coronavirus data javascript library open-source wrapper
Last synced: 21 Apr 2025
https://github.com/chartshq/datamodel
Relational algebra compliant in memory tabular data store.
data datagrid datamodel datatable datatables javascript relational-algebra rust rust-language schema tabular-data wasm webassembly
Last synced: 04 Oct 2025
https://github.com/514-labs/moose
The developer framework for your data & analytics stack
analytics data dataengineering deployment framework insights metrics python rust typescript
Last synced: 05 Apr 2025
https://github.com/rolyatmax/citibike-trips
Visualizing citibike trips with webgl
citibike data javascript regl visualization webgl
Last synced: 14 Apr 2025
https://github.com/hswick/jutsu
Graphing tool for Clojure built with the web and interactivity in mind
clojure data plotlyjs visualization
Last synced: 09 Sep 2025
https://github.com/micromata/generator-http-fake-backend
Yeoman generator for building a fake backend by providing the content of JSON files or JavaScript objects through configurable routes.
api backend data fake fake-data http http-server json mock mocking mocking-server mocks node nodejs rest rest-api restful restful-api yeoman yeoman-generator
Last synced: 14 Jan 2026
https://github.com/bluzi/name-db
:rocket: A multilingual collection of names from around the world
data language names translations
Last synced: 09 Oct 2025
https://github.com/mkearney/kaggler
🏁 API client for Kaggle
api-client api-wrapper data data-api kaggle kaggle-api kaggle-competition machine-learning mkearney-dataset mkearney-r-package neural-networks r r-package rstats
Last synced: 30 Oct 2025
https://github.com/z3z1ma/alto
Alto is a versatile data integration tool that allows you to easily run Singer plugins, build and cache PEX files encapsulating those plugins, and create a data reservoir whereby you can extract once and replay to as many destinations as you want.
data data-pipeline etl meltano singer
Last synced: 17 Mar 2025
https://github.com/fillmula/jsonclasses
🌎 The Modern Declarative Data Flow Framework for the AI Empowered Generation.
Last synced: 07 Sep 2025
https://github.com/noopeeks/datanvim
A fully-featured batteries-included Neovim distribution for the world of Data Science. Prepared to run code and interact with Jupyter Notebooks without ever leaving your terminal.
data data-science distribution jupyter-notebook machine-learning neovim nvim nvim-config text-editor vim
Last synced: 06 Oct 2025
https://github.com/unsw-ceem/nemosis
NEMOSIS: NEM Open-source information service. A Python package for downloading historical data published by the Australian Energy Market Operator (AEMO)
aemo australia data energy national-electricity-market nem nemweb python
Last synced: 04 Apr 2025
https://github.com/nikitastupin/orgs-data
Mapping from bug bounty and vulnerability disclosure programs to respective GitHub organizations
bug-bounty data github reconnaissance vulnerability-disclosure
Last synced: 09 Apr 2025
https://github.com/queryverse/excelreaders.jl
ExcelReaders is a package that provides functionality to read Excel files.
Last synced: 12 Apr 2025
https://github.com/geirolz/fly4s
A lightweight, simple and functional wrapper of Flyway using cats effect.
cats cats-effect data database database-migrations db flyway flyway-migrations flywaydb functional-programming persistence scala
Last synced: 11 Apr 2025
https://github.com/fityannugroho/idn-area-data
Provides the administrative areas data of Indonesia based on the latest official sources 🇮🇩
data data-sources hacktoberfest idn-area indonesia island javascript open-data pulau wilayah wilayah-indonesia
Last synced: 06 Apr 2025
https://github.com/kristoferjoseph/redeux
Minimal unidirectional data flow utility library
data flux reducer store unidirectional utility
Last synced: 06 May 2025
https://github.com/idouble/pandas-python-data-analysis-playground
🐍 Data Analysis with the Pandas Library & Notes 📊📈
analysis csv csv-files data data-analysis data-science data-visualization dataframe examples library pandas pandas-dataframe pandas-library pandas-python python
Last synced: 10 Sep 2025
https://github.com/leonawicz/rtrek
R package for Star Trek datasets and related R functions.
data r-package stapi star-trek
Last synced: 06 Sep 2025
https://github.com/r3dhulk/python-for-ethical-hacking
Build tools for hacking ethically using python.
ceh cehv10 cehv11 cyber-security cybersecurity data ethical ethical-hacking ethical-hacking-tools hackerrank hacking pentest pentest-tool pentesting pentesting-tools python python-for-ethical-hacker python-for-everybody python3 security
Last synced: 11 Jul 2025
https://github.com/worldbank/llm4data
LLM4Data is a Python library designed to facilitate the application of large language models (LLMs) and artificial intelligence for development data and knowledge discovery.
ai data development-data gpt gpt-4 indicators llm llm4data metadata sql wdi world-development-indicators
Last synced: 07 Apr 2025
https://github.com/hadley/neiss
Data from National Electronic Injury Surveillance System
Last synced: 19 Oct 2025
https://github.com/digao-dalpiaz/dztalkapp
Delphi non-visual component to communicate between applications
applications communication component data delphi pascal
Last synced: 12 Jul 2025
https://github.com/dxxxxy/EssentialCosmeticsUnlocker
Client-side only patch that allows you to unlock ALL cosmetics (+ emotes) in the Essential mod. Works on every version of Essential MC (1.8.9 - 1.20.6).
client compatibility config cosmetics data dump emotes essential fabric forge free inject minecraft mixin mod patch side universal unlocker
Last synced: 09 Aug 2025
https://github.com/kevinwang15/treebox
an interactive TreeMap visualization - Please star if you like this project
canvas canvas2d data javascript treemap visualization
Last synced: 13 May 2025
https://github.com/turbot/steampipe-sqlite
Steampipe SQLite is a zero-ETL engine for SQLite. Virtual tables translate queries into live API calls for cloud services and APIs. Hundreds of plugins with thousands of documented examples.
aws azure data devsecops etl gcp golang kubernetes security sql sqlite steampipe steampipe-engine zero-etl
Last synced: 28 Jul 2025
https://github.com/mysociety/yournextrepresentative
A website for crowd-sourcing structured election candidate data
civic-tech data elections politics
Last synced: 02 Aug 2025
https://github.com/vijinho/iso-country-data
ISO Country data in JSON and CSV format.
country-data currencies data iso-country json-data
Last synced: 21 Jan 2026
https://github.com/m4rz910/NYISOToolkit
Access data, statistics, and visualizations for New York's electricity grid.
analysis clcpa clean-energy data datascience datasets decarbonization electricity energy kaggle kaggle-competition kaggle-dataset machine-learning ml newyork nyiso renewable-energy visualization
Last synced: 07 May 2025
https://github.com/emberexperts/ember-await
Await component for Ember Applications. Resolve your data on demand, just when needed.
await data ember-addon ember-await emberjs javascript loading
Last synced: 12 Apr 2025
https://github.com/jonschlinkert/cache-base
Basic object store with methods like get/set/extend/omit
cache config data dot-notation emit extend inherit javascript node nodejs object store
Last synced: 05 Apr 2025
https://github.com/colour-science/colour-datasets
Colour science datasets for use with Colour
color color-science color-space color-spaces colorspace colorspaces colour colour-science colour-space colour-spaces colourspace colourspaces data dataset datasets python spectral-data spectral-dataset spectral-datasets
Last synced: 06 Apr 2025
https://github.com/tg12/script-toolbox
This repository contains a collection of scripts and tools that I have written to solve various problems that I have come across.
data exif exif-metadata exif-reader exiftool scripts scripts-collection toolbox toolkit toolkits
Last synced: 27 Jul 2025
https://github.com/datakitchen/dataops-testgen
DataOps Data Quality TestGen is part of DataKitchen's Open Source Data Observability. DataOps TestGen delivers simple, fast data quality test generation and execution by data profiling, new dataset hygiene review, AI generation of data quality validation tests, ongoing testing of data refreshes, & continuous anomaly monitoring
data data-engineering data-observability data-quality data-science data-testing datachecker dataops dataprofiling dataquality datavalidation mssql postgresql python redshift self-hosted snowflake
Last synced: 06 Apr 2025
https://github.com/alinski29/stonks.jl
Julia library for standardizing financial data retrieval and storage from multiple APIs.
data data-mining data-science dataframe finance julia trading trading-algorithms
Last synced: 06 May 2025
https://github.com/zachowj/xfinity-data-usage
Fetch Xfinity data usage and serve it via an HTTP endpoint, publish it to MQTT or post it to an URL.
data home-assistant mqtt usage xfinity xfinity-data
Last synced: 09 Mar 2025
https://github.com/zbrookle/sql_to_ibis
A Python package that parses sql and converts it to ibis expressions
data databases dataframes etl hacktoberfest ibis sql
Last synced: 14 Apr 2025
https://github.com/ioos/bio_data_guide
Standardizing Marine Biological Data Working Group - An open community to facilitate the mobilization of biological data to OBIS.
darwin-core data data-management marine-biology marine-data obis tutorials
Last synced: 30 Oct 2025
https://github.com/chase-manning/pokemon-tcg-pocket-cards
An open source repo for data on the Pokemon TCG Cards
api data pokemon pokemon-tcg pokemon-tcg-pocket pokemon-trading-card-game-pocket
Last synced: 12 Nov 2025
https://github.com/mcarlucci/decky-storage-cleaner
A Decky Loader plugin for tidying up your Steam Deck's storage. Quickly visualize, select and clear shader cache and compatibility data.
cache cleaner compat compatibility data decky decky-loader plugin shader steam steamdeck storage utility
Last synced: 03 Mar 2025
https://github.com/spratiher9/sparkora
Powerful rapid automatic EDA and feature engineering library with a very easy to use API 🌟
apache apache-spark data data-analysis data-analysis-python data-analytics easy-to-use eda exploratory-data-analysis open-source opensource pyspark python python3 toolkit
Last synced: 10 Jul 2025
https://github.com/OElesin/querypal
Web UI for Amazon Athena
analytics aws aws-athena data data-lake sql
Last synced: 30 Jul 2025
https://github.com/tomhanika/conexp-clj
A General-Purpose Tool for Formal Concept Analysis
clojure closure-systems conceptual-knowledge data data-analysis data-science formal-concept-analysis lattice order order-theory
Last synced: 30 Dec 2025