data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-02-03 00:08:04 UTC
- JSON Representation
https://github.com/daveoncode/pyvaru
Rule based data validation library for python 3.
data data-validation form-validation model validation validator
Last synced: 06 May 2025
https://github.com/marcskovmadsen/holoviz-mcp
✨A MCP server that provides intelligent access to the HoloViz ecosystem for humans and AIs.
ai analytics data datascience dataviz holoviews holoviz hvplot mcp model-context-protocol panel python
Last synced: 25 Jan 2026
https://github.com/jef1056/riceteacatpanda
repo with challenge material for riceteacatpanda (2020)
ai artificial-intelligence artificial-intelligence-algorithms binary-exploitation computer-science computer-vision cryptography ctf ctf-challenges cyber-security cybersecurity data data-analysis data-analytics natural-language-processing neural-network neural-networks website
Last synced: 14 Jul 2025
https://github.com/izelnakri/memoria
Single JS/TS ORM for frontend, backend & in-memory in-browser testing. Based on typeorm API, allows you to change adapters: MemoryAdapter, RESTAdapter, SQLAdapter etc.
browser data database decorators frontend graphql in-memory-database javascript json-api mock-server orm rest rest-client server sql state state-management testing-tools typeorm typescript
Last synced: 29 Jun 2025
https://github.com/nasa/ziggy
Ziggy, a portable, scalable infrastructure for science data processing pipelines, is the child of the Transiting Exoplanet Survey Satellite (TESS) pipeline and the grandchild of the Kepler Pipeline.
algorithm analysis arc data data-analysis data-reduction java k2 kepler linux macos nasa open-source pipeline science tess ziggy
Last synced: 26 Jan 2026
https://github.com/stefen-taime/car-price-predictor
Predicting Car Prices with FastAPI, Streamlit, MLflow, Kafka, and Debezium: A Practical Demonstration
data data-science dataanalysis-projects engineering machine-learning mlops predictive-modeling
Last synced: 04 Aug 2025
https://github.com/hitsz-ids/argus
Argus is a result review engine that prevents data leakage and ensures out-of-domain result traceability.
data data-protection data-security privacy secruity
Last synced: 01 Jul 2025
https://github.com/jgmdev/lessram
Pure PHP implementation of array data structures that use less memory.
data extension less memory php ram structures
Last synced: 15 May 2025
https://github.com/siddharthpatelde/distance-to-next-edge
This project focuses on building a logic to calculate the distance to the next edge when a robot equipped with a 2D LIDAR sensor is placed on a table. The project leverages the RPlidar.h library and a Raspberry Pi Pico to work with the LIDAR sensor.
2dlidar arduino cpp data data-visualization filtering-data functions jason lidar linux lowpass-filter mathematics physics raspberry-pi-pico ros serial-communication trignometry uart
Last synced: 12 Aug 2025
https://github.com/DesignandHuman/qui-possede-les-medias
Qui possède les grands médias que nous lisons ?
data extension holders media web-extension
Last synced: 02 Aug 2025
https://github.com/xiaodaigh/fstfileformat.jl
Julia bindings for the fst format
data fst julia julia-language julialang
Last synced: 07 May 2025
https://github.com/ibhavikmakwana/sample_data
generate random sample data to test your application.
dart data flutter sample sample-data
Last synced: 26 Mar 2025
https://github.com/ff137/bitstamp-btcusd-minute-data
Daily updates of Bitstamp BTC/USD 1-minute OHLC data, with historical data since 2012
bitcoin bitstamp candle data ohlcv-data price-data
Last synced: 09 Aug 2025
https://github.com/grimen/python-attributedict
A dictionary object with attributes support - for Python.
attribute attributes custom data dict dictionary object properties property python struct
Last synced: 03 May 2025
https://github.com/snowplow/quickstart-examples
Examples of how to automate creating a Snowplow Community Edition pipeline
analysis aws azure data gcp snowplow snowplow-analytics snowplow-pipeline terraform
Last synced: 21 Apr 2025
https://github.com/knime/knime-r
KNIME Interactive R Statistics Integration
analysis data knime learning machine mining r statistics
Last synced: 21 Jan 2026
https://github.com/hasnep/dataskimmer.jl
📊 A Julia package that summarises tabular data in the REPL
Last synced: 28 Dec 2025
https://github.com/malloydata/malloy-samples
Malloy model examples and associated datasets
data data-modeling malloy semantic-modeling sql
Last synced: 16 Oct 2025
https://github.com/atolcd/hop-gis-plugins
🗺 GIS plugins for Apache Hop Orchestration Platform
data dxf etl geojson geopackage gis gpx hop java mif-mid shp spatialite
Last synced: 23 Jan 2026
https://github.com/malloydata/malloy-vscode-extension
The Malloy Visual Studio Code extension facilitates building Malloy data models, querying and transforming data, and creating simple visualizations and dashboards
data data-modeling malloy semantic-modeling sql
Last synced: 26 Apr 2025
https://github.com/osl-pocs/skdata
Python tools for data analysis
data data-analysis data-science open-data python
Last synced: 12 Dec 2025
https://github.com/castelao/gsw-rs
Unofficial Gibbs Sea Water Oceanographic Toolbox of TEOS-10 in Rust
data ocean ocean-sciences oceanography rust
Last synced: 03 Apr 2025
https://github.com/vzhufk/z1p
Zip Codes Validation and Parse.
data geo geocode geolocation latitude longitude zip zipcode
Last synced: 12 Jan 2026
https://github.com/m-clark/noiris
Any data but iris 👁
big-five-model data fashion-mnist gapminder google-apps iris-dataset kiva movielens movielens-dataset mtcars pisa-data r sp500 starwars starwars-api water-risk world-happiness-report
Last synced: 30 Apr 2025
https://github.com/stephanakkerman/crypto-ohlcv
Gets historical OHLCV data from supported exchanges and converts it into dataframe readable by TensorTrade.
binance bitcoin crypto cryptocurrency data ethereum ftx ohlcv ohlcv-data
Last synced: 10 Apr 2025
https://github.com/bradleyboehmke/completejourney
An R data 📦 of retail shopping transactions for 2469 households over one year
Last synced: 13 Apr 2025
https://github.com/siongui/data
Data files for Pāḷi Tipiṭaka, Pāḷi Dictionaries, and external libraries
Last synced: 08 May 2025
https://github.com/axetroy/struct
A Modern, Scalable , Graceful, Easy Use data structure validator
Last synced: 13 Sep 2025
https://github.com/dkoguciuk/mesh2pointcloud
A mini scripts to sample ModelNet40 or ShapeNetCore55v2 meshes into 3D point clouds
3d data deep-learning machine-learning
Last synced: 20 Mar 2025
https://github.com/gsurma/twitter_data_parser
Python scripts that download metadata and tweets for given users.
data machine-learning parser python python2 twitter twitter-api
Last synced: 20 Jul 2025
https://github.com/peterdavehello/docker-azcopy
🐳 Tiny Dockerized AzCopy (Azure Storage data transfer utility) inside Alpine Linux 🐧 (~10MB)
azcopy azure cli container copy data docker docker-image hacktoberfest storage sync
Last synced: 18 Mar 2025
https://github.com/rririanto/redash-query-cheatsheets-mongodb
Query cheatsheet redash.io MongoDB for User Metrics
data data-analysis data-visualization mongodb query-cheatsheet-redash redash redashio visualize-data
Last synced: 11 Apr 2025
https://github.com/Canner/vulcan-sql-examples
Curated VulcanSQL show cases
analytics api-builder bigquery data data-lake data-warehouse database duckdb examples postgresql reporting restful-api sql vulcan-sql vulcansql
Last synced: 11 Apr 2025
https://github.com/hodur-org/hodur-lacinia-schema
Hodur is a domain modeling approach and collection of libraries to Clojure. By using Hodur you can define your domain model as data, parse and validate it, and then either consume your model via an API or use one of the many plugins to help you achieve mechanical results faster and in a purely functional manner.
clojure data graphql lacinia modeling schema
Last synced: 12 Dec 2025
https://github.com/rezapace/komputasi-big-data
This repository contains materials and practical exercises for learning Python in the context of Big Data Computation. The focus is on analyzing and processing large datasets using various tools and techniques.
ai big data data-science git-reza gunadarma gundar komputasi-big-data
Last synced: 28 Sep 2025
https://github.com/benoitvx/data-gouv-skill
🇫🇷 Skill professionnel pour Claude Code - Accès au catalogue de données ouvertes data.gouv.fr
claude-code data datagouv france opendata python
Last synced: 13 Jan 2026
https://github.com/senrok/yadal
Yet Another Data Access Layer: Accessing S3, POSIX in the same way. Deeply inspired by Databend's OpenDAL
cloud-native data go minio s3 storage
Last synced: 12 Jan 2026
https://github.com/meomundep/meomundep-airdrop-data-base.
Contribute stars if you want me to make scripts fast :)
airdrop airdrop-claim-bot airdrop-farm airdrop-free airdrops-bot airdrops-tools data data-base github meomundep scrape web
Last synced: 08 Aug 2025
https://github.com/espoirx/elegantdata
像操作Room一样操作 SharedPreferences 和 File 文件.
data database db elegantdata file room sharedpreferences
Last synced: 01 Sep 2025
https://github.com/gragland/react-component-data
🍯 Data fetching for server-rendered React applications.
data props react resolving server-rendered universal-app
Last synced: 07 May 2025
https://github.com/kiwicom/contessa
Easy way to define, execute and store quality rules for your data.
data data-engineering data-quality framework mysql postgres python quality-assurance sqlite3
Last synced: 29 Jul 2025
https://github.com/aramshiva/nomen
✍️ An web viewer of every name
baby-names data drizzle mysql names nextjs shadcn social-security-administration ssa tailwind
Last synced: 12 Oct 2025
https://github.com/dimitryzub/hotels-scraper-js
Scrape Airbnb, Booking, Hotels.com from a single JavaScript module. ❗No longer maintained.
airbnb booking data datascraping hotels hotels-api playwright puppeteer puppeteer-extra webscraping
Last synced: 07 Sep 2025
https://github.com/ahmetfurkandemir/trendyol-smartphone-price-prediction
Trendyol Smartphone Price Prediction
aws aws-ec2 data datascience flask flask-api linear-regression machine-learning python scikit-learn trendyol
Last synced: 14 Oct 2025
https://github.com/hodur-org/hodur-spec-schema
Hodur is a domain modeling approach and collection of libraries to Clojure. By using Hodur you can define your domain model as data, parse and validate it, and then either consume your model via an API or use one of the many plugins to help you achieve mechanical results faster and in a purely functional manner.
clojure data modeling schema spec types validation
Last synced: 12 Dec 2025
https://github.com/ajayarunachalam/gui-pandas-ai
GUIPandasAI - Integrating Generative AI capabilities into Pandas as Web Interface along with key-words based data analysis services
ai chatgpt data data-analysis data-analytics data-science generative-ai gpt-3 gpt-4 llm pandas python streamlit web-app
Last synced: 06 Jul 2025
https://github.com/uladz-zubrycki/Reseed
Initialize and clean integration tests database in a convenient, reliable and fast way.
csharp data data-seeding database dotnet integration-testing mssql mssql-database ndbunit2 netcore seed tests
Last synced: 06 Aug 2025
https://github.com/juliahealth/icd_gems.jl
ICD_GEMs.jl is a Julia package that allows to translate ICD-9 codes in ICD-10 and viceversa via the General Equivalence Mappings (GEMs) of the International Classification of Diseases (ICD).
cdc clinical-data clinical-research data death-certificates epidemiology health-data icd-10 icd-10-cm icd-9 icd-codes julia julia-language julia-package mortality-data public-health who
Last synced: 22 Apr 2025
https://github.com/marykdb/maryk
Maryk is a Kotlin Multiplatform library which helps you to store, query and send data in a structured way over multiple platforms. The data store stores any value with a version, so it is possible to request only the changed data or live listen for updates.
data database graph json kotlin kotlin-multiplatform rocksdb serialization versioned yaml
Last synced: 14 Jan 2026
https://github.com/danilofreire/prisonbrief
An R package that returns tidy data from the World Prison Brief website.
data prison rstats world-prison-brief
Last synced: 09 Apr 2025
https://github.com/mark-hoffmann/fastteradata
Tools for faster and optimized interaction with Teradata and large datasets.
data fastexport fastteradata python teradata
Last synced: 26 Jan 2026
https://github.com/brightway-lca/brightway2-data
Tools for the management of inventory databases and impact assessment methods. Part of the Brightway LCA framework.
brightway data life-cycle-assessment python
Last synced: 19 Oct 2025
https://github.com/glamboyosa/mey
A react package that exports hooks for handling the request lifecycle.
data data-fetching fetch hooks react react-native
Last synced: 12 May 2025
https://github.com/kettanaito/react-data-preview
Fancy interactive preview of your JavaScript data.
data data-preview javascript preview react react-data-preview
Last synced: 06 May 2025
https://github.com/tarantool/sdvg
Synthetic Data Values Generator
csv-generator data data-generation data-generator generation generator http-generator parquet-generator random-data random-data-generation synthetic-data synthetic-data-generation synthetic-dataset-generation test-data test-data-generator
Last synced: 12 Jan 2026
https://github.com/zq99/pgn2data
A library that converts a chess pgn file into a tabulated CSV data set.
chess chess-analysis csv data dataset fen library pgn
Last synced: 17 Jan 2026
https://github.com/ECCC-MSC/msc-animet
MSC AniMet is a simple tool enabling users to interact with MSC Open Data weather data and create custom weather animations for any area in the world. The resulting animations can be downloaded and shared with a permalink.
animation canada data visualization weather wms
Last synced: 20 Jul 2025
https://github.com/hoangsonww/latticedb-nextgen-dbms
🗂️ A next-gen relational database with mergeable CRDT tables, time-travel queries, vector search, and differential privacy built-in. Written in C++17 with a SQL engine, WAL storage, and a modern web Studio.
cmake cplusplus data database databases db dbms dbms-project docker file relational-database relational-databases sql sql-parser
Last synced: 13 Sep 2025
https://github.com/chainguard-dev/image-comparison
Comparison of Chainguard Images to others
containers data no-ghaudit-branch-protections security
Last synced: 29 Oct 2025
https://github.com/d-wasserman/shared-row
This is an open data specification for describing the right-of-way (ROW) for street centerline networks. It is intended to establish a common set of attributes (schema) to describe how space is allocated along a streets right of way from sidewalk edge to sidewalk edge.
data right-of-way row schema sharedstreets specification standard streets
Last synced: 05 May 2025
https://github.com/marioruiz/string_pattern
Generate strings supplying a simple pattern. Perfect to be used in test data factories. Validate if a text fulfills a specific pattern. Also you can use regular expressions (Regexp) to generate strings: `/[a-z0-9]{2,5}\w+/.gen`. Generate words in English or Spanish.
data error-detection factories generation pattern random regex-pattern regexp regular-expressions ruby ruby-gem string test
Last synced: 05 May 2025
https://github.com/retailmenotsandbox/dart
Self-service data workflow management
Last synced: 10 Apr 2025
https://github.com/lcsb-biocore/distributeddata.jl
Simple distributed data manipulation and processing routines in Julia
Last synced: 22 Apr 2025
https://github.com/itsjafer/tv-show-recommendations
Machine learning pipeline trained offline that, given a TV Show, recommends 10 similar TV Shows using cosine similarities based on a variety of features
data engine learning machine python recommendation science tv-shows
Last synced: 09 Jul 2025
https://github.com/chase-manning/eth-twitter-accounts
Data dump of 15,451 Twitter accounts and their Ethereum Address
Last synced: 12 Apr 2025
https://github.com/minhaskamal/alphabetrecognizer
Simple Optical Character Recognizer (english-ocr-image-to-text-recognition-sample-trainig-alphabet-photo-data-database-dataset)
alphabet-recognizer data database english image-processing java machine-learning ocr sample template-matching text-recognition training-data writing
Last synced: 11 Apr 2025
https://github.com/anuran-roy/serpytor
A distributed, low-code, end-to-end data collection and analysis tool for data folks. Take the pain out of data collection from your pipeline!
data dataengineering datascience distributed-computing distributed-systems low-code lowcode open-source pipeline python python3
Last synced: 16 May 2025
https://github.com/randomfractals/duckdb-sql-tools
DuckDB SQL Tools add DuckDB support to VSCode, and provide database schema and SQL query interfaces for the popular SQLTools extension, SQL query editor, language server, and data processing tools.
data data-tools duckdb sql sql-tools sqltools sqltools-driver viewer vscode
Last synced: 22 Mar 2025
https://github.com/nxr-deen/student-records
This repository contains a C program that manages student data in a binary file, allowing for input and retrieval of records.
binaryfiles c data management records students
Last synced: 06 Aug 2025
https://github.com/nevillelyh/scio-koans
A collection of Scio exercises inspired by Ruby Koans and many others.
Last synced: 22 Aug 2025
https://github.com/w2sv/koala
A poor man's version of a pandas DataFrame for dart.
dart dartlang data dataframe datamanagement datamanipulation flutter pandas-dataframe
Last synced: 09 Sep 2025
https://github.com/vkcom/vkdata-sketchplugin
Sketch plugin for using data from your account at vk.com
data sketch sketch-plugin sketchapp vk vkontakte
Last synced: 27 Sep 2025
https://github.com/magoo-magoo/keyrier-json
SQL queries on JSON & CSV
data desktop html json keyrier-json react sql web webapp
Last synced: 05 Oct 2025
https://github.com/marco-roy/DDO
A DBT package to perform DataOps & administrative CI/CD on your data warehouse.
data dataops datawarehouse datawarehouseautomation dbt snowflake
Last synced: 05 May 2025
https://github.com/geoscienceaustralia/gnssanalysis
basic python module for gnss analysis
coordinate-systems coordinate-transformation crustal-deformation data data-analysis data-analysis-python geodesy geodesy-functions geophysics geospatial gnss gnss-signals gps gps-data transformation
Last synced: 20 Aug 2025
https://jhildenbiddle.github.io/class-change/
A micro-library for manipulating CSS class names, triggering change events using HTML data attributes, and creating declarative class-related event listeners
attributes change class classlist css data event event-listener html listener polyfill ponyfill
Last synced: 11 May 2025
https://github.com/mikestefanello/batcher
Type-safe, automatic, asynchronous batch processing.
batch batch-processing concurrency data goroutines
Last synced: 26 Sep 2025
https://github.com/vojay-dev/sc2-data-pipeline
StarCraft 2 Data Pipeline with Airflow, DuckDB and Streamlit
airflow data data-engineering data-science duckdb starcraft2 streamlit
Last synced: 20 Sep 2025
https://github.com/tushar2704/everyday_python
Welcome to Everyday Python Sheets – your go-to resource for everyday Python cheat sheets, pro tips, interview questions, Python one-liners, and Python data structures. Whether you're a beginner looking to learn Python or an experienced developer seeking quick reference materials, this Streamlit application has got you covered.
artificial-intelligence cheatsheet data data-analysis data-science data-structures data-visualization database protips python streamlit streamlit-tushar2704 tushar2704
Last synced: 04 Nov 2025
https://github.com/jhildenbiddle/class-change
A micro-library for manipulating CSS class names, triggering change events using HTML data attributes, and creating declarative class-related event listeners
attributes change class classlist css data event event-listener html listener polyfill ponyfill
Last synced: 17 Aug 2025