data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-20 00:07:41 UTC
- JSON Representation
https://github.com/benthosdev/benthos-captain
A Kubernetes Operator to orchestrate Benthos pipelines
benthos data data-engineering gitops go golang helm kubernetes kustomize pipelines stream-processing
Last synced: 22 Jan 2026
https://github.com/greenelab/pubtator
Retrieve and process PubTator annotations
data nlp pubmed pubtator snorkel text-mining tool
Last synced: 05 May 2025
https://github.com/yzfly/mcp-excel-server
The Excel MCP Server is a powerful tool that enables natural language interaction with Excel files through the Model Context Protocol (MCP). It provides a comprehensive set of capabilities for reading, analyzing, visualizing, and writing Excel data.
claude claude-mcp data excel mcp mcp-excel-server
Last synced: 06 Jul 2025
https://github.com/albar965/navdatareader
Navdatareader is a command line tool that uses the atools fs/bgl and fs/writer to store a full flight simulator scenery database into a relational database like Sqlite or MySql.
compiler data flight fsx map navigation prepar3d simulator x-plane
Last synced: 02 May 2025
https://github.com/guocaoyi/meituan-spider
美团™爬虫练习项目(Region、POI、店铺、商品)
china-city data learning meituan meituan-pois poi puppeteer reptile reptile-nodejs
Last synced: 17 Aug 2025
https://github.com/vida-nyu/data-polygamy
Data Polygamy is a topology-based framework that allows users to query for statistically significant relationships between spatio-temporal data sets.
Last synced: 10 Apr 2025
https://github.com/ethicnology/ophois
Creates street graph from OpenStreetMap
data graph network openstreetmap osm street
Last synced: 11 Oct 2025
https://github.com/rOpenSpain/climaemet
R Climate AEMET Tools
aemet climate cran data forecast-api r r-package ropenspain rstats science spain weather-api
Last synced: 20 Jul 2025
https://github.com/cdcgov/cdc-open-viz
CDC OpenViz is a library of React packages for data visualization.
data react visualization visualization-library
Last synced: 04 Apr 2025
https://github.com/nasdaq/hackathons
Nasdaq's realtime streaming stock market data for hackathons.
data hackathon market market-data nasdaq real-time realtime stock-market streaming
Last synced: 18 Oct 2025
https://github.com/ausaki/python-validator
a data validator like Django ORM
data python python-validator schema validate validation validation-library validator
Last synced: 04 Feb 2026
https://github.com/j535d165/cbsodata
Unofficial Statistics Netherlands (CBS) open data API client for Python
census-api census-data data national-statistics netherlands open-data python-library
Last synced: 05 Apr 2025
https://github.com/fedora-infra/datagrepper
HTTP API for datanommer and the fedmsg bus
data data-analysis data-science fedora fedora-project postgres postgresql python
Last synced: 12 May 2025
https://github.com/adieuadieu/japan-train-data
🇯🇵 🚂 A circular object of train data for Japan including translations & station geocoding and a tool to generate it.
data eki japan nihon train translations
Last synced: 18 Mar 2025
https://github.com/itext/itext-pdfocr-dotnet
pdfOCR is an iText 7 add-on to recognize and extract text in scanned documents and images. It can also convert them into fully ISO-compliant PDF or PDF/A-3u files that are accessible, searchable, and suitable for archiving
archival character data diacritic extractable glyphs hindi image iso-compliant ligatures mandarin ocr optical pdf portuguese recognition scan searchable spanish tesseract
Last synced: 08 Jan 2026
https://github.com/asreview/asreview-makita
Workflow generator for simulation studies using the command line interface of ASReview LAB
asreview data machine-learning python simulation systematic-literature-reviews systematic-reviews utrecht-university
Last synced: 14 Feb 2026
https://github.com/paezha/idealista18
Open data product with real estate listings from Idealista. The datasets are for three major cities in Spain and the year 2018. https://doi.org/10.1177/23998083241242844
data open-data-products packages r real-estate spain spatial
Last synced: 29 Jun 2025
https://github.com/data-fair/data-fair
Findable, Accessible, Interoperable and Reusable Data. A complete open-source solution for your open and private data needs. French only for the time being, internationalization coming soon.
api data datasets docker nocode nocodeapi nodejs open-data openapi3
Last synced: 23 Apr 2026
https://github.com/PatrickCuba-zz/thedatamustflow
Visio stencils and artefacts related to data vault guru
data data-vault stencil vault visio
Last synced: 20 Jul 2025
https://github.com/jpmorganchase/py-avro-schema
Generate Apache Avro schemas for Python types including standard library data-classes and Pydantic data models.
avro data dataclasses deserialization generate jpmorganchase kafka messaging pydantic python schema serialization types
Last synced: 28 Jun 2025
https://github.com/tombarr/open-source-words
Visualization of the most frequent words used in open source projects
d3 data data-visualization javascript python
Last synced: 13 Apr 2025
https://github.com/xefi/faker-php-symfony
Symfony integration of the xefi\faker-php package
data fake faker php symfony symfony-bundle
Last synced: 18 Mar 2025
https://github.com/alir3z4/django-databrowse
Databrowse is a Django application that lets you browse your data.
Last synced: 11 Apr 2025
https://github.com/jobovy/apogee
Tools for dealing with APOGEE data
astronomy astrophysics data data-analysis python spectroscopy
Last synced: 02 Oct 2025
https://github.com/the-alchemists-of-arland/gray-matter-rs
A tool for easily extracting front matter out of a string. It is a fast Rust implementation of gray-matter. Parses YAML, JSON, TOML and support for custom parsers. Use it and let me know by giving it a star!
data front-matter front-matter-parsers frontmatter gray-matter gray-matter-rs gray-matter-rust markdown matter parse rust rust-crate yaml
Last synced: 10 Apr 2025
https://github.com/Articdive/ArticData
Collection of data extracted from Minecraft.
data data-extraction data-mining java json mc minecraft minecraft-data minecraft-server minecraft-servers registry
Last synced: 08 May 2025
https://github.com/webankblockchain/data-export
Data-Export支持将链上数据导出到MySQL、ES等便于进行大数据处理的存储介质中,解决区块链数据复杂查询、分析、可视化和处理的问题。
blockchain consortium data data-governance export webank-blockchain
Last synced: 09 Jul 2025
https://github.com/Maicius/UniversityRecruitment-sSurvey
用严肃的数据来回答“什么样的企业会到什么样的大学招聘”?
analysis beautifulsoup crawler data redis university
Last synced: 06 Mar 2025
https://github.com/jonschlinkert/write-yaml
Basic node.js utility for converting JSON to YAML and writing formatting YAML files to disk.
data disk file file-system fs write yaml
Last synced: 11 Apr 2025
https://github.com/hyunjoonbok/Python-Projects
Portfolio in Python
augmentation cnn-classification data data-visualization dataanalytics datascience deep-learning forecasting gan lightgbm machine-learning nlp rnn rnn-pytorch textclassification timeseries xgboost
Last synced: 16 Apr 2025
https://github.com/bitol-io/open-data-product-standard
Home of the Open Data Product Standard (ODPS).
data data-engineering data-mesh data-product data-products data-quality standard
Last synced: 10 Mar 2026
https://github.com/0015/python-data-sampling-app
Data Sampling App from Serial to CSV file
accelerometer arduino csv data esp32 gyroscope pysimplegui python sampling serial-communication serialport
Last synced: 26 Apr 2025
https://github.com/prioritizr/wdpar
Interface to the World Database on Protected Areas
biodiversity conservation cran data database protected-areas r r-package rstats spatial
Last synced: 01 Jul 2025
https://github.com/emilyriederer/data-disasters
book bookdown data data-analysis data-science
Last synced: 21 Feb 2026
https://github.com/rmax/databrewer
The missing datasets manager. Like hombrew but for datasets. CLI-tool for search and discover datasets!
command-line data datasets discovery python
Last synced: 20 Mar 2025
https://github.com/ropenspain/climaemet
R Climate AEMET Tools
aemet climate cran data forecast-api r r-package ropenspain rstats science spain weather-api
Last synced: 07 Apr 2025
https://github.com/shannonmoeller/handlebars-wax
The missing Handlebars API for data, partials, helpers, and decorators.
data decorators expressjs glob handlebars helpers nodejs partials
Last synced: 14 Apr 2026
https://github.com/maicius/universityrecruitment-ssurvey
用严肃的数据来回答“什么样的企业会到什么样的大学招聘”?
analysis beautifulsoup crawler data redis university
Last synced: 28 Apr 2025
https://github.com/tradewelltech/beavers
Python stream processing for analytics
analytics apache-arrow data kafka pandas python realtime stream-processing
Last synced: 14 Jan 2026
https://github.com/ipeagit/flightsbr
R Package to Download Flight and Airport Data from Brazil
aviation-data brazil data r rstats rstats-package
Last synced: 02 May 2025
https://github.com/Link-/uber_data
Uber web interface crawler / scraper - Convert the trips table into a CSV file
analysis data jupyter uber-crawler uber-data
Last synced: 04 May 2025
https://github.com/yiuman/data-visulaization
:scream_cat:数据可视化~实现可拖拽数据可视化视图、数据获取配置
data data-visualization datavalidation draggable vcharts-echarts visual visualization vue
Last synced: 19 Mar 2025
https://github.com/junyuan-chen/readstattables.jl
Read and write Stata, SAS and SPSS data files with Julia tables
data dataframe datasets julia sas spss stata statistics tables tabular-data
Last synced: 27 Jan 2026
https://github.com/teamreboott/data-modori
data data-analysis data-preprocessing data-visualization llm lmops
Last synced: 14 Jan 2026
https://github.com/curran/d3-in-motion
Code examples and references for the course "D3.js in Motion"
chart d3js data dataviz html5 programming teaching visualization web
Last synced: 05 Feb 2026
https://github.com/ahdinosaur/rimu
Template language for structured data: functional YAML 🌱
configuration configuration-language data data-structures expression-evaluator expression-language functional json serde string-interpolation template template-engine toml untrusted-values yaml
Last synced: 20 Sep 2025
https://github.com/ashleydavis/sql-to-mongodb
A Node.js script to convert an SQL table to a MongoDB database.
convert-sql-to-mongodb data database javascript mongodb mongodb-database nodejs nosql sql sql-table
Last synced: 24 Oct 2025
https://github.com/cityofaustin/knackpy
A Python client for interacting with Knack applications
api api-wrapper csv data etl hacktoberfest knack knackhq python python-client
Last synced: 04 Jun 2026
https://github.com/JuliaClimate/ClimateBase.jl
Tools to analyze and manipulate climate (spatiotemporal) data. Also used by ClimateTools and ClimatePlots
analysis climate data hacktoberfest julia spatiotemporal
Last synced: 20 Jul 2025
https://github.com/iodepo/odis-arch
Development of the Ocean Data and Information System (ODIS) architecture
catalogue data interoperability knowledge-graph metadata ocean ogc-services rdf sharing
Last synced: 20 Jul 2025
https://github.com/sghall/chord-transitions
Transitioning Chord Diagram Demo with Angular/D3
angularjs d3js data visualisation
Last synced: 22 Mar 2025
https://github.com/m-ahmadi/tse-client
A client for fetching stock data from the Tehran Stock Exchange (TSETMC). Works in Browser, Node and as CLI.
browser caching cli cli-app compression crawler data dataset downloader iran node-module stock stock-data stock-market stock-prices tehran ticker tsetmc universal
Last synced: 18 Feb 2026
https://github.com/visualize-admin/visualization-tool
The tool for visualizing Swiss Open Government Data. Project ownership: Federal Office for the Environment FOEN
data data-visualization linked-data open-data open-government visualization
Last synced: 05 Mar 2026
https://github.com/zakarialaoui10/mapfun
mapfun is a function that applies a mapping function to an infinite number of input elements, with options to skip certain elements and selectively apply the mapping to keys and/or values of objects. The origin of this function traces back to zikojs
awesome data function javascript map php python
Last synced: 17 Mar 2026
https://github.com/geoscienceaustralia/gnssanalysis
basic python module for gnss analysis
coordinate-systems coordinate-transformation crustal-deformation data data-analysis data-analysis-python geodesy geodesy-functions geophysics geospatial gnss gnss-signals gps gps-data transformation
Last synced: 12 Mar 2026
https://github.com/runprism/alto
Serverless for data practitioners. The fastest ⚡️ way to run your code in the cloud. Effortlessly run scripts, functions, and Jupyter notebooks in virtual machines.
aws cli cloud data data-analysis data-science deployment ec2 entrypoint function gcp infrastructure jupyter python serverless
Last synced: 31 May 2026
https://github.com/juliaclimate/climatebase.jl
Tools to analyze and manipulate climate (spatiotemporal) data. Also used by ClimateTools and ClimatePlots
analysis climate data hacktoberfest julia spatiotemporal
Last synced: 11 Apr 2025
https://github.com/WSAyan/medicinedb
sqlite medicine database
bangladesh csv data database drugs json medicine sqlite
Last synced: 11 Jun 2026
https://github.com/Corfucinas/crypto-candlesticks
Download candlestick data fast & easy for analysis
bitcoin bitfinex candlesticks crypto-candlesticks cryptocurrency cryptocurrency-candlesticks-data data download historical-prices ohlcv prices python
Last synced: 07 Apr 2025
https://github.com/mathiasrichter/shapiro
Modelling data with JSON-LD, Turtle, SHACL
data data-structures json-ld json-schema linked-data model-as-code openapi rdf schema semantic semantic-web shacl sparql turtle
Last synced: 21 Nov 2025
https://github.com/fsolt/swiid
Standardized World Income Inequality Database
Last synced: 29 Oct 2025
https://github.com/gagolews/clustering-benchmarks
A framework for benchmarking clustering algorithms
benchmark-suite benchmarking cluster cluster-analysis clustering clustering-algorithms clustering-benchmarks clustering-evaluation data data-science dataset datasets ground-truth machine-learning
Last synced: 10 Apr 2025
https://github.com/intothedev/scriptable-object-loader
Load Scriptable Objects via code
config data inspector scriptableobject unity unity-scripts unity2d unity3d
Last synced: 24 Oct 2025
https://github.com/TomFevrier/kiwis
A Pandas-inspired data wrangling toolkit in JavaScript
data data-manipulation data-wrangling pandas
Last synced: 15 Mar 2025
https://github.com/tomfevrier/kiwis
A Pandas-inspired data wrangling toolkit in JavaScript
data data-manipulation data-wrangling pandas
Last synced: 05 Apr 2026
https://github.com/remotesensinginfo/rsgislib-tutorials
A set of notebook tutorials for RSGISLib.
analysis classification data earth learning machine observatiob python remote rsgislib sensing tutorials
Last synced: 21 Feb 2026
https://github.com/arturoeanton/go-notebook
Go-Notebook is inspired by Jupyter Project (link) in order to document Golang code.
data data-science data-visualization documentation gobook godoc golang golang-examples golang-notebook golang-tools gomacro jupyter notebook notebook-jupyter plot repl shell-go
Last synced: 29 Oct 2025
https://github.com/daoodaba975/galsenify
A comprehensive library for Senegalese data, it offers a lot of information about country of Teranga 📦
data hacktoberfest made-in-senegal npm-package
Last synced: 04 Mar 2026
https://github.com/pityka/saddle
SADDLE: Scala Data Library
data data-science dataframe linear-algebra matrix numpy pandas scala
Last synced: 02 Oct 2025
https://github.com/howprogrammingworks/datatypes
Built-in data types
bigint boolean data data-type data-types function javascript js number object string symbol
Last synced: 04 Apr 2025
https://github.com/fahad19/tydel
Typed Models & Collections for JavaScript data structure
data immutable javascript models structure
Last synced: 19 Apr 2025
https://github.com/mathiasbynens/unicode-tr51
Emoji data extracted from Unicode Technical Report #51.
Last synced: 20 Jun 2025
https://github.com/epiforecasts/covidregionaldata
An interface to subnational and national level COVID-19 data. For all countries supported, this includes a daily time-series of cases. Wherever available we also provide data on deaths, hospitalisations, and tests. National level data is also supported using a range of data sources as well as linelist data and links to intervention data sets.
covid-19 data open-science r6 regional-data rstats
Last synced: 14 Jun 2025
https://github.com/rafzamb/sknifedatar
sknifedatar is a package that serves primarily as an extension to the modeltime 📦 ecosystem. In addition to some functionalities of spatial data and visualization.
data data-analysis data-science data-visualization forecasting r statistics time-series
Last synced: 07 Mar 2026
https://github.com/rsquaredacademy/xplorerr
Shiny apps for interactive data analysis, visualization and modeling.
data exploration r rstats shiny-apps statistics visualization
Last synced: 02 Jul 2025
https://github.com/dodoex/dodoex_v2_subgraph
Subgraphs to index data for DODOEX V2
assembly blockchain-technology data subgraph typescript
Last synced: 11 Apr 2025
https://github.com/47degrees/org
Easily create a webpage with your organization's open source projects
clojure clojurescript data github graphql react rum
Last synced: 11 Apr 2025
https://github.com/ssbuild/aigc_data
share data, prompt data , pretraining data
aigc-data data instruct llm open open-data pretraining prompt
Last synced: 24 Apr 2025
https://github.com/openschemas/schemaorg
python functions for applied use of schema.org
curation data schemaorg software
Last synced: 20 Feb 2026
https://github.com/sircryptic/autoexif
want to remove sensitive data from photos or even view it? use autoexif to easily help you do that no more remembering syntaxs with this user-friendly tool.
data data-analysis exif-data exif-data-extraction exif-interface exif-metadata exif-reader exif-remover exiftool image meta metadata osint osint-tool viewer
Last synced: 14 Apr 2025
https://github.com/theronione/cleaner.jl
A toolbox of simple solutions for common data cleaning problems.
Last synced: 24 Oct 2025
https://github.com/dsietz/test-data-generation
Test Data Generation
algorithm archconf data data-privacy generate json machine-learning markov-decision-processes nfjs privacy profile rust-lang testing
Last synced: 17 Mar 2025
https://github.com/itext/itext-pdfocr-java
pdfOCR is an iText add-on to recognize and extract text in scanned documents and images. It can also convert them into fully ISO-compliant PDF or PDF/A-3u files that are accessible, searchable, and suitable for archiving
archival character data diacritic extractable glyphs hindi image iso-compliant ligatures mandarin ocr optical pdf portuguese recognition scan searchable spanish tesseract
Last synced: 09 Jan 2026
https://github.com/wchatx/direct-access-py
Enverus Drillinginfo Direct Access Developer API Python Client
api data drillinginfo enverus gas oil python
Last synced: 27 Mar 2026
https://github.com/nagix/ukraine-livecams
Ukraine live camera 3D map
data mapping open-data ukraine-invasion
Last synced: 03 Mar 2026
https://github.com/theoyinbooke/30days-of-learning-data-analysis-using-power-bi-for-students
This is a 30Days guided learning journey for Beginner Data Analyst Using Microsoft Power BI
data data-analysis powerbi powerbi-desktop powerbi-service
Last synced: 27 Feb 2026
https://github.com/artigraph/artigraph
Batteries included toolkit for data engineering.
Last synced: 14 Jan 2026
https://github.com/ropenspain/spanishoddata
Access national high-quality and open-access datasets on movement patterns derived from mobile telephone datasets / Accede y usa datos nacionales abiertos sobre movimientos basados en teléfonos móviles.
cdr data data-package mobile-telephone-data mobility origin-destination rstats
Last synced: 28 Apr 2025
https://github.com/smarie/pytest-patterns
A couple of examples showing how pytest and its plugins can be combined to solve real-world needs.
benchmark case concerns data decorator design file fixture incremental modular parameter parametrize pattern pytest result separate share state step test
Last synced: 20 Mar 2025
https://github.com/rbren/vizzy
Data Visualization with LLMs
chatgpt data data-visualization llm
Last synced: 07 May 2025
https://github.com/kristijorgji/goseeder
Go database seeder inspired from Laravel/Lumen seeder and more
data database go seeder seeders table test-seeds testing
Last synced: 14 May 2025
https://github.com/SirCryptic/autoexif
want to remove sensitive data from photos or even view it? use autoexif to easily help you do that no more remembering syntaxs with this user-friendly tool.
data data-analysis exif-data exif-data-extraction exif-interface exif-metadata exif-reader exif-remover exiftool image meta metadata osint osint-tool viewer
Last synced: 04 Mar 2025