data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-02-04 00:07:41 UTC
- JSON Representation
https://github.com/thearyadev/top500-aggregator
A suite of tools and a web service to collect and provide data on the Overwatch 2 Top 500 leaderboards.
Last synced: 16 Jan 2026
https://github.com/michalporeba/odis
Search in decentralised systems. Search federation, result moderation, aggregation and feedback with hypermedia in ReSTful API to round it all of.
data data-discovery discoverability federated information-discovery mesh-networks search
Last synced: 18 Jan 2026
https://github.com/turbot/steampipe-export
Steampipe Export is a zero-ETL CLI to fetch data from cloud services and APIs. Hundreds of plugins with thousands of documented examples.
aws azure backup data devsecops etl gcp golang kubernetes security steampipe steampipe-engine zero-etl
Last synced: 31 Jul 2025
https://github.com/devscast/cd-data
important background data for the creation of a solution for the DRC
congo congo-kinshasa data data-science json rdata rdc rdc-data
Last synced: 06 Apr 2025
https://github.com/mara/mara-mondrian
A python integration for the Saiku ad hoc analysis tool
adhoc-analysis data mara mondrian mondrian-olap-engine python reporting saiku
Last synced: 30 Apr 2025
https://github.com/casbin/confita
An open-source version of Kaggle written in Go and React
casbin casdoor conference data go javascript kaggle react
Last synced: 09 Aug 2025
https://github.com/nrennie/national-highways
R package for accessing the National Highways WebTRIS API via R.
Last synced: 14 Aug 2025
https://github.com/lchsk/sanchosql
SanchoSQL - Linux desktop PostgreSQL client
data database database-gui database-management desktop development editor linux linuxapps postgres postgresql sql
Last synced: 16 Aug 2025
https://github.com/mewmix/gh_llm_loader
clone GitHub repositories and prepare their data for ingestion for LLMs.
context data data-structures github llm llm-training python
Last synced: 19 Sep 2025
https://github.com/wahyudesu/predicting-hotel-booking-cancellations
This project will help hotel managers optimize their booking policies, reduce cancellations, and improve revenue.
data data-analysis data-science python
Last synced: 07 Jul 2025
https://github.com/komed3/airportmap-database
Airportmap project structure & data
airports data database database-structure frequencies navaids runways
Last synced: 27 Jun 2025
https://github.com/globalgov/manydata
The portal for global governance data
Last synced: 16 Apr 2025
https://github.com/MohammedSardar/Bive
Bive is a Kurdish profanity language processing project.
data dataanalysis kurdish kurdish-corpus kurdish-dataset kurdish-language-processing kurdishdata kurdishnlp
Last synced: 07 May 2025
https://github.com/flor91/data-structures-and-algorithms
Theory and Implementation of Data Structures and Algorithms using Python
algorith data data-structures python
Last synced: 19 Apr 2025
https://github.com/lukasmosser/oklahomaproductiondata
A repository of machine-learnable formatted oklahoma o&g production data.
data data-mining energy machine-learning
Last synced: 03 Aug 2025
https://github.com/edoardottt/postgressql-db
Easy implementation of some postgreSQL Databases for practicing with Conceptual analysis of requirements, design of relational databases and SQL queries
data database pgplsql pgsql plsql postgres postgresql postgresql-database rdbm rdbms relational-databases sql
Last synced: 27 Oct 2025
https://github.com/chalk-ai/chalk-ts
Typescript client for working with Chalk
chalk data feature-engineering pipelines typescript
Last synced: 17 Jan 2026
https://github.com/lens-vm/spec
LensVM specifications and ABI definition
abi data interoperability lenses schema transformations web-assembly
Last synced: 12 Jan 2026
https://github.com/datadesk/calfire-wildfires
Download wildfires data from CalFire
cli data data-journalism geojson journalism news python wildfires
Last synced: 05 Jan 2026
https://github.com/AurelienAubry/Spotlight
Spotlight is a Spotify dashboard that allows user to visualize his listening habits.
backend bootstrap chartjs data data-analysis data-science data-visualization flask frontend javascript js pandas python python3 react react-bootstrap spotify
Last synced: 15 Apr 2025
https://github.com/ganeshkandu/kdbv
mysql database auto schema migration library
autodeploy automation composer data database database-migrations latest-version mariadb migrations-generator mysql mysql-database php schema seed update upgrade upgrade-tool version-changer version-control versioning
Last synced: 12 Oct 2025
https://github.com/colour-science/colour-mitsuba
Various resources for Mitsuba 3
color color-science color-space color-spaces colorspace colorspaces colour colour-science colour-space colour-spaces colourspace colourspaces data dataset datasets mitsuba spectral-data spectral-dataset spectral-datasets
Last synced: 21 Apr 2025
https://github.com/bloomberggraphics/2017-trump-llc-documents
Dataset of Trump LLC documents
Last synced: 16 Oct 2025
https://github.com/danieljdufour/xdim
Multi-Dimensional Functions. Create, Query, and Transform Multi-Dimensional Data.
array binary data dimensions format formatter functions image javascript js layout math multidimensional ndarray rearrange reorganize reshape shape theory
Last synced: 13 Jun 2025
https://github.com/emilyriederer/dbt-convo-covid
Demo repo with full code described in blog post
controlled-vocabulary data dbt sql variable-names
Last synced: 26 Oct 2025
https://github.com/glassflow/glassflow-python-sdk
GlassFlow Python SDK to publish and consume data to your pipelines at Glassflow.dev
data data-processing datastreaming python real-time sdk stream-processing
Last synced: 12 Oct 2025
https://github.com/nejdetkadir/turkish-taboo-words
It is includes turkish taboo words with different formats as JSON, XML and YAML (500+ words)
data json taboo taboo-word turkish words xml yaml yml
Last synced: 07 Sep 2025
https://github.com/jordan-iralde/probestojarvisai
JarvisIA es un sistema de Inteligencia Artificial autónomo diseñado para aprender, optimizar y ejecutar tareas complejas sin intervención humana. Combina automatización, clonación de voz, reconocimiento facial e interacción con sistemas operativos para crear una IA adaptable y eficiente, capaz de mejorar continuamente.
data ia-automator python react tensorflow
Last synced: 23 Jan 2026
https://github.com/dsdanielpark/arxiv2text
Converting PDF files to text, mainly with a focus on arXiv papers.
crawling data datamining translation
Last synced: 05 Sep 2025
https://github.com/klaudiosinani/binoheap
Binomial heaps for ES6
binomial data es6 heap structure typescript
Last synced: 12 Jun 2025
https://github.com/nrc-cnrc/nrc-gamma
Large labelled dataset of real-life gas meter images — Vaste ensemble d'images réelles et étiquetées de compteurs de gaz.
ai computer-vision creative-commons crowdsourcing data data-science datanalytics dataset iot machine-learning machinelearning opendata
Last synced: 01 Dec 2025
https://github.com/thomwright/balamb
🌱 Concurrently run a set of dependent, asynchronous tasks with type-safe dependencies
concurrent dag dags data data-seeding dependency-injection di seed seeding tasks
Last synced: 07 May 2025
https://github.com/cipherstash/protectjs
Encrypt and protect data using industry standard algorithms, field level encryption, a unique data key per record, bulk encryption operations, and decryption level identity verification. Powered by CipherStash Encryption.
data data-security encryption javascript postgres postgresql security typescript
Last synced: 29 Oct 2025
https://github.com/lifyzer/data-parser-system
:apple: Simple script that parses data from open source databases to the standard Lifyzer database structure :green_apple:
data data-parser databases food food-data health ingredients lifyzer nutrition parsed-data parser parses-data
Last synced: 09 Apr 2025
https://github.com/qzcool/cpef
私募基金管理人查询数据接口。Chinese Private Equity Funds APIs.
china crawler data finance fund funds hedge-funds private-equity python python3 scraper scraping-websites spider
Last synced: 11 Jul 2025
https://github.com/dkv204p/c-programming
Welcome to the C-Programming repository! This repository is a comprehensive collection of resources, examples, and exercises for learning and mastering the C programming language.
algorithm and c c-enums c-file-handling c-functions c-programming c-programming-language c-structures c-tutorial data dsa dsa-in-c structure
Last synced: 28 Aug 2025
https://github.com/lukasmosser/geolink_dataset
Analysis notebooks for the geolink well log dataset
data datasets eda facies geology lithology machine-learning petrophysics wireline
Last synced: 15 Apr 2025
https://github.com/chrissimpkins/vectora
A Rust library for n-dimensional vector computation with real and complex scalar data
2d 3d complex-number data data-analysis floating-point integer math mathematics real-number rust rust-crate rust-lang rust-library scalar vector vector-computations vector-math
Last synced: 16 Oct 2025
https://github.com/klaudiosinani/shtack
LIFO Stacks for ES6
data es6 lifo stack structure typescript
Last synced: 24 Apr 2025
https://github.com/zdavatz/oddb2xml
oddb2xml, create xml files using refdata, swissmedic and bag xml files
bag data drug open refdata ruby source swissmedic switzerland xml
Last synced: 19 Oct 2025
https://github.com/aminkhani/db
Database Tutorial
data database database-concepts database-systems database-tutorial db dbms mongodb mysql sql
Last synced: 11 Jul 2025
https://github.com/coryleach/unityinfotables
Unity package of ScriptableObjects for building static data info tables. Generates enum source code for easy access of table entry ids.
data enum scriptableobject scriptableobjects unity unity-plugin unity-scripts unity3d unity3d-plugin unitypackage
Last synced: 24 Oct 2025
https://github.com/soasis/encoding_tables
Shared tables between C and C++ for encoding infrastructure
Last synced: 09 Apr 2025
https://github.com/sl-solution/inmemorydatasetstutorial
A tutorial for working with InMemoryDatasets.jl.
data data-manipulation data-science data-wrangling dataset flight-data-analysis inmemorydatasets julia jupyter-notebook tutorial
Last synced: 22 Apr 2025
https://github.com/wdataorg/wdata
A database with multiple data sets that support drawing, These data sets are: World population data set, World Carbon dioxide Concentration data set, World Number of Cities data set, China number of population data set, China number of space vehicles data set......
chinese data database pip pypi python3
Last synced: 25 Mar 2025
https://github.com/realign/localstorage
Encapsulate a simple LocalStorage Class
Last synced: 07 Oct 2025
https://github.com/anuraganalog/try-every-ml-algorithm
Trying every Machine learning algorithm on a given dataset and measuring the efficiency.
accuracy algorithms analysis classification data deep-learning efficiency learning machine metrics neural-networks regression streamlit
Last synced: 04 Jul 2025
https://github.com/edoardottt/covid-19
Info/Data (global/italy) about COVID-19. PR welcome for other countries.
coronavirus coronavirus-disease coronavirus-info coronavirus-real-time coronavirus-tracking covid-19 covid-data data data-science data-visualization disease droplets epidemics epidemiology epidemiology-analysis pandemic sars-cov-2 spread symptoms virus
Last synced: 27 Oct 2025
https://github.com/rtmigo/tabular_dart
Dart library for displaying tabular data in a visually appealing ASCII table format.
ascii dart data flutter formatting github markdown prettytable pubdev readme spreadsheet table tabulate
Last synced: 10 Jul 2025
https://github.com/andredarcie/best-games-of-all-time-data-based
🏆 Definite Best Games Of All Time Data Based by multiple sources
best critics data dataset game rank video-game video-games web-crawling web-scraping
Last synced: 28 Apr 2025
https://github.com/rririanto/unstructured-demo-streamlit
Extract your docs (CSV, PDF, JSON, HTML, DOCS, Sheets and more) for your own GPT and LLM projects using Unstructured.io via streamlit
ai data data-extraction gpt unstructured unstructured-data
Last synced: 09 Apr 2025
https://github.com/orico/flexeegile
Extending Agile For AI & Data Teams
agile ai data data-science flexeegile methodology
Last synced: 08 Jan 2026
https://github.com/finos/legend-community-delta
Combining best of open data standards with open source technologies
data database delta-lake spark
Last synced: 15 Oct 2025
https://github.com/dagshub/open-source-ml-datasets
This repository holds open source datasets for various machine learning domains with a link to download and use them
ai dagshub data data-engine dataset hacktoberfest hacktoberfest2023 machine-learning mlops open-source
Last synced: 19 Oct 2025
https://github.com/samedwardes/pydatafaker
A python package to create fake data with relationships between tables.
data data-science fake-data python
Last synced: 23 Apr 2025
https://github.com/r3li4nt/datafaker
Generador de datos falsos.
data datafaker hacking kali-linux pentesting python
Last synced: 15 Oct 2025
https://github.com/disruptek/skiplists
generic skip list implementations💃
collection data linked list nim search skip structure
Last synced: 09 Apr 2025
https://github.com/secnot/leaky_diode
Leaky diode is a data exfiltration test tool for data diodes.
cybersecurity data diodes exfiltration pentesting
Last synced: 17 Jan 2026
https://github.com/ikp4success/shopasource
Easiest way to find best lowest price products online.
async celery collection css data data-mining flask flask-sqlalchemy html javascript json postgresql python python3 quart scrapy spider spiders webscraper webscraping
Last synced: 07 Sep 2025
https://github.com/c3n7ral051nt4g3ncy/justtrakem
Sports Tracker Large Scale Profile Checker
chromedriver data intelligence opensourceforgood osint osint-python osint-tool python3 selenium sports tracking
Last synced: 08 May 2025
https://github.com/n1ghtf1re/map-of-emergency-incidents
Emergency Map allows you to effectively visualize multi-dimensional information, has an intuitive interface. The developed code is easily modified for use in a variety of areas. The use of color mixing technology enhances the perception and analysis of information
big-data big-data-analytics big-data-visualization bigdata color-mixing colors data data-analytics data-science data-visualization data-visualization-challenges data-visualization-simpler mysql open-source-project php student-project
Last synced: 18 Mar 2025
https://github.com/davidssmith/ra
RawArray file format reference implementation
c data data-structures dimensions hdf5 hdf5-format julia library matlab metadata python ra-format storage-container
Last synced: 07 May 2025
https://github.com/instafluff/coronavirus
COVID-19 Coronavirus Data Tracker
2019-ncov coronavirus covid-19 data ncov ncov-2019 sars-cov-2 wuhan
Last synced: 29 Oct 2025
https://github.com/epiverse-trace/linelist
R package for handling linelist data
data data-structures epidemiology epiverse outbreaks r r-package sdg-3 structured-data
Last synced: 06 May 2025
https://github.com/cloudposse/terraform-aws-dms
Terraform modules for provisioning and managing AWS DMS resources
data dms dynamodb migration mysql oracle postgres postgresql s3 sql sql-server
Last synced: 09 Sep 2025
https://github.com/akeneo/transporteo
Migration Tool for Akeneo PIM from 1.7 to 2.0
akeneo akeneo-pim data migration php symfony
Last synced: 29 Jul 2025
https://github.com/bilalhameed248/urdu-to-english-machine-translation
Fine tuned Urdu to English machine translation pre train model using Hugging-Face Trainer API on custom dataset.
bert bert-fine-tuning bert-model data data-preprocessing data-science deep-learning deep-neural-networks machine-translation pytorch seq2seq seq2seq-model seq2seq-tensorflow tensorflow
Last synced: 13 Apr 2025
https://github.com/netcorestack/datatransform
Sql2Sql or Sql2MongoDb Transform Tool
data data-transform database migration mongodb mssql nosql sql sql2mongo sql2mongodb sql2sql tooling transform
Last synced: 11 Apr 2025
https://github.com/motasimfoad/emr
“EMR” is a platform built using leading edge web technologies and API’s to help Doctors/ Patient/ Hospitals/ Pharmacies to better deal with medical documentation.
apollo data doctor emr graphcool graphql hospital medical patients pharmacy reactjs record yarn
Last synced: 10 Apr 2025
https://github.com/arverma/data_diode
A unidirectional network (also referred to as a unidirectional security gateway or data diode ) is a network appliance or device allowing data to travel only in one direction. It is used in guaranteeing information security. They are most commonly found in high security environments such as defense, where they serve as connections between two or more networks of differing security classification – also known as a "cross domain solution." This technology is also found at the industrial control level for such facilit ies as nuclear power plants, electric power generation/distribution, oil and gas production, water/wastewater, airplanes (between flight control units and in - flight entertainment systems), and manufacturing.
c client client-server client-server-architecture data data-diode diode networking server socket-programming
Last synced: 23 Aug 2025
https://github.com/civicdatalab/working-with-data-workshops
Learn to work with data.
data social-science training-materials workshop xaringan
Last synced: 20 Jun 2025
https://github.com/purarue/hpi-template
A cookiecutter template for creating a HPI repository
data lifelogging quantified-self
Last synced: 18 Mar 2025
https://github.com/modmuss50/cursemapper
A tool to make graphs and stuff from downloads on curse
curseforge data google-charts gradle graph javafx js kotlin php
Last synced: 18 Jul 2025
https://github.com/pysat/pysatmodels
Interface for model analysis and model-data comparisons within the pysat ecosystem
comparison data dineof model pysat python sami2 tie-gcm validation
Last synced: 22 Jul 2025
https://github.com/stas00/reddit-to-threads
Convert arctic_shift Reddit data dumps into thread-view documents
Last synced: 20 Mar 2025
https://github.com/texora/ssol-da
Data Availability for Solana Layer 2 Blockchain - SuperSol
data data-availability docs layer2 solana
Last synced: 10 Apr 2025
https://github.com/purarue/HPI-template
A cookiecutter template for creating a HPI repository
data lifelogging quantified-self
Last synced: 01 May 2025
https://github.com/chifisource/parsenoteval.jl
Expands the usage of Base.parse to work with more Base structures.
data data-structures evaluator julia parse parsing
Last synced: 13 Apr 2025
https://github.com/SamEdwardes/pydatafaker
A python package to create fake data with relationships between tables.
data data-science fake-data python
Last synced: 09 Jul 2025