data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-20 00:07:41 UTC
- JSON Representation
https://github.com/kedarvj/mysql-random-data-generator
This is the easiest MySQL random test data generator tool. Load the procedure and execute to auto detect column types and load data.
data dataset dummy-data dummy-data-generator mysql procedure random-generation testdata testdatabuilder
Last synced: 16 Mar 2025
https://github.com/asad70/wallstreetbets-sentiment-analysis
This program finds the most mentioned ticker on r/wallstreetbets and uses Vader SentimentIntensityAnalyzer to calculate the sentiment analysis.
algotrading analysis data docker-container reddit sentiment-analysis trading vader-sentimentintensityanalyzer wallstreetbets wallstreetbets-sentiment-analysis
Last synced: 23 Oct 2025
https://github.com/DanielBok/copulae
Multivariate data modelling with Copulas in Python
conda copula copula-models copulae copulas data data-analysis dependency-analysis dependency-modeling modeling pypi pypi-packages python python3 statistics
Last synced: 26 Mar 2025
https://github.com/Appsilon/data.validator
validate your data and create nice reports straight from R
data r reporting rhinoverse rstudio validation
Last synced: 06 May 2025
https://github.com/openstates/people
Curated information on all state legislators & governors.
data hacktoberfest legislators united-states-data
Last synced: 15 Apr 2026
https://github.com/Tauffer-Consulting/domino
User friendly and open source platform for workflow creation and monitoring
ai airflow containers data gui kubernetes open-source python workflows
Last synced: 22 Apr 2025
https://github.com/ingoscholtes/pathpy
pathpy is an OpenSource python package for the modeling and analysis of pathways and temporal networks using higher-order and multi-order graphical models
analysis data data-mining graph graphical-models machine-learning model-selection multi-order network-analysis networks pathways python sequential-data temporal-correlations temporal-networks
Last synced: 25 Sep 2025
https://github.com/leinelissen/aeon
๐ก Scan the internet for your personal information and modify or remove it
Last synced: 05 Apr 2025
https://github.com/robjhyndman/fpp3
All data sets required for the examples and exercises in the book "Forecasting: principles and practice" (3rd ed, 2020) by Rob J Hyndman and George Athanasopoulos <http://OTexts.org/fpp3/>. All packages required to run the examples are also loaded.
Last synced: 13 Feb 2026
https://github.com/kirlovon/aloedb
Light, Embeddable, NoSQL database for Deno ๐ฆ
data database datastore db deno embeddable embedded embedded-database javascript json light nosql nosql-database typescript
Last synced: 08 May 2025
https://github.com/Kirlovon/AloeDB
Light, Embeddable, NoSQL database for Deno ๐ฆ
data database datastore db deno embeddable embedded embedded-database javascript json light nosql nosql-database typescript
Last synced: 13 May 2025
https://github.com/petera2c/simple-table
Lightweight React data grid/table for fast, modern web apps
component data grid javascript library react simple typescript
Last synced: 02 Feb 2026
https://github.com/bps-statistics/stadata
STADATA is a Python package that simplifies access to statistical data provided by BPS - Statistics Indonesia
data data-analytics data-science national-statistics nso official-statistics python python-package statistics
Last synced: 14 Oct 2025
https://github.com/gbif/pygbif
GBIF Python client
api biodiversity data ecology gbif
Last synced: 17 Mar 2026
https://github.com/expo/entity
Entity is a privacy-aware data layer for defining, caching, and authorizing access to application data models.
Last synced: 02 Mar 2026
https://github.com/infinum/datx
DatX is an opinionated JS/TS data store. It features support for simple property definition, references to other models and first-class TypeScript support.
collection data data-store hacktoberfest javascript mobx model open-source prop state-management
Last synced: 23 Mar 2025
https://github.com/dbt-labs/jaffle_shop_duckdb
Get started with dbt in less than 1 minute from `git clone` to `dbt docs serve` for free!
Last synced: 07 Apr 2025
https://github.com/rello/analytics
Analytics - Open source data warehouse and reporting for Nextcloud
analytics data data-warehouse datasources nextcloud visualization
Last synced: 18 Feb 2026
https://github.com/cipherstash/stack
Data level access controls via field level searchable encryption SDK for JavaScript/TypeScript
data data-security encryption encryption-sdk javascript postgres postgresql searchable-encryption security typescript
Last synced: 18 May 2026
https://github.com/odota/mobile
React Native apps for viewing Dota 2 data on Android/iOS
android data dota ios mobile react-native
Last synced: 21 Apr 2025
https://github.com/royal-statistical-society/datavisguide
Introductory guide to the art and science of data visualisation. Insights, advice, and examples (with code) to make data outputs more readable, accessible, and impactful.
charts data graphics tables visualization
Last synced: 27 Mar 2025
https://github.com/sheetjs/js-adler32
:ballot_box_with_check: ADLER-32 checksum
Last synced: 12 Apr 2025
https://github.com/Kirlovon/aloedb
Light, Embeddable, NoSQL database for Deno ๐ฆ
data database datastore db deno embeddable embedded embedded-database javascript json light nosql nosql-database typescript
Last synced: 06 Aug 2025
https://github.com/noahgift/cloud-data-analysis-at-scale
[Course-2020-2023] taught at Duke MIDS. This is also a Coursera Course that covers MLOps, ML Engineering and the foundations of Cloud Computing for Data Science.
analytics cloud data duke github hugging huggingface machine-learning mids syllabus
Last synced: 18 Feb 2026
https://github.com/ls9512/ubind
UBind is a value binding component for Unity, which is used to quickly realize the association binding between UI and logical data.
data databind databinding databinding-library datadriven framework u3d ugui uguicomponent ui unity unity3d unity3d-plugin unityplugin
Last synced: 09 Oct 2025
https://github.com/raystack/guardian
Guardian is universal data access management tool with automated access workflows and security controls across data stores, analytical systems, and cloud products.
access compliance control data dataops
Last synced: 24 Jul 2025
https://github.com/hadley/babynames
An R package containing US baby names from the SSA
Last synced: 05 Apr 2025
https://github.com/Meeshkan/micro-jaymock
Tiny API mocking microservice for generating fake JSON data.
data fake fake-data micro microservice test-data testing
Last synced: 07 May 2025
https://github.com/meeshkan/micro-jaymock
Tiny API mocking microservice for generating fake JSON data.
data fake fake-data micro microservice test-data testing
Last synced: 13 Mar 2025
https://github.com/glotzerlab/signac
Manage large and heterogeneous data spaces on the file system.
data data-management database reproducibility
Last synced: 08 Apr 2025
https://github.com/siriusastrebe/jsynchronous
Jsynchronous.js - Data synchronization for games and real-time web apps.
data games javascript nodejs realtime synchronization websockets
Last synced: 10 Apr 2025
https://github.com/c-3lab/dim
๐ฆ dim: Manage the open data in your project like a package manager.
cli commads command-line-tool data dataops dim gpt gpt-3 llm opendata package-manager public-data public-dataset
Last synced: 17 Jan 2026
https://github.com/quickbirdeng/survey_kit
Flutter library to create beautiful surveys (aligned with ResearchKit on iOS)
clinical-trials dart data flutter flutter-library flutter-package flutter-plugin flutter-ui flutter-widget forms ios-researchkit-surveys medical poll research study-conduct survey
Last synced: 01 Apr 2026
https://github.com/stethoscope-js/stethoscope
๐ฉบ Track, visualize, and embed your health and life data โ location, health, work, play, and more
automation clockify data github-actions goodreads google-fit last-fm life oura oura-ring pocket-casts rescue-time spotify time-tracking wakatime
Last synced: 22 Sep 2025
https://github.com/ali1k/ld-r
Linked Data Reactor (LD-R)
adaptive authoring browse components data javascript ld-reactor linked react visualization
Last synced: 15 Feb 2026
https://github.com/decileapp/decile
Simple, open-source analytics tool for any Postgres database.
analytics automation business dashboard data intelligence postgres reporting sql sql-editor
Last synced: 02 Aug 2025
https://github.com/bitquery/widgets
Widgets for blockchain data visualizations
analytics blockchain data graphql statistics widgets
Last synced: 23 Jun 2025
https://github.com/zero-one-group/fxl
fxl is a Clojure spreadsheet library
clojure-library data data-oriented excel functional-programming spreadsheet xlsx
Last synced: 11 Apr 2025
https://github.com/ghiscoding/aurelia-slickgrid
Aurelia-Slickgrid is a wrapper of the lightning fast & customizable SlickGrid datagrid, it also includes multiple Styling Themes
aurelia bootstrap bootstrap4 bootstrap5 data datagrid datatable graphql grid odata slickgrid treeview typescript
Last synced: 06 Apr 2025
https://github.com/QuickBirdEng/survey_kit
Flutter library to create beautiful surveys (aligned with ResearchKit on iOS)
clinical-trials dart data flutter flutter-library flutter-package flutter-plugin flutter-ui flutter-widget forms ios-researchkit-surveys medical poll research study-conduct survey
Last synced: 02 Apr 2025
https://github.com/airbytehq/airbyte-agent-sdk
๐ Drop-in tools that give AI agents reliable, permission-aware access to external systems.
ai ai-agents airbyte anthropic connectors data enterprise gemini integrations langchain llm mcp open-source openai pydantic-ai rag
Last synced: 19 Jun 2026
https://github.com/globeandmail/startr
A template for data journalism in R
data data-analysis data-journalism data-visualization journalism news r
Last synced: 26 Jun 2025
https://github.com/miniql/miniql
A tiny JSON-based query language inspired by GraphQL
data data-analysis data-science graphql javascript json queries query query-language typescript
Last synced: 15 Apr 2025
https://github.com/ls9512/UBind
UBind is a value binding component for Unity, which is used to quickly realize the association binding between UI and logical data.
data databind databinding databinding-library datadriven framework u3d ugui uguicomponent ui unity unity3d unity3d-plugin unityplugin
Last synced: 25 Apr 2025
https://github.com/mdzio/ccu-historian
Der CCU-Historian erfasst die Betriebsdaten des Hausautomations-Systems HomeMatic der Firma eQ-3.
aquisition ccu ccu-historian data homematic pims smarthome
Last synced: 19 Jul 2025
https://github.com/propeldata/ui-kit
React components for data visualization and quickly building data analytics dashboards
analytics dashboards data datavisualization react reactjs
Last synced: 07 Apr 2026
https://github.com/Canner/wren-engine
๐ค The semantic engine for LLMs, bringing semantic context to AI agents. ๐ฅ
business-intelligence data data-analysis data-analytics data-lake data-warehouse hacktoberfest llm semantic semantic-layer sql
Last synced: 01 Apr 2025
https://github.com/gita/gita
Bhagavad Gita in JSON
bhagavad-gita bhagavad-gita-api data dataset dharma gita hacktoberfest hacktoberfest-accepted hindu hinduism india json raw-data sanatandharma sanskrit
Last synced: 11 May 2025
https://github.com/nicohlr/ipychart
The power of Chart.js with Python
charting-library chartjs charts data data-analysis data-science data-visualization ipywidgets javascript-es6 jupyter jupyter-notebook notebook python
Last synced: 09 Apr 2025
https://github.com/cially/cially
๐ชผCially is a powerful, open-source dashboard designed to provide in-depth insights, real-time analytics, and detailed statistics for your Discord server. Monitor member activity, track engagement trends, and make data-driven decisions with ease.
analytics api dashboard data discord discord-bot discord-js express expressjs full-stack nextjs pocketbase website
Last synced: 20 Jan 2026
https://github.com/pierridotite/stonks-dashboard
A cyberpunk-style, real-time financial monitor that runs directly in the terminal. It visualizes cryptocurrency and stock market data using ASCII charts and dynamic coloring.
cli dashboard data finance market market-data real-time stocks terminal trading tui
Last synced: 29 Dec 2025
https://github.com/the-osint-toolbox/people-search-osint
Search tools to help you find people, focused towards UK resources.
data data-aggregation osint people privacy search searching
Last synced: 15 Feb 2026
https://github.com/kayak/fireant
Data analysis and reporting tool for quick access to custom charts and tables in Jupyter Notebooks and in the shell.
aggregated analysis analytics bi builder business data database fireant juypter mysql oracle pandas postgres python query rdbms science sql vertica
Last synced: 19 Jul 2025
https://github.com/juliaearth/geospatial-data-science-with-julia
Geospatial Data Science with Julia
book computational data geo geometry geospatial geostatistics julia science statistics
Last synced: 30 Jan 2026
https://github.com/jasonkuhrt/alge
Type safe library for creating Algebraic Data Types (ADTs) in TypeScript. ๐ฑ
adt algebraic-data-types data typescript
Last synced: 16 May 2025
https://github.com/ropensci/spocc
Species occurrence data toolkit for R
antweb api bison data ebird ecoengine gbif idigbio inaturalist obis occurrence r r-package rstats species species-occurrence spocc vertnet
Last synced: 13 Jul 2025
https://github.com/piersyork/owidR
An R Package for Importing Data from Our World in Data
data data-visualisation economics r r-package
Last synced: 13 Jul 2025
https://github.com/data-engineering-community/data-engineering-project-template
This is a template you can use for your next data engineering portfolio project.
data data-engineering data-warehouse etl etl-pipeline python sql
Last synced: 06 Apr 2025
https://github.com/codeclimate/platform
Code Climate Engineering Data Platform
analytics code-quality codeclimate data specification
Last synced: 05 Mar 2026
https://github.com/chesslablab/php-chess
A chess library offering move validation, common formats, multiple variants, UCI engine support, explanation of chess positions, image recognition, PGN parsing and knowledge extraction from games.
board chess data database db fen game machinelearning ml pgn php
Last synced: 06 Mar 2025
https://github.com/autoviml/pandas_dq
Find data quality issues and clean your data in a single line of code with a Scikit-Learn compatible Transformer.
data data-science dataquality dataqualitycheck machine-learning pandas python scikit-learn
Last synced: 07 Apr 2025
https://github.com/piersyork/owidr
An R Package for Importing Data from Our World in Data
data data-visualisation economics r r-package
Last synced: 10 Apr 2025
https://github.com/kaarthik108/snowbrain
snowBrain - AI-Driven Insights with Snowflake (New version- https://github.com/kaarthik108/snowbrain-AGUI)
chatgpt data fastapi langchain nextjs pinecone snowflake tailwindcss
Last synced: 17 Mar 2025
https://github.com/airbytehq/airbyte-agent-connectors
๐ Drop-in tools that give AI agents reliable, permission-aware access to external systems.
ai ai-agents airbyte anthropic connectors data enterprise gemini integrations langchain llm mcp open-source openai pydantic-ai rag
Last synced: 09 Apr 2026
https://github.com/DataKitchen/data-observability-installer
Installer for DataKitchen's Open Source Data Observability Products. Data breaks. Servers break. Your toolchain breaks. Ensure your team is the first to know and the first to solve with visibility across and down your data estate. Save time with simple, fast data quality test generation and execution. Trust your data, tools, and systems end to end.
data data-engineering data-observability data-profiling data-quality data-reliability data-science datachecker datacleaner datacleaning dataops dataquality datatesting datavalidation mssql pipeline-tests postgresql redshift self-hosted snowflake
Last synced: 05 May 2025
https://github.com/MaJerle/lwpkt
Lightweight packet protocol structure for multi-device communication focused on RS-485
command data devices length master microcontroller multi-slaves optimization packet packet-structure protocol rs-485 simple stm32
Last synced: 14 May 2025
https://github.com/dsebastien/obsidian-life-tracker-base-view
Capture and visualize the data that matters in your life
chartjs charts charts-and-graphs daily dailynotes data data-analysis data-visualization journaling life-os life-tracking obsidian obsidian-plugin obsidian-plugins
Last synced: 13 Jan 2026
https://github.com/majerle/lwpkt
Lightweight packet protocol structure for multi-device communication focused on RS-485
command data devices length master microcontroller multi-slaves optimization packet packet-structure protocol rs-485 simple stm32
Last synced: 04 Mar 2025
https://github.com/postlight/account
๐๏ธ โ ๐ข Tell little stories with numbers
business-models data data-journalism diagonal-hamburger expr-eval journalism labs parsimmon react sliders spreadsheet storytelling
Last synced: 06 Aug 2025
https://github.com/anna-geller/dataflow-ops
Project demonstrating how to automate Prefect 2.0 deployments to AWS ECS Fargate
analytics analytics-engineering automation aws cicd data data-engineering data-engineering-infrastructure data-engineering-pipeline data-science dataflow dataflow-ops infrastructure-as-code observability orchestration pipeline prefect python serverless
Last synced: 24 Mar 2025
https://github.com/Rello/analytics
Analytics - Open source data warehouse and reporting for Nextcloud
analytics data data-warehouse datasources nextcloud visualization
Last synced: 03 Apr 2025
https://github.com/aahouzi/Instagram-Scraper-2021
Scrape Instagram content and stories, using a new technique based on the har file (No Token + No public API).
browsermob-proxy data facebook facebook-graph-api graphql-api instagram instagram-api instagram-bot instagram-crawler instagram-feed instagram-scraper instagram-stories meta scraper selenium webscraping
Last synced: 15 Mar 2025
https://github.com/bps-statistics/form-gear
FormGear is a framework engine for dynamic form creation and complex form processing and validation for data collection.
census data data-collection form-builder form-engine form-generator national-statistics official-statistics survey survey-builder survey-form
Last synced: 13 Oct 2025
https://github.com/cosmo/binarykit
๐พ๐๐งฎ BinaryKit helps you to break down binary data into bits and bytes, easily access specific parts and write data to binary.
binary binary-data bit bits byte bytes data hacktoberfest nsdata swift
Last synced: 27 Feb 2026
https://github.com/Cosmo/BinaryKit
๐พ๐๐งฎ BinaryKit helps you to break down binary data into bits and bytes, easily access specific parts and write data to binary.
binary binary-data bit bits byte bytes data hacktoberfest nsdata swift
Last synced: 16 Mar 2025
https://github.com/aahouzi/instagram-scraper-2021
Scrape Instagram content and stories, using a new technique based on the har file (No Token + No public API).
browsermob-proxy data facebook facebook-graph-api graphql-api instagram instagram-api instagram-bot instagram-crawler instagram-feed instagram-scraper instagram-stories meta scraper selenium webscraping
Last synced: 27 Oct 2025
https://github.com/ardalis/dotnetdataaccesstour
A tour of different data access approaches in .NET 8+.
dapper data design-patterns dotnet dotnet-core entity-framework-core repository repository-pattern specification sql-server
Last synced: 19 Jun 2025
https://github.com/bartobri/data-structures-c
A collection of algorithms for data structure manipulation in C
Last synced: 26 Feb 2026
https://github.com/datakitchen/data-observability-installer
Installer for DataKitchen's Open Source Data Observability Products. Data breaks. Servers break. Your toolchain breaks. Ensure your team is the first to know and the first to solve with visibility across and down your data estate. Save time with simple, fast data quality test generation and execution. Trust your data, tools, and systems end to end.
data data-engineering data-observability data-profiling data-quality data-reliability data-science datachecker datacleaner datacleaning dataops dataquality datatesting datavalidation mssql pipeline-tests postgresql redshift self-hosted snowflake
Last synced: 06 Apr 2026
https://github.com/joelgmsec/invoke-dnsteal
Simple & Customizable DNS Data Exfiltrator
data delay dns domain exfiltrator fake random tcp udp
Last synced: 10 Mar 2026
https://github.com/dbt-labs/dbt-meshify
A dbt-core python package that automates the management and creation of dbt groups, contracts, access, and versions.
Last synced: 09 Apr 2025
https://github.com/nberlette/canbus
Controller Area Network (CAN) reference, wiki, and DBC files.
automotive bmw can can-bus canbus data database dbc networking vector wiki
Last synced: 21 Mar 2025
https://github.com/DataWithBaraa/sql-data-analytics-project
This repository contains a collection of SQL scripts demonstrating various analytical techniques, such as changes over time, cumulative, performance, data segmentation, part-to-whole analysis.
analytics business-analytics business-intelligence data data-analysis data-analyst data-analytics data-engineering data-science data-scientist database datascience query reporting sql sql-queries sql-query sql-server window-functions window-functions-in-sql
Last synced: 14 Oct 2025
https://github.com/anchore/vunnel
Tool for collecting vulnerability data from various sources (used to build the grype database)
data grype hacktoberfest vulnerability
Last synced: 14 Feb 2026
https://github.com/JoelGMSec/Invoke-DNSteal
Simple & Customizable DNS Data Exfiltrator
data delay dns domain exfiltrator fake random tcp udp
Last synced: 12 Jul 2025