data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-01-21 00:07:59 UTC
- JSON Representation
https://github.com/theseus-rs/rsql
Command line SQL interface for relational databases and common data file formats
cockroachdb command-line csv data database duckdb json mariadb mysql parquet postgres postgresql redshift snowflake sql sqlite sqlite3 sqlserver
Last synced: 16 May 2025
https://github.com/MLWhiz/data_science_blogs
A repository to keep track of all the code that I end up writing for my blog posts.
blogging chatbot data datascience gan graphs machine-learning mcmc python spark streamlit time-series xgboost
Last synced: 05 May 2025
https://github.com/mlwhiz/data_science_blogs
A repository to keep track of all the code that I end up writing for my blog posts.
blogging chatbot data datascience gan graphs machine-learning mcmc python spark streamlit time-series xgboost
Last synced: 06 Apr 2025
https://github.com/artus9033/chartjs-plugin-dragdata
Draggable data points plugin for Chart.js
Last synced: 12 Apr 2025
https://github.com/przemek83/volbx
Graphical tool for data manipulation written in C++/Qt.
c-plus-plus c-plus-plus-17 cpp cpp17 csv data data-analysis data-export data-filtering data-import data-manipulation data-visualization dynamic graphical ods plots qt spreadsheet statistical-analysis xlsx
Last synced: 08 Apr 2025
https://github.com/dat-ecosystem-archive/datBase
Open data sharing powered by Dat [ DEPRECATED - More info on active projects and modules at https://dat-ecosystem.org/ ]
dat data datproject p2p registry search sharing
Last synced: 03 Apr 2025
https://github.com/jldbc/coffee-quality-database
Building the Coffee Quality Institute Database
agriculture coffee data data-science dataset
Last synced: 09 Apr 2025
https://github.com/graphia-app/graphia
A visualisation tool for the creation and analysis of graphs
analysis data data-analysis data-science data-visualization graphs interpretation networks visualisation visualization
Last synced: 02 Apr 2025
https://github.com/awslabs/amazon-s3-find-and-forget
Amazon S3 Find and Forget is a solution to handle data erasure requests from data lakes stored on Amazon S3, for example, pursuant to the European General Data Protection Regulation (GDPR)
amazon-s3 aws big-data ccpa data data-erasure data-lake gdpr parquet privacy right-to-be-forgotten s3
Last synced: 04 Apr 2025
https://github.com/OpenIntroStat/openintro
📦 R package for data and supplemental functions for OpenIntro resources
data openintro rstats rstats-package
Last synced: 30 Jul 2025
https://github.com/shukkkur/volleyvision
Applying Deep Learning Approaches to Volleyball Data
action-recognition ball-tracking court-detection data dataset deep-learning event-detection group-activity-recognition machine-learning object-detection object-tracking roboflow tracking volleyball volleyball-games volleyball-tracking
Last synced: 12 Oct 2025
https://github.com/stocknear/frontend
UI of stocknear - Open Source Stock Analysis
ai community data data-science finance machine-learning memes pocketbase prediction stock-market svelte sveltekit
Last synced: 12 Apr 2025
https://github.com/www-zerocode-net-cn/ERD-Online
ERD Online is an online collaborative data warehouse design software. It does not need to install applications locally and operate databases online. It is an excellent alternative to desktop data modeling tools.
bigdata collaborative data database design erd java lowcode metadata nocode online sql
Last synced: 24 Mar 2025
https://github.com/Synthoid/ExportSheetData
Add-on for Google Sheets that allows sheets to be exported as JSON or XML.
data esd google-sheets json tools xml
Last synced: 14 Mar 2025
https://github.com/anaconda/anaconda-project
Tool for encapsulating, running, and reproducing data science projects
anaconda conda-environment data datascience encapsulation reproducibility running
Last synced: 11 Dec 2025
https://github.com/data-dot-all/dataall
A modern data marketplace that makes collaboration among diverse users (like business, analysts and engineers) easier, increasing efficiency and agility in data projects on AWS.
aws aws-glue aws-lake-formation aws-s3 data data-science etl-framework lakeformation lakehouse redshift
Last synced: 29 Jul 2025
https://github.com/everypolitician/everypolitician-data
data for national legislatures worldwide
civic-tech data dataset everypolitician political politicians politics ruby
Last synced: 27 Jul 2025
https://github.com/koldlight/curso-python-analisis-datos
Curso de python básico orientado al análisis de datos, en español
course data data-analysis folium hacktoberfest numpy pandas python requests seaborn spanish
Last synced: 12 Apr 2025
https://github.com/RTradeLtd/Temporal
☄️ Temporal is an easy-to-use, enterprise-grade interface into distributed and decentralized storage
data ethereum ethereum-swarm golang i2p infrastructure ipfs ipfs-cluster ipns pinning storage swarm temporal
Last synced: 06 Apr 2025
https://github.com/rtradeltd/temporal
☄️ Temporal is an easy-to-use, enterprise-grade interface into distributed and decentralized storage
data ethereum ethereum-swarm golang i2p infrastructure ipfs ipfs-cluster ipns pinning storage swarm temporal
Last synced: 30 Sep 2025
https://github.com/harinij/100daysofcode
#100DaysOfCode - Learn by developing 100 unique apps to explore exciting tech stacks
100daysofcode ai appdev coding-challenge data developer-challenge learning-by-doing machine-learning opensource reactjs
Last synced: 17 Jun 2025
https://github.com/eurostat/gridviz
A library for visualizing gridded data 🌐
cartography census csv d3 data data-analysis data-science data-visualization datascience geospatial gis gridded-statistics grids map map-making mapping mapping-tools maps visualization webgl
Last synced: 24 Dec 2025
https://github.com/vinyzu/chrome-fingerprints
A Collection of 10.000 collected Windows Chrome Fingerprints. Usable with an easy-to-use API, available as a compressed (lzma) or full-size Json (view Releases). Its just 1.4mb in size in compressed form, and fast in read times.
automation botright browser data dataset fingerprinting fingerprints fingerprints-generator playwright
Last synced: 16 May 2025
https://github.com/bcapathshala/dsa-supreme-2-0-notes
DATA STRUCTURE USING CPP NOTES
algorithms cpp data data-structures dsa
Last synced: 12 Apr 2025
https://github.com/datadesk/california-coronavirus-data
The Los Angeles Times' open-source archive of California coronavirus data
altair binder coronavirus covid csv data data-journalism journalism jupyter news pandas python
Last synced: 06 Apr 2025
https://github.com/nycdb/nycdb
Database of NYC Housing Data
civic-data data database housing nyc open-data psql python3
Last synced: 15 May 2025
https://github.com/ronellsicat/DxR
DXR is a Unity package for rapid prototyping of immersive data visualizations in augmented, mixed, and virtual reality (AR, MR, VR) or XR for short.
ar augmented charts data graphs hololens immersive mixed mixed-reality reality unity virtual visualization vr
Last synced: 01 Apr 2025
https://github.com/hiyali/vue-smooth-picker
🏄🏼 A SmoothPicker for Vue 2 (like native datetime picker of iOS)
awesome data datetime picker smooth vue vue-picker vue2
Last synced: 13 Apr 2025
https://github.com/mendableai/firecrawl-app-examples
🔥 This repository contains complete application examples, including websites and other projects, developed using Firecrawl.
ai ai-scraping data examples html-to-markdown llm markdown rag scrapers templates web-crawler
Last synced: 13 Apr 2025
https://github.com/mahmoudparsian/data-algorithms-with-spark
O'Reilly Book: [Data Algorithms with Spark] by Mahmoud Parsian
algorithms bigdata data data-abstractions data-algorithms data-transformation dataframes design design-patterns machine-learning mappers mapreduce monoid partitioning-algorithms pyspark python rdd reducers spark transformations
Last synced: 07 Apr 2025
https://github.com/umitkaanusta/reddit-detective
Play detective on Reddit: Discover political disinformation campaigns, secret influencers and more
analysis analytics api data database elt etl graph graph-database neo4j network politics reddit social social-media social-network
Last synced: 06 Apr 2025
https://github.com/telefonicaid/fiware-orion
Context Broker and CEF building block for context data management, providing NGSI interfaces.
context-information-management data fiware fiware-ngsi fiware-orion orion-context-broker
Last synced: 04 Apr 2025
https://github.com/bitol-io/open-data-contract-standard
Home of the Open Data Contract Standard (ODCS).
data data-contract data-contracts data-engineering data-mesh data-quality
Last synced: 18 Jul 2025
https://github.com/dataplane-app/dataplane
Dataplane is an Airflow inspired unified data platform with additional data mesh and RPA capability to automate, schedule and design data pipelines and workflows. Dataplane is written in Golang with a React front end.
airflow data data-analysis data-engineering data-integration data-pipelines data-science dataplane datawarehouse etl finance golang kubernetes pipelines robotics-process-automation rpa scheduler workflow workflow-automation workflows
Last synced: 27 Dec 2025
https://github.com/K3V1991/Disable-Firefox-Telemetry-and-Data-Collection
How to disable Firefox Telemetry and Data Collection
blocking browser config data data-collection disable firefox how-to list mozilla mozilla-firefox off options privacy reporting security server settings telemetry tutorial
Last synced: 13 Apr 2025
https://github.com/tirendazacademy/pandas-tutorial
Jupyter Notebooks and Data Sets for Pandas Library
data data-analysis data-preprocessing data-science machine-learning pandas pandas-dataframe pandas-datareader pandas-library pandas-python pandas-series pandas-tricks-for-data-manipulation pandas-tutorial python
Last synced: 06 Apr 2025
https://github.com/iterative/vscode-dvc
Machine learning experiment tracking and data versioning with DVC extension for VS Code
data data-science dvc machine-learning python visual-studio-code vscode vscode-extension
Last synced: 18 Jun 2025
https://github.com/openfintechio/openfintech
Opensource FinTech standards & payment provider data
collaboration community data finance fintech images json openfintech payment payment-gateway payment-integration payment-methods payment-team payments payments-data payments-domain payments-hub payout standard standards
Last synced: 17 Jan 2026
https://github.com/storieswithsiva/Data-Science-Resources
👨🏽🏫You can learn about what data science is and why it's important in today's modern world. Are you interested in data science?🔋
artificial-intelligence artificial-neural-networks data data-analysis data-analytics data-mining data-science data-science-resource data-science-resources data-scientist data-scientists data-visualization data-world datascience dataset learning learning-kit machine-learning python repository
Last synced: 10 Apr 2025
https://github.com/robmsmt/ASR-Audio-Data-Links
A list of publically available audio data that anyone can download for ASR or other speech activities
asr audio-data data speech speech-activities speech-recognition speech-to-text
Last synced: 11 Mar 2025
https://github.com/silverton-io/buz
Serverless multi-protocol + multi-destination event collection system.
analytics analytics-tracking cloudevents cloudevents-schema contracts data data-collection data-platform eventbridge jsonschema product-analytics redpanda redpanda-console schema-registry schema-validation snowplow-analytics streaming-analytics streaming-data webhook-receiver webhook-server
Last synced: 12 Apr 2025
https://github.com/felikcat/unlimited-hotspot
Remove speed restrictions on your hotspot internet (iOS, iPadOS, Android, Quectel), and allows hotspots on any plan (rooted Android & Quectel only).
android bypass-throttling cellphone data hotspot hotspot-wifi internet ios linux macos mobile phone qualcomm quectel tablet tether tethering unlimited unlimited-data windows
Last synced: 05 Apr 2025
https://github.com/robmarkcole/hass-data-detective
Explore and analyse your Home Assistant data
data data-science home home-assistant home-automation
Last synced: 16 May 2025
https://github.com/Swirrl/grafter
Linked Data & RDF Manufacturing Tools in Clojure
clojure data etl grafter linked-data rdf semantic-web
Last synced: 02 Apr 2025
https://github.com/swirrl/grafter
Linked Data & RDF Manufacturing Tools in Clojure
clojure data etl grafter linked-data rdf semantic-web
Last synced: 04 Apr 2025
https://github.com/mirador/mirador
Tool for visual exploration of complex data.
data exploratory-data-analysis tabular-data visualization
Last synced: 17 Jan 2026
https://github.com/robmarkcole/HASS-data-detective
Explore and analyse your Home Assistant data
data data-science home home-assistant home-automation
Last synced: 06 Apr 2025
https://github.com/not-a-bank/open-banking-tracker-data
The open banking API directory
api bank banking data finance fintech institutions obie openbanking psd2 tracker
Last synced: 20 Jul 2025
https://github.com/igorkamyshev/farfetched
The advanced data fetching tool for web applications
async data data-fetching effector fetch
Last synced: 07 Apr 2025
https://github.com/apache/texera
Collaborative Machine-Learning-Centric Data Analytics Using Workflows
artificial-intelligence data data-analytics data-science machine-learning texera workflow
Last synced: 15 Dec 2025
https://github.com/easystats/datawizard
Magic potions to clean and transform your data 🧙
data dplyr hacktoberfest janitor manipulation r-package reshape rstats tidyr wrangling
Last synced: 04 Apr 2025
https://github.com/streamthoughts/azkarra-streams
🚀 Azkarra is a lightweight java framework to make it easy to develop, deploy and manage cloud-native streaming microservices based on Apache Kafka Streams.
apache-kafka azkarra-streams cloud-native data interactive-queries java kafka kafka-streams micro-framework microservices webui
Last synced: 19 Aug 2025
https://github.com/criccomini/awesome-infra
A curated list of infrastructure projects and companies.
ai awesome awesome-list data database infrastructure ml stream-processing streaming workflow
Last synced: 24 Jul 2025
https://github.com/taleshape-com/shaper
Build Data Dashboards all in SQL. Powered by DuckDB.
analytics dashboards data duckdb
Last synced: 12 Jan 2026
https://github.com/HubSpot/general-store
Simple, flexible store implementation for Flux. #hubspot-open-source
data dispatcher flux hubspot javascript react store
Last synced: 31 Mar 2025
https://github.com/hubspot/general-store
Simple, flexible store implementation for Flux. #hubspot-open-source
data dispatcher flux hubspot javascript react store
Last synced: 14 Oct 2025
https://github.com/OWOX/owox-data-marts
Open-Source Self-Service Analytics Platform
analytics athena bigquery dashboard data data-analysis data-marts facebook linkedin looker-studio reddit reporting self-service sheets sql sql-editor tiktok
Last synced: 16 Oct 2025
https://github.com/cawfree/react-native-quiet
🤫 Quiet for React Native.
audio chirp data quiet quietjs react react-native send transmit ultrasonic
Last synced: 24 Jun 2025
https://github.com/triggerdotdev/apihero
Make every API you use faster and more reliable with one line of code ⚡️
api data gateway http http-client observability proxy rest typescript
Last synced: 15 Apr 2025
https://github.com/unytics/airbyte_serverless
Airbyte made simple (no UI, no database, no cluster)
airbyte bigquery data data-analysis data-engineering data-warehouse elt etl pipeline
Last synced: 16 May 2025
https://github.com/looker/lookerbot
Lookerbot lets you access all your Looker data from Slack! Super fun!
chat chatbot data data-visualization looker slack slash-commands
Last synced: 01 Apr 2025
https://github.com/esri/geodev-hackerlabs
A place to learn how to build geo apps with the ArcGIS Platform.
arcgis-js-api arcgis-online arcgis-platform data design geodev javascript
Last synced: 18 Jul 2025
https://github.com/apis-is/apis
Making data readily available to anyone interested
api data iceland javascript node public-data
Last synced: 13 May 2025
https://github.com/tomwhite/covid-19-uk-data
Coronavirus (COVID-19) UK Historical Data
confirmed-cases coronavirus covid-19 csv daily-counts data dataset deaths england historical-data northern-ireland scotland uk united-kingdom wales
Last synced: 01 Nov 2025
https://github.com/jonschlinkert/data-store
Easily get, set and persist config data. Fast. Supports dot-notation in keys. No dependencies.
cache conf config configstore data javascript json nodejs persist store stort
Last synced: 04 Apr 2025
https://github.com/luojilab/datatranshub
跨平台Android/iOS海量数据上报组件,基于Xlog完善,解决Xlog痛点问题。
android data data-report ios logger xlog
Last synced: 21 Aug 2025
https://github.com/censusreporter/census-api
The home for the API that powers the Census Reporter project.
Last synced: 27 Jul 2025
https://github.com/lironmiz/computer-science-in-java
Designed for saving assignments, submission exercises and projects
algrothm arrays bluej computer-science coursework data datastructures degree implementation-of-data-structures java linked-list oop oop-principles openuniversity queue recursion search-algorithm sorting-algorithms stack trees
Last synced: 31 Oct 2025
https://github.com/oxinabox/DataDeps.jl
reproducible data setup for reproducible science
data data-science open-science
Last synced: 13 Nov 2025
https://github.com/ropensci/dataspice
:hot_pepper: Create lightweight schema.org descriptions of your datasets
data dataset metadata r r-package rstats schema-org unconf unconf18
Last synced: 13 Jul 2025
https://github.com/scriptin/kanji-frequency
Kanji usage frequency data collected from various sources
cjk cjk-characters corpus corpus-linguistics data data-visualization frequency-lists japanese japanese-language kanji kanji-frequency
Last synced: 18 Jan 2026
https://github.com/bacinger/f1-circuits
A repository of Formula 1™ circuits in GeoJSON format.
data data-repository f1-circuits formula-1 geojson geojson-data geojson-format
Last synced: 15 Apr 2025
https://github.com/edwindj/daff
Diff, patch and merge for data.frames, see http://paulfitz.github.io/daff/
Last synced: 14 May 2025
https://github.com/oxinabox/datadeps.jl
reproducible data setup for reproducible science
data data-science open-science
Last synced: 14 Mar 2025
https://github.com/datafusion-contrib/datafusion-dft
Batteries included CLI, TUI, and server implementations for DataFusion.
arrow cli data database datafusion tui
Last synced: 10 May 2025
https://github.com/kedarvj/mysql-random-data-generator
This is the easiest MySQL random test data generator tool. Load the procedure and execute to auto detect column types and load data.
data dataset dummy-data dummy-data-generator mysql procedure random-generation testdata testdatabuilder
Last synced: 16 Mar 2025
https://github.com/asad70/wallstreetbets-sentiment-analysis
This program finds the most mentioned ticker on r/wallstreetbets and uses Vader SentimentIntensityAnalyzer to calculate the sentiment analysis.
algotrading analysis data docker-container reddit sentiment-analysis trading vader-sentimentintensityanalyzer wallstreetbets wallstreetbets-sentiment-analysis
Last synced: 23 Oct 2025
https://github.com/DanielBok/copulae
Multivariate data modelling with Copulas in Python
conda copula copula-models copulae copulas data data-analysis dependency-analysis dependency-modeling modeling pypi pypi-packages python python3 statistics
Last synced: 26 Mar 2025
https://github.com/Appsilon/data.validator
validate your data and create nice reports straight from R
data r reporting rhinoverse rstudio validation
Last synced: 06 May 2025
https://github.com/Tauffer-Consulting/domino
User friendly and open source platform for workflow creation and monitoring
ai airflow containers data gui kubernetes open-source python workflows
Last synced: 22 Apr 2025
https://github.com/ingoscholtes/pathpy
pathpy is an OpenSource python package for the modeling and analysis of pathways and temporal networks using higher-order and multi-order graphical models
analysis data data-mining graph graphical-models machine-learning model-selection multi-order network-analysis networks pathways python sequential-data temporal-correlations temporal-networks
Last synced: 25 Sep 2025
https://github.com/leinelissen/aeon
📡 Scan the internet for your personal information and modify or remove it
Last synced: 05 Apr 2025
https://github.com/kirlovon/aloedb
Light, Embeddable, NoSQL database for Deno 🦕
data database datastore db deno embeddable embedded embedded-database javascript json light nosql nosql-database typescript
Last synced: 08 May 2025
https://github.com/Kirlovon/AloeDB
Light, Embeddable, NoSQL database for Deno 🦕
data database datastore db deno embeddable embedded embedded-database javascript json light nosql nosql-database typescript
Last synced: 13 May 2025
https://github.com/robjhyndman/fpp3
All data sets required for the examples and exercises in the book "Forecasting: principles and practice" (3rd ed, 2020) by Rob J Hyndman and George Athanasopoulos <http://OTexts.org/fpp3/>. All packages required to run the examples are also loaded.
Last synced: 19 Jul 2025