data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-23 00:07:41 UTC
- JSON Representation
https://github.com/tinymins/luadata
This is a javascript(js) npm package that can serialize array and object to Lua table, or unserialize Lua table to array and object.
data javascript js lua luadata
Last synced: 24 Apr 2025
https://github.com/longnguyen010203/ecommerce-elt-pipeline
🌄📈📉 A Data Engineering Project 🌈 that implements an ELT data pipeline using Dagster, Docker, Dbt, Polars, Snowflake, PostgreSQL. Data from kaggle website 🔥
dagster data data-engineering dbt docker docker-compose dockerfile elt elt-pipeline extract kaggle load polars postgresql raw-data relational-databases snowflake transform
Last synced: 27 Feb 2026
https://github.com/stdlib-js/array-complex64
Complex64Array.
array cmplx complex complex64 complex64array data float imag imaginary javascript node node-js nodejs real stdlib structure typed typed-array types
Last synced: 09 Apr 2025
https://github.com/ahmetfurkandemir/iot
Internet of Things (IoT)
accelerometer arduino data dht22sensor esp32-arduino gyroscope iot iot-device lcd-display sensor wokwii
Last synced: 15 Apr 2025
https://github.com/cgivre/drill-geoip-functions
GeoIP Functions for Apache Drill
apache-drill city country data data-analysis data-science drill geoip-functions ip-address ipv4
Last synced: 12 Apr 2025
https://github.com/apache/incubator-devlake-playground
Apache DevLake is an open-source dev data platform to ingest, analyze, and visualize the fragmented data from DevOps tools, extracting insights for engineering excellence, developer experience, and community growth.
dashboard-friendly data data-analysis data-engineering data-integration data-transfers devops domain-layer dora etl hacktoberfest integration jira open-source python user-friendly
Last synced: 19 Oct 2025
https://github.com/jesusgraterol/binance-futures-dataset-builder
The dataset builder script extracts the most relevant market data straight from Binance's API and builds a series of datasets that can be used in data science and machine learning projects.
bitcoin blockchain blockchain-technology data datascience datascience-machinelearning dataset dataset-generation futures futures-long-short futures-market machine-learning
Last synced: 06 Mar 2026
https://github.com/orfium/s3-parquetifier
This is a tool that takes a file from an S3 bucket and transforms it to Parquet format
Last synced: 12 Apr 2025
https://github.com/dsietz/pbd
Privacy by Design SDK
actix-web best-practices data data-privacy development-kit nfjs pbd pbd-sdk privacy privacy-by-design rust rust-lang sdk sdk-rust strategies
Last synced: 09 Apr 2025
https://github.com/hitsz-ids/dbmasker
DBMasker 是一个针对主流数据库系统的 Java 开源项目,旨在提供统一且安全的访问接口。
data data-security database mask sdk security
Last synced: 26 Apr 2025
https://github.com/r-js/mangos
🥭's is monorepo collecting data wrangling and data validation utilities
counterculture data data-wrangling fold functional isomorphism javascript json lens optics schema traversal validation
Last synced: 22 Feb 2026
https://github.com/amol-/datapyground
Easy to study Data Platform for fun and profit
compute-engine data data-engineering database python
Last synced: 28 Jul 2025
https://github.com/simranjeet97/financial_analysis_python
This repository contains code and videos related to financial data analysis using python.
data data-analysis data-science finance financial-data-analysis indian-stock-market machine-learning nifty50 numpy pandas portfolio-optimization python stock-analysis stock-market stock-price-prediction stock-price-simulation stock-simulation
Last synced: 22 Sep 2025
https://github.com/saifulaiub123/elasticsync.net
Real-time high-performance synchronization engine for syncing relational database changes to Elasticsearch index
background-service data database dotnet dotnetcore elastic elastic-search elasticsearch fulltext-search fulltextsearch npgsql postgresql sqlserver sync
Last synced: 27 Sep 2025
https://github.com/woctezuma/steam-reviews-to-sales
Study the relationship between review numbers and sales for Steam games
boxleiter data data-leak data-leakage data-leaks game games leak leakage leaks mike-boxleiter player review steam steam-api steam-game steam-games steam-reviews steamspy vginsights
Last synced: 02 Aug 2025
https://github.com/datadistillr/datadistillr-python-sdk
A Python SDK for Programmatically Interacting with DataDistillr
apache-drill data data-science datadistillr jupyter sql
Last synced: 01 Jul 2025
https://github.com/joschnitzbauer/dalymi
A lightweight, data-focused and non-opinionated pipeline manager written in and for Python.
dag data data-science pipeline python workflow
Last synced: 14 Jan 2026
https://github.com/spsanderson/healthyr.data
Data sets for the healthyR package.
data data-science data-sets healthcare healthcare-analysis healthcare-application healthcare-datasets r rstats
Last synced: 07 Apr 2025
https://github.com/spsanderson/steveondata
Repository for mainly R tips and tricks for my blog. I also include some VBA, SQL, C and Linux Usage.
ai blog c data data-science linux machinelearning-r ml ms-sql r sql time-series tipoftheday vba vba-excel
Last synced: 07 Apr 2025
https://github.com/mozahran/data-mapper
A data mapping tool that helps you map JSON with configuration files (JSON structure transformation). It also supports if conditions, casting, and mutators (custom or built-in functions).
data json mapper mappings mutator transformer
Last synced: 13 Jan 2026
https://github.com/dprokop/querier
Simple declarative data layer for React apps
data declarative react typescript
Last synced: 23 Mar 2025
https://github.com/maelgangloff/sfrmobile-api
Support non-officiel de l'API mobile du FAI français SFR/RED
consommation data giga mobile phone sfr
Last synced: 10 Mar 2026
https://github.com/erfidev/ds_golang
Data strcutures implemented by go programming language
Last synced: 16 May 2025
https://github.com/RDeconomist/RDeconomist.github.io
RapidCharts - a site for teaching and demonstrating Data Science and Visualisation techniques
data data-science data-visualization economics politics sports
Last synced: 08 Apr 2025
https://github.com/Scetrov/FrontierSharp
C# / .NET API Clients for EVE Frontier — API client for the static data exposed by CCPs HTTP API plus a HTTP Client tuned to the specific API design patterns implemented by CCP.
api data eve-frontier static-data
Last synced: 30 May 2026
https://github.com/wazzabeee/twitter-sentiment-analysis-pyspark
Comparative study of classification algorithms implemented in PySpark on the Sentiment 140 dataset.
apache-spark data data-science gcp google-cloud logistic-regression naive-bayes-classifier natural-language-processing nlp nlp-machine-learning pyspark python python3 sentiment-analysis sentiment-classification sentiment140-dataset sentimental-analysis spark tweet twitter
Last synced: 06 May 2025
https://github.com/olamide100/data-engineering-project
This Project summarizes the data set of Tweets related to the forth coming 2023 general election in Nigeria targeted at the two leading presidential aspirants in the country
airflow data data-engineering datatalks docker election nigeria politics twitter
Last synced: 05 May 2025
https://github.com/firelink-sh/evolve-py
A highly efficient, composable, and lightweight ETL and data integration framework.
analytics arrow big-data data data-engineering data-integration data-science duckdb elt etl ingestion ingress ml olap pipeline polars postgresql python s3
Last synced: 10 Mar 2026
https://github.com/theadam/pivoter
A library to create pivot tables that you can plug into any view library
Last synced: 16 Dec 2025
https://github.com/russss/coviddata
Yet another python package for accessing COVID-19 data
Last synced: 13 May 2025
https://github.com/The-Swarm-Corporation/BackTesterAgent
An enterprise-grade AI-powered backtesting framework built on the Swarms framework for automated trading strategy validation and optimization.
ai backtesting data finance finance-agents jpmorgan yahoofinance
Last synced: 11 Sep 2025
https://github.com/guilt/neuralinkcompression
Neuralink Compression Submission
Last synced: 14 Apr 2025
https://github.com/nrennie/shakespeare
CSV files of the works of William Shakespeare.
Last synced: 20 Mar 2025
https://github.com/bullet-db/bullet-record
The generic AVRO record container for plugging in your data into Bullet
avro bullet bullet-record container data
Last synced: 15 Jul 2025
https://github.com/piotrlaczkowski/keras-data-processor
Data Preprocessing model based on Keras preprocessing layers that can be used as a standalone model or incorporated to Keras model as first layers.
data keras layers preprocessing tensorflow
Last synced: 06 May 2025
https://github.com/lacerbi/changeprob
Human Online Adaptation to Changes in Prior Probability
bayesian-inference change-point-detection data modeling psychophysics
Last synced: 13 May 2025
https://github.com/makomweb/curious-moon-exercise
My notes about: A Curious Moon by Rob Conery
data data-science docker make postgres postgresql sql
Last synced: 06 Apr 2025
https://github.com/stdlib-js/array-pool
Typed array pool.
array data float32array float64array int16array int32array javascript matrix ndarray node node-js nodejs stdlib structure typed typed-array types uint32array vector
Last synced: 05 May 2025
https://github.com/tillathehun0/tilla
Permission based object transformation made easy.
asynchronous data datatransferobject dto mapper mapping node permissions promise-api transform transformer
Last synced: 12 May 2025
https://github.com/katilingban/ppitables
Poverty Probability Index (PPI) Lookup Tables
data poverty poverty-likelihoods poverty-probability ppi
Last synced: 04 Oct 2025
https://github.com/thiagodnf/qwood
Data structure visualization tool based on Qt
data data-structure qt5 tree visualization
Last synced: 15 Jul 2025
https://github.com/tjpalanca/education-analytics
Data-driven analysis of the Philippine education system
Last synced: 20 Oct 2025
https://github.com/lykmapipo/nyc-tlc-trip-data
Python scripts to download, process, and analyze the New York City Taxi and Limousine Commission (TLC) Trip Record Data dataset
apache-arrow apache-spark data data-engineering data-extraction data-transformation etl fsspec geopandas joblib jupyterlab lykmapipo metadata nyc nyc-taxi-dataset pandas pyarrow python s3
Last synced: 17 Sep 2025
https://github.com/aeternalis-ingenium/anomalytics
The ultimate anomaly detection and its analytics.
analysis analytics anomaly-detection data detection engineering inference mathematics numpy open-source pandas prediction python3 research scientific scipy statistics
Last synced: 12 Apr 2025
https://github.com/femtotrader/dukascopyticksreader.jl
A Julia library to download tick data from Dukascopy https://www.dukascopy.com/swiss/english/marketwatch/historical/
data data-analysis dataset dukascopy html julia stock-data
Last synced: 10 Apr 2025
https://github.com/strmprivacy/data-plane-helm-chart
Care about your data leaving your VPC/environment in SaaS mode? With our self-hosted option you can run our privacy focused Data Plane in your own Kubernetes Cluster. Just (1) sign-up, (2) request a self-hosted installation, (3) use our values.yaml on your own k8s clusters and (4) run your (customer) data inside your own cloud like 🪄
charts data helm kubernetes privacy security
Last synced: 23 Jun 2025
https://github.com/woctezuma/steam-api-data
Data consisting of Steam app details.
Last synced: 13 Apr 2025
https://github.com/blockception/bc-minecraft-bedrock-vanilla-data
A Typescript library for dealing provides vanilla minecraft data
bedrock data hacktoberfest hacktoberfest2022 minecraft ts typescript vanilla
Last synced: 08 Sep 2025
https://github.com/ygor-j/sql-guard
A small package for data quality rules using Standard SQL
assertions bigquery data data-cleaning data-quality data-quality-checks duckdb ducklake gcp in-memory pure-sql sql testing-tools
Last synced: 26 Oct 2025
https://github.com/lgatto/rpx
R Interface to the ProteomeXchange Repository
bioconductor data mass-spectrometry proteomexchange proteomics
Last synced: 21 Mar 2025
https://github.com/iguptashubham/online-retail-sales
This Power BI dashboard, designed for marketing strategists, analyzes sales trends and customer behavior. It provides key insights empowering them to identify sales opportunities and optimize marketing campaigns, ultimately boosting business sales.
dashboard data data-analysis data-analysis-project data-analysis-project-powerbi data-analysis-python data-project data-science powerbi project
Last synced: 19 Mar 2026
https://github.com/lynnlangit/sample-data
Small datasets and files in many formats, used for testing cloud SQL, NoSQL or Machine Learning Services
Last synced: 24 Mar 2025
https://github.com/ifttt/stripe-webhook-event-ingester
An AWS CDK-based project for ingesting Stripe webhook events into an SQS queue
aws cdk data lambda-functions pipeline python stripe
Last synced: 24 Dec 2025
https://github.com/ahmet-dedeler/concussify-csya-hacks
AI Powered Concussion Detection ⚡ On your browser, in one minute.
ai data github javascript learn web
Last synced: 08 Jul 2025
https://github.com/dhershman1/simply_valid
A simple to use data driven validation system
data easy-to-use plug-and-play simple validation
Last synced: 03 May 2025
https://github.com/FCC/pirateaction2
pirate action version 2
data data-visualization gis map
Last synced: 27 Jul 2025
https://github.com/bbc/verify-it
Randomised test property/data generation for NodeJS
data node nodejs property random randomization test testing typescript
Last synced: 17 Jun 2025
https://github.com/tim-crisp/javascript.lol
An in-browser javascript repl with flow based prototyping and dynamic module imports. https://javascript.lol
browser data javascript js json node playground repl
Last synced: 14 May 2025
https://github.com/gsantiago/extract-data-options
Extract `data-(namespace)-*` options from a HTML element
attributes data element extract html options
Last synced: 10 Apr 2025
https://github.com/hadiazt/medias-info
Scrap Media Info Such As Information & Download Link
data downloader media movie music
Last synced: 17 Jul 2025
https://github.com/navchandar/wifi_usage_tracker
Used to track WiFi usage from ISP website and track network speed using fast.com
data fast isp isp-website python scraper speedtest task-scheduler track-wifi tracker wifi
Last synced: 10 Apr 2025
https://github.com/lmacken/covidmon
A tool to monitor and visualize COVID-19 case status across many locations
covid covid-19 data dataviz global matplotlib monitoring pandas pandemic python virus
Last synced: 10 Apr 2025
https://github.com/ritiek/infrared-serial-comm
Transmits data using an IR led to an IR receiver on a Raspberry Pi
Last synced: 10 Apr 2025
https://github.com/aerisweather/mapsgl-ios-webview
Displays a MapsGL map view in a WKWebView with native controls using a Swift and Javascript bridge.
data forecast googlemaps leaflet map mapbox maplibre mapsgl visualization weather webgl
Last synced: 13 Apr 2025
https://github.com/hevalhazalkurt/exploring_the_data_of_lego_history
A data exploration project on LEGO history in Python with pandas, matplotlib etc. (WIP)
data data-analysis data-science data-visualization datascience datasets lego lego-history matplotlib pandas python python3
Last synced: 13 Apr 2025
https://github.com/xfhy/datastructure
数据结构,语言:C++
cpp data data-structures structure
Last synced: 13 Apr 2025
https://github.com/nashory/loader-torch
An Multi-threaded Data Loader Module for Torch.
data dataloader loader module multi-thread preprocessing toolbox torch
Last synced: 14 May 2025
https://github.com/maltyxx/images-generator
Generator of placeholder-type images using GD for FakerPHP/Faker
data faker fixtures image-generator imagesgenerator maltyxx php-faker
Last synced: 15 Jan 2026
https://github.com/astrapi69/data-api
Defines interfaces for data beans like jpa entities or DTO(Data Transfer Object) objects
active data dto identity java jpa modification tree-structure versioning
Last synced: 01 Apr 2025
https://github.com/trostalski/fhan
A simple client to query FHIR servers
data fhir fhir-client healthcare interoperability
Last synced: 17 Jan 2026
https://github.com/octoenergy/s3migrate
Bulk delete/copy/move files or modify Hive/Drill/Athena partitions using pythonic pattern matching
Last synced: 24 Jun 2025
https://github.com/navchandar/Wifi_Usage_Tracker
Used to track WiFi usage from ISP website and track network speed using fast.com
data fast isp isp-website python scraper speedtest task-scheduler track-wifi tracker wifi
Last synced: 15 Jul 2025
https://github.com/contributte/dag
:running: Dag is opinionated expression data generator written in PHP to fake your API.
alice api data expression faker generator ninjify php serverless
Last synced: 11 Jan 2026
https://github.com/weecology/retriever-recipes
data data-retrieval data-science dataset datasets hacktobefest
Last synced: 17 Feb 2026
https://github.com/anasfik/claude-code-prompts-reference
Leaked Claude code research material (how it does everything)
claude claude-code code data leak leaks prompts research skills system-prompts tools
Last synced: 08 Apr 2026
https://github.com/scottgriv/river-charts
🌊 📉 A Python, Django, Plotly, and Pandas web application that visualizes river data in real time, pulled using an API from the United States Geological Survey (USGS).
api charts data data-analysis data-visualization dataset django pandas plotly python usgs usgs-api visualization webapp
Last synced: 12 Aug 2025
https://github.com/fahim1049/data-structure-and-algorithm
Data Structure and Algorithm
array array-manipulations binary data data-science linear peek pop push search-algorithm stack structures
Last synced: 24 Apr 2025
https://github.com/smilyorg/tinygpkg-data
Small geographic datasets based on open data + tools
data dataset geo geodata geography geojson geopackage golang naturalearth naturalearthdata
Last synced: 27 Jun 2025
https://github.com/ropensci/dataaimsr
Australian Institute of Marine Science (AIMS) Data Platform API Client which provides easy access to AIMS Data Platform scientific data and information.
aims australia data marine monitoring sst weather
Last synced: 27 Dec 2025
https://github.com/sckott/gbifrb
GBIF Ruby client; docs: http://www.rubydoc.info/gems/gbifrb/0.1.0
biodiversity data ecology gbif gbif-api
Last synced: 15 Sep 2025