data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-28 00:07:46 UTC
- JSON Representation
https://github.com/hitsz-ids/dbmasker
DBMasker 是一个针对主流数据库系统的 Java 开源项目,旨在提供统一且安全的访问接口。
data data-security database mask sdk security
Last synced: 26 Apr 2025
https://github.com/simranjeet97/financial_analysis_python
This repository contains code and videos related to financial data analysis using python.
data data-analysis data-science finance financial-data-analysis indian-stock-market machine-learning nifty50 numpy pandas portfolio-optimization python stock-analysis stock-market stock-price-prediction stock-price-simulation stock-simulation
Last synced: 22 Sep 2025
https://github.com/samashi47/ml-toolkit-project
A general-purpose toolkit for data preprocessing, machine learning modeling, and visualization.
classification data data-preprocessing machine-learning python3 visualization
Last synced: 30 Jul 2025
https://github.com/navchandar/file-convertor-utils
Set of custom Python Utilities to convert one file format into another. Filetypes supported: Excel, Images, PDF, GIF, MP4, XML, etc.
conversions convertor-utils data dataconversion excel file-conversion fileconversion fileformats image pdf python video xml
Last synced: 21 Sep 2025
https://github.com/longnguyen010203/ecommerce-elt-pipeline
🌄📈📉 A Data Engineering Project 🌈 that implements an ELT data pipeline using Dagster, Docker, Dbt, Polars, Snowflake, PostgreSQL. Data from kaggle website 🔥
dagster data data-engineering dbt docker docker-compose dockerfile elt elt-pipeline extract kaggle load polars postgresql raw-data relational-databases snowflake transform
Last synced: 27 Feb 2026
https://github.com/aircloud/use-groot
React Hooks for Data Fetching
cache data data-fetching fetch hook hooks react react-native stale-while-revalidate suspense swr
Last synced: 30 Jul 2025
https://github.com/tomaztk/datasetR
Generate datasets for R projects
data data-frame data-science r-language r-programming sample sample-data sample-data-generator
Last synced: 29 Jul 2025
https://github.com/joschnitzbauer/dalymi
A lightweight, data-focused and non-opinionated pipeline manager written in and for Python.
dag data data-science pipeline python workflow
Last synced: 14 Jan 2026
https://github.com/mozahran/data-mapper
A data mapping tool that helps you map JSON with configuration files (JSON structure transformation). It also supports if conditions, casting, and mutators (custom or built-in functions).
data json mapper mappings mutator transformer
Last synced: 13 Jan 2026
https://github.com/bkuhlmann/lode
A monadic store of marshaled objects.
data objects persistence pstore storage transactions value
Last synced: 29 Jul 2025
https://github.com/r-js/mangos
🥭's is monorepo collecting data wrangling and data validation utilities
counterculture data data-wrangling fold functional isomorphism javascript json lens optics schema traversal validation
Last synced: 22 Feb 2026
https://github.com/Scetrov/FrontierSharp
C# / .NET API Clients for EVE Frontier — API client for the static data exposed by CCPs HTTP API plus a HTTP Client tuned to the specific API design patterns implemented by CCP.
api data eve-frontier static-data
Last synced: 30 May 2026
https://github.com/weavechain/weave-py-api
Weavechain Python API
confidential-compute data data-replication distributed-computing homomorphic-encryption layer-0 mpc self-sovereign verifiable-credentials weavechain zero-knowledge-proofs
Last synced: 14 Jan 2026
https://github.com/tuanai-vireox/dataplatform-stack
How to build a complete Data Platform -> Here
airflow cdc data data-warehouse datalake dataplatform dbt flink k8s kafka spark-streaming
Last synced: 22 Aug 2025
https://github.com/amol-/datapyground
Easy to study Data Platform for fun and profit
compute-engine data data-engineering database python
Last synced: 28 Jul 2025
https://github.com/pottekkat/bulldozer-prize-predictions
Predict the auction sale price for a piece of heavy equipment to create a "blue book" for bulldozers.
bluebook bulldozer data data-science jupyter-notebook kaggle-competition machine-learning
Last synced: 20 Jun 2026
https://github.com/juliaearth/geoartifacts.jl
Artifacts (e.g., datasets) for Geospatial Data Science
Last synced: 10 Apr 2026
https://github.com/mooxphp/data
[READ-ONLY] Static Language Data for Filament
countries currencies data filament languages laravel static timezones
Last synced: 20 Feb 2026
https://github.com/gregyjames/mapperic
Automatically generate DTO Classes and AutoMapper Configurations.
automapper automapper-profiles code-generation csharp data dotnet dotnet-core dto dto-entity-mapper dto-generator dto-mapper dto-pattern object-oriented rosyln syntax-analysis syntax-tree
Last synced: 07 May 2025
https://github.com/extratone/routinehubreport
Maintaining a git-tracked record of analytics-esque data for my RoutineHub Library generated by Martin de Boer's exceptional Routinehub Report Siri Shortcut.
analytics automation data routinehub siri-shortcuts
Last synced: 03 Sep 2025
https://github.com/arindal1/striversdsasheet
Solutions of all the problems in Striver's A2Z DSA Sheet
cpp data datastructures datastructures-algorithms striver strivers-sde-sheet
Last synced: 04 Apr 2025
https://github.com/unicef/magasin
Cloud native open-source end-to-end data / AI / ML platform
cloud dagster data data-pipelines data-science data-visualization helm-charts kubernetes magasin
Last synced: 21 Apr 2025
https://github.com/ahmetfurkandemir/iot
Internet of Things (IoT)
accelerometer arduino data dht22sensor esp32-arduino gyroscope iot iot-device lcd-display sensor wokwii
Last synced: 15 Apr 2025
https://github.com/apache/incubator-devlake-playground
Apache DevLake is an open-source dev data platform to ingest, analyze, and visualize the fragmented data from DevOps tools, extracting insights for engineering excellence, developer experience, and community growth.
dashboard-friendly data data-analysis data-engineering data-integration data-transfers devops domain-layer dora etl hacktoberfest integration jira open-source python user-friendly
Last synced: 19 Oct 2025
https://github.com/effect-deprecated/morphic
Domain Modelling and Structural Derivation (port of morphic-ts)
data domain functional typeclasses
Last synced: 29 Jun 2025
https://github.com/RDeconomist/RDeconomist.github.io
RapidCharts - a site for teaching and demonstrating Data Science and Visualisation techniques
data data-science data-visualization economics politics sports
Last synced: 08 Apr 2025
https://github.com/flintsh/outlier-tools
A collection of free open-source tools to help you better understand your Outlier account, entirely handled in-browser.
Last synced: 27 Feb 2025
https://github.com/sjefvanleeuwen/rqlite-dotnet
A lightweight database HTTP API client for rqlite. rqlite is a lightweight, distributed relational database, which uses RAFT and SQLite as its storage engine.
cluster data database distributed distributed-computing distributed-database distributed-systems dotnet raft rqlite
Last synced: 12 May 2025
https://github.com/DSCmatter/TechNews-API
This API allows you to retrieve tech news from various sources around the world using simple GET commands.
api api-rest backend-api data express javascript newsapi nodejs
Last synced: 19 Aug 2025
https://github.com/gsantiago/extract-data-options
Extract `data-(namespace)-*` options from a HTML element
attributes data element extract html options
Last synced: 10 Apr 2025
https://github.com/femtotrader/dukascopyticksreader.jl
A Julia library to download tick data from Dukascopy https://www.dukascopy.com/swiss/english/marketwatch/historical/
data data-analysis dataset dukascopy html julia stock-data
Last synced: 10 Apr 2025
https://github.com/navchandar/wifi_usage_tracker
Used to track WiFi usage from ISP website and track network speed using fast.com
data fast isp isp-website python scraper speedtest task-scheduler track-wifi tracker wifi
Last synced: 10 Apr 2025
https://github.com/trostalski/fhan
A simple client to query FHIR servers
data fhir fhir-client healthcare interoperability
Last synced: 17 Jan 2026
https://github.com/ritiek/infrared-serial-comm
Transmits data using an IR led to an IR receiver on a Raspberry Pi
Last synced: 10 Apr 2025
https://github.com/lmacken/covidmon
A tool to monitor and visualize COVID-19 case status across many locations
covid covid-19 data dataviz global matplotlib monitoring pandas pandemic python virus
Last synced: 10 Apr 2025
https://github.com/dhershman1/simply_valid
A simple to use data driven validation system
data easy-to-use plug-and-play simple validation
Last synced: 03 May 2025
https://github.com/ahmet-dedeler/concussify-csya-hacks
AI Powered Concussion Detection ⚡ On your browser, in one minute.
ai data github javascript learn web
Last synced: 08 Jul 2025
https://github.com/iguptashubham/online-retail-sales
This Power BI dashboard, designed for marketing strategists, analyzes sales trends and customer behavior. It provides key insights empowering them to identify sales opportunities and optimize marketing campaigns, ultimately boosting business sales.
dashboard data data-analysis data-analysis-project data-analysis-project-powerbi data-analysis-python data-project data-science powerbi project
Last synced: 19 Mar 2026
https://github.com/ygor-j/sql-guard
A small package for data quality rules using Standard SQL
assertions bigquery data data-cleaning data-quality data-quality-checks duckdb ducklake gcp in-memory pure-sql sql testing-tools
Last synced: 26 Oct 2025
https://github.com/blockception/bc-minecraft-bedrock-vanilla-data
A Typescript library for dealing provides vanilla minecraft data
bedrock data hacktoberfest hacktoberfest2022 minecraft ts typescript vanilla
Last synced: 08 Sep 2025
https://github.com/katilingban/ppitables
Poverty Probability Index (PPI) Lookup Tables
data poverty poverty-likelihoods poverty-probability ppi
Last synced: 04 Oct 2025
https://github.com/stdlib-js/array-pool
Typed array pool.
array data float32array float64array int16array int32array javascript matrix ndarray node node-js nodejs stdlib structure typed typed-array types uint32array vector
Last synced: 05 May 2025
https://github.com/nrennie/shakespeare
CSV files of the works of William Shakespeare.
Last synced: 20 Mar 2025
https://github.com/theadam/pivoter
A library to create pivot tables that you can plug into any view library
Last synced: 16 Dec 2025
https://github.com/olamide100/data-engineering-project
This Project summarizes the data set of Tweets related to the forth coming 2023 general election in Nigeria targeted at the two leading presidential aspirants in the country
airflow data data-engineering datatalks docker election nigeria politics twitter
Last synced: 05 May 2025
https://github.com/wazzabeee/twitter-sentiment-analysis-pyspark
Comparative study of classification algorithms implemented in PySpark on the Sentiment 140 dataset.
apache-spark data data-science gcp google-cloud logistic-regression naive-bayes-classifier natural-language-processing nlp nlp-machine-learning pyspark python python3 sentiment-analysis sentiment-classification sentiment140-dataset sentimental-analysis spark tweet twitter
Last synced: 06 May 2025
https://github.com/piotrlaczkowski/keras-data-processor
Data Preprocessing model based on Keras preprocessing layers that can be used as a standalone model or incorporated to Keras model as first layers.
data keras layers preprocessing tensorflow
Last synced: 06 May 2025
https://github.com/aeternalis-ingenium/anomalytics
The ultimate anomaly detection and its analytics.
analysis analytics anomaly-detection data detection engineering inference mathematics numpy open-source pandas prediction python3 research scientific scipy statistics
Last synced: 12 Apr 2025
https://github.com/strmprivacy/data-plane-helm-chart
Care about your data leaving your VPC/environment in SaaS mode? With our self-hosted option you can run our privacy focused Data Plane in your own Kubernetes Cluster. Just (1) sign-up, (2) request a self-hosted installation, (3) use our values.yaml on your own k8s clusters and (4) run your (customer) data inside your own cloud like 🪄
charts data helm kubernetes privacy security
Last synced: 23 Jun 2025
https://github.com/hadiazt/medias-info
Scrap Media Info Such As Information & Download Link
data downloader media movie music
Last synced: 17 Jul 2025
https://github.com/hevalhazalkurt/exploring_the_data_of_lego_history
A data exploration project on LEGO history in Python with pandas, matplotlib etc. (WIP)
data data-analysis data-science data-visualization datascience datasets lego lego-history matplotlib pandas python python3
Last synced: 13 Apr 2025
https://github.com/xfhy/datastructure
数据结构,语言:C++
cpp data data-structures structure
Last synced: 13 Apr 2025
https://github.com/maltyxx/images-generator
Generator of placeholder-type images using GD for FakerPHP/Faker
data faker fixtures image-generator imagesgenerator maltyxx php-faker
Last synced: 15 Jan 2026
https://github.com/contributte/dag
:running: Dag is opinionated expression data generator written in PHP to fake your API.
alice api data expression faker generator ninjify php serverless
Last synced: 11 Jan 2026
https://github.com/weecology/retriever-recipes
data data-retrieval data-science dataset datasets hacktobefest
Last synced: 17 Feb 2026
https://github.com/octoenergy/s3migrate
Bulk delete/copy/move files or modify Hive/Drill/Athena partitions using pythonic pattern matching
Last synced: 24 Jun 2025
https://github.com/navchandar/Wifi_Usage_Tracker
Used to track WiFi usage from ISP website and track network speed using fast.com
data fast isp isp-website python scraper speedtest task-scheduler track-wifi tracker wifi
Last synced: 15 Jul 2025
https://github.com/aerisweather/mapsgl-ios-webview
Displays a MapsGL map view in a WKWebView with native controls using a Swift and Javascript bridge.
data forecast googlemaps leaflet map mapbox maplibre mapsgl visualization weather webgl
Last synced: 13 Apr 2025
https://github.com/lgatto/rpx
R Interface to the ProteomeXchange Repository
bioconductor data mass-spectrometry proteomexchange proteomics
Last synced: 21 Mar 2025
https://github.com/woctezuma/steam-api-data
Data consisting of Steam app details.
Last synced: 13 Apr 2025
https://github.com/tillathehun0/tilla
Permission based object transformation made easy.
asynchronous data datatransferobject dto mapper mapping node permissions promise-api transform transformer
Last synced: 12 May 2025
https://github.com/makomweb/curious-moon-exercise
My notes about: A Curious Moon by Rob Conery
data data-science docker make postgres postgresql sql
Last synced: 06 Apr 2025
https://github.com/guilt/neuralinkcompression
Neuralink Compression Submission
Last synced: 14 Apr 2025
https://github.com/anasfik/claude-code-prompts-reference
Leaked Claude code research material (how it does everything)
claude claude-code code data leak leaks prompts research skills system-prompts tools
Last synced: 08 Apr 2026
https://github.com/russss/coviddata
Yet another python package for accessing COVID-19 data
Last synced: 13 May 2025
https://github.com/The-Swarm-Corporation/BackTesterAgent
An enterprise-grade AI-powered backtesting framework built on the Swarms framework for automated trading strategy validation and optimization.
ai backtesting data finance finance-agents jpmorgan yahoofinance
Last synced: 11 Sep 2025
https://github.com/bullet-db/bullet-record
The generic AVRO record container for plugging in your data into Bullet
avro bullet bullet-record container data
Last synced: 15 Jul 2025
https://github.com/lacerbi/changeprob
Human Online Adaptation to Changes in Prior Probability
bayesian-inference change-point-detection data modeling psychophysics
Last synced: 13 May 2025
https://github.com/thiagodnf/qwood
Data structure visualization tool based on Qt
data data-structure qt5 tree visualization
Last synced: 15 Jul 2025
https://github.com/lynnlangit/sample-data
Small datasets and files in many formats, used for testing cloud SQL, NoSQL or Machine Learning Services
Last synced: 24 Mar 2025
https://github.com/ifttt/stripe-webhook-event-ingester
An AWS CDK-based project for ingesting Stripe webhook events into an SQS queue
aws cdk data lambda-functions pipeline python stripe
Last synced: 24 Dec 2025
https://github.com/bbc/verify-it
Randomised test property/data generation for NodeJS
data node nodejs property random randomization test testing typescript
Last synced: 17 Jun 2025
https://github.com/tim-crisp/javascript.lol
An in-browser javascript repl with flow based prototyping and dynamic module imports. https://javascript.lol
browser data javascript js json node playground repl
Last synced: 14 May 2025
https://github.com/nashory/loader-torch
An Multi-threaded Data Loader Module for Torch.
data dataloader loader module multi-thread preprocessing toolbox torch
Last synced: 14 May 2025
https://github.com/astrapi69/data-api
Defines interfaces for data beans like jpa entities or DTO(Data Transfer Object) objects
active data dto identity java jpa modification tree-structure versioning
Last synced: 01 Apr 2025
https://github.com/firelink-sh/evolve-py
A highly efficient, composable, and lightweight ETL and data integration framework.
analytics arrow big-data data data-engineering data-integration data-science duckdb elt etl ingestion ingress ml olap pipeline polars postgresql python s3
Last synced: 10 Mar 2026
https://github.com/tjpalanca/education-analytics
Data-driven analysis of the Philippine education system
Last synced: 20 Oct 2025
https://github.com/lykmapipo/nyc-tlc-trip-data
Python scripts to download, process, and analyze the New York City Taxi and Limousine Commission (TLC) Trip Record Data dataset
apache-arrow apache-spark data data-engineering data-extraction data-transformation etl fsspec geopandas joblib jupyterlab lykmapipo metadata nyc nyc-taxi-dataset pandas pyarrow python s3
Last synced: 17 Sep 2025
https://github.com/FCC/pirateaction2
pirate action version 2
data data-visualization gis map
Last synced: 27 Jul 2025
https://github.com/acacode/serializy
❤️ Isolate server side models at frontend side ❤️
convert data json jsonschema mapper object schema serialization serializer validation validator
Last synced: 08 May 2025