data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-23 00:07:41 UTC
- JSON Representation
https://github.com/fx31337/fx-data-download-action
:chart_with_upwards_trend:🐳Downloads Forex historical data via GitHub Actions
backtesting csv data docker forex fx-data github-actions historical-data
Last synced: 21 Jun 2025
https://github.com/zh-plus/faker-openai
Generate fake data with OpenAI's GPT-3 API.
data fake fake-data fake-data-generator faker openai-api python test-data test-data-generator testing
Last synced: 18 Mar 2025
https://github.com/aphp/dsfaker
Time series Generator
data faker generation time-series
Last synced: 18 Jun 2025
https://github.com/strmprivacy/cli
This is the STRM Privacy Command Line Interface, to define and manage your privacy streams, data schemas, event contracts and much more.
cli data data-pipeline data-privacy data-privacy-compliance data-processing privacy
Last synced: 23 Jun 2025
https://github.com/jonschlinkert/gulp-data-contents
Gulp plugin that replaces the contents of a file with the contents of another file using the filepath specified on the 'contents' property in front-matter. Customizable, useful for generating scaffolding or defining placeholder files.
contents data front-matter generate generator gulp gulp-plugins gulpplugin placeholder replace scaffolding templates
Last synced: 12 May 2025
https://github.com/prakaa/mms-monthly-cli
Source code and CLI tool to query and download data from the Australian Energy Market Operator's Monthly Data Archive
aemo australia data energy national-electricity-market nem nemweb python
Last synced: 30 Oct 2025
https://github.com/alexcarpenter/us-coffee-roasters
☕️ Crowd-sourced list of US coffee roasters
Last synced: 12 Apr 2025
https://github.com/vatshayan/data-sets
Different Data-set on various Important topic on Real-world Problems
artificial-intelligence data data-mining datanalysis datascience datascientist dataset datavisualization machine-learning-algorithms
Last synced: 06 Mar 2026
https://github.com/DeutscheAktuarvereinigung/insurance_scr_data
How to Work With Comprehensive Internal Model Data for Three Portfolios
actuaries actuary data insurance internal-models scr
Last synced: 20 Jul 2025
https://github.com/yisaienkov/tinysets
The project aims to collect various datasets for tasks such as classification, clustering, object detection... The purpose of this datasets is quick checking models and algorithms performance.
algorithms classification data data-science dataset datasets kaggle kaggle-dataset lego lego-minifigures lego-sets object-detection pypi python regression text-classification tinysets
Last synced: 14 Apr 2025
https://github.com/mckraqs/dataride
Lightning-fast data platform setup toolkit for small projects and PoCs
data data-engineering python terraform
Last synced: 24 Oct 2025
https://github.com/frederickgeek8/lyql
📈 Free realtime stock data. Streamed straight from Yahoo.
data data-mining finance realtime stocks stream stream-api yahoo
Last synced: 05 Mar 2026
https://github.com/dbt-labs/jaffle-shop-mesh-finance
A ✨ meshified ✨ open source sandbox project exploring dbt workflows via a fictional sandwich shop's data. This is a domain-focused node in the mesh focused on finance models, built on the jaffle-shop-mesh-platform project.
analytics analytics-engineering data data-engineering dbt dbt-cloud
Last synced: 05 Mar 2026
https://github.com/abcnews/data-australian-political-donations
A data package about political donations in Australia.
Last synced: 27 Jan 2026
https://github.com/koddachad/dq_tester
A lightweight simple data quality testing tool.
data database dataengineering dataquality dataqualitycheck
Last synced: 08 Oct 2025
https://github.com/andrewrporter/goiex
A go interface for accessing IEX finanical information
data fetch finance golang iex iex-api iextrading
Last synced: 28 Apr 2025
https://github.com/paulgrammer/ug-locale
Uganda districts, sub-counties, counties, parishes and villages
data districts nodejs npm-package uganda
Last synced: 02 Mar 2026
https://github.com/heysupratim/android-app-categories
A JSON having 19K Android package name entries with their Play Store Categories. Useful for people looking to create App Category Based things. Eg Smart Launcher
android crawled-data data json
Last synced: 26 Mar 2025
https://github.com/jrdnbradford/readmdtable
R 📦 for reading markdown tables into tibbles
data data-analysis data-analytics data-extraction data-mining data-science markdown markdown-parser markdown-table r r-package r-programming
Last synced: 23 Oct 2025
https://github.com/biglocalnews/bln-python-client
Python client for the biglocalnews.org API
api api-client data data-journalism graphql graphql-client journalism news python
Last synced: 03 Apr 2026
https://github.com/synzen/Discord.Stats
Data visualization for Discord server activities
charts data discord statistics tracking visualization
Last synced: 12 Oct 2025
https://github.com/szczyglis-dev/ultimate-chain-parser
[PHP] Advanced, extendable, and configurable text data parsing and processing toolkit working in a chain-based flow. The concept of the application is based on processing in subsequent iterations using configurable data processing modules in a configured manner. Each element in the execution chain accesses the output of the previous element.
composer-library csv csv-parser data json-parser parsing plugin-architecture processing rearrange-array recordset regex regex-match regex-pattern repack repair-processes reparse text text-generation text-processing yaml-parser
Last synced: 08 Oct 2025
https://github.com/simonsfoundation/spectrum-drug-tracker
Python files and datasets underlying the Spectrum Drug Tracker.
autism data data-visualization python python3
Last synced: 27 Feb 2026
https://github.com/filippobovo/betfair_data
Simple script to collect market data from Betfair.
betfair betfair-api collection data python
Last synced: 27 Feb 2026
https://github.com/cmpadden/dagster-pipes-rust
Dagster pipes implementation in Rust
dagster data integrations orchestration rust
Last synced: 11 Oct 2025
https://github.com/anthonykrivonos/nba-ml
🏀 Hardcoded ML classifiers from scratch to create predictive models on the outcomes of NBA games!
basketball classifiers data fromscratch hardcoded machine-learning ml nba python science sports
Last synced: 08 Oct 2025
https://github.com/shreshthvashisht/imdb-movie-analysis
Advanced MS Excel
data data-analysis-excel pivot-tables visualisation
Last synced: 01 Mar 2026
https://github.com/earthinversion/geospatial-data-visualization-using-pygmt
Example script to visualize topographic data, earthquake data, and tomographic data on a map
data geophysics pygmt python3 seismology visualization
Last synced: 10 Apr 2025
https://github.com/buabaj/byte
data transmission over sound modulator-demodulator model
Last synced: 08 Oct 2025
https://github.com/fdmorison/tiozin
Tiozin, your friendly ETL framework
data declarative etl framework pipeline
Last synced: 26 Apr 2026
https://github.com/m-dadej/downloading-and-aggregating-stocks
Scripts for downloading WSE/GPW stock prices. Allows for downloading historical price for every stock into a single dataset
data finance gpw historical-data stock
Last synced: 29 Apr 2025
https://github.com/datasets/genome-sequencing-costs
Costs associated with DNA sequencing since 2001
Last synced: 19 Oct 2025
https://github.com/claudiucreanga/data-science
Data Science notebooks
competitions data kaggle science
Last synced: 14 Oct 2025
https://github.com/gematik/spec-isip
FHIR resources for information technology systems in nursing care (ISiP – Informationstechnische Systeme in der pflegerischen Versorgung) are determined through the affirmative action process of the same name. Through ISiP, open and standardized interfaces are defined for the interoperable exchange of health data in care.
Last synced: 03 Mar 2026
https://github.com/tiramizoo/simple_data_migrations
Data migrations
data hacktoberfest migration rails ruby
Last synced: 12 Oct 2025
https://github.com/zeybek/node-matlab
NodeJS Package for MATLAB
algebra analytics data matlab matrix signal-processing
Last synced: 13 Mar 2026
https://github.com/colour-science/colour-demosaicing-examples-datasets
Colour - Demosaicing - Examples Datasets
color color-science color-space color-spaces colorspace colorspaces colour colour-science colour-space colour-spaces colourspace colourspaces data dataset datasets de-mosaicing debayering demosaicing demosaicking raw
Last synced: 27 Feb 2026
https://github.com/caerbannogwhite/aargh
A library that helps you out of data nightmares in Go. 🧙♂️
csv data data-science data-wrangling dataframe go golang html json linq statistics stats xlsx xpt
Last synced: 14 Jan 2026
https://github.com/zq99/optionsview
This library downloads option chain data for a given symbol from yahoo finance in a trader friendly format.
data options options-trading trading yahoo-finance
Last synced: 14 Jan 2026
https://github.com/dathere/qsvpro.dathere.com
🌐 Promo website for qsv pro, a spreadsheet data wrangling desktop app. Includes download links for Windows, macOS, & Linux. Website built with Astro as a static site.
astro ckan csv data data-wrangling framer-motion javascript product qsv react saas tailwindcss website
Last synced: 28 Feb 2026
https://github.com/insightsoftwareconsortium/rirewebsite
Website sources for The Retrospective Image Registration Evaluation Project (RIRE)
data grand-challenge imaging open-access open-science registeration
Last synced: 12 Oct 2025
https://github.com/jetsly/ddrx
A lightweight front-end framework based on rxjs. (Inspired by camel)
Last synced: 13 Oct 2025
https://github.com/cafali/pathscan
PathScan exports information about the contents of directories and hard drives. With a single click, you can create a complete list of all files and paths within a specific folder or across an entire hard drive.
backup command-line data data-analysis data-migration data-mining data-recovery directory folder-management folders forensics hard-drive keyword-extraction logging pathfinding recovery string-search tools utility windows
Last synced: 10 Oct 2025
https://github.com/freeipcc/freedatascrm
工商数据,电话获客,智能客户关系管理,数据驱动营销,自动化销售线索,B2B营销,客户洞察分析,精准营销!
ai bigdata bigdataanalytics data scrm
Last synced: 08 Feb 2026
https://github.com/adityashrm21/exploratory_data_analysis
A collection of exploratory data analysis techniques and resources
data data-analysis data-exploration data-science data-visualization dataset datasets eda exploratory-data-analysis insights kaggle
Last synced: 29 Apr 2025
https://github.com/cmstatr/cmstatr
An R Package for Statistical Analysis of Composite Material Data
composite-material-data cran data materials-science r statistical-analysis statistics
Last synced: 22 Oct 2025
https://github.com/iondv/report
IONDV. Framework: Report module is to form the analytical reports.
analytics businessintelligence css data data-analysis data-visualization iondv iondv-module reporting
Last synced: 12 Mar 2026
https://github.com/route1io/route1io-python-connectors
Connectors for interacting with popular APIs and services used in marketing analytics via clean and concise Python code.
analytics api api-connector data data-engineering marketing marketing-analytics python python3
Last synced: 13 Apr 2026
https://github.com/dotflow-io/dotflow
🎲 Business Logic Code in a flow!
data data-structures database dataflow dataflow-programming etl etl-framework etl-pipeline flow python python3 workflow workflow-engine
Last synced: 11 Apr 2026
https://github.com/reiniervlinschoten/castoredc_api
Python Wrapper for Castor EDC API
castor-edc castor-edc-api clinical-research clinical-trials data data-science python3 wrapper-api
Last synced: 15 Oct 2025
https://github.com/spatialcurrent/go-math
Math functions that support varied types
Last synced: 29 Jan 2026
https://github.com/koffisani/coding-data-togo
Données sur les langages et outils de développement utilisés ou sollicités au Togo
data python python3 scrapy scrapy-crawler
Last synced: 26 Mar 2025
https://github.com/giscience/osm-transform
Filter, enrich and prepare your OSM data for openrouteservice 🚙
cleanup data elevation enrichment filter graphs openrouteservice openstreetmap pbf routing
Last synced: 01 Apr 2026
https://github.com/siongui/7rsk9vjkm4p8z5xrdtqc
Pāli chanting resources and dhammatalk books
Last synced: 19 Jan 2026
https://github.com/matheusfelipeog/filometro
Obtenha os dados dos postos de vacinação da covid-19 em São Paulo
coronavirus covid-19 data de-olho-na-fila filometro python sao-paulo vacina vacinasampa wrapper
Last synced: 07 Oct 2025
https://github.com/adhar-io/adhar
ADHAR - The Open Cloud-Native Foundation
adhar adhar-patform ai analytics architecture cloud-native data developerexperience devops enterprise gitops governance helm idp k8s kubernetes microservices rapid-development security
Last synced: 25 Feb 2026
https://github.com/franloza/running-races-insights
Web application created with Evidence and DuckDB to share stats about the running races in Cuenca.
data dataengineering duckdb elt evidence markdown netlify running sql visualization
Last synced: 23 Jun 2026
https://github.com/gematik/spec-templateforsimplifierprojects
Template for creating gematik FHIR profiles
data fhir fsh miscellaneous template
Last synced: 25 Feb 2026
https://github.com/kevinrecuerda/recshark
:shark: Provide some C# tools
actions data di dotnet expression-evaluator netcore testing tools
Last synced: 10 May 2026
https://github.com/tuanai-vireox/dataplatform-stack
How to build a complete Data Platform -> Here
airflow cdc data data-warehouse datalake dataplatform dbt flink k8s kafka spark-streaming
Last synced: 22 Aug 2025
https://github.com/vishnu-t-r/data-analytics-portfolio-projects
This repository contain data analyst portfolio projects developed using various data analytics tools including SQL, Python, Tableau, Looker etc.
data data-analysis data-cleaning data-modeling data-visualization looker looker-studio python sql ssms tableau
Last synced: 23 Apr 2025
https://github.com/zillow/intake-dal
Dataset abstraction over disparate storage systems (eg: bulk, streaming, serving, ...).
Last synced: 23 Apr 2025
https://github.com/unicef/magasin
Cloud native open-source end-to-end data / AI / ML platform
cloud dagster data data-pipelines data-science data-visualization helm-charts kubernetes magasin
Last synced: 21 Apr 2025
https://github.com/DSCmatter/TechNews-API
This API allows you to retrieve tech news from various sources around the world using simple GET commands.
api api-rest backend-api data express javascript newsapi nodejs
Last synced: 19 Aug 2025
https://github.com/weavechain/weave-py-api
Weavechain Python API
confidential-compute data data-replication distributed-computing homomorphic-encryption layer-0 mpc self-sovereign verifiable-credentials weavechain zero-knowledge-proofs
Last synced: 14 Jan 2026
https://github.com/tmsalab/edmdata
Supplementary data package for the edm package
cognitive-diagnostic-models data edm r
Last synced: 15 Aug 2025
https://github.com/navchandar/file-convertor-utils
Set of custom Python Utilities to convert one file format into another. Filetypes supported: Excel, Images, PDF, GIF, MP4, XML, etc.
conversions convertor-utils data dataconversion excel file-conversion fileconversion fileformats image pdf python video xml
Last synced: 21 Sep 2025
https://github.com/RDeconomist/RDeconomist.github.io
RapidCharts - a site for teaching and demonstrating Data Science and Visualisation techniques
data data-science data-visualization economics politics sports
Last synced: 08 Apr 2025
https://github.com/stefen-taime/kafka-pipeline
In the following post, we will learn how to build a data pipeline using a combination of open-source software (OSS), including Debezium, Apache Kafka, Kafka Connect.
bash data docker elasticsearch etl-pipeline k kafka kafka-connect kafka-streams kafka-topic kibana ksqldb masking mongodb mysql pii pipeline postgresql
Last synced: 15 Apr 2025
https://github.com/tinymins/luadata
This is a javascript(js) npm package that can serialize array and object to Lua table, or unserialize Lua table to array and object.
data javascript js lua luadata
Last synced: 24 Apr 2025
https://github.com/extratone/routinehubreport
Maintaining a git-tracked record of analytics-esque data for my RoutineHub Library generated by Martin de Boer's exceptional Routinehub Report Siri Shortcut.
analytics automation data routinehub siri-shortcuts
Last synced: 03 Sep 2025
https://github.com/bjascob/pythondataserve
A module for serving up python data in a stand-alone process.
Last synced: 23 Apr 2025
https://github.com/gregyjames/mapperic
Automatically generate DTO Classes and AutoMapper Configurations.
automapper automapper-profiles code-generation csharp data dotnet dotnet-core dto dto-entity-mapper dto-generator dto-mapper dto-pattern object-oriented rosyln syntax-analysis syntax-tree
Last synced: 07 May 2025
https://github.com/bkuhlmann/lode
A monadic store of marshaled objects.
data objects persistence pstore storage transactions value
Last synced: 29 Jul 2025
https://github.com/longnguyen010203/ecommerce-elt-pipeline
🌄📈📉 A Data Engineering Project 🌈 that implements an ELT data pipeline using Dagster, Docker, Dbt, Polars, Snowflake, PostgreSQL. Data from kaggle website 🔥
dagster data data-engineering dbt docker docker-compose dockerfile elt elt-pipeline extract kaggle load polars postgresql raw-data relational-databases snowflake transform
Last synced: 27 Feb 2026
https://github.com/samashi47/ml-toolkit-project
A general-purpose toolkit for data preprocessing, machine learning modeling, and visualization.
classification data data-preprocessing machine-learning python3 visualization
Last synced: 30 Jul 2025
https://github.com/aircloud/use-groot
React Hooks for Data Fetching
cache data data-fetching fetch hook hooks react react-native stale-while-revalidate suspense swr
Last synced: 30 Jul 2025
https://github.com/pottekkat/bulldozer-prize-predictions
Predict the auction sale price for a piece of heavy equipment to create a "blue book" for bulldozers.
bluebook bulldozer data data-science jupyter-notebook kaggle-competition machine-learning
Last synced: 20 Jun 2026
https://github.com/stdlib-js/array-complex64
Complex64Array.
array cmplx complex complex64 complex64array data float imag imaginary javascript node node-js nodejs real stdlib structure typed typed-array types
Last synced: 09 Apr 2025
https://github.com/cgivre/drill-geoip-functions
GeoIP Functions for Apache Drill
apache-drill city country data data-analysis data-science drill geoip-functions ip-address ipv4
Last synced: 12 Apr 2025
https://github.com/jesusgraterol/binance-futures-dataset-builder
The dataset builder script extracts the most relevant market data straight from Binance's API and builds a series of datasets that can be used in data science and machine learning projects.
bitcoin blockchain blockchain-technology data datascience datascience-machinelearning dataset dataset-generation futures futures-long-short futures-market machine-learning
Last synced: 06 Mar 2026
https://github.com/haroldeustaquio/analysis-of-oncological-diseases-in-peru
This repository presents an analysis of care related to the 7 most frequent cancers in 2022, compiling open data provided by FISSAL. This report includes a breakdown of consultations by department, a patient profile by age, insurance and sex, as well as the most requested types of services. The findings are complemented by an interactive dashboard
analysis data diseases powerbi r sql sqlite
Last synced: 03 Apr 2025
https://github.com/orfium/s3-parquetifier
This is a tool that takes a file from an S3 bucket and transforms it to Parquet format
Last synced: 12 Apr 2025
https://github.com/wzbsocialsciencecenter/gemeindeverzeichnis
Python-Modul zum Einlesen von Gemeindeverzeichnisdaten des Statistischen Bundesamts als pandas DataFrame
administrative-data data germany pandas pandas-dataframe python
Last synced: 12 Apr 2025
https://github.com/flintsh/outlier-tools
A collection of free open-source tools to help you better understand your Outlier account, entirely handled in-browser.
Last synced: 27 Feb 2025
https://github.com/dsietz/pbd
Privacy by Design SDK
actix-web best-practices data data-privacy development-kit nfjs pbd pbd-sdk privacy privacy-by-design rust rust-lang sdk sdk-rust strategies
Last synced: 09 Apr 2025
https://github.com/wzbsocialsciencecenter/mdb-twitter-network
Twitter network of members of the 19th German Bundestag
bundestag data network-analysis r scraping twitter
Last synced: 12 Apr 2025