data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-23 00:07:41 UTC
- JSON Representation
https://github.com/moderocky/fern
A human-readable and modifiable data-expression language with minimal clutter.
configuration data data-structures data-visualization language
Last synced: 18 Sep 2025
https://github.com/cansik/duit
Duit is a Python library that provides a set of tools for working with data in a structured and efficient way.
configuration data gui helper open3d python settings tkinter tool
Last synced: 05 May 2025
https://github.com/wmarquardt/cassandra-csv
A simple way to export cassandra query result to CSV format
Last synced: 18 Mar 2025
https://github.com/khaouitiabdelhakim/etl-real-example
This repository contains a real example of an Extract, Transform, Load (ETL) process using SQL Server Management Studio (SSMS), SQL Server Integration Services (SSIS), and AdventureWorks2012 data. The objective is to load data into our LightAdventureDW data warehouse.
data database database-management sql sql-server ssis ssms warehouse
Last synced: 18 Mar 2025
https://github.com/nishkarshraj/operating-systems-with-c-and-shell
Operating Systems with Shell and C programming
algorithms algorithms-and-data-structures bash c cpp data data-structures file filesystem fork kernel linux open-source opensource operating-system piping scheduler scheduling-algorithms shell shell-scripts
Last synced: 22 Apr 2025
https://github.com/lykmapipo/data-science-learning
Data Science Learning
analytics big-data data data-mining data-visualization data-wrangling gis julia lykmapipo python r scientific-computing statistics
Last synced: 19 Mar 2025
https://github.com/alexcarpenter/us-coffee-roasters
☕️ Crowd-sourced list of US coffee roasters
Last synced: 12 Apr 2025
https://github.com/buabaj/byte
data transmission over sound modulator-demodulator model
Last synced: 08 Oct 2025
https://github.com/szczyglis-dev/ultimate-chain-parser
[PHP] Advanced, extendable, and configurable text data parsing and processing toolkit working in a chain-based flow. The concept of the application is based on processing in subsequent iterations using configurable data processing modules in a configured manner. Each element in the execution chain accesses the output of the previous element.
composer-library csv csv-parser data json-parser parsing plugin-architecture processing rearrange-array recordset regex regex-match regex-pattern repack repair-processes reparse text text-generation text-processing yaml-parser
Last synced: 08 Oct 2025
https://github.com/earthinversion/geospatial-data-visualization-using-pygmt
Example script to visualize topographic data, earthquake data, and tomographic data on a map
data geophysics pygmt python3 seismology visualization
Last synced: 10 Apr 2025
https://github.com/filippobovo/betfair_data
Simple script to collect market data from Betfair.
betfair betfair-api collection data python
Last synced: 27 Feb 2026
https://github.com/simonsfoundation/spectrum-drug-tracker
Python files and datasets underlying the Spectrum Drug Tracker.
autism data data-visualization python python3
Last synced: 27 Feb 2026
https://github.com/cmpadden/dagster-pipes-rust
Dagster pipes implementation in Rust
dagster data integrations orchestration rust
Last synced: 11 Oct 2025
https://github.com/frederickgeek8/lyql
📈 Free realtime stock data. Streamed straight from Yahoo.
data data-mining finance realtime stocks stream stream-api yahoo
Last synced: 05 Mar 2026
https://github.com/dbt-labs/jaffle-shop-mesh-finance
A ✨ meshified ✨ open source sandbox project exploring dbt workflows via a fictional sandwich shop's data. This is a domain-focused node in the mesh focused on finance models, built on the jaffle-shop-mesh-platform project.
analytics analytics-engineering data data-engineering dbt dbt-cloud
Last synced: 05 Mar 2026
https://github.com/heysupratim/android-app-categories
A JSON having 19K Android package name entries with their Play Store Categories. Useful for people looking to create App Category Based things. Eg Smart Launcher
android crawled-data data json
Last synced: 26 Mar 2025
https://github.com/fdmorison/tiozin
Tiozin, your friendly ETL framework
data declarative etl framework pipeline
Last synced: 26 Apr 2026
https://github.com/datasets/genome-sequencing-costs
Costs associated with DNA sequencing since 2001
Last synced: 19 Oct 2025
https://github.com/biglocalnews/bln-python-client
Python client for the biglocalnews.org API
api api-client data data-journalism graphql graphql-client journalism news python
Last synced: 03 Apr 2026
https://github.com/andrewrporter/goiex
A go interface for accessing IEX finanical information
data fetch finance golang iex iex-api iextrading
Last synced: 28 Apr 2025
https://github.com/jrdnbradford/readmdtable
R 📦 for reading markdown tables into tibbles
data data-analysis data-analytics data-extraction data-mining data-science markdown markdown-parser markdown-table r r-package r-programming
Last synced: 23 Oct 2025
https://github.com/synzen/Discord.Stats
Data visualization for Discord server activities
charts data discord statistics tracking visualization
Last synced: 12 Oct 2025
https://github.com/paulgrammer/ug-locale
Uganda districts, sub-counties, counties, parishes and villages
data districts nodejs npm-package uganda
Last synced: 02 Mar 2026
https://github.com/koddachad/dq_tester
A lightweight simple data quality testing tool.
data database dataengineering dataquality dataqualitycheck
Last synced: 08 Oct 2025
https://github.com/mckraqs/dataride
Lightning-fast data platform setup toolkit for small projects and PoCs
data data-engineering python terraform
Last synced: 24 Oct 2025
https://github.com/anthonykrivonos/nba-ml
🏀 Hardcoded ML classifiers from scratch to create predictive models on the outcomes of NBA games!
basketball classifiers data fromscratch hardcoded machine-learning ml nba python science sports
Last synced: 08 Oct 2025
https://github.com/m-dadej/downloading-and-aggregating-stocks
Scripts for downloading WSE/GPW stock prices. Allows for downloading historical price for every stock into a single dataset
data finance gpw historical-data stock
Last synced: 29 Apr 2025
https://github.com/abcnews/data-australian-political-donations
A data package about political donations in Australia.
Last synced: 27 Jan 2026
https://github.com/shreshthvashisht/imdb-movie-analysis
Advanced MS Excel
data data-analysis-excel pivot-tables visualisation
Last synced: 01 Mar 2026
https://github.com/insightsoftwareconsortium/rirewebsite
Website sources for The Retrospective Image Registration Evaluation Project (RIRE)
data grand-challenge imaging open-access open-science registeration
Last synced: 12 Oct 2025
https://github.com/adityashrm21/exploratory_data_analysis
A collection of exploratory data analysis techniques and resources
data data-analysis data-exploration data-science data-visualization dataset datasets eda exploratory-data-analysis insights kaggle
Last synced: 29 Apr 2025
https://github.com/claudiucreanga/data-science
Data Science notebooks
competitions data kaggle science
Last synced: 14 Oct 2025
https://github.com/spatialcurrent/go-math
Math functions that support varied types
Last synced: 29 Jan 2026
https://github.com/freeipcc/freedatascrm
工商数据,电话获客,智能客户关系管理,数据驱动营销,自动化销售线索,B2B营销,客户洞察分析,精准营销!
ai bigdata bigdataanalytics data scrm
Last synced: 08 Feb 2026
https://github.com/iondv/report
IONDV. Framework: Report module is to form the analytical reports.
analytics businessintelligence css data data-analysis data-visualization iondv iondv-module reporting
Last synced: 12 Mar 2026
https://github.com/jetsly/ddrx
A lightweight front-end framework based on rxjs. (Inspired by camel)
Last synced: 13 Oct 2025
https://github.com/reiniervlinschoten/castoredc_api
Python Wrapper for Castor EDC API
castor-edc castor-edc-api clinical-research clinical-trials data data-science python3 wrapper-api
Last synced: 15 Oct 2025
https://github.com/colour-science/colour-demosaicing-examples-datasets
Colour - Demosaicing - Examples Datasets
color color-science color-space color-spaces colorspace colorspaces colour colour-science colour-space colour-spaces colourspace colourspaces data dataset datasets de-mosaicing debayering demosaicing demosaicking raw
Last synced: 27 Feb 2026
https://github.com/matheusfelipeog/filometro
Obtenha os dados dos postos de vacinação da covid-19 em São Paulo
coronavirus covid-19 data de-olho-na-fila filometro python sao-paulo vacina vacinasampa wrapper
Last synced: 07 Oct 2025
https://github.com/kevinrecuerda/recshark
:shark: Provide some C# tools
actions data di dotnet expression-evaluator netcore testing tools
Last synced: 10 May 2026
https://github.com/cafali/pathscan
PathScan exports information about the contents of directories and hard drives. With a single click, you can create a complete list of all files and paths within a specific folder or across an entire hard drive.
backup command-line data data-analysis data-migration data-mining data-recovery directory folder-management folders forensics hard-drive keyword-extraction logging pathfinding recovery string-search tools utility windows
Last synced: 10 Oct 2025
https://github.com/zq99/optionsview
This library downloads option chain data for a given symbol from yahoo finance in a trader friendly format.
data options options-trading trading yahoo-finance
Last synced: 14 Jan 2026
https://github.com/koffisani/coding-data-togo
Données sur les langages et outils de développement utilisés ou sollicités au Togo
data python python3 scrapy scrapy-crawler
Last synced: 26 Mar 2025
https://github.com/adhar-io/adhar
ADHAR - The Open Cloud-Native Foundation
adhar adhar-patform ai analytics architecture cloud-native data developerexperience devops enterprise gitops governance helm idp k8s kubernetes microservices rapid-development security
Last synced: 25 Feb 2026
https://github.com/tiramizoo/simple_data_migrations
Data migrations
data hacktoberfest migration rails ruby
Last synced: 12 Oct 2025
https://github.com/cmstatr/cmstatr
An R Package for Statistical Analysis of Composite Material Data
composite-material-data cran data materials-science r statistical-analysis statistics
Last synced: 22 Oct 2025
https://github.com/caerbannogwhite/aargh
A library that helps you out of data nightmares in Go. 🧙♂️
csv data data-science data-wrangling dataframe go golang html json linq statistics stats xlsx xpt
Last synced: 14 Jan 2026
https://github.com/siongui/7rsk9vjkm4p8z5xrdtqc
Pāli chanting resources and dhammatalk books
Last synced: 19 Jan 2026
https://github.com/gematik/spec-isip
FHIR resources for information technology systems in nursing care (ISiP – Informationstechnische Systeme in der pflegerischen Versorgung) are determined through the affirmative action process of the same name. Through ISiP, open and standardized interfaces are defined for the interoperable exchange of health data in care.
Last synced: 03 Mar 2026
https://github.com/zeybek/node-matlab
NodeJS Package for MATLAB
algebra analytics data matlab matrix signal-processing
Last synced: 13 Mar 2026
https://github.com/dotflow-io/dotflow
🎲 Business Logic Code in a flow!
data data-structures database dataflow dataflow-programming etl etl-framework etl-pipeline flow python python3 workflow workflow-engine
Last synced: 11 Apr 2026
https://github.com/giscience/osm-transform
Filter, enrich and prepare your OSM data for openrouteservice 🚙
cleanup data elevation enrichment filter graphs openrouteservice openstreetmap pbf routing
Last synced: 01 Apr 2026
https://github.com/dathere/qsvpro.dathere.com
🌐 Promo website for qsv pro, a spreadsheet data wrangling desktop app. Includes download links for Windows, macOS, & Linux. Website built with Astro as a static site.
astro ckan csv data data-wrangling framer-motion javascript product qsv react saas tailwindcss website
Last synced: 28 Feb 2026
https://github.com/gematik/spec-templateforsimplifierprojects
Template for creating gematik FHIR profiles
data fhir fsh miscellaneous template
Last synced: 25 Feb 2026
https://github.com/route1io/route1io-python-connectors
Connectors for interacting with popular APIs and services used in marketing analytics via clean and concise Python code.
analytics api api-connector data data-engineering marketing marketing-analytics python python3
Last synced: 13 Apr 2026
https://github.com/cveira/synology-tools
Scripts I've coded to automate routine tasks on my NAS
automation data nas operations replication scripts shell synology system-administration
Last synced: 30 Jul 2025
https://github.com/flintsh/outlier-tools
A collection of free open-source tools to help you better understand your Outlier account, entirely handled in-browser.
Last synced: 27 Feb 2025
https://github.com/effect-deprecated/morphic
Domain Modelling and Structural Derivation (port of morphic-ts)
data domain functional typeclasses
Last synced: 29 Jun 2025
https://github.com/sjefvanleeuwen/rqlite-dotnet
A lightweight database HTTP API client for rqlite. rqlite is a lightweight, distributed relational database, which uses RAFT and SQLite as its storage engine.
cluster data database distributed distributed-computing distributed-database distributed-systems dotnet raft rqlite
Last synced: 12 May 2025
https://github.com/aircloud/use-groot
React Hooks for Data Fetching
cache data data-fetching fetch hook hooks react react-native stale-while-revalidate suspense swr
Last synced: 30 Jul 2025
https://github.com/tomaztk/datasetR
Generate datasets for R projects
data data-frame data-science r-language r-programming sample sample-data sample-data-generator
Last synced: 29 Jul 2025
https://github.com/juliaearth/geoartifacts.jl
Artifacts (e.g., datasets) for Geospatial Data Science
Last synced: 10 Apr 2026
https://github.com/sabyasachi-seal/summer-olympics-data-analysis
Analyzing 2020 Summer Olympic Dataset
analysis colab-notebook data data-science dataset jypyternotebook olympics
Last synced: 08 May 2025
https://github.com/arindal1/striversdsasheet
Solutions of all the problems in Striver's A2Z DSA Sheet
cpp data datastructures datastructures-algorithms striver strivers-sde-sheet
Last synced: 04 Apr 2025
https://github.com/nasa-pds/naif-pds4-bundler
Package to generate PDS4 SPICE Kernels Archives
archive data geometry geometry-processing navigation planetary-data planetary-science python spice
Last synced: 06 Jan 2026
https://github.com/unicef/magasin
Cloud native open-source end-to-end data / AI / ML platform
cloud dagster data data-pipelines data-science data-visualization helm-charts kubernetes magasin
Last synced: 21 Apr 2025
https://github.com/wzbsocialsciencecenter/mdb-twitter-network
Twitter network of members of the 19th German Bundestag
bundestag data network-analysis r scraping twitter
Last synced: 12 Apr 2025
https://github.com/wzbsocialsciencecenter/gemeindeverzeichnis
Python-Modul zum Einlesen von Gemeindeverzeichnisdaten des Statistischen Bundesamts als pandas DataFrame
administrative-data data germany pandas pandas-dataframe python
Last synced: 12 Apr 2025
https://github.com/haroldeustaquio/analysis-of-oncological-diseases-in-peru
This repository presents an analysis of care related to the 7 most frequent cancers in 2022, compiling open data provided by FISSAL. This report includes a breakdown of consultations by department, a patient profile by age, insurance and sex, as well as the most requested types of services. The findings are complemented by an interactive dashboard
analysis data diseases powerbi r sql sqlite
Last synced: 03 Apr 2025
https://github.com/bjascob/pythondataserve
A module for serving up python data in a stand-alone process.
Last synced: 23 Apr 2025
https://github.com/stefen-taime/kafka-pipeline
In the following post, we will learn how to build a data pipeline using a combination of open-source software (OSS), including Debezium, Apache Kafka, Kafka Connect.
bash data docker elasticsearch etl-pipeline k kafka kafka-connect kafka-streams kafka-topic kibana ksqldb masking mongodb mysql pii pipeline postgresql
Last synced: 15 Apr 2025
https://github.com/mooxphp/data
[READ-ONLY] Static Language Data for Filament
countries currencies data filament languages laravel static timezones
Last synced: 20 Feb 2026
https://github.com/extratone/routinehubreport
Maintaining a git-tracked record of analytics-esque data for my RoutineHub Library generated by Martin de Boer's exceptional Routinehub Report Siri Shortcut.
analytics automation data routinehub siri-shortcuts
Last synced: 03 Sep 2025
https://github.com/samber/ansible-role-airbyte
Ansible role for Airbyte
3rd-party airbyte ansible connector data data-analysis data-science data-visualization datawarehouse elt etl incremental integration pipeline replication reverse-etl role saas sync
Last synced: 12 Apr 2025
https://github.com/navchandar/file-convertor-utils
Set of custom Python Utilities to convert one file format into another. Filetypes supported: Excel, Images, PDF, GIF, MP4, XML, etc.
conversions convertor-utils data dataconversion excel file-conversion fileconversion fileformats image pdf python video xml
Last synced: 21 Sep 2025
https://github.com/gregyjames/mapperic
Automatically generate DTO Classes and AutoMapper Configurations.
automapper automapper-profiles code-generation csharp data dotnet dotnet-core dto dto-entity-mapper dto-generator dto-mapper dto-pattern object-oriented rosyln syntax-analysis syntax-tree
Last synced: 07 May 2025
https://github.com/tuanai-vireox/dataplatform-stack
How to build a complete Data Platform -> Here
airflow cdc data data-warehouse datalake dataplatform dbt flink k8s kafka spark-streaming
Last synced: 22 Aug 2025
https://github.com/weavechain/weave-py-api
Weavechain Python API
confidential-compute data data-replication distributed-computing homomorphic-encryption layer-0 mpc self-sovereign verifiable-credentials weavechain zero-knowledge-proofs
Last synced: 14 Jan 2026
https://github.com/vishnu-t-r/data-analytics-portfolio-projects
This repository contain data analyst portfolio projects developed using various data analytics tools including SQL, Python, Tableau, Looker etc.
data data-analysis data-cleaning data-modeling data-visualization looker looker-studio python sql ssms tableau
Last synced: 23 Apr 2025
https://github.com/zillow/intake-dal
Dataset abstraction over disparate storage systems (eg: bulk, streaming, serving, ...).
Last synced: 23 Apr 2025
https://github.com/samashi47/ml-toolkit-project
A general-purpose toolkit for data preprocessing, machine learning modeling, and visualization.
classification data data-preprocessing machine-learning python3 visualization
Last synced: 30 Jul 2025
https://github.com/DSCmatter/TechNews-API
This API allows you to retrieve tech news from various sources around the world using simple GET commands.
api api-rest backend-api data express javascript newsapi nodejs
Last synced: 19 Aug 2025
https://github.com/pottekkat/bulldozer-prize-predictions
Predict the auction sale price for a piece of heavy equipment to create a "blue book" for bulldozers.
bluebook bulldozer data data-science jupyter-notebook kaggle-competition machine-learning
Last synced: 20 Jun 2026
https://github.com/tmsalab/edmdata
Supplementary data package for the edm package
cognitive-diagnostic-models data edm r
Last synced: 15 Aug 2025
https://github.com/bkuhlmann/lode
A monadic store of marshaled objects.
data objects persistence pstore storage transactions value
Last synced: 29 Jul 2025