data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-02-03 00:08:04 UTC
- JSON Representation
https://github.com/blenderskool/lexico
🔭 Powerful data searching with terse syntax
data dsl extended fuzzy hacktoberfest indexes logic operators search
Last synced: 12 Apr 2025
https://github.com/jravi2/chat-analyzer
A python program to analyze your Social Chats
Last synced: 12 Apr 2025
https://github.com/agile-lab-dev/governance-decision-record
The Governance Decision Record (GDR) is a specification model for (computational) data governance policies inspired from the ADR (Architectural Decision Record).
architectural-decision-records data data-governance data-management data-management-platform data-mesh federated-computational-governance governance-decision-record platform policy-as-code
Last synced: 30 Jun 2025
https://github.com/benkingcode/react-data-fetching-components
♻️ Asynchronously load data for your React components with SSR
async code-splitting data loading react server-side-rendering ssr
Last synced: 14 Apr 2025
https://github.com/lungben/tableio.jl
A glue package for reading and writing tabular data. It aims to provide a uniform api for reading and writing tabular data from and to multiple sources.
arrow csv data data-science database dataframe dataframes excel jdf json-format parquet postgresql sqlite zip
Last synced: 12 Oct 2025
https://github.com/dimitryzub/google-finance-py
Scripts for scraping Google Finance data in Python.
data data-analysis data-mining finance financial-data google google-finance market-data python python-programming python-script scraper scraping stock stock-market stocks web-scraping webscraping
Last synced: 20 Jun 2025
https://github.com/hoangsonww/standard-deviation-calculator
📊 This repository contains a Standard Deviation Calculator implemented in C++. It provides an efficient algorithm for calculating the statistical standard deviation of a dataset, making it a valuable tool for students, researchers, and analysts seeking a reliable method for data analysis.
algorithms cplusplus cpp data data-analysis data-analytics data-science standard-deviation standard-deviation-calculator standard-deviations
Last synced: 22 Sep 2025
https://github.com/lazarus-org/dj-data-generator
A package for generating data
data django django-packages fake-data python
Last synced: 28 Jun 2025
https://github.com/thorium/fsharp.data.jsonprovider.serializer
Plugin to replace FSharp.Data.JsonProvider default serialization with fast System.Text.Json
Last synced: 23 Jul 2025
https://github.com/dotnet-ad/Faker.Portable
C# faked data generation for testing and prototyping purpose.
Last synced: 27 Jun 2025
https://github.com/rxn4chemistry/rxn-reaction-preprocessing
Preprocessing of datasets of chemical reactions: standardization, filtering, augmentation, tokenization, etc.
chemistry data processing reactions retrosynthesis rxn
Last synced: 16 Jan 2026
https://github.com/rousan/samples-viewer-generator
:tada: A CLI utility tool to generate web app of data visualization samples for presentation purpose
boilerplate chart cli data generator viz
Last synced: 13 Jul 2025
https://github.com/lmangani/videoparquet
Python library for converting Parquet files (containing array-like or tabular data) to video files and back using ffmpeg
Last synced: 07 Oct 2025
https://github.com/ask-sage/asksage-open-source-community
🚀 Welcome to AskSage Open Source Repo, a community aimed at providing code 🧑💻, documentation📝, and proof-of-concept projects for interacting with the AskSage API. This repository serves as an unofficial resource for paid subscribers seeking practical guidance on utilizing the API. If you have any questions email us asksage.community@gmail.com
agnostic-llm api artificial-intelligence asksage chatgpt cohere data gemini generative-ai government large-language-model large-language-models llama machine-learning open-source openai prompt-engineering rag
Last synced: 06 Oct 2025
https://github.com/cveira/social-media-scripting-framework
Social Media Scripting Framework
api cli data data-analysis data-analytics excel facebook linkedin powershell rss-feed social social-media social-network-analysis twitter web-scraping
Last synced: 06 Oct 2025
https://github.com/emptymalei/audiorepr
A python package to represent data using musical notes.
audiolization data data-audiolization data-science
Last synced: 12 Oct 2025
https://github.com/lisad/phaser
The missing layer for complex data batch integration pipelines
data data-integration etl etl-pipeline
Last synced: 23 Apr 2025
https://github.com/anthonydb/data-wrangling-python-nicar-2017
Materials for the NICAR 2017 Data Wrangling with Python hands-on class
agate csvkit data jupyter nicar-2017 nicar17 python
Last synced: 22 Apr 2025
https://github.com/codeforcroatia/open-data
Croatia open data repository (Open Data HR)
data datasets open-data open-datasets opendata
Last synced: 27 Feb 2025
https://github.com/kevinmusgrave/record-keeper
Record experiment data easily
data experiments logging machine-learning pytorch
Last synced: 23 Apr 2025
https://github.com/davdroman/swift-builders
Result builders for Swift and Foundation types
array builder data dictionary resultbuilder set string substring swift
Last synced: 09 Apr 2025
https://github.com/floriank13/skyimages
Download and manage sky images for solar power forecasting.
data dataset datasets energy forecasting machine-learning python python-package pytorch
Last synced: 13 May 2025
https://github.com/klaudiosinani/kiu
FIFO Queues for ES6
data es6 fifo queue structure typescript
Last synced: 24 Apr 2025
https://github.com/base-repos/base-pkg
Base plugin for adding a `pkg` object with get/set methods for getting data from package.json or setting data to package.json.
config data libary module npm package package-json project store
Last synced: 07 May 2025
https://github.com/electricbolt/bindkit
Two-way data binding framework for iOS. Only one API to learn.
binding data ios objective-c reactive rxswift swift uikit
Last synced: 04 May 2025
https://github.com/dawidolko/algorithms-data-structures
Tasks studies - laboratory
data java labs structures works
Last synced: 06 Apr 2025
https://github.com/hgraph-os/hgraph-react
(Note: Use main hGraph repo) An open source visualization for patient health data, as a React component using d3.
boston care-plan data design health-data healthcare hgraph open-source patient-health-data ux visualization wellness
Last synced: 12 Jan 2026
https://github.com/KevinMusgrave/record-keeper
Record experiment data easily
data experiments logging machine-learning pytorch
Last synced: 08 May 2025
https://github.com/newrelic-experimental/newrelic-logs-for-salesforce-commerce-cloud
A Docker image purpose-built to monitor Salesforce Commerce Cloud (fka Demandware) logs using New Relic Logs.
data data-acquisition new-relic-logs newrelic nrlabs nrlabs-odp observability-data salesforce-commerce-cloud
Last synced: 10 Apr 2025
https://github.com/travishorn/diabetes-food-database
Food information for people with diabetes.
data database diabetes diabetic food food-safety information medical
Last synced: 14 Apr 2025
https://github.com/ipeagit/aopdata
Download data from the Access to Opportunities Project (AOP)
data r transport transport-accessibility transport-planning
Last synced: 02 May 2025
https://github.com/aria-ml/dataeval
Python library for analyzing data quality and its impact on model performance across classification and object-detection tasks.
ai bias-detection data linter metrics out-of-distribution-detection outlier-detection sufficiency
Last synced: 16 Jan 2026
https://github.com/wmo-raf/adl
Automated Weather Observation Data Collection and Processing System
adl automated-data-loader automatic-weather-station aws climate-data climate-services data data-collection-and-processing data-transmission weather-data weather-station wis2 wis2box
Last synced: 17 Jan 2026
https://github.com/amir9ume/urdu_ghazals_rekhta
Dataset for Urdu Ghazals
data dataset language-model machine-learning nlp parser rekhta urdu
Last synced: 13 May 2025
https://github.com/jonschlinkert/merge-configs
Find, load and merge JSON and YAML config settings from one or more files, in the specified order.
combine conf config configuration data eslint find jonschlinkert lookup merge namespace node nodejs object package rc runtime-config search store
Last synced: 26 Jun 2025
https://github.com/flowforfrank/d3-bubble
💭 Bubble chart created with D3.js
bubble-chart d3 d3js data data-visualization javascript svg tutorial webtips
Last synced: 09 Oct 2025
https://github.com/ineelhere/clintrialx
R package to fetch and explore clinical trials data from freely available registries. Fetch data in bulk, customize data and build comprehensive html reports. Currently, it supports the ClinicalTrials.gov registry and CTTI AACT (Access to Aggregate Content of ClinicalTrials.gov).
aact bioinformatics clinical-data clinical-trials clinicaltrialsgov ctti data data-management medical-informatics r-language r-package trials
Last synced: 28 Oct 2025
https://github.com/moscarde/foliumtools
Um breve guia de como utilizar ferramentas básicas do Folium para gerar mapas interativos através de dados geográficos em conjunto com Python, Pandas e GoogleMaps.
data data-visualization folium googlemaps-api maps pandas phyton
Last synced: 14 Apr 2025
https://github.com/randomfractals/tabular-data-viewer
Tabular Data Viewer 🀄 VSCode extension for viewing very large local and remote CSV and TSV data files with Tabulator Table, Perspective Datagrid and D3FC Chart Views 📊📈
charts csv d3fc data data-packages datapackage dsv flat-data large-data perspective remote-data tabular tabulator tsv view viewer vscode
Last synced: 11 Apr 2025
https://github.com/the-akira/datascience
Coleção de recursos sobre Ciência de Dados com Python.
data data-analysis data-science data-structures data-visualization machine-learning machine-learning-algorithms mathematics pandas pandas-dataframe portuguese-language python3 scikit-learn statistics sympy
Last synced: 07 May 2025
https://github.com/ronin-co/client
Access RONIN via TypeScript.
bun client data javascript js library node nodejs query ts typescript
Last synced: 10 Apr 2025
https://github.com/streamr-dev/datav2
The second incarnation of the DATA token
Last synced: 13 Jul 2025
https://github.com/warisgill/feddefender
FedDefender is a novel defense mechanism designed to safeguard Federated Learning from the poisoning attacks (i.e., backdoor attacks).
backdoor-attacks data differential-testing federated-learning poisoning-attack
Last synced: 21 Mar 2025
https://github.com/dhhruv/stock-price-prediction
A deep learning project in which the model was trained using LSTM layers and Tata Stock prices were predicted and compared with thier actual values.
algorithm cli college-project data data-science dataset deep-learning jupyter jupyter-notebook lstm machine-learning prediction science shell stock-price-prediction tata-beverages terminal
Last synced: 03 May 2025
https://github.com/stdlib-js/ndarray
Multidimensional arrays.
array buffer data javascript matrix multidimensional namespace ndarray node node-js nodejs ns stdlib structures tensor typed typed-array types vector
Last synced: 06 Apr 2025
https://github.com/kyryl-opens-ml/ml-in-production-practice
Practice for Machine Learning in Production course
data inference-api infrastructure llm ml mlops monitoring pipelines platform
Last synced: 16 Jan 2026
https://github.com/erichenry/swagger-data-gen
Tool to generate random data from a Sagger/OpenAPI spec
cli data generator mock-data openapi openapi-specification random swagger tool
Last synced: 14 Apr 2025
https://github.com/benjamincharity/angular-json-calendar
:calendar: An AngularJS module that generates calendar data as a JSON object and/or HTML :muscle:
Last synced: 22 Mar 2025
https://github.com/tuanai-vireox/gcp-professional-data-engineer
GCP Professional Data Engineer Certification- Learning
data dataengineering gcp professional
Last synced: 22 Aug 2025
https://github.com/anishkumar127/data-structures-and-algorithms
Solutions to Arrays, Strings, Lists, Sorting, Stacks, Trees and General DS problems using JAVA.
algorithms algorithms-and-data-structures codestudio-solutions data data-structures gfg-solutions hackerrank-solutions hacktoberfest hacktoberfest2022 hacktoebrfest-accepted java leetcode leetcode-java leetcode-solution leetcode-solutions notes pdf solutions
Last synced: 31 Jul 2025
https://github.com/viraltux/datawrangler.jl
Data transformation tools for analytics
data transformations wrangling
Last synced: 15 Aug 2025
https://github.com/bst04/tools-for-data-recovery
All tools for Data Recovery
data data-recovery programs recovery tools
Last synced: 29 Jun 2025
https://github.com/kachayev/timely0
Minimalistic implementation of Naiad paper "A Timely Dataflow System" in Scala
data dataflow dataflow-programming graph timely
Last synced: 12 Apr 2025
https://github.com/nas5w/imdb-data
A JSON file of 50,000 IMDB movie reviews to be used in machine learning applications.
data data-science imdb javascript machine-learning
Last synced: 19 Apr 2025
https://github.com/williamtroup/tree.js
🌲 A lightweight JavaScript library that allows you to create responsive and customizable interactive tree diagrams from an array of JS objects.
css3 data grid html5 javascript map tree treemap visualization
Last synced: 15 Apr 2025
https://github.com/ludbek/validatex
A simple yet powerful data validator for javascript.
async data javascript schema validator
Last synced: 29 Jul 2025
https://github.com/spatialcurrent/go-simple-serializer
Simple library and command line program for converting between JSON, YAML, TOML, and many more common serialization formats.
Last synced: 29 Jan 2026
https://github.com/cfpb/cfpb-chart-builder
Charts for the Consumer Financial Protection Bureau
Last synced: 09 Apr 2025
https://github.com/dimitryzub/ecommerce-scraper-py
Scrape ecommerce websites such as Amazon, eBay, Walmart, Home Depot, Google Shopping from a single module in Python🐍
data datamining ecommerce ecommerce-website python python3 selectolax selenium serpapi webscraper webscraping
Last synced: 03 Sep 2025
https://github.com/the-swarm-corporation/backtesteragent
An enterprise-grade AI-powered backtesting framework built on the Swarms framework for automated trading strategy validation and optimization.
ai backtesting data finance finance-agents jpmorgan yahoofinance
Last synced: 13 Oct 2025
https://github.com/mzjp2/kedro-dataframe-dropin
A Kedro plugin that provides pandas dropin replacements for the pandas datasets (e.g modin and cuDF)
data gpu-acceleration kedro-catalog kedro-plugin modin rapidsai
Last synced: 09 Apr 2025
https://github.com/purarue/autotui
quickly create UIs to interactively prompt, validate, and persist python objects to disk (JSON/YAML) and back using type hints
cli data deserialization json namedtuple serialization tui typehints
Last synced: 16 Mar 2025
https://github.com/jcrodriguez1989/thesimpsons
Package (dataset): The Simpsons episodes dataset
Last synced: 26 Oct 2025
https://github.com/ethicnology/blockchain-ekstrakto
Blockchain ekstrakto is a Python program which extracts all Bitcoin blockchain data using Bitcoin Core.
Last synced: 11 Oct 2025
https://github.com/abcnews/census-100-people
Census 2016: This is Australia as 100 people
australia census data visualisation
Last synced: 27 Jan 2026
https://github.com/philss/brazil-in-notebooks
A collection of notebooks presenting data about Brazil.
brazil data data-visualization elixir livebook vega-lite
Last synced: 13 Oct 2025
https://github.com/bradlindblad/cheatsheet
A simple package to grab cheat sheets and save them to your local computer
cheatsheets data datascience r
Last synced: 22 Oct 2025
https://github.com/hodur-org/hodur-graphviz-schema
Hodur is a domain modeling approach and collection of libraries to Clojure. By using Hodur you can define your domain model as data, parse and validate it, and then either consume your model via an API or use one of the many plugins to help you achieve mechanical results faster and in a purely functional manner.
clojure data modeling schema visualization
Last synced: 12 Dec 2025
https://github.com/huemulsolutions/huemul-bigdatagovernance
Huemul BigDataGovernance, es una framework que trabaja sobre Spark, Hive y HDFS. Permite la implementación de una estrategia corporativa de dato único, basada en buenas prácticas de Gobierno de Datos. Permite implementar tablas con control de Primary Key y Foreing Key al insertar y actualizar datos utilizando la librería, Validación de nulos, largos de textos, máximos/mínimos de números y fechas, valores únicos y valores por default. También permite clasificar los campos en aplicabilidad de derechos ARCO para facilitar la implementación de leyes de protección de datos tipo GDPR, identificar los niveles de seguridad y si se está aplicando algún tipo de encriptación. Adicionalmente permite agregar reglas de validación más complejas sobre la misma tabla.
bigdata chile cloudera data data-engineer data-engineering data-governance data-warehouse datamart dataquality gdpr hadoop hive hortonworks huemul huemul-bigdatagovernance parquet spark spark-sql trabaja-sobre-spark
Last synced: 26 Apr 2025
https://github.com/fairdataihub/fairdataihub.org
Website of the FAIR Data Innovations Hub
data fair nextjs organization software typescript
Last synced: 27 Jan 2026
https://github.com/jacquietran/wnblr
An R package containing game stats from the Women's National Basketball League (WNBL).
basketball data r r-package wnbl
Last synced: 06 Oct 2025
https://github.com/hyriver/pydaymet
A part of HyRiver software stack for retrieving and post-processing climate data from the Daymet Webservice.
climate data daymet hydrology python webservice
Last synced: 12 Dec 2025
https://github.com/bfolkens/pandas-datareader-gdax
GDAX data for Pandas in the style of DataReader
bitcoin cryptocurrency data data-analysis dataset finance gdax pandas quant
Last synced: 28 Jan 2026
https://github.com/underlay/tasl
An algebraic data model for strongly typed semantic data
adt algebraic-data-types category-theory data data-model graph-data knowledge-graph property-graph relational-model typescript
Last synced: 09 Oct 2025
https://github.com/richardlitt/ebird-ext
Tools for doing stuff with eBird data
birding birds community-science data ebird science
Last synced: 27 Oct 2025
https://github.com/nishkarshraj/cpp-programming-with-data-structures
Advanced Data Structure using C programming
c cpp cpp-library data data-structures devops git github object-oriented-programming oops oops-in-cpp sorting-algorithms standard-template-library
Last synced: 22 Apr 2025
https://github.com/justintime50/dad-node
Dummy Address Data (DAD) - Retrieve real addresses from all around the world. (Node Client Library)
address addresses dad data dataset dummy dummy-data real retrieving-addresses world
Last synced: 30 Apr 2025
https://github.com/lamm-mit/moleculediffusiontransformer
Molecular generation using diffusion models and autoregressive transformer models
ai chemistry data design dft generative modeling molecular-design quantum-mechanics
Last synced: 13 Apr 2025
https://github.com/aradfarahani/Geoelectricspy
This project presents an interactive 3D visualization of subsurface resistivity using geoelectric data. It utilizes Python and its scientific libraries to process and visualize the data across multiple depths, ranging from 1 meter to 464 meters.
data goelectrics visualization
Last synced: 28 Mar 2025
https://github.com/simranjeet97/top-machine-learning-algorithms-python
This Repository contains the Machine Learning Algorithms with Mathematical Explanation behind them along with Implementation in Python.
data data-analysis data-science data-structures database machine machine-learning machine-learning-algorithms machine-learning-library machine-learning-playlist machinelearning machinelearning-python python python-programming python-script python3 youtube youtube-tutorial youtube-tutorial-series
Last synced: 11 Apr 2025
https://github.com/blackcipher101/spaceye
A tool to decode star spectrum images using OpenCV to make predictions about its charatestics.
data data-visualizations hacktoberfest opencv pysimplegui space
Last synced: 20 Mar 2025
https://github.com/synthesized-io/insight
🧿 Metrics & Monitoring of Datasets
data data-analysis data-science framework insights metrics monitoring python
Last synced: 24 Jun 2025
https://github.com/buccaneerai/rxjs-stats
Moved to @bottlenose/rxstats (https://github.com/buccaneerai/bottlenose)
analytics data data-mining data-science observables reactive rxjs statistics
Last synced: 15 Jul 2025
https://github.com/ikramagix/faussaire
Generate ultra-realistic fake data in French and Greek with credible, context-aware, and culturally relevant options.
data faker-generator france french french-language french-speaking french-translation gem rspec rspec-testing rspec-tests ruby ruby-gem ruby-on-rails rubygem rubygems testing yaml
Last synced: 10 Apr 2025
https://github.com/alemidev/dashboard
my custom data collector and visualizer dashboard
dashboard data egui rust timeseries
Last synced: 29 Oct 2025