data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-22 00:07:43 UTC
- JSON Representation
https://github.com/anthonydb/data-wrangling-python-nicar-2017
Materials for the NICAR 2017 Data Wrangling with Python hands-on class
agate csvkit data jupyter nicar-2017 nicar17 python
Last synced: 22 Apr 2025
https://github.com/viraltux/datawrangler.jl
Data transformation tools for analytics
data transformations wrangling
Last synced: 15 Aug 2025
https://github.com/kachayev/timely0
Minimalistic implementation of Naiad paper "A Timely Dataflow System" in Scala
data dataflow dataflow-programming graph timely
Last synced: 12 Apr 2025
https://github.com/dimitryzub/ecommerce-scraper-py
Scrape ecommerce websites such as Amazon, eBay, Walmart, Home Depot, Google Shopping from a single module in Python🐍
data datamining ecommerce ecommerce-website python python3 selectolax selenium serpapi webscraper webscraping
Last synced: 03 Sep 2025
https://github.com/anishkumar127/data-structures-and-algorithms
Solutions to Arrays, Strings, Lists, Sorting, Stacks, Trees and General DS problems using JAVA.
algorithms algorithms-and-data-structures codestudio-solutions data data-structures gfg-solutions hackerrank-solutions hacktoberfest hacktoberfest2022 hacktoebrfest-accepted java leetcode leetcode-java leetcode-solution leetcode-solutions notes pdf solutions
Last synced: 31 Jul 2025
https://github.com/purarue/autotui
quickly create UIs to interactively prompt, validate, and persist python objects to disk (JSON/YAML) and back using type hints
cli data deserialization json namedtuple serialization tui typehints
Last synced: 16 Mar 2025
https://github.com/ludbek/validatex
A simple yet powerful data validator for javascript.
async data javascript schema validator
Last synced: 29 Jul 2025
https://github.com/the-swarm-corporation/backtesteragent
An enterprise-grade AI-powered backtesting framework built on the Swarms framework for automated trading strategy validation and optimization.
ai backtesting data finance finance-agents jpmorgan yahoofinance
Last synced: 13 Oct 2025
https://github.com/cfpb/cfpb-chart-builder
Charts for the Consumer Financial Protection Bureau
Last synced: 09 Apr 2025
https://github.com/williamtroup/tree.js
🌲 A lightweight JavaScript library that allows you to create responsive and customizable interactive tree diagrams from an array of JS objects.
css3 data grid html5 javascript map tree treemap visualization
Last synced: 15 Apr 2025
https://github.com/alexanderkasten/use-next-sse
use-next-sse is a lightweight and easy-to-use React hook library for implementing Server-Sent Events (SSE) in Next.js applications, enabling real-time, unidirectional data streaming from server to client.
data events eventsource nextjs react real-time server-sent-events serverless sse streaming
Last synced: 19 Feb 2026
https://github.com/kyryl-opens-ml/ml-in-production-practice
Practice for Machine Learning in Production course
data inference-api infrastructure llm ml mlops monitoring pipelines platform
Last synced: 16 Jan 2026
https://github.com/mzjp2/kedro-dataframe-dropin
A Kedro plugin that provides pandas dropin replacements for the pandas datasets (e.g modin and cuDF)
data gpu-acceleration kedro-catalog kedro-plugin modin rapidsai
Last synced: 09 Mar 2026
https://github.com/bst04/tools-for-data-recovery
All tools for Data Recovery
data data-recovery programs recovery tools
Last synced: 29 Jun 2025
https://github.com/nas5w/imdb-data
A JSON file of 50,000 IMDB movie reviews to be used in machine learning applications.
data data-science imdb javascript machine-learning
Last synced: 19 Apr 2025
https://github.com/moscarde/foliumtools
Um breve guia de como utilizar ferramentas básicas do Folium para gerar mapas interativos através de dados geográficos em conjunto com Python, Pandas e GoogleMaps.
data data-visualization folium googlemaps-api maps pandas phyton
Last synced: 14 Apr 2025
https://github.com/stdlib-js/ndarray
Multidimensional arrays.
array buffer data javascript matrix multidimensional namespace ndarray node node-js nodejs ns stdlib structures tensor typed typed-array types vector
Last synced: 06 Apr 2025
https://github.com/benjamincharity/angular-json-calendar
:calendar: An AngularJS module that generates calendar data as a JSON object and/or HTML :muscle:
Last synced: 22 Mar 2025
https://github.com/ineelhere/clintrialx
R package to fetch and explore clinical trials data from freely available registries. Fetch data in bulk, customize data and build comprehensive html reports. Currently, it supports the ClinicalTrials.gov registry and CTTI AACT (Access to Aggregate Content of ClinicalTrials.gov).
aact bioinformatics clinical-data clinical-trials clinicaltrialsgov ctti data data-management medical-informatics r-language r-package trials
Last synced: 28 Oct 2025
https://github.com/streamr-dev/datav2
The second incarnation of the DATA token
Last synced: 13 Jul 2025
https://github.com/wmo-raf/adl
Automated Weather Observation Data Collection and Processing System
adl automated-data-loader automatic-weather-station aws climate-data climate-services data data-collection-and-processing data-transmission weather-data weather-station wis2 wis2box
Last synced: 17 Jan 2026
https://github.com/warisgill/feddefender
FedDefender is a novel defense mechanism designed to safeguard Federated Learning from the poisoning attacks (i.e., backdoor attacks).
backdoor-attacks data differential-testing federated-learning poisoning-attack
Last synced: 21 Mar 2025
https://github.com/randomfractals/tabular-data-viewer
Tabular Data Viewer 🀄 VSCode extension for viewing very large local and remote CSV and TSV data files with Tabulator Table, Perspective Datagrid and D3FC Chart Views 📊📈
charts csv d3fc data data-packages datapackage dsv flat-data large-data perspective remote-data tabular tabulator tsv view viewer vscode
Last synced: 11 Apr 2025
https://github.com/erichenry/swagger-data-gen
Tool to generate random data from a Sagger/OpenAPI spec
cli data generator mock-data openapi openapi-specification random swagger tool
Last synced: 14 Apr 2025
https://github.com/jonschlinkert/merge-configs
Find, load and merge JSON and YAML config settings from one or more files, in the specified order.
combine conf config configuration data eslint find jonschlinkert lookup merge namespace node nodejs object package rc runtime-config search store
Last synced: 26 Jun 2025
https://github.com/flowforfrank/d3-bubble
💭 Bubble chart created with D3.js
bubble-chart d3 d3js data data-visualization javascript svg tutorial webtips
Last synced: 09 Oct 2025
https://github.com/dhhruv/stock-price-prediction
A deep learning project in which the model was trained using LSTM layers and Tata Stock prices were predicted and compared with thier actual values.
algorithm cli college-project data data-science dataset deep-learning jupyter jupyter-notebook lstm machine-learning prediction science shell stock-price-prediction tata-beverages terminal
Last synced: 03 May 2025
https://github.com/amir9ume/urdu_ghazals_rekhta
Dataset for Urdu Ghazals
data dataset language-model machine-learning nlp parser rekhta urdu
Last synced: 13 May 2025
https://github.com/the-akira/datascience
Coleção de recursos sobre Ciência de Dados com Python.
data data-analysis data-science data-structures data-visualization machine-learning machine-learning-algorithms mathematics pandas pandas-dataframe portuguese-language python3 scikit-learn statistics sympy
Last synced: 07 May 2025
https://github.com/ronin-co/client
Access RONIN via TypeScript.
bun client data javascript js library node nodejs query ts typescript
Last synced: 10 Apr 2025
https://github.com/datalpia/laketower
Oversee your lakehouse
apache-iceberg arrow data deltalake duckdb lakehouse sql
Last synced: 20 Jun 2026
https://github.com/spatialcurrent/go-simple-serializer
Simple library and command line program for converting between JSON, YAML, TOML, and many more common serialization formats.
Last synced: 29 Jan 2026
https://github.com/abcnews/census-100-people
Census 2016: This is Australia as 100 people
australia census data visualisation
Last synced: 27 Jan 2026
https://github.com/ethicnology/blockchain-ekstrakto
Blockchain ekstrakto is a Python program which extracts all Bitcoin blockchain data using Bitcoin Core.
Last synced: 11 Oct 2025
https://github.com/jcrodriguez1989/thesimpsons
Package (dataset): The Simpsons episodes dataset
Last synced: 26 Oct 2025
https://github.com/bfolkens/pandas-datareader-gdax
GDAX data for Pandas in the style of DataReader
bitcoin cryptocurrency data data-analysis dataset finance gdax pandas quant
Last synced: 28 Jan 2026
https://github.com/richardlitt/ebird-ext
Tools for doing stuff with eBird data
birding birds community-science data ebird science
Last synced: 27 Oct 2025
https://github.com/aoemods/attrib
Age of Empires 4 attribute dump, keeping track of patch changes. Converted with AOEMods.Essence. Join our discord on the website link.
age-of-empires-iv aoe4 data information mods stats
Last synced: 23 Feb 2026
https://github.com/bradlindblad/cheatsheet
A simple package to grab cheat sheets and save them to your local computer
cheatsheets data datascience r
Last synced: 22 Oct 2025
https://github.com/hyriver/pydaymet
A part of HyRiver software stack for retrieving and post-processing climate data from the Daymet Webservice.
climate data daymet hydrology python webservice
Last synced: 12 Dec 2025
https://github.com/qzcool/cpef
私募基金管理人查询数据接口。Chinese Private Equity Funds APIs.
china crawler data finance fund funds hedge-funds private-equity python python3 scraper scraping-websites spider
Last synced: 26 Feb 2026
https://github.com/seancolsen/music-theory-data
A data source containing names and details for musical scales, chords, intervals, and notes.
data music music-theory musical-scales musicology
Last synced: 17 Mar 2026
https://github.com/jacquietran/wnblr
An R package containing game stats from the Women's National Basketball League (WNBL).
basketball data r r-package wnbl
Last synced: 06 Oct 2025
https://github.com/underlay/tasl
An algebraic data model for strongly typed semantic data
adt algebraic-data-types category-theory data data-model graph-data knowledge-graph property-graph relational-model typescript
Last synced: 09 Oct 2025
https://github.com/hodur-org/hodur-graphviz-schema
Hodur is a domain modeling approach and collection of libraries to Clojure. By using Hodur you can define your domain model as data, parse and validate it, and then either consume your model via an API or use one of the many plugins to help you achieve mechanical results faster and in a purely functional manner.
clojure data modeling schema visualization
Last synced: 12 Dec 2025
https://github.com/fairdataihub/fairdataihub.org
Website of the FAIR Data Innovations Hub
data fair nextjs organization software typescript
Last synced: 03 May 2026
https://github.com/judehunter/reactivefile
Parse and reactively auto-save JSON, TOML, YAML and any other data file with ease.
data data-structures file filesystem javascript reactive reactive-programming typescript typescript-definitions
Last synced: 15 Apr 2025
https://github.com/kishyassin/goframe
goframe is a Go package inspired by Python's pandas, designed for data manipulation and analysis.
data dataframe go goframe golang good-first-issue package
Last synced: 04 Oct 2025
https://github.com/huemulsolutions/huemul-bigdatagovernance
Huemul BigDataGovernance, es una framework que trabaja sobre Spark, Hive y HDFS. Permite la implementación de una estrategia corporativa de dato único, basada en buenas prácticas de Gobierno de Datos. Permite implementar tablas con control de Primary Key y Foreing Key al insertar y actualizar datos utilizando la librería, Validación de nulos, largos de textos, máximos/mínimos de números y fechas, valores únicos y valores por default. También permite clasificar los campos en aplicabilidad de derechos ARCO para facilitar la implementación de leyes de protección de datos tipo GDPR, identificar los niveles de seguridad y si se está aplicando algún tipo de encriptación. Adicionalmente permite agregar reglas de validación más complejas sobre la misma tabla.
bigdata chile cloudera data data-engineer data-engineering data-governance data-warehouse datamart dataquality gdpr hadoop hive hortonworks huemul huemul-bigdatagovernance parquet spark spark-sql trabaja-sobre-spark
Last synced: 26 Apr 2025
https://github.com/philss/brazil-in-notebooks
A collection of notebooks presenting data about Brazil.
brazil data data-visualization elixir livebook vega-lite
Last synced: 13 Oct 2025
https://github.com/nishkarshraj/cpp-programming-with-data-structures
Advanced Data Structure using C programming
c cpp cpp-library data data-structures devops git github object-oriented-programming oops oops-in-cpp sorting-algorithms standard-template-library
Last synced: 22 Apr 2025
https://github.com/epsoft/explainable
explainable
data database dataset explainable numpy seaborn tensorflow
Last synced: 29 Jul 2025
https://github.com/emreyalvac/sulfur
Shaping, Processing, and Transforming Data with the Power of Sulfur with Rust
data data-analysis data-flow database
Last synced: 19 Aug 2025
https://github.com/domaindrivenarchitecture/data-test
framework for data-driven tests
clojure data test test-driven-development
Last synced: 13 Aug 2025
https://github.com/mgroves/sqlservertocouchbase
Library to automatically best effort move and remodel data from relational databases (like SQL Server) to Couchbase
couchbase data json migration sql sql-server tables
Last synced: 26 Sep 2025
https://github.com/dpguthrie/bankfind
Python interface to the FDIC's API for publically available bank data
api api-wrapper banking data finance pandas python united-states
Last synced: 02 Aug 2025
https://github.com/isoverse/isoreader
Read IRMS (Isotope Ratio Mass Spectrometry) data files into R
data ecology geochemistry isotopes r
Last synced: 19 Feb 2026
https://github.com/prioritizr/aoh
Create Area of Habitat Data
conservation data ecology gis-data rstats rstats-package
Last synced: 15 Apr 2025
https://github.com/fwd/reddit
Graph Visualization UI for Reddit.
data data-science datasets worldnews
Last synced: 24 Apr 2025
https://github.com/giacomopiccinini/rush
Swiss-army knife for media inspection and manipulation
cli data data-engineering multimedia rust
Last synced: 09 Mar 2026
https://github.com/durgeshsamariya/100daysofdatascience
A 100 Day DS Challenge to learn and implement DS concepts ranging from the beginner of Data Science to Data Scientist.
100days 100daysofcode 100daysofdscode 100daysofmlcode data data-science
Last synced: 15 Apr 2025
https://github.com/mitevpi/urban-insights-frontend
Winning AEC Hackathon 2019 Silicon Valley Project. AR/VR Application for visualizing proposed buildings on their sites and overlaying environmental and zoning analysis.
a-frame analysis ar architecture city computation data design rhino smart ui vr vue vuejs
Last synced: 08 Aug 2025
https://github.com/alemidev/dashboard
my custom data collector and visualizer dashboard
dashboard data egui rust timeseries
Last synced: 29 Oct 2025
https://github.com/synthesized-io/insight
🧿 Metrics & Monitoring of Datasets
data data-analysis data-science framework insights metrics monitoring python
Last synced: 24 Jun 2025
https://github.com/aradfarahani/Geoelectricspy
This project presents an interactive 3D visualization of subsurface resistivity using geoelectric data. It utilizes Python and its scientific libraries to process and visualize the data across multiple depths, ranging from 1 meter to 464 meters.
data goelectrics visualization
Last synced: 28 Mar 2025
https://github.com/ikramagix/faussaire
Generate ultra-realistic fake data in French and Greek with credible, context-aware, and culturally relevant options.
data faker-generator france french french-language french-speaking french-translation gem rspec rspec-testing rspec-tests ruby ruby-gem ruby-on-rails rubygem rubygems testing yaml
Last synced: 10 Apr 2025
https://github.com/justintime50/dad-node
Dummy Address Data (DAD) - Retrieve real addresses from all around the world. (Node Client Library)
address addresses dad data dataset dummy dummy-data real retrieving-addresses world
Last synced: 30 Apr 2025
https://github.com/kovah/taboo-data
A data set for Taboo games. Plain JSON files which contain the keyword and some buzzwords like in the original Taboo game
data data-structures dataset datasets game game-resources taboo tabu
Last synced: 14 Apr 2025
https://github.com/caerbannogwhite/preludio
Preludio is a data wrangling language based on PRQL and written in Go. 🎭
csv data data-analysis data-cleaning data-engineering dplyr dsl go golang language manipulation pipeline programming-language prql sql stack-oriented wrangling
Last synced: 17 Jan 2026
https://github.com/simranjeet97/top-machine-learning-algorithms-python
This Repository contains the Machine Learning Algorithms with Mathematical Explanation behind them along with Implementation in Python.
data data-analysis data-science data-structures database machine machine-learning machine-learning-algorithms machine-learning-library machine-learning-playlist machinelearning machinelearning-python python python-programming python-script python3 youtube youtube-tutorial youtube-tutorial-series
Last synced: 11 Apr 2025
https://github.com/mzubairtahir/google-maps-scraper
Scraper that scrapes data from search results of google maps
business-data data data-scraping google-map-api google-map-scraper google-maps lead-generator python-scraper python-scraping python3 scraper scraping-websites
Last synced: 05 Apr 2025
https://github.com/tusharnankani/data-analysis-with-python
A complete introduction to data analysis covering the basics of Python, Numpy, Pandas, Data Visualization, and Exploratory Data Analysis.
data data-analysis data-visualization hacktoberfest jovian jovian-ml jupyter-notebook matplotlib numpy pandas python seaborn
Last synced: 07 May 2025
https://github.com/buccaneerai/rxjs-stats
Moved to @bottlenose/rxstats (https://github.com/buccaneerai/bottlenose)
analytics data data-mining data-science observables reactive rxjs statistics
Last synced: 15 Jul 2025
https://github.com/yagoluiz/meuremedio-extracao
[PT-BR] Extração de dados de preço de medicamentos disponibilizados pela ANVISA
Last synced: 15 Jul 2025
https://github.com/gamemann/xdp-access-last-byte
Repository to store information accessing the last byte of a packet in BPF and XDP.
bpf byte data express last lastbyte network network-programming networking packet path payload processing xdp
Last synced: 18 Mar 2025
https://github.com/lamm-mit/moleculediffusiontransformer
Molecular generation using diffusion models and autoregressive transformer models
ai chemistry data design dft generative modeling molecular-design quantum-mechanics
Last synced: 13 Apr 2025
https://github.com/ismet55555/logging-for-matlab
A flexible message logger for your MATLAB scripts and programs
best-practices class daq data logging logging-library matlab mfiles utility
Last synced: 20 Mar 2025
https://github.com/blackcipher101/spaceye
A tool to decode star spectrum images using OpenCV to make predictions about its charatestics.
data data-visualizations hacktoberfest opencv pysimplegui space
Last synced: 20 Mar 2025
https://github.com/keshiarose/toggl-web-data-connector
A Tableau Web Data Connector that pulls in data from the Detail Reports view from toggl.com
data tableau tableau-desktop toggl toggl-track wdc web-data-connector
Last synced: 24 Dec 2025
https://github.com/akoury/ml-helper
Python library with helpers to speed up and structure machine learning projects.
data data-visualization machine-learning ml python scikit-learn sklearn
Last synced: 24 Oct 2025
https://github.com/contextlab/data-wrangler
Wrangle messy numerical, image, and text data into consistent well-organized formats
data data-analysis data-science data-wrangling hugging-face image-data machine-learning nlp numpy pandas python scikit-learn
Last synced: 10 Apr 2025
https://github.com/cfnptr/pack
Runtime optimized multi-platform data packing library for realtime game resources loading
c c99 compression compressor container cpp cross-platform csharp data library multi-platform pack package packer packing resource resources runtime storage zstd
Last synced: 30 Oct 2025
https://github.com/karanpratapsingh/scale-etl
Partition, Transform, Load, and Search large CSV files
Last synced: 10 Jul 2025
https://github.com/vin-cento/fakesnake
Do you need to quickly generate a 🎲random dataset to work with? This is the 🔨tool for you! You can generate a quick list of random or quickly populate a 🏬database.
Last synced: 04 Nov 2025
https://github.com/hinto-janai/someday
Lock-free MVCC primitive
atomic concurrency data lock-free mvcc rust
Last synced: 18 Mar 2025