data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-02-03 00:08:04 UTC
- JSON Representation
https://github.com/synthesized-io/insight
🧿 Metrics & Monitoring of Datasets
data data-analysis data-science framework insights metrics monitoring python
Last synced: 24 Jun 2025
https://github.com/yagoluiz/meuremedio-extracao
[PT-BR] Extração de dados de preço de medicamentos disponibilizados pela ANVISA
Last synced: 15 Jul 2025
https://github.com/dawidolko/data-bases
Tasks studies - laboratory
bases courses data documentation lab labs projects
Last synced: 15 Jul 2025
https://github.com/simranjeet97/top-machine-learning-algorithms-python
This Repository contains the Machine Learning Algorithms with Mathematical Explanation behind them along with Implementation in Python.
data data-analysis data-science data-structures database machine machine-learning machine-learning-algorithms machine-learning-library machine-learning-playlist machinelearning machinelearning-python python python-programming python-script python3 youtube youtube-tutorial youtube-tutorial-series
Last synced: 11 Apr 2025
https://github.com/alemidev/dashboard
my custom data collector and visualizer dashboard
dashboard data egui rust timeseries
Last synced: 29 Oct 2025
https://github.com/justintime50/dad-node
Dummy Address Data (DAD) - Retrieve real addresses from all around the world. (Node Client Library)
address addresses dad data dataset dummy dummy-data real retrieving-addresses world
Last synced: 30 Apr 2025
https://github.com/dpguthrie/bankfind
Python interface to the FDIC's API for publically available bank data
api api-wrapper banking data finance pandas python united-states
Last synced: 02 Aug 2025
https://github.com/emreyalvac/sulfur
Shaping, Processing, and Transforming Data with the Power of Sulfur with Rust
data data-analysis data-flow database
Last synced: 19 Aug 2025
https://github.com/durgeshsamariya/100daysofdatascience
A 100 Day DS Challenge to learn and implement DS concepts ranging from the beginner of Data Science to Data Scientist.
100days 100daysofcode 100daysofdscode 100daysofmlcode data data-science
Last synced: 15 Apr 2025
https://github.com/fwd/reddit
Graph Visualization UI for Reddit.
data data-science datasets worldnews
Last synced: 24 Apr 2025
https://github.com/domaindrivenarchitecture/data-test
framework for data-driven tests
clojure data test test-driven-development
Last synced: 13 Aug 2025
https://github.com/kishyassin/goframe
goframe is a Go package inspired by Python's pandas, designed for data manipulation and analysis.
data dataframe go goframe golang good-first-issue package
Last synced: 04 Oct 2025
https://github.com/mitevpi/urban-insights-frontend
Winning AEC Hackathon 2019 Silicon Valley Project. AR/VR Application for visualizing proposed buildings on their sites and overlaying environmental and zoning analysis.
a-frame analysis ar architecture city computation data design rhino smart ui vr vue vuejs
Last synced: 08 Aug 2025
https://github.com/prioritizr/aoh
Create Area of Habitat Data
conservation data ecology gis-data rstats rstats-package
Last synced: 15 Apr 2025
https://github.com/judehunter/reactivefile
Parse and reactively auto-save JSON, TOML, YAML and any other data file with ease.
data data-structures file filesystem javascript reactive reactive-programming typescript typescript-definitions
Last synced: 15 Apr 2025
https://github.com/mgroves/sqlservertocouchbase
Library to automatically best effort move and remodel data from relational databases (like SQL Server) to Couchbase
couchbase data json migration sql sql-server tables
Last synced: 26 Sep 2025
https://github.com/epsoft/explainable
explainable
data database dataset explainable numpy seaborn tensorflow
Last synced: 29 Jul 2025
https://github.com/caerbannogwhite/preludio
Preludio is a data wrangling language based on PRQL and written in Go. 🎭
csv data data-analysis data-cleaning data-engineering dplyr dsl go golang language manipulation pipeline programming-language prql sql stack-oriented wrangling
Last synced: 17 Jan 2026
https://github.com/kovah/taboo-data
A data set for Taboo games. Plain JSON files which contain the keyword and some buzzwords like in the original Taboo game
data data-structures dataset datasets game game-resources taboo tabu
Last synced: 14 Apr 2025
https://github.com/dimitryzub/webscraping-py
Web Scraping scripts for all Google, other search engines, and other websites (currently outdated, something may not be working).
api bs4 data google-maps-api googleapi googlescraping googlesearchapi lxml parsel playwright python requests scraper scraping scrapy selenium webscraper webscraping webscraping-data webscraping-search
Last synced: 12 Aug 2025
https://github.com/vin-cento/fakesnake
Do you need to quickly generate a 🎲random dataset to work with? This is the 🔨tool for you! You can generate a quick list of random or quickly populate a 🏬database.
Last synced: 04 Nov 2025
https://github.com/eddienubes/validness
🟢 Your favourite library for validating incoming data in express.js.
data dto express expressjs http http-server nestjs nodejs server validation
Last synced: 28 Jun 2025
https://github.com/fd0/split
Split large files into smaller ones using deterministic Content Defined Chunking
Last synced: 18 Aug 2025
https://github.com/gibbsbravo/datadelta
The best Python package for comparing two dataframes
analytics comparison data data-analytics database database-management databases dataops dataops-platform devops pandas pandas-dataframe testing testing-tools version-control
Last synced: 18 Aug 2025
https://github.com/bovem/stock-tracker
An interactive data visualization application developed in Python
data data-analysis data-visualization iex-api plotly-dash python stock-data stock-tracker visualization
Last synced: 19 Sep 2025
https://github.com/hariharan-devarajan/vanidl
VaniDL is an tool for analyzing I/O patterns and behavior with Deep Learning Applications.
ai analysis data deep-learning deep-neural-networks machine-learning profile storage tensorflow2
Last synced: 07 Aug 2025
https://github.com/stefen-taime/etl-data-pipeline-rdbms-to-hdfs-using-airflow-apache-sqoop-spark-postgres-and-hive
This project aims to move the data from a Relational database system (RDBMS) to a Hadoop file system (HDFS)
airflow big-data data docker-compose etl-pipeline hdfs hive infrastructure-as-code rdbms spark sql sqoop
Last synced: 03 Jul 2025
https://github.com/lablnet/pakweather_scraper
A multi-threaded Pakistan Weather crawler written in JavaScript
crawler data mit-license open-source pakistan scraping weather weather-channel
Last synced: 22 Aug 2025
https://github.com/ethjs/ethjs-schema
The complete Ethereum RPC spec as a JSON object export.
data ethereum ethjs json rpc solidity specification web3
Last synced: 05 Oct 2025
https://github.com/colour-science/colour-science.org
https://www.colour-science.org
color color-science color-space color-spaces colorspace colorspaces colour colour-science colour-space colour-spaces colourspace colourspaces data dataset datasets python spectral-data spectral-dataset spectral-datasets website
Last synced: 21 Apr 2025
https://github.com/gibbs/currency-data
ISO 4217 currency data for consumption in CSV, JSON, PHP, XML and YAML
currencies currency data dataset
Last synced: 23 Apr 2025
https://github.com/correia-jpv/fucking-awesome-bigdata
A curated list of awesome big data frameworks, resources and other awesomeness. With repository stars⭐ and forks🍴
awesome awesome-list bigdata data data-analytics data-science data-stream data-visualization data-warehouse database distributed-database series-database stream-processing streaming-data visualize-data
Last synced: 27 Apr 2025
https://github.com/aflah02/easy-data-augmentation-implementation
My Implementation for the paper EDA: Easy Data Augmentation Techniques for Boosting Performance on Text Classification Tasks using Tensorflow
data deep-learning lstm nlp tensorflow2
Last synced: 09 Jul 2025
https://github.com/equinor/fmu-dataio
FMU data standard and data export with rich metadata in the FMU context
data fmu jsonschema python subsurface sumo
Last synced: 07 Jan 2026
https://github.com/zengfr/arcade_game_romhacking_sourcecode_top_secret_data
arcade_game_romhacking_sourcecode_top_secret_datafor mess sfc snes sega md geoneo data asm mame m68k m68000 cps1 capcom rom assember
68000 68k arcade asm asmem assember capcom cheat cheat-engine cps cps1 data game m68000 m68k mame rom romhacking sourcecode
Last synced: 14 Oct 2025
https://github.com/aoemods/attrib
Age of Empires 4 attribute dump, keeping track of patch changes. Converted with AOEMods.Essence. Join our discord on the website link.
age-of-empires-iv aoe4 data information mods stats
Last synced: 25 Oct 2025
https://github.com/cpscript/termux-security
This software has a simple VPN using "OpenVPN" and a "Static generator" which aims to make your internet traffic less interesting to sell, keeping your personal data safe and secure.
android beta-testing data hack hacking network networking openvpn private static termux vpn vpn-server
Last synced: 28 Sep 2025
https://github.com/psyteachr/ads-v1
Applied Data Skills: Processing & Presenting Data
Last synced: 11 Oct 2025
https://github.com/capire/xtravels
Travel booking app using master data from xflights
cap cds data federation flights reuse
Last synced: 23 Jan 2026
https://github.com/data-forge/data-forge-fs
This library contains the file system extensions to Data-Forge that allow it to directly read and write CSV and JSON files in Node.js
csv data data-analysis data-cleaning data-cleansing data-forge data-management data-manipulation data-munging data-visualization data-wrangling javascript json linq nodejs pandas visualization
Last synced: 04 Sep 2025
https://github.com/giscience/measures-rest
A REST server to provide measures for geospatial datasets
data dggs geospatial measure rest
Last synced: 10 Oct 2025
https://github.com/octue/octue-sdk-python
The python SDK for @Octue services and digital twins.
data data-service data-service-development-kit data-services digital-twin digital-twin-application digital-twin-web digital-twins microservice microservices python python3 renewable-energy renewables sdk sdk-python wind-energy wind-energy-analytics
Last synced: 18 Aug 2025
https://github.com/zgbjgg/quetzal
Quetzal - Analytical web apps, fast, easy and real-time using Elixir. No Javascript required.
analytical data data-visualization elixir erlang plotly web-app
Last synced: 12 Apr 2025
https://github.com/codeforafrica/ckanext-openafrica
A CKAN extension to style and add features to the openAFRICA platform. Accessible at http://openafrica.net
africa ckan ckan-extension data open-data openafrica
Last synced: 16 Mar 2025
https://github.com/rqluo/mixtex-datahub
LaTeXDataHub is an open-source platform dedicated to the sharing and contribution of real-world LaTeX image datasets and their annotations, allows users to upload, download, and contribute to a growing collection of high-quality LaTeX datasets.
data deep-learning latex machine-learning ocr
Last synced: 24 Oct 2025
https://github.com/benedekrozemberczki/hullcoverconditionedunitdiskgraph
A generator for unit disk graphs conditioned on concave hull cover.
data data-generator data-science data-visualization deep-learning fun funny graph graph-clustering graph-embedding graph-visualization hull-cover joke machine-learning network-visualization networkx node-embedding non-planar-graph synthetic unit-disk-graph
Last synced: 06 Jul 2025
https://github.com/hinto-janai/someday
Lock-free MVCC primitive
atomic concurrency data lock-free mvcc rust
Last synced: 18 Mar 2025
https://github.com/neuroglia-io/framework
A collection of libraries to extend the .NET Framework
asp caching data eventing expression framework mapping mediation net serialization templating
Last synced: 13 Apr 2025
https://github.com/jincheng9/python-tutorial
Python tutorial,量化交易,涵盖基础、中级和高级教程
data data-analysis-python data-analyst data-science django flask numpy pandas python quant quant-dev tutorial
Last synced: 07 May 2025
https://github.com/karanpratapsingh/scale-etl
Partition, Transform, Load, and Search large CSV files
Last synced: 10 Jul 2025
https://github.com/Ingenjorsarbete-For-Klimatet/ifk-smhi
SMHI climate data client.
Last synced: 20 Jul 2025
https://github.com/contextlab/data-wrangler
Wrangle messy numerical, image, and text data into consistent well-organized formats
data data-analysis data-science data-wrangling hugging-face image-data machine-learning nlp numpy pandas python scikit-learn
Last synced: 10 Apr 2025
https://github.com/govau/galileo
Quantifying interactions with government services to support delivery teams to improve their own products and services
analytics data data-science government observatory pandas python r shiny website
Last synced: 10 Jul 2025
https://github.com/cfnptr/pack
Runtime optimized multi-platform data packing library for realtime game resources loading
c c99 compression compressor container cpp cross-platform csharp data library multi-platform pack package packer packing resource resources runtime storage zstd
Last synced: 30 Oct 2025
https://github.com/juliadatascience/juliadatascience-pt
Book on Julia for Data Science (Portuguese Edition)
book data data-manipulation data-science data-visualization julia julia-language
Last synced: 24 Jun 2025
https://github.com/keshiarose/toggl-web-data-connector
A Tableau Web Data Connector that pulls in data from the Detail Reports view from toggl.com
data tableau tableau-desktop toggl toggl-track wdc web-data-connector
Last synced: 24 Dec 2025
https://github.com/cobertos/tld-data
Get yer TLD data here! Scraped straight from DNS, ICANN and IANA. Including branded gTLDs and whether or not there's registry restrictions.
data dataset domain gtld gtlds javascript tld
Last synced: 13 Apr 2025
https://github.com/kroncrv/datasets
Datasets used for articles and stories made available on Pointer (www.pointer.nl)
csv data datasets excel structured-data
Last synced: 19 Jul 2025
https://github.com/jonschlinkert/write-json
Write a JSON to file disk, also creates directories in the dest path if they don't already exist.
data disk file file-system fs json object write
Last synced: 07 May 2025
https://github.com/akoury/ml-helper
Python library with helpers to speed up and structure machine learning projects.
data data-visualization machine-learning ml python scikit-learn sklearn
Last synced: 24 Oct 2025
https://github.com/arevi/mouse-data-visualizer
A visual playground for the WindMouse JavaScript library. Edit settings in real time and fine tune your mouse movements.
data javascript jsx mouse nodejs react typescript visualizer windmouse
Last synced: 22 Jun 2025
https://github.com/kehvinbehvin/json-mcp-filter
JSON MCP server to filter only relevant data for your LLM
claude-mcp data data-extraction data-filtering json json-analysis json-filter json-mcp-server json-parser json-schema-inference json-to-typescript json-utilities large-files mcp mcp-server query type-generation
Last synced: 07 Sep 2025
https://github.com/TRASAL/psrdada-python
Python bindings to the PSRDada ringbuffer implementation
astronomy data nlesc psrdada python ringbuffer
Last synced: 31 Mar 2025
https://github.com/pratapvardhan/elections-india-2014
Results related to General Assembly (Lok Sabha) elections 2014 in India.
data elections india python web-scraping
Last synced: 13 Apr 2025
https://github.com/waylonwalker/steel-toes
a kedro hook to protect against breaking changes to data
cli data kedro kedro-hook kedro-plugin python
Last synced: 05 May 2025
https://github.com/mohammadreza-mohammadi94/data-analysis-projects-with-pandas
A repository featuring practical data analysis projects using Pandas, demonstrating data manipulation, visualization, and real-world problem-solving techniques. Ideal for learning and applying Pandas for data analysis.
data data-science jupyter-notebook pandas
Last synced: 05 May 2025
https://github.com/reycn/data-analytics-in-julia
Notebooks for data analysis in social science using Julia, replicating frequent analytical steps in Python & R.
data data-analysis data-science data-visualization julia
Last synced: 07 May 2025
https://github.com/nobrainr/axios-morphism
Axios plugin to transform data requests/responses based on a schema.
api axios client data http interceptor javascript network nodejs request response transform typescript
Last synced: 10 Jul 2025
https://github.com/malcolmgreaves/avro-codegen
Scala code generator for Avro schemas.
avro avro-schema codegen data scala serialization
Last synced: 07 May 2025
https://github.com/chuongmep/ifc-to-excel
Convert Metadata From IFC To Excel
autodesk big-data data ifc ifc-excel ifc-viewer
Last synced: 30 Apr 2025
https://github.com/purarue/discord_data
Library to parse messages/activity from the discord data export
Last synced: 18 Mar 2025
https://github.com/scribe-org/scribe-server
Backend service for Scribe data downloads
api autosuggest backend data data-downloader data-pipeline dictionary education elt emoji go golang grammar language learning open-source translation wikidata wikipedia
Last synced: 30 Oct 2025
https://github.com/enkidevs/driveql
1. Sync your files from Google Drive. 2. access them with an automatically generated API
Last synced: 12 Apr 2025
https://github.com/OpenCourseAPI/OwlAPI
An open source REST API written in Python to scrape and serve Foothill / De Anza course data :ledger:
api course data de-anza foothill myportal owl-api webscraping
Last synced: 13 Jul 2025
https://github.com/gagniuc/prototype-software-for-photon-pixel-coupling
Photon-pixel coupling is a novel method that allows a parallel sampling of an unlimited number of sensors. In the case shown here, 200 sensors are sampled in parallel at video rate frequency. This implementation is done in Visual Basic 6.0 (VB6).
biosensors coupling curent data electronics led photon-pixel sampling sensors skin vb6 voltage webcam
Last synced: 04 Mar 2025
https://github.com/iamhosseindhv/lstm-classification
Comment toxicity classification using Karas/TensorFlow
classification cnn data data-mining keras lstm machine-learning python rnn tensorflow
Last synced: 08 May 2025
https://github.com/castelao/directip
Iridium SBD Direct-IP (satellite communication)
communication-systems data remote-sensors satellite
Last synced: 11 Jul 2025
https://github.com/moumen-soliman/hashed-device-fingerprint-js
A lightweight JavaScript/TypeScript package that generates device-specific hashed fingerprints for devices in both browser and server environments.
data device expressjs fingerprint fingerprinting javascript nodejs sha256 sha256-hash typescript
Last synced: 13 Apr 2025
https://github.com/lchsk/sanchosql
SanchoSQL - Linux desktop PostgreSQL client
data database database-gui database-management desktop development editor linux linuxapps postgres postgresql sql
Last synced: 16 Aug 2025
https://github.com/mewmix/gh_llm_loader
clone GitHub repositories and prepare their data for ingestion for LLMs.
context data data-structures github llm llm-training python
Last synced: 19 Sep 2025