data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-22 00:07:43 UTC
- JSON Representation
https://github.com/bovem/stock-tracker
An interactive data visualization application developed in Python
data data-analysis data-visualization iex-api plotly-dash python stock-data stock-tracker visualization
Last synced: 19 Sep 2025
https://github.com/cobertos/tld-data
Get yer TLD data here! Scraped straight from DNS, ICANN and IANA. Including branded gTLDs and whether or not there's registry restrictions.
data dataset domain gtld gtlds javascript tld
Last synced: 13 Apr 2025
https://github.com/Ingenjorsarbete-For-Klimatet/ifk-smhi
SMHI climate data client.
Last synced: 20 Jul 2025
https://github.com/eddienubes/validness
🟢 Your favourite library for validating incoming data in express.js.
data dto express expressjs http http-server nestjs nodejs server validation
Last synced: 28 Jun 2025
https://github.com/pratapvardhan/elections-india-2014
Results related to General Assembly (Lok Sabha) elections 2014 in India.
data elections india python web-scraping
Last synced: 13 Apr 2025
https://github.com/codepawl/loclean
An AI Data Cleaning Library
automated-cleaning data data-cleaning data-engineering data-preprocessing data-science data-wrangling etl llm normalization open-source polars privacy-preserving python semantic-analysis slm structured-data
Last synced: 04 Apr 2026
https://github.com/juliadatascience/juliadatascience-pt
Book on Julia for Data Science (Portuguese Edition)
book data data-manipulation data-science data-visualization julia julia-language
Last synced: 24 Jun 2025
https://github.com/arevi/mouse-data-visualizer
A visual playground for the WindMouse JavaScript library. Edit settings in real time and fine tune your mouse movements.
data javascript jsx mouse nodejs react typescript visualizer windmouse
Last synced: 22 Jun 2025
https://github.com/TRASAL/psrdada-python
Python bindings to the PSRDada ringbuffer implementation
astronomy data nlesc psrdada python ringbuffer
Last synced: 31 Mar 2025
https://github.com/govau/galileo
Quantifying interactions with government services to support delivery teams to improve their own products and services
analytics data data-science government observatory pandas python r shiny website
Last synced: 10 Jul 2025
https://github.com/kehvinbehvin/json-mcp-filter
JSON MCP server to filter only relevant data for your LLM
claude-mcp data data-extraction data-filtering json json-analysis json-filter json-mcp-server json-parser json-schema-inference json-to-typescript json-utilities large-files mcp mcp-server query type-generation
Last synced: 07 Sep 2025
https://github.com/jincheng9/python-tutorial
Python tutorial,量化交易,涵盖基础、中级和高级教程
data data-analysis-python data-analyst data-science django flask numpy pandas python quant quant-dev tutorial
Last synced: 07 May 2025
https://github.com/kroncrv/datasets
Datasets used for articles and stories made available on Pointer (www.pointer.nl)
csv data datasets excel structured-data
Last synced: 19 Jul 2025
https://github.com/equinor/fmu-dataio
FMU data standard and data export with rich metadata in the FMU context
data fmu jsonschema python subsurface sumo
Last synced: 19 Feb 2026
https://github.com/capire/xtravels
Travel booking app using master data from xflights
cap cds data federation flights reuse
Last synced: 23 Jan 2026
https://github.com/giscience/measures-rest
A REST server to provide measures for geospatial datasets
data dggs geospatial measure rest
Last synced: 10 Oct 2025
https://github.com/zengfr/arcade_game_romhacking_sourcecode_top_secret_data
arcade_game_romhacking_sourcecode_top_secret_datafor mess sfc snes sega md geoneo data asm mame m68k m68000 cps1 capcom rom assember
68000 68k arcade asm asmem assember capcom cheat cheat-engine cps cps1 data game m68000 m68k mame rom romhacking sourcecode
Last synced: 14 Oct 2025
https://github.com/psyteachr/ads-v1
Applied Data Skills: Processing & Presenting Data
Last synced: 11 Oct 2025
https://github.com/correia-jpv/fucking-awesome-bigdata
A curated list of awesome big data frameworks, resources and other awesomeness. With repository stars⭐ and forks🍴
awesome awesome-list bigdata data data-analytics data-science data-stream data-visualization data-warehouse database distributed-database series-database stream-processing streaming-data visualize-data
Last synced: 27 Apr 2025
https://github.com/rqluo/mixtex-datahub
LaTeXDataHub is an open-source platform dedicated to the sharing and contribution of real-world LaTeX image datasets and their annotations, allows users to upload, download, and contribute to a growing collection of high-quality LaTeX datasets.
data deep-learning latex machine-learning ocr
Last synced: 24 Oct 2025
https://github.com/neuroglia-io/framework
A collection of libraries to extend the .NET Framework
asp caching data eventing expression framework mapping mediation net serialization templating
Last synced: 15 Mar 2026
https://github.com/jgraving/deepposekit-data
Example datasets for DeepPoseKit
data deepposekit pose-estimation posture
Last synced: 05 Mar 2026
https://github.com/gibbs/currency-data
ISO 4217 currency data for consumption in CSV, JSON, PHP, XML and YAML
currencies currency data dataset
Last synced: 23 Apr 2025
https://github.com/scimusmn/earth-latest-data
Download latest wind data for the Earth global map.
conner-prairie data earth gis grib grib2json map
Last synced: 03 Feb 2026
https://github.com/hariharan-devarajan/vanidl
VaniDL is an tool for analyzing I/O patterns and behavior with Deep Learning Applications.
ai analysis data deep-learning deep-neural-networks machine-learning profile storage tensorflow2
Last synced: 07 Aug 2025
https://github.com/zgbjgg/quetzal
Quetzal - Analytical web apps, fast, easy and real-time using Elixir. No Javascript required.
analytical data data-visualization elixir erlang plotly web-app
Last synced: 12 Apr 2025
https://github.com/codeforafrica/ckanext-openafrica
A CKAN extension to style and add features to the openAFRICA platform. Accessible at http://openafrica.net
africa ckan ckan-extension data open-data openafrica
Last synced: 16 Mar 2025
https://github.com/ethjs/ethjs-schema
The complete Ethereum RPC spec as a JSON object export.
data ethereum ethjs json rpc solidity specification web3
Last synced: 05 Oct 2025
https://github.com/benedekrozemberczki/hullcoverconditionedunitdiskgraph
A generator for unit disk graphs conditioned on concave hull cover.
data data-generator data-science data-visualization deep-learning fun funny graph graph-clustering graph-embedding graph-visualization hull-cover joke machine-learning network-visualization networkx node-embedding non-planar-graph synthetic unit-disk-graph
Last synced: 06 Jul 2025
https://github.com/fd0/split
Split large files into smaller ones using deterministic Content Defined Chunking
Last synced: 18 Aug 2025
https://github.com/lablnet/pakweather_scraper
A multi-threaded Pakistan Weather crawler written in JavaScript
crawler data mit-license open-source pakistan scraping weather weather-channel
Last synced: 22 Aug 2025
https://github.com/quantium-ai/patternity
Stock price prediction using deterministic algorithm inspired by LSTM, focusing on pattern recognition in historical data.
algorithm algotrading chart crypto data detection deterministic finance forecasting forex lstm pattern prediction price stock trading
Last synced: 11 Mar 2026
https://github.com/stefen-taime/etl-data-pipeline-rdbms-to-hdfs-using-airflow-apache-sqoop-spark-postgres-and-hive
This project aims to move the data from a Relational database system (RDBMS) to a Hadoop file system (HDFS)
airflow big-data data docker-compose etl-pipeline hdfs hive infrastructure-as-code rdbms spark sql sqoop
Last synced: 03 Jul 2025
https://github.com/gibbsbravo/datadelta
The best Python package for comparing two dataframes
analytics comparison data data-analytics database database-management databases dataops dataops-platform devops pandas pandas-dataframe testing testing-tools version-control
Last synced: 18 Aug 2025
https://github.com/dimitryzub/webscraping-py
Web Scraping scripts for all Google, other search engines, and other websites (currently outdated, something may not be working).
api bs4 data google-maps-api googleapi googlescraping googlesearchapi lxml parsel playwright python requests scraper scraping scrapy selenium webscraper webscraping webscraping-data webscraping-search
Last synced: 12 Aug 2025
https://github.com/octue/octue-sdk-python
The python SDK for @Octue services and digital twins.
data data-service data-service-development-kit data-services digital-twin digital-twin-application digital-twin-web digital-twins microservice microservices python python3 renewable-energy renewables sdk sdk-python wind-energy wind-energy-analytics
Last synced: 18 Aug 2025
https://github.com/colour-science/colour-science.org
https://www.colour-science.org
color color-science color-space color-spaces colorspace colorspaces colour colour-science colour-space colour-spaces colourspace colourspaces data dataset datasets python spectral-data spectral-dataset spectral-datasets website
Last synced: 21 Apr 2025
https://github.com/jonschlinkert/write-json
Write a JSON to file disk, also creates directories in the dest path if they don't already exist.
data disk file file-system fs json object write
Last synced: 09 Mar 2026
https://github.com/cpscript/termux-security
This software has a simple VPN using "OpenVPN" and a "Static generator" which aims to make your internet traffic less interesting to sell, keeping your personal data safe and secure.
android beta-testing data hack hacking network networking openvpn private static termux vpn vpn-server
Last synced: 28 Sep 2025
https://github.com/data-forge/data-forge-fs
This library contains the file system extensions to Data-Forge that allow it to directly read and write CSV and JSON files in Node.js
csv data data-analysis data-cleaning data-cleansing data-forge data-management data-manipulation data-munging data-visualization data-wrangling javascript json linq nodejs pandas visualization
Last synced: 04 Sep 2025
https://github.com/aflah02/easy-data-augmentation-implementation
My Implementation for the paper EDA: Easy Data Augmentation Techniques for Boosting Performance on Text Classification Tasks using Tensorflow
data deep-learning lstm nlp tensorflow2
Last synced: 09 Jul 2025
https://github.com/lens-vm/spec
LensVM specifications and ABI definition
abi data interoperability lenses schema transformations web-assembly
Last synced: 12 Jan 2026
https://github.com/wahyudesu/predicting-hotel-booking-cancellations
This project will help hotel managers optimize their booking policies, reduce cancellations, and improve revenue.
data data-analysis data-science python
Last synced: 07 Jul 2025
https://github.com/turbot/steampipe-export
Steampipe Export is a zero-ETL CLI to fetch data from cloud services and APIs. Hundreds of plugins with thousands of documented examples.
aws azure backup data devsecops etl gcp golang kubernetes security steampipe steampipe-engine zero-etl
Last synced: 31 Jul 2025
https://github.com/lukasmosser/oklahomaproductiondata
A repository of machine-learnable formatted oklahoma o&g production data.
data data-mining energy machine-learning
Last synced: 03 Aug 2025
https://github.com/chalk-ai/chalk-ts
Typescript client for working with Chalk
chalk data feature-engineering pipelines typescript
Last synced: 09 Mar 2026
https://github.com/danieljdufour/xdim
Multi-Dimensional Functions. Create, Query, and Transform Multi-Dimensional Data.
array binary data dimensions format formatter functions image javascript js layout math multidimensional ndarray rearrange reorganize reshape shape theory
Last synced: 13 Jun 2025
https://github.com/cipherstash/protectjs
Encrypt and protect data using industry standard algorithms, field level encryption, a unique data key per record, bulk encryption operations, and decryption level identity verification. Powered by CipherStash Encryption.
data data-security encryption javascript postgres postgresql security typescript
Last synced: 29 Oct 2025
https://github.com/casbin/confita
An open-source version of Kaggle written in Go and React
casbin casdoor conference data go javascript kaggle react
Last synced: 09 Aug 2025
https://github.com/isoverse/clumpedr
Clumped isotope data analysis in R
analysis clumped data isotope processing r stable stable-isotopes
Last synced: 19 Feb 2026
https://github.com/lifyzer/data-parser-system
:apple: Simple script that parses data from open source databases to the standard Lifyzer database structure :green_apple:
data data-parser databases food food-data health ingredients lifyzer nutrition parsed-data parser parses-data
Last synced: 09 Apr 2025
https://github.com/klaudiosinani/shtack
LIFO Stacks for ES6
data es6 lifo stack structure typescript
Last synced: 24 Apr 2025
https://github.com/dsdanielpark/arxiv2text
Converting PDF files to text, mainly with a focus on arXiv papers.
crawling data datamining translation
Last synced: 05 Sep 2025
https://github.com/nrennie/national-highways
R package for accessing the National Highways WebTRIS API via R.
Last synced: 14 Aug 2025
https://github.com/lchsk/sanchosql
SanchoSQL - Linux desktop PostgreSQL client
data database database-gui database-management desktop development editor linux linuxapps postgres postgresql sql
Last synced: 16 Aug 2025
https://github.com/klaudiosinani/binoheap
Binomial heaps for ES6
binomial data es6 heap structure typescript
Last synced: 12 Jun 2025
https://github.com/thomwright/balamb
🌱 Concurrently run a set of dependent, asynchronous tasks with type-safe dependencies
concurrent dag dags data data-seeding dependency-injection di seed seeding tasks
Last synced: 07 May 2025
https://github.com/mara/mara-mondrian
A python integration for the Saiku ad hoc analysis tool
adhoc-analysis data mara mondrian mondrian-olap-engine python reporting saiku
Last synced: 30 Apr 2025
https://github.com/colour-science/colour-mitsuba
Various resources for Mitsuba 3
color color-science color-space color-spaces colorspace colorspaces colour colour-science colour-space colour-spaces colourspace colourspaces data dataset datasets mitsuba spectral-data spectral-dataset spectral-datasets
Last synced: 21 Apr 2025
https://github.com/abhay557/fakedata
The fakedata package generates realistic synthetic user profiles for machine learning, deep learning, data analysis, and data science workflows.
abhay557 anime data data-analysis data-science deep-learning fake fake-data generator joke machine-learning mock mock-data
Last synced: 30 May 2026
https://github.com/flor91/data-structures-and-algorithms
Theory and Implementation of Data Structures and Algorithms using Python
algorith data data-structures python
Last synced: 19 Apr 2025
https://github.com/AurelienAubry/Spotlight
Spotlight is a Spotify dashboard that allows user to visualize his listening habits.
backend bootstrap chartjs data data-analysis data-science data-visualization flask frontend javascript js pandas python python3 react react-bootstrap spotify
Last synced: 15 Apr 2025
https://github.com/datadesk/calfire-wildfires
Download wildfires data from CalFire
cli data data-journalism geojson journalism news python wildfires
Last synced: 05 Jan 2026
https://github.com/mdlincoln/europop
Historical Populations of European Cities, 1500-1800
Last synced: 21 Feb 2026
https://github.com/edoardottt/postgressql-db
Easy implementation of some postgreSQL Databases for practicing with Conceptual analysis of requirements, design of relational databases and SQL queries
data database pgplsql pgsql plsql postgres postgresql postgresql-database rdbm rdbms relational-databases sql
Last synced: 27 Oct 2025
https://github.com/thearyadev/top500-aggregator
A suite of tools and a web service to collect and provide data on the Overwatch 2 Top 500 leaderboards.
Last synced: 16 Jan 2026
https://github.com/michalporeba/odis
Search in decentralised systems. Search federation, result moderation, aggregation and feedback with hypermedia in ReSTful API to round it all of.
data data-discovery discoverability federated information-discovery mesh-networks search
Last synced: 18 Jan 2026
https://github.com/MohammedSardar/Bive
Bive is a Kurdish profanity language processing project.
data dataanalysis kurdish kurdish-corpus kurdish-dataset kurdish-language-processing kurdishdata kurdishnlp
Last synced: 07 May 2025
https://github.com/globalgov/manydata
The portal for global governance data
Last synced: 16 Apr 2025
https://github.com/reycn/data-analytics-in-julia
Notebooks for data analysis in social science using Julia, replicating frequent analytical steps in Python & R.
data data-analysis data-science data-visualization julia
Last synced: 07 May 2025
https://github.com/scribe-org/scribe-server
Backend service for Scribe data downloads
api autosuggest backend data data-downloader data-pipeline dictionary education elt emoji go golang grammar language learning open-source translation wikidata wikipedia
Last synced: 30 Oct 2025
https://github.com/gagniuc/prototype-software-for-photon-pixel-coupling
Photon-pixel coupling is a novel method that allows a parallel sampling of an unlimited number of sensors. In the case shown here, 200 sensors are sampled in parallel at video rate frequency. This implementation is done in Visual Basic 6.0 (VB6).
biosensors coupling curent data electronics led photon-pixel sampling sensors skin vb6 voltage webcam
Last synced: 08 Mar 2026
https://github.com/purarue/discord_data
Library to parse messages/activity from the discord data export
Last synced: 18 Mar 2025
https://github.com/enkidevs/driveql
1. Sync your files from Google Drive. 2. access them with an automatically generated API
Last synced: 12 Apr 2025
https://github.com/iamhosseindhv/lstm-classification
Comment toxicity classification using Karas/TensorFlow
classification cnn data data-mining keras lstm machine-learning python rnn tensorflow
Last synced: 08 May 2025
https://github.com/moumen-soliman/hashed-device-fingerprint-js
A lightweight JavaScript/TypeScript package that generates device-specific hashed fingerprints for devices in both browser and server environments.
data device expressjs fingerprint fingerprinting javascript nodejs sha256 sha256-hash typescript
Last synced: 13 Apr 2025
https://github.com/devscast/cd-data
important background data for the creation of a solution for the DRC
congo congo-kinshasa data data-science json rdata rdc rdc-data
Last synced: 06 Apr 2025
https://github.com/malcolmgreaves/avro-codegen
Scala code generator for Avro schemas.
avro avro-schema codegen data scala serialization
Last synced: 07 May 2025
https://github.com/tuttlepower/predictit_markets
Simple Python code that helps to retreive predictit market data.
data political-science politics predictit predictit-api pypi pypi-package python
Last synced: 08 Apr 2026
https://github.com/chuongmep/ifc-to-excel
Convert Metadata From IFC To Excel
autodesk big-data data ifc ifc-excel ifc-viewer
Last synced: 30 Apr 2025