data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-23 00:07:41 UTC
- JSON Representation
https://github.com/mongoexpuser/synthetic-drilling-data-app-for-sqlite-ml
Generate synthetic drilling data that can be used for testing machine learning (ML) models.
classification data drilling events inference machine-learning ml-models node-sqlite3 nodejs prediction python sqlite3 training
Last synced: 08 Apr 2026
https://github.com/dataopstix/modelt
Modelt(mow·delt) is a modern data integration solution that connects data to data for advanced analytics.
airbyte airflow airflow-docker data data-analysis data-visualization database dbt elt etl etl-automation metabase metadata modern modern-dev modernization
Last synced: 28 Mar 2025
https://github.com/shahiakhilesh1304/dsa
This repository is my personal space for practicing Data Structures and Algorithms. I'll be adding questions, solutions, and explanations as I progress, covering topics from basics to advanced.
algorithms code contribution contributions-and-collaboration contributions-welcome data data-structures data-structures-and-algorithms dsa dsa-algorithm dsa-learning-series dsa-practice hacktober hacktoberfest hacktoberfest-accepted hacktoberfest-starter practice programming python-3
Last synced: 13 Apr 2025
https://github.com/cherylisabella/statistics--caret
Training Regression and Classification Models using caret
data data-analysis data-mining data-science datascience dataset r statistics
Last synced: 24 Jun 2025
https://github.com/fiedsch/datamanagement
Data management helpers (PHP-CLI)
csv-data data datamanagement helper php
Last synced: 05 Apr 2025
https://github.com/sharmadhiraj/free-json-datasets
Collection of free JSON data that are scraped and parsed from different websites.
collection crawler data data-scraping datasets json sports statistics web-scraping
Last synced: 28 Mar 2025
https://github.com/alexandregazagnes/scikit-res
Very Basic package to store results of ML models Grid search results are hard to exploit. This package aims to store them in a more convenient way.
data machine-learning mlops mlops-workflow results scikit-learn
Last synced: 20 Jan 2026
https://github.com/philhawksworth/netlify-plugin-trello-lists
A plugin to fetch the JSON data of a public Trello board, and stash the data for each list in a JSON file before your build runs making the data available to your static site generator at build time.
api data eleventy netlify plugin trello
Last synced: 20 Jan 2026
https://github.com/lemmotresto/migrational
A data migration library
data java migration versioning
Last synced: 30 Oct 2025
https://github.com/robjg/dido
Data In/Data Out in many formats
csv-parser data etl java json-parser
Last synced: 11 Jan 2026
https://github.com/writetome51/big-dataset-paginator
A TypeScript/JavaScript class for pagination in a real-world web app.
app data javascript pagination paginator typescript
Last synced: 17 May 2026
https://github.com/justjavac/deno_open_data
data deno deno-mod deno-module denomod
Last synced: 17 May 2026
https://github.com/max-tonny8/android_web3
This is a library for Android to call data from Node on Ethereum Chain or Solana Chain
android blockchain coroutines coroutines-android data eth-call ethereum kotlin ktx retrofit rpc smart-contracts solana web3 web3j
Last synced: 27 Mar 2025
https://github.com/wfamous/fiv_update-data
This project automates the retrieval, processing, and publishing of digital product data for our Shopify store. It integrates Google Cloud Platform (GCP), Amazon Web Service (AWS), Terraform (Tofu), Python, Bash, Ansible and GitHub Actions to manage data pipelines efficiently.
ansible aws bash data data-analysis data-science devops gcp python pythonpackage shopify terraform tofu
Last synced: 17 Feb 2026
https://github.com/joisino/twinpaper
Code for "Twin Papers: A Simple Framework of Causal Inference for Citations via Coupling" (CIKM 2022)
causal-inference data research science-of-science
Last synced: 21 Mar 2025
https://github.com/eliot-akira/png-compressor
Compress and encode data as PNG image
Last synced: 17 Mar 2025
https://github.com/jasondrawdy/compendio
Collection of common and noteworthy extension methods, security tools, and filesystem functions generally found in most applications; focusing on extensibility and portability.
compendium converters cryptography data extensions generators hashing library security utilities validation windows
Last synced: 18 May 2026
https://github.com/nowosad/cllc
Country-level Land Cover - categories and transitions
data dataset land-cover land-cover-transitions r
Last synced: 04 Apr 2025
https://github.com/kale-ko/ejcl
An advanced configuration library for Java with support for local files, mysql databases, and more.
config configs configuration configuration-files data java java-serialization json mariadb mysql parser serialization serializer smile yaml
Last synced: 02 Feb 2026
https://github.com/randomfractals/unfolded-map-renderer
Unfolded Map 🗺️ Notebook 📓 Renderer for VSCode.
cell data flat-data geo geo-location geo-spatial map notebook output renderer unfolded view vscode
Last synced: 21 Mar 2025
https://github.com/tbille/github-stars-datastudio-connector
GitHub Stars connector for datastudio
data data-visualization datastudio github github-stars
Last synced: 18 May 2026
https://github.com/meetyildiz/pandazip
Reduce RAM footprint of Pandas DataFrame without losing information.
compression data data-mining data-science machine-learning pandas utilities
Last synced: 20 Jan 2026
https://github.com/gabrielu3/ori-inverted-list
Inverted List made for a college discipline named Organization and Retrieval of Information
c data data-structures index inverted-index list
Last synced: 24 Feb 2026
https://github.com/jorgeatgu/clau
📊 Gráficas con d3, responsive y reutilizables
charts d3js d3v5 data data-visualization data-viz graphs
Last synced: 12 Sep 2025
https://github.com/kabeech/real-dice
Random number generation based on physical media touched by humans
data dice haskell human-computer-interaction physical random random-generation random-number-generators rng
Last synced: 10 Apr 2025
https://github.com/stdlib-js/ndarray-base-buffer-ctors
ndarray data buffer constructors.
array base buffer constructor constructors ctor ctors data dtype dtypes javascript multidimensional ndarray node node-js nodejs stdlib types utilities utility
Last synced: 06 Apr 2025
https://github.com/karlinarayberinger/karbytes_23andme_ancestry_data
This repository contains web page source code and media files which constitute (some of) the intellectual property encapsulated by the website named Karbytes For Life Blog dot WordPress dot Com.
2023 23andme ancestry blog data dna genome karbytes website wordpress
Last synced: 12 May 2025
https://github.com/anonympins/data-primals-engine
Manage and automate your data at scale 🚀 With data-primals-engine you get workflows, dashboards, alerts, i18n, client integration & AI assistant — all open-source, all MongoDB powered.
api automation data data-analysis data-engineer data-visualization database expressjs low-code mongodb nodejs rest-api
Last synced: 07 Mar 2026
https://github.com/chickennungets/ifsc-data-analysis
An implementation of Elo-MMR ranking in the boulder and lead disciplines.
d3 data elo elo-rating ifsc visualization webscraping
Last synced: 20 Jan 2026
https://github.com/smithsonian/massdigi-tools
Scripts that support Mass Digitization projects by the Digitization Program Office, OCIO
data data-validation digital-files digitization digitization-workflows museums
Last synced: 13 Apr 2025
https://github.com/stdlib-js/array-typed-integer-ctors
Integer-valued typed array constructors.
array constructor constructors ctor ctors data dtype dtypes javascript node node-js nodejs stdlib structure type typed typed-array types utilities
Last synced: 12 Jan 2026
https://github.com/Ekey/ER.DATA.Tool
Tool for extract data archives from mobile game Earth Revival (Project Arrival)
data earth-revival idx project-arrival
Last synced: 19 May 2026
https://github.com/ange007/jquery.mydata
jQuery.myData - Small jQuery&Zepto plugin for two-ways data binding.
data data-binding jquery jquery-plugin zepto zepto-plugin zeptojs
Last synced: 19 May 2026
https://github.com/exaluc/webhookcatcher
Catch your webhooks like a dream
api catcher data webhook webhook-callbacks webhooks-catcher
Last synced: 14 Apr 2025
https://github.com/kodie/migrate-acf-field-data-to-repeater
A WordPress plugin that migrates field metadata for ACF fields that have been moved inside of a repeater
acf acf-field acf-fields advance-custom-field data data-migration data-migration-tool wordpress wordpress-plugin
Last synced: 19 May 2026
https://github.com/mzazakeith/puppetmaster
Puppeteer & Crawl4AI microservice for web automation, scraping, and AI processing with Bull queues
agent ai automation bull bullmq chrome crawl4ai crawler data data-extraction extraction gemini llm llms openai playwright puppeteer web-automation
Last synced: 13 May 2025
https://github.com/toviszsolt/stormflow
StromFlow - A Lightweight Data Modeling and Storage Library
data database datamanagement datastore db document jsondatabase memorydatabase model mongodb mongoose nodb nosql query schema stormflow
Last synced: 13 May 2025
https://github.com/stdlib-js/array-bool
BooleanArray.
array binary bool boolean booleanarray data javascript mask node node-js nodejs stdlib structure typed typed-array types
Last synced: 13 May 2025
https://github.com/longzheng/southeastwater-usage-scraper
Extract hourly water usage data from South East Water portal website for digital water meters
australia data iot playwright southeastwater victoria water
Last synced: 06 Feb 2026
https://github.com/nazar-pc/fixed-size-multiplexer
A tiny library for multiplexing data chunks into blocks of fixed size and vice versa
chunk data demultiplex demux fixed multiplex mux size
Last synced: 31 Oct 2025
https://github.com/ugurcanerdogan/cross-validation-with-imbalanced-dataset
BBM467*SDSP - Small Data Science Project - Things to consider in cross validation and resampling when dealing with Imbalanced Data : What is the right way?
bbm467 cross-validation data data-science kfold-cross-validation logistic-regression machine-learning oversampling sdsp smote
Last synced: 21 Jun 2025
https://github.com/minightdev/paperclip
Paperclip is a powerful privacy-focused data breach search engine that empowers users to swiftly and securely investigate breaches using email addresses and phone numbers. Our robust search engine delivers real-time results while prioritizing the privacy and security of user queries.
beaches data database pwn pwned search-engine
Last synced: 22 Mar 2025
https://github.com/quantalabs/climate
Graphs temperatures of the US, Caribbean Nations, and the world from 1967, 1910, and 1880 to 2020, respectively.
caribbean caribbean-temp caribbean-temperature carribean-climate climate climate-change data data-visualization global global-climate global-temp global-temperature global-warming graphing-with-python graphs us us-climate us-temp us-temperature world
Last synced: 29 Mar 2025
https://github.com/horizom/dto
Data Transfer Objects for all PHP applications.
Last synced: 14 Sep 2025
https://github.com/emrecpp/datapacket-cpp
Send, recv, encrypt, decrypt, compress data as Packet and send it with socket for C++.
compress data deserialization deserialize deserializer encrypt packet recv send serialization serialize serializer socket storage
Last synced: 28 Mar 2025
https://github.com/gavinr/minor-league-baseball
Data set of minor league baseball teams
baseball data hacktoberfest maps open-data open-datasets sports
Last synced: 06 Apr 2025
https://github.com/automators-com/tweaked
Fine tune your data
ai data nextjs rust synthetic tailwindcss tauri
Last synced: 08 Apr 2026
https://github.com/coatless-rpkg/ucimlrepo
An unofficial R port of the Python package to download data off of the UCI ML repository
data data-science machine-learning rstats statistics uci-machine-learning uci-machine-learning-repository web-api
Last synced: 28 Jun 2025
https://github.com/rawsashimi1604/jobextract
Scrapes LinkedIn data. Conducts sentiment analysis on what traits and qualifications employers are looking for.
data data-analysis data-analytics data-cleaning linkedin mvc python webscraper
Last synced: 06 Nov 2025
https://github.com/tuananh/opentravel
✈ A collection of travel related data
Last synced: 09 Oct 2025
https://github.com/richardschoen/ileaccess
Save file packaging of IBM i libraries ILEASTIC, NOXDB and ILEUSION for convenience installation. ILEastic allows HTTP microservices to be created on IBM i. NOXDB makes SQL,JSON and XML made easy for IBM i, ILEusion packages it all up to make IBM i data and program call services easier.
as400 call cl command data database db2 http https ibmi ile ileastic ileusion microservice noxdb program queue rpg sql xmlservice
Last synced: 24 Jul 2025
https://github.com/d2hydro/hydrodashboards
Open Source Dashboards for hydro data
bokeh data geopandas hydrology hydrometrics pandas sheetjs
Last synced: 26 Jul 2025
https://github.com/weisscharlesj/data_scicompforchem
Zipped data for SciCompforChem book for easy download
chemistry chemistry-education data data-visualization python
Last synced: 07 Nov 2025
https://github.com/quetz-al/quetzal-client
Python client for the Quetzal API
client data data-science openapi-client openapi3 python quetzal
Last synced: 28 Jul 2025
https://github.com/kocyigitkim/realtime.io
Real time data streaming & socket programming library
data realtime socket streaming
Last synced: 29 Jul 2025
https://github.com/windwalker-io/data
[READ ONLY] A library contains data/collection objects with null-object pattern.
collection collections data data-object iterator nullobject value-object
Last synced: 12 Mar 2026
https://github.com/hdk101/credentials-validator
A quick way to validate credentials in server-side
backend credentials data email frontend javascript login node npm npm-install password register server-side
Last synced: 21 Sep 2025
https://github.com/mo-karbalaee/introduction-to-data-science-sbu
Reports and full documentation of the introduction to data science course held at SBU
data data-science python shahid-beheshti-university
Last synced: 02 Aug 2025
https://github.com/ctechhindi/auto-fill-form-data
AUTO FILL AND AUTOCOMPLETE USER DATA WITH KEY NAME
autocomplete chrome-extension data extension
Last synced: 17 Apr 2026
https://github.com/sadcenter/messenger
Data messaging system between servers using popular messaging brokers
Last synced: 06 Aug 2025
https://github.com/yash22222/data-analysis-with-python
This repository provides a practical introduction to data acquisition and analysis using Pandas. It covers loading datasets, exploring data, manipulating data, and gaining insights through statistical summaries. Ideal for beginners, it offers code examples and explanations to enhance your data manipulation skills using Pandas for Python.
binning data data-acquisition data-analysis data-binning data-cleaning data-formatting data-integration data-normalization data-preprocessing data-science data-transformation data-wrangling dataframe description numpy pandas pandas-dataframe python python3
Last synced: 09 Apr 2026
https://github.com/cgossain/genericresultscontroller
A generic NSFetchedResultsController replacement for iOS, written in Swift.
api client connector controller coredata data database fetch firebase firebase-firestore firebase-realtime-database generic ios mongodb nsfetchedresultscontroller results source swift-generics tableview ui
Last synced: 19 Feb 2026
https://github.com/rulox/faker
A Go library to create Fake Data for your projects
data dummy dummy-data fake fake-data faker go golang
Last synced: 28 May 2026
https://github.com/jaldekoa/nyfedapi
A Python wrapper to easily retrieve data from the Federal Reserve Bank of New York (FRBoNY) official API in pandas format.
api api-wrapper banking data finance pandas python united-states
Last synced: 08 Feb 2026
https://github.com/erictleung/erictleung.github.io
:memo: Source code for my website, portfolio of projects, and more
bioinformatics blog data data-analysis data-science github-jekyll github-page jekyll lanyon open-science open-source software-engineering
Last synced: 21 Jan 2026
https://github.com/thomas-nyanumba/r-programming-air-pollution_disease-project
Personal R Programming Project
aggregate-functions boxplot-visualization data dpylr ggplot2 leftjoin linear-regression patchwork powerquery r readxl scatter-plot tidyr visualization
Last synced: 25 Mar 2025
https://github.com/yanpitangui/iteminfoconverter
Application that converts ragnarok legacy data files to iteminfo.lua
data itemdbconf iteminfo luafiles ragnarok
Last synced: 12 Oct 2025
https://github.com/nononoexe/setariaviridis
🌾 Field-collected data of green foxtail
data data-science dataset rpackage
Last synced: 27 Feb 2026
https://github.com/danielbayley/schemas
A collection of useful @JSON-schema-org schemas for data validation.
ajv config configuration data data-science data-structures data-validation json json-schema linter linting schema schema-org validation yaml yaml-configuration
Last synced: 13 Oct 2025
https://github.com/eby8zevin/android-pos4122020
The Next Project . . .
android android-app android-application android-database android-studio androidstudio create data database database-sqlite delete point-of-sale pos read search sqlite update
Last synced: 13 Oct 2025
https://github.com/kaos599/apollo-synthetic-data-generator
Apollo is a Python GUI application designed to simplify the complex process of generating random data based on fixed values. It allows users to generate various types of binary datasets, such as Yes/No type questions, by specifying probabilities.
data data-engineering data-generation data-generator data-science faker-library machine-learning tkinter-gui
Last synced: 22 Jul 2025
https://github.com/bkamapantula/india-pc-nfhs4
Parliamentary constituency factsheet for indicators of nutrition, health, and development in India using NFHS4 data.
data government health india nfhs nfhs4
Last synced: 19 Mar 2026
https://github.com/countervolts/apple-music-stats-calculator
how to get your most streamed songs/artists
apple apple-music applemusic calculator data
Last synced: 11 Feb 2026
https://github.com/saleh0987/mohamed_saleh
That's my personal website where I show my skills and projects.
aos-animation axios boot data json nextjs portfolio portfolio-website projects react-icons reactjs sass swiper
Last synced: 09 Mar 2026
https://github.com/datahub-local/datahub-local
DataHub.local is a powerful data platform designed for edge devices, enabling seamless analytics and insights at home
data data-engineering devops kubernetes raspberrypi
Last synced: 21 Jan 2026
https://github.com/bredalis/kpopnews
A place to see kpop news 📝
backend css data feedparser flask frameworks frontend html jinja2 kpop mongodb mongodb-atlas news newsletter os pages pymongo python requests web
Last synced: 12 Feb 2026
https://github.com/chaitanyac22/hr_policy_query_resolution_with_retrieval_augmented_generation_rag
This repository contains an HR Policy Query Resolution system using Retrieval-Augmented Generation (RAG). It leverages a 4-bit quantized Mistral-7B-Instruct-v0.2 LLM and JP Morgan Chase’s publicly available Code of Conduct documents to generate accurate, contextually relevant responses for HR policy queries.
artificial-intelligence data hr large-language-models llm mistral-7b nlp pipeline prompt-engineering quantization rag retrieval-augmented-generation
Last synced: 12 Feb 2026
https://github.com/stdlib-js/datasets-anscombes-quartet
Anscombe's quartet.
anscombe anscombes-quartet data dataset datasets javascript node node-js nodejs quartet sample statistics stats stdlib
Last synced: 13 Oct 2025
https://github.com/achraf-oujjir/chatgpt-users-tweets-pipeline
🐦🔵End-to-end ChatGPT Users' Tweets Data Pipeline with Python 🐍, Hive 🐝, and Power BI 📊
bash-script cloudera data data-engineering data-vizualisation datawarehouse hdfs hive networking powerbi python sentiment-analysis sftp shell tweepy twitter-api ubuntu virtualization vmware-workstation
Last synced: 28 Feb 2026
https://github.com/colour-science/colour-hdri-examples-datasets
Colour - HDRI - Examples Datasets
color color-science color-space color-spaces colorspace colorspaces colour colour-science colour-space colour-spaces colourspace colourspaces data dataset datasets hdr hdri raw tone-mapping tonemapping
Last synced: 19 Mar 2026
https://github.com/colour-science/colour-demosaicing-tests-datasets
Colour - Demosaicing - Tests Datasets
color color-science color-space color-spaces colorspace colorspaces colour colour-science colour-space colour-spaces colourspace colourspaces data dataset datasets de-mosaicing debayering demosaicing demosaicking raw
Last synced: 19 Mar 2026