data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-23 00:07:41 UTC
- JSON Representation
https://github.com/ugurcanerdogan/cross-validation-with-imbalanced-dataset
BBM467*SDSP - Small Data Science Project - Things to consider in cross validation and resampling when dealing with Imbalanced Data : What is the right way?
bbm467 cross-validation data data-science kfold-cross-validation logistic-regression machine-learning oversampling sdsp smote
Last synced: 21 Jun 2025
https://github.com/sheweny/discord-resolve
This module groups together functions to retrieve data from different types of arguments.
data discord discord-js mentions resolver sheweny utility
Last synced: 29 Oct 2025
https://github.com/schbenedikt/datamining
Heise (https://heise.de) News Crawler
data data-science heise postgresql web-crawler
Last synced: 10 Apr 2025
https://github.com/coatless-rpkg/ucimlrepo
An unofficial R port of the Python package to download data off of the UCI ML repository
data data-science machine-learning rstats statistics uci-machine-learning uci-machine-learning-repository web-api
Last synced: 28 Jun 2025
https://github.com/nikoshet/exploratory-data-analysis-using-r
Exploratory Data Analysis using R Course Project for M.Sc. 'Data Science and Machine Learning' in NTUA
data data-analysis data-science eda exploratory-data-analysis ggplot2 r
Last synced: 14 May 2026
https://github.com/eosdis-nasa/earthdata-pub-dashboard
Front-end Dashboard for Earthdata Pub
data earthdata edpub publication
Last synced: 15 Jan 2026
https://github.com/nazar-pc/fixed-size-multiplexer
A tiny library for multiplexing data chunks into blocks of fixed size and vice versa
chunk data demultiplex demux fixed multiplex mux size
Last synced: 31 Oct 2025
https://github.com/mongoexpuser/synthetic-drilling-data-app-for-sqlite-ml
Generate synthetic drilling data that can be used for testing machine learning (ML) models.
classification data drilling events inference machine-learning ml-models node-sqlite3 nodejs prediction python sqlite3 training
Last synced: 08 Apr 2026
https://github.com/defano/chicago-oasis
A visualization of Chicago business accessibility by neighborhood or census tract.
census chicago data data-science javascript neighborhood
Last synced: 11 Mar 2026
https://github.com/ahmedkhalf/arabic-keyword-scraper
Stop wasting your time! And obtain Arabic definitions without having to look it up.
arabic data definitions scraper sentences wordsearch
Last synced: 12 Mar 2025
https://github.com/hmeleiro/alquilermad
Housing rent map in Comunidad de Madrid / Mapa del alquiler en la Comunidad de Madrid
data data-science data-visualization datascience housing-location-visualization rent renting
Last synced: 13 Sep 2025
https://github.com/richardschoen/ileaccess
Save file packaging of IBM i libraries ILEASTIC, NOXDB and ILEUSION for convenience installation. ILEastic allows HTTP microservices to be created on IBM i. NOXDB makes SQL,JSON and XML made easy for IBM i, ILEusion packages it all up to make IBM i data and program call services easier.
as400 call cl command data database db2 http https ibmi ile ileastic ileusion microservice noxdb program queue rpg sql xmlservice
Last synced: 24 Jul 2025
https://github.com/zarr-developers/cookiecutter-zarr-store
Cookiecutter for Zarr store implementations
chunked data n-dimensional zarr
Last synced: 16 Jun 2025
https://github.com/dataopstix/modelt
Modelt(mow·delt) is a modern data integration solution that connects data to data for advanced analytics.
airbyte airflow airflow-docker data data-analysis data-visualization database dbt elt etl etl-automation metabase metadata modern modern-dev modernization
Last synced: 28 Mar 2025
https://github.com/longzheng/southeastwater-usage-scraper
Extract hourly water usage data from South East Water portal website for digital water meters
australia data iot playwright southeastwater victoria water
Last synced: 06 Feb 2026
https://github.com/eliot-akira/png-compressor
Compress and encode data as PNG image
Last synced: 17 Mar 2025
https://github.com/marlenezw/speech-to-text
Turn any video or audio recording into a written transcript using python
data data-science python speech speech-recognition speech-synthesis speech-to-text
Last synced: 27 Apr 2026
https://github.com/dhruvldrp9/simpledht
A Python-based Distributed Hash Table (DHT) implementation enabling cross-network key-value storage, automatic node discovery, and data replication with a simple CLI and library interface.
cross-network-node-communation data data-replication data-synchronization dht dht-python distributed-hash-table key-value-storage nat netowork node-discovery peer-to-peer peer-to-peer-network python sha-256 simple udp udp-socket-communication
Last synced: 28 Feb 2026
https://github.com/joisino/twinpaper
Code for "Twin Papers: A Simple Framework of Causal Inference for Citations via Coupling" (CIKM 2022)
causal-inference data research science-of-science
Last synced: 21 Mar 2025
https://github.com/johntocci/nullaxe
Nullaxe is a powerful and user-friendly Python library designed for cleaning and preprocessing data. It works seamlessly with both pandas and polars DataFrames, making it a versatile tool for data scientists and developers.
data data-analysis data-science datacleaning pandas polars python
Last synced: 06 Apr 2026
https://github.com/ciscorn/tinybufr
A Rust library for decoding BUFR meteorological observation data format
bufr data meteorology rust weather wmo
Last synced: 11 Jan 2026
https://github.com/utrechtuniversity/dataprivacysurvey
Code for analysing data from the Data Privacy Survey (2022)
data gdpr open-science privacy rdm research research-data-management survey utrecht-university
Last synced: 16 Jun 2025
https://github.com/memair/apps
App Store for Memair
apps appstore data data-science quantified-self
Last synced: 06 Apr 2026
https://github.com/andrei-vataselu/data-science-snippets
🧰 Essential EDA and Data Cleaning Helpers for Any DataFrame This collection of functions is designed to accelerate exploratory data analysis (EDA), quickly surface data quality issues, and offer high-level insights into the structure and content of your dataset.
artificial-intelligence data data-science eda feature-engineering hyperparamater-tunning library loading model-evaluation modeling preprocessing python snippets text-processing time-series visualization
Last synced: 10 Mar 2026
https://github.com/radekbednarik/data_generator
Random data generator using Python. Generate data files with random string, floats, ints, dates via console or TOML files..
csv data generator python python3 random test-data-generator
Last synced: 13 Dec 2025
https://github.com/kocyigitkim/realtime.io
Real time data streaming & socket programming library
data realtime socket streaming
Last synced: 29 Jul 2025
https://github.com/shahiakhilesh1304/dsa
This repository is my personal space for practicing Data Structures and Algorithms. I'll be adding questions, solutions, and explanations as I progress, covering topics from basics to advanced.
algorithms code contribution contributions-and-collaboration contributions-welcome data data-structures data-structures-and-algorithms dsa dsa-algorithm dsa-learning-series dsa-practice hacktober hacktoberfest hacktoberfest-accepted hacktoberfest-starter practice programming python-3
Last synced: 13 Apr 2025
https://github.com/windwalker-io/data
[READ ONLY] A library contains data/collection objects with null-object pattern.
collection collections data data-object iterator nullobject value-object
Last synced: 12 Mar 2026
https://github.com/antvis/create-antv-demo
A simple CV-dashboard framework for practicing how to use AntV.
antv cv dashboard data resume resume-template resume-website visualization
Last synced: 09 Apr 2025
https://github.com/hdk101/credentials-validator
A quick way to validate credentials in server-side
backend credentials data email frontend javascript login node npm npm-install password register server-side
Last synced: 21 Sep 2025
https://github.com/stdlib-js/array-bool
BooleanArray.
array binary bool boolean booleanarray data javascript mask node node-js nodejs stdlib structure typed typed-array types
Last synced: 13 May 2025
https://github.com/shysolocup/aepl
A Node.JS multi-layered class creation package with built-in parenting systems that let you get info from classes above as well as better function and property makers for easier to read and understand development and modding support inspired by Roblox's Studio API.
aepl backend classes data framework game-development game-framework javascript js js-class js-framework lightweight nodejs package
Last synced: 28 Oct 2025
https://github.com/satyam4229/college-predictor-system
The college predictor system is a Python-based application that utilizes a machine learning model to predict colleges and their corresponding degree programs and branches based on a student's JEE (Joint Entrance Examination) score.
data data-science jupyter-notebook kaggle prediction python
Last synced: 06 Apr 2026
https://github.com/danlsn/causality
A Personal Data Platform and the culmination of years of curiosity and learning in the Data Engineering space.
data data-engineering datawarehousing personal-data quantified-self
Last synced: 06 Mar 2026
https://github.com/felixklauke/atomizer
Playing around with butter knife, android bindings and rx java.
binding butterknife data java react rx rxjava
Last synced: 15 May 2026
https://github.com/toviszsolt/stormflow
StromFlow - A Lightweight Data Modeling and Storage Library
data database datamanagement datastore db document jsondatabase memorydatabase model mongodb mongoose nodb nosql query schema stormflow
Last synced: 13 May 2025
https://github.com/mzazakeith/puppetmaster
Puppeteer & Crawl4AI microservice for web automation, scraping, and AI processing with Bull queues
agent ai automation bull bullmq chrome crawl4ai crawler data data-extraction extraction gemini llm llms openai playwright puppeteer web-automation
Last synced: 13 May 2025
https://github.com/kodie/migrate-acf-field-data-to-repeater
A WordPress plugin that migrates field metadata for ACF fields that have been moved inside of a repeater
acf acf-field acf-fields advance-custom-field data data-migration data-migration-tool wordpress wordpress-plugin
Last synced: 19 May 2026
https://github.com/cherylisabella/statistics--caret
Training Regression and Classification Models using caret
data data-analysis data-mining data-science datascience dataset r statistics
Last synced: 24 Jun 2025
https://github.com/heikomuller/histore
Library for maintaining snapshots of evolving tabular data sets
Last synced: 10 Apr 2025
https://github.com/stimulsoft/samples-dashboards.js-for-react
JavaScript samples for Dashboards.JS data analysis tool for React applications
analyzer chart components constructor dashboard dashboards data designer export expression javascript js library parser react react-dashboard reactjs relation text viewer
Last synced: 09 Aug 2025
https://github.com/frefrik/covid19norge-data
🦠 COVID-19 Datasets for Norway
covid covid-19 covid19 covid19-data csv data datasets norge norway norwegian smittestopp vaccine
Last synced: 09 Apr 2026
https://github.com/woctezuma/download-steam-banners-data
Data consisting of Steam banners.
Last synced: 06 Jan 2026
https://github.com/polina-prokofieva/viewjson
The class for convenient visualization of json with some settings.
data data-visualization es5 es6 javascript json
Last synced: 15 May 2026
https://github.com/simoneas02/data-science
🐍 A planning study to become a data scientist and to improve my current skills. 🤘🏼🌻
data data-analysis data-science data-visualization deep-learning machine-learning pandas python3 r sql
Last synced: 12 Apr 2026
https://github.com/mmaithani/loan-approvel-ml-model-with-insights
This project will approved or reject the loan applications. Public api, data insights and predictive models for loan prediction project are also provided
data data-science loan-prediction-analysis machine-learning visualization
Last synced: 16 Aug 2025
https://github.com/fiedsch/datamanagement
Data management helpers (PHP-CLI)
csv-data data datamanagement helper php
Last synced: 05 Apr 2025
https://github.com/kabeech/real-dice
Random number generation based on physical media touched by humans
data dice haskell human-computer-interaction physical random random-generation random-number-generators rng
Last synced: 10 Apr 2025
https://github.com/jorgeatgu/clau
📊 Gráficas con d3, responsive y reutilizables
charts d3js d3v5 data data-visualization data-viz graphs
Last synced: 12 Sep 2025
https://github.com/stdlib-js/ndarray-base-buffer-ctors
ndarray data buffer constructors.
array base buffer constructor constructors ctor ctors data dtype dtypes javascript multidimensional ndarray node node-js nodejs stdlib types utilities utility
Last synced: 06 Apr 2025
https://github.com/sharmadhiraj/free-json-datasets
Collection of free JSON data that are scraped and parsed from different websites.
collection crawler data data-scraping datasets json sports statistics web-scraping
Last synced: 28 Mar 2025
https://github.com/gabrielu3/ori-inverted-list
Inverted List made for a college discipline named Organization and Retrieval of Information
c data data-structures index inverted-index list
Last synced: 24 Feb 2026
https://github.com/alexandregazagnes/scikit-res
Very Basic package to store results of ML models Grid search results are hard to exploit. This package aims to store them in a more convenient way.
data machine-learning mlops mlops-workflow results scikit-learn
Last synced: 20 Jan 2026
https://github.com/meetyildiz/pandazip
Reduce RAM footprint of Pandas DataFrame without losing information.
compression data data-mining data-science machine-learning pandas utilities
Last synced: 20 Jan 2026
https://github.com/philhawksworth/netlify-plugin-trello-lists
A plugin to fetch the JSON data of a public Trello board, and stash the data for each list in a JSON file before your build runs making the data available to your static site generator at build time.
api data eleventy netlify plugin trello
Last synced: 20 Jan 2026
https://github.com/karlinarayberinger/karbytes_23andme_ancestry_data
This repository contains web page source code and media files which constitute (some of) the intellectual property encapsulated by the website named Karbytes For Life Blog dot WordPress dot Com.
2023 23andme ancestry blog data dna genome karbytes website wordpress
Last synced: 12 May 2025
https://github.com/lemmotresto/migrational
A data migration library
data java migration versioning
Last synced: 30 Oct 2025
https://github.com/tbille/github-stars-datastudio-connector
GitHub Stars connector for datastudio
data data-visualization datastudio github github-stars
Last synced: 18 May 2026
https://github.com/robjg/dido
Data In/Data Out in many formats
csv-parser data etl java json-parser
Last synced: 11 Jan 2026
https://github.com/randomfractals/unfolded-map-renderer
Unfolded Map 🗺️ Notebook 📓 Renderer for VSCode.
cell data flat-data geo geo-location geo-spatial map notebook output renderer unfolded view vscode
Last synced: 21 Mar 2025
https://github.com/kale-ko/ejcl
An advanced configuration library for Java with support for local files, mysql databases, and more.
config configs configuration configuration-files data java java-serialization json mariadb mysql parser serialization serializer smile yaml
Last synced: 02 Feb 2026
https://github.com/writetome51/big-dataset-paginator
A TypeScript/JavaScript class for pagination in a real-world web app.
app data javascript pagination paginator typescript
Last synced: 17 May 2026
https://github.com/justjavac/deno_open_data
data deno deno-mod deno-module denomod
Last synced: 17 May 2026
https://github.com/max-tonny8/android_web3
This is a library for Android to call data from Node on Ethereum Chain or Solana Chain
android blockchain coroutines coroutines-android data eth-call ethereum kotlin ktx retrofit rpc smart-contracts solana web3 web3j
Last synced: 27 Mar 2025
https://github.com/godeltech/godeltech.data.entityframeworkcore
Library to access database with Unit of Work, Repository and Entity classes for Entity Framework Core.
data entity entity-framework-core repository unitofwork
Last synced: 30 Apr 2025
https://github.com/exaluc/webhookcatcher
Catch your webhooks like a dream
api catcher data webhook webhook-callbacks webhooks-catcher
Last synced: 14 Apr 2025
https://github.com/ange007/jquery.mydata
jQuery.myData - Small jQuery&Zepto plugin for two-ways data binding.
data data-binding jquery jquery-plugin zepto zepto-plugin zeptojs
Last synced: 19 May 2026
https://github.com/anonympins/data-primals-engine
Manage and automate your data at scale 🚀 With data-primals-engine you get workflows, dashboards, alerts, i18n, client integration & AI assistant — all open-source, all MongoDB powered.
api automation data data-analysis data-engineer data-visualization database expressjs low-code mongodb nodejs rest-api
Last synced: 07 Mar 2026
https://github.com/chickennungets/ifsc-data-analysis
An implementation of Elo-MMR ranking in the boulder and lead disciplines.
d3 data elo elo-rating ifsc visualization webscraping
Last synced: 20 Jan 2026
https://github.com/Ekey/ER.DATA.Tool
Tool for extract data archives from mobile game Earth Revival (Project Arrival)
data earth-revival idx project-arrival
Last synced: 19 May 2026
https://github.com/stdlib-js/array-typed-integer-ctors
Integer-valued typed array constructors.
array constructor constructors ctor ctors data dtype dtypes javascript node node-js nodejs stdlib structure type typed typed-array types utilities
Last synced: 12 Jan 2026
https://github.com/smithsonian/massdigi-tools
Scripts that support Mass Digitization projects by the Digitization Program Office, OCIO
data data-validation digital-files digitization digitization-workflows museums
Last synced: 13 Apr 2025
https://github.com/nowosad/cllc
Country-level Land Cover - categories and transitions
data dataset land-cover land-cover-transitions r
Last synced: 04 Apr 2025
https://github.com/wfamous/fiv_update-data
This project automates the retrieval, processing, and publishing of digital product data for our Shopify store. It integrates Google Cloud Platform (GCP), Amazon Web Service (AWS), Terraform (Tofu), Python, Bash, Ansible and GitHub Actions to manage data pipelines efficiently.
ansible aws bash data data-analysis data-science devops gcp python pythonpackage shopify terraform tofu
Last synced: 17 Feb 2026
https://github.com/sermetpekin/evdscpp
evdscpp is a C++ library for fast, efficient, and user-friendly interaction with the EVDS API Server. Designed with performance in mind, it provides built-in caching, an Excel export option, and an intuitive user interface for configuring and retrieving data. evdscpp can be extended for integration with other C++ projects and offers options for use
cbrt central-bank cpp data edds evds evds-api evdscpp tcmb tcmb-api
Last synced: 07 Sep 2025
https://github.com/maskedsyntax/covid-tracker
Qt app to keep a track of Covid-19 records of different countries.
coronavirus coronavirus-tracking covid-19 data parsing scraping scraping-websites tracker web-scraping
Last synced: 29 Mar 2025
https://github.com/pjmagee/starwars-data
A Star Wars Web app with Charts and entire Timeline events!
aspire blazor blazor-webassembly data database dataset docker dotnet json starwars starwars-data starwars-fandom
Last synced: 07 Mar 2026
https://github.com/yash22222/data-analysis-with-python
This repository provides a practical introduction to data acquisition and analysis using Pandas. It covers loading datasets, exploring data, manipulating data, and gaining insights through statistical summaries. Ideal for beginners, it offers code examples and explanations to enhance your data manipulation skills using Pandas for Python.
binning data data-acquisition data-analysis data-binning data-cleaning data-formatting data-integration data-normalization data-preprocessing data-science data-transformation data-wrangling dataframe description numpy pandas pandas-dataframe python python3
Last synced: 09 Apr 2026
https://github.com/inspect-js/is-data-view
Is this value a JS DataView? This module works cross-realm/iframe, does not depend on instanceof or mutable properties, and despite ES6 Symbol.toStringTag.
data dataview ecmascript javascript typedarray typedarrays view
Last synced: 05 Apr 2025
https://github.com/marek-jakub/monitoring
A university project concerning field data management for bird ringers.
bird data fieldwork management ringing
Last synced: 24 Jun 2026
https://github.com/wklee610/datapush
MySQL Data Generator
automatic data data-generator database dataset db dynamic generator mysql test testing-tools
Last synced: 25 Jan 2026
https://github.com/reppon97/cryptosnake
Simple, unofficial python wrapper for Binance API. You'll find this easy-to-use package helpful if you're interested in general market data and cryptocurrency values. You don't need to have a Binance account or API Key since you can't purchase/trade cryptocurrencies using this package.
api binance bitcoin crypto cryptocurrencies cryptocurrency cryptocurrency-exchanges cryptography data ethereum json litecoin python python3 statistics
Last synced: 05 Mar 2025
https://github.com/laurensius/covid-19-info
data datatabel grafik peta visualisasi
Last synced: 21 May 2026
https://github.com/masesgroup/datadistributionmanager
A reliable subsystem to distribute data across multiple datacenters using multiple languages (C/C++, .NET, JVM enabled languages) over multiple technologies (e.g. Apache Kafka, OpenDDS, etc)
apachekafka availability business-solutions businesscontinuity data durability opendds reliability softwareneverlockdown
Last synced: 14 Apr 2025
https://github.com/d2hydro/hydrodashboards
Open Source Dashboards for hydro data
bokeh data geopandas hydrology hydrometrics pandas sheetjs
Last synced: 26 Jul 2025