data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-29 00:07:49 UTC
- JSON Representation
https://github.com/nop-dev/learning-js
Esse repositório contem todas as anotações que fiz enquanto estudava um módulo da trilha Explorer da Rocketseat sobre JavaScript. 🔰
data data-structures functions javascript js
Last synced: 17 Apr 2026
https://github.com/jesusgraterol/bitcoin-blockchain-dataset-builder
The dataset builder script extracts all the relevant block information from the Bitcoin Blockchain through Mempool.space's public API. The data is stored in a .csv file, facilitating its use in data science and machine learning projects.
bitcoin blockchain blockchain-technology data datascience datascience-machinelearning dataset dataset-generation machine-learning
Last synced: 06 May 2026
https://github.com/bradlindblad/quotableoffice
Repo for the quotable office R Shiny app
data datascience golem-apps r shiny shiny-apps text text-mining
Last synced: 26 May 2026
https://github.com/csadorf/pydata-ann-arbor-2018
Slides and notebooks demonstrating signac for PyData Ann Arbor Meetup 2018
data data-management jupyter signac workflow
Last synced: 04 Jun 2026
https://github.com/norton120/dfmock
Python Pandas DataFrame mock generator. You need mock'd data in a dataframe? this is what you need.
data mock pandas pandas-dataframe python python37
Last synced: 19 Jan 2026
https://github.com/audeering/emodb
Publishes Berlin Database of Emotional Speech with audb
Last synced: 19 Oct 2025
https://github.com/doriclaudino/canarinho_nlp
labels, classify, summarization string for canarinho app
chrome-console classification classifier-model data labels nlp nlu python spacy spacy-models spacy-nlp summarization-string
Last synced: 08 May 2026
https://github.com/machu-gwu/constant2-project
provide extensive way of managing your constant variable.
configuration constants data developer-tools python
Last synced: 26 May 2026
https://github.com/stdlib-js/array-int32
Int32Array.
array data int int32 int32array integer javascript long node node-js nodejs signed stdlib structure typed typed-array types
Last synced: 27 May 2026
https://github.com/stdlib-js/array-typed-integer-ctors
Integer-valued typed array constructors.
array constructor constructors ctor ctors data dtype dtypes javascript node node-js nodejs stdlib structure type typed typed-array types utilities
Last synced: 12 Jan 2026
https://github.com/johnmackintosh/simd2016_tmap
Mapping SIMD with tmap - static & interactive
data data-science data-visualization mapping r visualisation
Last synced: 20 Mar 2025
https://github.com/dhruvldrp9/simpledht
A Python-based Distributed Hash Table (DHT) implementation enabling cross-network key-value storage, automatic node discovery, and data replication with a simple CLI and library interface.
cross-network-node-communation data data-replication data-synchronization dht dht-python distributed-hash-table key-value-storage nat netowork node-discovery peer-to-peer peer-to-peer-network python sha-256 simple udp udp-socket-communication
Last synced: 28 Feb 2026
https://github.com/Ekey/ER.DATA.Tool
Tool for extract data archives from mobile game Earth Revival (Project Arrival)
data earth-revival idx project-arrival
Last synced: 19 May 2026
https://github.com/andrewjbateman/mevn-stack-data
:clipboard: MEVN Info & Full stack MEVN app with CRUD functions
data database express expressjs full-stack info mevn mevn-stack middleware mongodb mongodb-atlas nodejs typescript vue vue3 vue3-typescript
Last synced: 07 Apr 2026
https://github.com/ange007/jquery.mydata
jQuery.myData - Small jQuery&Zepto plugin for two-ways data binding.
data data-binding jquery jquery-plugin zepto zepto-plugin zeptojs
Last synced: 19 May 2026
https://github.com/exaluc/webhookcatcher
Catch your webhooks like a dream
api catcher data webhook webhook-callbacks webhooks-catcher
Last synced: 14 Apr 2025
https://github.com/johntocci/nullaxe
Nullaxe is a powerful and user-friendly Python library designed for cleaning and preprocessing data. It works seamlessly with both pandas and polars DataFrames, making it a versatile tool for data scientists and developers.
data data-analysis data-science datacleaning pandas polars python
Last synced: 06 Apr 2026
https://github.com/joisino/twinpaper
Code for "Twin Papers: A Simple Framework of Causal Inference for Citations via Coupling" (CIKM 2022)
causal-inference data research science-of-science
Last synced: 21 Mar 2025
https://github.com/cheminfo/cheminfo-types
chemistry data hacktoberfest schema typescript
Last synced: 03 Apr 2026
https://github.com/ciscorn/tinybufr
A Rust library for decoding BUFR meteorological observation data format
bufr data meteorology rust weather wmo
Last synced: 11 Jan 2026
https://github.com/utrechtuniversity/dataprivacysurvey
Code for analysing data from the Data Privacy Survey (2022)
data gdpr open-science privacy rdm research research-data-management survey utrecht-university
Last synced: 16 Jun 2025
https://github.com/eliot-akira/png-compressor
Compress and encode data as PNG image
Last synced: 17 Mar 2025
https://github.com/memair/apps
App Store for Memair
apps appstore data data-science quantified-self
Last synced: 06 Apr 2026
https://github.com/andrei-vataselu/data-science-snippets
🧰 Essential EDA and Data Cleaning Helpers for Any DataFrame This collection of functions is designed to accelerate exploratory data analysis (EDA), quickly surface data quality issues, and offer high-level insights into the structure and content of your dataset.
artificial-intelligence data data-science eda feature-engineering hyperparamater-tunning library loading model-evaluation modeling preprocessing python snippets text-processing time-series visualization
Last synced: 10 Mar 2026
https://github.com/jasondrawdy/compendio
Collection of common and noteworthy extension methods, security tools, and filesystem functions generally found in most applications; focusing on extensibility and portability.
compendium converters cryptography data extensions generators hashing library security utilities validation windows
Last synced: 18 May 2026
https://github.com/radekbednarik/data_generator
Random data generator using Python. Generate data files with random string, floats, ints, dates via console or TOML files..
csv data generator python python3 random test-data-generator
Last synced: 13 Dec 2025
https://github.com/kodie/migrate-acf-field-data-to-repeater
A WordPress plugin that migrates field metadata for ACF fields that have been moved inside of a repeater
acf acf-field acf-fields advance-custom-field data data-migration data-migration-tool wordpress wordpress-plugin
Last synced: 19 May 2026
https://github.com/mzazakeith/puppetmaster
Puppeteer & Crawl4AI microservice for web automation, scraping, and AI processing with Bull queues
agent ai automation bull bullmq chrome crawl4ai crawler data data-extraction extraction gemini llm llms openai playwright puppeteer web-automation
Last synced: 13 May 2025
https://github.com/toviszsolt/stormflow
StromFlow - A Lightweight Data Modeling and Storage Library
data database datamanagement datastore db document jsondatabase memorydatabase model mongodb mongoose nodb nosql query schema stormflow
Last synced: 13 May 2025
https://github.com/stdlib-js/array-bool
BooleanArray.
array binary bool boolean booleanarray data javascript mask node node-js nodejs stdlib structure typed typed-array types
Last synced: 13 May 2025
https://github.com/antvis/create-antv-demo
A simple CV-dashboard framework for practicing how to use AntV.
antv cv dashboard data resume resume-template resume-website visualization
Last synced: 09 Apr 2025
https://github.com/victoorv/breast_cancer
Mammographic images classification.
breast-cancer breast-cancer-classification classification cnn cnn-classification convnext convnext-tiny convolutional-neural-networks data data-science data-visualization feature-tuning image image-classification mammogram-images mammographic-images neural-network resnet-50 resnet50 transfer-learning
Last synced: 27 Jan 2026
https://github.com/longzheng/southeastwater-usage-scraper
Extract hourly water usage data from South East Water portal website for digital water meters
australia data iot playwright southeastwater victoria water
Last synced: 06 Feb 2026
https://github.com/cherylisabella/statistics--caret
Training Regression and Classification Models using caret
data data-analysis data-mining data-science datascience dataset r statistics
Last synced: 24 Jun 2025
https://github.com/nowosad/cllc
Country-level Land Cover - categories and transitions
data dataset land-cover land-cover-transitions r
Last synced: 04 Apr 2025
https://github.com/nazar-pc/fixed-size-multiplexer
A tiny library for multiplexing data chunks into blocks of fixed size and vice versa
chunk data demultiplex demux fixed multiplex mux size
Last synced: 31 Oct 2025
https://github.com/mongoexpuser/synthetic-drilling-data-app-for-sqlite-ml
Generate synthetic drilling data that can be used for testing machine learning (ML) models.
classification data drilling events inference machine-learning ml-models node-sqlite3 nodejs prediction python sqlite3 training
Last synced: 08 Apr 2026
https://github.com/mohammadkarbalaee/introduction-to-data-science-sbu
Reports and full documentation of the introduction to data science course held at SBU
data data-science python shahid-beheshti-university
Last synced: 27 Mar 2025
https://github.com/weisscharlesj/data_scicompforchem
Zipped data for SciCompforChem book for easy download
chemistry chemistry-education data data-visualization python
Last synced: 07 Nov 2025
https://github.com/ugurcanerdogan/cross-validation-with-imbalanced-dataset
BBM467*SDSP - Small Data Science Project - Things to consider in cross validation and resampling when dealing with Imbalanced Data : What is the right way?
bbm467 cross-validation data data-science kfold-cross-validation logistic-regression machine-learning oversampling sdsp smote
Last synced: 21 Jun 2025
https://github.com/shysolocup/aepl
A Node.JS multi-layered class creation package with built-in parenting systems that let you get info from classes above as well as better function and property makers for easier to read and understand development and modding support inspired by Roblox's Studio API.
aepl backend classes data framework game-development game-framework javascript js js-class js-framework lightweight nodejs package
Last synced: 28 Oct 2025
https://github.com/satyam4229/college-predictor-system
The college predictor system is a Python-based application that utilizes a machine learning model to predict colleges and their corresponding degree programs and branches based on a student's JEE (Joint Entrance Examination) score.
data data-science jupyter-notebook kaggle prediction python
Last synced: 06 Apr 2026
https://github.com/danlsn/causality
A Personal Data Platform and the culmination of years of curiosity and learning in the Data Engineering space.
data data-engineering datawarehousing personal-data quantified-self
Last synced: 06 Mar 2026
https://github.com/rrighart/rrighart.github.io
A webpage about data science, programming, statistics and related topics
analyses data data-mining programming statistics
Last synced: 20 Jan 2026
https://github.com/aaronmeder/social-history
A quick look into your history on social media. Drop in the archives you've downloaded from Facebook and Instagram and see some stats about your time on the networks.
archives data facebook instagram statistics stats
Last synced: 27 Mar 2025
https://github.com/lukanedimovic/table_editor
A simple table data editor, with easily scalable functions and operations & a nice GUI
data data-science formula java parser parsing preprocessing swing tokenizer
Last synced: 04 Apr 2025
https://github.com/felixklauke/atomizer
Playing around with butter knife, android bindings and rx java.
binding butterknife data java react rx rxjava
Last synced: 15 May 2026
https://github.com/mickeyshi-syd/actuarial-hackathon-2019
2019 Actuarial Hackathon
actuarial actuaries analytics data data-science hackathon
Last synced: 15 Jul 2025
https://github.com/minightdev/paperclip
Paperclip is a powerful privacy-focused data breach search engine that empowers users to swiftly and securely investigate breaches using email addresses and phone numbers. Our robust search engine delivers real-time results while prioritizing the privacy and security of user queries.
beaches data database pwn pwned search-engine
Last synced: 22 Mar 2025
https://github.com/fiedsch/datamanagement
Data management helpers (PHP-CLI)
csv-data data datamanagement helper php
Last synced: 05 Apr 2025
https://github.com/sermetpekin/evdscpp
evdscpp is a C++ library for fast, efficient, and user-friendly interaction with the EVDS API Server. Designed with performance in mind, it provides built-in caching, an Excel export option, and an intuitive user interface for configuring and retrieving data. evdscpp can be extended for integration with other C++ projects and offers options for use
cbrt central-bank cpp data edds evds evds-api evdscpp tcmb tcmb-api
Last synced: 07 Sep 2025
https://github.com/quantalabs/climate
Graphs temperatures of the US, Caribbean Nations, and the world from 1967, 1910, and 1880 to 2020, respectively.
caribbean caribbean-temp caribbean-temperature carribean-climate climate climate-change data data-visualization global global-climate global-temp global-temperature global-warming graphing-with-python graphs us us-climate us-temp us-temperature world
Last synced: 29 Mar 2025
https://github.com/heikomuller/histore
Library for maintaining snapshots of evolving tabular data sets
Last synced: 10 Apr 2025
https://github.com/kale-ko/ejcl
An advanced configuration library for Java with support for local files, mysql databases, and more.
config configs configuration configuration-files data java java-serialization json mariadb mysql parser serialization serializer smile yaml
Last synced: 02 Feb 2026
https://github.com/sun-lab-nbb/sl-experiment
A Python library that provides tools to acquire, manage, and preprocess scientific data in the Sun (NeuroAI) lab.
ataraxis data experiment methods neuroscience
Last synced: 07 Mar 2026
https://github.com/woctezuma/download-steam-banners-data
Data consisting of Steam banners.
Last synced: 06 Jan 2026
https://github.com/hariprashad-ravikumar/ai-datascience-lab
AI‑DataScience‑Lab is a web app for uploading CSV datasets, cleaning with Pandas, and running quick exploratory analyses and regression models using scikit‑learn. Its modular design supports future AI extensions, like deep learning with TensorFlow or insight generation via the OpenAI API.
ai api azure cloudcomputing data data-analysis data-science data-visualization mathplotlib numpy openai pandas python scikit-learn
Last synced: 02 Aug 2025
https://github.com/serhatkacmaz/cpp-datastructuresandalgortihms
Contains codes related to data structures
algorithms cplusplus data data-structures
Last synced: 10 Jul 2025
https://github.com/randomfractals/unfolded-map-renderer
Unfolded Map 🗺️ Notebook 📓 Renderer for VSCode.
cell data flat-data geo geo-location geo-spatial map notebook output renderer unfolded view vscode
Last synced: 21 Mar 2025
https://github.com/polina-prokofieva/viewjson
The class for convenient visualization of json with some settings.
data data-visualization es5 es6 javascript json
Last synced: 15 May 2026
https://github.com/maskedsyntax/covid-tracker
Qt app to keep a track of Covid-19 records of different countries.
coronavirus coronavirus-tracking covid-19 data parsing scraping scraping-websites tracker web-scraping
Last synced: 29 Mar 2025
https://github.com/pjmagee/starwars-data
A Star Wars Web app with Charts and entire Timeline events!
aspire blazor blazor-webassembly data database dataset docker dotnet json starwars starwars-data starwars-fandom
Last synced: 07 Mar 2026
https://github.com/tbille/github-stars-datastudio-connector
GitHub Stars connector for datastudio
data data-visualization datastudio github github-stars
Last synced: 18 May 2026
https://github.com/horizom/dto
Data Transfer Objects for all PHP applications.
Last synced: 14 Sep 2025
https://github.com/dataopstix/modelt
Modelt(mow·delt) is a modern data integration solution that connects data to data for advanced analytics.
airbyte airflow airflow-docker data data-analysis data-visualization database dbt elt etl etl-automation metabase metadata modern modern-dev modernization
Last synced: 28 Mar 2025
https://github.com/emrecpp/datapacket-cpp
Send, recv, encrypt, decrypt, compress data as Packet and send it with socket for C++.
compress data deserialization deserialize deserializer encrypt packet recv send serialization serialize serializer socket storage
Last synced: 28 Mar 2025
https://github.com/meetyildiz/pandazip
Reduce RAM footprint of Pandas DataFrame without losing information.
compression data data-mining data-science machine-learning pandas utilities
Last synced: 20 Jan 2026
https://github.com/sharmadhiraj/free-json-datasets
Collection of free JSON data that are scraped and parsed from different websites.
collection crawler data data-scraping datasets json sports statistics web-scraping
Last synced: 28 Mar 2025
https://github.com/gabrielu3/ori-inverted-list
Inverted List made for a college discipline named Organization and Retrieval of Information
c data data-structures index inverted-index list
Last synced: 24 Feb 2026
https://github.com/godeltech/godeltech.data
.NET library to access data storage with Unit of Work, Repository and Entity classes
data entity repository unitofwork
Last synced: 30 Apr 2025
https://github.com/godeltech/godeltech.data.entityframeworkcore
Library to access database with Unit of Work, Repository and Entity classes for Entity Framework Core.
data entity entity-framework-core repository unitofwork
Last synced: 30 Apr 2025
https://github.com/thealphadollar/messiah
Messiah: The Mighty Son Of God Is Here To Help You Through Times Of Calamity
azure backend data data-analysis flask frontend materialize natural-disasters
Last synced: 19 Jan 2026
https://github.com/gavinr/minor-league-baseball
Data set of minor league baseball teams
baseball data hacktoberfest maps open-data open-datasets sports
Last synced: 06 Apr 2025
https://github.com/bgmp/tesis-german-deuster
Datos estadísticos para tercería de una tésis
Last synced: 28 Mar 2025
https://github.com/d2hydro/hydrodashboards
Open Source Dashboards for hydro data
bokeh data geopandas hydrology hydrometrics pandas sheetjs
Last synced: 26 Jul 2025
https://github.com/alexandregazagnes/scikit-res
Very Basic package to store results of ML models Grid search results are hard to exploit. This package aims to store them in a more convenient way.
data machine-learning mlops mlops-workflow results scikit-learn
Last synced: 20 Jan 2026
https://github.com/crazywolf132/jungla
🌲🌲🌲 Your new favourite data manipulator
backend data data-manipulation easy-to-use frontend fullstack help-wanted interpreter language library microservices mobile nodejs parser programming-language
Last synced: 05 Apr 2025
https://github.com/samboycoding/hungergames-data
data hunger-games javascript json
Last synced: 15 May 2026
https://github.com/schbenedikt/datamining
Heise (https://heise.de) News Crawler
data data-science heise postgresql web-crawler
Last synced: 10 Apr 2025
https://github.com/sheweny/discord-resolve
This module groups together functions to retrieve data from different types of arguments.
data discord discord-js mentions resolver sheweny utility
Last synced: 29 Oct 2025
https://github.com/rahulraikwar00/advault
Advault is a adhaar data vault generation tool
aadhaar data hacktoberfest uidai vault
Last synced: 05 Apr 2025
https://github.com/wklee610/datapush
MySQL Data Generator
automatic data data-generator database dataset db dynamic generator mysql test testing-tools
Last synced: 25 Jan 2026
https://github.com/imadsaddik/bodmaghdataset
BoDmagh dataset is a Supervised Fine-Tuning (SFT) dataset for the Darija language
arabic-llm arabic-nlp darija-llm darija-nlp data dataset fine-tuning llm nlp sft
Last synced: 03 Apr 2025
https://github.com/philhawksworth/netlify-plugin-trello-lists
A plugin to fetch the JSON data of a public Trello board, and stash the data for each list in a JSON file before your build runs making the data available to your static site generator at build time.
api data eleventy netlify plugin trello
Last synced: 20 Jan 2026