data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-01 00:07:35 UTC
- JSON Representation
https://github.com/mickeyshi-syd/actuarial-hackathon-2019
2019 Actuarial Hackathon
actuarial actuaries analytics data data-science hackathon
Last synced: 15 Jul 2025
https://github.com/mongoexpuser/synthetic-drilling-data-app-for-sqlite-ml
Generate synthetic drilling data that can be used for testing machine learning (ML) models.
classification data drilling events inference machine-learning ml-models node-sqlite3 nodejs prediction python sqlite3 training
Last synced: 08 Apr 2026
https://github.com/dataopstix/modelt
Modelt(mow·delt) is a modern data integration solution that connects data to data for advanced analytics.
airbyte airflow airflow-docker data data-analysis data-visualization database dbt elt etl etl-automation metabase metadata modern modern-dev modernization
Last synced: 28 Mar 2025
https://github.com/shahiakhilesh1304/dsa
This repository is my personal space for practicing Data Structures and Algorithms. I'll be adding questions, solutions, and explanations as I progress, covering topics from basics to advanced.
algorithms code contribution contributions-and-collaboration contributions-welcome data data-structures data-structures-and-algorithms dsa dsa-algorithm dsa-learning-series dsa-practice hacktober hacktoberfest hacktoberfest-accepted hacktoberfest-starter practice programming python-3
Last synced: 13 Apr 2025
https://github.com/cherylisabella/statistics--caret
Training Regression and Classification Models using caret
data data-analysis data-mining data-science datascience dataset r statistics
Last synced: 24 Jun 2025
https://github.com/fiedsch/datamanagement
Data management helpers (PHP-CLI)
csv-data data datamanagement helper php
Last synced: 05 Apr 2025
https://github.com/sharmadhiraj/free-json-datasets
Collection of free JSON data that are scraped and parsed from different websites.
collection crawler data data-scraping datasets json sports statistics web-scraping
Last synced: 28 Mar 2025
https://github.com/alexandregazagnes/scikit-res
Very Basic package to store results of ML models Grid search results are hard to exploit. This package aims to store them in a more convenient way.
data machine-learning mlops mlops-workflow results scikit-learn
Last synced: 20 Jan 2026
https://github.com/philhawksworth/netlify-plugin-trello-lists
A plugin to fetch the JSON data of a public Trello board, and stash the data for each list in a JSON file before your build runs making the data available to your static site generator at build time.
api data eleventy netlify plugin trello
Last synced: 20 Jan 2026
https://github.com/lemmotresto/migrational
A data migration library
data java migration versioning
Last synced: 30 Oct 2025
https://github.com/robjg/dido
Data In/Data Out in many formats
csv-parser data etl java json-parser
Last synced: 11 Jan 2026
https://github.com/writetome51/big-dataset-paginator
A TypeScript/JavaScript class for pagination in a real-world web app.
app data javascript pagination paginator typescript
Last synced: 17 May 2026
https://github.com/justjavac/deno_open_data
data deno deno-mod deno-module denomod
Last synced: 17 May 2026
https://github.com/max-tonny8/android_web3
This is a library for Android to call data from Node on Ethereum Chain or Solana Chain
android blockchain coroutines coroutines-android data eth-call ethereum kotlin ktx retrofit rpc smart-contracts solana web3 web3j
Last synced: 27 Mar 2025
https://github.com/wfamous/fiv_update-data
This project automates the retrieval, processing, and publishing of digital product data for our Shopify store. It integrates Google Cloud Platform (GCP), Amazon Web Service (AWS), Terraform (Tofu), Python, Bash, Ansible and GitHub Actions to manage data pipelines efficiently.
ansible aws bash data data-analysis data-science devops gcp python pythonpackage shopify terraform tofu
Last synced: 17 Feb 2026
https://github.com/joisino/twinpaper
Code for "Twin Papers: A Simple Framework of Causal Inference for Citations via Coupling" (CIKM 2022)
causal-inference data research science-of-science
Last synced: 21 Mar 2025
https://github.com/eliot-akira/png-compressor
Compress and encode data as PNG image
Last synced: 17 Mar 2025
https://github.com/jasondrawdy/compendio
Collection of common and noteworthy extension methods, security tools, and filesystem functions generally found in most applications; focusing on extensibility and portability.
compendium converters cryptography data extensions generators hashing library security utilities validation windows
Last synced: 18 May 2026
https://github.com/nowosad/cllc
Country-level Land Cover - categories and transitions
data dataset land-cover land-cover-transitions r
Last synced: 04 Apr 2025
https://github.com/kale-ko/ejcl
An advanced configuration library for Java with support for local files, mysql databases, and more.
config configs configuration configuration-files data java java-serialization json mariadb mysql parser serialization serializer smile yaml
Last synced: 02 Feb 2026
https://github.com/randomfractals/unfolded-map-renderer
Unfolded Map 🗺️ Notebook 📓 Renderer for VSCode.
cell data flat-data geo geo-location geo-spatial map notebook output renderer unfolded view vscode
Last synced: 21 Mar 2025
https://github.com/tbille/github-stars-datastudio-connector
GitHub Stars connector for datastudio
data data-visualization datastudio github github-stars
Last synced: 18 May 2026
https://github.com/meetyildiz/pandazip
Reduce RAM footprint of Pandas DataFrame without losing information.
compression data data-mining data-science machine-learning pandas utilities
Last synced: 20 Jan 2026
https://github.com/gabrielu3/ori-inverted-list
Inverted List made for a college discipline named Organization and Retrieval of Information
c data data-structures index inverted-index list
Last synced: 24 Feb 2026
https://github.com/jorgeatgu/clau
📊 Gráficas con d3, responsive y reutilizables
charts d3js d3v5 data data-visualization data-viz graphs
Last synced: 12 Sep 2025
https://github.com/kabeech/real-dice
Random number generation based on physical media touched by humans
data dice haskell human-computer-interaction physical random random-generation random-number-generators rng
Last synced: 10 Apr 2025
https://github.com/stdlib-js/ndarray-base-buffer-ctors
ndarray data buffer constructors.
array base buffer constructor constructors ctor ctors data dtype dtypes javascript multidimensional ndarray node node-js nodejs stdlib types utilities utility
Last synced: 06 Apr 2025
https://github.com/karlinarayberinger/karbytes_23andme_ancestry_data
This repository contains web page source code and media files which constitute (some of) the intellectual property encapsulated by the website named Karbytes For Life Blog dot WordPress dot Com.
2023 23andme ancestry blog data dna genome karbytes website wordpress
Last synced: 12 May 2025
https://github.com/anonympins/data-primals-engine
Manage and automate your data at scale 🚀 With data-primals-engine you get workflows, dashboards, alerts, i18n, client integration & AI assistant — all open-source, all MongoDB powered.
api automation data data-analysis data-engineer data-visualization database expressjs low-code mongodb nodejs rest-api
Last synced: 07 Mar 2026
https://github.com/chickennungets/ifsc-data-analysis
An implementation of Elo-MMR ranking in the boulder and lead disciplines.
d3 data elo elo-rating ifsc visualization webscraping
Last synced: 20 Jan 2026
https://github.com/smithsonian/massdigi-tools
Scripts that support Mass Digitization projects by the Digitization Program Office, OCIO
data data-validation digital-files digitization digitization-workflows museums
Last synced: 13 Apr 2025
https://github.com/stdlib-js/array-typed-integer-ctors
Integer-valued typed array constructors.
array constructor constructors ctor ctors data dtype dtypes javascript node node-js nodejs stdlib structure type typed typed-array types utilities
Last synced: 12 Jan 2026
https://github.com/Ekey/ER.DATA.Tool
Tool for extract data archives from mobile game Earth Revival (Project Arrival)
data earth-revival idx project-arrival
Last synced: 19 May 2026
https://github.com/ange007/jquery.mydata
jQuery.myData - Small jQuery&Zepto plugin for two-ways data binding.
data data-binding jquery jquery-plugin zepto zepto-plugin zeptojs
Last synced: 19 May 2026
https://github.com/exaluc/webhookcatcher
Catch your webhooks like a dream
api catcher data webhook webhook-callbacks webhooks-catcher
Last synced: 14 Apr 2025
https://github.com/kodie/migrate-acf-field-data-to-repeater
A WordPress plugin that migrates field metadata for ACF fields that have been moved inside of a repeater
acf acf-field acf-fields advance-custom-field data data-migration data-migration-tool wordpress wordpress-plugin
Last synced: 19 May 2026
https://github.com/mzazakeith/puppetmaster
Puppeteer & Crawl4AI microservice for web automation, scraping, and AI processing with Bull queues
agent ai automation bull bullmq chrome crawl4ai crawler data data-extraction extraction gemini llm llms openai playwright puppeteer web-automation
Last synced: 13 May 2025
https://github.com/toviszsolt/stormflow
StromFlow - A Lightweight Data Modeling and Storage Library
data database datamanagement datastore db document jsondatabase memorydatabase model mongodb mongoose nodb nosql query schema stormflow
Last synced: 13 May 2025
https://github.com/stdlib-js/array-bool
BooleanArray.
array binary bool boolean booleanarray data javascript mask node node-js nodejs stdlib structure typed typed-array types
Last synced: 13 May 2025
https://github.com/longzheng/southeastwater-usage-scraper
Extract hourly water usage data from South East Water portal website for digital water meters
australia data iot playwright southeastwater victoria water
Last synced: 06 Feb 2026
https://github.com/nazar-pc/fixed-size-multiplexer
A tiny library for multiplexing data chunks into blocks of fixed size and vice versa
chunk data demultiplex demux fixed multiplex mux size
Last synced: 31 Oct 2025
https://github.com/ugurcanerdogan/cross-validation-with-imbalanced-dataset
BBM467*SDSP - Small Data Science Project - Things to consider in cross validation and resampling when dealing with Imbalanced Data : What is the right way?
bbm467 cross-validation data data-science kfold-cross-validation logistic-regression machine-learning oversampling sdsp smote
Last synced: 21 Jun 2025
https://github.com/minightdev/paperclip
Paperclip is a powerful privacy-focused data breach search engine that empowers users to swiftly and securely investigate breaches using email addresses and phone numbers. Our robust search engine delivers real-time results while prioritizing the privacy and security of user queries.
beaches data database pwn pwned search-engine
Last synced: 22 Mar 2025
https://github.com/quantalabs/climate
Graphs temperatures of the US, Caribbean Nations, and the world from 1967, 1910, and 1880 to 2020, respectively.
caribbean caribbean-temp caribbean-temperature carribean-climate climate climate-change data data-visualization global global-climate global-temp global-temperature global-warming graphing-with-python graphs us us-climate us-temp us-temperature world
Last synced: 29 Mar 2025
https://github.com/sun-lab-nbb/sl-experiment
A Python library that provides tools to acquire, manage, and preprocess scientific data in the Sun (NeuroAI) lab.
ataraxis data experiment methods neuroscience
Last synced: 07 Mar 2026
https://github.com/maskedsyntax/covid-tracker
Qt app to keep a track of Covid-19 records of different countries.
coronavirus coronavirus-tracking covid-19 data parsing scraping scraping-websites tracker web-scraping
Last synced: 29 Mar 2025
https://github.com/pjmagee/starwars-data
A Star Wars Web app with Charts and entire Timeline events!
aspire blazor blazor-webassembly data database dataset docker dotnet json starwars starwars-data starwars-fandom
Last synced: 07 Mar 2026
https://github.com/horizom/dto
Data Transfer Objects for all PHP applications.
Last synced: 14 Sep 2025
https://github.com/emrecpp/datapacket-cpp
Send, recv, encrypt, decrypt, compress data as Packet and send it with socket for C++.
compress data deserialization deserialize deserializer encrypt packet recv send serialization serialize serializer socket storage
Last synced: 28 Mar 2025
https://github.com/gavinr/minor-league-baseball
Data set of minor league baseball teams
baseball data hacktoberfest maps open-data open-datasets sports
Last synced: 06 Apr 2025
https://github.com/crazywolf132/jungla
🌲🌲🌲 Your new favourite data manipulator
backend data data-manipulation easy-to-use frontend fullstack help-wanted interpreter language library microservices mobile nodejs parser programming-language
Last synced: 05 Apr 2025
https://github.com/rahulraikwar00/advault
Advault is a adhaar data vault generation tool
aadhaar data hacktoberfest uidai vault
Last synced: 05 Apr 2025
https://github.com/wklee610/datapush
MySQL Data Generator
automatic data data-generator database dataset db dynamic generator mysql test testing-tools
Last synced: 25 Jan 2026
https://github.com/elggem/shapeset-generator
Generates varying shapes as training data for neural nets: ShapeSet.
data machine-learning numpy opencv python svg training
Last synced: 11 Apr 2026
https://github.com/garciparedes/matlab-examples
Set of awesome Matlab Examples
data data-science examples garciparedes matlab statistics university-of-valladolid
Last synced: 05 Mar 2025
https://github.com/automators-com/tweaked
Fine tune your data
ai data nextjs rust synthetic tailwindcss tauri
Last synced: 08 Apr 2026
https://github.com/reppon97/cryptosnake
Simple, unofficial python wrapper for Binance API. You'll find this easy-to-use package helpful if you're interested in general market data and cryptocurrency values. You don't need to have a Binance account or API Key since you can't purchase/trade cryptocurrencies using this package.
api binance bitcoin crypto cryptocurrencies cryptocurrency cryptocurrency-exchanges cryptography data ethereum json litecoin python python3 statistics
Last synced: 05 Mar 2025
https://github.com/coatless-rpkg/ucimlrepo
An unofficial R port of the Python package to download data off of the UCI ML repository
data data-science machine-learning rstats statistics uci-machine-learning uci-machine-learning-repository web-api
Last synced: 28 Jun 2025
https://github.com/real-veersandhu/scifaa-covid-19-project
📈 COVID-19 Data Science Project (2021 Internship @ SCI-FAA)
covid-19 data data-science data-visualization python
Last synced: 14 May 2026
https://github.com/laurensius/covid-19-info
data datatabel grafik peta visualisasi
Last synced: 21 May 2026
https://github.com/masesgroup/datadistributionmanager
A reliable subsystem to distribute data across multiple datacenters using multiple languages (C/C++, .NET, JVM enabled languages) over multiple technologies (e.g. Apache Kafka, OpenDDS, etc)
apachekafka availability business-solutions businesscontinuity data durability opendds reliability softwareneverlockdown
Last synced: 14 Apr 2025
https://github.com/rawsashimi1604/jobextract
Scrapes LinkedIn data. Conducts sentiment analysis on what traits and qualifications employers are looking for.
data data-analysis data-analytics data-cleaning linkedin mvc python webscraper
Last synced: 06 Nov 2025
https://github.com/tuananh/opentravel
✈ A collection of travel related data
Last synced: 09 Oct 2025
https://github.com/richardschoen/ileaccess
Save file packaging of IBM i libraries ILEASTIC, NOXDB and ILEUSION for convenience installation. ILEastic allows HTTP microservices to be created on IBM i. NOXDB makes SQL,JSON and XML made easy for IBM i, ILEusion packages it all up to make IBM i data and program call services easier.
as400 call cl command data database db2 http https ibmi ile ileastic ileusion microservice noxdb program queue rpg sql xmlservice
Last synced: 24 Jul 2025
https://github.com/d2hydro/hydrodashboards
Open Source Dashboards for hydro data
bokeh data geopandas hydrology hydrometrics pandas sheetjs
Last synced: 26 Jul 2025
https://github.com/weisscharlesj/data_scicompforchem
Zipped data for SciCompforChem book for easy download
chemistry chemistry-education data data-visualization python
Last synced: 07 Nov 2025
https://github.com/rudxain/ideas
A collection of my non-started projects
brain-storms brainstorming broken concepts crap data dreams experiments graphics hardware inspiration lazy mono-repository monorepo pet-project proposals software text unfinished wishes
Last synced: 06 Feb 2026
https://github.com/jmbhughes/goes_solar_retriever
Tool to retrieve GOES-R Solar Data
data data-retrieval data-science goes-16 goes-satellite goes16 goes17 solar solar-physics
Last synced: 07 Jan 2026
https://github.com/espoirmur/balobi_nini
An End to End Data Science Project, where I used Tweepy and Airflow to collect tweets related to the DRC and topic modeling technics to discover which topics Congolese are talking about on Twitter.
Last synced: 24 Aug 2025
https://github.com/lafayettegabe/g2m-insight-for-cab-investment-firm
📊 Exploratory Data Analysis (EDA) on multiple datasets related to the cab industry in the US, to provide actionable insights and recommendations to a private firm looking to invest in the market. The analysis includes data cleaning, transformation, visualization, and hypothesis testing.
big-data data data-analysis data-science data-visualization eda gotomarket
Last synced: 13 Jun 2025
https://github.com/zalweny26/tools
Just a bunch of tools made in TypeScript.
algorithms data dimensionality distances helpers reduction sortings structures tools utils
Last synced: 03 Feb 2026
https://github.com/ashwinpn/visualization
Data Visualization using Matplotlib, Pandas Visualization, Seaborn, ggplot, and Plotly.
analysis data data-analysis data-science data-visualization graphs plots python python3 visualization
Last synced: 13 Apr 2026
https://github.com/joelllllll/up-sync
Sync account and transaction data from up bank to your local environment
accounts bank data postgres sync transactions up upbank
Last synced: 06 Jul 2025
https://github.com/datafold/vhol-demo
Get hands-on examples of dbt + Datafold CI/CD workflows
data data-engineering datafold dbt diff
Last synced: 28 Dec 2025
https://github.com/sadcenter/messenger
Data messaging system between servers using popular messaging brokers
Last synced: 06 Aug 2025
https://github.com/steelcake/cherry-pipelines
A collection of pipelines built with cherry
blockchain clickhouse data pipeline pyhton
Last synced: 09 Mar 2026
https://github.com/ctechhindi/auto-fill-form-data
AUTO FILL AND AUTOCOMPLETE USER DATA WITH KEY NAME
autocomplete chrome-extension data extension
Last synced: 17 Apr 2026
https://github.com/a3r0id/lightshot-data-miner
A random idea I had a while back to make a data miner for lightshot. Never released this but after a friend sent me a post about lightshot's transparency I figured it'd be a good time to release this. I've included some output from a run before making the repo. I am not responsible for the imagery or it's contents.
brute-force bruteforce data dataset face-recognition image-processing lightshot mining scraper scraping text-recognition
Last synced: 19 Oct 2025
https://github.com/peterdavehello/nrd-list-archive
🌐📂 A collection of past NRD lists to explore—perfect for fun, research, or just plain curiosity! 🎉🔍✨
Last synced: 17 Mar 2026
https://github.com/vikyw89/usesyncv
a simplistic react global store with pregenerated CRUD, and built in async fetch
data fetch mobx reactjs reactquery redux state state-management store swr zustand
Last synced: 06 Jan 2026
https://github.com/bastgau/snow-revoke-privileges
Script designed to simplify the management of permissions in your Snowflake databases.
data database dba dev-container python snowflake
Last synced: 20 Apr 2025
https://github.com/priyanka7411/customer-segmentation-churn-dashboard
📊 Streamlit + Plotly dashboard for customer segmentation, RFM analysis, and churn prediction using machine learning.
churn data machine-learning pandas prediction python rfm rfm-analysis streamlit visualization
Last synced: 14 Apr 2026