data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-02 00:07:45 UTC
- JSON Representation
https://github.com/woctezuma/download-steam-banners-data
Data consisting of Steam banners.
Last synced: 06 Jan 2026
https://github.com/quantalabs/climate
Graphs temperatures of the US, Caribbean Nations, and the world from 1967, 1910, and 1880 to 2020, respectively.
caribbean caribbean-temp caribbean-temperature carribean-climate climate climate-change data data-visualization global global-climate global-temp global-temperature global-warming graphing-with-python graphs us us-climate us-temp us-temperature world
Last synced: 29 Mar 2025
https://github.com/mostafanabieh/image-classification-with-data-augmentation
Project for Data augmentation with tensorflow v2
data deep-learning image-classification machine-learning tensorflow tensorflow2
Last synced: 07 May 2026
https://github.com/mickeyshi-syd/actuarial-hackathon-2019
2019 Actuarial Hackathon
actuarial actuaries analytics data data-science hackathon
Last synced: 15 Jul 2025
https://github.com/sharmadhiraj/free-json-datasets
Collection of free JSON data that are scraped and parsed from different websites.
collection crawler data data-scraping datasets json sports statistics web-scraping
Last synced: 28 Mar 2025
https://github.com/minightdev/paperclip
Paperclip is a powerful privacy-focused data breach search engine that empowers users to swiftly and securely investigate breaches using email addresses and phone numbers. Our robust search engine delivers real-time results while prioritizing the privacy and security of user queries.
beaches data database pwn pwned search-engine
Last synced: 22 Mar 2025
https://github.com/heikomuller/histore
Library for maintaining snapshots of evolving tabular data sets
Last synced: 10 Apr 2025
https://github.com/alexandregazagnes/scikit-res
Very Basic package to store results of ML models Grid search results are hard to exploit. This package aims to store them in a more convenient way.
data machine-learning mlops mlops-workflow results scikit-learn
Last synced: 20 Jan 2026
https://github.com/philhawksworth/netlify-plugin-trello-lists
A plugin to fetch the JSON data of a public Trello board, and stash the data for each list in a JSON file before your build runs making the data available to your static site generator at build time.
api data eleventy netlify plugin trello
Last synced: 20 Jan 2026
https://github.com/ugurcanerdogan/cross-validation-with-imbalanced-dataset
BBM467*SDSP - Small Data Science Project - Things to consider in cross validation and resampling when dealing with Imbalanced Data : What is the right way?
bbm467 cross-validation data data-science kfold-cross-validation logistic-regression machine-learning oversampling sdsp smote
Last synced: 21 Jun 2025
https://github.com/lemmotresto/migrational
A data migration library
data java migration versioning
Last synced: 30 Oct 2025
https://github.com/enricocid/monitoraggio-vaccini-italia
Sito web statico per github.com/apalladi/covid_vaccini_monitoraggio
covid-19 covid-19-data covid-19-data-analysis data data-analysis data-visualization dataset python python3 python37 sars-cov-2
Last synced: 09 Sep 2025
https://github.com/johnmackintosh/simd2016_tmap
Mapping SIMD with tmap - static & interactive
data data-science data-visualization mapping r visualisation
Last synced: 20 Mar 2025
https://github.com/masesgroup/datadistributionmanager
A reliable subsystem to distribute data across multiple datacenters using multiple languages (C/C++, .NET, JVM enabled languages) over multiple technologies (e.g. Apache Kafka, OpenDDS, etc)
apachekafka availability business-solutions businesscontinuity data durability opendds reliability softwareneverlockdown
Last synced: 14 Apr 2025
https://github.com/robjg/dido
Data In/Data Out in many formats
csv-parser data etl java json-parser
Last synced: 11 Jan 2026
https://github.com/felixklauke/atomizer
Playing around with butter knife, android bindings and rx java.
binding butterknife data java react rx rxjava
Last synced: 15 May 2026
https://github.com/antoineaugusti/purchasing-power
Archive daily data about purchasing power parity: how much goods should cost in various countries
archive data purchasing-power-parity
Last synced: 28 Oct 2025
https://github.com/nazar-pc/fixed-size-multiplexer
A tiny library for multiplexing data chunks into blocks of fixed size and vice versa
chunk data demultiplex demux fixed multiplex mux size
Last synced: 31 Oct 2025
https://github.com/helixspiral/ndbc
Golang wrapper for the National Data Buoy Center (NDBC)
data data-science golang government-data ndbc ndbc-buoy-data noaa noaa-api noaa-buoys noaa-data noaa-weather wrapper wrapper-api wrapper-library
Last synced: 14 Jun 2025
https://github.com/writetome51/big-dataset-paginator
A TypeScript/JavaScript class for pagination in a real-world web app.
app data javascript pagination paginator typescript
Last synced: 17 May 2026
https://github.com/justjavac/deno_open_data
data deno deno-mod deno-module denomod
Last synced: 17 May 2026
https://github.com/max-tonny8/android_web3
This is a library for Android to call data from Node on Ethereum Chain or Solana Chain
android blockchain coroutines coroutines-android data eth-call ethereum kotlin ktx retrofit rpc smart-contracts solana web3 web3j
Last synced: 27 Mar 2025
https://github.com/danlsn/causality
A Personal Data Platform and the culmination of years of curiosity and learning in the Data Engineering space.
data data-engineering datawarehousing personal-data quantified-self
Last synced: 06 Mar 2026
https://github.com/headless-start/data-augmentation-impact
This repository contains effect of Data Augmentation of Training Set during Model Training.
augmented-images cuda data gpu keras matplotlib mnist opencv-python python3 tensorflow training-data
Last synced: 05 Apr 2026
https://github.com/satyam4229/college-predictor-system
The college predictor system is a Python-based application that utilizes a machine learning model to predict colleges and their corresponding degree programs and branches based on a student's JEE (Joint Entrance Examination) score.
data data-science jupyter-notebook kaggle prediction python
Last synced: 06 Apr 2026
https://github.com/longzheng/southeastwater-usage-scraper
Extract hourly water usage data from South East Water portal website for digital water meters
australia data iot playwright southeastwater victoria water
Last synced: 06 Feb 2026
https://github.com/shysolocup/aepl
A Node.JS multi-layered class creation package with built-in parenting systems that let you get info from classes above as well as better function and property makers for easier to read and understand development and modding support inspired by Roblox's Studio API.
aepl backend classes data framework game-development game-framework javascript js js-class js-framework lightweight nodejs package
Last synced: 28 Oct 2025
https://github.com/jesusgraterol/bitcoin-lightning-network-stats-dataset-builder
The dataset builder script extracts Bitcoin's Lightnining Network statistics through Mempool.space's public API. The data is stored in a .csv file, facilitating its use in data science and machine learning projects.
bitcoin blockchain blockchain-technology data data-science dataset dataset-generation lightning-network machine-learning
Last synced: 16 May 2026
https://github.com/stdlib-js/array-bool
BooleanArray.
array binary bool boolean booleanarray data javascript mask node node-js nodejs stdlib structure typed typed-array types
Last synced: 13 May 2025
https://github.com/toviszsolt/stormflow
StromFlow - A Lightweight Data Modeling and Storage Library
data database datamanagement datastore db document jsondatabase memorydatabase model mongodb mongoose nodb nosql query schema stormflow
Last synced: 13 May 2025
https://github.com/mzazakeith/puppetmaster
Puppeteer & Crawl4AI microservice for web automation, scraping, and AI processing with Bull queues
agent ai automation bull bullmq chrome crawl4ai crawler data data-extraction extraction gemini llm llms openai playwright puppeteer web-automation
Last synced: 13 May 2025
https://github.com/kodie/migrate-acf-field-data-to-repeater
A WordPress plugin that migrates field metadata for ACF fields that have been moved inside of a repeater
acf acf-field acf-fields advance-custom-field data data-migration data-migration-tool wordpress wordpress-plugin
Last synced: 19 May 2026
https://github.com/antvis/create-antv-demo
A simple CV-dashboard framework for practicing how to use AntV.
antv cv dashboard data resume resume-template resume-website visualization
Last synced: 09 Apr 2025
https://github.com/radekbednarik/data_generator
Random data generator using Python. Generate data files with random string, floats, ints, dates via console or TOML files..
csv data generator python python3 random test-data-generator
Last synced: 13 Dec 2025
https://github.com/laurensius/covid-19-info
data datatabel grafik peta visualisasi
Last synced: 21 May 2026
https://github.com/exaluc/webhookcatcher
Catch your webhooks like a dream
api catcher data webhook webhook-callbacks webhooks-catcher
Last synced: 14 Apr 2025
https://github.com/ange007/jquery.mydata
jQuery.myData - Small jQuery&Zepto plugin for two-ways data binding.
data data-binding jquery jquery-plugin zepto zepto-plugin zeptojs
Last synced: 19 May 2026
https://github.com/mohammadkarbalaee/introduction-to-data-science-sbu
Reports and full documentation of the introduction to data science course held at SBU
data data-science python shahid-beheshti-university
Last synced: 27 Mar 2025
https://github.com/zenwor/table_editor
A simple table data editor, with easily scalable functions and operations & a nice GUI
data data-science formula java parser parsing preprocessing swing tokenizer
Last synced: 22 Jun 2025
https://github.com/andrei-vataselu/data-science-snippets
🧰 Essential EDA and Data Cleaning Helpers for Any DataFrame This collection of functions is designed to accelerate exploratory data analysis (EDA), quickly surface data quality issues, and offer high-level insights into the structure and content of your dataset.
artificial-intelligence data data-science eda feature-engineering hyperparamater-tunning library loading model-evaluation modeling preprocessing python snippets text-processing time-series visualization
Last synced: 10 Mar 2026
https://github.com/memair/apps
App Store for Memair
apps appstore data data-science quantified-self
Last synced: 06 Apr 2026
https://github.com/Ekey/ER.DATA.Tool
Tool for extract data archives from mobile game Earth Revival (Project Arrival)
data earth-revival idx project-arrival
Last synced: 19 May 2026
https://github.com/utrechtuniversity/dataprivacysurvey
Code for analysing data from the Data Privacy Survey (2022)
data gdpr open-science privacy rdm research research-data-management survey utrecht-university
Last synced: 16 Jun 2025
https://github.com/wfamous/fiv_update-data
This project automates the retrieval, processing, and publishing of digital product data for our Shopify store. It integrates Google Cloud Platform (GCP), Amazon Web Service (AWS), Terraform (Tofu), Python, Bash, Ansible and GitHub Actions to manage data pipelines efficiently.
ansible aws bash data data-analysis data-science devops gcp python pythonpackage shopify terraform tofu
Last synced: 17 Feb 2026
https://github.com/rrighart/rrighart.github.io
A webpage about data science, programming, statistics and related topics
analyses data data-mining programming statistics
Last synced: 20 Jan 2026
https://github.com/aaronmeder/social-history
A quick look into your history on social media. Drop in the archives you've downloaded from Facebook and Instagram and see some stats about your time on the networks.
archives data facebook instagram statistics stats
Last synced: 27 Mar 2025
https://github.com/stdlib-js/array-typed-integer-ctors
Integer-valued typed array constructors.
array constructor constructors ctor ctors data dtype dtypes javascript node node-js nodejs stdlib structure type typed typed-array types utilities
Last synced: 12 Jan 2026
https://github.com/acaciaman/db-autotest
DB Database test automation. This python package allows to create database object structure and load data from database.
Last synced: 05 May 2026
https://github.com/xxczaki/parsify-plugin-covid19
Parsify plugin, that adds COVID 19-related variables 🦠
confirmed coronavirus covid19 data deaths fun math parser parsify parsify-plugin plugin variable variables
Last synced: 13 Mar 2026
https://github.com/planarnetwork/feeds.planar.network
GTFS feeds for bus, train and plane
data feeds gtfs transit transportation
Last synced: 11 Feb 2026
https://github.com/d3oxy/country-state-data
A comprehensive JSON dataset containing countries, states, cities, regions, and languages with TypeScript support. Perfect for building location-based dropdowns, address forms, and geographical applications.
address cities countries currency data dropdown geographical iso json languages location regions states typescript
Last synced: 24 Jan 2026
https://github.com/pawelzny/vo
DDD Value Object implementation
data ddd-patterns object python3 value
Last synced: 15 Feb 2026
https://github.com/jaldekoa/nyfedapi
A Python wrapper to easily retrieve data from the Federal Reserve Bank of New York (FRBoNY) official API in pandas format.
api api-wrapper banking data finance pandas python united-states
Last synced: 08 Feb 2026
https://github.com/dhimmel/het.io-rep-data
Data from Project Rephetio for the het.io website
browser data datatables drug-repurposing rephetio
Last synced: 07 Feb 2026
https://github.com/skylinenando/javascript
autocomplete browser data disable events javascript language loop
Last synced: 14 Feb 2026
https://github.com/a3r0id/lightshot-data-miner
A random idea I had a while back to make a data miner for lightshot. Never released this but after a friend sent me a post about lightshot's transparency I figured it'd be a good time to release this. I've included some output from a run before making the repo. I am not responsible for the imagery or it's contents.
brute-force bruteforce data dataset face-recognition image-processing lightshot mining scraper scraping text-recognition
Last synced: 19 Oct 2025
https://github.com/tomwhite/chernoff
A visual mood indicator. One of the first Java programs I ever wrote.
chernoff-faces data visualization
Last synced: 20 Apr 2026
https://github.com/ipstack/finder
Define data by IP Address
composer data geo geoip info ip ip-database ip-search ipstack ipstack-finder php search
Last synced: 14 May 2026
https://github.com/rcourivaud/rcourivaud.github.io
Raphaël Courivaud
data database datascience python
Last synced: 21 Apr 2026
https://github.com/sapienzanlp/exploring-srl
Repository for the paper "Exploring Non-Verbal Predicates in Semantic Role Labeling: Challenges and Opportunities"
acl acl2023 conllu data dataset natural-language-processing nlp semantic-role-labeling srl
Last synced: 31 Jan 2026
https://github.com/zgbjgg/quetzal-examples
Examples using Quetzal :rocket: :bird:
analytics dashboard data data-visualization elixir erlang plotly web-app
Last synced: 24 Apr 2026
https://github.com/kefniark/kaaya
JS Library for State management and Data synchronization between Applications
data game kaaya mutation network serialization state-management
Last synced: 06 Jun 2026
https://github.com/openpeeps/zxc-nim
Bindings to the ZXC compression library, a LZ77-based compressor optimized for high decompression speed
archive compression compressor data decompression game-assets lossless lossless-compression lz77 nim nim-bindings nim-package nim-wrapper openpeeps zxc
Last synced: 07 Jun 2026
https://github.com/csadorf/pydata-ann-arbor-2018
Slides and notebooks demonstrating signac for PyData Ann Arbor Meetup 2018
data data-management jupyter signac workflow
Last synced: 04 Jun 2026
https://github.com/poncoe/passdatatoanotherfragment
Latihan Passing data Ke Fragment Lain
android android-app android-application android-studio data fragment fragments kotlin kotlin-android passing-parameters passingdataintent viewmodel
Last synced: 23 Jun 2026
https://github.com/noklam/blog_archive_fastpage
Nok's data science blog
blog data data-science machine-learning python sceince
Last synced: 01 May 2026
https://github.com/gauravkoradiya/tensorflow-data-and-deployement
This repository contains usage of data and deployment pipline in tensorflow.
data deployment machine-learning-algorithms pipline tensorflowjs
Last synced: 06 Oct 2025
https://github.com/stdlib-js/datasets-anscombes-quartet
Anscombe's quartet.
anscombe anscombes-quartet data dataset datasets javascript node node-js nodejs quartet sample statistics stats stdlib
Last synced: 13 Oct 2025
https://github.com/stefen-taime/real-time-data-pipeline-snake-game
Dynamic Snake Game: Unleashing Real-Time Streaming Analytics with Redis, Kafka, Flink, ClickHouse & Chart.js in an Online Snake Game via Flask API
chartjs clickhouse confluent-cloud data flask kafka-streams pipeline redis
Last synced: 04 May 2026
https://github.com/1sumer/sql
This repository contains SQL scripts and data for various analytical and database management tasks. The project is designed to demonstrate SQL capabilities in handling complex queries, data analysis, and database design. It includes datasets related to e-commerce and streaming services, with a focus on real-world scenarios and use cases.
analytics data data-analysis data-storage sql vscode
Last synced: 19 Jan 2026
https://github.com/yanpitangui/iteminfoconverter
Application that converts ragnarok legacy data files to iteminfo.lua
data itemdbconf iteminfo luafiles ragnarok
Last synced: 12 Oct 2025
https://github.com/jaffarabbas/library-management-system-in-java-
GUI base + Database functionality
data database datastructures-algorithms dbms gson java javafx javafx-application javafx-desktop-apps javamail library-management-system mysql sql xammp
Last synced: 05 May 2026
https://github.com/bdpedigo/neuropull
A (soon to be) lightweight Python package for accessing single-cell connectome networks with metadata.
connectome connectomes connectomics data dataset networks networks-biology
Last synced: 05 Oct 2025
https://github.com/mednour2019/devolap
OLAP Cube Dispatcher Tool
analysis-services csharp data excel excel-export kpi mdx metroframework mvvm-architecture sql wpf
Last synced: 27 Jan 2026
https://github.com/yorkulibraries/vendorpol
URLs for vendor privacy policies and terms of use.
Last synced: 15 Oct 2025
https://github.com/amethyst-php/customer
A person or an organization that pays for goods or services
amethyst amethyst-package api customer data laravel
Last synced: 11 May 2026
https://github.com/dantesc03/uberpool-case-study
This project was designed to understand the statistical effects of longer wait times on uber rides. Particularly on the user and driver experience with the Uber Pool System.
analysis data excel jupyter jupyternotebooks learn python seaborn statistics t-tests uber visualization
Last synced: 16 Apr 2026
https://github.com/bastianolea/siedu_indicadores_urbanos
Datos del Sistema de Indicadores y Estándares de Desarrollo Urbano, con datos comunales sobre temas como transporte, urbanismo, servicios básicos, calidad de vida y más.
ambiental app chile ciudad comunas data estado social
Last synced: 19 Feb 2026
https://github.com/achraf-oujjir/chatgpt-users-tweets-pipeline
🐦🔵End-to-end ChatGPT Users' Tweets Data Pipeline with Python 🐍, Hive 🐝, and Power BI 📊
bash-script cloudera data data-engineering data-vizualisation datawarehouse hdfs hive networking powerbi python sentiment-analysis sftp shell tweepy twitter-api ubuntu virtualization vmware-workstation
Last synced: 28 Feb 2026
https://github.com/ium101/files-and-folders-lister-z
Files and Folders Lister Z is a utility for listing the contents of directories on your computer. It provides both a command-line and a graphical user interface (GUI) for easy use.
application application-code brasil brazil cmd command data database databases exe filemanagement filesystem linux lowcode macos python sh tool utility windows
Last synced: 09 Oct 2025
https://github.com/sabujxi/python-scraper-and-data-analysts-admin-panel-in-django
A data scraper from texas govt site and a helping web app for managing, reviewing and editing the data
analyst data data-analysis data-entry data-scraper django django-application python python-scraper real-estate regex scraper texas
Last synced: 30 Apr 2026
https://github.com/missiontoscale/bluesky-scraper
This is a work of art that enables you to scrape data off BlueSky.
analytics bluesky bluesky-api bluesky-client data datascraper-framework datascraping scraping social-media web webscraping
Last synced: 19 Jun 2026
https://github.com/mohasarc/treeviz
The best tree data-structures visualization tool
data structures visualization visualization-tools
Last synced: 25 Apr 2026
https://github.com/oliverhennhoefer/shiny-template-interactive-table
Example of interactively adding rows / deleting rows by selecting directly in a data.table (DT) in Shiny
button data delete dt r select selection server shiny shiny-applications shiny-apps shiny-r shinyapps table ui userinterface
Last synced: 16 Apr 2026
https://github.com/bastianolea/sinim_info_municipal
Base de datos del Sistema Nacional de Información Municipal, que incluye datos comunales sobre finanzas municipales, recursos humanos, educación, salud, pensiones, organizaciones sociales, y más.
chile comunas data estado laboral politica social tiempo
Last synced: 26 Oct 2025
https://github.com/audeering/emodb
Publishes Berlin Database of Emotional Speech with audb
Last synced: 19 Oct 2025
https://github.com/kuraydev/react-native-modal-data-passing
Seamless Data Passing to React-Native-Modal
android apple data data-passing google hook ios modal react react-native react-native-modal useimperativehandle
Last synced: 18 Apr 2026