data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-23 00:07:41 UTC
- JSON Representation
https://github.com/unaygney/js-challenges-data-structures-and-algorithms
Repo of the challenges I'm trying to solve to understand data structures and algorithms..
algorithms-and-data-structures data javascript structure
Last synced: 29 Oct 2025
https://github.com/stdlib-js/array-bool
BooleanArray.
array binary bool boolean booleanarray data javascript mask node node-js nodejs stdlib structure typed typed-array types
Last synced: 13 May 2025
https://github.com/toviszsolt/stormflow
StromFlow - A Lightweight Data Modeling and Storage Library
data database datamanagement datastore db document jsondatabase memorydatabase model mongodb mongoose nodb nosql query schema stormflow
Last synced: 13 May 2025
https://github.com/mzazakeith/puppetmaster
Puppeteer & Crawl4AI microservice for web automation, scraping, and AI processing with Bull queues
agent ai automation bull bullmq chrome crawl4ai crawler data data-extraction extraction gemini llm llms openai playwright puppeteer web-automation
Last synced: 13 May 2025
https://github.com/kodie/migrate-acf-field-data-to-repeater
A WordPress plugin that migrates field metadata for ACF fields that have been moved inside of a repeater
acf acf-field acf-fields advance-custom-field data data-migration data-migration-tool wordpress wordpress-plugin
Last synced: 19 May 2026
https://github.com/thejeshgn/thejeshgn
data data-visualization datameet india opendata public-interest
Last synced: 15 Jan 2026
https://github.com/joisino/twinpaper
Code for "Twin Papers: A Simple Framework of Causal Inference for Citations via Coupling" (CIKM 2022)
causal-inference data research science-of-science
Last synced: 21 Mar 2025
https://github.com/imadsaddik/bodmaghdataset
BoDmagh dataset is a Supervised Fine-Tuning (SFT) dataset for the Darija language
arabic-llm arabic-nlp darija-llm darija-nlp data dataset fine-tuning llm nlp sft
Last synced: 03 Apr 2025
https://github.com/stdlib-js/datasets-suthaharan-single-hop-sensor-network
Labeled wireless sensor network data set collected from a simple single-hop wireless sensor network deployment using TelosB motes.
data dataset datasets javascript labeled machine-learning ml mote motes network node node-js nodejs outlier outliers sample sensor statistics stats stdlib
Last synced: 03 Mar 2025
https://github.com/tuananh/opentravel
✈ A collection of travel related data
Last synced: 09 Oct 2025
https://github.com/samboycoding/hungergames-data
data hunger-games javascript json
Last synced: 15 May 2026
https://github.com/philhawksworth/netlify-plugin-trello-lists
A plugin to fetch the JSON data of a public Trello board, and stash the data for each list in a JSON file before your build runs making the data available to your static site generator at build time.
api data eleventy netlify plugin trello
Last synced: 20 Jan 2026
https://github.com/denko5/sales-analysis
A complete SQL-based sales analysis project covering Africa, showcasing data cleaning, exploratory analysis, insights, and lessons learned. The project highlights sales trends, regional performances, and marketing effectiveness across multiple platforms.
africa data data-analysis data-science exploratory-data-analysis insights kenya sales sql
Last synced: 24 Jan 2026
https://github.com/yessasvini23/cisco-data-analytics-essentials_-virtual-_internship
From the CISCO Networking Academy
data dataanalysis database datascience excel relational-databases sql statistics structured-query tableau
Last synced: 17 Jul 2025
https://github.com/andrianllmm/wika-data
Philippine language resources.
data language low-resource-languages parser philippines scraper
Last synced: 17 Jul 2025
https://github.com/bacross/datamunger
python package for handling nan's and outliers
data data-frame datamunger knn nan outliers python scikit-learn
Last synced: 17 May 2026
https://github.com/hughrawlinson/github-data-scripts
Scripts to grab data about repos of interest to compare
data github-graphql github-repo-organizer graphql scripts typescript
Last synced: 09 Jul 2025
https://github.com/mustika-putri-m/-tableu-laporan-data-karyawan-growian
I am currently pursuing a data analysis certification at GROWIA, where I've learned to use tools such as Python, SQL, Google Big Query, Google Data Studio, Advanced Microsoft Excel, and Tableau. This course has enhanced my ability to analyze data using KPIs and business metrics, enabling me to solve business problems more effectively
data data-visualization tableau
Last synced: 17 Feb 2026
https://github.com/clabe45/kaz
Minimalistic local storage cli
cli data minimalistic storage utility
Last synced: 17 Jul 2025
https://github.com/simranjeet97/gpt4_applications
Applications build using OpenAI API and GPT4
ai ai-applications artificial-intelligence chatgpt data data-science gpt3 gpt4 large-language-models llm machine-learning openai openai-api project python
Last synced: 05 May 2026
https://github.com/shuklayash02/complete_data_analysis_project
A Full Data Analysis project where a sales data is ask,prepare,process,analyze,share and act through data analysis process
data data-visualization dataanalysis database datacleaning powerbi sql
Last synced: 16 Jul 2025
https://github.com/dineshpinto/geist-finance-subgraph
Subgraph for the Geist Finance protocol on the Fantom blockchain.
assemblyscript blockchain data fantom graphql typescript
Last synced: 17 May 2026
https://github.com/davedupplaw/jquery.bargraph
Moving, sliding bargraph display for jQuery
barchart bargraphs data javascript javascript-library jquery jquery-library jquery-plugin jquery-widgets realtime scrolling visualization
Last synced: 17 May 2026
https://github.com/reiiyuki/once-data-manager
Once Data Manager is temporary data management utility kit for Unity.
data manager playerprefs preference scene temporary unity
Last synced: 17 May 2026
https://github.com/wamphlett/smart-data-objects
An easy solution for capturing and validating data into usable DTO's
data dto forms php php7 validation
Last synced: 17 May 2026
https://github.com/nafisalawalidris/dr.-semmelweis-and-the-discovery-of-handwashing
Uncover the revolutionary impact of handwashing on mortality rates in healthcare. Explore the story of Dr. Semmelweis and his groundbreaking findings.
data data-analysis handwashing healthcare-analysis medical-breakthrough mortality-rates
Last synced: 13 Jul 2025
https://github.com/novecento99/nuvolino
air cloud data ikea iot pm pm25 sensor vindstyrka
Last synced: 13 Jul 2025
https://github.com/nichtich/wikidata-taxonomy-examples
Extract classifications from Wikidata
coli-conc data knowledge-organization wikidata
Last synced: 12 Jul 2025
https://github.com/elazar/pycopyql
Exports a subset of data from a relational database.
data database export relational tool utility
Last synced: 16 May 2026
https://github.com/flownrecords/flightTracker
A mobile app built to record essential flight data for post-flight review and debriefing.
Last synced: 23 Jun 2025
https://github.com/evoluteur/madeleinology
Playing with data science by taking a look at the proportions of flour, sugar, butter, and eggs in 147 Madeleine recipes (the traditional French sponge cake).
baking cake cooking cooking-recipes data data-science data-visualization dessert exploratory-analysis exploratory-data-analysis exploratory-data-visualizations food histogram longtail madeleine recipe visualization
Last synced: 23 Jun 2025
https://github.com/priyanshubiswas-tech/ev-data-analysis-dashboard
An interactive dashboard analyzing EV trends, including total vehicles, BEV vs. PHEV breakdown, model popularity, state-wise distribution, and CAFV eligibility. Visualizes key insights for data-driven decisions in the EV industry. 📊
dashboard data data-analysis data-science data-visualization tableau tableau-public
Last synced: 17 Feb 2026
https://github.com/marians/tour-tracker
Track the general classification development of the Tour De France, stage over stage
cycling data sports statistics
Last synced: 24 Jun 2025
https://github.com/nia-cloud-official/datascript
DataScript: A Hypothetical Data Scripting Language, DataScript is designed for simplifying data manipulation and analysis tasks. It serves as a scripting language tailored specifically for handling various data operations efficiently.
data data-scripting scripting-language
Last synced: 22 Jun 2025
https://github.com/dennyglee/open-covid19-public
A collaboration between SCRI and Databricks on the analysis of open COVID-19 datasets.
covid-19 data data-analytics data-engineering data-science nlp
Last synced: 22 Jun 2025
https://github.com/DevAthul-88/random-fakedata.js
A package to generate random data
data data-generator fake fake-data fake-data-generator javascipt javascript nodejs npm-package package
Last synced: 22 Jun 2025
https://github.com/shgysk8zer0/schema
A PHP implementation of schema.org structured data objects
data microdata schema seo structured-data
Last synced: 24 Jun 2025
https://github.com/dostuffthatmatters/circadian-scp-upload
Resumable, interruptible, SCP upload client for any files or directories generated day by day
checksum daily data directories files library python scp ssh synchronization time-series upload utilities
Last synced: 24 Jun 2025
https://github.com/lunastev/reflectlm
ReflectLM is a self-reflective, language-structure-only AI model that learns exclusively through interaction. It starts with zero factual knowledge but can engage in dialogue, evaluate its own responses, and remember conversations for future learning.
ai data language-model llm model open-source ts web
Last synced: 22 Jun 2025
https://github.com/harmonydata/harmony_examples
Example Jupyter notebook and R scripts using Harmony in real research problems
data data-harmonisation data-harmonization harmonisation psychology python r research
Last synced: 11 Jul 2025
https://github.com/nafisalawalidris/elfeenah
Configuration files for my GitHub profile. Welcome to my GitHub profile! I'm Nafisa Lawal Idris, a passionate Data Scientist with a strong interest for blockchain technology. Explore my GitHub portfolio to delve into the exciting world where data science and blockchain converge.
artificial-intelligence bitcoin blockchain config data data-science-portfolio data-science-projects datascience datascientist deep-learning github-config machinelearning
Last synced: 11 Sep 2025
https://github.com/legopitstop/mcextract
Extract assets and data from the Minecraft jar.
assets customtkinter data jar java minecraft pypi python pythonpackage reports serverjars userfolder
Last synced: 17 May 2026
https://github.com/alireza29675/goudi
GOUDI is a multi-layer data visualization application, inspired by mind maps and some other thinking and describing methods.
analysis data goudi visualization
Last synced: 11 Jul 2025
https://github.com/utkarshverma439/simple-sms-spam-detector
Built a Python text classification model for spam detection in SMS. Explored data, preprocessed text, utilized TF-IDF, trained a classifier, and addressed visualization challenges, yielding practical insights.
data data-science data-visualization spam-detection
Last synced: 20 Jun 2025
https://github.com/yasir13001/moonai_api
This MoonAI API service built with FastAPI that calculates and provides detailed Moon and Sun astronomical data based on user input such as date, latitude, longitude, elevation, and timezone.
ai almanac api astro-ai astronomy data data-science fastapi fastapi-api gemini groq-api hilal-detection html islamic-calenda llama llm-integration moon python
Last synced: 20 Jun 2025
https://github.com/divithraju/divith-raju-data-mining
This project focuses on customer segmentation using data mining techniques, specifically K-Means clustering, to classify customers into distinct groups based on their purchasing behaviors. The goal is to analyze customer data and segment them into clusters for targeted marketing strategies and better customer relationship management.
algorthims analytics apache business client connector data dataarchitecture database dataengineering datamining datascience hadoop k-means-clustering mysql project project-repository pyspark python3 spark
Last synced: 06 Mar 2026
https://github.com/jub0t/Eso
An application to manage all your Encryption & Decryption keys and other related tools.
data encryption encryption-decryption hacking hacking-tool keys pgp privacy private
Last synced: 10 May 2025
https://github.com/giscience/measures-rest-sparql
A SPARQL endpoint for the Measures REST OSHDB App framework.
data osm quality semantics sparql sparql-endpoints
Last synced: 24 Jun 2025
https://github.com/ayush585/fireducksblog
BLOG: Unlocking AI Efficiency: How FireDucks Revolutionizes Data Preprocessing
Last synced: 28 Apr 2026
https://github.com/stdlib-js/array-nans
Create an array filled with NaNs and having a specified length.
array complex128 complex128array complex64array data float32array float64array javascript matrix ndarray node node-js nodejs stdlib structure typed typed-array types vector
Last synced: 06 Mar 2026
https://github.com/uhstray-io/just-dashboards
Light and Easy Rust-Fullstack/WASM application to build dashboards from any data source
analytics data dioxus rust visualization
Last synced: 29 Mar 2025
https://github.com/fbraza/paris_airbnb
Analysis of Paris AirBnB data using R and Shiny
analysis data data-analysis paris-airbnb r shiny
Last synced: 21 Mar 2025
https://github.com/dbrennand/rm-content
A Python 3.7 script to remove a specific string from all files and repos (owned by the user).
content data erase eraser privacy privacy-protection privacy-tools remove remover rm-content
Last synced: 29 Mar 2025
https://github.com/hamzacham/data_set_projet-3
analysis data project rstudio visualization
Last synced: 29 Oct 2025
https://github.com/benji-lewis/archivord
An archival bot for Discord servers designed to retain as much data as possible to show future generations how we communicated.
archive data data-mining discord discord-bot typescript
Last synced: 16 May 2026
https://github.com/wklee610/de_project
[Data Engineer] Personal Toy Project For Study
Last synced: 31 Mar 2025
https://github.com/danieljdufour/fast-bin
Quickly Convert an Array of Numbers into their Minimal Binary Representations
array binarize binary bits data nbits numbers unbinarize
Last synced: 13 Apr 2025
https://github.com/whatheheckisthis/pwc_project-
Successfully completed a PwC virtual case, advancing Power BI skills to address cybersecurity and cloud architecture requirements. Developed comprehensive dashboards that effectively communicated key performance indicators (KPIs), showcasing proficiency in data visualization and deliver
case-study data data-science dataanalytics databases datavisualization powerbi virtual
Last synced: 05 Apr 2025
https://github.com/danieljdufour/easy-file-saver
Very Easily Save a File
csv data download file file-saver javascript js json save
Last synced: 21 Apr 2026
https://github.com/linas/archeo
File Recovery, Integrity and Archive Management
corruption data monitoring recovery
Last synced: 29 Mar 2025
https://github.com/d-ganchar/thedus
Thedus is a lightweight migration tool for Clickhouse
cli clickhouse data database migration migrations python
Last synced: 12 Apr 2025
https://github.com/elvis-not-presley-one/lostcassowary
LostCassowary is an Minecraft data miner that searches region files/.MCA files for data from the game, this one can search for banners, signs, biomes, blocks
data data-mining data-science dataminer minecraft nbt nbt-parser scraper
Last synced: 12 Apr 2025
https://github.com/epsoft/dataset-generator
dataset generator
data dataset dataset-generation matplotlib matplotlib-figures tensorflow tensorflow-datasets
Last synced: 18 May 2026
https://github.com/yash22222/tsf-grip-tasks
The Sparks Foundation Data Science & Business Analytics Internship Tasks
buisness-intelligence business-analytics data data-science data-science-projects data-structures grip gripjune23 internship internship-task machine-learning projects python simple-linear-regression the-sparks-foundation tsf
Last synced: 27 Apr 2026
https://github.com/sottey/shon
SHON (Structured Human-Optimized Notation) is a data serialization format designed for readability, schema support, and practical use in modern systems. Version 0.6 introduces advanced types and syntax improvements.
data golang json spec specification
Last synced: 18 May 2026
https://github.com/conduitio/conduit-site
data data-ingestion data-integration documentation
Last synced: 06 May 2025
https://github.com/stdlib-js/ndarray-base-assert-is-integer-data-type
Test if an input value is a supported ndarray integer data type.
array assert base check data dtype is javascript multidimensional ndarray node node-js nodejs stdlib test types util utilities utility utils
Last synced: 12 Apr 2025
https://github.com/alhonaut/quant-assigment
Code for quant analyz Morpho Markets and simulation reallocation process in MetaMorpho
analysis data defi quantitative-finance
Last synced: 16 May 2026
https://github.com/rifqanzalbina/libraryjs
A Library js
data data-science database datascience javascript javascript-library
Last synced: 17 Jan 2026
https://github.com/definetlynotai/test_generator
A tool to create datasets based on configurations from a csv file, This tool can be used as a skeleton for other software.
algorithim csv data development dynamic exam generator huge nirt powerful python skeleton test tools
Last synced: 21 Jul 2025
https://github.com/geo-c/oct-ckan
The Open City Toolkit (more information about the project: http://geo-c.eu)
cities collaboration data open participation transparency
Last synced: 16 May 2026
https://github.com/openfoodfacts/openfoodfacts-corrector
Ruby script to correct and enhance data on OpenFoodFacts
Last synced: 24 Apr 2026
https://github.com/lennart080/esp8266-tinyconfig
Esp8266 library to store configuration data
arduino arduino-ide arduino-library config configuration credential-storage credentials data data-config esp8266 esp8266-arduino iot platformio platformio-library
Last synced: 03 May 2026
https://github.com/rrwen/slides-covid19-geosocial-db
Presentation titled "A Real-time Geo-social Media Database for Large-scale Coronavirus Disease 2019 (COVID-19) Research" for my second research seminar at Ryerson University
covid covid-19 covid19 data database disease geo gis index media ncov-2019 ncov19 postgres postgresql presentation research seminar slides social virus
Last synced: 18 May 2026
https://github.com/ajitharunai/covid-tracker-using-python
Covid-Tracker-Using-Python
data datavisualization python python3 pythonapplications
Last synced: 25 Jun 2025
https://github.com/junkwaxdata/cardlists
Sports Card set lists in easily consumable JSON Format for databases, apps, websites, and more!
baseball baseball-cards baseball-data bowman data dataset datasets donruss fleer json json-schema panini topps upper-deck
Last synced: 13 Mar 2025
https://github.com/dr-saad-la/r-distilled
R Programming Language distilled
data data-analysis learning programming-language r rlanguage rprogramming statistical-analysis
Last synced: 18 May 2026
https://github.com/randomfractals/chicago-transport
Exploratory data analysis of public Chicago transportation datasets.
chicago data data-tools duckdb sql transportation
Last synced: 01 May 2026
https://github.com/jigyasag18/sonar-rock-vs-mine-prediction-ml-project
This repository contains a machine learning project that classifies SONAR reading data to distinguish between rocks and mines. It implements various classification models,evaluates their performance,and features a user-friendly web application deployed with Streamlit for real-time predictions. The project is aimed to help in safe marine operations.
classification data dataset machine-learning machine-learning-algorithms machinelearning machinelearning-python machinelearningmodel machinelearningproject machinelearningprojects modelevaluation modeltraining prediction-model streamlit streamlit-webapp
Last synced: 18 May 2026
https://github.com/jvrck/australianpayphones
Get Australian payphone data in GeoJSON format.
australia data geojson geojson-data scraper
Last synced: 04 Apr 2025
https://github.com/suyashkumar/deeplesion-gcp-loader
Get the DeepLesion CT Image data set into a GCP Storage Bucket
bucket data data-loader data-loading data-science deep-learning deep-lesion deeplesion gcp gcp-bucket loader storage
Last synced: 04 Apr 2025
https://github.com/amyflo/cs448b
Exploring r/LoveLetters
d3-visualization d3js data react reactjs visualization
Last synced: 18 May 2026
https://github.com/anuveyatsu/cloudflare-data-fabric
Cloudflare Data Fabric: Use Cloudflare's global infrastructure to build a flexible, resilient framework for data solutions.
cloudflare data data-lake fabric lakehouse mesh
Last synced: 12 Sep 2025