data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-01 00:07:35 UTC
- JSON Representation
https://github.com/josechirif/reviews-and-satisfaction-analysis-of-airbnb-brazil-and-mexico-from-june-2010-to-february-2021
This project analyzes the reviews and satisfaction of customers who used AirBnB services. It also studies if there is a relationship between another variables.
data data-analysis data-visualization powerbi sql-server
Last synced: 25 Feb 2026
https://github.com/msampathkumar/fakereceiptimagegenerator
Receipt Generator using PIL, Python
data fake generator image python receipt synthetic-data
Last synced: 06 Sep 2025
https://github.com/metapsy-project/data-gambling-psyctr
Database of psychological interventions for problem gambling and gambling disorder.
Last synced: 02 Apr 2026
https://github.com/flowforfrank/d3-treemap
🌲 Treemap created with D3.js
d3 d3js data data-visualization javascript svg treemap tutorial webtips
Last synced: 09 Apr 2025
https://github.com/exsokamabay/encoderdecoder
Encoder Decoder Your Data
data decoder decryption encoder encoder-decoder encryption security-tools
Last synced: 14 Jan 2026
https://github.com/anicolaspp/mapr-data-gen
Data generator for MapR Data Platform
data mapr mapr-db mapr-es mapr-streams maprdb parquet scala spark
Last synced: 29 Apr 2026
https://github.com/flexiodata/functions-covid-19-feed
Import Covid-19 data from Johns Hopkins University into Microsoft Excel and Google Sheets.
covid-19 data excel google-sheets import johns-hopkins-csse johns-hopkins-university spreadsheet
Last synced: 10 Mar 2025
https://github.com/ggreen/data-orchestration-with-scdf-showcase
data-orchestration-with-scdf-showcase
data orchestration scdf spring
Last synced: 14 Jan 2026
https://github.com/tkd-alex/php-dmi-api
PHP-DMI-API is the api of DMI (CT) website developed in PHP.
bot data dmi json php unict university webscraping
Last synced: 20 Feb 2026
https://github.com/aramshiva/babies
👶 A parser for every name listed on a Social Security Card between 1880-2023
babies data datagov db graphs mysql names social-security social-security-data sql statistics stats
Last synced: 22 Aug 2025
https://github.com/hsyntes/data-modeling
A Backend application that provides Advanced Data Modeling and Schema Design with MongoDB, mongoose in Node.js & Express
data database datamodeling express modeling mongodb mongoose nodejs schema
Last synced: 10 Apr 2026
https://github.com/mrlynn/30-min-data-web-form
30 Minutes to a Data Enabled Web Form with MongoDB
beginner data html html-form javascript mongodb mongodb-atlas mongodb-database web webforms
Last synced: 15 Apr 2026
https://github.com/smolsoftboi/php-faker-providers
Faker providers that generate fake data for you.
data faker faker-generator faker-provider generator php
Last synced: 22 Apr 2025
https://github.com/hadro/brewery-guides
The data for guides to breweries across the United States from 1896 to 1918
brewers brewery-guides brewing brewing-history data dataset digital-collections digital-humanities hocr nypl open-data
Last synced: 16 Mar 2026
https://github.com/arda-guler/binsonograph
Encode any binary file into an audio file. Sister project of https://github.com/arda-guler/binGallery
audio converter data encoder proof-of-concept sonification sound
Last synced: 21 Jun 2025
https://github.com/joamag/pandas
Loads of pandas data from China with awesome data
data data-analysis jupyter notebook pandas
Last synced: 25 Apr 2026
https://github.com/robertoentringer/lod-opendata
A NPM package for get data of Lëtzebuerger Online Dictionnaire (LOD) from data.public.lu.
api data dictionary json-api lod-lu luxembourg luxemburgish open-data package parse public public-api
Last synced: 05 Sep 2025
https://github.com/rousan/bytevault
A command line application that stores sensitive data as key-value pair securely in local machine
application byte c command-line data encrypts key-value sensitive vault
Last synced: 16 Mar 2025
https://github.com/sneels/parkds
Connect all your Data Sources via 1 process (Cross-Domain + Single-Domain)
cross-domain data database datasource datasources javascript source
Last synced: 24 Feb 2026
https://github.com/qeeqbox/data-compliance
Data compliance is the process of following various regulations and standards to ensure that sensitive digital assets (data) are guarded against loss, theft, and misuse
compliance data data-compliance infosecsimplified qeeqbox
Last synced: 19 Mar 2026
https://github.com/robertmyles/riscobrasil
An R package to download 'Brazil Risk' data :chart_with_upwards_trend:
Last synced: 08 Apr 2025
https://github.com/1sumer/sql
This repository contains SQL scripts and data for various analytical and database management tasks. The project is designed to demonstrate SQL capabilities in handling complex queries, data analysis, and database design. It includes datasets related to e-commerce and streaming services, with a focus on real-world scenarios and use cases.
analytics data data-analysis data-storage sql vscode
Last synced: 19 Jan 2026
https://github.com/quetz-al/quetzal-client
Python client for the Quetzal API
client data data-science openapi-client openapi3 python quetzal
Last synced: 28 Jul 2025
https://github.com/frefrik/covid19norge-data
🦠 COVID-19 Datasets for Norway
covid covid-19 covid19 covid19-data csv data datasets norge norway norwegian smittestopp vaccine
Last synced: 09 Apr 2026
https://github.com/ctechhindi/auto-fill-form-data
AUTO FILL AND AUTOCOMPLETE USER DATA WITH KEY NAME
autocomplete chrome-extension data extension
Last synced: 17 Apr 2026
https://github.com/programmer-rd-ai/open-images-v6
Open-Images-V6
ai data dataset dl images ml object-detection open open-images programming python v6
Last synced: 03 Aug 2025
https://github.com/steelcake/cherry-pipelines
A collection of pipelines built with cherry
blockchain clickhouse data pipeline pyhton
Last synced: 09 Mar 2026
https://github.com/yash22222/data-analysis-with-python
This repository provides a practical introduction to data acquisition and analysis using Pandas. It covers loading datasets, exploring data, manipulating data, and gaining insights through statistical summaries. Ideal for beginners, it offers code examples and explanations to enhance your data manipulation skills using Pandas for Python.
binning data data-acquisition data-analysis data-binning data-cleaning data-formatting data-integration data-normalization data-preprocessing data-science data-transformation data-wrangling dataframe description numpy pandas pandas-dataframe python python3
Last synced: 09 Apr 2026
https://github.com/divithraju/divith-raju-openmetadata
Open Standard for Metadata. A Single place to Discover, Collaborate and Get your data right.
automation bigdata bigdataanalytics data data-structures dataengineering datascience hacktoberfest2022 metadata metadata-extraction
Last synced: 20 Feb 2026
https://github.com/instaclustr/cassandra-parquet-transformer
Transform SSTables from Apache Cassandra to Parquet or Avro files, locally or remotely via Apache Cassandra Sidecar
analytics apache apache-cassandra avro big cassandra data parquet spark sstable transformation
Last synced: 29 Aug 2025
https://github.com/slipke/eurlex-model-go
This projects implements the EUR-Lex XML data model in Golang. For more information see README.md
data datamodel eur-lex eurlex webservice
Last synced: 09 Mar 2026
https://github.com/rikvdh/zabuffer
Zero-Allocation buffer handling in C
buffer c clib data embedded memory string zero-allocation
Last synced: 03 Mar 2025
https://github.com/mawburn/across-a-thousand-dead-worlds-data
Across a Thousand Dead Worlds Data
Last synced: 21 Apr 2026
https://github.com/zalweny26/tools
Just a bunch of tools made in TypeScript.
algorithms data dimensionality distances helpers reduction sortings structures tools utils
Last synced: 03 Feb 2026
https://github.com/jimut123/scrapers
All Scrapers that I'll build
bs4 data python3 real-time-visualisations scrapers scrapy wget
Last synced: 16 Jan 2026
https://github.com/wamphlett/input-collection
A smarter and stricter way to capture and validate request data
Last synced: 27 May 2026
https://github.com/stdlib-js/array-int32
Int32Array.
array data int int32 int32array integer javascript long node node-js nodejs signed stdlib structure typed typed-array types
Last synced: 27 May 2026
https://github.com/lmantw/binarion
A simple binary format for storing JavaScript objects.
binary data decoding encoding format javascript
Last synced: 02 Sep 2025
https://github.com/sdhutchins/jxn-open-data-api
Access Jackson, MS open government data using a python API wrapper.
api data jackson jxn mississippi open-gov
Last synced: 08 Apr 2025
https://github.com/parimala24-ds/datascientistmlinterviewprep24
DATASCIENTST ML INTERVIEW PREP24
data decisiontree interviewquestions linear-regression logistic machine-learning matplotlib numpy pandas python seaborn sklearn
Last synced: 12 Apr 2025
https://github.com/leeper/mcode
Functions to merge and recode across multiple variables
data data-transformation r recode recoding
Last synced: 16 May 2025
https://github.com/datafold/vhol-demo
Get hands-on examples of dbt + Datafold CI/CD workflows
data data-engineering datafold dbt diff
Last synced: 28 Dec 2025
https://github.com/vikashpr/18cse301j_ra2011003010737
This website tells the story of a nation's GDP through data visualization, providing insights on global GDP, state-wise GDP, sector-wise GDP, and the vision for India's economy. It includes data sets and sources for further reference.
css3 d3-visualization d3js data data-vizualisation gephi-visualizations html5 indian-economy indian-gdp information-visualization js python-word-cloud python3 storytelling tableau tableau-public threejs wordcloud-visualization
Last synced: 03 May 2026
https://github.com/purarue/listenbrainz_export
Export your scrobbling history from ListenBrainz
data data-export music scrobbling
Last synced: 24 Jan 2026
https://github.com/mujadded/facebook_scrapper
The fcebook scrapper gem that dont need the api
data data-mining facebook ruby-gem scrapper selenium-webdriver
Last synced: 28 Oct 2025
https://github.com/j1sk1ss/dateapppc.exmpl
Простое нативное приложение для Windows с демонстрацией ООП и SQL баз данных на примере приложения для знакомств.
data oop-principles parsing pgadmin4 sql wpf
Last synced: 11 Apr 2026
https://github.com/junkwaxhero/cardlists
Sports Card set lists in easily consumable JSON Format for databases, apps, websites, and more!
baseball baseball-cards baseball-data bowman data dataset datasets donruss fleer json json-schema panini topps upper-deck
Last synced: 24 Apr 2025
https://github.com/stdlib-js/array-ones
Create an array filled with ones and having a specified length.
array data float32array float64array int16array int32array javascript matrix ndarray node node-js nodejs stdlib structure typed typed-array types uint32array vector
Last synced: 09 Apr 2025
https://github.com/perezrd5/publicdataprojects
These are public database and data analysis projects from the portfolio of Doug Perez
data data-model data-modeling data-models data-science data-structure data-structures database microsoft-sql-server mysql olap olap-cube oltp postgresql ssas ssis ssrs t-sql
Last synced: 13 Apr 2026
https://github.com/ozanarkancan/sailx
This repo contains the code for generating artificial navigational instruction following data.
data grounded-language-learning
Last synced: 08 Jan 2026
https://github.com/agnosticeng/agx
Query and explore local and remote data with Clickhouse
clickhouse d3 data rust svelte
Last synced: 26 Oct 2025
https://github.com/caelean/twittermap
Map of twitter user's influence as defined on by influencetracker
data google-maps maps sparql twitter visualization
Last synced: 14 Jun 2025
https://github.com/thomas-nyanumba/r-programming-air-pollution_disease-project
Personal R Programming Project
aggregate-functions boxplot-visualization data dpylr ggplot2 leftjoin linear-regression patchwork powerquery r readxl scatter-plot tidyr visualization
Last synced: 25 Mar 2025
https://github.com/gauravkoradiya/tensorflow-data-and-deployement
This repository contains usage of data and deployment pipline in tensorflow.
data deployment machine-learning-algorithms pipline tensorflowjs
Last synced: 06 Oct 2025
https://github.com/henrylin03/video-games
Using Python and SQL to clean, analyse and visualise video games' data from Metacritic. Includes scraping using BeautifulSoup.
analysis beautifulsoup beautifulsoup4 data data-analysis data-science eda jupyter-notebook pandas python sql sqlite3 video-game video-games
Last synced: 14 Apr 2026
https://github.com/woo071002/parcel-management-system
A Parcel Delivery Management System streamlining deliveries with features for admin, users, and delivery personnel, including real-time tracking, delivery requests, and personalized dashboards.
cors csharp data dotenv html-css iconfont jkuat land-information-system mongodb python react-router-dom sass tech-expo xaml
Last synced: 08 Oct 2025
https://github.com/utrechtuniversity/dataprivacyproject
This is the repository underlying the landing page for the Data Privacy Project @UtrechtUniversity, the Netherlands.
data gdpr open-science privacy rdm research research-data-management utrecht-university
Last synced: 10 Oct 2025
https://github.com/automators-com/datamaker-js
The official Node.js / Typescript library for the DataMaker API
data javascript nodejs typescript
Last synced: 11 Oct 2025
https://github.com/yanpitangui/iteminfoconverter
Application that converts ragnarok legacy data files to iteminfo.lua
data itemdbconf iteminfo luafiles ragnarok
Last synced: 12 Oct 2025
https://github.com/eby8zevin/android-pos4122020
The Next Project . . .
android android-app android-application android-database android-studio androidstudio create data database database-sqlite delete point-of-sale pos read search sqlite update
Last synced: 13 Oct 2025
https://github.com/datahub-local/datahub-local
DataHub.local is a powerful data platform designed for edge devices, enabling seamless analytics and insights at home
data data-engineering devops kubernetes raspberrypi
Last synced: 21 Jan 2026
https://github.com/stdlib-js/datasets-anscombes-quartet
Anscombe's quartet.
anscombe anscombes-quartet data dataset datasets javascript node node-js nodejs quartet sample statistics stats stdlib
Last synced: 13 Oct 2025
https://github.com/skywarth/fenrir-wolfpack-simulator
Simulating wolfpack behaviours and future of the pack in an environment using Javascript and data trees.
data data-structures javascript max-heap simulation simulations wolfpack
Last synced: 14 Oct 2025
https://github.com/davemlz/master_of_datascience
Master of Data Science repository
data data-mining data-science database r rmd sql sqlite statistics
Last synced: 14 Apr 2026
https://github.com/cerema/groum
Utilitaire en ligne de commande pour convertir les données d'arrêtés de circulation
Last synced: 06 Feb 2026
https://github.com/nononoexe/setariaviridis
🌾 Field-collected data of green foxtail
data data-science dataset rpackage
Last synced: 27 Feb 2026
https://github.com/bkamapantula/india-pc-nfhs4
Parliamentary constituency factsheet for indicators of nutrition, health, and development in India using NFHS4 data.
data government health india nfhs nfhs4
Last synced: 19 Mar 2026
https://github.com/chaitanyac22/hr_policy_query_resolution_with_retrieval_augmented_generation_rag
This repository contains an HR Policy Query Resolution system using Retrieval-Augmented Generation (RAG). It leverages a 4-bit quantized Mistral-7B-Instruct-v0.2 LLM and JP Morgan Chase’s publicly available Code of Conduct documents to generate accurate, contextually relevant responses for HR policy queries.
artificial-intelligence data hr large-language-models llm mistral-7b nlp pipeline prompt-engineering quantization rag retrieval-augmented-generation
Last synced: 12 Feb 2026
https://github.com/achraf-oujjir/chatgpt-users-tweets-pipeline
🐦🔵End-to-end ChatGPT Users' Tweets Data Pipeline with Python 🐍, Hive 🐝, and Power BI 📊
bash-script cloudera data data-engineering data-vizualisation datawarehouse hdfs hive networking powerbi python sentiment-analysis sftp shell tweepy twitter-api ubuntu virtualization vmware-workstation
Last synced: 28 Feb 2026
https://github.com/colour-science/colour-demosaicing-tests-datasets
Colour - Demosaicing - Tests Datasets
color color-science color-space color-spaces colorspace colorspaces colour colour-science colour-space colour-spaces colourspace colourspaces data dataset datasets de-mosaicing debayering demosaicing demosaicking raw
Last synced: 19 Mar 2026
https://github.com/mihasm/arso-scraper
Unofficial Python CLI tool for downloading automated sensor weather data from the Slovenian Environment Agency.
api arso cli data historical-data meteorological python slovenia weather
Last synced: 14 Feb 2026
https://github.com/oliverhennhoefer/shiny-template-interactive-table
Example of interactively adding rows / deleting rows by selecting directly in a data.table (DT) in Shiny
button data delete dt r select selection server shiny shiny-applications shiny-apps shiny-r shinyapps table ui userinterface
Last synced: 16 Apr 2026
https://github.com/snandasena/disaster-response-pipeline
Disaster Response Pipeline | Data Engineering
data data-engineering-pipeline etl flask machine-learning nlp nlp-pipeline
Last synced: 24 Apr 2026
https://github.com/sabujxi/python-scraper-and-data-analysts-admin-panel-in-django
A data scraper from texas govt site and a helping web app for managing, reviewing and editing the data
analyst data data-analysis data-entry data-scraper django django-application python python-scraper real-estate regex scraper texas
Last synced: 30 Apr 2026
https://github.com/noklam/blog_archive_fastpage
Nok's data science blog
blog data data-science machine-learning python sceince
Last synced: 01 May 2026
https://github.com/assem-elqersh/creativa-data-science-bootcamp
Jupyter notebooks from the Creativa Data Science Bootcamp, covering key data science concepts and practices across multiple sessions, from data preprocessing to model building and time series analysis.
data data-science eda exploratory-data-analysis machine-learning pandas time-series-analysis xgboost xgboost-classifier
Last synced: 03 May 2026
https://github.com/stefen-taime/real-time-data-pipeline-snake-game
Dynamic Snake Game: Unleashing Real-Time Streaming Analytics with Redis, Kafka, Flink, ClickHouse & Chart.js in an Online Snake Game via Flask API
chartjs clickhouse confluent-cloud data flask kafka-streams pipeline redis
Last synced: 04 May 2026