data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-30 00:07:50 UTC
- JSON Representation
https://github.com/sungchun12/sqlmesh-demos
SQLMesh project for live demos - provides instructions so you can run this on your own!
data data-engineering sql sqlmesh
Last synced: 24 Oct 2025
https://github.com/arda-guler/binsonograph
Encode any binary file into an audio file. Sister project of https://github.com/arda-guler/binGallery
audio converter data encoder proof-of-concept sonification sound
Last synced: 21 Jun 2025
https://github.com/openintrostat/airports
📦 R package for data on airports 🛫
data openintro rstats rstats-package
Last synced: 22 Feb 2026
https://github.com/devtin/duckfficer
Zero-dependencies light-weight library for modeling, validating and sanitizing data 🦆 🐵 👁
coercion data duck-typing json parsing schema validation
Last synced: 01 Mar 2025
https://github.com/mrlynn/30-min-data-web-form
30 Minutes to a Data Enabled Web Form with MongoDB
beginner data html html-form javascript mongodb mongodb-atlas mongodb-database web webforms
Last synced: 15 Apr 2026
https://github.com/ravi-prakash1907/caterapp
A Quick & Secured Data Sharing Application!
application cater caterapp data data-sharing pip python
Last synced: 06 Sep 2025
https://github.com/daiangm/joker-validator
Validador de dados JSON para NodeJS
customized data data-validation javascript json json-schema nodejs schema validador validation validator
Last synced: 04 Jul 2025
https://github.com/exsokamabay/encoderdecoder
Encoder Decoder Your Data
data decoder decryption encoder encoder-decoder encryption security-tools
Last synced: 14 Jan 2026
https://github.com/baked-libs/bstats-discord-integration
A simple program which queries https://bstats.org/ and presents this data in a highly customizable discord webhook
bstats data discord discord-webhook javascript minecraft notifications paper plugin spigot statistic stats typescript webhook
Last synced: 28 Apr 2025
https://github.com/zkan/hello-airflow
Hello, Airflow!
airflow apache-airflow data data-pipeline pipeline python
Last synced: 19 Aug 2025
https://github.com/msampathkumar/fakereceiptimagegenerator
Receipt Generator using PIL, Python
data fake generator image python receipt synthetic-data
Last synced: 06 Sep 2025
https://github.com/eosdis-nasa/earthdata-pub
core code repository for Earthdata Pub (EDPub)
data earthdata edpub publication
Last synced: 17 Jan 2026
https://github.com/mews-labs/crep
This simple module aims at providing some function to tackle tabular data that have a continuous axis. In situations, this index can represent time, but this tool was originally developed to tackle rail way description.
data pandas pandas-dataframe python python3 rails-application time-series
Last synced: 23 Feb 2026
https://github.com/onaio/gisida-react
React Dashboard library for Gisida.
dashboard data gisida map react visualization
Last synced: 28 Apr 2025
https://github.com/astrid-project/lcp
In each local agent, the control plane is responsible for programmability, i.e., changing the behaviour of the data plane at run-time.
agent beats control data ebpf elasticsearch log logstash management programmability security
Last synced: 06 Apr 2025
https://github.com/sureshpandiyan1/smartweather
detect real-time weather data of any country
data no-api python smart software users weather weather-app weather-data weather-information windows windows-app
Last synced: 12 May 2026
https://github.com/abdussattar-70/oop-school-library
The OOP-School-Library project demonstrates the principles of data abstraction, inheritance, encapsulation, and polymorphism, which are fundamental concepts in object-oriented programming(OOP).
abstraction data encapsulation inheritance polymorphism rubocop-configuration ruby
Last synced: 29 Mar 2025
https://github.com/biglocalnews/upload-files
Upload comma-delimited files to biglocalnews.org in your GitHub Action
action actions archiving csv data data-journalism github-actions journalism news
Last synced: 27 Apr 2026
https://github.com/nalgeon/nalgeon.github.io
Everything about SQLite, Python, open data and awesome software
Last synced: 14 Jul 2025
https://github.com/psfried/dgen
Generate evil test data
csv data data-generation data-generator language testing-tools
Last synced: 18 Mar 2025
https://github.com/huseyincenik/data_science
Data Science materials
data data-science data-structures data-visualization dataanalysis dataengineering datapreparation dataprocessing datascience dataset time-series time-series-analysis timeline timeseries timeseries-analysis timeseriesforecasting
Last synced: 25 Jul 2025
https://github.com/stefanbohacek/fediverse-explorations
Exploring the fediverse through data, studies, and polls.
data data-visualization fediverse mastodon social-media
Last synced: 12 Apr 2025
https://github.com/muneeb1030/finetune-tiny-llama
Fine-tuning the Tiny Llama model to mimic my professor's writing style using the Llama Factory. The project involves data collection, preprocessing, preparation, fine-tuning, and evaluation.
data data-preparation data-preprocessing finetuning llama-factory llm pymupdf selenium-python spacy tinyllama webscraping
Last synced: 08 Apr 2026
https://github.com/edoardottt/computerphile-pong
Pong game with a little bit of Data Science. Computerphile.
2d-game computerphile csv data data-science datascience game game-2d game-development games pandas pong pong-game pygame pygame-library python python-3 python-library python3
Last synced: 30 Oct 2025
https://github.com/FCC/contours-api-node
Enterprise Contours Node API
api contours data data-visualization geospatial gis map
Last synced: 27 Jul 2025
https://github.com/divinemonk/dataentrywebapp
Data Entry Web App is a lightweight web application built with Flask, a Python web framework, designed to streamline data entry and management processes. It provides a user-friendly interface for efficient data entry, viewing, editing, and deletion.
data data-entry flask flask-application production production-server web webapp
Last synced: 13 Apr 2025
https://github.com/v4ss3ur/hierarchicaldatagrid.wpf
A WPF control that mix DataGrid and TreeView functionalities, allowing for hierarchical, recursive data display with expandable nested rows. Ideal for complex data structures in an easy-to-use, MVVM-friendly tabular format.
controls data datagrid hierarchical hierarchical-data mvvm nested nested-objects nested-structures treeview wpf xaml
Last synced: 13 May 2025
https://github.com/nickmcintyre/processing-netcdf
Simple access to scientific datasets with Processing
Last synced: 11 Apr 2025
https://github.com/mbrn/dbmixer
A project that mask database columns by several algorithms
Last synced: 19 Jul 2025
https://github.com/simranjeet97/docker_python_flask-dash_app
Docker Image and Container Build for Python Flask/Dash App
data data-science data-structures data-visualization docker docker-compose docker-container docker-image python python-script uwsgi-nginx
Last synced: 07 May 2026
https://github.com/rpidanny/streamline.js
A JavaScript class that reads and processes a stream line-by-line in order.
big-data data data-processing file-stream javascript stream streams typescript
Last synced: 08 Sep 2025
https://github.com/siongui/gopaliwordvfs
Serve JSON data of Pali words, embedded in Go code
data go golang pali vfs virtual-file-system virtualfilesystem
Last synced: 04 Apr 2025
https://github.com/urunov/algorithms
algorithm, data structure, dynamic array, dynamic programming
algorithms algorithms-and-data-structures data dynamic-programming
Last synced: 20 Mar 2025
https://github.com/mbanq/dupe
Fake banking data for your front- or backend
backend data datagenerator fake faker frontend javascript nodejs npm npm-package
Last synced: 13 May 2025
https://github.com/abistarun/resseract-lite
A Data Analytics Tool with flexible architecture to visualize and analyse data
analystics charts custom-dashboards data data-science data-visualization diverse-data-sources expression-language geochart geocharts interactive-data-exploration light-weight lightweight monitoring resseract resseract-lite slice tool visualisations visualization
Last synced: 14 May 2025
https://github.com/gaurav-van/data_science__machine_learning-projects
Compilation of Data Science and Machine Learning Projects Completed During My Bachelor's Degree
classification data data-science data-science-projects data-visualization deep-learning deep-learning-models deep-learning-projects deep-neural-networks inceptionv3 machine-learning machine-learning-projects python regression streamlit
Last synced: 23 Jun 2025
https://github.com/alexgustafsson/systembolaget-api-data
An up to date data mirror of Systembolaget's APIs
data data-science sweden systembolaget
Last synced: 28 Oct 2025
https://github.com/legopitstop/datapacks
All legopitstop's datapacks in one place.
assets data datapack hacktoberfest minecraft mods modtoberfest resroucepack vanilla
Last synced: 03 Jan 2026
https://github.com/gianlucatruda/project_sleep
A Quantified Self project in which I use ±40 nights of data to determine what helps and hinders my sleep.
data experiment matplotlib python quantified science self sleep visualization
Last synced: 03 Apr 2025
https://github.com/aa-sikkkk/twitterdatamining
A Simple Script to mine data from X/Twitter
Last synced: 24 Jan 2026
https://github.com/dicook/tutorial_make_better_data_plots
Materials for a workshop in June 2025
data data-analysis data-science data-visualization r statistical-graphics statistics
Last synced: 25 Jun 2025
https://github.com/guiferviz/tuberia
Data engineering meets software engineering
data data-engineering expectations pipeline python spark
Last synced: 08 Mar 2026
https://github.com/umitkaanusta/smol-elt
a smol elt (not etl) pipeline for smol tasks
analytics automation aws aws-sns data data-engineering data-pipeline elt etl google-sheets pandas pipeline python spreadsheet web-scraping
Last synced: 10 May 2026
https://github.com/equinor/data-marketplace
Easily find and check out data products
Last synced: 01 May 2025
https://github.com/kawai-senpai/potatodb
PotatoDB is a lightweight, file-based NoSQL database for Python projects, designed for easy setup and use in small-scale applications. Ideal for developers seeking simple data persistence without the complexity of traditional databases.
data database easy-to-use file-based json key-value lightweight nosql nosql-database persistence python simple
Last synced: 23 Oct 2025
https://github.com/nix1707/webscrapper-browserextension
Scraper Master is a Chrome extension for effortless web data extraction. Built with React, TypeScript, and the Chrome Scripting API, it ensures efficient, high-quality, and seamless scraping. Utilizing HTML and CSS, ScrapeEase offers a clean, responsive design. Simplify your data collection with Scraper Master.
chrome-extension chrome-extensions css data frontend html html-parser modern parser parsing react scraper scraping typescript ui validation webparser webparsing webscraping
Last synced: 21 Jun 2025
https://github.com/intercloud/gotsgen
Golang Time Series Data Generator
data generator golang library timeseries
Last synced: 20 Jun 2025
https://github.com/deveel/deveel.repository
Implementations of the repository pattern for .NET to support the domain-driven modeling
clean-architechture csharp data dotnet-core dotnetcore efcore entity entity-framework entity-manager layered-architecture mongodb repository repository-manager repository-pattern
Last synced: 22 Apr 2025
https://github.com/rclement/romain-clement.net
Freelance Software Engineer & Trainer
data freelancer machine-learning mkdocs mkdocs-material python
Last synced: 21 Mar 2025
https://github.com/monfireboose/monfireboose
A lightweight JavaScript library that provides a high level and model based API for interacting with Firebase.
api data database firebase firestore high-level-api interact javascript library model storage
Last synced: 18 Feb 2026
https://github.com/fabriquebeweb/dao
Le 'Data Access Object' pour les nuls !
Last synced: 18 Feb 2026
https://github.com/daninet/audio-annotator
Simple app for annotating audio segments
ai annotate annotation artificial audio data intelligence label labeling labeling-tool learning machine ml science wav
Last synced: 04 Apr 2025
https://github.com/aydinnyunus/dictionary
Dictionary
data data-analysis data-science data-structures data-visualization database dataset dictionaries dictionary dictionary-learning python python-2 python-3 python-3-6 python-library python-script python2 python27 python3 python36
Last synced: 09 May 2025
https://github.com/datawookie/data-diaspora
Various datasets used in tutorials and workshops.
Last synced: 20 Mar 2025
https://github.com/codiepp/elykseer-base
cryptographic data archive; written in F#; envisaged to stay another 10 years
archive cli cryptography data distributed-storage dotnet fsharp longterm-storage
Last synced: 19 May 2026
https://github.com/hoanganhngo610/introduction-r-packages
This repository is an introduction to the most essential packages in R programming, for the sake of satisfying any demand and customised work flow
Last synced: 28 Jun 2025
https://github.com/rn0x/app.altaqwaa.org
موقع إسلامي شامل يحتوي على الأذكار والقرآن الكريم بأصوات عدد كبير من القراء، بالإضافة إلى تفسير وحصن المسلم. يتضمن الموقع أيضًا مسبحة إلكترونية وإذاعات إسلامية وأوقات الصلاة.
app broadcasts data islam muslim prayer quran tafsir website website-design
Last synced: 07 Aug 2025
https://github.com/ferhatgec/kedi
Fegeya Kedi, Experimental Data Interface.
cpp cpp17 data data-interchange data-interface fegeya gnu json library linux xml
Last synced: 14 Apr 2025
https://github.com/randomgamingdev/mc_block_color_mapper
Python scripts & libraries for generating and mapping the average colors for each of the Minecraft blocks
average average-calculator cli data data-generator documented-api extract extract-data extractor fast minecraft python3 simple small texture texture-pack textures
Last synced: 22 May 2026
https://github.com/louis030195/ega
ai artificial-intelligence data data-science data-visualization
Last synced: 07 Mar 2026
https://github.com/ikstream/dns-handler
Data collection server for the dalec user collection system
collection dalec data data-collection dns dns-server python python3
Last synced: 13 Mar 2025
https://github.com/deveripon/assignment-6-assets
This assets is only for Reactive Accelarator Batch 2 - Assignment 6
Last synced: 30 Apr 2025
https://github.com/charconstpointer/markovbot
PoC markov chain sentence generator, powered by discord for data gathering
bot chain collection data discord markov parsing
Last synced: 16 May 2026
https://github.com/cttynul/elsoftware
⚽ Vinci al Fantacalcio usando librerie di pandas, facendo credere a tutti che tu stia usando il machine learning
data data-science fantacalcio machine-learning pandas
Last synced: 30 Jun 2026
https://github.com/dsietz/daas
Overview of the Data as a Service (DaaS) architecture
archconf architecture daas data message-broker microservice nfjs patterns rust-lang uberconf
Last synced: 21 May 2026
https://github.com/dyazincahya/kbbi-sql-database
Kamus Besar Bahasa Indonesia (KBBI) SQL Database | Total data 115.978 Kata | Tersedia untuk MySQL, SQLite dan PostgreSQL. Juga tersedia untuk format data CSV, JSON, Markdown, PHP Array, XML, DbUnit, HTML
bahasa-indonesia csv data database dictionary indonesian-language json kamus kamus-besar-bahasa-indonesia kamus-indonesia kbbi kbbi-api kbbi-sql mysql php-array postgresql sql sqlite xml
Last synced: 19 Aug 2025
https://github.com/andygol/yamap
Yamap Ain't Map – deployment of OSM infrastructure project inspired by osm-seed
api data extract geo-data map openstreetmap osm
Last synced: 24 Jun 2025
https://github.com/pseudomuto/iceberg-rest-go
A Go client library for working with Iceberg Rest catalogs
Last synced: 25 Jan 2026
https://github.com/andreped/chatbot-streamlit-demo
Develop accessible ChatBot with Azure OpenAI and Streamlit
azure chatbot data data-mining huggingface huggingface-spaces large-language-models llm openai python research streamlit web-application
Last synced: 01 Aug 2025
https://github.com/arverma/data-engineer-interview-experience
My interview experience with the companies I interviewed with
big-data data data-engineer data-engineering engineering interview interview-practice interview-preparation interview-questions python3 spark sql
Last synced: 19 May 2026
https://github.com/DataHerb/dataherb-python
Python Package for DataHerb: create, search, and load datasets.
data data-analysis data-mining database dataset python
Last synced: 08 May 2025
https://github.com/edgardleal/thanos-for-data
A Thanos implementation to restore the balance of your data
Last synced: 15 Jun 2025
https://github.com/yashika-malhotra/data-exploration-and-visualization-for-streaming-platform
Data Analysis and Visualization for streaming platform to provide insights and recommendations to improve their userbase.
colab-notebook data data-visualization jupyter-notebook matplotlib numpy pandas python seaborn
Last synced: 18 Apr 2026
https://github.com/aidinhamedi/pneumonia-detection-ai
This project uses a deep learning model built with the TensorFlow Library to detect pneumonia in X-ray images. The model architecture is based on the EfficientNetB7 model, which has achieved an accuracy of approximately 97.12% (97.11538%) on our test data. This high accuracy rate is one of the strengths of our AI model.
ai artificial-intelligence artificial-neural-networks cli data data-science database gui interface kaggle keras pneumonia pneumonia-classification pneumonia-detection pneumonia-detector pneumoniac-xray python tensorflow tensorflow2 training
Last synced: 06 Apr 2025
https://github.com/dotnet-ad/staticbind
Generated and compiled data binding for .NET (Xamarin.iOS, Xamarin.Android,...)
Last synced: 19 May 2026
https://github.com/njraladdin/newspapers-com-scraper
A Node.js scraper for extracting article data from Newspapers.com based on keywords, dates, and locations.
archive data newspapers scraper scraper-api scraping
Last synced: 06 Apr 2025
https://github.com/priyanka7411/dataspark-electronics-retail-analytics
DataSpark is a data analysis project using Python, SQL, and Power BI to analyze global electronics retail sales, focusing on customer behavior, sales performance, product profitability, and store performance to optimize sales strategies.
analytics-providers business-intelligence customer-segmentation data data-analysis electronics-industry global-sales pandas powerbi powerbi-visuals product-profitability python retail-analytics sales-performance sql store-analysis visualization
Last synced: 10 Jul 2025
https://github.com/burakboduroglu/data_structures_and_algorithms
This repo contains my sata structures and algorithms codes.
alghorithm data data-structures dynamic-programming graph hash interview interview-questions linked-list structures tree-structure
Last synced: 04 Apr 2025
https://github.com/aymericzip/api-refetch
Alternative to SWC or react-query. Hook that store your API calls and provide states as isLoading, isFetched, data, error. Allow to instantly fetch the API when the hook is mounted. Provide retry and revalidation options.
api async autofetch cache data fetch loading react-query retry revalidate session-storage state store swr zustand
Last synced: 11 Apr 2025
https://github.com/varletjs/ruler-factory
A flexible, chainable validation rule factory for typeScript/javaScript.
chainable data factory form javascript rules typescript validation validator
Last synced: 12 Sep 2025
https://github.com/thamerh/web-scraper-with-node.js-and-cheerio
used simple exemple how Scraper data from Build a Web Scraper with Node.js and Cheerio
cheer data expressjs nodejs scarper webscraping
Last synced: 08 Apr 2026
https://github.com/olajideolagunju/gcp_mage_data_pipeline
An end-to-end data engineering pipeline that processes and analyzes Maintenance Work Orders using Mage, Docker, Google BigQuery, MariaDB, and Looker Studio. It features a seamless integration of cloud and open-source tools for scalable data storage, transformation, and visualization.
automation bigquery cloud compute-engine data data-engineering database database-schema docker-compose excel gcp mage-ai maintenance mariadb orchestration python sql virtual-machine visualization-dashboard work-orders
Last synced: 07 Mar 2025
https://github.com/themitosan/grpp
GRPP is a simple tool written in TS that helps preserving git repositories.
cli data git grpp linux preservation project repo repository
Last synced: 15 Jul 2025
https://github.com/alexandregazagnes/global-biodiversity-score
CDC Biodiversité is a subsidiary of the Caisse des Dépôts et Consignation, the largest French financial institution. It is specialized in providing biodiversity-positive solutions to businesses such as ecological offsets and biodiversity footprinting.
analytics biodiversity data data-science environment ghg python
Last synced: 28 Jul 2025
https://github.com/yessasvini23/machine_learning_specialization_deeplearning.ai
Contains all course modules, exercises and notes of ML Specialization by Andrew Ng, Stanford Un. and DeepLearning.ai in Coursera
andrew-ng andrew-ng-course andrew-ng-machine-learning classification data data-science deep-learning machine-learning machine-learning-algorithms neural-network nlp-machine-learning regression rnn-tensorflow
Last synced: 18 May 2026
https://github.com/vijishmadhavan/parse-clip
A simple CLIP based project for combining images from multiple datasets.
clip data datacleaning dataexploration dataset fastai image python
Last synced: 14 May 2026
https://github.com/ultreon/ubo
NBT inspired data I/O. Made for games.
api binary-data data data-storage file-type game-data io library ubo
Last synced: 16 Jun 2025