data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-02 00:07:45 UTC
- JSON Representation
https://github.com/msmenegol/datapark
Datapark: a self-hosted data platform
airflow data data-engineering data-science jupyter-notebook machine-learning minio mlflow postgresql spark
Last synced: 19 Feb 2026
https://github.com/qiyangduan/schemaindex
SchemaIndex is designed for data scientists to index and search metadata more efficiently.
data database engineering hdfs indexing reflection reverse scientists
Last synced: 16 Jan 2026
https://github.com/msrd0/gotham_formdata
Form data parsing for the gotham web framework
data form gotham html http multipart rust server urlencoded
Last synced: 16 Mar 2025
https://github.com/ymorsi7/ayatica
Exploring patterns, miracles, and linguistic features in Islamic sacred texts through data visualization
d3 d3js data data-visualization hadith hadiths islam islamic quran visualization
Last synced: 19 Apr 2025
https://github.com/pratapvardhan/maharashtra-state-elections-2014
Maharashtra State Elections - 2014
data elections india maharashtra maharashtra-state-elections python web-scraping
Last synced: 11 Jun 2025
https://github.com/maltzsama/sumeh
Sumeh — Unified Data Quality Framework Sumeh is a unified data quality validation framework supporting multiple backends (PySpark, Dask, Polars, DuckDB, Pandas) with centralized rule configuration.
dask-dataframes data data-quality data-quality-analysis data-quality-assessment data-quality-checks data-quality-framework data-quality-measurement data-quality-report duckdb duckdb-extension pandas pandas-library polars polars-dataframe polars-extensions pyspark pyspark-dataframes
Last synced: 09 Mar 2026
https://github.com/jacob-shuman/static_lists
astro clsx data lists netlify pnpm random react static-data static-site tailwindcss
Last synced: 29 Apr 2026
https://github.com/flexiodata/functions-covid-19-feed
Import Covid-19 data from Johns Hopkins University into Microsoft Excel and Google Sheets.
covid-19 data excel google-sheets import johns-hopkins-csse johns-hopkins-university spreadsheet
Last synced: 10 Mar 2025
https://github.com/facekapow/struffer
Struct + Buffer = Struffer. Also works with Uint8Arrays
buffer data javascript node node-js nodejs struct structure structured-data typescript uint8array
Last synced: 10 Apr 2026
https://github.com/andreabozzo/osservatorio
Osservatorio - Open Data Processing Platform
api-rest data data-analysis data-visualization database datamodelling docker duckdb etl fastapi jwt-authentication open-source pipeline postgresql python react sqlite
Last synced: 02 Sep 2025
https://github.com/erasmosoares/UnityDataManager
This project allows game developers to create and manager level atributes using xls files. Using a xls file can be a simple way to edit character attributes in addition to having a broad view of the attributes for each level of the game, besides making this edition in a separate file enable you to share the same file with other team members such as game design and level design.
data levelup manager unity unity-scripts
Last synced: 26 Apr 2025
https://github.com/dbt-labs/jaffle-shop-mesh-marketing
A ✨ meshified ✨ open source sandbox project exploring dbt workflows via a fictional sandwich shop's data. This is a domain-focused node in the mesh focused on marketing models, built on the jaffle-shop-mesh-platform project.
analytics analytics-engineering data data-engineering dbt dbt-cloud
Last synced: 04 Mar 2026
https://github.com/tangentlin/indexed-collection
A zero-dependency library of classes that make filtering, sorting and observing changes to arrays easier and more efficient.
collectionview data data-indexing index javascript typescript
Last synced: 19 Apr 2025
https://github.com/hellomaxime/data-platform-on-kubernetes
Open Source Data Platform on Kubernetes
bigdata data data-pipeline dbt druid etl kubernetes ml open-source platform spark superset
Last synced: 09 Nov 2025
https://github.com/datawookie/data-diaspora
Various datasets used in tutorials and workshops.
Last synced: 20 Mar 2025
https://github.com/olajideolagunju/gcp_mage_data_pipeline
An end-to-end data engineering pipeline that processes and analyzes Maintenance Work Orders using Mage, Docker, Google BigQuery, MariaDB, and Looker Studio. It features a seamless integration of cloud and open-source tools for scalable data storage, transformation, and visualization.
automation bigquery cloud compute-engine data data-engineering database database-schema docker-compose excel gcp mage-ai maintenance mariadb orchestration python sql virtual-machine visualization-dashboard work-orders
Last synced: 07 Mar 2025
https://github.com/abistarun/resseract-lite
A Data Analytics Tool with flexible architecture to visualize and analyse data
analystics charts custom-dashboards data data-science data-visualization diverse-data-sources expression-language geochart geocharts interactive-data-exploration light-weight lightweight monitoring resseract resseract-lite slice tool visualisations visualization
Last synced: 14 May 2025
https://github.com/ebsco/builde
Open source Bibframe vocabulary files
bibframe data libraries linked
Last synced: 08 Mar 2026
https://github.com/urunov/algorithms
algorithm, data structure, dynamic array, dynamic programming
algorithms algorithms-and-data-structures data dynamic-programming
Last synced: 20 Mar 2025
https://github.com/rn0x/app.altaqwaa.org
موقع إسلامي شامل يحتوي على الأذكار والقرآن الكريم بأصوات عدد كبير من القراء، بالإضافة إلى تفسير وحصن المسلم. يتضمن الموقع أيضًا مسبحة إلكترونية وإذاعات إسلامية وأوقات الصلاة.
app broadcasts data islam muslim prayer quran tafsir website website-design
Last synced: 07 Aug 2025
https://github.com/edoardottt/computerphile-pong
Pong game with a little bit of Data Science. Computerphile.
2d-game computerphile csv data data-science datascience game game-2d game-development games pandas pong pong-game pygame pygame-library python python-3 python-library python3
Last synced: 30 Oct 2025
https://github.com/aidinhamedi/pneumonia-detection-ai
This project uses a deep learning model built with the TensorFlow Library to detect pneumonia in X-ray images. The model architecture is based on the EfficientNetB7 model, which has achieved an accuracy of approximately 97.12% (97.11538%) on our test data. This high accuracy rate is one of the strengths of our AI model.
ai artificial-intelligence artificial-neural-networks cli data data-science database gui interface kaggle keras pneumonia pneumonia-classification pneumonia-detection pneumonia-detector pneumoniac-xray python tensorflow tensorflow2 training
Last synced: 06 Apr 2025
https://github.com/swaymm7/open-source-prompt-library
Here is where I store all my useful prompts
chatgpt-prompt data data-analytics data-engineering deepseek gpt ios llm macos prompt prompts prompts-template swift-package-manager tracker
Last synced: 16 Jul 2025
https://github.com/aydinnyunus/dictionary
Dictionary
data data-analysis data-science data-structures data-visualization database dataset dictionaries dictionary dictionary-learning python python-2 python-3 python-3-6 python-library python-script python2 python27 python3 python36
Last synced: 09 May 2025
https://github.com/fabriquebeweb/dao
Le 'Data Access Object' pour les nuls !
Last synced: 18 Feb 2026
https://github.com/mbrn/dbmixer
A project that mask database columns by several algorithms
Last synced: 19 Jul 2025
https://github.com/dsietz/daas
Overview of the Data as a Service (DaaS) architecture
archconf architecture daas data message-broker microservice nfjs patterns rust-lang uberconf
Last synced: 21 May 2026
https://github.com/aa-sikkkk/twitterdatamining
A Simple Script to mine data from X/Twitter
Last synced: 24 Jan 2026
https://github.com/codiepp/elykseer-base
cryptographic data archive; written in F#; envisaged to stay another 10 years
archive cli cryptography data distributed-storage dotnet fsharp longterm-storage
Last synced: 19 May 2026
https://github.com/edgardleal/thanos-for-data
A Thanos implementation to restore the balance of your data
Last synced: 15 Jun 2025
https://github.com/stefanbohacek/fediverse-explorations
Exploring the fediverse through data, studies, and polls.
data data-visualization fediverse mastodon social-media
Last synced: 12 Apr 2025
https://github.com/vijishmadhavan/parse-clip
A simple CLIP based project for combining images from multiple datasets.
clip data datacleaning dataexploration dataset fastai image python
Last synced: 14 May 2026
https://github.com/louis030195/ega
ai artificial-intelligence data data-science data-visualization
Last synced: 07 Mar 2026
https://github.com/dyazincahya/kbbi-sql-database
Kamus Besar Bahasa Indonesia (KBBI) SQL Database | Total data 115.978 Kata | Tersedia untuk MySQL, SQLite dan PostgreSQL. Juga tersedia untuk format data CSV, JSON, Markdown, PHP Array, XML, DbUnit, HTML
bahasa-indonesia csv data database dictionary indonesian-language json kamus kamus-besar-bahasa-indonesia kamus-indonesia kbbi kbbi-api kbbi-sql mysql php-array postgresql sql sqlite xml
Last synced: 19 Aug 2025
https://github.com/gaurav-van/data_science__machine_learning-projects
Compilation of Data Science and Machine Learning Projects Completed During My Bachelor's Degree
classification data data-science data-science-projects data-visualization deep-learning deep-learning-models deep-learning-projects deep-neural-networks inceptionv3 machine-learning machine-learning-projects python regression streamlit
Last synced: 23 Jun 2025
https://github.com/oliver021/ecmalinq
The linq runtime and support to typescript/javascript ecosystem
collection data iterable iteration javascript library linq linq-expressions nodejs query stream stream-data structure typescript
Last synced: 13 May 2025
https://github.com/emrecpp/datapacket-csharp
Send, recv, encrypt, decrypt, compress data as Packet and send it with socket for C#.
compress data deserialization deserialize deserializer encrypt packet send serialization serialize serializer socket
Last synced: 15 Sep 2025
https://github.com/dicook/tutorial_make_better_data_plots
Materials for a workshop in June 2025
data data-analysis data-science data-visualization r statistical-graphics statistics
Last synced: 25 Jun 2025
https://github.com/burakboduroglu/data_structures_and_algorithms
This repo contains my sata structures and algorithms codes.
alghorithm data data-structures dynamic-programming graph hash interview interview-questions linked-list structures tree-structure
Last synced: 04 Apr 2025
https://github.com/opendatablend/opendatablend-py
The fastest way to get data from the Open Data Blend Dataset API
data data-engineering data-science dataset frictionless-data frictionlessdata koalas pandas python
Last synced: 14 Dec 2025
https://github.com/mbanq/dupe
Fake banking data for your front- or backend
backend data datagenerator fake faker frontend javascript nodejs npm npm-package
Last synced: 13 May 2025
https://github.com/nickmcintyre/processing-netcdf
Simple access to scientific datasets with Processing
Last synced: 11 Apr 2025
https://github.com/muradisazade777/vaultedge
**VaultEdge** is a secure, modular, and scalable backend system built with C#. It provides robust user authentication, encrypted vault storage, and a clean RESTful API architecture.
api backend backend-api backend-server backend-service core csharp data json json-server testing token
Last synced: 29 Oct 2025
https://github.com/rcorrero/light-pipe
A high-level syntax for data pipelines, designed to make pipeline development quick and painless.
data data-pipelines data-processing geospatial-analysis geospatial-processing pipeline
Last synced: 14 Dec 2025
https://github.com/sureshpandiyan1/smartweather
detect real-time weather data of any country
data no-api python smart software users weather weather-app weather-data weather-information windows windows-app
Last synced: 12 May 2026
https://github.com/deveripon/assignment-6-assets
This assets is only for Reactive Accelarator Batch 2 - Assignment 6
Last synced: 30 Apr 2025
https://github.com/romelperez/empanada
Simple data mock generator.
data generator javascript mock typescript
Last synced: 11 Apr 2025
https://github.com/andreped/chatbot-streamlit-demo
Develop accessible ChatBot with Azure OpenAI and Streamlit
azure chatbot data data-mining huggingface huggingface-spaces large-language-models llm openai python research streamlit web-application
Last synced: 01 Aug 2025
https://github.com/nalgeon/nalgeon.github.io
Everything about SQLite, Python, open data and awesome software
Last synced: 14 Jul 2025
https://github.com/varletjs/ruler-factory
A flexible, chainable validation rule factory for typeScript/javaScript.
chainable data factory form javascript rules typescript validation validator
Last synced: 12 Sep 2025
https://github.com/vasturiano/data-bind-mapper
Bind data arrays with any type of JS objects
bind data digest joins mapper performance
Last synced: 26 Jul 2025
https://github.com/abdelmajidlh/eportfolio
ePortfolio Abdelmajid EL HOU
bioinformatics data data-analysis data-science data-visualization database datascience genetics
Last synced: 22 Mar 2025
https://github.com/monfireboose/monfireboose
A lightweight JavaScript library that provides a high level and model based API for interacting with Firebase.
api data database firebase firestore high-level-api interact javascript library model storage
Last synced: 18 Feb 2026
https://github.com/faster-games/whiskey
Data and Events framework for Unity. 🥃⚡
Last synced: 19 May 2026
https://github.com/divinemonk/dataentrywebapp
Data Entry Web App is a lightweight web application built with Flask, a Python web framework, designed to streamline data entry and management processes. It provides a user-friendly interface for efficient data entry, viewing, editing, and deletion.
data data-entry flask flask-application production production-server web webapp
Last synced: 13 Apr 2025
https://github.com/yessasvini23/machine_learning_specialization_deeplearning.ai
Contains all course modules, exercises and notes of ML Specialization by Andrew Ng, Stanford Un. and DeepLearning.ai in Coursera
andrew-ng andrew-ng-course andrew-ng-machine-learning classification data data-science deep-learning machine-learning machine-learning-algorithms neural-network nlp-machine-learning regression rnn-tensorflow
Last synced: 18 May 2026
https://github.com/FCC/contours-api-node
Enterprise Contours Node API
api contours data data-visualization geospatial gis map
Last synced: 27 Jul 2025
https://github.com/daninet/audio-annotator
Simple app for annotating audio segments
ai annotate annotation artificial audio data intelligence label labeling labeling-tool learning machine ml science wav
Last synced: 04 Apr 2025
https://github.com/huseyincenik/data_science
Data Science materials
data data-science data-structures data-visualization dataanalysis dataengineering datapreparation dataprocessing datascience dataset time-series time-series-analysis timeline timeseries timeseries-analysis timeseriesforecasting
Last synced: 25 Jul 2025
https://github.com/csengupta1101/dig-student-files
This Repository will contain all student submissions at one place.
data datascience education machine-learning python students visualization
Last synced: 17 Jul 2025
https://github.com/hoanganhngo610/introduction-r-packages
This repository is an introduction to the most essential packages in R programming, for the sake of satisfying any demand and customised work flow
Last synced: 28 Jun 2025
https://github.com/ultreon/ubo
NBT inspired data I/O. Made for games.
api binary-data data data-storage file-type game-data io library ubo
Last synced: 16 Jun 2025
https://github.com/ljharb/define-data-property
Define a data property on an object. Will fall back to assignment in an engine without descriptors.
accessor configurable data define ecmascript enumerable javascript object property writable
Last synced: 13 Apr 2025
https://github.com/codenoid/lazy-mongo
Insert data to mongo from text plain or file
crystal crystal-language data database mongoclient mongodb
Last synced: 13 Apr 2026
https://github.com/priyanka7411/dataspark-electronics-retail-analytics
DataSpark is a data analysis project using Python, SQL, and Power BI to analyze global electronics retail sales, focusing on customer behavior, sales performance, product profitability, and store performance to optimize sales strategies.
analytics-providers business-intelligence customer-segmentation data data-analysis electronics-industry global-sales pandas powerbi powerbi-visuals product-profitability python retail-analytics sales-performance sql store-analysis visualization
Last synced: 10 Jul 2025
https://github.com/strmprivacy/docs
With STRM Privacy you can easily build privacy-by-design data pipelines and define data contracts to encode privacy inside your data. Data streams are pseudonymised or anonymised in real-time or batch. These are our docs.
data documentation docusaurus privacy privacy-enhancing-technologies
Last synced: 12 Jul 2025
https://github.com/njraladdin/newspapers-com-scraper
A Node.js scraper for extracting article data from Newspapers.com based on keywords, dates, and locations.
archive data newspapers scraper scraper-api scraping
Last synced: 06 Apr 2025
https://github.com/weecology/ratdat
R package version of Portal Project Teaching Database
data database ecology teaching teaching-data
Last synced: 17 Feb 2026
https://github.com/rclement/romain-clement.net
Freelance Software Engineer & Trainer
data freelancer machine-learning mkdocs mkdocs-material python
Last synced: 21 Mar 2025
https://github.com/psfried/dgen
Generate evil test data
csv data data-generation data-generator language testing-tools
Last synced: 18 Mar 2025
https://github.com/siongui/gopaliwordvfs
Serve JSON data of Pali words, embedded in Go code
data go golang pali vfs virtual-file-system virtualfilesystem
Last synced: 04 Apr 2025
https://github.com/ferhatgec/kedi
Fegeya Kedi, Experimental Data Interface.
cpp cpp17 data data-interchange data-interface fegeya gnu json library linux xml
Last synced: 14 Apr 2025
https://github.com/techiaith/brawddegau-tagiedig
Corpws o frawddegau CC0 mewn fformat jsonl, gyda rhannau ymadrodd y tocynnau (geiriau etc.) wedi'u tagio â thagiau Universal Dependencies. // A Corpus of CC0 sentences in the jsonl format, tagged with Universal Dependency part-of-speech tags.
annotated cc0 commonvoice data nlp welsh
Last synced: 17 Jan 2026
https://github.com/arverma/data-engineer-interview-experience
My interview experience with the companies I interviewed with
big-data data data-engineer data-engineering engineering interview interview-practice interview-preparation interview-questions python3 spark sql
Last synced: 19 May 2026
https://github.com/gappeah/apocalypse-food-prep-report
This PowerBI project focuses on visualising data for Apocalypse Food Prep, a company specialising in emergency food supplies. The dataset consists of various CSV files containing information on customers, locations, products, sales, sales teams, and state regions.
data data-visualization powerbi powerbi-report powerbi-visuals
Last synced: 25 Feb 2025
https://github.com/alexgustafsson/systembolaget-api-data
An up to date data mirror of Systembolaget's APIs
data data-science sweden systembolaget
Last synced: 28 Oct 2025
https://github.com/astrid-project/lcp
In each local agent, the control plane is responsible for programmability, i.e., changing the behaviour of the data plane at run-time.
agent beats control data ebpf elasticsearch log logstash management programmability security
Last synced: 06 Apr 2025
https://github.com/aymericzip/api-refetch
Alternative to SWC or react-query. Hook that store your API calls and provide states as isLoading, isFetched, data, error. Allow to instantly fetch the API when the hook is mounted. Provide retry and revalidation options.
api async autofetch cache data fetch loading react-query retry revalidate session-storage state store swr zustand
Last synced: 11 Apr 2025
https://github.com/bohnacker/data-manipulation
Some Javascript and Python scripts to manipulate (large) CSV files and JSON data.
data data-mining data-structures javascript python
Last synced: 18 May 2026
https://github.com/jujuadams/ini-to-json
JSON+buffer replacement for native GameMaker INI functions.
data gamemaker gamemaker-studio-2 gms2 ini json save
Last synced: 21 Jul 2025
https://github.com/thamerh/web-scraper-with-node.js-and-cheerio
used simple exemple how Scraper data from Build a Web Scraper with Node.js and Cheerio
cheer data expressjs nodejs scarper webscraping
Last synced: 08 Apr 2026
https://github.com/dnth/mafat-fastdup-blogpost
Data insights from the MAFAT Satellite Vision challenge.
clustering computer-vision data data-visualization dataset duplicate-detection mafat-radar-challenge validation vision
Last synced: 27 Mar 2025