data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-23 00:07:41 UTC
- JSON Representation
https://github.com/olajideolagunju/gcp_mage_data_pipeline
An end-to-end data engineering pipeline that processes and analyzes Maintenance Work Orders using Mage, Docker, Google BigQuery, MariaDB, and Looker Studio. It features a seamless integration of cloud and open-source tools for scalable data storage, transformation, and visualization.
automation bigquery cloud compute-engine data data-engineering database database-schema docker-compose excel gcp mage-ai maintenance mariadb orchestration python sql virtual-machine visualization-dashboard work-orders
Last synced: 07 Mar 2025
https://github.com/strmprivacy/docs
With STRM Privacy you can easily build privacy-by-design data pipelines and define data contracts to encode privacy inside your data. Data streams are pseudonymised or anonymised in real-time or batch. These are our docs.
data documentation docusaurus privacy privacy-enhancing-technologies
Last synced: 12 Jul 2025
https://github.com/monfireboose/monfireboose
A lightweight JavaScript library that provides a high level and model based API for interacting with Firebase.
api data database firebase firestore high-level-api interact javascript library model storage
Last synced: 18 Feb 2026
https://github.com/jujuadams/ini-to-json
JSON+buffer replacement for native GameMaker INI functions.
data gamemaker gamemaker-studio-2 gms2 ini json save
Last synced: 21 Jul 2025
https://github.com/dnth/mafat-fastdup-blogpost
Data insights from the MAFAT Satellite Vision challenge.
clustering computer-vision data data-visualization dataset duplicate-detection mafat-radar-challenge validation vision
Last synced: 27 Mar 2025
https://github.com/biglocalnews/upload-files
Upload comma-delimited files to biglocalnews.org in your GitHub Action
action actions archiving csv data data-journalism github-actions journalism news
Last synced: 27 Apr 2026
https://github.com/csengupta1101/dig-student-files
This Repository will contain all student submissions at one place.
data datascience education machine-learning python students visualization
Last synced: 17 Jul 2025
https://github.com/njraladdin/newspapers-com-scraper
A Node.js scraper for extracting article data from Newspapers.com based on keywords, dates, and locations.
archive data newspapers scraper scraper-api scraping
Last synced: 06 Apr 2025
https://github.com/priyanka7411/dataspark-electronics-retail-analytics
DataSpark is a data analysis project using Python, SQL, and Power BI to analyze global electronics retail sales, focusing on customer behavior, sales performance, product profitability, and store performance to optimize sales strategies.
analytics-providers business-intelligence customer-segmentation data data-analysis electronics-industry global-sales pandas powerbi powerbi-visuals product-profitability python retail-analytics sales-performance sql store-analysis visualization
Last synced: 10 Jul 2025
https://github.com/yashika-malhotra/data-exploration-and-visualization-for-streaming-platform
Data Analysis and Visualization for streaming platform to provide insights and recommendations to improve their userbase.
colab-notebook data data-visualization jupyter-notebook matplotlib numpy pandas python seaborn
Last synced: 18 Apr 2026
https://github.com/astrid-project/lcp
In each local agent, the control plane is responsible for programmability, i.e., changing the behaviour of the data plane at run-time.
agent beats control data ebpf elasticsearch log logstash management programmability security
Last synced: 06 Apr 2025
https://github.com/v4ss3ur/hierarchicaldatagrid.wpf
A WPF control that mix DataGrid and TreeView functionalities, allowing for hierarchical, recursive data display with expandable nested rows. Ideal for complex data structures in an easy-to-use, MVVM-friendly tabular format.
controls data datagrid hierarchical hierarchical-data mvvm nested nested-objects nested-structures treeview wpf xaml
Last synced: 13 May 2025
https://github.com/andreped/chatbot-streamlit-demo
Develop accessible ChatBot with Azure OpenAI and Streamlit
azure chatbot data data-mining huggingface huggingface-spaces large-language-models llm openai python research streamlit web-application
Last synced: 01 Aug 2025
https://github.com/vasturiano/data-bind-mapper
Bind data arrays with any type of JS objects
bind data digest joins mapper performance
Last synced: 26 Jul 2025
https://github.com/arverma/data-engineer-interview-experience
My interview experience with the companies I interviewed with
big-data data data-engineer data-engineering engineering interview interview-practice interview-preparation interview-questions python3 spark sql
Last synced: 19 May 2026
https://github.com/edgardleal/thanos-for-data
A Thanos implementation to restore the balance of your data
Last synced: 15 Jun 2025
https://github.com/burakboduroglu/data_structures_and_algorithms
This repo contains my sata structures and algorithms codes.
alghorithm data data-structures dynamic-programming graph hash interview interview-questions linked-list structures tree-structure
Last synced: 04 Apr 2025
https://github.com/aa-sikkkk/twitterdatamining
A Simple Script to mine data from X/Twitter
Last synced: 24 Jan 2026
https://github.com/psfried/dgen
Generate evil test data
csv data data-generation data-generator language testing-tools
Last synced: 18 Mar 2025
https://github.com/emrecpp/datapacket-csharp
Send, recv, encrypt, decrypt, compress data as Packet and send it with socket for C#.
compress data deserialization deserialize deserializer encrypt packet send serialization serialize serializer socket
Last synced: 15 Sep 2025
https://github.com/alexandregazagnes/global-biodiversity-score
CDC Biodiversité is a subsidiary of the Caisse des Dépôts et Consignation, the largest French financial institution. It is specialized in providing biodiversity-positive solutions to businesses such as ecological offsets and biodiversity footprinting.
analytics biodiversity data data-science environment ghg python
Last synced: 28 Jul 2025
https://github.com/FCC/contours-api-node
Enterprise Contours Node API
api contours data data-visualization geospatial gis map
Last synced: 27 Jul 2025
https://github.com/andygol/yamap
Yamap Ain't Map – deployment of OSM infrastructure project inspired by osm-seed
api data extract geo-data map openstreetmap osm
Last synced: 24 Jun 2025
https://github.com/satyam4229/college-predictor-system
The college predictor system is a Python-based application that utilizes a machine learning model to predict colleges and their corresponding degree programs and branches based on a student's JEE (Joint Entrance Examination) score.
data data-science jupyter-notebook kaggle prediction python
Last synced: 06 Apr 2026
https://github.com/sermetpekin/evdscpp
evdscpp is a C++ library for fast, efficient, and user-friendly interaction with the EVDS API Server. Designed with performance in mind, it provides built-in caching, an Excel export option, and an intuitive user interface for configuring and retrieving data. evdscpp can be extended for integration with other C++ projects and offers options for use
cbrt central-bank cpp data edds evds evds-api evdscpp tcmb tcmb-api
Last synced: 07 Sep 2025
https://github.com/antoineaugusti/purchasing-power
Archive daily data about purchasing power parity: how much goods should cost in various countries
archive data purchasing-power-parity
Last synced: 28 Oct 2025
https://github.com/helixspiral/ndbc
Golang wrapper for the National Data Buoy Center (NDBC)
data data-science golang government-data ndbc ndbc-buoy-data noaa noaa-api noaa-buoys noaa-data noaa-weather wrapper wrapper-api wrapper-library
Last synced: 14 Jun 2025
https://github.com/headless-start/data-augmentation-impact
This repository contains effect of Data Augmentation of Training Set during Model Training.
augmented-images cuda data gpu keras matplotlib mnist opencv-python python3 tensorflow training-data
Last synced: 05 Apr 2026
https://github.com/stimulsoft/samples-dashboards.js-for-react
JavaScript samples for Dashboards.JS data analysis tool for React applications
analyzer chart components constructor dashboard dashboards data designer export expression javascript js library parser react react-dashboard reactjs relation text viewer
Last synced: 09 Aug 2025
https://github.com/amethyst-php/address
The place where a person or organization can be found or communicated with. Contains fields such as: street, postal code, city, country etc... Can be used for example as a shipment address or as an invoice address.
address amethyst amethyst-package api data laravel
Last synced: 13 Aug 2025
https://github.com/thekartikeyamishra/data_cleaning_project
Welcome to the Data Cleaning and Visualization project! This repository demonstrates how to clean messy data and create insightful visualizations using Python with Pandas and Matplotlib.
data dataanalysis matplotlib matplotlib-pyplot pandas python
Last synced: 02 May 2026
https://github.com/courtois-neuromod/anat
Anatomical sub-dataset of Courtois-Neuromod project.
Last synced: 17 Jan 2026
https://github.com/kom-senapati/ghw-data-hacks
🌍 Global Hack Week data projects, 📊 focused on exploration, manipulation, and analysis...
Last synced: 12 Mar 2025
https://github.com/imadsaddik/bodmaghdataset
BoDmagh dataset is a Supervised Fine-Tuning (SFT) dataset for the Darija language
arabic-llm arabic-nlp darija-llm darija-nlp data dataset fine-tuning llm nlp sft
Last synced: 03 Apr 2025
https://github.com/sheweny/discord-resolve
This module groups together functions to retrieve data from different types of arguments.
data discord discord-js mentions resolver sheweny utility
Last synced: 29 Oct 2025
https://github.com/schbenedikt/datamining
Heise (https://heise.de) News Crawler
data data-science heise postgresql web-crawler
Last synced: 10 Apr 2025
https://github.com/thealphadollar/messiah
Messiah: The Mighty Son Of God Is Here To Help You Through Times Of Calamity
azure backend data data-analysis flask frontend materialize natural-disasters
Last synced: 19 Jan 2026
https://github.com/godeltech/godeltech.data.entityframeworkcore
Library to access database with Unit of Work, Repository and Entity classes for Entity Framework Core.
data entity entity-framework-core repository unitofwork
Last synced: 30 Apr 2025
https://github.com/godeltech/godeltech.data
.NET library to access data storage with Unit of Work, Repository and Entity classes
data entity repository unitofwork
Last synced: 30 Apr 2025
https://github.com/polina-prokofieva/viewjson
The class for convenient visualization of json with some settings.
data data-visualization es5 es6 javascript json
Last synced: 15 May 2026
https://github.com/woctezuma/download-steam-banners-data
Data consisting of Steam banners.
Last synced: 06 Jan 2026
https://github.com/heikomuller/histore
Library for maintaining snapshots of evolving tabular data sets
Last synced: 10 Apr 2025
https://github.com/felixklauke/atomizer
Playing around with butter knife, android bindings and rx java.
binding butterknife data java react rx rxjava
Last synced: 15 May 2026
https://github.com/danlsn/causality
A Personal Data Platform and the culmination of years of curiosity and learning in the Data Engineering space.
data data-engineering datawarehousing personal-data quantified-self
Last synced: 06 Mar 2026
https://github.com/shysolocup/aepl
A Node.JS multi-layered class creation package with built-in parenting systems that let you get info from classes above as well as better function and property makers for easier to read and understand development and modding support inspired by Roblox's Studio API.
aepl backend classes data framework game-development game-framework javascript js js-class js-framework lightweight nodejs package
Last synced: 28 Oct 2025
https://github.com/antvis/create-antv-demo
A simple CV-dashboard framework for practicing how to use AntV.
antv cv dashboard data resume resume-template resume-website visualization
Last synced: 09 Apr 2025
https://github.com/radekbednarik/data_generator
Random data generator using Python. Generate data files with random string, floats, ints, dates via console or TOML files..
csv data generator python python3 random test-data-generator
Last synced: 13 Dec 2025
https://github.com/andrei-vataselu/data-science-snippets
🧰 Essential EDA and Data Cleaning Helpers for Any DataFrame This collection of functions is designed to accelerate exploratory data analysis (EDA), quickly surface data quality issues, and offer high-level insights into the structure and content of your dataset.
artificial-intelligence data data-science eda feature-engineering hyperparamater-tunning library loading model-evaluation modeling preprocessing python snippets text-processing time-series visualization
Last synced: 10 Mar 2026
https://github.com/memair/apps
App Store for Memair
apps appstore data data-science quantified-self
Last synced: 06 Apr 2026
https://github.com/utrechtuniversity/dataprivacysurvey
Code for analysing data from the Data Privacy Survey (2022)
data gdpr open-science privacy rdm research research-data-management survey utrecht-university
Last synced: 16 Jun 2025
https://github.com/ciscorn/tinybufr
A Rust library for decoding BUFR meteorological observation data format
bufr data meteorology rust weather wmo
Last synced: 11 Jan 2026
https://github.com/johntocci/nullaxe
Nullaxe is a powerful and user-friendly Python library designed for cleaning and preprocessing data. It works seamlessly with both pandas and polars DataFrames, making it a versatile tool for data scientists and developers.
data data-analysis data-science datacleaning pandas polars python
Last synced: 06 Apr 2026
https://github.com/dhruvldrp9/simpledht
A Python-based Distributed Hash Table (DHT) implementation enabling cross-network key-value storage, automatic node discovery, and data replication with a simple CLI and library interface.
cross-network-node-communation data data-replication data-synchronization dht dht-python distributed-hash-table key-value-storage nat netowork node-discovery peer-to-peer peer-to-peer-network python sha-256 simple udp udp-socket-communication
Last synced: 28 Feb 2026
https://github.com/marlenezw/speech-to-text
Turn any video or audio recording into a written transcript using python
data data-science python speech speech-recognition speech-synthesis speech-to-text
Last synced: 27 Apr 2026
https://github.com/zarr-developers/cookiecutter-zarr-store
Cookiecutter for Zarr store implementations
chunked data n-dimensional zarr
Last synced: 16 Jun 2025
https://github.com/hmeleiro/alquilermad
Housing rent map in Comunidad de Madrid / Mapa del alquiler en la Comunidad de Madrid
data data-science data-visualization datascience housing-location-visualization rent renting
Last synced: 13 Sep 2025
https://github.com/ahmedkhalf/arabic-keyword-scraper
Stop wasting your time! And obtain Arabic definitions without having to look it up.
arabic data definitions scraper sentences wordsearch
Last synced: 12 Mar 2025
https://github.com/eosdis-nasa/earthdata-pub-dashboard
Front-end Dashboard for Earthdata Pub
data earthdata edpub publication
Last synced: 15 Jan 2026
https://github.com/nikoshet/exploratory-data-analysis-using-r
Exploratory Data Analysis using R Course Project for M.Sc. 'Data Science and Machine Learning' in NTUA
data data-analysis data-science eda exploratory-data-analysis ggplot2 r
Last synced: 14 May 2026
https://github.com/unaygney/js-challenges-data-structures-and-algorithms
Repo of the challenges I'm trying to solve to understand data structures and algorithms..
algorithms-and-data-structures data javascript structure
Last synced: 29 Oct 2025
https://github.com/thejeshgn/thejeshgn
data data-visualization datameet india opendata public-interest
Last synced: 15 Jan 2026
https://github.com/stdlib-js/datasets-suthaharan-single-hop-sensor-network
Labeled wireless sensor network data set collected from a simple single-hop wireless sensor network deployment using TelosB motes.
data dataset datasets javascript labeled machine-learning ml mote motes network node node-js nodejs outlier outliers sample sensor statistics stats stdlib
Last synced: 03 Mar 2025
https://github.com/samboycoding/hungergames-data
data hunger-games javascript json
Last synced: 15 May 2026
https://github.com/hariprashad-ravikumar/ai-datascience-lab
AI‑DataScience‑Lab is a web app for uploading CSV datasets, cleaning with Pandas, and running quick exploratory analyses and regression models using scikit‑learn. Its modular design supports future AI extensions, like deep learning with TensorFlow or insight generation via the OpenAI API.
ai api azure cloudcomputing data data-analysis data-science data-visualization mathplotlib numpy openai pandas python scikit-learn
Last synced: 02 Aug 2025
https://github.com/aaronmeder/social-history
A quick look into your history on social media. Drop in the archives you've downloaded from Facebook and Instagram and see some stats about your time on the networks.
archives data facebook instagram statistics stats
Last synced: 27 Mar 2025
https://github.com/rrighart/rrighart.github.io
A webpage about data science, programming, statistics and related topics
analyses data data-mining programming statistics
Last synced: 20 Jan 2026
https://github.com/mohammadkarbalaee/introduction-to-data-science-sbu
Reports and full documentation of the introduction to data science course held at SBU
data data-science python shahid-beheshti-university
Last synced: 27 Mar 2025
https://github.com/johnmackintosh/simd2016_tmap
Mapping SIMD with tmap - static & interactive
data data-science data-visualization mapping r visualisation
Last synced: 20 Mar 2025
https://github.com/enricocid/monitoraggio-vaccini-italia
Sito web statico per github.com/apalladi/covid_vaccini_monitoraggio
covid-19 covid-19-data covid-19-data-analysis data data-analysis data-visualization dataset python python3 python37 sars-cov-2
Last synced: 09 Sep 2025
https://github.com/mostafanabieh/image-classification-with-data-augmentation
Project for Data augmentation with tensorflow v2
data deep-learning image-classification machine-learning tensorflow tensorflow2
Last synced: 07 May 2026
https://github.com/stanford-oval/medxchange
Medical Data Exchange (MedXchange) platform
data ethereum exchange medical medxchange
Last synced: 16 May 2026
https://github.com/royruddle/vizdataquality
Python package for visualizing data quality
data data-science data-visualization jupyter-notebook missing-data python
Last synced: 05 May 2025
https://github.com/bgmp/tesis-german-deuster
Datos estadísticos para tercería de una tésis
Last synced: 28 Mar 2025
https://github.com/lukanedimovic/table_editor
A simple table data editor, with easily scalable functions and operations & a nice GUI
data data-science formula java parser parsing preprocessing swing tokenizer
Last synced: 04 Apr 2025
https://github.com/cheminfo/cheminfo-types
chemistry data hacktoberfest schema typescript
Last synced: 03 Apr 2026
https://github.com/wpp-public/akqa-nz-tagmanager-connector
A simple javascript library to send events to a tag manager container
Last synced: 05 Apr 2025
https://github.com/inspect-js/is-data-view
Is this value a JS DataView? This module works cross-realm/iframe, does not depend on instanceof or mutable properties, and despite ES6 Symbol.toStringTag.
data dataview ecmascript javascript typedarray typedarrays view
Last synced: 05 Apr 2025
https://github.com/serhatkacmaz/cpp-datastructuresandalgortihms
Contains codes related to data structures
algorithms cplusplus data data-structures
Last synced: 10 Jul 2025
https://github.com/victoorv/breast_cancer
Mammographic images classification.
breast-cancer breast-cancer-classification classification cnn cnn-classification convnext convnext-tiny convolutional-neural-networks data data-science data-visualization feature-tuning image image-classification mammogram-images mammographic-images neural-network resnet-50 resnet50 transfer-learning
Last synced: 27 Jan 2026
https://github.com/itzshoaib/hashtegrity
A library for generating hash, validating data integrity, monitoring file/directory integrity, offchain data integrity
crypto-hash data data-integrity hacktoberfest hash integrity
Last synced: 07 Mar 2026
https://github.com/ournet/news-sources
A repository of news sources for every country
data news news-sources sources
Last synced: 11 Jul 2025
https://github.com/jesusgraterol/bitcoin-lightning-network-stats-dataset-builder
The dataset builder script extracts Bitcoin's Lightnining Network statistics through Mempool.space's public API. The data is stored in a .csv file, facilitating its use in data science and machine learning projects.
bitcoin blockchain blockchain-technology data data-science dataset dataset-generation lightning-network machine-learning
Last synced: 16 May 2026
https://github.com/zenwor/table_editor
A simple table data editor, with easily scalable functions and operations & a nice GUI
data data-science formula java parser parsing preprocessing swing tokenizer
Last synced: 22 Jun 2025
https://github.com/andrewjbateman/mevn-stack-data
:clipboard: MEVN Info & Full stack MEVN app with CRUD functions
data database express expressjs full-stack info mevn mevn-stack middleware mongodb mongodb-atlas nodejs typescript vue vue3 vue3-typescript
Last synced: 07 Apr 2026
https://github.com/mickeyshi-syd/actuarial-hackathon-2019
2019 Actuarial Hackathon
actuarial actuaries analytics data data-science hackathon
Last synced: 15 Jul 2025