data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-23 00:07:41 UTC
- JSON Representation
https://github.com/burakboduroglu/data_structures_and_algorithms
This repo contains my sata structures and algorithms codes.
alghorithm data data-structures dynamic-programming graph hash interview interview-questions linked-list structures tree-structure
Last synced: 04 Apr 2025
https://github.com/olajideolagunju/gcp_mage_data_pipeline
An end-to-end data engineering pipeline that processes and analyzes Maintenance Work Orders using Mage, Docker, Google BigQuery, MariaDB, and Looker Studio. It features a seamless integration of cloud and open-source tools for scalable data storage, transformation, and visualization.
automation bigquery cloud compute-engine data data-engineering database database-schema docker-compose excel gcp mage-ai maintenance mariadb orchestration python sql virtual-machine visualization-dashboard work-orders
Last synced: 07 Mar 2025
https://github.com/jujuadams/ini-to-json
JSON+buffer replacement for native GameMaker INI functions.
data gamemaker gamemaker-studio-2 gms2 ini json save
Last synced: 21 Jul 2025
https://github.com/njraladdin/newspapers-com-scraper
A Node.js scraper for extracting article data from Newspapers.com based on keywords, dates, and locations.
archive data newspapers scraper scraper-api scraping
Last synced: 06 Apr 2025
https://github.com/csengupta1101/dig-student-files
This Repository will contain all student submissions at one place.
data datascience education machine-learning python students visualization
Last synced: 17 Jul 2025
https://github.com/weecology/ratdat
R package version of Portal Project Teaching Database
data database ecology teaching teaching-data
Last synced: 17 Feb 2026
https://github.com/priyanka7411/dataspark-electronics-retail-analytics
DataSpark is a data analysis project using Python, SQL, and Power BI to analyze global electronics retail sales, focusing on customer behavior, sales performance, product profitability, and store performance to optimize sales strategies.
analytics-providers business-intelligence customer-segmentation data data-analysis electronics-industry global-sales pandas powerbi powerbi-visuals product-profitability python retail-analytics sales-performance sql store-analysis visualization
Last synced: 10 Jul 2025
https://github.com/DataHerb/dataherb-python
Python Package for DataHerb: create, search, and load datasets.
data data-analysis data-mining database dataset python
Last synced: 08 May 2025
https://github.com/kawai-senpai/potatodb
PotatoDB is a lightweight, file-based NoSQL database for Python projects, designed for easy setup and use in small-scale applications. Ideal for developers seeking simple data persistence without the complexity of traditional databases.
data database easy-to-use file-based json key-value lightweight nosql nosql-database persistence python simple
Last synced: 23 Oct 2025
https://github.com/muneeb1030/finetune-tiny-llama
Fine-tuning the Tiny Llama model to mimic my professor's writing style using the Llama Factory. The project involves data collection, preprocessing, preparation, fine-tuning, and evaluation.
data data-preparation data-preprocessing finetuning llama-factory llm pymupdf selenium-python spacy tinyllama webscraping
Last synced: 08 Apr 2026
https://github.com/techiaith/brawddegau-tagiedig
Corpws o frawddegau CC0 mewn fformat jsonl, gyda rhannau ymadrodd y tocynnau (geiriau etc.) wedi'u tagio â thagiau Universal Dependencies. // A Corpus of CC0 sentences in the jsonl format, tagged with Universal Dependency part-of-speech tags.
annotated cc0 commonvoice data nlp welsh
Last synced: 17 Jan 2026
https://github.com/dnth/mafat-fastdup-blogpost
Data insights from the MAFAT Satellite Vision challenge.
clustering computer-vision data data-visualization dataset duplicate-detection mafat-radar-challenge validation vision
Last synced: 27 Mar 2025
https://github.com/amethyst-php/invoice
amethyst amethyst-package api data invoice laravel
Last synced: 10 Apr 2025
https://github.com/arverma/data-engineer-interview-experience
My interview experience with the companies I interviewed with
big-data data data-engineer data-engineering engineering interview interview-practice interview-preparation interview-questions python3 spark sql
Last synced: 19 May 2026
https://github.com/vasturiano/data-bind-mapper
Bind data arrays with any type of JS objects
bind data digest joins mapper performance
Last synced: 26 Jul 2025
https://github.com/faster-games/whiskey
Data and Events framework for Unity. 🥃⚡
Last synced: 19 May 2026
https://github.com/FCC/contours-api-node
Enterprise Contours Node API
api contours data data-visualization geospatial gis map
Last synced: 27 Jul 2025
https://github.com/codiepp/elykseer-base
cryptographic data archive; written in F#; envisaged to stay another 10 years
archive cli cryptography data distributed-storage dotnet fsharp longterm-storage
Last synced: 19 May 2026
https://github.com/daninet/audio-annotator
Simple app for annotating audio segments
ai annotate annotation artificial audio data intelligence label labeling labeling-tool learning machine ml science wav
Last synced: 04 Apr 2025
https://github.com/yashika-malhotra/data-exploration-and-visualization-for-streaming-platform
Data Analysis and Visualization for streaming platform to provide insights and recommendations to improve their userbase.
colab-notebook data data-visualization jupyter-notebook matplotlib numpy pandas python seaborn
Last synced: 18 Apr 2026
https://github.com/satyam4229/college-predictor-system
The college predictor system is a Python-based application that utilizes a machine learning model to predict colleges and their corresponding degree programs and branches based on a student's JEE (Joint Entrance Examination) score.
data data-science jupyter-notebook kaggle prediction python
Last synced: 06 Apr 2026
https://github.com/sermetpekin/evdscpp
evdscpp is a C++ library for fast, efficient, and user-friendly interaction with the EVDS API Server. Designed with performance in mind, it provides built-in caching, an Excel export option, and an intuitive user interface for configuring and retrieving data. evdscpp can be extended for integration with other C++ projects and offers options for use
cbrt central-bank cpp data edds evds evds-api evdscpp tcmb tcmb-api
Last synced: 07 Sep 2025
https://github.com/antoineaugusti/purchasing-power
Archive daily data about purchasing power parity: how much goods should cost in various countries
archive data purchasing-power-parity
Last synced: 28 Oct 2025
https://github.com/helixspiral/ndbc
Golang wrapper for the National Data Buoy Center (NDBC)
data data-science golang government-data ndbc ndbc-buoy-data noaa noaa-api noaa-buoys noaa-data noaa-weather wrapper wrapper-api wrapper-library
Last synced: 14 Jun 2025
https://github.com/headless-start/data-augmentation-impact
This repository contains effect of Data Augmentation of Training Set during Model Training.
augmented-images cuda data gpu keras matplotlib mnist opencv-python python3 tensorflow training-data
Last synced: 05 Apr 2026
https://github.com/stimulsoft/samples-dashboards.js-for-react
JavaScript samples for Dashboards.JS data analysis tool for React applications
analyzer chart components constructor dashboard dashboards data designer export expression javascript js library parser react react-dashboard reactjs relation text viewer
Last synced: 09 Aug 2025
https://github.com/amethyst-php/address
The place where a person or organization can be found or communicated with. Contains fields such as: street, postal code, city, country etc... Can be used for example as a shipment address or as an invoice address.
address amethyst amethyst-package api data laravel
Last synced: 13 Aug 2025
https://github.com/thekartikeyamishra/data_cleaning_project
Welcome to the Data Cleaning and Visualization project! This repository demonstrates how to clean messy data and create insightful visualizations using Python with Pandas and Matplotlib.
data dataanalysis matplotlib matplotlib-pyplot pandas python
Last synced: 02 May 2026
https://github.com/courtois-neuromod/anat
Anatomical sub-dataset of Courtois-Neuromod project.
Last synced: 17 Jan 2026
https://github.com/kom-senapati/ghw-data-hacks
🌍 Global Hack Week data projects, 📊 focused on exploration, manipulation, and analysis...
Last synced: 12 Mar 2025
https://github.com/imadsaddik/bodmaghdataset
BoDmagh dataset is a Supervised Fine-Tuning (SFT) dataset for the Darija language
arabic-llm arabic-nlp darija-llm darija-nlp data dataset fine-tuning llm nlp sft
Last synced: 03 Apr 2025
https://github.com/sheweny/discord-resolve
This module groups together functions to retrieve data from different types of arguments.
data discord discord-js mentions resolver sheweny utility
Last synced: 29 Oct 2025
https://github.com/schbenedikt/datamining
Heise (https://heise.de) News Crawler
data data-science heise postgresql web-crawler
Last synced: 10 Apr 2025
https://github.com/thealphadollar/messiah
Messiah: The Mighty Son Of God Is Here To Help You Through Times Of Calamity
azure backend data data-analysis flask frontend materialize natural-disasters
Last synced: 19 Jan 2026
https://github.com/godeltech/godeltech.data.entityframeworkcore
Library to access database with Unit of Work, Repository and Entity classes for Entity Framework Core.
data entity entity-framework-core repository unitofwork
Last synced: 30 Apr 2025
https://github.com/godeltech/godeltech.data
.NET library to access data storage with Unit of Work, Repository and Entity classes
data entity repository unitofwork
Last synced: 30 Apr 2025
https://github.com/polina-prokofieva/viewjson
The class for convenient visualization of json with some settings.
data data-visualization es5 es6 javascript json
Last synced: 15 May 2026
https://github.com/woctezuma/download-steam-banners-data
Data consisting of Steam banners.
Last synced: 06 Jan 2026
https://github.com/heikomuller/histore
Library for maintaining snapshots of evolving tabular data sets
Last synced: 10 Apr 2025
https://github.com/felixklauke/atomizer
Playing around with butter knife, android bindings and rx java.
binding butterknife data java react rx rxjava
Last synced: 15 May 2026
https://github.com/danlsn/causality
A Personal Data Platform and the culmination of years of curiosity and learning in the Data Engineering space.
data data-engineering datawarehousing personal-data quantified-self
Last synced: 06 Mar 2026
https://github.com/shysolocup/aepl
A Node.JS multi-layered class creation package with built-in parenting systems that let you get info from classes above as well as better function and property makers for easier to read and understand development and modding support inspired by Roblox's Studio API.
aepl backend classes data framework game-development game-framework javascript js js-class js-framework lightweight nodejs package
Last synced: 28 Oct 2025
https://github.com/antvis/create-antv-demo
A simple CV-dashboard framework for practicing how to use AntV.
antv cv dashboard data resume resume-template resume-website visualization
Last synced: 09 Apr 2025
https://github.com/radekbednarik/data_generator
Random data generator using Python. Generate data files with random string, floats, ints, dates via console or TOML files..
csv data generator python python3 random test-data-generator
Last synced: 13 Dec 2025
https://github.com/andrei-vataselu/data-science-snippets
🧰 Essential EDA and Data Cleaning Helpers for Any DataFrame This collection of functions is designed to accelerate exploratory data analysis (EDA), quickly surface data quality issues, and offer high-level insights into the structure and content of your dataset.
artificial-intelligence data data-science eda feature-engineering hyperparamater-tunning library loading model-evaluation modeling preprocessing python snippets text-processing time-series visualization
Last synced: 10 Mar 2026
https://github.com/memair/apps
App Store for Memair
apps appstore data data-science quantified-self
Last synced: 06 Apr 2026
https://github.com/utrechtuniversity/dataprivacysurvey
Code for analysing data from the Data Privacy Survey (2022)
data gdpr open-science privacy rdm research research-data-management survey utrecht-university
Last synced: 16 Jun 2025
https://github.com/ciscorn/tinybufr
A Rust library for decoding BUFR meteorological observation data format
bufr data meteorology rust weather wmo
Last synced: 11 Jan 2026
https://github.com/johntocci/nullaxe
Nullaxe is a powerful and user-friendly Python library designed for cleaning and preprocessing data. It works seamlessly with both pandas and polars DataFrames, making it a versatile tool for data scientists and developers.
data data-analysis data-science datacleaning pandas polars python
Last synced: 06 Apr 2026
https://github.com/dhruvldrp9/simpledht
A Python-based Distributed Hash Table (DHT) implementation enabling cross-network key-value storage, automatic node discovery, and data replication with a simple CLI and library interface.
cross-network-node-communation data data-replication data-synchronization dht dht-python distributed-hash-table key-value-storage nat netowork node-discovery peer-to-peer peer-to-peer-network python sha-256 simple udp udp-socket-communication
Last synced: 28 Feb 2026
https://github.com/marlenezw/speech-to-text
Turn any video or audio recording into a written transcript using python
data data-science python speech speech-recognition speech-synthesis speech-to-text
Last synced: 27 Apr 2026
https://github.com/zarr-developers/cookiecutter-zarr-store
Cookiecutter for Zarr store implementations
chunked data n-dimensional zarr
Last synced: 16 Jun 2025
https://github.com/hmeleiro/alquilermad
Housing rent map in Comunidad de Madrid / Mapa del alquiler en la Comunidad de Madrid
data data-science data-visualization datascience housing-location-visualization rent renting
Last synced: 13 Sep 2025
https://github.com/ahmedkhalf/arabic-keyword-scraper
Stop wasting your time! And obtain Arabic definitions without having to look it up.
arabic data definitions scraper sentences wordsearch
Last synced: 12 Mar 2025
https://github.com/eosdis-nasa/earthdata-pub-dashboard
Front-end Dashboard for Earthdata Pub
data earthdata edpub publication
Last synced: 15 Jan 2026
https://github.com/nikoshet/exploratory-data-analysis-using-r
Exploratory Data Analysis using R Course Project for M.Sc. 'Data Science and Machine Learning' in NTUA
data data-analysis data-science eda exploratory-data-analysis ggplot2 r
Last synced: 14 May 2026
https://github.com/unaygney/js-challenges-data-structures-and-algorithms
Repo of the challenges I'm trying to solve to understand data structures and algorithms..
algorithms-and-data-structures data javascript structure
Last synced: 29 Oct 2025
https://github.com/thejeshgn/thejeshgn
data data-visualization datameet india opendata public-interest
Last synced: 15 Jan 2026
https://github.com/stdlib-js/datasets-suthaharan-single-hop-sensor-network
Labeled wireless sensor network data set collected from a simple single-hop wireless sensor network deployment using TelosB motes.
data dataset datasets javascript labeled machine-learning ml mote motes network node node-js nodejs outlier outliers sample sensor statistics stats stdlib
Last synced: 03 Mar 2025
https://github.com/samboycoding/hungergames-data
data hunger-games javascript json
Last synced: 15 May 2026
https://github.com/hariprashad-ravikumar/ai-datascience-lab
AI‑DataScience‑Lab is a web app for uploading CSV datasets, cleaning with Pandas, and running quick exploratory analyses and regression models using scikit‑learn. Its modular design supports future AI extensions, like deep learning with TensorFlow or insight generation via the OpenAI API.
ai api azure cloudcomputing data data-analysis data-science data-visualization mathplotlib numpy openai pandas python scikit-learn
Last synced: 02 Aug 2025
https://github.com/aaronmeder/social-history
A quick look into your history on social media. Drop in the archives you've downloaded from Facebook and Instagram and see some stats about your time on the networks.
archives data facebook instagram statistics stats
Last synced: 27 Mar 2025
https://github.com/rrighart/rrighart.github.io
A webpage about data science, programming, statistics and related topics
analyses data data-mining programming statistics
Last synced: 20 Jan 2026
https://github.com/mohammadkarbalaee/introduction-to-data-science-sbu
Reports and full documentation of the introduction to data science course held at SBU
data data-science python shahid-beheshti-university
Last synced: 27 Mar 2025
https://github.com/johnmackintosh/simd2016_tmap
Mapping SIMD with tmap - static & interactive
data data-science data-visualization mapping r visualisation
Last synced: 20 Mar 2025
https://github.com/enricocid/monitoraggio-vaccini-italia
Sito web statico per github.com/apalladi/covid_vaccini_monitoraggio
covid-19 covid-19-data covid-19-data-analysis data data-analysis data-visualization dataset python python3 python37 sars-cov-2
Last synced: 09 Sep 2025
https://github.com/mostafanabieh/image-classification-with-data-augmentation
Project for Data augmentation with tensorflow v2
data deep-learning image-classification machine-learning tensorflow tensorflow2
Last synced: 07 May 2026
https://github.com/stanford-oval/medxchange
Medical Data Exchange (MedXchange) platform
data ethereum exchange medical medxchange
Last synced: 16 May 2026
https://github.com/royruddle/vizdataquality
Python package for visualizing data quality
data data-science data-visualization jupyter-notebook missing-data python
Last synced: 05 May 2025
https://github.com/bgmp/tesis-german-deuster
Datos estadísticos para tercería de una tésis
Last synced: 28 Mar 2025
https://github.com/lukanedimovic/table_editor
A simple table data editor, with easily scalable functions and operations & a nice GUI
data data-science formula java parser parsing preprocessing swing tokenizer
Last synced: 04 Apr 2025
https://github.com/cheminfo/cheminfo-types
chemistry data hacktoberfest schema typescript
Last synced: 03 Apr 2026
https://github.com/wpp-public/akqa-nz-tagmanager-connector
A simple javascript library to send events to a tag manager container
Last synced: 05 Apr 2025
https://github.com/inspect-js/is-data-view
Is this value a JS DataView? This module works cross-realm/iframe, does not depend on instanceof or mutable properties, and despite ES6 Symbol.toStringTag.
data dataview ecmascript javascript typedarray typedarrays view
Last synced: 05 Apr 2025
https://github.com/serhatkacmaz/cpp-datastructuresandalgortihms
Contains codes related to data structures
algorithms cplusplus data data-structures
Last synced: 10 Jul 2025
https://github.com/victoorv/breast_cancer
Mammographic images classification.
breast-cancer breast-cancer-classification classification cnn cnn-classification convnext convnext-tiny convolutional-neural-networks data data-science data-visualization feature-tuning image image-classification mammogram-images mammographic-images neural-network resnet-50 resnet50 transfer-learning
Last synced: 27 Jan 2026
https://github.com/itzshoaib/hashtegrity
A library for generating hash, validating data integrity, monitoring file/directory integrity, offchain data integrity
crypto-hash data data-integrity hacktoberfest hash integrity
Last synced: 07 Mar 2026
https://github.com/ournet/news-sources
A repository of news sources for every country
data news news-sources sources
Last synced: 11 Jul 2025
https://github.com/jesusgraterol/bitcoin-lightning-network-stats-dataset-builder
The dataset builder script extracts Bitcoin's Lightnining Network statistics through Mempool.space's public API. The data is stored in a .csv file, facilitating its use in data science and machine learning projects.
bitcoin blockchain blockchain-technology data data-science dataset dataset-generation lightning-network machine-learning
Last synced: 16 May 2026
https://github.com/zenwor/table_editor
A simple table data editor, with easily scalable functions and operations & a nice GUI
data data-science formula java parser parsing preprocessing swing tokenizer
Last synced: 22 Jun 2025
https://github.com/andrewjbateman/mevn-stack-data
:clipboard: MEVN Info & Full stack MEVN app with CRUD functions
data database express expressjs full-stack info mevn mevn-stack middleware mongodb mongodb-atlas nodejs typescript vue vue3 vue3-typescript
Last synced: 07 Apr 2026
https://github.com/mickeyshi-syd/actuarial-hackathon-2019
2019 Actuarial Hackathon
actuarial actuaries analytics data data-science hackathon
Last synced: 15 Jul 2025