data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-27 00:07:33 UTC
- JSON Representation
https://github.com/flowsynx/plugin-json
FlowSynx plugin to loads and parses local JSON files. Supports transformation, extraction, and mapping of hierarchical data structures in workflows.
data data-platform flowsynx json
Last synced: 10 Mar 2026
https://github.com/gmersy/data-carbon
Repository accompanying the paper: Toward a Life Cycle Assessment for the Carbon Footprint of Data
carbon-emissions carbon-footprint climate-change data data-science sustainability sustainable-software
Last synced: 31 Mar 2025
https://github.com/inc44/raqua
Raqua 💧, a set of Python scripts and Rust program, is designed to scan an ocean of disk copies and retrieve files lacking conventional signatures, by creating an overflowing cache
cli console data data-recovery files linux macos python python3 recovery rust search terminal tool windows
Last synced: 11 Apr 2026
https://github.com/spectrochempy/spectrochempy_data
Test and examples data repository for SpectroChemPy
Last synced: 04 Apr 2025
https://github.com/katiesaund/dresden_maps
Contains a data file with locations from The Dresden Files. The data file is to be used for my map tutorial in R.
Last synced: 05 Jan 2026
https://github.com/zonggen/data-structure
Course notes on data structures and analysis (CSC263)
Last synced: 23 Mar 2025
https://github.com/SAP-archive/signavio-qualtrics-di
Setup an SAP Data Intelligence data pipeline to connect Qualtrics surveys data to SAP Signavio Process Intelligence via Ingestion API.
data intelligence process-intelligence qualtrics sample sap-data-intelligence sap-signavio-process-intelligence signavio
Last synced: 09 May 2025
https://github.com/ginga1402/chinook_database
Microsoft SQL Server Management Studio
business-query data sql-server
Last synced: 30 Mar 2025
https://github.com/victorowinoke/after-work-data-science-project-showcase-eda
You work for Lublu as a Data Science Consultant and you have been tasked to perform analysis on pricing, product and assortment of Adidas and Nike. Create a descriptive analysis report, making relevant observations and recommendations that will help Lublu in the launch of such similar products.
adidas analysis data deliverables nike pythonanalysis ranges
Last synced: 28 May 2026
https://github.com/rafaelfloressouza/Covid-19-Dashboard
Python web application to display COVID19 data from the world using Plotly and Dash
bootstrap covid-19 css data datavisualization plotly-dash python3
Last synced: 10 Mar 2025
https://github.com/diegoperea20/own_dataset_segmentation_yolov8
Segmentacion y detection de objetos con propio dataset usando YOLOV8 , en el que se utiliza un dataset propio de una moneda de 200 pesos colombianos del año 2023.
coins colombia data opencv own python segmentation tensorflow yolov8
Last synced: 12 Apr 2026
https://github.com/dev-owdenmag/dataflow-manager
A dynamic and versatile web application for managing, collecting, and presenting data with an integrated printing feature.
data data-management data-management-platform data-visualization python
Last synced: 30 Mar 2025
https://github.com/rayenfathallah/students_analysis
This projects contains an analysis of the different fadtors affecting students performance in their final exams. The project uses D3.js to create interactive dashboards that are compelling and easy to interpret.
analysis d3 data education javascript python students
Last synced: 12 Apr 2026
https://github.com/metriccoders/metriccoders_datasets
This is the Metric Coders repository containing all the datasets for machine learning.
data datasets machine-learning natural-language-processing scikit-learn
Last synced: 08 Apr 2025
https://github.com/dwidevelopes/database-input-pelanggran-mahasiswa
Menginput data Mahasiswa Yang Melakukan Pelanggran yang siap di data dan di hukum Dan juga siap Terkena Sanksi
aplikasi aplikasi-sekolah data data-analysis database input-method mahasiswa sekolah siswa siswi website
Last synced: 02 May 2026
https://github.com/gher-uliege/bluecloud-plankton
Spatial interpolation of plankton data using a neural network
data data-analysis data-visualization neural-network oceanography
Last synced: 30 Mar 2025
https://github.com/igorskyflyer/npm-adblock-header-extract
✂️ Parse and extract ad-block filter list headers with ease. Works on strings or files, trims whitespace, and returns clean metadata for tooling and automation. 📃
adblock back-end biome data filter header igorskyflyer javascript js metadata node nodejs npm string ts typescript utility
Last synced: 11 Mar 2026
https://github.com/open-i18n/data-unicode-math
Git mirror for Unicode Support for Mathematics data
data i18n internationalization math mathematics open-i18n unicode unicode-consortium unicode-data
Last synced: 11 Mar 2026
https://github.com/GiveMePseudonyms/PiVisualisations
A way to visualise millions of digits of Pi. Written in Python using Pygame and Tkinter.
data data-visualization pi pygame python self-organising-criticality tkinter
Last synced: 08 Apr 2025
https://github.com/khalyomede/fetch
Quickly retrieve your PHP data
config configuration data fetch php php7
Last synced: 15 Mar 2025
https://github.com/bijx/firestore-data-fetcher
A simple Python script to fetch documents from a Firebase Firestore collection and save them to a local `.json` file.
automation data database downloader exporter fetcher firebase firestore open-source script
Last synced: 12 Apr 2026
https://github.com/cqllum/schema2dwh
⚡ Automatically produce a data model on your database using its information schema using GenAI.
ai data data-structures dataengineering datawarehousing dwh gemini gemini-api genai reporting reporting-tool schema-design
Last synced: 13 Mar 2025
https://github.com/nik-kusanagi/bash.sh-treinamento
Versão mais organizada (+ ou -)
data database debian gnome gnome-extension gnu gnu-linux linux shell shell-script
Last synced: 05 May 2026
https://github.com/castelao/bufr
BUFR binary data format from WMO
binary data format meteorology oceanography wmo
Last synced: 13 Jul 2025
https://github.com/shivam1808/data-cleaning-project
We take raw housing data and transform it in SQL Server to make it more usable for analysis.
analysis data datacleaning sql sqlserver
Last synced: 29 May 2026
https://github.com/ntia/compound_radar_waveforms-data
Data used by NTIA/ITS TR-23-566 Examining the Effects of Resolution Bandwidth when Measuring Compound Radar Waveforms.
bandwidth data measurement p0n q3n radar resolution stepped waveform
Last synced: 27 Jan 2026
https://github.com/fredhutch/gdscnsoilsites
Homepage for BioDIGS Project. Learn about the project and download data.
biodigs data metagenomics student-research
Last synced: 25 Mar 2025
https://github.com/datenoio/internacia-db
Public registry of the intergovernmental organizations, country groups and countries. Available as JSONl, Parquet, YAML and DuckDB database datasets
countries data datasets international international-trade reference
Last synced: 29 May 2026
https://github.com/lmuffato/project-ting-trybe
Projeto ting - Projeto avaliativo da Trybe do Bloco 37: Estrutura de Dados II: Listas, Filas e Pilhas
data data-analysis python queue read-file stack trybe trybe-projects
Last synced: 12 Jun 2025
https://github.com/toransahu/metoffice
Data visualisation - MetOffice
data metoffice uk visualization weather
Last synced: 25 Mar 2025
https://github.com/lane-romuald/iot-irrigation-data-collection-system
An IoT-based data collection system using the ESP32 microcontroller programmed with Arduino to monitor environmental conditions for smart irrigation. The system measures soil moisture, temperature, air temperature, humidity, and rain probability. Data is stored locally on an SD card and uploaded to the ThingSpeak platform.
arduino cloud data data-collection esp32 openweather openweathermap thingspeak wi-fi
Last synced: 12 Apr 2026
https://github.com/osiota10/alx-low_level_programming
C Low Level Programming - Data Structures, Linux/Unix System Programming and Algorithms with ALX Software Engineering
algorithms assembly c data data-structures linux shell unix
Last synced: 25 Jun 2025
https://github.com/edugmenes/azure-data-engineering
This repository contains my first end-to-end Data Engineering project, built using Microsoft Azure Cloud and Azure Databricks with PySpark.
azure cloud data data-engineering data-lakehouse data-structures databricks delta-lake etl-pipelines lakehouse lakehouse-architectures medallion-architecture microsoft-azure pyspark spark
Last synced: 29 Jan 2026
https://github.com/eugenedakin/caesarcipher
Native Xojo code for the Caesar Cipher algorithm with an example program
caesar-cipher data decryption encryption xojo
Last synced: 07 Jan 2026
https://github.com/themost-framework/jspa
JavaScript Persistent API
api data database-schema jspa object-relational-mapping orm orm-framework
Last synced: 31 Aug 2025
https://github.com/cleanzr/restaurant
Restaurant data set for entity resolution
Last synced: 11 Mar 2026
https://github.com/fiskeben/meetjescraper
HTTP proxy for Meet je stad project
api data go iot meetjestad proxy scraper weather
Last synced: 29 May 2026
https://github.com/grycap/cdmi-client-go
A basic Go library to perform CDMI core operations
Last synced: 21 Jan 2026
https://github.com/quasilyte/phpcorpus
A collection of various PHP code; useful for PHP tools writers to get some insights on how "real-world" PHP code looks like
analysis corpus data php php-corpus
Last synced: 04 Jul 2025
https://github.com/codeforafrica/ckanext-followy
[ARCHIVED] A CKAN extension to show the datasets a user is following.
ckan ckan-extension ckanext-followy data dataset followy-extension open-data
Last synced: 16 Mar 2025
https://github.com/e-panourgia/data-science-projects
Data Science Projects
annotations augmentation data data-preprocessing-and-cleaning hyperparameter-tuning llm logistic-regression nlp random-forest-classifier xboost-classifier
Last synced: 09 Apr 2025
https://github.com/avahoffman/dataplay
🤸♂️ Load data to play with
data data-package r r-package rstats
Last synced: 25 Mar 2025
https://github.com/stdlib-js/ndarray-base-assert-is-complex-floating-point-data-type
Test if an input value is a supported ndarray complex-valued floating-point data type.
array assert base check data dtype is javascript multidimensional ndarray node node-js nodejs stdlib test types util utilities utility utils
Last synced: 08 Mar 2026
https://github.com/bolajiolayinka/graph-api-automation
An End to End Automation from Facebook Business to Data Visualization of Campaigns
Last synced: 07 May 2025
https://github.com/tether/tether-schema
Custom protocol buffer schema for data validation
data protocol schema validation
Last synced: 09 Apr 2025
https://github.com/clinton-mwachia/data-analysis-in-r
Various Analysis in R
data data-science machine-learning machine-learning-algorithms r random-forest rstats
Last synced: 30 Nov 2025
https://github.com/xpotify/scraper
Scraper designed for Xpotify's client to gather information from websites🌟
axios cheerio data javascript scraper webscraper
Last synced: 07 Jul 2025
https://github.com/desininja/data-engineer-interview-questions
This repository contains all the Data Engineer Interview Questions asked by interviewers.
data data-engineer-interview-questions
Last synced: 31 Mar 2025
https://github.com/bredalis/datastructure
📚 Estructuras de Datos en Python
algorithms data data-structure python
Last synced: 12 Apr 2026
https://github.com/stdlib-js/ndarray-base-to-reversed
Return a new ndarray where the order of elements of an input ndarray is reversed along each dimension.
base data flip javascript matrix ndarray node node-js nodejs reverse slice stdlib structure to-reversed types vector view
Last synced: 12 Apr 2026
https://github.com/stdlib-js/array-float32
Float32Array.
array data float float32 float32array ieee754 javascript node node-js nodejs single single-precision stdlib structure typed typed-array types
Last synced: 14 Jan 2026
https://github.com/agavitalis/sample-c-codes
A collection of small projects I carried out on audino as an electronic engineering student despite felling in love with website development.
ageteller atm binary data gpcalculator logging
Last synced: 09 Apr 2025
https://github.com/devlive-community/mockaroo
一个轻量级的 HTTP Mock 服务器,用于快速构建模拟数据接口,适用于前后端开发和接口测试场景。
Last synced: 08 Jul 2025
https://github.com/shawnduong/pacman-digest
Generate a digest of package space usage for Linux systems using pacman.
Last synced: 13 May 2026
https://github.com/yasenstar/powerbi_tutorial
Base on "PowerBI Tutorial" book, provide step by step video demo on learning and mastering Power BI tool
analytics data microsoft powerbi tutorial visualization
Last synced: 07 Jan 2026
https://github.com/bukalapak/bukadata
Data supplier plugin for populating design with real data.
data plugin sketch sketch-plugin
Last synced: 05 Jul 2025
https://github.com/jigyasag18/gold-price-prediction-project-using-machine-learning
This repository contains a machine learning project focused on predicting gold prices (GLD) using historical stock market data, including indicators such as SPX, USO, SLV, and EUR/USD. The project implements a Random Forest Regressor for accurate price forecasting, complete with data visualization, correlation analysis, and model evaluation metrics
data dataset jupyter-notebook jupyter-notebooks machine-learning machinelearing machinelearningalgorithms machinelearningmodel machinelearningprojects matplotlib mlproject numpy pandas randomforestregressor seaborn
Last synced: 23 Jul 2025
https://github.com/so-cool/uobrain
My solution to the University of Bristol PURE Data Challenge
Last synced: 09 Sep 2025
https://github.com/jaldekoa/fdicapi
A Python wrapper to easily retrieve data from the BankFind Suite official API from FDIC in pandas format.
api api-wrapper banking data finance pandas python united-states
Last synced: 07 Jan 2026
https://github.com/san089/black-friday-sales-analysis
This Project gives an insight into few statistics related to black Friday Sale.
custom data dataanalysis insights sales statistics
Last synced: 13 Jul 2025
https://github.com/camara94/introduction-to-data-engineering
Describe the different entities that form a modern data ecosystem. Describe and differentiate between the role and responsibilities of Data Engineers, Data Scientists, Data Analysts, Business Analysts, and Business Intelligence Analysts. Explain what Data Engineering is. List the tasks that need to be performed in a typical data engineering lifecycle. Describe what a day in the life of a Data Engineer looks like.
business-analytics business-intelligence data dataingestion dataintegration datascience machinelearning python statistical-analysis
Last synced: 09 Apr 2025
https://github.com/stdlib-js/array-zero-to-like
Generate a linearly spaced numeric array whose elements increment by 1 starting from zero and having the same length and data type as a provided input array.
array data float32array float64array int16array int32array javascript matrix ndarray node node-js nodejs stdlib structure typed typed-array types uint32array vector
Last synced: 07 Jan 2026
https://github.com/gkapfham/ast2016-paper
Source Code of and Supporting Files for a Paper Published at AST 2016
data latex-document paper research
Last synced: 19 Oct 2025
https://github.com/danreynolds/data_batcher
Data batcher batches and de-dupes data fetched in the same task of the event loop.
batching data flutter hacktoberfest
Last synced: 19 May 2026
https://github.com/harmanveer-2546/supply-chain
Supply chain analytics is a valuable part of data-driven decision-making in various industries such as manufacturing, retail, healthcare, and logistics. It is the process of collecting, analyzing and interpreting data related to the movement of products and services from suppliers to customers.
customer-segmentation-analysis data data-analysis data-cleaning data-insights ggplot2 numpy pandas performance-evaluation predictive-analytics-for-business python risk-assessment sales-analysis statistical-analysis supply-chain tidyverse trend-analysis
Last synced: 10 Apr 2026
https://github.com/goncaloperes/datavisualization
Here I will share some of my data visualizations using a variety of datasets, technologies and tools.
d3js data dataset datavisualization dataviz ggplot matplotlib rawgraphs seaborn tableau visualization yellowbrick
Last synced: 04 Feb 2026
https://github.com/ispyhumanfly/prowler
Query the web, extract data from the results, and transform that data into a format you can use.
ai analytics business cryptocurrency data extract-data machine-learning mining scraping web
Last synced: 06 Sep 2025
https://github.com/tylerben/data-spring
Easily generate a dummy dataset based on a provided config
data data-spring datagenerator fake-data generator javascript typescript
Last synced: 27 May 2026
https://github.com/stdlib-js/array-zero-to
Generate a linearly spaced numeric array whose elements increment by 1 starting from zero.
array data float32array float64array int16array int32array javascript matrix ndarray node node-js nodejs stdlib structure typed typed-array types uint32array vector
Last synced: 08 Jan 2026
https://github.com/luminati-io/Crunchbase-dataset-samples
A sample of 1001 Crunchbase companies with key data points, extracted using the Bright Data API.
crunchbase crunchbase-api crunchbase-scraper data database datasets webscraper-api webscraping
Last synced: 09 Apr 2025
https://github.com/cintia0528/data_cleaning_and_analytics-python
Evaluate if aggressive discounting benefits Eniac long-term, considering differing views on customer acquisition and brand positioning. Focus on data cleaning for informed decision-making.
colab-notebook data data-analysis datacleaning dataquality jupyter-notebook matplotlib pandas python seaborn
Last synced: 08 Jan 2026
https://github.com/jrcichra/ingestd
HTTP server that easily ingests data into a database
data gin hacktoberfest ingest ingestion restful-api
Last synced: 28 Apr 2026
https://github.com/marabesi/d3-visualization
Different visualizations using data and d3.js
charts css d3js data html js json timeline-chart visualization
Last synced: 01 May 2026
https://github.com/rayyan9477/dep
data data-science machine-learning python visualization web-scraping
Last synced: 08 May 2026
https://github.com/ayushai/salesfoce-hospital-management
A custom Salesforce-based Hospital Management System with powerful dashboards and data analysis tools. It provides real-time insights into patient care, appointment scheduling, and inventory management, optimizing healthcare operations and decision-making.
analytics dashboard data salesforce-developers visualization
Last synced: 22 Feb 2026
https://github.com/ronaldkanyepi/python-streamlit-covid-19-dashboard
This is a responsive streamlit covid 19 Dashboard
analytics data data-analysis data-visualization datascience python streamlit
Last synced: 18 May 2026
https://github.com/garcane/global-shipping-analytics-dashboard
This Tableau project provides a comprehensive visual analysis of global sales, shipping costs, and quality metrics across different regions and countries.
data data-analysis data-analyst data-visualization metrics tableau
Last synced: 01 Mar 2026
https://github.com/wioniqle-q/tower-modelling
Data science
data data-science ndarray-odeint ndjson science
Last synced: 16 Mar 2025
https://github.com/bredalis/scikitlearn
🤖 Library to create ML models 🤖
data ia learning-python librery ml python
Last synced: 30 May 2026
https://github.com/xtao-org/tree-annotation
What is TAO
annotation data intercommunication json notation s-expressions simplicity syntax tao tree tree-annotation universal xml
Last synced: 25 May 2026
https://github.com/nafisalawalidris/buybuy-e-commerce-company
The BuyBuy E-commerce Company repository is a comprehensive hub for the company's e-commerce platform. It includes source code, documentation, and data analysis insights, providing a data-driven approach to improve customer experience, drive revenue, and inform decision-making.
buybuy cleaning-data company customer-experience data data-analysis decision-making documentation e-commerce excel insights postgresql repository revenue source-code sql
Last synced: 16 Mar 2025
https://github.com/vagnerbellacosa/029_analisededadoscompythonpandas
Neste Labs será apresentada a biblioteca Pandas, uma biblioteca Python de código aberto para análise de dados. Ela dá ao Python a capacidade de trabalhar com dados do tipo planilha, permitindo carregar, manipular e combinar dados rapidamente, entre outras funções. Python
data digital-innovation-one dio jupiter-notebook labs ms-excel panda python
Last synced: 14 May 2026
https://github.com/jmcanterafonseca/leaflet-context-information
A Leaflet plugin + infrastructure for getting access to Context Information (i.e. data) exposed through FIWARE NGSIv2
context data fiware information leaflet map open visualization web
Last synced: 21 Apr 2026
https://github.com/gagolews/clustering-results-v1
A framework for benchmarking clustering algorithms – Benchmark results (for version 1 of the Suite)
benchmark benchmark-datasets clustering data dataset datasets machine-learning
Last synced: 16 Mar 2025