data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-29 00:07:49 UTC
- JSON Representation
https://github.com/ucd-cws/nitrates-cv
california centralvalley data frep groundwater model nitrates
Last synced: 16 Jan 2026
https://github.com/astrid-project/cb-manager
APIs to interact with the Context Broker's database. Through a REST Interface, it exposes data and events stored in the internal storage system in a structured way. It provides uniform access to the capabilities of monitoring agents.
agent beats control data ebpf elasticsearch log logstash management programmability security
Last synced: 30 Jun 2025
https://github.com/mohsinali08000/myportfolio
I’m Mohsin Ali, a passionate software engineer with over 2 years of experience in developing robust software solutions. Currently transitioning into the field of data science.
Last synced: 22 Apr 2026
https://github.com/oefenweb/python-untraceables
Randomizes IDs for a given set of tables making them untraceable across environments
anonymize data database mysql privacy python python2 python3 randomization
Last synced: 03 Feb 2026
https://github.com/kingabzpro/makefile-actions
GitHub Actions and MakeFile tutorial and project for beginners.
actions analytics automation data data-science makefile
Last synced: 18 Apr 2026
https://github.com/ahmadjamil888/facial-recognition-ai-model
A facial recognition AI model powered by CNN , and trained by thousands of images.
ai cnn data data-science facial facial-recognition recognition
Last synced: 30 Jun 2025
https://github.com/jimut123/web-crawller
A web crawler which crawls through the whole internet
beautifulsoup collector data databases glance internet link links mining python3 scrapping-python web-crawler
Last synced: 16 Jan 2026
https://github.com/akin-mustapha/portfolio-management-platform
Portfolio data ingestion pipeline
alembic-migration api dash dash-ui dashboard data data-engineering docker-compose ingestion-pipeline kafka postgres prefect stock-market system-design
Last synced: 27 May 2026
https://github.com/ishanoshada/matplot3dex
A Matplotlib 3D Extension package for enhanced data visualization
data data-science matplotlib python-packages scikit-learn
Last synced: 05 Jan 2026
https://github.com/lookininward/data-formatter-demo
You have directories containing data files and specification files. The specification files describe the structure of the data files. Write an app that reads format definitions from specification files. Use these definitions to convert the parsed files to NDJSON files.
csv data demo files json ndjson python txt unittest
Last synced: 27 Apr 2026
https://github.com/nesterenko-kv/object-id
ObjectIDs are a special type of identifier mainly used in MongoDB to uniquely identify documents within a collection. They consist of a 12-byte binary value that includes a timestamp, a machine identifier, a process identifier, and a counter.
c-sharp data id net object-id unique-identifier
Last synced: 16 May 2025
https://github.com/kgryte/talks-sfnode-may-2017
Talk for SFNode (May, 2017).
analysis data javascript machine-learning math nodejs numeric-computing presentation statistics talk
Last synced: 22 May 2026
https://github.com/sbdk-dev/sbdk.dev
A complete reference implementation of a local-first ecosystem for AI-powered analytics. This repository contains the source code for the SBDK.dev website, the central hub for the SBDK suite of open-source tools.
ai-powered-analytics data data-engineering data-engineeringlocal-first data-pipeline-automation data-pipelines dbt dlt duckdb elt etl-pipeline llm local-first machine-learning pipeline sbdk semantic-layer
Last synced: 27 May 2026
https://github.com/emnetdegafe/allesoverfilm-backend
AllesOverFilm-backend is part of the AllesOverFilm mobile app development project and contains the database structure, server query scripts, and Sequelize-cli database structures.
backend data data-model express postgresql sequelize-cli
Last synced: 11 Apr 2026
https://github.com/spine-tools/metreload
Python application for downloading meteorological reanalysis data
Last synced: 01 Jul 2025
https://github.com/cosmos-loops/cosmos-dapper
Cosmos.Dapper is a part of Cosmos.Data, a inline project of COSMOS LOOPS PROGRAMME. This repository provides a package of StackExchange.Dapper to improve development efficiency.
dapper data mysql mysqlconnector oracle postgresql sql-query sqlite sqlkata sqlserver
Last synced: 11 Apr 2026
https://github.com/cintia0528/data_analytics_and_visualization-sql_tableau
Evaluate Magist as a strategic partner for Eniac's Brazilian expansion. Use SQL to analyze growth, tech accessory sales potential, delivery times, and customer satisfaction in Magist's database.
data dataanalysis datavisualization sql strategy tableau
Last synced: 31 Mar 2025
https://github.com/tsvikas/covid-19-israel-data
Unofficial Github with the data published by The Israel Ministry of Health, regarding The Coronavirus disease
coronavirus-disease covid-19 csv daily-reports data health israel
Last synced: 05 Jan 2026
https://github.com/ttitcombe/timekeep
Defensive timeseries analysis in python
data data-science sklearn time-series time-series-analysis timeseries
Last synced: 05 Jan 2026
https://github.com/dataship/beam
Get collimate'd data into Frame, in Node or the Browser
column-store data data-science
Last synced: 27 Apr 2026
https://github.com/hamzacham/data_set_projet-4
analysis analytics data data-science datawarehouse sas sql sql-server
Last synced: 24 Mar 2025
https://github.com/benmaier/boarding_school_sir
Fit SIR dynamics to the prevalence curve of an H1N1 outbreak of a British boarding school in 1978.
boarding data disease epidemiology modeling school spreading
Last synced: 31 Mar 2025
https://github.com/waylonwalker/exceltocsv
A usefull tool to convert excel spreadsheets to csv files without launching excel
csv-converter csv-files data excel python spreadsheet
Last synced: 05 May 2025
https://github.com/stdlib-js/datasets-cdc-nchs-us-births-1969-1988
US birth data from 1969 to 1988, as provided by the Center for Disease Control and Prevention's National Center for Health Statistics.
america babies births data dataset datasets javascript node node-js nodejs stdlib time-series timeseries united-states us usa
Last synced: 19 Apr 2025
https://github.com/flowsynx/plugin-csv
FlowSynx plugin to reads and writes CSV files, enabling easy batch data import/export operations and integration with spreadsheet-based data workflows.
comma-separated-values csv data data-platform flowsynx
Last synced: 10 Mar 2026
https://github.com/gbowne1/jsonhelix
This is a X11 GUI JSON application for editing, debugging and converting JSON and schemas and API data.
api data gui gui-application json x11
Last synced: 10 Jun 2025
https://github.com/willdev12/rjson
Encryptable Json file format for .NET projects!
csharp csharp-library data dotnet json json-data json-plugin variables vbdotnet vbnet
Last synced: 11 Apr 2026
https://github.com/flowsynx/plugin-json
FlowSynx plugin to loads and parses local JSON files. Supports transformation, extraction, and mapping of hierarchical data structures in workflows.
data data-platform flowsynx json
Last synced: 10 Mar 2026
https://github.com/spectrochempy/spectrochempy_data
Test and examples data repository for SpectroChemPy
Last synced: 04 Apr 2025
https://github.com/zonggen/data-structure
Course notes on data structures and analysis (CSC263)
Last synced: 23 Mar 2025
https://github.com/jorgeatgu/apaga-luz
💡 ¿Cuánto cuesta la luz? 💶
data data-visualization flat-data
Last synced: 04 Feb 2026
https://github.com/danish-foundation-models/dfm-processing
Toolkit for processing data in the danish foundation models project.
Last synced: 02 Jul 2025
https://github.com/ginga1402/chinook_database
Microsoft SQL Server Management Studio
business-query data sql-server
Last synced: 30 Mar 2025
https://github.com/dev-owdenmag/dataflow-manager
A dynamic and versatile web application for managing, collecting, and presenting data with an integrated printing feature.
data data-management data-management-platform data-visualization python
Last synced: 30 Mar 2025
https://github.com/rayenfathallah/students_analysis
This projects contains an analysis of the different fadtors affecting students performance in their final exams. The project uses D3.js to create interactive dashboards that are compelling and easy to interpret.
analysis d3 data education javascript python students
Last synced: 12 Apr 2026
https://github.com/gher-uliege/bluecloud-plankton
Spatial interpolation of plankton data using a neural network
data data-analysis data-visualization neural-network oceanography
Last synced: 30 Mar 2025
https://github.com/jcasbin/jcasbin-menu-permission
Casbin Menu Permission Example (Based on jCasbin)
abac acl auth authorization authz casbin data go java jcasbin menu permission rbac spring springboot
Last synced: 11 Jul 2025
https://github.com/bijx/firestore-data-fetcher
A simple Python script to fetch documents from a Firebase Firestore collection and save them to a local `.json` file.
automation data database downloader exporter fetcher firebase firestore open-source script
Last synced: 12 Apr 2026
https://github.com/cqllum/schema2dwh
⚡ Automatically produce a data model on your database using its information schema using GenAI.
ai data data-structures dataengineering datawarehousing dwh gemini gemini-api genai reporting reporting-tool schema-design
Last synced: 13 Mar 2025
https://github.com/shivam1808/data-cleaning-project
We take raw housing data and transform it in SQL Server to make it more usable for analysis.
analysis data datacleaning sql sqlserver
Last synced: 29 May 2026
https://github.com/fredhutch/gdscnsoilsites
Homepage for BioDIGS Project. Learn about the project and download data.
biodigs data metagenomics student-research
Last synced: 25 Mar 2025
https://github.com/agahkarakuzu/datavis_edu
Presented in BrainHack School 2019-2020, QBIN SciComm 2021
binder dashboard data notebooks repo2docker visualization
Last synced: 01 Apr 2025
https://github.com/lmuffato/project-ting-trybe
Projeto ting - Projeto avaliativo da Trybe do Bloco 37: Estrutura de Dados II: Listas, Filas e Pilhas
data data-analysis python queue read-file stack trybe trybe-projects
Last synced: 12 Jun 2025
https://github.com/azrunguraya/kabyle-corpus-dataset
Dans l'univers du Traitement Automatique des Langues , l'accès à des datasets diversifiés et bien annotés est essentiel pour développer des modèles performants. Ce projet vise à combler cette lacune spécifique pour la langue taqbaylit, une langue berbère parlée principalement en Kabylie
ber berber berber-dataset corpus data dataset ia kabyle kabyle-art kb machine-learning nlp nlp-machine-learning python taqbaylit text words
Last synced: 31 Jul 2025
https://github.com/lane-romuald/iot-irrigation-data-collection-system
An IoT-based data collection system using the ESP32 microcontroller programmed with Arduino to monitor environmental conditions for smart irrigation. The system measures soil moisture, temperature, air temperature, humidity, and rain probability. Data is stored locally on an SD card and uploaded to the ThingSpeak platform.
arduino cloud data data-collection esp32 openweather openweathermap thingspeak wi-fi
Last synced: 12 Apr 2026
https://github.com/eugenedakin/caesarcipher
Native Xojo code for the Caesar Cipher algorithm with an example program
caesar-cipher data decryption encryption xojo
Last synced: 07 Jan 2026
https://github.com/cleanzr/restaurant
Restaurant data set for entity resolution
Last synced: 11 Mar 2026
https://github.com/quasilyte/phpcorpus
A collection of various PHP code; useful for PHP tools writers to get some insights on how "real-world" PHP code looks like
analysis corpus data php php-corpus
Last synced: 04 Jul 2025
https://github.com/codeforafrica/ckanext-followy
[ARCHIVED] A CKAN extension to show the datasets a user is following.
ckan ckan-extension ckanext-followy data dataset followy-extension open-data
Last synced: 16 Mar 2025
https://github.com/e-panourgia/data-science-projects
Data Science Projects
annotations augmentation data data-preprocessing-and-cleaning hyperparameter-tuning llm logistic-regression nlp random-forest-classifier xboost-classifier
Last synced: 09 Apr 2025
https://github.com/bolajiolayinka/graph-api-automation
An End to End Automation from Facebook Business to Data Visualization of Campaigns
Last synced: 07 May 2025
https://github.com/melinteflxrin/softserve-bigdata-project
End-to-end data warehousing project integrating APIs, ETL workflows, and PostgreSQL for analytics and reporting.
analytics api bigdata data datawarehousing externalapi pipeline postgres postgresql python warehouse
Last synced: 26 Jan 2026
https://github.com/thiagopanini/datadelivery
Um módulo Terraform open source capaz de proporcionar um toolkit completo de infraestrutura para que usuários iniciem suas respectivas jornadas de exploração em serviços de Analytics na AWS.
analytics athena aws catalog crawler data datamesh glue s3 terraform
Last synced: 29 Nov 2025
https://github.com/tether/tether-schema
Custom protocol buffer schema for data validation
data protocol schema validation
Last synced: 09 Apr 2025
https://github.com/whitehathackerpr/data-visualization-tool
This is a Python-based web application that allows users to upload datasets, analyze data, and create visualizations interactively. The tool is designed for ease of use and provides a simple interface to perform basic data analysis and generate visualizations
data data-analysis data-visualization python python3
Last synced: 05 Sep 2025
https://github.com/xpotify/scraper
Scraper designed for Xpotify's client to gather information from websites🌟
axios cheerio data javascript scraper webscraper
Last synced: 07 Jul 2025
https://github.com/desininja/data-engineer-interview-questions
This repository contains all the Data Engineer Interview Questions asked by interviewers.
data data-engineer-interview-questions
Last synced: 31 Mar 2025
https://github.com/stdlib-js/ndarray-base-to-reversed
Return a new ndarray where the order of elements of an input ndarray is reversed along each dimension.
base data flip javascript matrix ndarray node node-js nodejs reverse slice stdlib structure to-reversed types vector view
Last synced: 12 Apr 2026
https://github.com/stdlib-js/array-float32
Float32Array.
array data float float32 float32array ieee754 javascript node node-js nodejs single single-precision stdlib structure typed typed-array types
Last synced: 14 Jan 2026
https://github.com/agavitalis/sample-c-codes
A collection of small projects I carried out on audino as an electronic engineering student despite felling in love with website development.
ageteller atm binary data gpcalculator logging
Last synced: 09 Apr 2025
https://github.com/shawnduong/pacman-digest
Generate a digest of package space usage for Linux systems using pacman.
Last synced: 13 May 2026
https://github.com/sefakcmn00/tensorflow_car_price_analysis
In this project, after extracting the data sets as csv, we tried to represent the car prices graphically and schematically by using data analysis and data visualization methods. We checked the connection of the car prices we analyzed with other data, then we created a 4-layer and 12-neuron system.
data datatrain keras machine-learning matplotlib-pyplot pandas seaborn sklearn tensorflow
Last synced: 14 Apr 2026
https://github.com/geo-y20/uber-rides-data-analysis
This project aims to analyze Uber ride data to understand various aspects of ride usage, such as the distribution of rides across different categories, purposes, months, days, and times.
dashboard dashboard-templates data data-analysis data-analysis-python data-analytics data-visualization pandas powerbi python recommendation-system rides uber
Last synced: 13 Apr 2026
https://github.com/jigyasag18/gold-price-prediction-project-using-machine-learning
This repository contains a machine learning project focused on predicting gold prices (GLD) using historical stock market data, including indicators such as SPX, USO, SLV, and EUR/USD. The project implements a Random Forest Regressor for accurate price forecasting, complete with data visualization, correlation analysis, and model evaluation metrics
data dataset jupyter-notebook jupyter-notebooks machine-learning machinelearing machinelearningalgorithms machinelearningmodel machinelearningprojects matplotlib mlproject numpy pandas randomforestregressor seaborn
Last synced: 23 Jul 2025
https://github.com/alexscigalszky/palabras-aleatorias-data
This package have a set of datasets of random words, animals, colors, jokes, onomatopeias and types
aleatorias data palabras random words
Last synced: 04 Oct 2025
https://github.com/nikhilash45/live_ipl_report
This repository hosts the source code for an interactive IPL (Indian Premier League) Dashboard built using PowerBI. The dashboard provides real-time updates on ongoing matches, including live scores, batting and bowling statistics for both teams, and the points table.
analysts cleaning-data cricket-data dashboard data data-analysis data-visualization dax powerbi
Last synced: 19 Mar 2026
https://github.com/camara94/introduction-to-data-engineering
Describe the different entities that form a modern data ecosystem. Describe and differentiate between the role and responsibilities of Data Engineers, Data Scientists, Data Analysts, Business Analysts, and Business Intelligence Analysts. Explain what Data Engineering is. List the tasks that need to be performed in a typical data engineering lifecycle. Describe what a day in the life of a Data Engineer looks like.
business-analytics business-intelligence data dataingestion dataintegration datascience machinelearning python statistical-analysis
Last synced: 09 Apr 2025
https://github.com/stdlib-js/ndarray-base
Base ndarray.
array base buffer data javascript matrix multidimensional namespace ndarray node node-js nodejs ns stdlib structures types vector
Last synced: 09 Apr 2025
https://github.com/stdlib-js/array-zero-to-like
Generate a linearly spaced numeric array whose elements increment by 1 starting from zero and having the same length and data type as a provided input array.
array data float32array float64array int16array int32array javascript matrix ndarray node node-js nodejs stdlib structure typed typed-array types uint32array vector
Last synced: 07 Jan 2026
https://github.com/danreynolds/data_batcher
Data batcher batches and de-dupes data fetched in the same task of the event loop.
batching data flutter hacktoberfest
Last synced: 19 May 2026
https://github.com/luminati-io/Crunchbase-dataset-samples
A sample of 1001 Crunchbase companies with key data points, extracted using the Bright Data API.
crunchbase crunchbase-api crunchbase-scraper data database datasets webscraper-api webscraping
Last synced: 09 Apr 2025
https://github.com/cintia0528/data_cleaning_and_analytics-python
Evaluate if aggressive discounting benefits Eniac long-term, considering differing views on customer acquisition and brand positioning. Focus on data cleaning for informed decision-making.
colab-notebook data data-analysis datacleaning dataquality jupyter-notebook matplotlib pandas python seaborn
Last synced: 08 Jan 2026
https://github.com/dbriane208/omdena-apprenticeship-project
This is part of my contribution to the Omdena apprenticeship program .
data data-science feature-engineering machine-learning
Last synced: 14 Mar 2026
https://github.com/stdlib-js/array-base-to-deduped
Copy elements to a new generic array after removing consecutive duplicated values.
array compress copy data dedupe deduplicate deduplication duplicate generic javascript node node-js nodejs stdlib structure types uniq unique
Last synced: 14 Jun 2025
https://github.com/garcane/global-shipping-analytics-dashboard
This Tableau project provides a comprehensive visual analysis of global sales, shipping costs, and quality metrics across different regions and countries.
data data-analysis data-analyst data-visualization metrics tableau
Last synced: 01 Mar 2026
https://github.com/neelravi/fairtool
A CLI tool for FAIR processing of computational materials science data.
computational data data-analytics fair management materials physics python science
Last synced: 14 Jan 2026
https://github.com/izaaccoding36/dados-dinamicos
Esse repositório apresenta um site criado com API para a criação de gráficos, relatando o uso de redes sociais em uma escala global
api data redes-sociais social-media website
Last synced: 26 Mar 2025
https://github.com/tomasoak/datahopper
Python package for data engineering and data wrangling
data data-analysis data-engineering data-mining data-science data-structures data-wrangling datascience pandas python
Last synced: 12 Mar 2026
https://github.com/nafisalawalidris/buybuy-e-commerce-company
The BuyBuy E-commerce Company repository is a comprehensive hub for the company's e-commerce platform. It includes source code, documentation, and data analysis insights, providing a data-driven approach to improve customer experience, drive revenue, and inform decision-making.
buybuy cleaning-data company customer-experience data data-analysis decision-making documentation e-commerce excel insights postgresql repository revenue source-code sql
Last synced: 16 Mar 2025
https://github.com/vagnerbellacosa/029_analisededadoscompythonpandas
Neste Labs será apresentada a biblioteca Pandas, uma biblioteca Python de código aberto para análise de dados. Ela dá ao Python a capacidade de trabalhar com dados do tipo planilha, permitindo carregar, manipular e combinar dados rapidamente, entre outras funções. Python
data digital-innovation-one dio jupiter-notebook labs ms-excel panda python
Last synced: 14 May 2026
https://github.com/jmcanterafonseca/leaflet-context-information
A Leaflet plugin + infrastructure for getting access to Context Information (i.e. data) exposed through FIWARE NGSIv2
context data fiware information leaflet map open visualization web
Last synced: 21 Apr 2026
https://github.com/gagolews/clustering-results-v1
A framework for benchmarking clustering algorithms – Benchmark results (for version 1 of the Suite)
benchmark benchmark-datasets clustering data dataset datasets machine-learning
Last synced: 16 Mar 2025
https://github.com/mattythedev01/easydatadb
A quick and easy way to store data!
data database discord-bot discord-js discord-ts discordbot discordjs discordts npm npm-package package quick-db quickdb
Last synced: 13 Apr 2026
https://github.com/mini-ware/mini-ware
Just some very simple markdown for my GitHub profile
codewars ctf data hackthebox javascript markdown minimalistic profile-readme python readme-profile simple stattistics svg
Last synced: 13 Apr 2026
https://github.com/sandipbera35/blogapp.spring.boot
A proof-of-concept Project Of Blog application In Java Spring Boot, Spring Data JPA with mysql Minio Object Storage , it is an Integration with JWT authservice project(written in golang) .
data java jpa jpa-entity-manager jpa-hibernate mysql mysql-server postman postmanapi spring-boot
Last synced: 13 Apr 2026