data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-03 00:07:49 UTC
- JSON Representation
https://github.com/open-i18n/data-unicode-math
Git mirror for Unicode Support for Mathematics data
data i18n internationalization math mathematics open-i18n unicode unicode-consortium unicode-data
Last synced: 11 Mar 2026
https://github.com/jcasbin/jcasbin-menu-permission
Casbin Menu Permission Example (Based on jCasbin)
abac acl auth authorization authz casbin data go java jcasbin menu permission rbac spring springboot
Last synced: 11 Jul 2025
https://github.com/stefanbohacek/exploring-the-mapping-police-violence-dataset
Using my Gutenberg Data Visualization plugin to explore police violence against civilians.
data dataviz police police-brutality police-misconduct
Last synced: 03 Dec 2025
https://github.com/khalyomede/fetch
Quickly retrieve your PHP data
config configuration data fetch php php7
Last synced: 15 Mar 2025
https://github.com/bijx/firestore-data-fetcher
A simple Python script to fetch documents from a Firebase Firestore collection and save them to a local `.json` file.
automation data database downloader exporter fetcher firebase firestore open-source script
Last synced: 12 Apr 2026
https://github.com/cqllum/schema2dwh
⚡ Automatically produce a data model on your database using its information schema using GenAI.
ai data data-structures dataengineering datawarehousing dwh gemini gemini-api genai reporting reporting-tool schema-design
Last synced: 13 Mar 2025
https://github.com/nik-kusanagi/bash.sh-treinamento
Versão mais organizada (+ ou -)
data database debian gnome gnome-extension gnu gnu-linux linux shell shell-script
Last synced: 05 May 2026
https://github.com/castelao/bufr
BUFR binary data format from WMO
binary data format meteorology oceanography wmo
Last synced: 13 Jul 2025
https://github.com/shivam1808/data-cleaning-project
We take raw housing data and transform it in SQL Server to make it more usable for analysis.
analysis data datacleaning sql sqlserver
Last synced: 29 May 2026
https://github.com/ntia/compound_radar_waveforms-data
Data used by NTIA/ITS TR-23-566 Examining the Effects of Resolution Bandwidth when Measuring Compound Radar Waveforms.
bandwidth data measurement p0n q3n radar resolution stepped waveform
Last synced: 27 Jan 2026
https://github.com/fredhutch/gdscnsoilsites
Homepage for BioDIGS Project. Learn about the project and download data.
biodigs data metagenomics student-research
Last synced: 25 Mar 2025
https://github.com/datenoio/internacia-db
Public registry of the intergovernmental organizations, country groups and countries. Available as JSONl, Parquet, YAML and DuckDB database datasets
countries data datasets international international-trade reference
Last synced: 29 May 2026
https://github.com/lmuffato/project-ting-trybe
Projeto ting - Projeto avaliativo da Trybe do Bloco 37: Estrutura de Dados II: Listas, Filas e Pilhas
data data-analysis python queue read-file stack trybe trybe-projects
Last synced: 12 Jun 2025
https://github.com/azrunguraya/kabyle-corpus-dataset
Dans l'univers du Traitement Automatique des Langues , l'accès à des datasets diversifiés et bien annotés est essentiel pour développer des modèles performants. Ce projet vise à combler cette lacune spécifique pour la langue taqbaylit, une langue berbère parlée principalement en Kabylie
ber berber berber-dataset corpus data dataset ia kabyle kabyle-art kb machine-learning nlp nlp-machine-learning python taqbaylit text words
Last synced: 31 Jul 2025
https://github.com/toransahu/metoffice
Data visualisation - MetOffice
data metoffice uk visualization weather
Last synced: 25 Mar 2025
https://github.com/lane-romuald/iot-irrigation-data-collection-system
An IoT-based data collection system using the ESP32 microcontroller programmed with Arduino to monitor environmental conditions for smart irrigation. The system measures soil moisture, temperature, air temperature, humidity, and rain probability. Data is stored locally on an SD card and uploaded to the ThingSpeak platform.
arduino cloud data data-collection esp32 openweather openweathermap thingspeak wi-fi
Last synced: 12 Apr 2026
https://github.com/osiota10/alx-low_level_programming
C Low Level Programming - Data Structures, Linux/Unix System Programming and Algorithms with ALX Software Engineering
algorithms assembly c data data-structures linux shell unix
Last synced: 25 Jun 2025
https://github.com/edugmenes/azure-data-engineering
This repository contains my first end-to-end Data Engineering project, built using Microsoft Azure Cloud and Azure Databricks with PySpark.
azure cloud data data-engineering data-lakehouse data-structures databricks delta-lake etl-pipelines lakehouse lakehouse-architectures medallion-architecture microsoft-azure pyspark spark
Last synced: 29 Jan 2026
https://github.com/eugenedakin/caesarcipher
Native Xojo code for the Caesar Cipher algorithm with an example program
caesar-cipher data decryption encryption xojo
Last synced: 07 Jan 2026
https://github.com/svelterun/store
Persisted version of svelte/store.
data state state-management store svelte svelte-store sveltekit svelterun typescript
Last synced: 08 Jan 2026
https://github.com/fiskeben/meetjescraper
HTTP proxy for Meet je stad project
api data go iot meetjestad proxy scraper weather
Last synced: 29 May 2026
https://github.com/vvipjain/bike-sales-dashboard
Bike Sales Dashboard
dashboards data data-analysis data-cleaning data-normalisation data-visualization excel pivot-chart pivot-tables
Last synced: 04 Feb 2026
https://github.com/e-panourgia/data-science-projects
Data Science Projects
annotations augmentation data data-preprocessing-and-cleaning hyperparameter-tuning llm logistic-regression nlp random-forest-classifier xboost-classifier
Last synced: 09 Apr 2025
https://github.com/avahoffman/dataplay
🤸♂️ Load data to play with
data data-package r r-package rstats
Last synced: 25 Mar 2025
https://github.com/stdlib-js/ndarray-base-assert-is-complex-floating-point-data-type
Test if an input value is a supported ndarray complex-valued floating-point data type.
array assert base check data dtype is javascript multidimensional ndarray node node-js nodejs stdlib test types util utilities utility utils
Last synced: 08 Mar 2026
https://github.com/bolajiolayinka/graph-api-automation
An End to End Automation from Facebook Business to Data Visualization of Campaigns
Last synced: 07 May 2025
https://github.com/melinteflxrin/softserve-bigdata-project
End-to-end data warehousing project integrating APIs, ETL workflows, and PostgreSQL for analytics and reporting.
analytics api bigdata data datawarehousing externalapi pipeline postgres postgresql python warehouse
Last synced: 26 Jan 2026
https://github.com/tether/tether-schema
Custom protocol buffer schema for data validation
data protocol schema validation
Last synced: 09 Apr 2025
https://github.com/clinton-mwachia/data-analysis-in-r
Various Analysis in R
data data-science machine-learning machine-learning-algorithms r random-forest rstats
Last synced: 30 Nov 2025
https://github.com/cainmi/data-page-project
A repository to pull code and files from, may be used to store page data links, code etc. mainly used for python for now
data html javascript python schema
Last synced: 21 Oct 2025
https://github.com/desininja/data-engineer-interview-questions
This repository contains all the Data Engineer Interview Questions asked by interviewers.
data data-engineer-interview-questions
Last synced: 31 Mar 2025
https://github.com/eve-ning/osumania_data
processed osu!mania data from osu!API
Last synced: 24 Feb 2026
https://github.com/stdlib-js/ndarray-base-to-reversed
Return a new ndarray where the order of elements of an input ndarray is reversed along each dimension.
base data flip javascript matrix ndarray node node-js nodejs reverse slice stdlib structure to-reversed types vector view
Last synced: 12 Apr 2026
https://github.com/stdlib-js/array-float32
Float32Array.
array data float float32 float32array ieee754 javascript node node-js nodejs single single-precision stdlib structure typed typed-array types
Last synced: 14 Jan 2026
https://github.com/devlive-community/mockaroo
一个轻量级的 HTTP Mock 服务器,用于快速构建模拟数据接口,适用于前后端开发和接口测试场景。
Last synced: 08 Jul 2025
https://github.com/shawnduong/pacman-digest
Generate a digest of package space usage for Linux systems using pacman.
Last synced: 13 May 2026
https://github.com/geo-y20/uber-rides-data-analysis
This project aims to analyze Uber ride data to understand various aspects of ride usage, such as the distribution of rides across different categories, purposes, months, days, and times.
dashboard dashboard-templates data data-analysis data-analysis-python data-analytics data-visualization pandas powerbi python recommendation-system rides uber
Last synced: 13 Apr 2026
https://github.com/jigyasag18/gold-price-prediction-project-using-machine-learning
This repository contains a machine learning project focused on predicting gold prices (GLD) using historical stock market data, including indicators such as SPX, USO, SLV, and EUR/USD. The project implements a Random Forest Regressor for accurate price forecasting, complete with data visualization, correlation analysis, and model evaluation metrics
data dataset jupyter-notebook jupyter-notebooks machine-learning machinelearing machinelearningalgorithms machinelearningmodel machinelearningprojects matplotlib mlproject numpy pandas randomforestregressor seaborn
Last synced: 23 Jul 2025
https://github.com/so-cool/uobrain
My solution to the University of Bristol PURE Data Challenge
Last synced: 09 Sep 2025
https://github.com/jaldekoa/fdicapi
A Python wrapper to easily retrieve data from the BankFind Suite official API from FDIC in pandas format.
api api-wrapper banking data finance pandas python united-states
Last synced: 07 Jan 2026
https://github.com/nikhilash45/live_ipl_report
This repository hosts the source code for an interactive IPL (Indian Premier League) Dashboard built using PowerBI. The dashboard provides real-time updates on ongoing matches, including live scores, batting and bowling statistics for both teams, and the points table.
analysts cleaning-data cricket-data dashboard data data-analysis data-visualization dax powerbi
Last synced: 19 Mar 2026
https://github.com/camara94/introduction-to-data-engineering
Describe the different entities that form a modern data ecosystem. Describe and differentiate between the role and responsibilities of Data Engineers, Data Scientists, Data Analysts, Business Analysts, and Business Intelligence Analysts. Explain what Data Engineering is. List the tasks that need to be performed in a typical data engineering lifecycle. Describe what a day in the life of a Data Engineer looks like.
business-analytics business-intelligence data dataingestion dataintegration datascience machinelearning python statistical-analysis
Last synced: 09 Apr 2025
https://github.com/stdlib-js/array-zero-to-like
Generate a linearly spaced numeric array whose elements increment by 1 starting from zero and having the same length and data type as a provided input array.
array data float32array float64array int16array int32array javascript matrix ndarray node node-js nodejs stdlib structure typed typed-array types uint32array vector
Last synced: 07 Jan 2026
https://github.com/danreynolds/data_batcher
Data batcher batches and de-dupes data fetched in the same task of the event loop.
batching data flutter hacktoberfest
Last synced: 19 May 2026
https://github.com/harmanveer-2546/supply-chain
Supply chain analytics is a valuable part of data-driven decision-making in various industries such as manufacturing, retail, healthcare, and logistics. It is the process of collecting, analyzing and interpreting data related to the movement of products and services from suppliers to customers.
customer-segmentation-analysis data data-analysis data-cleaning data-insights ggplot2 numpy pandas performance-evaluation predictive-analytics-for-business python risk-assessment sales-analysis statistical-analysis supply-chain tidyverse trend-analysis
Last synced: 10 Apr 2026
https://github.com/goncaloperes/datavisualization
Here I will share some of my data visualizations using a variety of datasets, technologies and tools.
d3js data dataset datavisualization dataviz ggplot matplotlib rawgraphs seaborn tableau visualization yellowbrick
Last synced: 04 Feb 2026
https://github.com/tylerben/data-spring
Easily generate a dummy dataset based on a provided config
data data-spring datagenerator fake-data generator javascript typescript
Last synced: 27 May 2026
https://github.com/stdlib-js/array-zero-to
Generate a linearly spaced numeric array whose elements increment by 1 starting from zero.
array data float32array float64array int16array int32array javascript matrix ndarray node node-js nodejs stdlib structure typed typed-array types uint32array vector
Last synced: 08 Jan 2026
https://github.com/luminati-io/Twitter-X-dataset-samples
A sample dataset of over 1000 Twitter (X) posts, extracted using the Bright Data API, ideal for trend discovery, brand monitoring, and competitive insights.
api data dataset twitter twitter-api twitter-scraper web-scraping x
Last synced: 09 Apr 2025
https://github.com/cintia0528/data_cleaning_and_analytics-python
Evaluate if aggressive discounting benefits Eniac long-term, considering differing views on customer acquisition and brand positioning. Focus on data cleaning for informed decision-making.
colab-notebook data data-analysis datacleaning dataquality jupyter-notebook matplotlib pandas python seaborn
Last synced: 08 Jan 2026
https://github.com/marabesi/d3-visualization
Different visualizations using data and d3.js
charts css d3js data html js json timeline-chart visualization
Last synced: 01 May 2026
https://github.com/rayyan9477/dep
data data-science machine-learning python visualization web-scraping
Last synced: 08 May 2026
https://github.com/ayushai/salesfoce-hospital-management
A custom Salesforce-based Hospital Management System with powerful dashboards and data analysis tools. It provides real-time insights into patient care, appointment scheduling, and inventory management, optimizing healthcare operations and decision-making.
analytics dashboard data salesforce-developers visualization
Last synced: 22 Feb 2026
https://github.com/spiceai/datasets
Spice AI curated dataset definitions for Spice.ai
ai bitcoin blockchain data ethereum polygon
Last synced: 20 Apr 2026
https://github.com/garcane/global-shipping-analytics-dashboard
This Tableau project provides a comprehensive visual analysis of global sales, shipping costs, and quality metrics across different regions and countries.
data data-analysis data-analyst data-visualization metrics tableau
Last synced: 01 Mar 2026
https://github.com/izaaccoding36/dados-dinamicos
Esse repositório apresenta um site criado com API para a criação de gráficos, relatando o uso de redes sociais em uma escala global
api data redes-sociais social-media website
Last synced: 26 Mar 2025
https://github.com/bredalis/scikitlearn
🤖 Library to create ML models 🤖
data ia learning-python librery ml python
Last synced: 30 May 2026
https://github.com/tomasoak/datahopper
Python package for data engineering and data wrangling
data data-analysis data-engineering data-mining data-science data-structures data-wrangling datascience pandas python
Last synced: 12 Mar 2026
https://github.com/xtao-org/tree-annotation
What is TAO
annotation data intercommunication json notation s-expressions simplicity syntax tao tree tree-annotation universal xml
Last synced: 25 May 2026
https://github.com/nafisalawalidris/buybuy-e-commerce-company
The BuyBuy E-commerce Company repository is a comprehensive hub for the company's e-commerce platform. It includes source code, documentation, and data analysis insights, providing a data-driven approach to improve customer experience, drive revenue, and inform decision-making.
buybuy cleaning-data company customer-experience data data-analysis decision-making documentation e-commerce excel insights postgresql repository revenue source-code sql
Last synced: 16 Mar 2025
https://github.com/vagnerbellacosa/029_analisededadoscompythonpandas
Neste Labs será apresentada a biblioteca Pandas, uma biblioteca Python de código aberto para análise de dados. Ela dá ao Python a capacidade de trabalhar com dados do tipo planilha, permitindo carregar, manipular e combinar dados rapidamente, entre outras funções. Python
data digital-innovation-one dio jupiter-notebook labs ms-excel panda python
Last synced: 14 May 2026
https://github.com/jmcanterafonseca/leaflet-context-information
A Leaflet plugin + infrastructure for getting access to Context Information (i.e. data) exposed through FIWARE NGSIv2
context data fiware information leaflet map open visualization web
Last synced: 21 Apr 2026
https://github.com/gagolews/clustering-results-v1
A framework for benchmarking clustering algorithms – Benchmark results (for version 1 of the Suite)
benchmark benchmark-datasets clustering data dataset datasets machine-learning
Last synced: 16 Mar 2025
https://github.com/mini-ware/mini-ware
Just some very simple markdown for my GitHub profile
codewars ctf data hackthebox javascript markdown minimalistic profile-readme python readme-profile simple stattistics svg
Last synced: 13 Apr 2026
https://github.com/robertopatino1/oscars2023_data_analysis
A deep data science analysis involving tweets regarding the upcoming Academy Awards
data data-analysis-python data-science data-visualization html jupyter-notebook lda-model machine-learning python trends tweepy twitter
Last synced: 24 Apr 2026
https://github.com/programmer-rd-ai/library-management-system-oraclesql
The Library Management System project, part of the CI6320 Advanced Data Modelling coursework, features comprehensive SQL scripts utilizing OracleSQL to facilitate efficient data modeling and management.
adm advanced ci6320 cw data icw library management modelling oracle oraclesql report sql system
Last synced: 29 Oct 2025
https://github.com/programmer-rd-ai/moviedatascraper
Explore the cinematic universe with our IMDb web scraping project! Dive into movie data with ease, uncovering insights from cast to critical reviews. With dynamic visualizations and reliable data, let's journey through the world of movies like never before. Lights, camera, analysis!
beautifulsoup beautifulsoup4 data data-analysis jupyter-notebook matplotlib numpy pandas programming python python3 scraping seaborn software web
Last synced: 01 Mar 2025
https://github.com/basemax/buskool.com-data
This repository contains the collected product data from the Buskool website (باسکول). The data is stored in 20k+ JSON files, each containing detailed information about products available on the website.
buskool buskoolcom data farsi information ir iran json persian
Last synced: 03 Apr 2025
https://github.com/sandipbera35/blogapp.spring.boot
A proof-of-concept Project Of Blog application In Java Spring Boot, Spring Data JPA with mysql Minio Object Storage , it is an Integration with JWT authservice project(written in golang) .
data java jpa jpa-entity-manager jpa-hibernate mysql mysql-server postman postmanapi spring-boot
Last synced: 13 Apr 2026
https://github.com/unownone/spenddy-link
Simple Privacy Friendly chrome extension to track your spends and more!
Last synced: 12 Mar 2026
https://github.com/ncgl-git/eriparse
Python code to parse the cost-of-living HTML from erieri.com, i.e. https://www.erieri.com/cost-of-living/united-states/illinois/chicago
cost-of-living crime crime-data data economic-research-institute erieri webscraper
Last synced: 14 Jan 2026
https://github.com/aiwithqasim/competitive-programming
I will add all material which i did or in the future i will do to make my programming skill more enhanced to become a competitive programmer
c-plus-plus code data java programming structured-data
Last synced: 20 May 2026
https://github.com/bunnysunny24/bluepulse
A Smart Water Management System
data data-processing data-visualization firebase iot machine-learning mysql-database reactjs
Last synced: 17 Mar 2025
https://github.com/s-raza/csvio
Wrapper for conveniently processing CSV files
csv data file processing wrapper
Last synced: 14 Jan 2026
https://github.com/avto-dev/static-references-data
Data for static references
Last synced: 05 Oct 2025
https://github.com/wangshouh/cryptofinancedata
An ipynb file containing data acquisition of futures, options and other financial derivatives
Last synced: 05 Oct 2025
https://github.com/igorwastaken/math-problems
Solve math problems easily with this utility library.
algorithm area data demography geography javascript math npm package population school typescript util utils
Last synced: 23 Feb 2026
https://github.com/iwconfig/svtplay-data
Daily JSON backup of content metadata from SVTPlay
data metadata streamlink svtplay svtplay-dl youtube-dl
Last synced: 24 Oct 2025
https://github.com/hyperversal-blocks/averveil
Averveil is OpenSea for Data.
blockchain data golang iot privacy zero-knowledge zkp
Last synced: 14 Jan 2026
https://github.com/joocer/data_expectations
Are your data meeting your expectations?
data data-engineering data-quality data-science data-unit-tests observability pipelines quality validation
Last synced: 07 Oct 2025
https://github.com/ahmad-ali-rafique/comment-generation-tool
This repository hosts a Jupyter Notebook-based Comment Generation Tool exploring advanced NLP techniques for automated, contextually relevant comment generation from input data. Ideal for developers and researchers in NLP and automated text generation.
ai aitools artificial-intelligence content-based-recommendation data datascience jupyter-notebook machine-learning
Last synced: 07 Oct 2025