data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-02 00:07:45 UTC
- JSON Representation
https://github.com/j1sk1ss/dateapppc.exmpl
Простое нативное приложение для Windows с демонстрацией ООП и SQL баз данных на примере приложения для знакомств.
data oop-principles parsing pgadmin4 sql wpf
Last synced: 11 Apr 2026
https://github.com/rastmob/wordpress-llms-output-plugin
A WordPress plugin to export posts, pages, and custom post types as JSON for training Language Models (LLMs).
ai data llm llms training training-data wordpress wordpress-development wordpress-plugin
Last synced: 03 May 2026
https://github.com/marek-jakub/monitoring
A university project concerning field data management for bird ringers.
bird data fieldwork management ringing
Last synced: 24 Jun 2026
https://github.com/bdpedigo/neuropull
A (soon to be) lightweight Python package for accessing single-cell connectome networks with metadata.
connectome connectomes connectomics data dataset networks networks-biology
Last synced: 05 Oct 2025
https://github.com/woctezuma/geforce-leak
Fetch data from the Geforce leak.
data datamining egs epic epic-games epic-games-launcher epic-games-store geforce geforce-experience geforce-leak geforce-now geforce-now-leak geforcenow geforcenow-leak graphql leak leaks nvidia steam steam-games
Last synced: 02 May 2026
https://github.com/gauravkoradiya/tensorflow-data-and-deployement
This repository contains usage of data and deployment pipline in tensorflow.
data deployment machine-learning-algorithms pipline tensorflowjs
Last synced: 06 Oct 2025
https://github.com/henrylin03/video-games
Using Python and SQL to clean, analyse and visualise video games' data from Metacritic. Includes scraping using BeautifulSoup.
analysis beautifulsoup beautifulsoup4 data data-analysis data-science eda jupyter-notebook pandas python sql sqlite3 video-game video-games
Last synced: 14 Apr 2026
https://github.com/imranhsayed/programming-in-c
Programming in C
array c c-programming circular-linked-list cprogramming data data-structures-and-algorithms file-handling linked-list pointers
Last synced: 28 Jan 2026
https://github.com/pommes-public/pommesdata
A full-featured transparent data preparation routine from raw data to POMMES model inputs
data opensource power raw-data transparent
Last synced: 07 Oct 2025
https://github.com/mark-summerfield/uxf
Uniform eXchange Format (uxf) is a plain text human readable optionally typed storage format that supports custom types. It may serve as a convenient alternative to csv, ini, json, sqlite, toml, xml, or yaml.
data ini json parser pretty-printer sqlite storage-engine toml xml yaml
Last synced: 08 Oct 2025
https://github.com/p32929/use-megamind
A simple react hook for managing asynchronous function calls with ease on the client side
async asynchronous-tasks axios client-side-javascript data data-fetching easy fetch generics hooks javascript npm painless promise query react rest simple small typescript
Last synced: 23 Jan 2026
https://github.com/woo071002/parcel-management-system
A Parcel Delivery Management System streamlining deliveries with features for admin, users, and delivery personnel, including real-time tracking, delivery requests, and personalized dashboards.
cors csharp data dotenv html-css iconfont jkuat land-information-system mongodb python react-router-dom sass tech-expo xaml
Last synced: 08 Oct 2025
https://github.com/doctorlai/hex-viewer
Simple File Viewer in HEX
application data files hacktoberfest hex-viewer hexeditor hexidecimal web-app
Last synced: 09 Oct 2025
https://github.com/gbv/cocoda-mappings
concordances, mappings and conversion scripts to create JSKOS mappings
Last synced: 28 Oct 2025
https://github.com/toransahu/metoffice
Data visualisation - MetOffice
data metoffice uk visualization weather
Last synced: 25 Mar 2025
https://github.com/lmuffato/project-ting-trybe
Projeto ting - Projeto avaliativo da Trybe do Bloco 37: Estrutura de Dados II: Listas, Filas e Pilhas
data data-analysis python queue read-file stack trybe trybe-projects
Last synced: 12 Jun 2025
https://github.com/svelterun/store
Persisted version of svelte/store.
data state state-management store svelte svelte-store sveltekit svelterun typescript
Last synced: 08 Jan 2026
https://github.com/datenoio/internacia-db
Public registry of the intergovernmental organizations, country groups and countries. Available as JSONl, Parquet, YAML and DuckDB database datasets
countries data datasets international international-trade reference
Last synced: 29 May 2026
https://github.com/vapourismo/binary-io
Read and write values of types that implement Binary from and to Handles
data haskell haskell-library io parsing
Last synced: 28 Mar 2025
https://github.com/agahkarakuzu/datavis_edu
Presented in BrainHack School 2019-2020, QBIN SciComm 2021
binder dashboard data notebooks repo2docker visualization
Last synced: 01 Apr 2025
https://github.com/quasilyte/phpcorpus
A collection of various PHP code; useful for PHP tools writers to get some insights on how "real-world" PHP code looks like
analysis corpus data php php-corpus
Last synced: 04 Jul 2025
https://github.com/e-panourgia/data-science-projects
Data Science Projects
annotations augmentation data data-preprocessing-and-cleaning hyperparameter-tuning llm logistic-regression nlp random-forest-classifier xboost-classifier
Last synced: 09 Apr 2025
https://github.com/fredhutch/gdscnsoilsites
Homepage for BioDIGS Project. Learn about the project and download data.
biodigs data metagenomics student-research
Last synced: 25 Mar 2025
https://github.com/bolajiolayinka/graph-api-automation
An End to End Automation from Facebook Business to Data Visualization of Campaigns
Last synced: 07 May 2025
https://github.com/thiagopanini/datadelivery
Um módulo Terraform open source capaz de proporcionar um toolkit completo de infraestrutura para que usuários iniciem suas respectivas jornadas de exploração em serviços de Analytics na AWS.
analytics athena aws catalog crawler data datamesh glue s3 terraform
Last synced: 29 Nov 2025
https://github.com/castelao/bufr
BUFR binary data format from WMO
binary data format meteorology oceanography wmo
Last synced: 13 Jul 2025
https://github.com/whitehathackerpr/data-visualization-tool
This is a Python-based web application that allows users to upload datasets, analyze data, and create visualizations interactively. The tool is designed for ease of use and provides a simple interface to perform basic data analysis and generate visualizations
data data-analysis data-visualization python python3
Last synced: 05 Sep 2025
https://github.com/cainmi/data-page-project
A repository to pull code and files from, may be used to store page data links, code etc. mainly used for python for now
data html javascript python schema
Last synced: 21 Oct 2025
https://github.com/bredalis/datastructure
📚 Estructuras de Datos en Python
algorithms data data-structure python
Last synced: 12 Apr 2026
https://github.com/eve-ning/osumania_data
processed osu!mania data from osu!API
Last synced: 24 Feb 2026
https://github.com/devlive-community/mockaroo
一个轻量级的 HTTP Mock 服务器,用于快速构建模拟数据接口,适用于前后端开发和接口测试场景。
Last synced: 08 Jul 2025
https://github.com/geo-y20/uber-rides-data-analysis
This project aims to analyze Uber ride data to understand various aspects of ride usage, such as the distribution of rides across different categories, purposes, months, days, and times.
dashboard dashboard-templates data data-analysis data-analysis-python data-analytics data-visualization pandas powerbi python recommendation-system rides uber
Last synced: 13 Apr 2026
https://github.com/toransahu/excel-implementation-of-regression-clustering
B.Tech. Major Project
btech-project-proposal clustering data kmeans-clustering machine-learning mining regression
Last synced: 25 Mar 2025
https://github.com/jigyasag18/gold-price-prediction-project-using-machine-learning
This repository contains a machine learning project focused on predicting gold prices (GLD) using historical stock market data, including indicators such as SPX, USO, SLV, and EUR/USD. The project implements a Random Forest Regressor for accurate price forecasting, complete with data visualization, correlation analysis, and model evaluation metrics
data dataset jupyter-notebook jupyter-notebooks machine-learning machinelearing machinelearningalgorithms machinelearningmodel machinelearningprojects matplotlib mlproject numpy pandas randomforestregressor seaborn
Last synced: 23 Jul 2025
https://github.com/bijx/firestore-data-fetcher
A simple Python script to fetch documents from a Firebase Firestore collection and save them to a local `.json` file.
automation data database downloader exporter fetcher firebase firestore open-source script
Last synced: 12 Apr 2026
https://github.com/nikhilash45/live_ipl_report
This repository hosts the source code for an interactive IPL (Indian Premier League) Dashboard built using PowerBI. The dashboard provides real-time updates on ongoing matches, including live scores, batting and bowling statistics for both teams, and the points table.
analysts cleaning-data cricket-data dashboard data data-analysis data-visualization dax powerbi
Last synced: 19 Mar 2026
https://github.com/camara94/introduction-to-data-engineering
Describe the different entities that form a modern data ecosystem. Describe and differentiate between the role and responsibilities of Data Engineers, Data Scientists, Data Analysts, Business Analysts, and Business Intelligence Analysts. Explain what Data Engineering is. List the tasks that need to be performed in a typical data engineering lifecycle. Describe what a day in the life of a Data Engineer looks like.
business-analytics business-intelligence data dataingestion dataintegration datascience machinelearning python statistical-analysis
Last synced: 09 Apr 2025
https://github.com/varbrad/mindb
🗄 🔍 ⚡️ Schema-less document-oriented collection model data-store for Node & Browsers.
browser data datastore db document javascript json-schema mongo mongodb nodejs nosql query schema
Last synced: 13 Apr 2026
https://github.com/gkapfham/ast2016-paper
Source Code of and Supporting Files for a Paper Published at AST 2016
data latex-document paper research
Last synced: 19 Oct 2025
https://github.com/nodef/infoods
Kit for International Network of Food Data Systems (INFOODS).
component data food identifier infoods international network systems tagnames
Last synced: 11 Mar 2026
https://github.com/danreynolds/data_batcher
Data batcher batches and de-dupes data fetched in the same task of the event loop.
batching data flutter hacktoberfest
Last synced: 19 May 2026
https://github.com/goncaloperes/datavisualization
Here I will share some of my data visualizations using a variety of datasets, technologies and tools.
d3js data dataset datavisualization dataviz ggplot matplotlib rawgraphs seaborn tableau visualization yellowbrick
Last synced: 04 Feb 2026
https://github.com/stdlib-js/array-zero-to
Generate a linearly spaced numeric array whose elements increment by 1 starting from zero.
array data float32array float64array int16array int32array javascript matrix ndarray node node-js nodejs stdlib structure typed typed-array types uint32array vector
Last synced: 08 Jan 2026
https://github.com/luminati-io/Twitter-X-dataset-samples
A sample dataset of over 1000 Twitter (X) posts, extracted using the Bright Data API, ideal for trend discovery, brand monitoring, and competitive insights.
api data dataset twitter twitter-api twitter-scraper web-scraping x
Last synced: 09 Apr 2025
https://github.com/open-i18n/data-unicode-math
Git mirror for Unicode Support for Mathematics data
data i18n internationalization math mathematics open-i18n unicode unicode-consortium unicode-data
Last synced: 11 Mar 2026
https://github.com/marabesi/d3-visualization
Different visualizations using data and d3.js
charts css d3js data html js json timeline-chart visualization
Last synced: 01 May 2026
https://github.com/rayyan9477/dep
data data-science machine-learning python visualization web-scraping
Last synced: 08 May 2026
https://github.com/ayushai/salesfoce-hospital-management
A custom Salesforce-based Hospital Management System with powerful dashboards and data analysis tools. It provides real-time insights into patient care, appointment scheduling, and inventory management, optimizing healthcare operations and decision-making.
analytics dashboard data salesforce-developers visualization
Last synced: 22 Feb 2026
https://github.com/dbriane208/omdena-apprenticeship-project
This is part of my contribution to the Omdena apprenticeship program .
data data-science feature-engineering machine-learning
Last synced: 14 Mar 2026
https://github.com/ronaldkanyepi/python-streamlit-covid-19-dashboard
This is a responsive streamlit covid 19 Dashboard
analytics data data-analysis data-visualization datascience python streamlit
Last synced: 18 May 2026
https://github.com/spiceai/datasets
Spice AI curated dataset definitions for Spice.ai
ai bitcoin blockchain data ethereum polygon
Last synced: 20 Apr 2026
https://github.com/neelravi/fairtool
A CLI tool for FAIR processing of computational materials science data.
computational data data-analytics fair management materials physics python science
Last synced: 14 Jan 2026
https://github.com/wioniqle-q/tower-modelling
Data science
data data-science ndarray-odeint ndjson science
Last synced: 16 Mar 2025
https://github.com/dwidevelopes/database-input-pelanggran-mahasiswa
Menginput data Mahasiswa Yang Melakukan Pelanggran yang siap di data dan di hukum Dan juga siap Terkena Sanksi
aplikasi aplikasi-sekolah data data-analysis database input-method mahasiswa sekolah siswa siswi website
Last synced: 02 May 2026
https://github.com/metriccoders/metriccoders_datasets
This is the Metric Coders repository containing all the datasets for machine learning.
data datasets machine-learning natural-language-processing scikit-learn
Last synced: 08 Apr 2025
https://github.com/bredalis/scikitlearn
🤖 Library to create ML models 🤖
data ia learning-python librery ml python
Last synced: 30 May 2026
https://github.com/xtao-org/tree-annotation
What is TAO
annotation data intercommunication json notation s-expressions simplicity syntax tao tree tree-annotation universal xml
Last synced: 25 May 2026
https://github.com/nafisalawalidris/buybuy-e-commerce-company
The BuyBuy E-commerce Company repository is a comprehensive hub for the company's e-commerce platform. It includes source code, documentation, and data analysis insights, providing a data-driven approach to improve customer experience, drive revenue, and inform decision-making.
buybuy cleaning-data company customer-experience data data-analysis decision-making documentation e-commerce excel insights postgresql repository revenue source-code sql
Last synced: 16 Mar 2025
https://github.com/mftnakrsu/crm-rfm-analysis
CRM-RFM-Analysis
ai crm data data-analysis data-science deep-learning machine-learning python rfm rfm-analysis
Last synced: 16 Mar 2025
https://github.com/vagnerbellacosa/029_analisededadoscompythonpandas
Neste Labs será apresentada a biblioteca Pandas, uma biblioteca Python de código aberto para análise de dados. Ela dá ao Python a capacidade de trabalhar com dados do tipo planilha, permitindo carregar, manipular e combinar dados rapidamente, entre outras funções. Python
data digital-innovation-one dio jupiter-notebook labs ms-excel panda python
Last synced: 14 May 2026
https://github.com/jmcanterafonseca/leaflet-context-information
A Leaflet plugin + infrastructure for getting access to Context Information (i.e. data) exposed through FIWARE NGSIv2
context data fiware information leaflet map open visualization web
Last synced: 21 Apr 2026
https://github.com/gagolews/clustering-results-v1
A framework for benchmarking clustering algorithms – Benchmark results (for version 1 of the Suite)
benchmark benchmark-datasets clustering data dataset datasets machine-learning
Last synced: 16 Mar 2025
https://github.com/mini-ware/mini-ware
Just some very simple markdown for my GitHub profile
codewars ctf data hackthebox javascript markdown minimalistic profile-readme python readme-profile simple stattistics svg
Last synced: 13 Apr 2026
https://github.com/robertopatino1/oscars2023_data_analysis
A deep data science analysis involving tweets regarding the upcoming Academy Awards
data data-analysis-python data-science data-visualization html jupyter-notebook lda-model machine-learning python trends tweepy twitter
Last synced: 24 Apr 2026
https://github.com/programmer-rd-ai/library-management-system-oraclesql
The Library Management System project, part of the CI6320 Advanced Data Modelling coursework, features comprehensive SQL scripts utilizing OracleSQL to facilitate efficient data modeling and management.
adm advanced ci6320 cw data icw library management modelling oracle oraclesql report sql system
Last synced: 29 Oct 2025
https://github.com/sandipbera35/blogapp.spring.boot
A proof-of-concept Project Of Blog application In Java Spring Boot, Spring Data JPA with mysql Minio Object Storage , it is an Integration with JWT authservice project(written in golang) .
data java jpa jpa-entity-manager jpa-hibernate mysql mysql-server postman postmanapi spring-boot
Last synced: 13 Apr 2026
https://github.com/rapter1990/data-visualization-examples
Data Visualization Examples
data data-analysis data-visualization folium matplotlib plot plotly python seaborn visualization
Last synced: 13 Apr 2026
https://github.com/vincentlaucsb/csv-data
A curated repository of real and fake CSV data for use in testing suites
Last synced: 08 Mar 2026
https://github.com/s-raza/csvio
Wrapper for conveniently processing CSV files
csv data file processing wrapper
Last synced: 14 Jan 2026
https://github.com/dixslyf/nbparts
Unpack a Jupyter notebook into its sources, outputs and metadata.
data haskell jupyter jupyter-notebook nix nix-flake
Last synced: 05 Oct 2025
https://github.com/diegoperea20/own_dataset_segmentation_yolov8
Segmentacion y detection de objetos con propio dataset usando YOLOV8 , en el que se utiliza un dataset propio de una moneda de 200 pesos colombianos del año 2023.
coins colombia data opencv own python segmentation tensorflow yolov8
Last synced: 12 Apr 2026
https://github.com/helins/ex.clj
Java exceptions as clojure data
clojure data exception java java-exceptions
Last synced: 12 Dec 2025
https://github.com/jahilldev/immutable-parsejs
Parse a JS object or array/map into an Immutable collection. Makes use of ImmutableJs List, and Record primitives.
data immutablejs javascript json nodejs parse typescript
Last synced: 13 Apr 2026
https://github.com/joocer/data_expectations
Are your data meeting your expectations?
data data-engineering data-quality data-science data-unit-tests observability pipelines quality validation
Last synced: 07 Oct 2025
https://github.com/ahmad-ali-rafique/comment-generation-tool
This repository hosts a Jupyter Notebook-based Comment Generation Tool exploring advanced NLP techniques for automated, contextually relevant comment generation from input data. Ideal for developers and researchers in NLP and automated text generation.
ai aitools artificial-intelligence content-based-recommendation data datascience jupyter-notebook machine-learning
Last synced: 07 Oct 2025
https://github.com/nikoshet/rust-dms-cdc-operator
The rust-dms-cdc-operator is a Rust-based utility for comparing the state of a list of tables in an Amazon RDS database with data stored in Parquet files on Amazon S3, particularly useful for change data capture (CDC) scenarios.
aws cdc data dms parquet pgdatadiff polars postgres rds rust s3 validation
Last synced: 18 Jan 2026
https://github.com/ryanjoy0000/yt-notifier
Youtube Notifier (Telegram Bot) - A real time data processing pipeline
data go kafka-streams real-time telegram-api youtube-api
Last synced: 14 Jan 2026
https://github.com/patrickdavies100/datapipeline37
Some Data Science practice using datasets available online. Currently test data is similar to this dataset: https://www.kaggle.com/datasets/asaniczka/amazon-uk-products-dataset-2023 but the plan is to expand.
data data-science pandas-dataframe python3
Last synced: 08 Oct 2025
https://github.com/jakakokosar/bioinformatics-serverfiles
Knowledge base for Orange3-bioinformatics add-on
bioinformatics data dictybase gene genesets go homologene markergenes ncbi serverfiles
Last synced: 16 Apr 2026
https://github.com/varun-khorgade/sentimentscope-e-commerce-review-analyzer
Analyzed customer reviews and purchase data to extract sentiment and behavioral insights. Built SQL-based ETL for data preparation and visualized results using Python and Power BI dashboards for actionable business decisions.
analytics customer-beheviour dashboard data data-visualization dataextraction natural-language-processing nlp pandas powerbi python sentiment-analysis sql textblob
Last synced: 17 Apr 2026
https://github.com/rafaelfloressouza/Covid-19-Dashboard
Python web application to display COVID19 data from the world using Plotly and Dash
bootstrap covid-19 css data datavisualization plotly-dash python3
Last synced: 10 Mar 2025
https://github.com/east-empire-trading-company/eetc-data-client
Client library for retrieving data managed by EETC Data Hub.
client-library data data-science finance library python
Last synced: 31 May 2026