data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-29 00:07:49 UTC
- JSON Representation
https://github.com/banbord/data-vis-tornados
This repository includes data files, processing scripts, visualization code, and documentation for our tornado data visualization project. It aims to provide insights into tornado patterns across the United States using interactive and informative visual representations.
d3-visualization d3js data javascript json visualization
Last synced: 24 Feb 2026
https://github.com/ballerina-platform/module-ballerina-data.csv
The Ballerina CSV Data Library is a comprehensive toolkit designed to facilitate the handling and manipulation of CSV data within Ballerina applications. It streamlines the process of converting CSV data to native Ballerina data types, enabling developers to work with CSV content seamlessly and efficiently.
ballerina ballerina-csv csv csv-data data
Last synced: 29 Jan 2026
https://github.com/cerema/groum
Utilitaire en ligne de commande pour convertir les données d'arrêtés de circulation
Last synced: 06 Feb 2026
https://github.com/peterdavehello/nrd-list-archive
🌐📂 A collection of past NRD lists to explore—perfect for fun, research, or just plain curiosity! 🎉🔍✨
Last synced: 17 Mar 2026
https://github.com/eesunmoon/algorithms
[Fall 2020] Algorithms
algorithms algorithms-and-data-structures c data data-structures
Last synced: 01 Feb 2026
https://github.com/jaldekoa/nyfedapi
A Python wrapper to easily retrieve data from the Federal Reserve Bank of New York (FRBoNY) official API in pandas format.
api api-wrapper banking data finance pandas python united-states
Last synced: 08 Feb 2026
https://github.com/nononoexe/setariaviridis
🌾 Field-collected data of green foxtail
data data-science dataset rpackage
Last synced: 27 Feb 2026
https://github.com/planarnetwork/feeds.planar.network
GTFS feeds for bus, train and plane
data feeds gtfs transit transportation
Last synced: 11 Feb 2026
https://github.com/bredalis/kpopnews
A place to see kpop news 📝
backend css data feedparser flask frameworks frontend html jinja2 kpop mongodb mongodb-atlas news newsletter os pages pymongo python requests web
Last synced: 12 Feb 2026
https://github.com/achraf-oujjir/chatgpt-users-tweets-pipeline
🐦🔵End-to-end ChatGPT Users' Tweets Data Pipeline with Python 🐍, Hive 🐝, and Power BI 📊
bash-script cloudera data data-engineering data-vizualisation datawarehouse hdfs hive networking powerbi python sentiment-analysis sftp shell tweepy twitter-api ubuntu virtualization vmware-workstation
Last synced: 28 Feb 2026
https://github.com/colour-science/colour-hdri-examples-datasets
Colour - HDRI - Examples Datasets
color color-science color-space color-spaces colorspace colorspaces colour colour-science colour-space colour-spaces colourspace colourspaces data dataset datasets hdr hdri raw tone-mapping tonemapping
Last synced: 19 Mar 2026
https://github.com/colour-science/colour-demosaicing-tests-datasets
Colour - Demosaicing - Tests Datasets
color color-science color-space color-spaces colorspace colorspaces colour colour-science colour-space colour-spaces colourspace colourspaces data dataset datasets de-mosaicing debayering demosaicing demosaicking raw
Last synced: 19 Mar 2026
https://github.com/mihasm/arso-scraper
Unofficial Python CLI tool for downloading automated sensor weather data from the Slovenian Environment Agency.
api arso cli data historical-data meteorological python slovenia weather
Last synced: 14 Feb 2026
https://github.com/rajatt95/python_rs
Programming | Python | PyCharm | Data Types | Tuple | Dictionary | If-Else | Loops - For, While | Functions | OOPS Principles | Constructor | String - SubString, Concatenation, Split, Strip | Read & Write data into files | JSON Parsing | CSV package | Web Scrapping
constructor csv-parser data dictionary functions if-else-statements json json-parser oops parser pycharm-ide python python-programming-language read-write-file strings tuple web-scrapping
Last synced: 15 Feb 2026
https://github.com/ngambip/diabetes_factors_2024
Exploring BMI Categories and Health Factors.
dashboards data datacleaning dax-languague powerbi sql sqlstudio tsql visualization
Last synced: 03 Mar 2026
https://github.com/csadorf/pydata-ann-arbor-2018
Slides and notebooks demonstrating signac for PyData Ann Arbor Meetup 2018
data data-management jupyter signac workflow
Last synced: 04 Jun 2026
https://github.com/slashdotted/pomapure
PoorMan's Pipeline
data json modular module pipeline processing
Last synced: 18 Apr 2026
https://github.com/pitmonticone/covid-italy
References for COVID-19 situation in Italy.
coronavirus covid-19 covid-19-italy data data-analysis documentation testing
Last synced: 05 Apr 2026
https://github.com/kuraydev/react-native-modal-data-passing
Seamless Data Passing to React-Native-Modal
android apple data data-passing google hook ios modal react react-native react-native-modal useimperativehandle
Last synced: 18 Apr 2026
https://github.com/0xdir/relief_web_dart
A Future-based wrapper around the Relief Web API, to retrieve information on humanitarian news, reports, training, jobs, and disasters
api dart data humanitarian jobs
Last synced: 11 Jun 2026
https://github.com/nafisalawalidris/advanced-fraud-detection-with-anomaly-detection
This repository demonstrates how to build a robust fraud detection system that combines supervised learning techniques with anomaly detection models. It provides end-to-end implementation, from data preprocessing and model training to deploying a real-time fraud detection API using FastAPI.
anomaly-detection creditcardfrauddetection data dataanalytics fastapi fraud-detection machinelearning modeldeployment python supervised-machine-learning unsupervised-machine-learning
Last synced: 20 Apr 2026
https://github.com/tomwhite/chernoff
A visual mood indicator. One of the first Java programs I ever wrote.
chernoff-faces data visualization
Last synced: 20 Apr 2026
https://github.com/rcourivaud/rcourivaud.github.io
Raphaël Courivaud
data database datascience python
Last synced: 21 Apr 2026
https://github.com/kefniark/kaaya
JS Library for State management and Data synchronization between Applications
data game kaaya mutation network serialization state-management
Last synced: 06 Jun 2026
https://github.com/14richa/patient-readmission-analysis
This project focuses on predictive modeling to foresee hospital readmissions of diabetic patients within 30 days post-discharge. By leveraging a dataset spanning a decade (1999-2008) and covering records from 130 US hospitals, the aim is to enhance healthcare management and patient outcomes.
analytics data jupyter-notebook numpy
Last synced: 29 Apr 2026
https://github.com/andrey-tech/data-storage-php
Простое хранилище данных в виде ключ-значение в JSON-файлах с разделяемой блокировкой на чтение и эксклюзивной блокировкой на запись.
data data-storage files json php php7 storage
Last synced: 29 Apr 2026
https://github.com/dongminlee94/data-visualization-tutorial
A repository for data visualization tutorial
data data-science data-visualization matp matplotlib pca plotly python seaborn t-sne tutorial umap visualization
Last synced: 29 Apr 2026
https://github.com/noklam/blog_archive_fastpage
Nok's data science blog
blog data data-science machine-learning python sceince
Last synced: 01 May 2026
https://github.com/banyan-team/banyan-julia-examples
Adventures in massively parallel cloud computing with Banyan Julia!
banyan data data-analytics data-processing data-science julia
Last synced: 02 May 2026
https://github.com/assem-elqersh/creativa-data-science-bootcamp
Jupyter notebooks from the Creativa Data Science Bootcamp, covering key data science concepts and practices across multiple sessions, from data preprocessing to model building and time series analysis.
data data-science eda exploratory-data-analysis machine-learning pandas time-series-analysis xgboost xgboost-classifier
Last synced: 03 May 2026
https://github.com/praveenpuglia/css-support
The source of truth for CSS browser support of info
api browser compatibility css data properties selectors support
Last synced: 31 Mar 2025
https://github.com/stefen-taime/real-time-data-pipeline-snake-game
Dynamic Snake Game: Unleashing Real-Time Streaming Analytics with Redis, Kafka, Flink, ClickHouse & Chart.js in an Online Snake Game via Flask API
chartjs clickhouse confluent-cloud data flask kafka-streams pipeline redis
Last synced: 04 May 2026
https://github.com/blakedrumm/scvmm-scripts-and-sql
The Scripts provided here are compatible with System Center Virtual Machine Manager
collector data powershell scripts scvmm sql
Last synced: 11 May 2025
https://github.com/marek-jakub/monitoring
A university project concerning field data management for bird ringers.
bird data fieldwork management ringing
Last synced: 24 Jun 2026
https://github.com/luminati-io/Pinterest-dataset-samples
Two sample datasets of over 1000 Pinterest profiles and posts, extracted using the Bright Data API, ideal for market research, influencer marketing, and product development.
data data-extraction data-mining database datasets pinterest pinterest-api structured-data web-scraping
Last synced: 09 Apr 2025
https://github.com/evoluteur/web-scraper-sitemaps
Sitemaps for the Web Scraper Chrome extension.
chrome-extension data dataset scraper scraping scrapper scrapping scrapy-crawler sitemap web-scraper web-scraping
Last synced: 04 Jun 2026
https://github.com/acaciaman/db-autotest
DB Database test automation. This python package allows to create database object structure and load data from database.
Last synced: 05 May 2026
https://github.com/junkwaxhero/cardlists
Sports Card set lists in easily consumable JSON Format for databases, apps, websites, and more!
baseball baseball-cards baseball-data bowman data dataset datasets donruss fleer json json-schema panini topps upper-deck
Last synced: 24 Apr 2025
https://github.com/asirihewage/simplest-xpath-web-scraper
Simplest web scraper created using Python3 and MongoDB
data data-mining python3 scraper web webscrping
Last synced: 29 Jan 2026
https://github.com/techbureau/zaifdata
:blue_book: Data Reader for zaif Exchange
bitcoin blockchain cryptocurrency data exchange nem token trading xem zaif
Last synced: 19 Apr 2026
https://github.com/lafayettegabe/g2m-insight-for-cab-investment-firm
📊 Exploratory Data Analysis (EDA) on multiple datasets related to the cab industry in the US, to provide actionable insights and recommendations to a private firm looking to invest in the market. The analysis includes data cleaning, transformation, visualization, and hypothesis testing.
big-data data data-analysis data-science data-visualization eda gotomarket
Last synced: 13 Jun 2025
https://github.com/ozanarkancan/sailx
This repo contains the code for generating artificial navigational instruction following data.
data grounded-language-learning
Last synced: 08 Jan 2026
https://github.com/pommes-public/pommesdata
A full-featured transparent data preparation routine from raw data to POMMES model inputs
data opensource power raw-data transparent
Last synced: 07 Oct 2025
https://github.com/jesusgraterol/bitcoin-blockchain-dataset-builder
The dataset builder script extracts all the relevant block information from the Bitcoin Blockchain through Mempool.space's public API. The data is stored in a .csv file, facilitating its use in data science and machine learning projects.
bitcoin blockchain blockchain-technology data datascience datascience-machinelearning dataset dataset-generation machine-learning
Last synced: 06 May 2026
https://github.com/purarue/listenbrainz_export
Export your scrobbling history from ListenBrainz
data data-export music scrobbling
Last synced: 24 Jan 2026
https://github.com/ashwinpn/visualization
Data Visualization using Matplotlib, Pandas Visualization, Seaborn, ggplot, and Plotly.
analysis data data-analysis data-science data-visualization graphs plots python python3 visualization
Last synced: 13 Apr 2026
https://github.com/financejs/discord-bot
A Discord Bot Used In Financejs Discord Server
data discord discord-bot discordjs-bot finance financejs financial
Last synced: 13 Apr 2026
https://github.com/rulox/faker
A Go library to create Fake Data for your projects
data dummy dummy-data fake fake-data faker go golang
Last synced: 28 May 2026
https://github.com/ymougenel/referencecollector
Helps you gather, store and share references links
ansible data docker keycloak kotlin spring-boot thymeleaf
Last synced: 14 Apr 2026
https://github.com/freebirdscrew/datastructures_python
Data Structures Implementation in Python and Explains each Steps.
data data-visualization datascience datastructures datastructures-algorithms datastructures-algorithms-python datastructures-implementation datastructuresandalgorithm freebirdscrew programming python simranjeet simranjeetsingh
Last synced: 16 Apr 2026
https://github.com/fabriciopsouza/covid-19-demographic-social-dataset
A social demographic dataset for analysis of the COVID-19 pandemic.
alteryx coronavirus coronavirus-analysis coronavirus-dataset covid-19 covid19 covid19-data data data-science dataset enrichment-analysis timeseries timeseries-analysis timeseries-clustering timeseries-covid-19 timeseries-database timeseries-segmentation timeseriesclassification
Last synced: 31 May 2026
https://github.com/mrnazu/eth-data-library
eth-data-library is a Nodejs library that provides tools for accessing and processing data on the Ethereum blockchain.
blockchain data ethereum nodejs smart-contracts web3
Last synced: 28 Jan 2026
https://github.com/ayemunhossain/firebase-realtime-db-advance-query
Firebase real time database, query with nodejs.
ayemunhossain data firebase firebase-functions firebase-realtime-database nodejs query
Last synced: 06 May 2026
https://github.com/freight-trust/edi-onboarding
ESC Guidelines for X12/EDIFACT Messages
b2b data data-interchange edi edi-xml edifact enterprise x12
Last synced: 04 Mar 2026
https://github.com/woo071002/parcel-management-system
A Parcel Delivery Management System streamlining deliveries with features for admin, users, and delivery personnel, including real-time tracking, delivery requests, and personalized dashboards.
cors csharp data dotenv html-css iconfont jkuat land-information-system mongodb python react-router-dom sass tech-expo xaml
Last synced: 08 Oct 2025
https://github.com/aisurjyasamantaray/sales-perfomance-analysis-dashboard
A comprehensive sales performance analysis dashboard built using Python, and visualization tools. This project includes data cleaning, descriptive statistics, correlation analysis, and insights into sales trends, profitability, and the impact of discounts. Key features include interactive visualizations using Seaborn, and Matplot
analytics annova data data-analysis data-visualization-project dataproject eda hypothesis-testing pandas-dataframe python sales-performance-analysis statistics
Last synced: 04 Apr 2026
https://github.com/codecentric/reedelk-bookingintegrationservice
Example service for the blog post series about Reedelk
api api-gateway data integration integration-flow
Last synced: 16 Oct 2025
https://github.com/nop-dev/learning-js
Esse repositório contem todas as anotações que fiz enquanto estudava um módulo da trilha Explorer da Rocketseat sobre JavaScript. 🔰
data data-structures functions javascript js
Last synced: 17 Apr 2026
https://github.com/norton120/dfmock
Python Pandas DataFrame mock generator. You need mock'd data in a dataframe? this is what you need.
data mock pandas pandas-dataframe python python37
Last synced: 19 Jan 2026
https://github.com/kaos599/apollo-synthetic-data-generator
Apollo is a Python GUI application designed to simplify the complex process of generating random data based on fixed values. It allows users to generate various types of binary datasets, such as Yes/No type questions, by specifying probabilities.
data data-engineering data-generation data-generator data-science faker-library machine-learning tkinter-gui
Last synced: 22 Jul 2025
https://github.com/skywarth/fenrir-wolfpack-simulator
Simulating wolfpack behaviours and future of the pack in an environment using Javascript and data trees.
data data-structures javascript max-heap simulation simulations wolfpack
Last synced: 14 Oct 2025
https://github.com/utrechtuniversity/dataprivacyproject
This is the repository underlying the landing page for the Data Privacy Project @UtrechtUniversity, the Netherlands.
data gdpr open-science privacy rdm research research-data-management utrecht-university
Last synced: 10 Oct 2025
https://github.com/pawelzny/vo
DDD Value Object implementation
data ddd-patterns object python3 value
Last synced: 15 Feb 2026
https://github.com/tayeva/eia-client-python
EIA Open Data API Client - Python
data open-source python python-3 python3
Last synced: 14 Oct 2025
https://github.com/ciscorn/tinygrib2
(experimental) A tiny toolkit for parsing JMA's GRIB2 files.
data grib grib2 meteorology rust weather
Last synced: 26 Apr 2026
https://github.com/natylaza89/covid19-il
Python package which brings a "Facade" interface for the client for using official covid 19 data of israeli data gov. ★19K+ Downloads★
api covid covid19 covid19-data data israel pandas python
Last synced: 13 Apr 2026
https://github.com/openpeeps/zxc-nim
Bindings to the ZXC compression library, a LZ77-based compressor optimized for high decompression speed
archive compression compressor data decompression game-assets lossless lossless-compression lz77 nim nim-bindings nim-package nim-wrapper openpeeps zxc
Last synced: 07 Jun 2026
https://github.com/Duartemartins/dados
Resultados de Eleições Portuguesas por Freguesia
data elections open-data portugal
Last synced: 20 Nov 2025
https://github.com/amethyst-php/customer
A person or an organization that pays for goods or services
amethyst amethyst-package api customer data laravel
Last synced: 11 May 2026
https://github.com/agnosticeng/agx
Query and explore local and remote data with Clickhouse
clickhouse d3 data rust svelte
Last synced: 26 Oct 2025
https://github.com/stdlib-js/datasets-suthaharan-multi-hop-sensor-network
Labeled wireless sensor network data set collected from a multi-hop wireless sensor network deployment using TelosB motes.
data dataset datasets javascript labeled machine-learning ml mote motes network node node-js nodejs outlier outliers sample sensor statistics stats stdlib
Last synced: 10 Oct 2025
https://github.com/nrennie/data
A collection of random datasets, either from web-scraping or processing more complex data.
Last synced: 30 May 2026
https://github.com/chaitanyac22/hr_policy_query_resolution_with_retrieval_augmented_generation_rag
This repository contains an HR Policy Query Resolution system using Retrieval-Augmented Generation (RAG). It leverages a 4-bit quantized Mistral-7B-Instruct-v0.2 LLM and JP Morgan Chase’s publicly available Code of Conduct documents to generate accurate, contextually relevant responses for HR policy queries.
artificial-intelligence data hr large-language-models llm mistral-7b nlp pipeline prompt-engineering quantization rag retrieval-augmented-generation
Last synced: 12 Feb 2026
https://github.com/snandasena/disaster-response-pipeline
Disaster Response Pipeline | Data Engineering
data data-engineering-pipeline etl flask machine-learning nlp nlp-pipeline
Last synced: 24 Apr 2026
https://github.com/rudxain/ideas
A collection of my non-started projects
brain-storms brainstorming broken concepts crap data dreams experiments graphics hardware inspiration lazy mono-repository monorepo pet-project proposals software text unfinished wishes
Last synced: 06 Feb 2026
https://github.com/bdpedigo/neuropull
A (soon to be) lightweight Python package for accessing single-cell connectome networks with metadata.
connectome connectomes connectomics data dataset networks networks-biology
Last synced: 05 Oct 2025
https://github.com/andrewrporter/my-analytics
Analyzes FireFox browsing history with modern python3 features and libraries
analytics data firefox matplotlib python python3 sqlite3
Last synced: 28 Apr 2026
https://github.com/rohan-paul/machine-learning-and-deep-learning-tutorial-notebooks
Various Machine Learning and Deep Learning Tutorial Notebooks in Blog Format
data data-analysis data-science deep-learning deep-learning-tutorial deep-neural-networks machine-learning machine-learning-algorithms machinelearning neural-network pytorch pytorch-implementation pytorch-tutorial tensorflow
Last synced: 09 May 2026
https://github.com/leapfrogtechnology/datamegh
Datamegh - Data Engineering for the cloud.
cloud cloud-native data datamegh docker megha python serverless
Last synced: 14 May 2026
https://github.com/alexandregazagnes/unilasalle-public-resources
UniLaSalle-Public-Ressources : This public repository contains the notebooks and the data used for both : 2nd Year - Practical Statistical Tests 4th Year - Data Analysis with Python
data data-analysis data-analytics data-cleaning data-storytelling education educational exploratory-data-analysis python python3 r r-programming rstudio statistics visualization
Last synced: 28 Apr 2026
https://github.com/ismet55555/pdw-asym-2link
Clear and easy way of simulating a passive dynamic walker (PDW) model derived and exectured using MATLAB.
data dynamics inverted-pendulum matlab numerical-simulations passive-dynamic-walker passive-dynamics ramp research robotics simulation slope walking-simulator
Last synced: 29 Apr 2026