data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-23 00:07:41 UTC
- JSON Representation
https://github.com/thekartikeyamishra/data_cleaning_project
Welcome to the Data Cleaning and Visualization project! This repository demonstrates how to clean messy data and create insightful visualizations using Python with Pandas and Matplotlib.
data dataanalysis matplotlib matplotlib-pyplot pandas python
Last synced: 02 May 2026
https://github.com/maskedsyntax/covid-tracker
Qt app to keep a track of Covid-19 records of different countries.
coronavirus coronavirus-tracking covid-19 data parsing scraping scraping-websites tracker web-scraping
Last synced: 29 Mar 2025
https://github.com/real-veersandhu/scifaa-covid-19-project
📈 COVID-19 Data Science Project (2021 Internship @ SCI-FAA)
covid-19 data data-science data-visualization python
Last synced: 14 May 2026
https://github.com/ciscorn/tinybufr
A Rust library for decoding BUFR meteorological observation data format
bufr data meteorology rust weather wmo
Last synced: 11 Jan 2026
https://github.com/johntocci/nullaxe
Nullaxe is a powerful and user-friendly Python library designed for cleaning and preprocessing data. It works seamlessly with both pandas and polars DataFrames, making it a versatile tool for data scientists and developers.
data data-analysis data-science datacleaning pandas polars python
Last synced: 06 Apr 2026
https://github.com/sun-lab-nbb/sl-experiment
A Python library that provides tools to acquire, manage, and preprocess scientific data in the Sun (NeuroAI) lab.
ataraxis data experiment methods neuroscience
Last synced: 07 Mar 2026
https://github.com/dhruvldrp9/simpledht
A Python-based Distributed Hash Table (DHT) implementation enabling cross-network key-value storage, automatic node discovery, and data replication with a simple CLI and library interface.
cross-network-node-communation data data-replication data-synchronization dht dht-python distributed-hash-table key-value-storage nat netowork node-discovery peer-to-peer peer-to-peer-network python sha-256 simple udp udp-socket-communication
Last synced: 28 Feb 2026
https://github.com/marlenezw/speech-to-text
Turn any video or audio recording into a written transcript using python
data data-science python speech speech-recognition speech-synthesis speech-to-text
Last synced: 27 Apr 2026
https://github.com/coatless-rpkg/ucimlrepo
An unofficial R port of the Python package to download data off of the UCI ML repository
data data-science machine-learning rstats statistics uci-machine-learning uci-machine-learning-repository web-api
Last synced: 28 Jun 2025
https://github.com/mickeyshi-syd/actuarial-hackathon-2019
2019 Actuarial Hackathon
actuarial actuaries analytics data data-science hackathon
Last synced: 15 Jul 2025
https://github.com/helixspiral/ndbc
Golang wrapper for the National Data Buoy Center (NDBC)
data data-science golang government-data ndbc ndbc-buoy-data noaa noaa-api noaa-buoys noaa-data noaa-weather wrapper wrapper-api wrapper-library
Last synced: 14 Jun 2025
https://github.com/quantalabs/climate
Graphs temperatures of the US, Caribbean Nations, and the world from 1967, 1910, and 1880 to 2020, respectively.
caribbean caribbean-temp caribbean-temperature carribean-climate climate climate-change data data-visualization global global-climate global-temp global-temperature global-warming graphing-with-python graphs us us-climate us-temp us-temperature world
Last synced: 29 Mar 2025
https://github.com/zarr-developers/cookiecutter-zarr-store
Cookiecutter for Zarr store implementations
chunked data n-dimensional zarr
Last synced: 16 Jun 2025
https://github.com/minightdev/paperclip
Paperclip is a powerful privacy-focused data breach search engine that empowers users to swiftly and securely investigate breaches using email addresses and phone numbers. Our robust search engine delivers real-time results while prioritizing the privacy and security of user queries.
beaches data database pwn pwned search-engine
Last synced: 22 Mar 2025
https://github.com/sharmadhiraj/free-json-datasets
Collection of free JSON data that are scraped and parsed from different websites.
collection crawler data data-scraping datasets json sports statistics web-scraping
Last synced: 28 Mar 2025
https://github.com/courtois-neuromod/anat
Anatomical sub-dataset of Courtois-Neuromod project.
Last synced: 17 Jan 2026
https://github.com/hmeleiro/alquilermad
Housing rent map in Comunidad de Madrid / Mapa del alquiler en la Comunidad de Madrid
data data-science data-visualization datascience housing-location-visualization rent renting
Last synced: 13 Sep 2025
https://github.com/wfamous/fiv_update-data
This project automates the retrieval, processing, and publishing of digital product data for our Shopify store. It integrates Google Cloud Platform (GCP), Amazon Web Service (AWS), Terraform (Tofu), Python, Bash, Ansible and GitHub Actions to manage data pipelines efficiently.
ansible aws bash data data-analysis data-science devops gcp python pythonpackage shopify terraform tofu
Last synced: 17 Feb 2026
https://github.com/ahmedkhalf/arabic-keyword-scraper
Stop wasting your time! And obtain Arabic definitions without having to look it up.
arabic data definitions scraper sentences wordsearch
Last synced: 12 Mar 2025
https://github.com/reppon97/cryptosnake
Simple, unofficial python wrapper for Binance API. You'll find this easy-to-use package helpful if you're interested in general market data and cryptocurrency values. You don't need to have a Binance account or API Key since you can't purchase/trade cryptocurrencies using this package.
api binance bitcoin crypto cryptocurrencies cryptocurrency cryptocurrency-exchanges cryptography data ethereum json litecoin python python3 statistics
Last synced: 05 Mar 2025
https://github.com/eosdis-nasa/earthdata-pub-dashboard
Front-end Dashboard for Earthdata Pub
data earthdata edpub publication
Last synced: 15 Jan 2026
https://github.com/ugurcanerdogan/cross-validation-with-imbalanced-dataset
BBM467*SDSP - Small Data Science Project - Things to consider in cross validation and resampling when dealing with Imbalanced Data : What is the right way?
bbm467 cross-validation data data-science kfold-cross-validation logistic-regression machine-learning oversampling sdsp smote
Last synced: 21 Jun 2025
https://github.com/automators-com/tweaked
Fine tune your data
ai data nextjs rust synthetic tailwindcss tauri
Last synced: 08 Apr 2026
https://github.com/nikoshet/exploratory-data-analysis-using-r
Exploratory Data Analysis using R Course Project for M.Sc. 'Data Science and Machine Learning' in NTUA
data data-analysis data-science eda exploratory-data-analysis ggplot2 r
Last synced: 14 May 2026
https://github.com/marek-jakub/monitoring
A university project concerning field data management for bird ringers.
bird data fieldwork management ringing
Last synced: 20 Jun 2026
https://github.com/kom-senapati/ghw-data-hacks
🌍 Global Hack Week data projects, 📊 focused on exploration, manipulation, and analysis...
Last synced: 12 Mar 2025
https://github.com/nazar-pc/fixed-size-multiplexer
A tiny library for multiplexing data chunks into blocks of fixed size and vice versa
chunk data demultiplex demux fixed multiplex mux size
Last synced: 31 Oct 2025
https://github.com/imadsaddik/bodmaghdataset
BoDmagh dataset is a Supervised Fine-Tuning (SFT) dataset for the Darija language
arabic-llm arabic-nlp darija-llm darija-nlp data dataset fine-tuning llm nlp sft
Last synced: 03 Apr 2025
https://github.com/alexandregazagnes/scikit-res
Very Basic package to store results of ML models Grid search results are hard to exploit. This package aims to store them in a more convenient way.
data machine-learning mlops mlops-workflow results scikit-learn
Last synced: 20 Jan 2026
https://github.com/andrewjbateman/mevn-stack-data
:clipboard: MEVN Info & Full stack MEVN app with CRUD functions
data database express expressjs full-stack info mevn mevn-stack middleware mongodb mongodb-atlas nodejs typescript vue vue3 vue3-typescript
Last synced: 07 Apr 2026
https://github.com/cherylisabella/statistics--caret
Training Regression and Classification Models using caret
data data-analysis data-mining data-science datascience dataset r statistics
Last synced: 24 Jun 2025
https://github.com/longzheng/southeastwater-usage-scraper
Extract hourly water usage data from South East Water portal website for digital water meters
australia data iot playwright southeastwater victoria water
Last synced: 06 Feb 2026
https://github.com/mongoexpuser/synthetic-drilling-data-app-for-sqlite-ml
Generate synthetic drilling data that can be used for testing machine learning (ML) models.
classification data drilling events inference machine-learning ml-models node-sqlite3 nodejs prediction python sqlite3 training
Last synced: 08 Apr 2026
https://github.com/unaygney/js-challenges-data-structures-and-algorithms
Repo of the challenges I'm trying to solve to understand data structures and algorithms..
algorithms-and-data-structures data javascript structure
Last synced: 29 Oct 2025
https://github.com/stdlib-js/array-bool
BooleanArray.
array binary bool boolean booleanarray data javascript mask node node-js nodejs stdlib structure typed typed-array types
Last synced: 13 May 2025
https://github.com/toviszsolt/stormflow
StromFlow - A Lightweight Data Modeling and Storage Library
data database datamanagement datastore db document jsondatabase memorydatabase model mongodb mongoose nodb nosql query schema stormflow
Last synced: 13 May 2025
https://github.com/mzazakeith/puppetmaster
Puppeteer & Crawl4AI microservice for web automation, scraping, and AI processing with Bull queues
agent ai automation bull bullmq chrome crawl4ai crawler data data-extraction extraction gemini llm llms openai playwright puppeteer web-automation
Last synced: 13 May 2025
https://github.com/kodie/migrate-acf-field-data-to-repeater
A WordPress plugin that migrates field metadata for ACF fields that have been moved inside of a repeater
acf acf-field acf-fields advance-custom-field data data-migration data-migration-tool wordpress wordpress-plugin
Last synced: 19 May 2026
https://github.com/joisino/twinpaper
Code for "Twin Papers: A Simple Framework of Causal Inference for Citations via Coupling" (CIKM 2022)
causal-inference data research science-of-science
Last synced: 21 Mar 2025
https://github.com/sheweny/discord-resolve
This module groups together functions to retrieve data from different types of arguments.
data discord discord-js mentions resolver sheweny utility
Last synced: 29 Oct 2025
https://github.com/thejeshgn/thejeshgn
data data-visualization datameet india opendata public-interest
Last synced: 15 Jan 2026
https://github.com/stdlib-js/datasets-suthaharan-single-hop-sensor-network
Labeled wireless sensor network data set collected from a simple single-hop wireless sensor network deployment using TelosB motes.
data dataset datasets javascript labeled machine-learning ml mote motes network node node-js nodejs outlier outliers sample sensor statistics stats stdlib
Last synced: 03 Mar 2025
https://github.com/schbenedikt/datamining
Heise (https://heise.de) News Crawler
data data-science heise postgresql web-crawler
Last synced: 10 Apr 2025
https://github.com/windwalker-io/data
[READ ONLY] A library contains data/collection objects with null-object pattern.
collection collections data data-object iterator nullobject value-object
Last synced: 12 Mar 2026
https://github.com/eliot-akira/png-compressor
Compress and encode data as PNG image
Last synced: 17 Mar 2025
https://github.com/nafisalawalidris/advanced-fraud-detection-with-anomaly-detection
This repository demonstrates how to build a robust fraud detection system that combines supervised learning techniques with anomaly detection models. It provides end-to-end implementation, from data preprocessing and model training to deploying a real-time fraud detection API using FastAPI.
anomaly-detection creditcardfrauddetection data dataanalytics fastapi fraud-detection machinelearning modeldeployment python supervised-machine-learning unsupervised-machine-learning
Last synced: 20 Apr 2026
https://github.com/ciscorn/tinygrib2
(experimental) A tiny toolkit for parsing JMA's GRIB2 files.
data grib grib2 meteorology rust weather
Last synced: 26 Apr 2026
https://github.com/feltex/datahora-java
Aprenda a trabalhar com Data e Hora em Java com as novas classes LocalDateTime, LocalDate, DateTimeFormatter e outras novidades do pacote java.time.
brasil data date dateformat dateformat-brazil datetime hora java java11 localdatetime locale localization zoneddatetime
Last synced: 18 Jun 2026
https://github.com/skylinenando/javascript
autocomplete browser data disable events javascript language loop
Last synced: 14 Feb 2026
https://github.com/stdlib-js/array-ones-like
Create an array filled with ones and having the same length and data type as a provided array.
array data float32array float64array int16array int32array javascript matrix ndarray node node-js nodejs stdlib structure typed typed-array types uint32array vector
Last synced: 05 Jan 2026
https://github.com/sdhutchins/jxn-open-data-api
Access Jackson, MS open government data using a python API wrapper.
api data jackson jxn mississippi open-gov
Last synced: 08 Apr 2025
https://github.com/parimala24-ds/datascientistmlinterviewprep24
DATASCIENTST ML INTERVIEW PREP24
data decisiontree interviewquestions linear-regression logistic machine-learning matplotlib numpy pandas python seaborn sklearn
Last synced: 12 Apr 2025
https://github.com/bernard-ng/drc-news-corpus
DRC News Corpus : Towards a scalable and efficient system for Congolese news dataset curation
aggregator data news nlp politics
Last synced: 06 Sep 2025
https://github.com/amacd31/daily_hydromet_sample_data
This repository contains streamflow, precipitation, and potential-evapotranspiration data for the Twentymile Creek USGS streamflow station.
data dataset hydrology potential-evapotranspiration precipitation public-domain streamflow
Last synced: 16 Jan 2026
https://github.com/rohan-paul/machine-learning-and-deep-learning-tutorial-notebooks
Various Machine Learning and Deep Learning Tutorial Notebooks in Blog Format
data data-analysis data-science deep-learning deep-learning-tutorial deep-neural-networks machine-learning machine-learning-algorithms machinelearning neural-network pytorch pytorch-implementation pytorch-tutorial tensorflow
Last synced: 09 May 2026
https://github.com/flrd/standardlastprofile
R Data Package for BDEW Standard Load Profiles in Electricity
Last synced: 16 Mar 2026
https://github.com/drkenreid/introductory-data-science
Hands-on machine learning tutorials in Google Colab, covering various algorithms and techniques for learners at different levels.
cnn data data-science deep-learning learning-datascience learning-machine-learning learning-python neural-network neural-networks regression rnn science tutorial tutorial-exercises tutorials
Last synced: 28 Jan 2026
https://github.com/andrew-johnson-4/misspeller
Take correctly spelled words and return common spelling mistakes
common-mistakes data language natural nlp processing rust
Last synced: 30 Apr 2025
https://github.com/colour-science/colour-hdri-examples-datasets
Colour - HDRI - Examples Datasets
color color-science color-space color-spaces colorspace colorspaces colour colour-science colour-space colour-spaces colourspace colourspaces data dataset datasets hdr hdri raw tone-mapping tonemapping
Last synced: 19 Mar 2026
https://github.com/andrewrporter/my-analytics
Analyzes FireFox browsing history with modern python3 features and libraries
analytics data firefox matplotlib python python3 sqlite3
Last synced: 28 Apr 2026
https://github.com/divithraju/divith-raju-searchengine-wikipedia
search engine optimizationA complete search engine experience built on top of 75 GB Wikipedia corpus with subsecond latency for searches. Results contain wiki pages ordered by TF/IDF relevance based on given search word/s. From an optimized code to the K-Way mergesort algorithm, this project addresses latency, indexing, and big data challenges.
algorithms data dataengineering inverted-index linux merge-sort nlp project project-repository python3 serchengine software-engineering ubuntu wikipedia
Last synced: 16 May 2026
https://github.com/financejs/discord-bot
A Discord Bot Used In Financejs Discord Server
data discord discord-bot discordjs-bot finance financejs financial
Last synced: 13 Apr 2026
https://github.com/Nazaniiin/EDA_QualityofRedWine
:wine_glass: :chart_with_upwards_trend: (EDA) R - Vizualization / Performed exploratory analysis and visualization on Red Wine Quality dataset; Mainly answering which chemical properties influence the quality of red wines.
charts data data-analyses data-analysis-udacity data-analytics data-mining data-visualization exploratory-data-analysis histogram linear-models prediction-model r r-programming visualization
Last synced: 30 Jul 2025
https://github.com/ozanarkancan/sailx
This repo contains the code for generating artificial navigational instruction following data.
data grounded-language-learning
Last synced: 08 Jan 2026
https://github.com/rikvdh/zabuffer
Zero-Allocation buffer handling in C
buffer c clib data embedded memory string zero-allocation
Last synced: 03 Mar 2025
https://github.com/ashwinpn/visualization
Data Visualization using Matplotlib, Pandas Visualization, Seaborn, ggplot, and Plotly.
analysis data data-analysis data-science data-visualization graphs plots python python3 visualization
Last synced: 13 Apr 2026
https://github.com/mikeintoshsystems/hispmd
HIS Performance Monitoring Dashboard
api dashboard data dhis2 dhis2-api docker docker-compose hispmd mfr rest-api visualization web
Last synced: 08 Apr 2025
https://github.com/OliverHennhoefer/shiny-template-interactive-table
Example of interactively adding rows / deleting rows by selecting directly in a data.table (DT) in Shiny
button data delete dt r select selection server shiny shiny-applications shiny-apps shiny-r shinyapps table ui userinterface
Last synced: 30 Jul 2025
https://github.com/banbord/data-vis-tornados
This repository includes data files, processing scripts, visualization code, and documentation for our tornado data visualization project. It aims to provide insights into tornado patterns across the United States using interactive and informative visual representations.
d3-visualization d3js data javascript json visualization
Last synced: 24 Feb 2026
https://github.com/andrey-tech/data-storage-php
Простое хранилище данных в виде ключ-значение в JSON-файлах с разделяемой блокировкой на чтение и эксклюзивной блокировкой на запись.
data data-storage files json php php7 storage
Last synced: 29 Apr 2026
https://github.com/p32929/use-megamind
A simple react hook for managing asynchronous function calls with ease on the client side
async asynchronous-tasks axios client-side-javascript data data-fetching easy fetch generics hooks javascript npm painless promise query react rest simple small typescript
Last synced: 23 Jan 2026
https://github.com/keosariel/nairagazer-clustered-news
Providing clustered News data specifically Nigeria news. In hindsight this repo contain nigeria news and it's coverage. Data is from Nairagazer
ai data data-science news nigeria nigerian-data python
Last synced: 30 Aug 2025
https://github.com/jimut123/scrapers
All Scrapers that I'll build
bs4 data python3 real-time-visualisations scrapers scrapy wget
Last synced: 16 Jan 2026
https://github.com/yeisonmontoya1815/machine-learning_prediction_can_inflation
we aim to predict trends in the Canadian market basket using sentiment analysis techniques. Sentiment analysis involves analyzing text data to determine the sentiment expressed, whether positive, negative, or neutral.
algorithms-and-data-structures data data-analysis data-science data-visualization feature-engineering machine-learning matplotlib-pyplot numerical-analysis numpy pandas pipelines python sklearn structured-data super unsupervised-learning
Last synced: 05 Feb 2026
https://github.com/georgetdn/syscppcp
Store C++ class data in a file ( persistence ) and manipulate it programmatically or using Small SQL (included)
class data framework object persistence serialize sql windows
Last synced: 04 Apr 2025
https://github.com/dongminlee94/data-visualization-tutorial
A repository for data visualization tutorial
data data-science data-visualization matp matplotlib pca plotly python seaborn t-sne tutorial umap visualization
Last synced: 29 Apr 2026
https://github.com/mollybeach/cherryether
CherryEther: Typescript Staking Deposits Ethereum Transactions
blockchain data data-science ethereum typescripts
Last synced: 21 May 2026
https://github.com/sabujxi/python-scraper-and-data-analysts-admin-panel-in-django
A data scraper from texas govt site and a helping web app for managing, reviewing and editing the data
analyst data data-analysis data-entry data-scraper django django-application python python-scraper real-estate regex scraper texas
Last synced: 30 Apr 2026
https://github.com/stefen-taime/real-time-data-pipeline-snake-game
Dynamic Snake Game: Unleashing Real-Time Streaming Analytics with Redis, Kafka, Flink, ClickHouse & Chart.js in an Online Snake Game via Flask API
chartjs clickhouse confluent-cloud data flask kafka-streams pipeline redis
Last synced: 04 May 2026
https://github.com/amethyst-php/customer
A person or an organization that pays for goods or services
amethyst amethyst-package api customer data laravel
Last synced: 11 May 2026
https://github.com/wamphlett/input-collection
A smarter and stricter way to capture and validate request data
Last synced: 27 May 2026
https://github.com/noklam/blog_archive_fastpage
Nok's data science blog
blog data data-science machine-learning python sceince
Last synced: 01 May 2026
https://github.com/cosmos-loops/cosmos-efcore
Cosmos.EntityFrameworkCore is a part of Cosmos.Data, a inline project of COSMOS LOOPS PROGRAMME. This repository provides a package of Microsoft.EntityFrameworkCore to improve development efficiency.
cosmos-loops data efcore entityframeworkcore
Last synced: 14 Aug 2025
https://github.com/jazeee/dexcom-android-wall-panel
Display data as a Graph on Android, jazeee data plotter
Last synced: 02 May 2026