data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-01 00:07:35 UTC
- JSON Representation
https://github.com/andreped/chatbot-streamlit-demo
Develop accessible ChatBot with Azure OpenAI and Streamlit
azure chatbot data data-mining huggingface huggingface-spaces large-language-models llm openai python research streamlit web-application
Last synced: 01 Aug 2025
https://github.com/emrecpp/datapacket-csharp
Send, recv, encrypt, decrypt, compress data as Packet and send it with socket for C#.
compress data deserialization deserialize deserializer encrypt packet send serialization serialize serializer socket
Last synced: 15 Sep 2025
https://github.com/dicook/tutorial_make_better_data_plots
Materials for a workshop in June 2025
data data-analysis data-science data-visualization r statistical-graphics statistics
Last synced: 25 Jun 2025
https://github.com/aa-sikkkk/twitterdatamining
A Simple Script to mine data from X/Twitter
Last synced: 24 Jan 2026
https://github.com/nickmcintyre/processing-netcdf
Simple access to scientific datasets with Processing
Last synced: 11 Apr 2025
https://github.com/biglocalnews/upload-files
Upload comma-delimited files to biglocalnews.org in your GitHub Action
action actions archiving csv data data-journalism github-actions journalism news
Last synced: 27 Apr 2026
https://github.com/themitosan/grpp
GRPP is a simple tool written in TS that helps preserving git repositories.
cli data git grpp linux preservation project repo repository
Last synced: 15 Jul 2025
https://github.com/cttynul/elsoftware
⚽ Vinci al Fantacalcio usando librerie di pandas, facendo credere a tutti che tu stia usando il machine learning
data data-science fantacalcio machine-learning pandas
Last synced: 30 Jun 2026
https://github.com/swaymm7/open-source-prompt-library
Here is where I store all my useful prompts
chatgpt-prompt data data-analytics data-engineering deepseek gpt ios llm macos prompt prompts prompts-template swift-package-manager tracker
Last synced: 16 Jul 2025
https://github.com/gianlucatruda/project_sleep
A Quantified Self project in which I use ±40 nights of data to determine what helps and hinders my sleep.
data experiment matplotlib python quantified science self sleep visualization
Last synced: 03 Apr 2025
https://github.com/intercloud/gotsgen
Golang Time Series Data Generator
data generator golang library timeseries
Last synced: 20 Jun 2025
https://github.com/codenoid/lazy-mongo
Insert data to mongo from text plain or file
crystal crystal-language data database mongoclient mongodb
Last synced: 13 Apr 2026
https://github.com/rcorrero/light-pipe
A high-level syntax for data pipelines, designed to make pipeline development quick and painless.
data data-pipelines data-processing geospatial-analysis geospatial-processing pipeline
Last synced: 14 Dec 2025
https://github.com/ikstream/dns-handler
Data collection server for the dalec user collection system
collection dalec data data-collection dns dns-server python python3
Last synced: 13 Mar 2025
https://github.com/csengupta1101/dig-student-files
This Repository will contain all student submissions at one place.
data datascience education machine-learning python students visualization
Last synced: 17 Jul 2025
https://github.com/louis030195/ega
ai artificial-intelligence data data-science data-visualization
Last synced: 07 Mar 2026
https://github.com/ljharb/define-data-property
Define a data property on an object. Will fall back to assignment in an engine without descriptors.
accessor configurable data define ecmascript enumerable javascript object property writable
Last synced: 13 Apr 2025
https://github.com/techiaith/brawddegau-tagiedig
Corpws o frawddegau CC0 mewn fformat jsonl, gyda rhannau ymadrodd y tocynnau (geiriau etc.) wedi'u tagio â thagiau Universal Dependencies. // A Corpus of CC0 sentences in the jsonl format, tagged with Universal Dependency part-of-speech tags.
annotated cc0 commonvoice data nlp welsh
Last synced: 17 Jan 2026
https://github.com/owsas/open-categories
Open Categorization system, available as a node module
categories categorization categorize data data-structures node open-source typescript yaml
Last synced: 30 Apr 2025
https://github.com/burakboduroglu/data_structures_and_algorithms
This repo contains my sata structures and algorithms codes.
alghorithm data data-structures dynamic-programming graph hash interview interview-questions linked-list structures tree-structure
Last synced: 04 Apr 2025
https://github.com/vijishmadhavan/parse-clip
A simple CLIP based project for combining images from multiple datasets.
clip data datacleaning dataexploration dataset fastai image python
Last synced: 14 May 2026
https://github.com/monfireboose/monfireboose
A lightweight JavaScript library that provides a high level and model based API for interacting with Firebase.
api data database firebase firestore high-level-api interact javascript library model storage
Last synced: 18 Feb 2026
https://github.com/rpidanny/streamline.js
A JavaScript class that reads and processes a stream line-by-line in order.
big-data data data-processing file-stream javascript stream streams typescript
Last synced: 08 Sep 2025
https://github.com/equinor/data-marketplace
Easily find and check out data products
Last synced: 01 May 2025
https://github.com/deveel/deveel.repository
Implementations of the repository pattern for .NET to support the domain-driven modeling
clean-architechture csharp data dotnet-core dotnetcore efcore entity entity-framework entity-manager layered-architecture mongodb repository repository-manager repository-pattern
Last synced: 22 Apr 2025
https://github.com/strmprivacy/docs
With STRM Privacy you can easily build privacy-by-design data pipelines and define data contracts to encode privacy inside your data. Data streams are pseudonymised or anonymised in real-time or batch. These are our docs.
data documentation docusaurus privacy privacy-enhancing-technologies
Last synced: 12 Jul 2025
https://github.com/muneeb1030/finetune-tiny-llama
Fine-tuning the Tiny Llama model to mimic my professor's writing style using the Llama Factory. The project involves data collection, preprocessing, preparation, fine-tuning, and evaluation.
data data-preparation data-preprocessing finetuning llama-factory llm pymupdf selenium-python spacy tinyllama webscraping
Last synced: 08 Apr 2026
https://github.com/imagodata/filter_mate
FilterMate is a Qgis plugin, an everyday companion that allows you to easily filter your vector layers
data exploratory-data-analysis filter geospatial ogr postgis qgis qgis-plugin qgis3 qgis3-plugin spatialite sql vector-database
Last synced: 29 Apr 2026
https://github.com/cmudig/mosaic-profiler
A data profiler built with Mosaic
Last synced: 25 Oct 2025
https://github.com/sneels/parkds
Connect all your Data Sources via 1 process (Cross-Domain + Single-Domain)
cross-domain data database datasource datasources javascript source
Last synced: 24 Feb 2026
https://github.com/0xdir/htcds_dart
Human Trafficking Case Data Standard (HTCDS v0.2) objects, for easy creation, storage and transmission of case data related to human trafficking.
data humanitarian schema standards
Last synced: 24 Oct 2025
https://github.com/ryanmorr/fastmap
Accelerated hash maps
data hashmap javascript map performance
Last synced: 10 Oct 2025
https://github.com/udityamerit/python-librearies-for-data-science
Python libraries for data science enable efficient data manipulation, analysis, and modeling. Key libraries include NumPy for numerical computing, pandas for data handling, Matplotlib for visualization, Scikit-learn for machine learning, TensorFlow for deep learning, and BeautifulSoup/requests for web scraping. These libraries simplify complex data
beautifulsoup data data-science data-science-libraries machine-learning matplotlib numpy pandas requests scikit-learn scikitlearn-machine-learning tensorflow
Last synced: 06 Feb 2026
https://github.com/tomdoestech/website-scraping-example
data node-js nodejs scraping scraping-websites
Last synced: 16 Mar 2025
https://github.com/t3v/t3v_datamapper
The data mapper extension of TYPO3voilà.
data database datamapper extension laravel mapper t3v typo3 typo3-cms-extension typo3-extension typo3voila
Last synced: 27 Jan 2026
https://github.com/legopitstop/addons
All legopitstop's Bedrock add-ons in one place.
add-on assets behaviorpack data hacktoberfest minecraft mods modtoberfest resroucepack vanilla
Last synced: 06 Feb 2026
https://github.com/binarybardakshat/suryanayan
Suryanayan AI is a project aimed at using drone technology and artificial intelligence for monitoring and detecting issues in solar panels. This project is inspired by the Indian government's initiative to promote solar energy by providing subsidies on solar panels.
Last synced: 10 Oct 2025
https://github.com/geopython/pygeoapi-examples
Example pygeoapi deployment patterns and configurations
api data geospatial ogc ogc-api osgeo pygeoapi
Last synced: 11 Oct 2025
https://github.com/fcakyon/earth2-scraper
Up-to-date earth2.io data
data earth earth2 earth2io javascript json json-api prices-per-tile python scraper
Last synced: 09 May 2026
https://github.com/cicerops/monitoring-check-grafana
Monitor a Grafana datasource against data becoming stale to detect data loss or other dropout conditions.
data database freshness grafana grafana-datasource icinga2 icinga2-plugin influxdb monitoring stale
Last synced: 08 May 2026
https://github.com/chompfoods/sdk-csharp
C# SDK for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.
api branded chomp csharp csharp-sdk data database dll food grocery ingredients nuget nutrition raw recipes recipes-api restsharp sdk swagger
Last synced: 06 May 2026
https://github.com/ahmedshahriar/eda_basketball
basketball basketball-stats data data-science data-visualization pandas python python3 streamlit
Last synced: 04 May 2026
https://github.com/jderstd/spec
A standard for JSON responses
data error jder json response specification structure
Last synced: 13 May 2026
https://github.com/deveel/kista
Implementations of the repository pattern for .NET to support the domain-driven modeling
clean-architechture csharp data dotnet-core dotnetcore efcore entity entity-framework entity-manager layered-architecture mongodb repository repository-manager repository-pattern
Last synced: 08 Jun 2026
https://github.com/paladique/azuresample-guestbook
Guestbook using MySQL and Cosmos DB on Azure
cosmosdb data mysql spa websockets
Last synced: 30 Apr 2026
https://github.com/anicolaspp/mapr-data-gen
Data generator for MapR Data Platform
data mapr mapr-db mapr-es mapr-streams maprdb parquet scala spark
Last synced: 29 Apr 2026
https://github.com/tooleks/laravel-presenter
The Laravel Presenter Composer Package
collection composer data entity laravel mapper mapping php presenter representation view
Last synced: 28 Apr 2026
https://github.com/yazaabed/at-who-angular
wrapper for At.js that add mentions autocomplete to your application with angular component for using it on any AngularJS projects
angular-components angularjs autocomplete components data modules webpack wrapper
Last synced: 28 Apr 2026
https://github.com/luminovrym/pbo-biodata
Simulasi Cara Input Data dengan OOP
Last synced: 18 Jun 2026
https://github.com/mongodb-developer/rocket-analytics
Learn how the various components of MongoDB's Developer Data Platform (DDP) can support app-driven and traditional analytics in real-time without duplicating data to other data stores. This demo was created for AWS re:Invent 2022 and presented at the MongoDB booth area at the Venetian expo hall.
data federation lucene lucenesearch mongodb s3 search sql
Last synced: 28 Apr 2026
https://github.com/justjavac/deno_data_dir
Returns the path to the user's data directory.
data deno deno-module deno-modules directory
Last synced: 27 Apr 2026
https://github.com/kanugurajesh/firebase-data
Adding data to firebase store
data firebase firebase-database python
Last synced: 27 Apr 2026
https://github.com/anthonykrivonos/ts-algo-masterclass
👾 Giant TypeScript algorithm and data structure masterclass to be constantly updated with important CS concepts.
algorithm class-project computer concepts data data-structures fundamentals giant library masterclass science structures typescript
Last synced: 11 May 2026
https://github.com/mukhopadhyay/opendata
Open Data ❤️
data data-science datasets deep-learning kaggle kaggle-dataset machine-learning open-source opendata
Last synced: 25 Apr 2026
https://github.com/joamag/pandas
Loads of pandas data from China with awesome data
data data-analysis jupyter notebook pandas
Last synced: 25 Apr 2026
https://github.com/corentinb/txtoredis
:fire: Push each line of a text file, to a Redis set
data datascience dataset go golang redis set
Last synced: 24 Apr 2026
https://github.com/d2hydro/fewspy
A Python API for the Deltares FEWS PI REST Web Service
data geopandas hydrology hydrometrics pandas python
Last synced: 23 Apr 2026
https://github.com/healthyregions/oeps
Opioid Environment Policy Scan - data explorer and backend management
data data-visualization public-health
Last synced: 21 Apr 2026
https://github.com/adanos-software/free-ticker-database
Free global stock & ETF ticker reference database - 50k+ tickers, 66 exchanges, 81 countries
Last synced: 10 May 2026
https://github.com/lilingxi01/bloark
Blocks Architecture (BloArk) project package for building Blocks-0 dataset and way beyond.
architecture bloark data revision-based
Last synced: 05 Apr 2026
https://github.com/metapsy-project/data-gambling-psyctr
Database of psychological interventions for problem gambling and gambling disorder.
Last synced: 02 Apr 2026
https://github.com/d8a-tech/d8a
A data collection service fully compatible with GA4 tracking protocols. Ingest into ClickHouse or BigQuery database while maintaining complete control over your data.
bigquery clickhouse data ga4 tracker
Last synced: 10 Apr 2026
https://github.com/robertmyles/riscobrasil
An R package to download 'Brazil Risk' data :chart_with_upwards_trend:
Last synced: 08 Apr 2025
https://github.com/ahmetfurkandemir/sahibinden-data-engineering-technical-case-study
Sahibinden.com Data Engineering Technical Case Study
case-study data data-engineering debezium docker flink kafka mongodb mysql pyflink pyspark python sahibinden spark
Last synced: 03 Mar 2026
https://github.com/muhammadibrahim313/datavue
"DataVue" is an AI-powered data science platform that simplifies EDA, visualizations, and data cleaning. It offers personalized learning, real-time collaboration, and strong data security for all users.
analytics auto chatbot data data-science data-visualization eda education genai groq groq-api llama3 machine-learning python streamlit
Last synced: 10 Apr 2025
https://github.com/louisbrulenaudet/legalkit-pipeline
Publication pipeline for French legal codes on 🤗 Datasets from LegiFrance with concurrent upload and dynamic REAMDE.md.
data datasets huggingface huggingface-datasets legal legaltech legifrance open-source parquet piste-api python
Last synced: 17 Mar 2025
https://github.com/geo2france/odema-dashboard
Tableaux de bord thématiques Odema
application client-side dashboard data echarts maplibre odema react waste
Last synced: 05 Feb 2026
https://github.com/abrudz/parsing
Dyalog APL expressions to parse common and unusual data formats from text files
apl csv data data-format dyalog-apl dyalogapl parsing
Last synced: 20 Mar 2026
https://github.com/woctezuma/steam-reviews-data
Data available to compute statistics of Steam reviews.
Last synced: 19 Mar 2026
https://github.com/huangcongqing/ranking-list
数据!important | 各种排行,榜单数据汇总 数据为王的时代 Data
Last synced: 15 Feb 2026
https://github.com/platob/yggdrasil
arrow data databricks pandas polars spark sql
Last synced: 02 Jun 2026
https://github.com/gadenbuie/crantrack
Hourly snapshots of CRAN's incoming packages folder
Last synced: 12 Mar 2026
https://github.com/qeeqbox/data-compliance
Data compliance is the process of following various regulations and standards to ensure that sensitive digital assets (data) are guarded against loss, theft, and misuse
compliance data data-compliance infosecsimplified qeeqbox
Last synced: 19 Mar 2026
https://github.com/fforres/webpack-plugin-dx-metrics
Webpack plugin to track webpack behaviour in datadog
data datadog developer-experience typescript visualization webpack
Last synced: 13 Feb 2026
https://github.com/onaio/gisida-react
React Dashboard library for Gisida.
dashboard data gisida map react visualization
Last synced: 28 Apr 2025
https://github.com/yashika-malhotra/exploratory-data-analysis-for-multinational-retail-corporation
Analysis via CLT and Visualization on Multinational Retail Corporation's data to provide insights and recommendations to improve their userbase.
colab-notebook data jupyter-notebook matplotlib numpy pandas python seaborn stats
Last synced: 11 Feb 2026
https://github.com/enes9103/039_react_task_tracker-json_server
api axios-react css3 data javascript json-server react reactjs responsive todoapp
Last synced: 11 Feb 2026
https://github.com/rikurauhala/insights
Visualize your coding journey
cypress data data-visualization github github-api javascript material-ui octokit react statistics typescript vite
Last synced: 11 Feb 2026
https://github.com/bluegreen-labs/oneflux_containers
Containerized (docker) versions of the ONEFlux processing pipeline
data ecosystem fluxes micrometeorology processing
Last synced: 07 Oct 2025