data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-30 00:07:50 UTC
- JSON Representation
https://github.com/andygol/yamap
Yamap Ain't Map – deployment of OSM infrastructure project inspired by osm-seed
api data extract geo-data map openstreetmap osm
Last synced: 24 Jun 2025
https://github.com/andreped/chatbot-streamlit-demo
Develop accessible ChatBot with Azure OpenAI and Streamlit
azure chatbot data data-mining huggingface huggingface-spaces large-language-models llm openai python research streamlit web-application
Last synced: 01 Aug 2025
https://github.com/daninet/audio-annotator
Simple app for annotating audio segments
ai annotate annotation artificial audio data intelligence label labeling labeling-tool learning machine ml science wav
Last synced: 04 Apr 2025
https://github.com/fabriquebeweb/dao
Le 'Data Access Object' pour les nuls !
Last synced: 18 Feb 2026
https://github.com/aydinnyunus/dictionary
Dictionary
data data-analysis data-science data-structures data-visualization database dataset dictionaries dictionary dictionary-learning python python-2 python-3 python-3-6 python-library python-script python2 python27 python3 python36
Last synced: 09 May 2025
https://github.com/abdelmajidlh/eportfolio
ePortfolio Abdelmajid EL HOU
bioinformatics data data-analysis data-science data-visualization database datascience genetics
Last synced: 22 Mar 2025
https://github.com/njraladdin/newspapers-com-scraper
A Node.js scraper for extracting article data from Newspapers.com based on keywords, dates, and locations.
archive data newspapers scraper scraper-api scraping
Last synced: 06 Apr 2025
https://github.com/nickmcintyre/processing-netcdf
Simple access to scientific datasets with Processing
Last synced: 11 Apr 2025
https://github.com/umitkaanusta/smol-elt
a smol elt (not etl) pipeline for smol tasks
analytics automation aws aws-sns data data-engineering data-pipeline elt etl google-sheets pandas pipeline python spreadsheet web-scraping
Last synced: 10 May 2026
https://github.com/guiferviz/tuberia
Data engineering meets software engineering
data data-engineering expectations pipeline python spark
Last synced: 08 Mar 2026
https://github.com/rpidanny/streamline.js
A JavaScript class that reads and processes a stream line-by-line in order.
big-data data data-processing file-stream javascript stream streams typescript
Last synced: 08 Sep 2025
https://github.com/codiepp/elykseer-base
cryptographic data archive; written in F#; envisaged to stay another 10 years
archive cli cryptography data distributed-storage dotnet fsharp longterm-storage
Last synced: 19 May 2026
https://github.com/cttynul/elsoftware
⚽ Vinci al Fantacalcio usando librerie di pandas, facendo credere a tutti che tu stia usando il machine learning
data data-science fantacalcio machine-learning pandas
Last synced: 30 Jun 2026
https://github.com/legopitstop/datapacks
All legopitstop's datapacks in one place.
assets data datapack hacktoberfest minecraft mods modtoberfest resroucepack vanilla
Last synced: 03 Jan 2026
https://github.com/newrelic-experimental/newrelic-java-camel
Instrumentation of the New Relic Java Agent for the Camel framework
camel camel-jms data instrumentation java nrlabs nrlabs-data nrlabs-java-verify nrlabs-odp observability-data
Last synced: 10 Apr 2025
https://github.com/thamerh/web-scraper-with-node.js-and-cheerio
used simple exemple how Scraper data from Build a Web Scraper with Node.js and Cheerio
cheer data expressjs nodejs scarper webscraping
Last synced: 08 Apr 2026
https://github.com/intercloud/gotsgen
Golang Time Series Data Generator
data generator golang library timeseries
Last synced: 20 Jun 2025
https://github.com/nix1707/webscrapper-browserextension
Scraper Master is a Chrome extension for effortless web data extraction. Built with React, TypeScript, and the Chrome Scripting API, it ensures efficient, high-quality, and seamless scraping. Utilizing HTML and CSS, ScrapeEase offers a clean, responsive design. Simplify your data collection with Scraper Master.
chrome-extension chrome-extensions css data frontend html html-parser modern parser parsing react scraper scraping typescript ui validation webparser webparsing webscraping
Last synced: 21 Jun 2025
https://github.com/astrid-project/lcp
In each local agent, the control plane is responsible for programmability, i.e., changing the behaviour of the data plane at run-time.
agent beats control data ebpf elasticsearch log logstash management programmability security
Last synced: 06 Apr 2025
https://github.com/charconstpointer/markovbot
PoC markov chain sentence generator, powered by discord for data gathering
bot chain collection data discord markov parsing
Last synced: 16 May 2026
https://github.com/edoardottt/computerphile-pong
Pong game with a little bit of Data Science. Computerphile.
2d-game computerphile csv data data-science datascience game game-2d game-development games pandas pong pong-game pygame pygame-library python python-3 python-library python3
Last synced: 30 Oct 2025
https://github.com/simranjeet97/docker_python_flask-dash_app
Docker Image and Container Build for Python Flask/Dash App
data data-science data-structures data-visualization docker docker-compose docker-container docker-image python python-script uwsgi-nginx
Last synced: 07 May 2026
https://github.com/biglocalnews/upload-files
Upload comma-delimited files to biglocalnews.org in your GitHub Action
action actions archiving csv data data-journalism github-actions journalism news
Last synced: 27 Apr 2026
https://github.com/psfried/dgen
Generate evil test data
csv data data-generation data-generator language testing-tools
Last synced: 18 Mar 2025
https://github.com/rcorrero/light-pipe
A high-level syntax for data pipelines, designed to make pipeline development quick and painless.
data data-pipelines data-processing geospatial-analysis geospatial-processing pipeline
Last synced: 14 Dec 2025
https://github.com/vijishmadhavan/parse-clip
A simple CLIP based project for combining images from multiple datasets.
clip data datacleaning dataexploration dataset fastai image python
Last synced: 14 May 2026
https://github.com/gappeah/apocalypse-food-prep-report
This PowerBI project focuses on visualising data for Apocalypse Food Prep, a company specialising in emergency food supplies. The dataset consists of various CSV files containing information on customers, locations, products, sales, sales teams, and state regions.
data data-visualization powerbi powerbi-report powerbi-visuals
Last synced: 25 Feb 2025
https://github.com/imagodata/filter_mate
FilterMate is a Qgis plugin, an everyday companion that allows you to easily filter your vector layers
data exploratory-data-analysis filter geospatial ogr postgis qgis qgis-plugin qgis3 qgis3-plugin spatialite sql vector-database
Last synced: 29 Apr 2026
https://github.com/cmudig/mosaic-profiler
A data profiler built with Mosaic
Last synced: 25 Oct 2025
https://github.com/sneels/parkds
Connect all your Data Sources via 1 process (Cross-Domain + Single-Domain)
cross-domain data database datasource datasources javascript source
Last synced: 24 Feb 2026
https://github.com/0xdir/htcds_dart
Human Trafficking Case Data Standard (HTCDS v0.2) objects, for easy creation, storage and transmission of case data related to human trafficking.
data humanitarian schema standards
Last synced: 24 Oct 2025
https://github.com/ryanmorr/fastmap
Accelerated hash maps
data hashmap javascript map performance
Last synced: 10 Oct 2025
https://github.com/udityamerit/python-librearies-for-data-science
Python libraries for data science enable efficient data manipulation, analysis, and modeling. Key libraries include NumPy for numerical computing, pandas for data handling, Matplotlib for visualization, Scikit-learn for machine learning, TensorFlow for deep learning, and BeautifulSoup/requests for web scraping. These libraries simplify complex data
beautifulsoup data data-science data-science-libraries machine-learning matplotlib numpy pandas requests scikit-learn scikitlearn-machine-learning tensorflow
Last synced: 06 Feb 2026
https://github.com/tomdoestech/website-scraping-example
data node-js nodejs scraping scraping-websites
Last synced: 16 Mar 2025
https://github.com/t3v/t3v_datamapper
The data mapper extension of TYPO3voilà.
data database datamapper extension laravel mapper t3v typo3 typo3-cms-extension typo3-extension typo3voila
Last synced: 27 Jan 2026
https://github.com/legopitstop/addons
All legopitstop's Bedrock add-ons in one place.
add-on assets behaviorpack data hacktoberfest minecraft mods modtoberfest resroucepack vanilla
Last synced: 06 Feb 2026
https://github.com/binarybardakshat/suryanayan
Suryanayan AI is a project aimed at using drone technology and artificial intelligence for monitoring and detecting issues in solar panels. This project is inspired by the Indian government's initiative to promote solar energy by providing subsidies on solar panels.
Last synced: 10 Oct 2025
https://github.com/geopython/pygeoapi-examples
Example pygeoapi deployment patterns and configurations
api data geospatial ogc ogc-api osgeo pygeoapi
Last synced: 11 Oct 2025
https://github.com/fcakyon/earth2-scraper
Up-to-date earth2.io data
data earth earth2 earth2io javascript json json-api prices-per-tile python scraper
Last synced: 09 May 2026
https://github.com/cicerops/monitoring-check-grafana
Monitor a Grafana datasource against data becoming stale to detect data loss or other dropout conditions.
data database freshness grafana grafana-datasource icinga2 icinga2-plugin influxdb monitoring stale
Last synced: 08 May 2026
https://github.com/chompfoods/sdk-csharp
C# SDK for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.
api branded chomp csharp csharp-sdk data database dll food grocery ingredients nuget nutrition raw recipes recipes-api restsharp sdk swagger
Last synced: 06 May 2026
https://github.com/ahmedshahriar/eda_basketball
basketball basketball-stats data data-science data-visualization pandas python python3 streamlit
Last synced: 04 May 2026
https://github.com/jderstd/spec
A standard for JSON responses
data error jder json response specification structure
Last synced: 13 May 2026
https://github.com/deveel/kista
Implementations of the repository pattern for .NET to support the domain-driven modeling
clean-architechture csharp data dotnet-core dotnetcore efcore entity entity-framework entity-manager layered-architecture mongodb repository repository-manager repository-pattern
Last synced: 08 Jun 2026
https://github.com/paladique/azuresample-guestbook
Guestbook using MySQL and Cosmos DB on Azure
cosmosdb data mysql spa websockets
Last synced: 30 Apr 2026
https://github.com/anicolaspp/mapr-data-gen
Data generator for MapR Data Platform
data mapr mapr-db mapr-es mapr-streams maprdb parquet scala spark
Last synced: 29 Apr 2026
https://github.com/tooleks/laravel-presenter
The Laravel Presenter Composer Package
collection composer data entity laravel mapper mapping php presenter representation view
Last synced: 28 Apr 2026
https://github.com/yazaabed/at-who-angular
wrapper for At.js that add mentions autocomplete to your application with angular component for using it on any AngularJS projects
angular-components angularjs autocomplete components data modules webpack wrapper
Last synced: 28 Apr 2026
https://github.com/luminovrym/pbo-biodata
Simulasi Cara Input Data dengan OOP
Last synced: 18 Jun 2026
https://github.com/mongodb-developer/rocket-analytics
Learn how the various components of MongoDB's Developer Data Platform (DDP) can support app-driven and traditional analytics in real-time without duplicating data to other data stores. This demo was created for AWS re:Invent 2022 and presented at the MongoDB booth area at the Venetian expo hall.
data federation lucene lucenesearch mongodb s3 search sql
Last synced: 28 Apr 2026
https://github.com/justjavac/deno_data_dir
Returns the path to the user's data directory.
data deno deno-module deno-modules directory
Last synced: 27 Apr 2026
https://github.com/kanugurajesh/firebase-data
Adding data to firebase store
data firebase firebase-database python
Last synced: 27 Apr 2026
https://github.com/anthonykrivonos/ts-algo-masterclass
👾 Giant TypeScript algorithm and data structure masterclass to be constantly updated with important CS concepts.
algorithm class-project computer concepts data data-structures fundamentals giant library masterclass science structures typescript
Last synced: 11 May 2026
https://github.com/mukhopadhyay/opendata
Open Data ❤️
data data-science datasets deep-learning kaggle kaggle-dataset machine-learning open-source opendata
Last synced: 25 Apr 2026
https://github.com/joamag/pandas
Loads of pandas data from China with awesome data
data data-analysis jupyter notebook pandas
Last synced: 25 Apr 2026
https://github.com/corentinb/txtoredis
:fire: Push each line of a text file, to a Redis set
data datascience dataset go golang redis set
Last synced: 24 Apr 2026
https://github.com/d2hydro/fewspy
A Python API for the Deltares FEWS PI REST Web Service
data geopandas hydrology hydrometrics pandas python
Last synced: 23 Apr 2026
https://github.com/healthyregions/oeps
Opioid Environment Policy Scan - data explorer and backend management
data data-visualization public-health
Last synced: 21 Apr 2026
https://github.com/adanos-software/free-ticker-database
Free global stock & ETF ticker reference database - 50k+ tickers, 66 exchanges, 81 countries
Last synced: 10 May 2026
https://github.com/lilingxi01/bloark
Blocks Architecture (BloArk) project package for building Blocks-0 dataset and way beyond.
architecture bloark data revision-based
Last synced: 05 Apr 2026
https://github.com/metapsy-project/data-gambling-psyctr
Database of psychological interventions for problem gambling and gambling disorder.
Last synced: 02 Apr 2026
https://github.com/d8a-tech/d8a
A data collection service fully compatible with GA4 tracking protocols. Ingest into ClickHouse or BigQuery database while maintaining complete control over your data.
bigquery clickhouse data ga4 tracker
Last synced: 10 Apr 2026
https://github.com/robertmyles/riscobrasil
An R package to download 'Brazil Risk' data :chart_with_upwards_trend:
Last synced: 08 Apr 2025
https://github.com/ahmetfurkandemir/sahibinden-data-engineering-technical-case-study
Sahibinden.com Data Engineering Technical Case Study
case-study data data-engineering debezium docker flink kafka mongodb mysql pyflink pyspark python sahibinden spark
Last synced: 03 Mar 2026
https://github.com/muhammadibrahim313/datavue
"DataVue" is an AI-powered data science platform that simplifies EDA, visualizations, and data cleaning. It offers personalized learning, real-time collaboration, and strong data security for all users.
analytics auto chatbot data data-science data-visualization eda education genai groq groq-api llama3 machine-learning python streamlit
Last synced: 10 Apr 2025
https://github.com/louisbrulenaudet/legalkit-pipeline
Publication pipeline for French legal codes on 🤗 Datasets from LegiFrance with concurrent upload and dynamic REAMDE.md.
data datasets huggingface huggingface-datasets legal legaltech legifrance open-source parquet piste-api python
Last synced: 17 Mar 2025
https://github.com/geo2france/odema-dashboard
Tableaux de bord thématiques Odema
application client-side dashboard data echarts maplibre odema react waste
Last synced: 05 Feb 2026
https://github.com/abrudz/parsing
Dyalog APL expressions to parse common and unusual data formats from text files
apl csv data data-format dyalog-apl dyalogapl parsing
Last synced: 20 Mar 2026
https://github.com/woctezuma/steam-reviews-data
Data available to compute statistics of Steam reviews.
Last synced: 19 Mar 2026
https://github.com/huangcongqing/ranking-list
数据!important | 各种排行,榜单数据汇总 数据为王的时代 Data
Last synced: 15 Feb 2026
https://github.com/platob/yggdrasil
arrow data databricks pandas polars spark sql
Last synced: 02 Jun 2026
https://github.com/gadenbuie/crantrack
Hourly snapshots of CRAN's incoming packages folder
Last synced: 12 Mar 2026
https://github.com/qeeqbox/data-compliance
Data compliance is the process of following various regulations and standards to ensure that sensitive digital assets (data) are guarded against loss, theft, and misuse
compliance data data-compliance infosecsimplified qeeqbox
Last synced: 19 Mar 2026
https://github.com/fforres/webpack-plugin-dx-metrics
Webpack plugin to track webpack behaviour in datadog
data datadog developer-experience typescript visualization webpack
Last synced: 13 Feb 2026
https://github.com/onaio/gisida-react
React Dashboard library for Gisida.
dashboard data gisida map react visualization
Last synced: 28 Apr 2025
https://github.com/yashika-malhotra/exploratory-data-analysis-for-multinational-retail-corporation
Analysis via CLT and Visualization on Multinational Retail Corporation's data to provide insights and recommendations to improve their userbase.
colab-notebook data jupyter-notebook matplotlib numpy pandas python seaborn stats
Last synced: 11 Feb 2026
https://github.com/enes9103/039_react_task_tracker-json_server
api axios-react css3 data javascript json-server react reactjs responsive todoapp
Last synced: 11 Feb 2026
https://github.com/rikurauhala/insights
Visualize your coding journey
cypress data data-visualization github github-api javascript material-ui octokit react statistics typescript vite
Last synced: 11 Feb 2026
https://github.com/bluegreen-labs/oneflux_containers
Containerized (docker) versions of the ONEFlux processing pipeline
data ecosystem fluxes micrometeorology processing
Last synced: 07 Oct 2025