data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-02 00:07:45 UTC
- JSON Representation
https://github.com/umitkaanusta/smol-elt
a smol elt (not etl) pipeline for smol tasks
analytics automation aws aws-sns data data-engineering data-pipeline elt etl google-sheets pandas pipeline python spreadsheet web-scraping
Last synced: 10 May 2026
https://github.com/newrelic-experimental/newrelic-java-camel
Instrumentation of the New Relic Java Agent for the Camel framework
camel camel-jms data instrumentation java nrlabs nrlabs-data nrlabs-java-verify nrlabs-odp observability-data
Last synced: 10 Apr 2025
https://github.com/cttynul/elsoftware
⚽ Vinci al Fantacalcio usando librerie di pandas, facendo credere a tutti che tu stia usando il machine learning
data data-science fantacalcio machine-learning pandas
Last synced: 30 Jun 2026
https://github.com/deveel/deveel.repository
Implementations of the repository pattern for .NET to support the domain-driven modeling
clean-architechture csharp data dotnet-core dotnetcore efcore entity entity-framework entity-manager layered-architecture mongodb repository repository-manager repository-pattern
Last synced: 22 Apr 2025
https://github.com/nix1707/webscrapper-browserextension
Scraper Master is a Chrome extension for effortless web data extraction. Built with React, TypeScript, and the Chrome Scripting API, it ensures efficient, high-quality, and seamless scraping. Utilizing HTML and CSS, ScrapeEase offers a clean, responsive design. Simplify your data collection with Scraper Master.
chrome-extension chrome-extensions css data frontend html html-parser modern parser parsing react scraper scraping typescript ui validation webparser webparsing webscraping
Last synced: 21 Jun 2025
https://github.com/andygol/yamap
Yamap Ain't Map – deployment of OSM infrastructure project inspired by osm-seed
api data extract geo-data map openstreetmap osm
Last synced: 24 Jun 2025
https://github.com/pseudomuto/iceberg-rest-go
A Go client library for working with Iceberg Rest catalogs
Last synced: 25 Jan 2026
https://github.com/themitosan/grpp
GRPP is a simple tool written in TS that helps preserving git repositories.
cli data git grpp linux preservation project repo repository
Last synced: 15 Jul 2025
https://github.com/DataHerb/dataherb-python
Python Package for DataHerb: create, search, and load datasets.
data data-analysis data-mining database dataset python
Last synced: 08 May 2025
https://github.com/lafayettegabe/nlp-resume-extraction
📝 NER (Named Entity Recognition) project aimed at solving the problem of manually shortlisting resumes by automating the process. This project proposes using NLP techniques and NER model to classify and extract relevant entities from resumes such as person name, college name, academics information, relevant experiences, skill set, etc.
big-data data data-analysis data-science eda ner nlp resume-extractor
Last synced: 03 Apr 2025
https://github.com/biglocalnews/upload-files
Upload comma-delimited files to biglocalnews.org in your GitHub Action
action actions archiving csv data data-journalism github-actions journalism news
Last synced: 27 Apr 2026
https://github.com/kawai-senpai/potatodb
PotatoDB is a lightweight, file-based NoSQL database for Python projects, designed for easy setup and use in small-scale applications. Ideal for developers seeking simple data persistence without the complexity of traditional databases.
data database easy-to-use file-based json key-value lightweight nosql nosql-database persistence python simple
Last synced: 23 Oct 2025
https://github.com/simranjeet97/docker_python_flask-dash_app
Docker Image and Container Build for Python Flask/Dash App
data data-science data-structures data-visualization docker docker-compose docker-container docker-image python python-script uwsgi-nginx
Last synced: 07 May 2026
https://github.com/equinor/data-marketplace
Easily find and check out data products
Last synced: 01 May 2025
https://github.com/owsas/open-categories
Open Categorization system, available as a node module
categories categorization categorize data data-structures node open-source typescript yaml
Last synced: 30 Apr 2025
https://github.com/guiferviz/tuberia
Data engineering meets software engineering
data data-engineering expectations pipeline python spark
Last synced: 08 Mar 2026
https://github.com/amethyst-php/invoice
amethyst amethyst-package api data invoice laravel
Last synced: 10 Apr 2025
https://github.com/gianlucatruda/project_sleep
A Quantified Self project in which I use ±40 nights of data to determine what helps and hinders my sleep.
data experiment matplotlib python quantified science self sleep visualization
Last synced: 03 Apr 2025
https://github.com/ikstream/dns-handler
Data collection server for the dalec user collection system
collection dalec data data-collection dns dns-server python python3
Last synced: 13 Mar 2025
https://github.com/muneeb1030/finetune-tiny-llama
Fine-tuning the Tiny Llama model to mimic my professor's writing style using the Llama Factory. The project involves data collection, preprocessing, preparation, fine-tuning, and evaluation.
data data-preparation data-preprocessing finetuning llama-factory llm pymupdf selenium-python spacy tinyllama webscraping
Last synced: 08 Apr 2026
https://github.com/randomgamingdev/mc_block_color_mapper
Python scripts & libraries for generating and mapping the average colors for each of the Minecraft blocks
average average-calculator cli data data-generator documented-api extract extract-data extractor fast minecraft python3 simple small texture texture-pack textures
Last synced: 22 May 2026
https://github.com/abdussattar-70/oop-school-library
The OOP-School-Library project demonstrates the principles of data abstraction, inheritance, encapsulation, and polymorphism, which are fundamental concepts in object-oriented programming(OOP).
abstraction data encapsulation inheritance polymorphism rubocop-configuration ruby
Last synced: 29 Mar 2025
https://github.com/intercloud/gotsgen
Golang Time Series Data Generator
data generator golang library timeseries
Last synced: 20 Jun 2025
https://github.com/yashika-malhotra/data-exploration-and-visualization-for-streaming-platform
Data Analysis and Visualization for streaming platform to provide insights and recommendations to improve their userbase.
colab-notebook data data-visualization jupyter-notebook matplotlib numpy pandas python seaborn
Last synced: 18 Apr 2026
https://github.com/rpidanny/streamline.js
A JavaScript class that reads and processes a stream line-by-line in order.
big-data data data-processing file-stream javascript stream streams typescript
Last synced: 08 Sep 2025
https://github.com/legopitstop/datapacks
All legopitstop's datapacks in one place.
assets data datapack hacktoberfest minecraft mods modtoberfest resroucepack vanilla
Last synced: 03 Jan 2026
https://github.com/jewelzufo/free-tech-learning
A collection of free Tech Courses with Credly Credentials
ai cisco courses credly cybersecurity data data-science ibm ibm-cloud ibm-watson learning-resources tech
Last synced: 16 Feb 2026
https://github.com/dotnet-ad/staticbind
Generated and compiled data binding for .NET (Xamarin.iOS, Xamarin.Android,...)
Last synced: 19 May 2026
https://github.com/v4ss3ur/hierarchicaldatagrid.wpf
A WPF control that mix DataGrid and TreeView functionalities, allowing for hierarchical, recursive data display with expandable nested rows. Ideal for complex data structures in an easy-to-use, MVVM-friendly tabular format.
controls data datagrid hierarchical hierarchical-data mvvm nested nested-objects nested-structures treeview wpf xaml
Last synced: 13 May 2025
https://github.com/charconstpointer/markovbot
PoC markov chain sentence generator, powered by discord for data gathering
bot chain collection data discord markov parsing
Last synced: 16 May 2026
https://github.com/udityamerit/python-librearies-for-data-science
Python libraries for data science enable efficient data manipulation, analysis, and modeling. Key libraries include NumPy for numerical computing, pandas for data handling, Matplotlib for visualization, Scikit-learn for machine learning, TensorFlow for deep learning, and BeautifulSoup/requests for web scraping. These libraries simplify complex data
beautifulsoup data data-science data-science-libraries machine-learning matplotlib numpy pandas requests scikit-learn scikitlearn-machine-learning tensorflow
Last synced: 06 Feb 2026
https://github.com/jmsallan/esdata
A R package to bring Spanish economic databases into the R environment
data datasets ine inflation spain unemployment-data
Last synced: 18 Jan 2026
https://github.com/ahmetfurkandemir/sahibinden-data-engineering-technical-case-study
Sahibinden.com Data Engineering Technical Case Study
case-study data data-engineering debezium docker flink kafka mongodb mysql pyflink pyspark python sahibinden spark
Last synced: 03 Mar 2026
https://github.com/definetlynotai/llm_data
A bunch of very famous repos source code's in python as pure localdocs all in this repo to train CODE AI
c code-examples cpp cuda data data-dum jupyter-notebook llm llm-code llm-datasets programming-data programming-data-sets python3
Last synced: 08 Oct 2025
https://github.com/no-stack-dub-sack/basic-immutable
basic immutable JavaScript objects and arrays, with a small API surface area
data immutable immutable-collections immutable-datastructures immutable-store lodash persistent-data-structure typescript
Last synced: 21 Jan 2026
https://github.com/jderstd/spec
A standard for JSON responses
data error jder json response specification structure
Last synced: 13 May 2026
https://github.com/datalayer/desktop
Ξ 🖥️ Datalayer Destkop.
ai data data-analysis data-science datalayer desktop electron
Last synced: 25 Oct 2025
https://github.com/luminovrym/pbo-biodata
Simulasi Cara Input Data dengan OOP
Last synced: 18 Jun 2026
https://github.com/abrudz/parsing
Dyalog APL expressions to parse common and unusual data formats from text files
apl csv data data-format dyalog-apl dyalogapl parsing
Last synced: 20 Mar 2026
https://github.com/bluegreen-labs/oneflux_containers
Containerized (docker) versions of the ONEFlux processing pipeline
data ecosystem fluxes micrometeorology processing
Last synced: 07 Oct 2025
https://github.com/woctezuma/steam-reviews-data
Data available to compute statistics of Steam reviews.
Last synced: 19 Mar 2026
https://github.com/cnayan/q-server
Gives API for back-end server connectivity; MS SQL Server connector provided.
data database provider q-server query query-engine
Last synced: 09 Oct 2025
https://github.com/anthonykrivonos/ts-algo-masterclass
👾 Giant TypeScript algorithm and data structure masterclass to be constantly updated with important CS concepts.
algorithm class-project computer concepts data data-structures fundamentals giant library masterclass science structures typescript
Last synced: 11 May 2026
https://github.com/onaio/gisida-react
React Dashboard library for Gisida.
dashboard data gisida map react visualization
Last synced: 28 Apr 2025
https://github.com/huangcongqing/ranking-list
数据!important | 各种排行,榜单数据汇总 数据为王的时代 Data
Last synced: 15 Feb 2026
https://github.com/platob/yggdrasil
arrow data databricks pandas polars spark sql
Last synced: 02 Jun 2026
https://github.com/d8a-tech/d8a
A data collection service fully compatible with GA4 tracking protocols. Ingest into ClickHouse or BigQuery database while maintaining complete control over your data.
bigquery clickhouse data ga4 tracker
Last synced: 10 Apr 2026
https://github.com/adanos-software/free-ticker-database
Free global stock & ETF ticker reference database - 50k+ tickers, 66 exchanges, 81 countries
Last synced: 10 May 2026
https://github.com/metapsy-project/data-gambling-psyctr
Database of psychological interventions for problem gambling and gambling disorder.
Last synced: 02 Apr 2026
https://github.com/qeeqbox/data-compliance
Data compliance is the process of following various regulations and standards to ensure that sensitive digital assets (data) are guarded against loss, theft, and misuse
compliance data data-compliance infosecsimplified qeeqbox
Last synced: 19 Mar 2026
https://github.com/ryanmorr/fastmap
Accelerated hash maps
data hashmap javascript map performance
Last synced: 10 Oct 2025
https://github.com/t3v/t3v_datamapper
The data mapper extension of TYPO3voilà.
data database datamapper extension laravel mapper t3v typo3 typo3-cms-extension typo3-extension typo3voila
Last synced: 27 Jan 2026
https://github.com/binarybardakshat/suryanayan
Suryanayan AI is a project aimed at using drone technology and artificial intelligence for monitoring and detecting issues in solar panels. This project is inspired by the Indian government's initiative to promote solar energy by providing subsidies on solar panels.
Last synced: 10 Oct 2025
https://github.com/fforres/webpack-plugin-dx-metrics
Webpack plugin to track webpack behaviour in datadog
data datadog developer-experience typescript visualization webpack
Last synced: 13 Feb 2026
https://github.com/geopython/pygeoapi-examples
Example pygeoapi deployment patterns and configurations
api data geospatial ogc ogc-api osgeo pygeoapi
Last synced: 11 Oct 2025
https://github.com/lilingxi01/bloark
Blocks Architecture (BloArk) project package for building Blocks-0 dataset and way beyond.
architecture bloark data revision-based
Last synced: 05 Apr 2026
https://github.com/tomdoestech/website-scraping-example
data node-js nodejs scraping scraping-websites
Last synced: 16 Mar 2025
https://github.com/wahyudesu/datacamp
data data-science datacamp pandas python
Last synced: 09 May 2026
https://github.com/gadenbuie/crantrack
Hourly snapshots of CRAN's incoming packages folder
Last synced: 12 Mar 2026
https://github.com/yashika-malhotra/exploratory-data-analysis-for-multinational-retail-corporation
Analysis via CLT and Visualization on Multinational Retail Corporation's data to provide insights and recommendations to improve their userbase.
colab-notebook data jupyter-notebook matplotlib numpy pandas python seaborn stats
Last synced: 11 Feb 2026
https://github.com/enes9103/039_react_task_tracker-json_server
api axios-react css3 data javascript json-server react reactjs responsive todoapp
Last synced: 11 Feb 2026
https://github.com/rikurauhala/insights
Visualize your coding journey
cypress data data-visualization github github-api javascript material-ui octokit react statistics typescript vite
Last synced: 11 Feb 2026
https://github.com/healthyregions/oeps
Opioid Environment Policy Scan - data explorer and backend management
data data-visualization public-health
Last synced: 21 Apr 2026
https://github.com/stdlib-js/datasets-spache-revised
A list of simple American-English words (revised Spache).
american complex comprehension data dataset datasets javascript node node-js nodejs primary readability readable reading school simple simplicity spache stdlib words
Last synced: 12 Oct 2025
https://github.com/d2hydro/fewspy
A Python API for the Deltares FEWS PI REST Web Service
data geopandas hydrology hydrometrics pandas python
Last synced: 23 Apr 2026
https://github.com/corentinb/txtoredis
:fire: Push each line of a text file, to a Redis set
data datascience dataset go golang redis set
Last synced: 24 Apr 2026
https://github.com/iondv/metrics
IONDV. Framework application: Metrics is to collect and show the metrics data.
collecting data data-analysis iondv iondv-app metrics
Last synced: 10 Feb 2026
https://github.com/reubano/ckanutils
A Python library for interacting with CKAN instances
Last synced: 10 Feb 2026
https://github.com/andyfratello/dbd
🗄️ Exercicis de Disseny de Bases de Dades (DBD) Q2 - UPC FIB
data database dbd dbd-fib dbeaver fib-upc nosql nosql-database oracle sql sql-database
Last synced: 10 Feb 2026
https://github.com/geo2france/odema-dashboard
Tableaux de bord thématiques Odema
application client-side dashboard data echarts maplibre odema react waste
Last synced: 05 Feb 2026
https://github.com/floriancassayre/nicknames-datasets
Open source nicknames sets with informations about the data origin(s).
Last synced: 08 Feb 2026
https://github.com/joamag/pandas
Loads of pandas data from China with awesome data
data data-analysis jupyter notebook pandas
Last synced: 25 Apr 2026
https://github.com/louisbrulenaudet/legalkit-pipeline
Publication pipeline for French legal codes on 🤗 Datasets from LegiFrance with concurrent upload and dynamic REAMDE.md.
data datasets huggingface huggingface-datasets legal legaltech legifrance open-source parquet piste-api python
Last synced: 17 Mar 2025
https://github.com/muhammadibrahim313/datavue
"DataVue" is an AI-powered data science platform that simplifies EDA, visualizations, and data cleaning. It offers personalized learning, real-time collaboration, and strong data security for all users.
analytics auto chatbot data data-science data-visualization eda education genai groq groq-api llama3 machine-learning python streamlit
Last synced: 10 Apr 2025
https://github.com/mukhopadhyay/opendata
Open Data ❤️
data data-science datasets deep-learning kaggle kaggle-dataset machine-learning open-source opendata
Last synced: 25 Apr 2026
https://github.com/vrm-piyush/python-projects
Open source Python Projects. Feel Free to contribute!
data dataanalysis games open-source pygame-games python python-app
Last synced: 26 Feb 2026
https://github.com/kanugurajesh/firebase-data
Adding data to firebase store
data firebase firebase-database python
Last synced: 27 Apr 2026
https://github.com/tosun-si/world-cup-qatar-team-stats-kotlin-midgard
This application shows a full Apache Beam pipeline with Kotlin and Midgard library. The use case works on the last Qatar FIFA world cup data and calculate players statistics per team. This application will be presented at Beam Summit 2023 in New York
apache-beam beam-summit data kotlin midgard world-cup-2022
Last synced: 01 Feb 2026
https://github.com/justjavac/deno_data_dir
Returns the path to the user's data directory.
data deno deno-module deno-modules directory
Last synced: 27 Apr 2026
https://github.com/mongodb-developer/rocket-analytics
Learn how the various components of MongoDB's Developer Data Platform (DDP) can support app-driven and traditional analytics in real-time without duplicating data to other data stores. This demo was created for AWS re:Invent 2022 and presented at the MongoDB booth area at the Venetian expo hall.
data federation lucene lucenesearch mongodb s3 search sql
Last synced: 28 Apr 2026
https://github.com/yazaabed/at-who-angular
wrapper for At.js that add mentions autocomplete to your application with angular component for using it on any AngularJS projects
angular-components angularjs autocomplete components data modules webpack wrapper
Last synced: 28 Apr 2026
https://github.com/tooleks/laravel-presenter
The Laravel Presenter Composer Package
collection composer data entity laravel mapper mapping php presenter representation view
Last synced: 28 Apr 2026
https://github.com/desultory/pycpio
Python library for CPIO manipulation
cpio cpio-archives data initramfs pypi-package python python-3 python3
Last synced: 04 Feb 2026
https://github.com/anicolaspp/mapr-data-gen
Data generator for MapR Data Platform
data mapr mapr-db mapr-es mapr-streams maprdb parquet scala spark
Last synced: 29 Apr 2026
https://github.com/mutasim77/dbt-analytics
🍉 Repo for analytics engineering with dbt, transforming raw data into actionable insights.
big-query data data-analysis dbt warehouse
Last synced: 25 Feb 2026
https://github.com/paladique/azuresample-guestbook
Guestbook using MySQL and Cosmos DB on Azure
cosmosdb data mysql spa websockets
Last synced: 30 Apr 2026