data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-28 00:07:46 UTC
- JSON Representation
https://github.com/huseyincenik/data_science
Data Science materials
data data-science data-structures data-visualization dataanalysis dataengineering datapreparation dataprocessing datascience dataset time-series time-series-analysis timeline timeseries timeseries-analysis timeseriesforecasting
Last synced: 25 Jul 2025
https://github.com/thamerh/web-scraper-with-node.js-and-cheerio
used simple exemple how Scraper data from Build a Web Scraper with Node.js and Cheerio
cheer data expressjs nodejs scarper webscraping
Last synced: 08 Apr 2026
https://github.com/yessasvini23/machine_learning_specialization_deeplearning.ai
Contains all course modules, exercises and notes of ML Specialization by Andrew Ng, Stanford Un. and DeepLearning.ai in Coursera
andrew-ng andrew-ng-course andrew-ng-machine-learning classification data data-science deep-learning machine-learning machine-learning-algorithms neural-network nlp-machine-learning regression rnn-tensorflow
Last synced: 18 May 2026
https://github.com/muneeb1030/finetune-tiny-llama
Fine-tuning the Tiny Llama model to mimic my professor's writing style using the Llama Factory. The project involves data collection, preprocessing, preparation, fine-tuning, and evaluation.
data data-preparation data-preprocessing finetuning llama-factory llm pymupdf selenium-python spacy tinyllama webscraping
Last synced: 08 Apr 2026
https://github.com/intercloud/gotsgen
Golang Time Series Data Generator
data generator golang library timeseries
Last synced: 20 Jun 2025
https://github.com/bohnacker/data-manipulation
Some Javascript and Python scripts to manipulate (large) CSV files and JSON data.
data data-mining data-structures javascript python
Last synced: 18 May 2026
https://github.com/nix1707/webscrapper-browserextension
Scraper Master is a Chrome extension for effortless web data extraction. Built with React, TypeScript, and the Chrome Scripting API, it ensures efficient, high-quality, and seamless scraping. Utilizing HTML and CSS, ScrapeEase offers a clean, responsive design. Simplify your data collection with Scraper Master.
chrome-extension chrome-extensions css data frontend html html-parser modern parser parsing react scraper scraping typescript ui validation webparser webparsing webscraping
Last synced: 21 Jun 2025
https://github.com/simranjeet97/docker_python_flask-dash_app
Docker Image and Container Build for Python Flask/Dash App
data data-science data-structures data-visualization docker docker-compose docker-container docker-image python python-script uwsgi-nginx
Last synced: 07 May 2026
https://github.com/deveripon/assignment-6-assets
This assets is only for Reactive Accelarator Batch 2 - Assignment 6
Last synced: 30 Apr 2025
https://github.com/ultreon/ubo
NBT inspired data I/O. Made for games.
api binary-data data data-storage file-type game-data io library ubo
Last synced: 16 Jun 2025
https://github.com/priyanka7411/dataspark-electronics-retail-analytics
DataSpark is a data analysis project using Python, SQL, and Power BI to analyze global electronics retail sales, focusing on customer behavior, sales performance, product profitability, and store performance to optimize sales strategies.
analytics-providers business-intelligence customer-segmentation data data-analysis electronics-industry global-sales pandas powerbi powerbi-visuals product-profitability python retail-analytics sales-performance sql store-analysis visualization
Last synced: 10 Jul 2025
https://github.com/siongui/gopaliwordvfs
Serve JSON data of Pali words, embedded in Go code
data go golang pali vfs virtual-file-system virtualfilesystem
Last synced: 04 Apr 2025
https://github.com/DataHerb/dataherb-python
Python Package for DataHerb: create, search, and load datasets.
data data-analysis data-mining database dataset python
Last synced: 08 May 2025
https://github.com/oliver021/ecmalinq
The linq runtime and support to typescript/javascript ecosystem
collection data iterable iteration javascript library linq linq-expressions nodejs query stream stream-data structure typescript
Last synced: 13 May 2025
https://github.com/charconstpointer/markovbot
PoC markov chain sentence generator, powered by discord for data gathering
bot chain collection data discord markov parsing
Last synced: 16 May 2026
https://github.com/lafayettegabe/nlp-resume-extraction
📝 NER (Named Entity Recognition) project aimed at solving the problem of manually shortlisting resumes by automating the process. This project proposes using NLP techniques and NER model to classify and extract relevant entities from resumes such as person name, college name, academics information, relevant experiences, skill set, etc.
big-data data data-analysis data-science eda ner nlp resume-extractor
Last synced: 03 Apr 2025
https://github.com/pseudomuto/iceberg-rest-go
A Go client library for working with Iceberg Rest catalogs
Last synced: 25 Jan 2026
https://github.com/ljharb/define-data-property
Define a data property on an object. Will fall back to assignment in an engine without descriptors.
accessor configurable data define ecmascript enumerable javascript object property writable
Last synced: 13 Apr 2025
https://github.com/vasturiano/data-bind-mapper
Bind data arrays with any type of JS objects
bind data digest joins mapper performance
Last synced: 26 Jul 2025
https://github.com/rpidanny/streamline.js
A JavaScript class that reads and processes a stream line-by-line in order.
big-data data data-processing file-stream javascript stream streams typescript
Last synced: 08 Sep 2025
https://github.com/edgardleal/thanos-for-data
A Thanos implementation to restore the balance of your data
Last synced: 15 Jun 2025
https://github.com/no-stack-dub-sack/basic-immutable
basic immutable JavaScript objects and arrays, with a small API surface area
data immutable immutable-collections immutable-datastructures immutable-store lodash persistent-data-structure typescript
Last synced: 21 Jan 2026
https://github.com/onaio/gisida-react
React Dashboard library for Gisida.
dashboard data gisida map react visualization
Last synced: 28 Apr 2025
https://github.com/tomdoestech/website-scraping-example
data node-js nodejs scraping scraping-websites
Last synced: 16 Mar 2025
https://github.com/robertmyles/riscobrasil
An R package to download 'Brazil Risk' data :chart_with_upwards_trend:
Last synced: 08 Apr 2025
https://github.com/muhammadibrahim313/datavue
"DataVue" is an AI-powered data science platform that simplifies EDA, visualizations, and data cleaning. It offers personalized learning, real-time collaboration, and strong data security for all users.
analytics auto chatbot data data-science data-visualization eda education genai groq groq-api llama3 machine-learning python streamlit
Last synced: 10 Apr 2025
https://github.com/louisbrulenaudet/legalkit-pipeline
Publication pipeline for French legal codes on 🤗 Datasets from LegiFrance with concurrent upload and dynamic REAMDE.md.
data datasets huggingface huggingface-datasets legal legaltech legifrance open-source parquet piste-api python
Last synced: 17 Mar 2025
https://github.com/gadenbuie/crantrack
Hourly snapshots of CRAN's incoming packages folder
Last synced: 12 Mar 2026
https://github.com/bluegreen-labs/oneflux_containers
Containerized (docker) versions of the ONEFlux processing pipeline
data ecosystem fluxes micrometeorology processing
Last synced: 07 Oct 2025
https://github.com/datalayer/desktop
Ξ 🖥️ Datalayer Destkop.
ai data data-analysis data-science datalayer desktop electron
Last synced: 25 Oct 2025
https://github.com/jmsallan/esdata
A R package to bring Spanish economic databases into the R environment
data datasets ine inflation spain unemployment-data
Last synced: 18 Jan 2026
https://github.com/definetlynotai/llm_data
A bunch of very famous repos source code's in python as pure localdocs all in this repo to train CODE AI
c code-examples cpp cuda data data-dum jupyter-notebook llm llm-code llm-datasets programming-data programming-data-sets python3
Last synced: 08 Oct 2025
https://github.com/cnayan/q-server
Gives API for back-end server connectivity; MS SQL Server connector provided.
data database provider q-server query query-engine
Last synced: 09 Oct 2025
https://github.com/ryanmorr/fastmap
Accelerated hash maps
data hashmap javascript map performance
Last synced: 10 Oct 2025
https://github.com/t3v/t3v_datamapper
The data mapper extension of TYPO3voilà.
data database datamapper extension laravel mapper t3v typo3 typo3-cms-extension typo3-extension typo3voila
Last synced: 27 Jan 2026
https://github.com/binarybardakshat/suryanayan
Suryanayan AI is a project aimed at using drone technology and artificial intelligence for monitoring and detecting issues in solar panels. This project is inspired by the Indian government's initiative to promote solar energy by providing subsidies on solar panels.
Last synced: 10 Oct 2025
https://github.com/geopython/pygeoapi-examples
Example pygeoapi deployment patterns and configurations
api data geospatial ogc ogc-api osgeo pygeoapi
Last synced: 11 Oct 2025
https://github.com/stdlib-js/datasets-spache-revised
A list of simple American-English words (revised Spache).
american complex comprehension data dataset datasets javascript node node-js nodejs primary readability readable reading school simple simplicity spache stdlib words
Last synced: 12 Oct 2025
https://github.com/geo2france/odema-dashboard
Tableaux de bord thématiques Odema
application client-side dashboard data echarts maplibre odema react waste
Last synced: 05 Feb 2026
https://github.com/critocrito/data-scores-in-the-uk
Investigate the uses of data analytics and algorithms in public services in the UK.
clojure data data-investigation data-preservation javascript social-sciences sugarcube uk
Last synced: 18 Oct 2025
https://github.com/legopitstop/addons
All legopitstop's Bedrock add-ons in one place.
add-on assets behaviorpack data hacktoberfest minecraft mods modtoberfest resroucepack vanilla
Last synced: 06 Feb 2026
https://github.com/udityamerit/python-librearies-for-data-science
Python libraries for data science enable efficient data manipulation, analysis, and modeling. Key libraries include NumPy for numerical computing, pandas for data handling, Matplotlib for visualization, Scikit-learn for machine learning, TensorFlow for deep learning, and BeautifulSoup/requests for web scraping. These libraries simplify complex data
beautifulsoup data data-science data-science-libraries machine-learning matplotlib numpy pandas requests scikit-learn scikitlearn-machine-learning tensorflow
Last synced: 06 Feb 2026
https://github.com/0xdir/htcds_dart
Human Trafficking Case Data Standard (HTCDS v0.2) objects, for easy creation, storage and transmission of case data related to human trafficking.
data humanitarian schema standards
Last synced: 24 Oct 2025
https://github.com/sneels/parkds
Connect all your Data Sources via 1 process (Cross-Domain + Single-Domain)
cross-domain data database datasource datasources javascript source
Last synced: 24 Feb 2026
https://github.com/cmudig/mosaic-profiler
A data profiler built with Mosaic
Last synced: 25 Oct 2025
https://github.com/imagodata/filter_mate
FilterMate is a Qgis plugin, an everyday companion that allows you to easily filter your vector layers
data exploratory-data-analysis filter geospatial ogr postgis qgis qgis-plugin qgis3 qgis3-plugin spatialite sql vector-database
Last synced: 29 Apr 2026
https://github.com/jsdhami/lightning-research-data
Lightning Research 🌩
analysis data lightning python research visualization
Last synced: 28 Jan 2026
https://github.com/jsdhami/python-for-research
"Python-For-Research" Event Organized By Tri-Chandra Research Group, Ghantaghar, Kathmandu
analysis colab data jupyter matplotlib numpy panda physics python research visualization
Last synced: 27 Oct 2025
https://github.com/carpentries-incubator/indigenous-data-sovereignty
Introduces the concepts and framework of Indigenous Data Sovereignty and Governance.
Last synced: 24 Jan 2026
https://github.com/hadro/brewery-guides
The data for guides to breweries across the United States from 1896 to 1918
brewers brewery-guides brewing brewing-history data dataset digital-collections digital-humanities hocr nypl open-data
Last synced: 16 Mar 2026
https://github.com/olgaele/playing-with-julia
Playing with data!
data data-analysis data-science julia statistics
Last synced: 19 Apr 2026
https://github.com/karlyndiary/imdb-data-analysis
Data Analysis on the IMDb Dataset using Python & Power BI.
data data-preprocessing data-visualization datacleaning dataset jupyter-lab power-bi powerbi-dashboards powerbi-report powerbi-visuals python
Last synced: 15 Apr 2026
https://github.com/josechirif/reviews-and-satisfaction-analysis-of-airbnb-brazil-and-mexico-from-june-2010-to-february-2021
This project analyzes the reviews and satisfaction of customers who used AirBnB services. It also studies if there is a relationship between another variables.
data data-analysis data-visualization powerbi sql-server
Last synced: 25 Feb 2026
https://github.com/open-i18n/data-unicode-cldr
Git mirror for Unicode Common Locale Data Repository (CLDR) data
cldr data open-i18n unicode unicode-consortium
Last synced: 07 Feb 2026
https://github.com/mutasim77/dbt-analytics
🍉 Repo for analytics engineering with dbt, transforming raw data into actionable insights.
big-query data data-analysis dbt warehouse
Last synced: 25 Feb 2026
https://github.com/desultory/pycpio
Python library for CPIO manipulation
cpio cpio-archives data initramfs pypi-package python python-3 python3
Last synced: 04 Feb 2026
https://github.com/tosun-si/world-cup-qatar-team-stats-kotlin-midgard
This application shows a full Apache Beam pipeline with Kotlin and Midgard library. The use case works on the last Qatar FIFA world cup data and calculate players statistics per team. This application will be presented at Beam Summit 2023 in New York
apache-beam beam-summit data kotlin midgard world-cup-2022
Last synced: 01 Feb 2026
https://github.com/vrm-piyush/python-projects
Open source Python Projects. Feel Free to contribute!
data dataanalysis games open-source pygame-games python python-app
Last synced: 26 Feb 2026
https://github.com/floriancassayre/nicknames-datasets
Open source nicknames sets with informations about the data origin(s).
Last synced: 08 Feb 2026
https://github.com/andyfratello/dbd
🗄️ Exercicis de Disseny de Bases de Dades (DBD) Q2 - UPC FIB
data database dbd dbd-fib dbeaver fib-upc nosql nosql-database oracle sql sql-database
Last synced: 10 Feb 2026
https://github.com/reubano/ckanutils
A Python library for interacting with CKAN instances
Last synced: 10 Feb 2026
https://github.com/iondv/metrics
IONDV. Framework application: Metrics is to collect and show the metrics data.
collecting data data-analysis iondv iondv-app metrics
Last synced: 10 Feb 2026
https://github.com/rikurauhala/insights
Visualize your coding journey
cypress data data-visualization github github-api javascript material-ui octokit react statistics typescript vite
Last synced: 11 Feb 2026
https://github.com/enes9103/039_react_task_tracker-json_server
api axios-react css3 data javascript json-server react reactjs responsive todoapp
Last synced: 11 Feb 2026
https://github.com/yashika-malhotra/exploratory-data-analysis-for-multinational-retail-corporation
Analysis via CLT and Visualization on Multinational Retail Corporation's data to provide insights and recommendations to improve their userbase.
colab-notebook data jupyter-notebook matplotlib numpy pandas python seaborn stats
Last synced: 11 Feb 2026
https://github.com/fforres/webpack-plugin-dx-metrics
Webpack plugin to track webpack behaviour in datadog
data datadog developer-experience typescript visualization webpack
Last synced: 13 Feb 2026
https://github.com/qeeqbox/data-compliance
Data compliance is the process of following various regulations and standards to ensure that sensitive digital assets (data) are guarded against loss, theft, and misuse
compliance data data-compliance infosecsimplified qeeqbox
Last synced: 19 Mar 2026
https://github.com/platob/yggdrasil
arrow data databricks pandas polars spark sql
Last synced: 02 Jun 2026
https://github.com/huangcongqing/ranking-list
数据!important | 各种排行,榜单数据汇总 数据为王的时代 Data
Last synced: 15 Feb 2026
https://github.com/woctezuma/steam-reviews-data
Data available to compute statistics of Steam reviews.
Last synced: 19 Mar 2026
https://github.com/abrudz/parsing
Dyalog APL expressions to parse common and unusual data formats from text files
apl csv data data-format dyalog-apl dyalogapl parsing
Last synced: 20 Mar 2026
https://github.com/ahmetfurkandemir/sahibinden-data-engineering-technical-case-study
Sahibinden.com Data Engineering Technical Case Study
case-study data data-engineering debezium docker flink kafka mongodb mysql pyflink pyspark python sahibinden spark
Last synced: 03 Mar 2026
https://github.com/d8a-tech/d8a
A data collection service fully compatible with GA4 tracking protocols. Ingest into ClickHouse or BigQuery database while maintaining complete control over your data.
bigquery clickhouse data ga4 tracker
Last synced: 10 Apr 2026
https://github.com/metapsy-project/data-gambling-psyctr
Database of psychological interventions for problem gambling and gambling disorder.
Last synced: 02 Apr 2026
https://github.com/lilingxi01/bloark
Blocks Architecture (BloArk) project package for building Blocks-0 dataset and way beyond.
architecture bloark data revision-based
Last synced: 05 Apr 2026
https://github.com/healthyregions/oeps
Opioid Environment Policy Scan - data explorer and backend management
data data-visualization public-health
Last synced: 21 Apr 2026
https://github.com/d2hydro/fewspy
A Python API for the Deltares FEWS PI REST Web Service
data geopandas hydrology hydrometrics pandas python
Last synced: 23 Apr 2026
https://github.com/corentinb/txtoredis
:fire: Push each line of a text file, to a Redis set
data datascience dataset go golang redis set
Last synced: 24 Apr 2026
https://github.com/joamag/pandas
Loads of pandas data from China with awesome data
data data-analysis jupyter notebook pandas
Last synced: 25 Apr 2026