data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-03 00:07:49 UTC
- JSON Representation
https://github.com/nicolasbizzozzero/datagenerator
Randomly generate various commonly used data
data data-generation data-generator data-science
Last synced: 18 Oct 2025
https://github.com/goncaloperes/datavisualization
Here I will share some of my data visualizations using a variety of datasets, technologies and tools.
d3js data dataset datavisualization dataviz ggplot matplotlib rawgraphs seaborn tableau visualization yellowbrick
Last synced: 04 Feb 2026
https://github.com/itrauco/robots
ai, machine learning, and robots...
ai artificial-intelligence automation big-data cloud cloud-engineering data data-engineering data-science data-science-projects m machine machine-learning ml prompts robots
Last synced: 11 Jun 2026
https://github.com/elhariri78/case-study-a-better-smoker-detector
Case Study-A better Smoker Detector
data dataframe evaluation kaggle matplotlib-pyplot numpy pandas pandas-dataframe pandas-python python3 seaborn sklearn
Last synced: 07 Apr 2026
https://github.com/geo-y20/coursera-managment-system
ML and Data Science-based recommendation system
course coursera data data-science data-visualization datacleaning machine-learning mean-square-error recommendation-system
Last synced: 19 Jun 2026
https://github.com/mohnoor94/datasciencefundementalsusingpython
My journey to learn Data Science with Python
data data-analysis data-science data-visualization learning learning-by-doing python python3
Last synced: 19 Jun 2026
https://github.com/sap-samples/security-research-codegraphsmote
Data augmentation strategy that can be applied to code graphs for learning-based vulnerability discovery.
augmentation data detection learning machine research sample security vulnerability
Last synced: 07 Jun 2026
https://github.com/karthikmprakash/github_repos_scraper
A tool to extract names of github repos of any user
automation bs4 data github python repositories requests webscraping
Last synced: 27 Apr 2026
https://github.com/cobluestars/dataherd-raika
"Dataherd-Raika is a library designed to simulate large-scale user behavior datasets. It takes a single user event (like a click or keyword input) and, by applying simple probability distributions and custom variables, expands it into a vast dataset."
big-data data data-generation data-generator data-science front-end javascript machine-learning npm-package simulator statistics typescript user-behavior user-experience
Last synced: 02 Jan 2026
https://github.com/florianwendelborn/metatypes
Monorepo of TypeScript Metadata Definitions (e.g. HTTP Status Codes)
code-generation data datastructures enum http-status-codes jsdoc lerna metadata typescript
Last synced: 27 Jan 2026
https://github.com/gematik/poc-isik-patient-merge
The repository contains a proof of concept (POC). The POC demonstrates how a FHIR subscription can be used to inform about happened merges within the ISIK context.
Last synced: 19 Oct 2025
https://github.com/rubenhortas/python_examples
Examples of Python code and DSA (data structures and algorithms).
algorithm algorithms data dsa examples python python-3 python3 samples snippets structures
Last synced: 03 Oct 2025
https://github.com/labwhatever/leetcode
Collection of LeetCode questions to ace the coding interview!
data data-structures-and-algorithms dsa leetcode-cpp leetcode-solutions structure structure-learning
Last synced: 22 Aug 2025
https://github.com/jtpio/data-playground
Experiments using public APIs and data
Last synced: 28 Apr 2026
https://github.com/asuozzo/medicare-data-analysis
An analysis of Medicare Part D data in Vermont
Last synced: 04 May 2026
https://github.com/joeyism/py-cifar10
This library was created to allow an easy usage of CIFAR 10 DATA. This is a wrapper around the instructions givn on the CIFAR 10 site
cifar cifar-10 cifar10 data machine-learning machinelearning
Last synced: 30 Jul 2025
https://github.com/divithraju/divith-aju-hadoop-pyspark-pipeline
This project demonstrates the creation of a scalable data processing pipeline for handling and analyzing log data from a hypothetical e-commerce platform. Leveraging Hadoop and PySpark, the pipeline is designed to process large volumes of log files, providing meaningful insights into user behavior, system performance, and sales metrics.
apache-hadoop-framework apache-spark bigdata client data database dataengineering dataingestionframework datapreprocessing documentation ecommerce-platform hdfs pipeline project project-repository pyspark python3 software-engineering
Last synced: 27 Jan 2026
https://github.com/oneblack333/pizza_sales_analysis
The project involves transforming raw pizza sales data into actionable business intelligence through analysis and visualization. This enables pizza business owners to make data-driven decisions on inventory, staffing, and marketing, ultimately improving performance and profitability.
data data-structures data-visualization excel mysql powerbi
Last synced: 20 Jun 2026
https://github.com/aidanjuma/ankideckextractor
A CLI tool written in Python that extracts Anki flashcard decks (.apkg) into separate JSON notes and media files. Perfect for developers building custom learning applications or repurposing Anki content programmatically.
anki apkg cli data decompression extraction flashcards learning python zip
Last synced: 29 Apr 2026
https://github.com/olamide100/capstone-project-llm-zoomcamp
Comparative Guide Assistant
argocd data dataengineering docker grafana kubernetes llm-agent mlops-workflow rag strreamlit
Last synced: 14 Feb 2026
https://github.com/v-mayya/python-sales-data-analysis
Group project with another team member held by CFG to conduct spreadsheet data analysis of fake sales data using Python
analysis data matplotlib numpy python
Last synced: 29 Apr 2026
https://github.com/ispyhumanfly/prowler
Query the web, extract data from the results, and transform that data into a format you can use.
ai analytics business cryptocurrency data extract-data machine-learning mining scraping web
Last synced: 06 Sep 2025
https://github.com/jorgeatgu/pqnvl
candi-DATOS
candi-datos data data-viz elections elections-spain poletika political spain
Last synced: 20 Jun 2026
https://github.com/chrnthnkmutt/theartofstatistic_python
This repository is implemented from David Spiegelhalter's The Art of Statistics Book, for making Python Visualization
data data-science data-visualization machine-learning statistics
Last synced: 08 Jun 2026
https://github.com/stdlib-js/ndarray-base-assert-is-complex-floating-point-data-type
Test if an input value is a supported ndarray complex-valued floating-point data type.
array assert base check data dtype is javascript multidimensional ndarray node node-js nodejs stdlib test types util utilities utility utils
Last synced: 08 Mar 2026
https://github.com/jackokring/www
Generic www flask server with phinka module
compression data flask phinka python
Last synced: 16 Jan 2026
https://github.com/avto-dev/static-references-data
Data for static references
Last synced: 05 Oct 2025
https://github.com/lakecountryhuntclub/dnr-map-data-model
Data Model for the 2023 DNR Pheasant Stocking Property Data
data data-model documentation excel gis hunting mapping powerquery vba
Last synced: 29 Jul 2025
https://github.com/jaldekoa/fiscaldataapi
A Python wrapper to easily retrieve data from the Fiscal Data (US Treasury) official API in pandas format.
api api-wrapper banking data finance pandas python united-states
Last synced: 27 Jan 2026
https://github.com/sksubhadeep/airbnb-dashboard-tableau
Airbnb Dashboard Using Tableau
airbnb data tableau tableau-public visualization
Last synced: 19 Mar 2026
https://github.com/bolajiolayinka/graph-api-automation
An End to End Automation from Facebook Business to Data Visualization of Campaigns
Last synced: 07 May 2025
https://github.com/dixslyf/nbparts
Unpack a Jupyter notebook into its sources, outputs and metadata.
data haskell jupyter jupyter-notebook nix nix-flake
Last synced: 05 Oct 2025
https://github.com/6km/islamic-data-repository
مستودع البيانات الإسلامية - قائمة بالموارد التي قد تفيد المبرمجين في تطوير التطبيقات ومواقع الويب.
data fonts hadeeth json quran quran-json
Last synced: 06 May 2026
https://github.com/ginga1402/travego_travellers
MySQL Mini Project
college-project data mysql-database
Last synced: 27 Jul 2025
https://github.com/stefanbohacek/fediverse-account-analyzer
bots botsinspace data dataviz fediverse mastodon
Last synced: 02 May 2026
https://github.com/khalyomede/fetch
Quickly retrieve your PHP data
config configuration data fetch php php7
Last synced: 15 Mar 2025
https://github.com/helins/ex.clj
Java exceptions as clojure data
clojure data exception java java-exceptions
Last synced: 12 Dec 2025
https://github.com/ddeutils/ddedocs
📖 Data Developer & Engineer Documents and Hands-On
blogs data data-engineering documents hands-on
Last synced: 08 Aug 2025
https://github.com/lmuffato/project-job-insights-trybe
Projeto job insights - Projeto avaliativo da Trybe do Bloco 32: Introdução à Python
data data-science data-transformation filter python
Last synced: 12 Jun 2025
https://github.com/stdlib-js/array-one-to
Generate a linearly spaced numeric array whose elements increment by 1 starting from one.
array data float32array float64array int16array int32array javascript matrix ndarray node node-js nodejs stdlib structure typed typed-array types uint32array vector
Last synced: 26 Feb 2026
https://github.com/unicef/magicbox-download-shapefiles
Downloads shapefiles for each country from gadm.org and unzips them.
data data-science docker downloads-shapefiles emergency-response gadm geospatial geospatial-data humanitarian javascript magicbox nodejs shapefile unicef
Last synced: 02 May 2026
https://github.com/lemniscate-world/stratai
This project analyzes financial assets using a Hidden Markov Model (HMM) to identify different market regimes and patterns. The analysis includes calculating daily returns, rolling volatility, and volume changes, and visualizing the hidden states identified by the HMM.
ai assets data data-science data-visualization finance financial-analysis fintech hmm-model hmmlearn machine-learning trading
Last synced: 23 Oct 2025
https://github.com/dominhduy09/my-links
All of my links and websites I have been creating - For saving all of my website's links
data database link linked-list linktree list save storage website
Last synced: 25 Jun 2026
https://github.com/isaac-lal/english-arabic-dictionary
This is a dictionary website that implements a search feature which allows input for a word in either English or Arabic and returns the alternative translation.
data db javascript react web-development
Last synced: 09 Apr 2026
https://github.com/rodekruis/510-data-catalog
The Project is CKAN based Data Catalog Portal for 510
Last synced: 23 Jan 2026
https://github.com/double-o-z/powershell-json-lightweight-serializer-deserializer
Simple powershell functions to convert from and to json. Very lightweight, will be supported with every powershell version. No dependences.
convert converter data data-science deserialize json lightweight powershell serializer
Last synced: 04 May 2026
https://github.com/public-health-scotland/waiting_times_clinical_prioritisation
This repository contains the Reproducible Analytical Pipeline (RAP) to produce the quarterly statistics on clinical prioritisation, part of the Stage of Treatment (SoT) publication.
data healthcare nhs public-health scotland shiny shiny-app treatment waiting-time
Last synced: 26 Jul 2025
https://github.com/atymri/linqsimulator
LINQ Simulator is an interactive C# console application designed to let you experiment with LINQ queries in real time.
console csharp data data-analysis linq query sql
Last synced: 23 Oct 2025
https://github.com/ahmad-ali-rafique/handwritten-digit-recognition-mnist
This project demonstrates a complete pipeline for recognizing handwritten digits using the MNIST dataset. The project is implemented in Python using Jupyter Notebook, and it covers data loading, preprocessing, model training, and performance evaluation of a Fully Connected Neural Network (FCNN).
ai artificial-intelligence data data-analysis datascience deep-learning deep-neural-networks fcnn fully-connected-network machine-learning machine-learning-algorithms ml modeling
Last synced: 09 Jun 2026
https://github.com/simranjeet97/datastructures_algoritms_python
Data Structures and Algorithms using Python
algorithms arrays arrays-and-strings coding data data-science data-structures datastructures-python hashing interview-preparation interview-questions linked-list python stacks stacks-as-an-array
Last synced: 09 Apr 2026
https://github.com/kenmwaura1/nuvo-data-cleaning-functions
Collection of scripts and functions to clean and preprocess data using Nuvo SDK.
Last synced: 04 May 2026
https://github.com/purarue/git_doc_history
copy/track file history in git, with python bindings to traverse and extract history/files/lines at some date
Last synced: 17 May 2026
https://github.com/stdlib-js/array-zero-to
Generate a linearly spaced numeric array whose elements increment by 1 starting from zero.
array data float32array float64array int16array int32array javascript matrix ndarray node node-js nodejs stdlib structure typed typed-array types uint32array vector
Last synced: 08 Jan 2026
https://github.com/stonecharioteer/renfield
Synchronize and Search through Hard Drives
catalogue data search storage synchronization
Last synced: 09 Feb 2026
https://github.com/luminati-io/Twitter-X-dataset-samples
A sample dataset of over 1000 Twitter (X) posts, extracted using the Bright Data API, ideal for trend discovery, brand monitoring, and competitive insights.
api data dataset twitter twitter-api twitter-scraper web-scraping x
Last synced: 09 Apr 2025
https://github.com/dmitriiweb/tr-data-getter
Tool to get market data from bitstamp.ne
Last synced: 14 May 2026
https://github.com/metriccoders/metriccoders_datasets
This is the Metric Coders repository containing all the datasets for machine learning.
data datasets machine-learning natural-language-processing scikit-learn
Last synced: 08 Apr 2025
https://github.com/mbolam/DSWS_OpenRefine
Cleaning and Linking Data with OpenRefine
cleaning data metadata openrefine
Last synced: 07 Apr 2025
https://github.com/n4ze3m/timezone-json
JSON file with more than 1642 cities timezone in UTC format.
Last synced: 19 Jul 2025
https://github.com/williamwutq/mappedpages
A fixed-size page provider backed by memory mapping, intended for building higher-level allocators and storage systems
allocation allocator data data-storage database file memory-mapping mmap page rust rust-crate rust-library storage
Last synced: 25 Jun 2026
https://github.com/yord/klp-dsv
A delimiter-separated values plugin for klp (Kelpie), the small, fast, and magical command-line data processor.
csv data deserializer dsv json kelpie klp marshaller parser serializer ssv tsv
Last synced: 14 May 2026
https://github.com/devsujay19/knowledgebase
My knowledge base built with NextJS 14, Tailwind CSS 3 and Aceternity UI.
data knowledge-base nextjs nextjs-typescript nextjs14 react server-side-rendering tailwindcss vercel
Last synced: 10 Apr 2026
https://github.com/kgryte/talks-sfnode-may-2017
Talk for SFNode (May, 2017).
analysis data javascript machine-learning math nodejs numeric-computing presentation statistics talk
Last synced: 22 May 2026
https://github.com/carlotta94c/sql4datascientistsdemo
Demo material for Microsoft Reactor session "Getting Started with Databases: SQL and Data Visualizations"
analysis data r sqlite tidyverse visualisation
Last synced: 18 Apr 2026
https://github.com/sixarm/sixarm_ruby_fab
SixArm.com → Ruby → Fab gem to fabricate sample data for testing
data fabrication factory fake gem mock ruby
Last synced: 24 Jul 2025
https://github.com/jimut123/web-crawller
A web crawler which crawls through the whole internet
beautifulsoup collector data databases glance internet link links mining python3 scrapping-python web-crawler
Last synced: 16 Jan 2026
https://github.com/qeeqbox/data-security
Safeguarding your personal information (How your info is protected)
data data-security infosecsimplified qeeqbox security
Last synced: 19 Mar 2026
https://github.com/kingabzpro/makefile-actions
GitHub Actions and MakeFile tutorial and project for beginners.
actions analytics automation data data-science makefile
Last synced: 18 Apr 2026
https://github.com/ucd-cws/nitrates-cv
california centralvalley data frep groundwater model nitrates
Last synced: 16 Jan 2026
https://github.com/farzai/geonames-php
This package provides a simple way to download Geonames data and format it for friendly use.
countries country-codes data geography geonames
Last synced: 24 Oct 2025
https://github.com/mitevpi/vue-d3-bar-chart
Reusable, reactive, animated bar chart using D3 + Vue.js. Written in idiomatic Vue, rather than D3 syntax.
d3 data data-visualization frontend interactive svg vue web
Last synced: 18 May 2026
https://github.com/woctezuma/download-steam-screenshots-data
Data consisting of Steam screenshots.
Last synced: 19 Feb 2026
https://github.com/tsvikas/covid-19-israel-data
Unofficial Github with the data published by The Israel Ministry of Health, regarding The Coronavirus disease
coronavirus-disease covid-19 csv daily-reports data health israel
Last synced: 05 Jan 2026
https://github.com/ttitcombe/timekeep
Defensive timeseries analysis in python
data data-science sklearn time-series time-series-analysis timeseries
Last synced: 05 Jan 2026
https://github.com/humbertocg18/pucrs-alest-i-2.3-2023.24
Trabalhos, Projetos, Exercícios e aulas realizados em Java na cadeira de Algoritimos e estrutura de dados 1, matéria do segundo semestre.
beecrowd beecrowd-solution-in-js beecrowd-solutions-in-java data data-structures datastructures-algorithms hashmap hashtable java-8 leetcode leetcode-javascript leetcode-solutions leetcodepra pucrs sorting-algorithms
Last synced: 29 Mar 2025
https://github.com/stdlib-js/array-base-to-accessor-array
Convert an array-like object to a minimal array-like object supporting the accessor protocol.
accessor accessors array array-like convert data javascript node node-js nodejs object protocol stdlib structure types wrap wrapper
Last synced: 04 Jan 2026
https://github.com/connectomicslab/cmtklib-data
Datalad dataset that stores all data resources of the cmtklib module of Connectome Mapper 3 (https://github.com/connectomicslab/connectomemapper3).
brain data parcellation resources software
Last synced: 16 Jan 2026
https://github.com/osiota10/alx-low_level_programming
C Low Level Programming - Data Structures, Linux/Unix System Programming and Algorithms with ALX Software Engineering
algorithms assembly c data data-structures linux shell unix
Last synced: 25 Jun 2025
https://github.com/joocer/data_expectations
Are your data meeting your expectations?
data data-engineering data-quality data-science data-unit-tests observability pipelines quality validation
Last synced: 07 Oct 2025
https://github.com/thechibuzornwachukwu/bluesky-scraper
This is a work of art that enables you to scrape data off BlueSky.
analytics bluesky bluesky-api bluesky-client data datascraper-framework datascraping scraping social-media web webscraping
Last synced: 16 Nov 2025
https://github.com/ahmad-ali-rafique/comment-generation-tool
This repository hosts a Jupyter Notebook-based Comment Generation Tool exploring advanced NLP techniques for automated, contextually relevant comment generation from input data. Ideal for developers and researchers in NLP and automated text generation.
ai aitools artificial-intelligence content-based-recommendation data datascience jupyter-notebook machine-learning
Last synced: 07 Oct 2025
https://github.com/mews-labs/dataframe-memory
This tools aims to provide simple solution to save memory when using pandas' data frame.
data data-science memory-usage pandas-dataframe python3
Last synced: 22 May 2026
https://github.com/keanteng/nextjs-directory
🌐A Draft Website For Data Catalogue Using NextJs
catalogue climate-change css data directory html javascript nextjs website
Last synced: 09 May 2026
https://github.com/dataship/beam
Get collimate'd data into Frame, in Node or the Browser
column-store data data-science
Last synced: 27 Apr 2026
https://github.com/priyanshubiswas-tech/pwc-power-bi-task-1-2
Power BI dashboards analyzing Phonenow's call center performance and customer retention. Task 1 focuses on KPIs like satisfaction rating, call count, and agent efficiency. Task 2 analyzes retention trends and customer behavior to enhance loyalty. Built using Power BI, DAX, and Excel.
dashboard data data-analysis dax-measures excel powerbi powerbidashboard
Last synced: 23 Jan 2026
https://github.com/jayantur13/kountry
Node module variant of the Country API
api data jsdelivr kountry nodejs npm npm-module npm-package unpkg yarn
Last synced: 26 Jan 2026
https://github.com/rambodrahmani/covid19-behind-the-numbers
COVID-19: Behind the Numbers.
apriori-algorithm apriori-algorithm-python clustering clustering-algorithm clustering-analysis covid covid-19 covid19-data data data-mining data-science datamining fpgrowth machine-learning machine-learning-algorithms python python-machine-learning
Last synced: 20 Aug 2025
https://github.com/cdapio/website
CDAP IO website
analytics applications cdap cdapio data data-analytics data-integration hugo integration metadata oss rules-engine
Last synced: 18 Jun 2025