data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-03 00:07:49 UTC
- JSON Representation
https://github.com/ahmad-ali-rafique/handwritten-digit-recognition-mnist
This project demonstrates a complete pipeline for recognizing handwritten digits using the MNIST dataset. The project is implemented in Python using Jupyter Notebook, and it covers data loading, preprocessing, model training, and performance evaluation of a Fully Connected Neural Network (FCNN).
ai artificial-intelligence data data-analysis datascience deep-learning deep-neural-networks fcnn fully-connected-network machine-learning machine-learning-algorithms ml modeling
Last synced: 09 Jun 2026
https://github.com/tonykipkemboi/ens_subgraph_data
Query On-Chain Data from Subgraphs by The Graph Protocol using Python
data subgraphs thegraphprotocol web3
Last synced: 17 Sep 2025
https://github.com/stone-zeng/china-infectious-diseases
全国法定传染病疫情概况
analytics covid-19 data healthcare infectious-diseases
Last synced: 31 Dec 2025
https://github.com/kenmwaura1/nuvo-data-cleaning-functions
Collection of scripts and functions to clean and preprocess data using Nuvo SDK.
Last synced: 04 May 2026
https://github.com/grkndev/twitcher
A great library that will allow you to use the Twitch API service. All you need to do is use your Token and Client Id information.
api clip clipr data javascript nodejs npm npm-package npmjs streamers streaming twitch twitch-api twitch-bot twitchtv twtich-clip user
Last synced: 09 Mar 2026
https://github.com/vincentlaucsb/csv-data
A curated repository of real and fake CSV data for use in testing suites
Last synced: 08 Mar 2026
https://github.com/wioniqle-q/tower-modelling
Data science
data data-science ndarray-odeint ndjson science
Last synced: 16 Mar 2025
https://github.com/dkosarevsky/db_cp
DB course project
data database db postgres postgresql postgresql-database postgressql
Last synced: 05 May 2026
https://github.com/woctezuma/download-steam-screenshots-data
Data consisting of Steam screenshots.
Last synced: 19 Feb 2026
https://github.com/alja7dali/swift-bits
A bite sized library for dealing with bytes.
binary bit bits byte bytes comprehension data manipulation swift
Last synced: 09 Jun 2026
https://github.com/tushar2704/insurance-cross-sell
This project harnesses the power of cutting-edge technologies including H2O AutoML, MLflow, FastAPI, and Streamlit to enhance cross-selling campaigns and boost efficiency.
data datascience h20automl machine-learning mlflow python streamlit-tushar2704
Last synced: 08 Oct 2025
https://github.com/nafisalawalidris/buybuy-e-commerce-company
The BuyBuy E-commerce Company repository is a comprehensive hub for the company's e-commerce platform. It includes source code, documentation, and data analysis insights, providing a data-driven approach to improve customer experience, drive revenue, and inform decision-making.
buybuy cleaning-data company customer-experience data data-analysis decision-making documentation e-commerce excel insights postgresql repository revenue source-code sql
Last synced: 16 Mar 2025
https://github.com/6km/islamic-data-repository
مستودع البيانات الإسلامية - قائمة بالموارد التي قد تفيد المبرمجين في تطوير التطبيقات ومواقع الويب.
data fonts hadeeth json quran quran-json
Last synced: 06 May 2026
https://github.com/sourceduty/data_hardware
🖥️ Comparing various hardware configurations needed for different data sizes, from personal laptops to mainframes.
calculation computer-hardware computer-science computers data data-calculation data-hardware data-processing data-project hardware hardware-configuration hardware-requirements hardware-science math process-programming programming python
Last synced: 08 Aug 2025
https://github.com/labwhatever/leetcode
Collection of LeetCode questions to ace the coding interview!
data data-structures-and-algorithms dsa leetcode-cpp leetcode-solutions structure structure-learning
Last synced: 22 Aug 2025
https://github.com/sivas-2/coffee-sales-visualization
This repository contains data visualization scripts and notebooks analyzing coffee sales data from a vending machine, sourced from Kaggle. The visualizations explore sales trends, customer preferences, and product popularity over time.
data data-analysis data-science data-visualization python visualization
Last synced: 07 May 2026
https://github.com/gappeah/global-shipping-analytics-dashboard
This Tableau project provides a comprehensive visual analysis of global sales, shipping costs, and quality metrics across different regions and countries.
data data-analysis data-analyst data-visualization metrics tableau
Last synced: 25 Feb 2025
https://github.com/ilejuxepwaduzd/structured-data-extractor
🛠️ Extract structured data from messy texts using Chain-of-Thought prompting to improve processing of customer support and technical issues.
cdp chrome-fetcher data document-extraction ecommerce golang-library headless metadata-extraction ocr open-source pdf pdf-converter pdf-extractor ruby scraper shopify spider structured-data
Last synced: 10 Apr 2026
https://github.com/stdlib-js/array-base-fancy-slice-assign
Assign element values from a broadcasted input array to corresponding elements in an output array.
array assign assignment copy data fancy generic javascript node node-js nodejs shallow slice stdlib structure subseq subsequence types
Last synced: 06 Oct 2025
https://github.com/antononcube/raku-data-cryptocurrencies
Raku package of cryptocurrency data retrieval.
Last synced: 02 Apr 2025
https://github.com/themuhd/world-cup-analysis
Analysis of The FIFA World cup from its inception to the recently completed tournament in 2023
data data-science data-visualization dataanalysis matplotlib matplotlib-pyplot notebook python
Last synced: 08 May 2026
https://github.com/wangshouh/cryptofinancedata
An ipynb file containing data acquisition of futures, options and other financial derivatives
Last synced: 05 Oct 2025
https://github.com/flowsynx/plugin-json
FlowSynx plugin to loads and parses local JSON files. Supports transformation, extraction, and mapping of hierarchical data structures in workflows.
data data-platform flowsynx json
Last synced: 10 Mar 2026
https://github.com/n0nag0n/flee-intercom
For those of you who like to keep your money after Intercom jacks up the prices year after year, but want to keep an export of your data.
again-and-again api data database export exporter flee high-prices intercom mysql php price run save saver year-over-year
Last synced: 09 May 2026
https://github.com/nitsc/spell-from-threebodytrilogy
Implemented the process of extrapolating from Gaia stellar data, to 3D visualizations, to three-views, to three-view signals, to three-view audio of signals, and even their inversions. This project proves the feasibility of the Logic (Luoji)'s “spell” from “The Three Body Problem” trilogy.
3d 3d-graphics astronomy astronomy-astrophysics audio audio-processing data data-science data-visualization gaia graph information-technology information-visualization numpy python python-3 python3 signal signal-processing visiualization
Last synced: 02 May 2026
https://github.com/cosmos-loops/cosmos-dapper
Cosmos.Dapper is a part of Cosmos.Data, a inline project of COSMOS LOOPS PROGRAMME. This repository provides a package of StackExchange.Dapper to improve development efficiency.
dapper data mysql mysqlconnector oracle postgresql sql-query sqlite sqlkata sqlserver
Last synced: 11 Apr 2026
https://github.com/public-health-scotland/waiting_times_clinical_prioritisation
This repository contains the Reproducible Analytical Pipeline (RAP) to produce the quarterly statistics on clinical prioritisation, part of the Stage of Treatment (SoT) publication.
data healthcare nhs public-health scotland shiny shiny-app treatment waiting-time
Last synced: 26 Jul 2025
https://github.com/v6ntage/sql-sales_data-analytics-project
This repository contains a SQL scripts demonstration analytical techniques.
analytics business-analytics data data-analysis database query sql sql-server
Last synced: 12 Apr 2026
https://github.com/petermartens98/nba-analytics-streamlit-app-with-langchain-agent
Interactive NBA Analytics app with Streamlit and a LangChain conversational agent connected to extracted data. Explore player, team, and game stats, track injuries, run simulations, visualize trends, and get AI-powered insights. Ongoing development, open to collaboration.
agentic-ai analysis data deepseek langchain nba python streamlit visualization
Last synced: 08 May 2026
https://github.com/connectomicslab/cmtklib-data
Datalad dataset that stores all data resources of the cmtklib module of Connectome Mapper 3 (https://github.com/connectomicslab/connectomemapper3).
brain data parcellation resources software
Last synced: 16 Jan 2026
https://github.com/rodekruis/510-data-catalog
The Project is CKAN based Data Catalog Portal for 510
Last synced: 23 Jan 2026
https://github.com/atymri/linqsimulator
LINQ Simulator is an interactive C# console application designed to let you experiment with LINQ queries in real time.
console csharp data data-analysis linq query sql
Last synced: 23 Oct 2025
https://github.com/basemax/buskool.com-data
This repository contains the collected product data from the Buskool website (باسکول). The data is stored in 20k+ JSON files, each containing detailed information about products available on the website.
buskool buskoolcom data farsi information ir iran json persian
Last synced: 03 Apr 2025
https://github.com/jaldekoa/fiscaldataapi
A Python wrapper to easily retrieve data from the Fiscal Data (US Treasury) official API in pandas format.
api api-wrapper banking data finance pandas python united-states
Last synced: 27 Jan 2026
https://github.com/devathul-88/random-fakedata.js
A package to generate random data
data data-generator fake fake-data fake-data-generator javascipt javascript nodejs npm-package package
Last synced: 09 May 2026
https://github.com/prpriesler/covid19-insights-and-analytics
This project delves into the realm of data analytics and programming, focusing on four pivotal datasets related to the COVID-19 pandemic: confirmed global, death global, vaccination & population data, and Twitter data.
covid19 covid19-data data data-science dataanalytics deep-neural-networks machine-learning natural-language-processing
Last synced: 31 Aug 2025
https://github.com/gmersy/data-carbon
Repository accompanying the paper: Toward a Life Cycle Assessment for the Carbon Footprint of Data
carbon-emissions carbon-footprint climate-change data data-science sustainability sustainable-software
Last synced: 31 Mar 2025
https://github.com/farzai/geonames-php
This package provides a simple way to download Geonames data and format it for friendly use.
countries country-codes data geography geonames
Last synced: 24 Oct 2025
https://github.com/lmuffato/project-ting-trybe
Projeto ting - Projeto avaliativo da Trybe do Bloco 37: Estrutura de Dados II: Listas, Filas e Pilhas
data data-analysis python queue read-file stack trybe trybe-projects
Last synced: 12 Jun 2025
https://github.com/dylanhogg/cloud-products
A package for getting cloud products and product descriptions from a cloud provider website.
aws cloud-products crawler data text-processing
Last synced: 05 Oct 2025
https://github.com/nik-kusanagi/bash.sh-treinamento
Versão mais organizada (+ ou -)
data database debian gnome gnome-extension gnu gnu-linux linux shell shell-script
Last synced: 05 May 2026
https://github.com/priyanshubiswas-tech/pwc-power-bi-task-1-2
Power BI dashboards analyzing Phonenow's call center performance and customer retention. Task 1 focuses on KPIs like satisfaction rating, call count, and agent efficiency. Task 2 analyzes retention trends and customer behavior to enhance loyalty. Built using Power BI, DAX, and Excel.
dashboard data data-analysis dax-measures excel powerbi powerbidashboard
Last synced: 23 Jan 2026
https://github.com/mattqdev/koalaz
Why don't use koalas as data mock? With this npm package you can!
data koala lorem-ipsum meme mock placeholder
Last synced: 13 Jan 2026
https://github.com/ayushverma135/sas-health-metrics-analysis-bmi-categorization-and-gender-insights
Using SAS, this project processes Excel data on individual statistics and health metrics. It calculates BMI, categorizes health status, and visualizes distributions through pie charts.
analytics data excel sas sasprogramming statistical-analysis
Last synced: 24 Feb 2026
https://github.com/capire/xtravels-java
Travel booking app using master data from xflights built with CAP Java
cap cds data federation flights java reuse
Last synced: 23 Jan 2026
https://github.com/cqllum/schema2dwh
⚡ Automatically produce a data model on your database using its information schema using GenAI.
ai data data-structures dataengineering datawarehousing dwh gemini gemini-api genai reporting reporting-tool schema-design
Last synced: 13 Mar 2025
https://github.com/rnabla/cuda-des
Bruteforcing DES using CUDA
bruteforce cuda data des encryption gpu parallel standard
Last synced: 27 Oct 2025
https://github.com/spine-tools/metreload
Python application for downloading meteorological reanalysis data
Last synced: 01 Jul 2025
https://github.com/emnetdegafe/allesoverfilm-backend
AllesOverFilm-backend is part of the AllesOverFilm mobile app development project and contains the database structure, server query scripts, and Sequelize-cli database structures.
backend data data-model express postgresql sequelize-cli
Last synced: 11 Apr 2026
https://github.com/data-forge-notebook/javascript-cheat-sheet
Cheat sheet that accompanies my book Data Wrangling with JavaScript
cheatsheet data data-wrangling javascript nodejs
Last synced: 15 Apr 2026
https://github.com/castdrian/kdapi
A TypeScript library that scrapes K-pop idol and group information from online sources to create comprehensive JSON datasets.
api data kpop scraper typescript
Last synced: 15 May 2025
https://github.com/astrid-project/cb-manager
APIs to interact with the Context Broker's database. Through a REST Interface, it exposes data and events stored in the internal storage system in a structured way. It provides uniform access to the capabilities of monitoring agents.
agent beats control data ebpf elasticsearch log logstash management programmability security
Last synced: 30 Jun 2025
https://github.com/danish-foundation-models/dfm-processing
Toolkit for processing data in the danish foundation models project.
Last synced: 02 Jul 2025
https://github.com/garcane/income-prediction-ml
This is a machine learning project aimed at predicting whether an individual's annual income exceeds $50,000 based on their demographic and personal information.
data data-science machine-learning ml numpy pandas python random-forest scikit-learn
Last synced: 08 Apr 2026
https://github.com/mohsinali08000/myportfolio
I’m Mohsin Ali, a passionate software engineer with over 2 years of experience in developing robust software solutions. Currently transitioning into the field of data science.
Last synced: 22 Apr 2026
https://github.com/svelterun/store
Persisted version of svelte/store.
data state state-management store svelte svelte-store sveltekit svelterun typescript
Last synced: 08 Jan 2026
https://github.com/idea2app/public-meta-data
HTTP API for Public Meta Data, written in TypeScript & designed for CDN.
api cdn data http meta public typescript
Last synced: 15 Mar 2025
https://github.com/zediculz/block
Block is a data structure/collection that uses Blockchain principle in managing data.
Last synced: 05 Oct 2025
https://github.com/cleanzr/restaurant
Restaurant data set for entity resolution
Last synced: 11 Mar 2026
https://github.com/alejo1630/titanic_kaggle
This Python Notebook is a proposal to analyse the Titanic dataset for the Kaggle Competition, using several data science techniques and concepts.
data data-science jupyter-notebook notebook python titanic-survival-prediction
Last synced: 03 May 2026
https://github.com/nxank4/loclean
⚡️ The All-in-One Local AI Data Cleaning Library. No GPU or API keys required.
automated-cleaning data data-cleaning data-engineering data-preprocessing data-science data-wrangling etl llm normalization open-source polars privacy-preserving python semantic-analysis slm structured-data
Last synced: 22 Jan 2026
https://github.com/apfirebolt/data-structures-and-algorithms-in-python
Data Structure and Algorithms in Python
algorithms data data-structures python python3 tkinter-gui
Last synced: 15 Mar 2025
https://github.com/sanskaryo/ultimate-dsa-repo
One Stop Solution for DSA Learning and Resources
data data-structures-and-algorithms dsa hacktoberfest hacktoberfest-accepted hacktoberfest2025
Last synced: 15 Oct 2025
https://github.com/suryavamsi-p/conflict-nlp-topic-modeling-sentiment-analysis-using-llms
Extracts insights from 26K+ protest events using BERTopic, Top2Vec, and LLMs for real-world applications like crisis monitoring, policy research, and social unrest analysis.
all-mpnet-base-v2 bertopic conflict-data data data-science lda llama2 llms machine-learning mistral-7b nlp nltk protest-analysis pyldavis python3 top2vec topic-modeling transformers visualization
Last synced: 11 May 2026
https://github.com/kirkalyn13/portfolio-dashboard-site
Portfolio Site; Initially a Service Provider Metrics Dashboard using React.
dashboard data data-visualization react
Last synced: 15 Apr 2026
https://github.com/stdlib-js/array-one-to-like
Generate a linearly spaced numeric array whose elements increment by 1 starting from one and having the same length and data type as a provided input array.
array data float32array float64array int16array int32array javascript matrix ndarray node node-js nodejs stdlib structure typed typed-array types uint32array vector
Last synced: 20 Feb 2026
https://github.com/codeforafrica/ckanext-followy
[ARCHIVED] A CKAN extension to show the datasets a user is following.
ckan ckan-extension ckanext-followy data dataset followy-extension open-data
Last synced: 29 Jun 2026
https://github.com/inphyt/quantitative_single_neuron_modeling_competition_2009
Data for the Quantitative Single-Neuron Modeling Competition (2009).
bayesian-inference bayesian-methods bayesian-optimization bayesian-statistics challenge competition computational-neuroscience data electrophysiological-data electrophysiology-data model-calibration modeling neuronal-models neuroscience neuroscience-competition parameter-estimation simulation simulation-modeling single-neuron-model uncertainty-quantification
Last synced: 25 Feb 2026
https://github.com/ishaansathaye/cpe202-datastructalgos
CPE 202 Data Structures and Algorithms Winter 2022 Freshman at Cal Poly
algorithm binary binary-search-tree data graph hash heap python queue stack structures
Last synced: 12 May 2026
https://github.com/ultrasage-danz/scikit-learn-ml
Machine Learning with scikit-learn by Data School
ai data data-school machine-learning macos ml scikit-learn ultrasage-dan
Last synced: 13 May 2026
https://github.com/stdlib-js/array-one-to
Generate a linearly spaced numeric array whose elements increment by 1 starting from one.
array data float32array float64array int16array int32array javascript matrix ndarray node node-js nodejs stdlib structure typed typed-array types uint32array vector
Last synced: 26 Feb 2026
https://github.com/jcasbin/jcasbin-menu-permission
Casbin Menu Permission Example (Based on jCasbin)
abac acl auth authorization authz casbin data go java jcasbin menu permission rbac spring springboot
Last synced: 11 Jul 2025
https://github.com/matusf/glasgow_wifi
Script that plots wifi access points to map and labels them by their protection
data data-visualization folium python python3
Last synced: 24 Jun 2026
https://github.com/humbertocg18/pucrs-alest-i-2.3-2023.24
Trabalhos, Projetos, Exercícios e aulas realizados em Java na cadeira de Algoritimos e estrutura de dados 1, matéria do segundo semestre.
beecrowd beecrowd-solution-in-js beecrowd-solutions-in-java data data-structures datastructures-algorithms hashmap hashtable java-8 leetcode leetcode-javascript leetcode-solutions leetcodepra pucrs sorting-algorithms
Last synced: 29 Mar 2025
https://github.com/pharo-ai/data-preprocessing
Project including data pre-processing algo. We aim to include scaling, centering, normalization, binarization methods.
data pharo pharo-smalltalk preprocessing smalltalk
Last synced: 09 Feb 2026
https://github.com/melinteflxrin/softserve-bigdata-project
End-to-end data warehousing project integrating APIs, ETL workflows, and PostgreSQL for analytics and reporting.
analytics api bigdata data datawarehousing externalapi pipeline postgres postgresql python warehouse
Last synced: 26 Jan 2026
https://github.com/oefenweb/python-untraceables
Randomizes IDs for a given set of tables making them untraceable across environments
anonymize data database mysql privacy python python2 python3 randomization
Last synced: 03 Feb 2026
https://github.com/prajwalsinha/unveiling-climate-change-dynamics-through-earth-surface-temperature-analysis
Climate change analysis through global surface temperature data. Includes data preprocessing, statistical analysis, visualizations, and forecasting. Python-based project using Pandas, Matplotlib, and Scikit-learn.
data dataanalysis dynamic-mapping pyplot python scikit-learn seaborn
Last synced: 10 Feb 2026
https://github.com/morphaxthedeveloper/yokatlas-dataset-2025
yök atlas detaylı üniversite, bölüm, puan vb. datası..
data database liste scrape universite veri yok-atlas yok-atlas-api yok-atlas-data yokatlas yokatlas-crawler yokatlas-data
Last synced: 14 Oct 2025
https://github.com/jhpoelen/rats
self-replicating data publication related to rat (Rattus sp.) specimen.
biodiversity data natural-history-collections provenance
Last synced: 18 Mar 2026
https://github.com/husna-poyraz/titanic-machine-learning
Use machine learning to create a model that predicts which passengers survived the Titanic shipwreck.
data data-analysis data-science data-visualization deep-learning machine-learning missing-data outlier-detection python titanic
Last synced: 10 May 2026
https://github.com/kingabzpro/makefile-actions
GitHub Actions and MakeFile tutorial and project for beginners.
actions analytics automation data data-science makefile
Last synced: 18 Apr 2026
https://github.com/xpotify/scraper
Scraper designed for Xpotify's client to gather information from websites🌟
axios cheerio data javascript scraper webscraper
Last synced: 07 Jul 2025
https://github.com/yeshunit/walmart-product-customer-sales-sql-analysis
This project aims to explore the Walmart Sales data to understand top performing branches and products, sales trend of of different products, customer behaviour. The aims is to study how sales strategies can be improved and optimized. The dataset was obtained from the Kaggle
data database mysql sql walmart
Last synced: 24 Feb 2026