data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-02 00:07:45 UTC
- JSON Representation
https://github.com/aidenellis/connectmp
🍰 ConnectMP - An easy way to share data between Processes in Python.
aidenellis connectmp data data-sharing multiprocessing process sharing
Last synced: 27 Apr 2026
https://github.com/nightroman/farnet.fsharp.data
FSharp.Data package for FarNet.FSharpFar
Last synced: 27 Apr 2026
https://github.com/jtpio/data-playground
Experiments using public APIs and data
Last synced: 28 Apr 2026
https://github.com/rdjarbeng/rdjarbeng
Richard Djarbeng's github profile-computer engineer specializing in web development, machine learning, and IoT devices. New web posts have moved to website below
data jekyll machine-learning ruby website
Last synced: 28 Apr 2026
https://github.com/the-aerospace-corporation/pivt
PIVT is an analytics tool to help software development teams visualize the life cycle and behavior of their software factory.
analytics dashboards data devops jenkins pipeline python splunk visualization
Last synced: 29 Apr 2026
https://github.com/quarylabs/quary_basketball_analysis_duckdb
An example analysis
analytics data duckdb engineering quary
Last synced: 29 Apr 2026
https://github.com/sodascience/open_supply_hub
Processing supply chain data obtained from Open Supply Hub
data global-supply-chain open-supply-hub python
Last synced: 29 Apr 2026
https://github.com/azevedontc/datapulse
DataPulse
automation brazil cli data data-analysis matplotlib meteorology open-meteo pandas prevision pycharm python python3 reports venv weather
Last synced: 29 Apr 2026
https://github.com/iamjuniorb/data_structures_and_algorithms
I'm working on Data Structures and Algorithms I C949 class in school and decided to write up all of these searching algorithms, sorting algorithms, strutures, and so on to get a better understanding. These can be used with large datasets to test their space and time complexities.
data data-analysis data-science data-structures datastructures datastructures-algorithms datastructuresandalgorithm math mathematics programming python python-app python-library python3
Last synced: 08 Jun 2026
https://github.com/wu-rymd/pyobjectify
Bridging the gap across the different file formats and streamlining the process to accessing ingested data via Python objects
Last synced: 08 Jun 2026
https://github.com/freebirdscrew/datascience_crash_course
Data Science Crash Course that Explained about Each and Every Process in Data Science.
dash data datascience datascience-crash-course datascience-machinelearning datascientist datasets freebirdscrew matplotlib numpy numpy-library pandas plotly plotly-python python python3 simranjeet simranjeetsingh
Last synced: 29 Apr 2026
https://github.com/chompfoods/stub-asp-net-core
ASP.NET Core server stub for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.
api asp asp-net-core aspnetcore branded chomp data database food grocery ingredients nutrition raw recipe-api recipes server stub stub-server
Last synced: 30 Apr 2026
https://github.com/lucien-loua/libgn
Manipulate geographical and administrative data about Guinea.
Last synced: 08 Jun 2026
https://github.com/gdhhgnbnvbn/f1-2025-ai-predict
fully generated by claude 3.5 sonnet via Windsurf IDE. Not a single lines wrote.
agent-based-modeling claude csv data f1 gpt machine-learning model prediction predictive-modeling python rainforest streamlit vibe
Last synced: 01 May 2026
https://github.com/syed-bilal-haider-engineer/interview_questions
Interview Questions
data database interview-questions javascript oop operating-system reactjs structure technical
Last synced: 01 May 2026
https://github.com/danielgiljam/orbit-utils
A collection of utility packages for Orbit.js.
data inference orbit orbitjs schema synchronization type typescript validation zod
Last synced: 01 May 2026
https://github.com/henrylin03/china-gdp
Analysis and visualisation of China GDP data using Python.
data data-analysis data-visualisation dataset kaggle pandas
Last synced: 01 May 2026
https://github.com/arif-miad/titanic-analysis
artificial-intelligence data data-science deep-neural-networks
Last synced: 09 Jun 2026
https://github.com/unicef/magicbox-download-shapefiles
Downloads shapefiles for each country from gadm.org and unzips them.
data data-science docker downloads-shapefiles emergency-response gadm geospatial geospatial-data humanitarian javascript magicbox nodejs shapefile unicef
Last synced: 02 May 2026
https://github.com/dominhduy09/my-links
All of my links and websites I have been creating - For saving all of my website's links
data database link linked-list linktree list save storage website
Last synced: 25 Jun 2026
https://github.com/double-o-z/powershell-json-lightweight-serializer-deserializer
Simple powershell functions to convert from and to json. Very lightweight, will be supported with every powershell version. No dependences.
convert converter data data-science deserialize json lightweight powershell serializer
Last synced: 04 May 2026
https://github.com/ishaansathaye/data40x-1_2_3
Fall 2025 Cal Poly Data 401 Data Science Process and Ethics, 402 Mathematical Foundations of Data Science, 403 Projects Lab
capstone-prep data data-science ethics lab python
Last synced: 04 May 2026
https://github.com/perceptronv/miscellaneous
A huge variety of materials, mostly training data for AI. Not a lot of source code yet.
data gan machine-learning nlp text-generation
Last synced: 04 May 2026
https://github.com/dkosarevsky/db_cp
DB course project
data database db postgres postgresql postgresql-database postgressql
Last synced: 05 May 2026
https://github.com/bredalis/numpy
✨ Library to work with arrays ✨
arrays data matrix numpy numpy-arrays numpy-library python
Last synced: 06 May 2026
https://github.com/montanaz0r/imdb-ratings-auto-inserter
A Python script that enables auto-inserting movie ratings into the IMDB profile.
data data-science dataanalysis imdb movies pandas pandas-dataframe python3 selenium selenium-webdriver webscraping
Last synced: 07 May 2026
https://github.com/freebirdscrew/covid-19-data-analysis
Coronavirus Data-Analysis with Live Data Streaming from the Website and Made a DASH Web-App at Last.
coronavirus coronavirus-real-time coronavirus-tracking countryinfo covid-19 covid-19-india covid19 covid19-data dash dash-button dashboard-application data data-analysis data-cleaning data-science data-visualization github jupyter pycountry python
Last synced: 07 May 2026
https://github.com/sivas-2/coffee-sales-visualization
This repository contains data visualization scripts and notebooks analyzing coffee sales data from a vending machine, sourced from Kaggle. The visualizations explore sales trends, customer preferences, and product popularity over time.
data data-analysis data-science data-visualization python visualization
Last synced: 07 May 2026
https://github.com/chompfoods/stub-python-flask
Flask (Python) server stub for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.
api branded chomp data database flask flask-server food grocery ingredients nutrition python raw recipe-api recipes server stub stub-server
Last synced: 07 May 2026
https://github.com/satur-io/estoraje
Estoraje is the simplest distributed system for key-value storage in less than 800 lines of code. It is temporary consistent, high available, lightweight, scalable and gives a good performance.
data database distributed go golang key-value performance training
Last synced: 07 May 2026
https://github.com/augustoarraes/corais
App Python de Monitoramento de vida marinha de Recife de Corais 🪸
coral data iot matplotlib pandas python streamlit
Last synced: 07 May 2026
https://github.com/geo-y20/loan-approval-automation-using-mongodb-and-pymongo
This project demonstrates the implementation of a loan approval system that utilizes MongoDB for distributed data storage and management, and PyMongo for database operations. The project aims to automate the assessment of loan eligibility using customer details from online applications.
crud-application data data-analysis data-science data-visualization deployment jupyter-notebook loan-default-prediction loan-prediction-analysis machine-learning machine-learning-algorithms matplotlib mongodb pymongo streamlit web
Last synced: 08 May 2026
https://github.com/miroslav-reiter/kurz_jazyk_sql_analytici_datovi_vedci
Materiály ku kurzu Jazyk SQL 1 pre Analytikov a Dátových Vedcov
analysis analytics data data-analysis data-science database mysql reiter sql
Last synced: 08 May 2026
https://github.com/ezfe/activityringsexporter
apple-watch applewatch data healthkit ios
Last synced: 08 May 2026
https://github.com/n0nag0n/flee-intercom
For those of you who like to keep your money after Intercom jacks up the prices year after year, but want to keep an export of your data.
again-and-again api data database export exporter flee high-prices intercom mysql php price run save saver year-over-year
Last synced: 09 May 2026
https://github.com/bastianolea/comisarias_chile
Base de datos con las comisarías, retenes, tenencias y otras instalaciones de Carabineros
Last synced: 23 Jun 2025
https://github.com/devathul-88/random-fakedata.js
A package to generate random data
data data-generator fake fake-data fake-data-generator javascipt javascript nodejs npm-package package
Last synced: 09 May 2026
https://github.com/lmuffato/project-mysql-vocabulary-booster-trybe
Projeto mysql vocabulary booster - Projeto avaliativo da Trybe do Bloco 20: Funções SQL, Joins e Subqueries
back-end crud data database mysql mysqlworkbench query sql trybe-projects
Last synced: 10 May 2026
https://github.com/dimitryzub/walmart-stores-coffee-analysis
Walmart Coffee Exploratory Data Analysis. Data Extracted with SerpApi 🧡
analysis analytics data data-visualization matplotlib pandas python pythonanalysis seaborn
Last synced: 10 May 2026
https://github.com/kouisamine/data-uri-to-image
Convert Data URI into Image(png, jpeg, webp, gif, svg, ...) files.
conversion convert converter data datauri datauri-to-image image js online php script source-code tools uri
Last synced: 10 May 2026
https://github.com/scarblase/russian-military-losses-analysis
This repository provides an in-depth analysis of Russian equipment losses using PySpark and data visualization techniques.
data data-science data-visualization jyputer-notebook matplotlib pyspark python3 seaborn seaborn-plots ukraine ukraine-invasion
Last synced: 12 May 2026
https://github.com/dmitriiweb/tr-data-getter
Tool to get market data from bitstamp.ne
Last synced: 14 May 2026
https://github.com/iotchulindrarai/reactlearning
learning react like data passing using usestate and props using fom both child to parent and parent to child
data passing props react usestate-hook
Last synced: 14 May 2026
https://github.com/svetlanam/twitter-ads
Get data about campaigns from Twitter Ads API
api data keboola keboola-extractor twitter twitter-ads twitter-api
Last synced: 12 Jun 2026
https://github.com/erwan-simon/aws-serverless-notebook-platform
A self-hosted, serverless platform offering an intuitive UI to manage, schedule, and execute Jupyter notebooks on AWS.
aws data docker notebook python serverless terraform webapp
Last synced: 13 Jun 2026
https://github.com/fairspec/fairspec-standard
Fairspec is a data exchange format compatible with DataCite for metadata and JSON Schema for structured data
ckan csv data dataset excel fair fairspec json ods polars python quality schema sqlite table typescript validation zenodo
Last synced: 16 Jun 2026
https://github.com/cdcgov/importsurvey
Import survey: Import data into R, with an application to the National Center for Health Statistics (NCHS)
data import r sas survey survey-data
Last synced: 19 Jun 2026
https://github.com/divanny/tiendabackend
Tienda
backend core csharp csharp-code csharp-core data integration webapi
Last synced: 20 Jun 2026
https://github.com/CentralFloridaAttorney/ComfyUI-ZMongo
An Easy-to-Use database framework and parameter library for ComfyUI. Centralize node presets, capture workflow logic, manage structured image collections, and build document-driven text automation pipelines on an offline Local File Store or BusinessProcessApplications.com .
api comfy comfy-ui comfyui comfyui-custom-node comfyui-custom-nodes comfyui-manager comfyui-node comfyui-nodes comfyui-workflow data database
Last synced: 21 Jun 2026
https://github.com/datadotworld/dw-jupyter-contents
Jupyter ContentsManager implementation for data.world
data data-analysis data-science dwstruct-t50-public-projects jupyter jupyter-notebook jupyterlab reference-implementation
Last synced: 22 Jun 2026
https://github.com/michalwols/awesome-data-curation
🗑️ ✨ 📊 Awesome things related to data collection, annotation, cleaning and management.
active-learning annotation cleaning-data data data-science deep-learning machine-learning
Last synced: 24 Jun 2026
https://github.com/williamwutq/bllist
Durable, crash-safe, checksummed block-based linked list allocators stored in a single file
data data-storage data-structure database file-based linkedlist
Last synced: 25 Jun 2026
https://github.com/williamwutq/bblock
Persistent checksummed blocks built on top of bstack's allocators
allocation binary block data data-structures database rust rust-crate rust-library serialization
Last synced: 25 Jun 2026
https://github.com/anuveyatsu/cloudflare-data-fabric
Cloudflare Data Fabric: Use Cloudflare's global infrastructure to build a flexible, resilient framework for data solutions.
cloudflare data data-lake fabric lakehouse mesh
Last synced: 29 Jun 2026
https://github.com/connectaman/deepseek-ocr-multigpu-infer
Efficient multi-GPU OCR inference framework leveraging parallel processes for accelerated token throughput and faster batch processing. Designed for scalable, high-performance optical character recognition workloads using PyTorch. Supports dynamic GPU assignment, optimized resource utilization, and easy integration for large-scale image datasets.
agentic-extraction data deepseek document-parser extraction extractor gpu image-parser llm multigpu nvidia ocr parallel-computing parser pdf-parser vlm
Last synced: 22 Jan 2026
https://github.com/Lemniscate-world/StratAI
This project analyzes financial assets using a Hidden Markov Model (HMM) to identify different market regimes and patterns. The analysis includes calculating daily returns, rolling volatility, and volume changes, and visualizing the hidden states identified by the HMM.
ai assets data data-science data-visualization finance financial-analysis fintech hmm-model hmmlearn machine-learning trading
Last synced: 13 Oct 2025
https://github.com/docusign/extension-app-data-io-reference-implementation
Extension App for Data IO Reference Implementation for the Docusign IAM Platform
Last synced: 02 Mar 2026
https://github.com/6km/islamic-data-repository
مستودع البيانات الإسلامية - قائمة بالموارد التي قد تفيد المبرمجين في تطوير التطبيقات ومواقع الويب.
data fonts hadeeth json quran quran-json
Last synced: 06 May 2026
https://github.com/danish-foundation-models/dfm-processing
Toolkit for processing data in the danish foundation models project.
Last synced: 02 Jul 2025
https://github.com/genert/metis
Asynchronous data sender library
analytics asynchronous data dependency-free typescript
Last synced: 27 Jan 2026
https://github.com/anthonybench/datapeek
Peek summary of datafile in a succinct, opinionated manner.
Last synced: 02 Mar 2026
https://github.com/simonbernarding/ml_project_simonbernarding
This project focuses on predicting flight delays using historical data from a Tunisian airline. We analyzed patterns in airport operations and flight schedules to build a machine learning model that can forecast potential delays.
data data-science flight-delay-prediction machine-learning machinelearning prediction
Last synced: 12 Oct 2025
https://github.com/cont-limno/lagosus-reservoir
Data module classifying lakes as natural lakes or reservoirs in the conterminous U.S.
Last synced: 17 Jan 2026
https://github.com/dbriane208/omdena-apprenticeship-project
This is part of my contribution to the Omdena apprenticeship program .
data data-science feature-engineering machine-learning
Last synced: 14 Mar 2026
https://github.com/hasnocool/war_thunder_data_scraper
A web scraping tool designed to extract valuable data from War Thunder, a popular online game.
data database framework integration multi processing python scraper scraping scrapy sql threaded thunder war
Last synced: 06 May 2026
https://github.com/soulyma/web_crawler
A focused web crawler to extract and structure Arabic content from web pages. Designed for researchers, data analysts, and developers working on Arabic language datasets.
beautifulsoup4 crawler csv data json python structured-data
Last synced: 15 May 2026
https://github.com/jrmedd/emojinal
An experimental API for determining emoji sentiment, based on research from Institut "Jožef Stefan", Slovenia.
data emojis sentiment user-research ux
Last synced: 19 Jan 2026
https://github.com/malvfr/zap
Fill your database with fake data.
cli csv data database generator hacktoberfest mock node populate populate-database seed sql
Last synced: 21 Jan 2026
https://github.com/erictleung/2017-new-coder-survey
:beginner: Code to help clean and format the 2017 New Coder Survey by freeCodeCamp
coder-survey data data-cleaning dplyr freecodecamp
Last synced: 03 Apr 2025
https://github.com/eradical/analytics-unibody
Ansible role that sets up a farm of analytics collectors based on nginx
analytics ansible ansible-role big-data collectors data nginx
Last synced: 06 May 2026
https://github.com/tupizz/data-processing-pipeline-aws
This project is a serverless application built with the Serverless Framework, TypeScript, and AWS services. It provides an enrichment service that processes contact information and enriches it with additional data.
aws data pipeline serverless typescript
Last synced: 13 May 2026
https://github.com/nfaltir/dataxplorer
🔬 A Streamlit app that performs various data exploration operations on an uploaded dataset instantly.
data data-science python streamlit
Last synced: 05 May 2026
https://github.com/agustinmusanti/sqlchallenge-2
This repository contains my solutions to a SQL challenge using MySQL, centered around a fictional retail company called TechMarket. The challenge covers various SQL tasks such as data retrieval, manipulation, and analysis, simulating real-world scenarios within a retail business environment.
Last synced: 03 Apr 2025
https://github.com/famarks/grafarg
Grafarg is an interactive data analytics and graphical data visualization application. Grafarg being a progressive fork of Grafana 7.5.17 continues to be available under open source Apache 2.0 License
analytics charts data data-analysis data-science data-visualization grafana grafarg graph
Last synced: 19 Jan 2026
https://github.com/dev-owdenmag/dataflow-manager
A dynamic and versatile web application for managing, collecting, and presenting data with an integrated printing feature.
data data-management data-management-platform data-visualization python
Last synced: 30 Mar 2025
https://github.com/definetlynotai/vulnscan_data
Logicytics VulnScan Module's Training Data and old model archive
ai data logicytics ml models pytorch sensitive-files text-processing tfidf-text-analysis training-data
Last synced: 11 Oct 2025
https://github.com/metriccoders/metriccoders_datasets
This is the Metric Coders repository containing all the datasets for machine learning.
data datasets machine-learning natural-language-processing scikit-learn
Last synced: 08 Apr 2025
https://github.com/dwidevelopes/database-input-pelanggran-mahasiswa
Menginput data Mahasiswa Yang Melakukan Pelanggran yang siap di data dan di hukum Dan juga siap Terkena Sanksi
aplikasi aplikasi-sekolah data data-analysis database input-method mahasiswa sekolah siswa siswi website
Last synced: 02 May 2026
https://github.com/gher-uliege/bluecloud-plankton
Spatial interpolation of plankton data using a neural network
data data-analysis data-visualization neural-network oceanography
Last synced: 30 Mar 2025
https://github.com/alexandregazagnes/rica-analysis
This repository contains the code to download, analyse, and modelize the RICA dataset from the french ministry of agriculture.
analysis argiculture business data data-analysis data-analytics food python
Last synced: 29 Apr 2026
https://github.com/east-empire-trading-company/eetc-data-client
Client library for retrieving data managed by EETC Data Hub.
client-library data data-science finance library python
Last synced: 31 May 2026