data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-01 00:07:35 UTC
- JSON Representation
https://github.com/nafisalawalidris/dr.-semmelweis-and-the-discovery-of-handwashing
Uncover the revolutionary impact of handwashing on mortality rates in healthcare. Explore the story of Dr. Semmelweis and his groundbreaking findings.
data data-analysis handwashing healthcare-analysis medical-breakthrough mortality-rates
Last synced: 13 Jul 2025
https://github.com/novecento99/nuvolino
air cloud data ikea iot pm pm25 sensor vindstyrka
Last synced: 13 Jul 2025
https://github.com/yessasvini23/cisco-data-analytics-essentials_-virtual-_internship
From the CISCO Networking Academy
data dataanalysis database datascience excel relational-databases sql statistics structured-query tableau
Last synced: 17 Jul 2025
https://github.com/newrelic-experimental/newrelic-java-sap-bi
Instrumentation for SAP PI/PO Server
bi data instrumentation java newrelic nrlabs nrlabs-data nrlabs-odp observability-data sap sap-pi sap-po
Last synced: 03 Mar 2025
https://github.com/giscience/measures-rest-oshdb-docker
Scripts for starting measures for geospatial datasets in docker container, using the OSHDB
data dggs docker geospatial mesure openstreetmap rest
Last synced: 18 Apr 2026
https://github.com/m-muecke/isocountry
R package containing ISO codes for countries and currencies
country-codes currency-codes data iso-3166-1 iso-4217 r r-package
Last synced: 20 Mar 2025
https://github.com/saboye/web-scraping-with-python
A web scraping project using Python's "Requests" and "BeautifulSoup" libraries to extract structured data from one or more websites. This project involves sending HTTP requests to the target website(s), retrieving the HTML content of the website(s), and parsing this content to extract the desired data in a usable format.
beautifulsoup csv data data-harvesting data-mining python request web webscraping
Last synced: 18 Jul 2025
https://github.com/chompfoods/sdk-typescript-fetch
Fetch TypeScript SDK for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.
api branded chomp data database fetch food grocery ingredients nutrition raw recipe-api recipes sdk typescript
Last synced: 03 May 2026
https://github.com/vulcalien/vulcdataformat
Simple data storage system for Java.
data data-storage java serialization
Last synced: 25 Feb 2025
https://github.com/nichtich/wikidata-taxonomy-examples
Extract classifications from Wikidata
coli-conc data knowledge-organization wikidata
Last synced: 12 Jul 2025
https://github.com/elazar/pycopyql
Exports a subset of data from a relational database.
data database export relational tool utility
Last synced: 16 May 2026
https://github.com/sstendahl/giscan
Simple tool to read and analyze existing GISAXS data
cbf data diffraction diffraction-analysis gisans gisaxs physics reflectivity scattering xray
Last synced: 30 Jun 2026
https://github.com/flownrecords/flightTracker
A mobile app built to record essential flight data for post-flight review and debriefing.
Last synced: 23 Jun 2025
https://github.com/evoluteur/madeleinology
Playing with data science by taking a look at the proportions of flour, sugar, butter, and eggs in 147 Madeleine recipes (the traditional French sponge cake).
baking cake cooking cooking-recipes data data-science data-visualization dessert exploratory-analysis exploratory-data-analysis exploratory-data-visualizations food histogram longtail madeleine recipe visualization
Last synced: 23 Jun 2025
https://github.com/yernaz-togizbayev/microsoft_store_data-analysis
Microsoft Store
data data-analysis data-visualization jupyter-notebook python3
Last synced: 15 May 2026
https://github.com/am-i-groot/summer-intern-iitguwahati-spml
Developed an automated Water Quality Monitoring System (WQMS) at IIT Guwahati, using the pH-W218 sensor and K-Means Clustering to assess water potability. The project enhances water quality evaluation through machine learning-based classification.
algorithm data data-visualization kmeans-clustering machine-learning python report sensor signal-processing
Last synced: 17 May 2026
https://github.com/bytraembedded/Laptop-Price-Prediction-with-Machine-Learning
The Laptop Price Prediction with Machine Learning project provides a system to predict the price of laptops based on various features such as processor type, RAM size, storage capacity, and more/
airflow data data-science data-visualization fastapi heroku-deployment machine-learning-algorithms matplotlib-pyplot numpy pandas python reactjs seaborn
Last synced: 30 Dec 2025
https://github.com/stdlib-js/array-base-filled4d-by
Create a filled four-dimensional nested array according to a provided callback function.
alloc allocate array callback data fill filled foreach generic javascript map matrix multidimensional node node-js nodejs stdlib strided structure types
Last synced: 07 Sep 2025
https://github.com/nia-cloud-official/datascript
DataScript: A Hypothetical Data Scripting Language, DataScript is designed for simplifying data manipulation and analysis tasks. It serves as a scripting language tailored specifically for handling various data operations efficiently.
data data-scripting scripting-language
Last synced: 22 Jun 2025
https://github.com/dennyglee/open-covid19-public
A collaboration between SCRI and Databricks on the analysis of open COVID-19 datasets.
covid-19 data data-analytics data-engineering data-science nlp
Last synced: 22 Jun 2025
https://github.com/DevAthul-88/random-fakedata.js
A package to generate random data
data data-generator fake fake-data fake-data-generator javascipt javascript nodejs npm-package package
Last synced: 22 Jun 2025
https://github.com/seafloor-geodesy/gnatss-test-data
Repository to host test data for GNATSS software
Last synced: 06 Apr 2026
https://github.com/lunastev/reflectlm
ReflectLM is a self-reflective, language-structure-only AI model that learns exclusively through interaction. It starts with zero factual knowledge but can engage in dialogue, evaluate its own responses, and remember conversations for future learning.
ai data language-model llm model open-source ts web
Last synced: 22 Jun 2025
https://github.com/shukkkur/py_dash
Assignment for ETL Course - Dashbaord (plotly & dash)
dash dashboard data data-visualization plotly
Last synced: 06 Oct 2025
https://github.com/harmonydata/harmony_examples
Example Jupyter notebook and R scripts using Harmony in real research problems
data data-harmonisation data-harmonization harmonisation psychology python r research
Last synced: 11 Jul 2025
https://github.com/stefanbohacek/dataviz-projects
My dataviz projects.
data data-visualization dataviz
Last synced: 08 Jul 2025
https://github.com/prioritizr/prioritizrdata
Conservation planning data sets
Last synced: 19 Jul 2025
https://github.com/alireza29675/goudi
GOUDI is a multi-layer data visualization application, inspired by mind maps and some other thinking and describing methods.
analysis data goudi visualization
Last synced: 11 Jul 2025
https://github.com/utkarshverma439/simple-sms-spam-detector
Built a Python text classification model for spam detection in SMS. Explored data, preprocessed text, utilized TF-IDF, trained a classifier, and addressed visualization challenges, yielding practical insights.
data data-science data-visualization spam-detection
Last synced: 20 Jun 2025
https://github.com/yasir13001/moonai_api
This MoonAI API service built with FastAPI that calculates and provides detailed Moon and Sun astronomical data based on user input such as date, latitude, longitude, elevation, and timezone.
ai almanac api astro-ai astronomy data data-science fastapi fastapi-api gemini groq-api hilal-detection html islamic-calenda llama llm-integration moon python
Last synced: 20 Jun 2025
https://github.com/divithraju/divith-raju-data-mining
This project focuses on customer segmentation using data mining techniques, specifically K-Means clustering, to classify customers into distinct groups based on their purchasing behaviors. The goal is to analyze customer data and segment them into clusters for targeted marketing strategies and better customer relationship management.
algorthims analytics apache business client connector data dataarchitecture database dataengineering datamining datascience hadoop k-means-clustering mysql project project-repository pyspark python3 spark
Last synced: 06 Mar 2026
https://github.com/stdlib-js/array-nans
Create an array filled with NaNs and having a specified length.
array complex128 complex128array complex64array data float32array float64array javascript matrix ndarray node node-js nodejs stdlib structure typed typed-array types vector
Last synced: 06 Mar 2026
https://github.com/rob-med/data-visualizations-for-python
A collection of useful snippets for clean data visualizations in Python (with matplotlib)
academic-publishing data data-science data-visualization dataviz matplotlib python scientific-publications storytelling visualization
Last synced: 08 May 2026
https://github.com/tezcatlipoca0000/ayudante
It's mainly a program for a store to manage the products data
data javascript scraping self-taught web
Last synced: 09 Apr 2025
https://github.com/tezcatlipoca0000/db-helper_sf
A program tailored for my workplace; it analyze, visualize and manipulate a Firebird 2.0 database
data data-visualization fdb firebird jupyter-notebook pandas python3
Last synced: 09 Apr 2025
https://github.com/hamzacham/data_set_projet-3
analysis data project rstudio visualization
Last synced: 29 Oct 2025
https://github.com/ate329/nsl-kdd-feature-extractor
Python-based tool designed to process network traffic packets and extract features compliant with the NSL-KDD dataset format.
cyber-security cybersecurity data data-science extractor feature-extraction machine-learning network-analysis nsl-kdd nsl-kdd-dataset
Last synced: 30 Oct 2025
https://github.com/benji-lewis/archivord
An archival bot for Discord servers designed to retain as much data as possible to show future generations how we communicated.
archive data data-mining discord discord-bot typescript
Last synced: 16 May 2026
https://github.com/2kabhishek/pokemon-stats
Gotta stat 'em all 🖲🐭
d3 data emoji pokemon rollup statistics
Last synced: 14 May 2026
https://github.com/ornella-gigante/wildlife-data-analysis-toolkit-ml
A data-driven exploration of Canis lupus signatus (Iberian) and Canis lupus labradorius (Labrador) subspecies, leveraging Jupyter Notebook and pandas to analyze weight distributions (25-56 kg), geographic patterns, and reproductive behaviors. Features size-weight correlations and NaN-handling workflows for robust ecological insights
analysis data datasets jupyter-notebook pandas-dataframe python
Last synced: 15 May 2026
https://github.com/DataHerb/dataherb-flora
DataHerb Flora: The core of DataHerb
data data-mining data-science datascience dataset datasets
Last synced: 08 May 2025
https://github.com/danieljdufour/fast-bin
Quickly Convert an Array of Numbers into their Minimal Binary Representations
array binarize binary bits data nbits numbers unbinarize
Last synced: 13 Apr 2025
https://github.com/cont-limno/lagosus-reservoir
Data module classifying lakes as natural lakes or reservoirs in the conterminous U.S.
Last synced: 17 Jan 2026
https://github.com/danieljdufour/easy-file-saver
Very Easily Save a File
csv data download file file-saver javascript js json save
Last synced: 21 Apr 2026
https://github.com/redodo/shipper
Hide encrypted data in files.
audio data images python steganography
Last synced: 26 Mar 2025
https://github.com/fjc0k/vue-merge-data
Intelligently merge data for Vue render functions.
data merge-data render-functions vue
Last synced: 17 May 2026
https://github.com/mikebairdrocks/fluky
[floo-kee]: obtained by chance rather than skill.
data framework mock netcore netstandard nuget random vscode
Last synced: 17 May 2026
https://github.com/d-ganchar/thedus
Thedus is a lightweight migration tool for Clickhouse
cli clickhouse data database migration migrations python
Last synced: 12 Apr 2025
https://github.com/elvis-not-presley-one/lostcassowary
LostCassowary is an Minecraft data miner that searches region files/.MCA files for data from the game, this one can search for banners, signs, biomes, blocks
data data-mining data-science dataminer minecraft nbt nbt-parser scraper
Last synced: 12 Apr 2025
https://github.com/conduitio/conduit-site
data data-ingestion data-integration documentation
Last synced: 06 May 2025
https://github.com/concaption/ksa-lawyers-data
scraped data of ksa lawyers and law firms
Last synced: 03 Apr 2025
https://github.com/luminati-io/crunchbase-dataset-samples
A sample of 1001 Crunchbase companies with key data points, extracted using the Bright Data API.
crunchbase crunchbase-api crunchbase-scraper data database datasets webscraper-api webscraping
Last synced: 17 Mar 2025
https://github.com/stdlib-js/ndarray-base-assert-is-integer-data-type
Test if an input value is a supported ndarray integer data type.
array assert base check data dtype is javascript multidimensional ndarray node node-js nodejs stdlib test types util utilities utility utils
Last synced: 12 Apr 2025
https://github.com/inzhenerka/scooters_data_uploader
Загрузка данных в PostgreSQL в рамках курса по dbt от Инженерка.Тех
Last synced: 04 May 2026
https://github.com/alhonaut/quant-assigment
Code for quant analyz Morpho Markets and simulation reallocation process in MetaMorpho
analysis data defi quantitative-finance
Last synced: 16 May 2026
https://github.com/codenoid/storial.co-database
a Storial.co Database, collected by Hofesh Bot (Scrapper)
Last synced: 28 Mar 2025
https://github.com/geo-c/oct-ckan
The Open City Toolkit (more information about the project: http://geo-c.eu)
cities collaboration data open participation transparency
Last synced: 16 May 2026
https://github.com/lennart080/esp8266-tinyconfig
Esp8266 library to store configuration data
arduino arduino-ide arduino-library config configuration credential-storage credentials data data-config esp8266 esp8266-arduino iot platformio platformio-library
Last synced: 03 May 2026
https://github.com/Greatwoman23/Market-Basket-Analysis
Unlock the power of data-driven sales optimization with Market Basket Analysis. Explore frequent itemsets and association rules to strategically enhance product placement, design targeted promotions, and adapt to seasonal trends. Elevate your business strategy with insights tailored for boosting sales and engaging customers effectively.
analysis analytics analytics-product data data-science jupyter medium-articles notebook-jupyter python
Last synced: 04 May 2025
https://github.com/jimbrig/jimstaskviews
CRAN Task Views and Shiny App https://jimstaskviews.jimbrig.com
cran data docs rstats shiny-app submodules task-views
Last synced: 06 Mar 2026
https://github.com/muhammad-fiaz/ason
ASON: Adaptive Structured Object Notation - Python library for dynamic data serialization, providing flexibility and simplicity.
adaptive-structure-object-notation api ason cli client data file file-format file-sharing file-upload json json-data json-parser open-source opensource parser parsing python python3
Last synced: 02 Feb 2026
https://github.com/agustinmusanti/sqlchallenge-2
This repository contains my solutions to a SQL challenge using MySQL, centered around a fictional retail company called TechMarket. The challenge covers various SQL tasks such as data retrieval, manipulation, and analysis, simulating real-world scenarios within a retail business environment.
Last synced: 03 Apr 2025
https://github.com/jvrck/australianpayphones
Get Australian payphone data in GeoJSON format.
australia data geojson geojson-data scraper
Last synced: 04 Apr 2025
https://github.com/suyashkumar/deeplesion-gcp-loader
Get the DeepLesion CT Image data set into a GCP Storage Bucket
bucket data data-loader data-loading data-science deep-learning deep-lesion deeplesion gcp gcp-bucket loader storage
Last synced: 04 Apr 2025
https://github.com/fairdataihub/fair-amd-oct-paper-code
Code associated with the paper on FAIR assessment of AMD-related datasets containing OCT data
amd biomedical data eye fair oct
Last synced: 03 Apr 2025
https://github.com/cainmi/easy-pull-from-repository
A repository to pull code and files from, may be used to store page data links, code etc. mainly used for python for now
data html javascript python schema
Last synced: 04 Apr 2025
https://github.com/denko5/sales-analysis
A complete SQL-based sales analysis project covering Africa, showcasing data cleaning, exploratory analysis, insights, and lessons learned. The project highlights sales trends, regional performances, and marketing effectiveness across multiple platforms.
africa data data-analysis data-science exploratory-data-analysis insights kenya sales sql
Last synced: 24 Jan 2026
https://github.com/discindo/natochak
Analysis of bicycle accidents in Macedonia using Rmarkdown and ggplot2
Last synced: 19 Feb 2026
https://github.com/MikeBairdRocks/Fluky
[floo-kee]: obtained by chance rather than skill.
data framework mock netcore netstandard nuget random vscode
Last synced: 02 Apr 2025
https://github.com/bacross/datamunger
python package for handling nan's and outliers
data data-frame datamunger knn nan outliers python scikit-learn
Last synced: 17 May 2026
https://github.com/hughrawlinson/github-data-scripts
Scripts to grab data about repos of interest to compare
data github-graphql github-repo-organizer graphql scripts typescript
Last synced: 09 Jul 2025
https://github.com/michellepellon/jobx
A modern, powerful job scraper for LinkedIn, Indeed and beyond.
compensation data data-analysis indeed indeed-scraping jobs jobsearch linkedin linkedin-scraper
Last synced: 17 Jan 2026
https://github.com/ginga1402/travego_travellers
MySQL Mini Project
college-project data mysql-database
Last synced: 27 Jul 2025
https://github.com/simranjeet97/gpt4_applications
Applications build using OpenAI API and GPT4
ai ai-applications artificial-intelligence chatgpt data data-science gpt3 gpt4 large-language-models llm machine-learning openai openai-api project python
Last synced: 05 May 2026
https://github.com/incubrain/awesome-maharashtra-data
A collection of datasets specific to Maharashtra, India. WIP
ai artificial-intelligence data data-analysis data-science datasets maharashtra marathi
Last synced: 23 May 2026
https://github.com/public-health-scotland/waiting_times_clinical_prioritisation
This repository contains the Reproducible Analytical Pipeline (RAP) to produce the quarterly statistics on clinical prioritisation, part of the Stage of Treatment (SoT) publication.
data healthcare nhs public-health scotland shiny shiny-app treatment waiting-time
Last synced: 26 Jul 2025
https://github.com/patelabhi574/hotel_reservation_analysis
Analyzing data collected by hotel to make future prediction for the owner of what are the segments they are making most profit & also which are the patterns & trends which have been seen over the past years in the booking in different times throughout the year and price setting on the website in peak time as per availability index.
data data-visualization datamodeling looker-studio powerbi reporting sql-query sql-server
Last synced: 19 Feb 2026
https://github.com/reiiyuki/once-data-manager
Once Data Manager is temporary data management utility kit for Unity.
data manager playerprefs preference scene temporary unity
Last synced: 17 May 2026
https://github.com/stonecharioteer/renfield
Synchronize and Search through Hard Drives
catalogue data search storage synchronization
Last synced: 09 Feb 2026
https://github.com/wamphlett/smart-data-objects
An easy solution for capturing and validating data into usable DTO's
data dto forms php php7 validation
Last synced: 17 May 2026
https://github.com/sixarm/sixarm_ruby_fab
SixArm.com → Ruby → Fab gem to fabricate sample data for testing
data fabrication factory fake gem mock ruby
Last synced: 24 Jul 2025
https://github.com/yoursrijit/data-structure-with-java
A data structure is a named location that can be used to store and organize data. And, an algorithm is a collection of steps to solve a particular problem. Learning data structures and algorithms allow us to write efficient and optimized computer programs.
data datastructures dsa-algorithm java linked-list
Last synced: 13 Mar 2025