data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-02 00:07:45 UTC
- JSON Representation
https://github.com/dostuffthatmatters/circadian-scp-upload
Resumable, interruptible, SCP upload client for any files or directories generated day by day
checksum daily data directories files library python scp ssh synchronization time-series upload utilities
Last synced: 24 Jun 2025
https://github.com/gbburleigh/quick-seeders
Generate realistic test data quickly with Quick-Seeders, a Python library offering a wide range of data types and schema definitions. Control data variance, probabilities, and output formats, including SQL. Simplify your data seeding process and improve testing efficiency.
data dataset faker generator python seeder sql test
Last synced: 03 Apr 2025
https://github.com/nafisalawalidris/elfeenah
Configuration files for my GitHub profile. Welcome to my GitHub profile! I'm Nafisa Lawal Idris, a passionate Data Scientist with a strong interest for blockchain technology. Explore my GitHub portfolio to delve into the exciting world where data science and blockchain converge.
artificial-intelligence bitcoin blockchain config data data-science-portfolio data-science-projects datascience datascientist deep-learning github-config machinelearning
Last synced: 11 Sep 2025
https://github.com/lisakey/convert-csv-to-sav
We used python 🐍 to convert a csv file into a sav file with all the modifications needed to open it in IBM spss and be able to analyse our data.
analysis chardet convert csv data databases ibm os pandas pyreadstat python sav spss sys transformations
Last synced: 08 May 2026
https://github.com/legopitstop/mcextract
Extract assets and data from the Minecraft jar.
assets customtkinter data jar java minecraft pypi python pythonpackage reports serverjars userfolder
Last synced: 17 May 2026
https://github.com/jub0t/Eso
An application to manage all your Encryption & Decryption keys and other related tools.
data encryption encryption-decryption hacking hacking-tool keys pgp privacy private
Last synced: 10 May 2025
https://github.com/giscience/measures-rest-sparql
A SPARQL endpoint for the Measures REST OSHDB App framework.
data osm quality semantics sparql sparql-endpoints
Last synced: 24 Jun 2025
https://github.com/ayush585/fireducksblog
BLOG: Unlocking AI Efficiency: How FireDucks Revolutionizes Data Preprocessing
Last synced: 28 Apr 2026
https://github.com/uhstray-io/just-dashboards
Light and Easy Rust-Fullstack/WASM application to build dashboards from any data source
analytics data dioxus rust visualization
Last synced: 29 Mar 2025
https://github.com/fbraza/paris_airbnb
Analysis of Paris AirBnB data using R and Shiny
analysis data data-analysis paris-airbnb r shiny
Last synced: 21 Mar 2025
https://github.com/dbrennand/rm-content
A Python 3.7 script to remove a specific string from all files and repos (owned by the user).
content data erase eraser privacy privacy-protection privacy-tools remove remover rm-content
Last synced: 29 Mar 2025
https://github.com/pbinkley/tweets-libraries-covid19
A twarc harvest of tweets related to libraries during the COVID-19 outbreak, starting 2020-03-02
Last synced: 06 Mar 2026
https://github.com/wklee610/de_project
[Data Engineer] Personal Toy Project For Study
Last synced: 31 Mar 2025
https://github.com/whatheheckisthis/pwc_project-
Successfully completed a PwC virtual case, advancing Power BI skills to address cybersecurity and cloud architecture requirements. Developed comprehensive dashboards that effectively communicated key performance indicators (KPIs), showcasing proficiency in data visualization and deliver
case-study data data-science dataanalytics databases datavisualization powerbi virtual
Last synced: 05 Apr 2025
https://github.com/linas/archeo
File Recovery, Integrity and Archive Management
corruption data monitoring recovery
Last synced: 29 Mar 2025
https://github.com/dsietz/datadot
data multicast plugin-manager plugins rust-lang
Last synced: 07 Sep 2025
https://github.com/epsoft/dataset-generator
dataset generator
data dataset dataset-generation matplotlib matplotlib-figures tensorflow tensorflow-datasets
Last synced: 18 May 2026
https://github.com/yash22222/tsf-grip-tasks
The Sparks Foundation Data Science & Business Analytics Internship Tasks
buisness-intelligence business-analytics data data-science data-science-projects data-structures grip gripjune23 internship internship-task machine-learning projects python simple-linear-regression the-sparks-foundation tsf
Last synced: 27 Apr 2026
https://github.com/sottey/shon
SHON (Structured Human-Optimized Notation) is a data serialization format designed for readability, schema support, and practical use in modern systems. Version 0.6 introduces advanced types and syntax improvements.
data golang json spec specification
Last synced: 18 May 2026
https://github.com/rifqanzalbina/libraryjs
A Library js
data data-science database datascience javascript javascript-library
Last synced: 17 Jan 2026
https://github.com/definetlynotai/test_generator
A tool to create datasets based on configurations from a csv file, This tool can be used as a skeleton for other software.
algorithim csv data development dynamic exam generator huge nirt powerful python skeleton test tools
Last synced: 21 Jul 2025
https://github.com/openfoodfacts/openfoodfacts-corrector
Ruby script to correct and enhance data on OpenFoodFacts
Last synced: 24 Apr 2026
https://github.com/rrwen/slides-covid19-geosocial-db
Presentation titled "A Real-time Geo-social Media Database for Large-scale Coronavirus Disease 2019 (COVID-19) Research" for my second research seminar at Ryerson University
covid covid-19 covid19 data database disease geo gis index media ncov-2019 ncov19 postgres postgresql presentation research seminar slides social virus
Last synced: 18 May 2026
https://github.com/ajitharunai/covid-tracker-using-python
Covid-Tracker-Using-Python
data datavisualization python python3 pythonapplications
Last synced: 25 Jun 2025
https://github.com/junkwaxdata/cardlists
Sports Card set lists in easily consumable JSON Format for databases, apps, websites, and more!
baseball baseball-cards baseball-data bowman data dataset datasets donruss fleer json json-schema panini topps upper-deck
Last synced: 13 Mar 2025
https://github.com/dr-saad-la/r-distilled
R Programming Language distilled
data data-analysis learning programming-language r rlanguage rprogramming statistical-analysis
Last synced: 18 May 2026
https://github.com/ubc-library-rc/ggplot2_intro_workshop
Workshop about data visualization with ggplot2 in R
Last synced: 01 Jul 2026
https://github.com/ubc-library-rc/intro_to_tidyverse
Introductory workshop about the tidyverse package
Last synced: 01 Jul 2026
https://github.com/randomfractals/chicago-transport
Exploratory data analysis of public Chicago transportation datasets.
chicago data data-tools duckdb sql transportation
Last synced: 01 May 2026
https://github.com/abhaysingh71/india-censes-data-analysis
This repo is a india censes data analysis in many domains
data data-science data-visualization dataanalysis streamlit
Last synced: 15 May 2026
https://github.com/jigyasag18/sonar-rock-vs-mine-prediction-ml-project
This repository contains a machine learning project that classifies SONAR reading data to distinguish between rocks and mines. It implements various classification models,evaluates their performance,and features a user-friendly web application deployed with Streamlit for real-time predictions. The project is aimed to help in safe marine operations.
classification data dataset machine-learning machine-learning-algorithms machinelearning machinelearning-python machinelearningmodel machinelearningproject machinelearningprojects modelevaluation modeltraining prediction-model streamlit streamlit-webapp
Last synced: 18 May 2026
https://github.com/mundra-ankur/msw_ai_pipeline
Municipal solid waste (MSW) characterization, AI and Data pipeline to charcterize solid waste in real time into diffrent buckets using Yolo
artificial-intelligence data datapipeline solid-waste-segregation yolo
Last synced: 11 Apr 2025
https://github.com/amyflo/cs448b
Exploring r/LoveLetters
d3-visualization d3js data react reactjs visualization
Last synced: 18 May 2026
https://github.com/pythongiant/data-analytics-wolfram-alpha
A data analysis porgram using wolfram alpha
analytics api data wolfram-alpha
Last synced: 04 Apr 2025
https://github.com/stonecharioteer/renfield
Synchronize and Search through Hard Drives
catalogue data search storage synchronization
Last synced: 09 Feb 2026
https://github.com/iosdec/adstorage
Automatic Data Storage - iOS
data ios objective-c public storage xcode
Last synced: 21 Mar 2025
https://github.com/sambacha/yearn-finance-data
data repo for proposed YIP-DATA
cryptocurrency data erc20 ethereum exchange yearn yip yyip
Last synced: 18 May 2026
https://github.com/sksubhadeep/nashville-housing-data-cleaning-project-using-sql
SQL Data Cleaning Project on Nashville Housing Dataset
Last synced: 19 Mar 2026
https://github.com/newrelic-experimental/newrelic-java-sap-bi
Instrumentation for SAP PI/PO Server
bi data instrumentation java newrelic nrlabs nrlabs-data nrlabs-odp observability-data sap sap-pi sap-po
Last synced: 03 Mar 2025
https://github.com/bredalis/seaborn
📊 Library to create graphics 📊
data graphics-programming librery python seaborn seaborn-plots
Last synced: 04 Mar 2025
https://github.com/davedupplaw/jquery.faceted-browser
Faceted Data Browser for jQuery
browser data database drag-and-drop draggable-elements facet-browser javascript javascript-library jquery jquery-plugin jquery-widgets
Last synced: 29 Apr 2026
https://github.com/erinaldi/bmn2-lattice
Data analysis of lattice Monte Carlo simulations of quantum matrix models.
data data-science data-visualisation lattice
Last synced: 27 Mar 2025
https://github.com/LisaKey/convert-csv-to-sav
We used python 🐍 to convert a csv file into a sav file with all the modifications needed to open it in IBM spss and be able to analyse our data.
analysis chardet convert csv data databases ibm os pandas pyreadstat python sav spss sys transformations
Last synced: 03 Mar 2025
https://github.com/r-mahesh45/hr---resume-text-classification
Text Classification for Resumes: Conducted Exploratory Data Analysis (EDA) on a vast collection of resumes. Organized the data using Bag of Words (BoW) and TF-IDF techniques. Built and evaluated multiple models, with Logistic Regression delivering standout performance. Created Word Clouds and Histograms.
data datacleaning extract-transform-load feature-extraction nlp nltk-tokenizer text-mining text-processing
Last synced: 12 Sep 2025
https://github.com/bastianolea/censo_viviendas
Censo de Viviendas procesado con R para disponibilizarlo con códigos/nombres de comunas, regiones, y etiquetas de sus variables. En formato original (6,5 millones de filas) y en conteo por comunas.
chile comunas data poblacion rural
Last synced: 30 Oct 2025
https://github.com/artcc/coredatagenericmodule
Core Data generic module for persist encrypted object
core coredata coredata-model data data-generic database encrypted encrypted-data encryption entity identifier persist protocol swift
Last synced: 08 May 2026
https://github.com/makosai/covid19datachart
A basic chart for checking corona data. Written in a single HTML file for convenience. Grab the single file and run it anywhere. Or visit the webpage.
chart chartjs corona coronavirus coronavirus-analysis covid-19 covid-2019 covid19 covid19-data data data-analysis datasets
Last synced: 23 Feb 2026
https://github.com/denisecase/nw-network-data-analytics
Network for those earning a NW Masters of Applied Data Science
Last synced: 02 Feb 2026
https://github.com/gabrieldim/world-bank-wdi-data-science
Faculty project. World Bank predictions with Data Science.
convolutional-neural-networks data data-science model neural-network neural-networks prediction-model python science
Last synced: 15 May 2026
https://github.com/bayer-group/cmc-ontologies
This is a submodule of cmc-knowledge-graph-setup. It contains ontologies and relevant data graph files
Last synced: 16 Jun 2025
https://github.com/tsiarokhin/student_bsu_by
Tool for parsing various BSU student information from student.bsu.by website.
belarus bsu data grades python students study university
Last synced: 28 May 2026
https://github.com/glassflow/pipelines-push-action
This Github Action lets you automate GlassFlow pipelines deployments as code
data data-processing datastreaming deployment github-actions glassflow python real-time stream-processing
Last synced: 19 May 2026
https://github.com/hoaihuongbk/lakeops
A modern data lake operations toolkit working with multiple table formats (Delta, Iceberg, Parquet) and engines (Spark, Polars) via the same APIs.
data data-operations dataengineering datalake
Last synced: 07 Mar 2026
https://github.com/wahyuwsslah/salary_prediction-aiml
Salary Prediction using Machine Learning with 3 Models. Linear Regression, Decision Tree, Random Forest
ai analytics data data-science datascience machine-learning python python3
Last synced: 19 May 2026
https://github.com/yazeed44/reform-api
A platform that harnesses the power of multiple data streams including satellite imagery and drone photos to visualize multiple urban planning indices and provide descriptive analytics that will empower local Saudi authorities to make data-driven decision that contribute to neighborhood quality of life.
Last synced: 18 May 2026
https://github.com/diddypod/crop-data-comparer
A Python script to compare crop data over years
comparison crop data openpyxl python
Last synced: 28 Jun 2026
https://github.com/stdlib-js/ndarray-base-reverse-dimension
Return a view of an input ndarray in which the order of elements along a specified dimension is reversed.
base data flip javascript matrix ndarray node node-js nodejs reverse slice stdlib structure types vector view
Last synced: 07 Mar 2026
https://github.com/vishwagauravin/screener-scraper-pro
Effortlessly scrape comprehensive financial data from screener.in and use it in your projects. No API key required.
data finance finances market-data scraper scrapers screener screener-in screener-plugin stock stock-data stock-market stocks
Last synced: 18 Feb 2026
https://github.com/hmeleiro/r_dataviz
Data visualization projects with R / Proyectos de visualización de datos con R
data dataviz r rmd-files social-science survey-data
Last synced: 21 Jun 2026
https://github.com/greatwoman23/market-basket-analysis
Unlock the power of data-driven sales optimization with Market Basket Analysis. Explore frequent itemsets and association rules to strategically enhance product placement, design targeted promotions, and adapt to seasonal trends. Elevate your business strategy with insights tailored for boosting sales and engaging customers effectively.
analysis analytics analytics-product data data-science jupyter medium-articles notebook-jupyter python
Last synced: 28 Apr 2026
https://github.com/chompfoods/sdk-go
Go SDK for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.
api branded chomp data database food go grocery ingredients nutrition raw recipe-api recipes sdk
Last synced: 19 May 2026
https://github.com/m-muecke/isocountry
R package containing ISO codes for countries and currencies
country-codes currency-codes data iso-3166-1 iso-4217 r r-package
Last synced: 20 Mar 2025
https://github.com/panukatan/senso
An Interface to the Philippine Census of Population and Housing Data
census data philippines r rstats
Last synced: 29 Jun 2026
https://github.com/kingtous/bots_task_result
Result of the Barcelona OpenMP Tasks Suite (BOTS) using ompTG
Last synced: 09 Jul 2025
https://github.com/gcoronelc/ucv_gdi-1_202302-b2
Taller de Gestión de Datos e Información I con Gustavo Coronel.
data data-science data-structures database databases online oracle query relational-databases security sql sql-server
Last synced: 19 May 2026
https://github.com/sermetpekin/perse
Perse is an experimental Python package that combines some of the most widely-used functionalities from the powerhouse libraries Pandas, Polars, and DuckDB into a single, unified DataFrame object. The goal of Perse is to provide a streamlined and efficient interface, leveraging the strengths of these libraries to create a versatile data handling.
data data-science data-structures duckdb pandas polars
Last synced: 09 May 2026
https://github.com/cyberaula/edvl
Educational Data Virtual Lab
apache-zeppelin big-data-platform data education fiware fiware-cosmos fiware-draco fiware-keyrock fiware-ngsi fiware-orion human-data-interaction ipynb notebook notebooks spark streaming-data upm zeppelin zeppelin-notebook
Last synced: 19 May 2026
https://github.com/labgua/ilmeteo
Acquisizione dati dal sensore SHT71 e trasmissione in rete in Real-Time
acquisition data humidity humidity-sensor iot raspberry-pi real-time realtime rpi sht71 temperatura temperature temperature-sensor umidita web
Last synced: 24 Apr 2026
https://github.com/viveknathani/maketest
A command line tool to generate test data. 📊
command-line data golang testing-tools
Last synced: 08 Jun 2026
https://github.com/yoursrijit/data-structure-with-java
A data structure is a named location that can be used to store and organize data. And, an algorithm is a collection of steps to solve a particular problem. Learning data structures and algorithms allow us to write efficient and optimized computer programs.
data datastructures dsa-algorithm java linked-list
Last synced: 13 Mar 2025
https://github.com/habedi/adbis-2023-paper
This repository hosts the code and data used for the experiments reported in the paper titled "Diversification of Top-k Geosocial Queries", published in ADBIS 2023
artifacts conference-paper data experiments graphs java research-paper
Last synced: 19 May 2026
https://github.com/uvaio/datasets
Notebooks for data processing, scraping, machine learning
data dataset jupyter jupyter-notebook learning machine ml model ontology
Last synced: 21 Mar 2025
https://github.com/nottherealtar/data_engineering_assesments
assesments data data-engineer interview-questions interview-test
Last synced: 13 Sep 2025
https://github.com/pedro-donoso/productoskotlin
App que carga una lista de Productos con ID, Nombre, Descripción, Disponible, Habilitado y Stock, convierte el nombre a mayúsculas, cambia boolean por SI o NO si está disponible y habilitado, los ordena descendente según Stock
class data fun id kotlin kotlin-android list
Last synced: 19 May 2026
https://github.com/amethyst-php/aggregator
aggregator amethyst amethyst-package api data laravel
Last synced: 19 May 2026
https://github.com/penspanic/datra
Datra is a comprehensive data management system for game development.
data game game-development gamedata unity unity-package unity3d-plugin
Last synced: 19 May 2026
https://github.com/georginapuig/graps-from-csv
📊 Data visualization with c3.js and Papaparse from CSV files.
c3 c3js chart d3 d3js data data-visualization graphs javascript javascript-library visualization
Last synced: 19 May 2026
https://github.com/radeelahmad/data-structure-code
various codes in C++
code data data-structures dsa dsa-algorithm
Last synced: 08 May 2026
https://github.com/ushkinaz/cbn-data
Automated game data extraction and processing for Cataclysm: Bright Nights. Provides JSON mirrors, WebP asset conversion, and unified translation data.
Last synced: 07 Mar 2026
https://github.com/coral/ddp
Distributed Display Protocol (DDP) in Go
data ddp distributed golang led pixel protocol wled
Last synced: 26 Jun 2025