data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-02 00:07:45 UTC
- JSON Representation
https://github.com/benji-lewis/archivord
An archival bot for Discord servers designed to retain as much data as possible to show future generations how we communicated.
archive data data-mining discord discord-bot typescript
Last synced: 16 May 2026
https://github.com/hamzacham/data_set_projet-3
analysis data project rstudio visualization
Last synced: 29 Oct 2025
https://github.com/stdlib-js/array-nans
Create an array filled with NaNs and having a specified length.
array complex128 complex128array complex64array data float32array float64array javascript matrix ndarray node node-js nodejs stdlib structure typed typed-array types vector
Last synced: 06 Mar 2026
https://github.com/divithraju/divith-raju-data-mining
This project focuses on customer segmentation using data mining techniques, specifically K-Means clustering, to classify customers into distinct groups based on their purchasing behaviors. The goal is to analyze customer data and segment them into clusters for targeted marketing strategies and better customer relationship management.
algorthims analytics apache business client connector data dataarchitecture database dataengineering datamining datascience hadoop k-means-clustering mysql project project-repository pyspark python3 spark
Last synced: 06 Mar 2026
https://github.com/yasir13001/moonai_api
This MoonAI API service built with FastAPI that calculates and provides detailed Moon and Sun astronomical data based on user input such as date, latitude, longitude, elevation, and timezone.
ai almanac api astro-ai astronomy data data-science fastapi fastapi-api gemini groq-api hilal-detection html islamic-calenda llama llm-integration moon python
Last synced: 20 Jun 2025
https://github.com/utkarshverma439/simple-sms-spam-detector
Built a Python text classification model for spam detection in SMS. Explored data, preprocessed text, utilized TF-IDF, trained a classifier, and addressed visualization challenges, yielding practical insights.
data data-science data-visualization spam-detection
Last synced: 20 Jun 2025
https://github.com/alireza29675/goudi
GOUDI is a multi-layer data visualization application, inspired by mind maps and some other thinking and describing methods.
analysis data goudi visualization
Last synced: 11 Jul 2025
https://github.com/harmonydata/harmony_examples
Example Jupyter notebook and R scripts using Harmony in real research problems
data data-harmonisation data-harmonization harmonisation psychology python r research
Last synced: 11 Jul 2025
https://github.com/lunastev/reflectlm
ReflectLM is a self-reflective, language-structure-only AI model that learns exclusively through interaction. It starts with zero factual knowledge but can engage in dialogue, evaluate its own responses, and remember conversations for future learning.
ai data language-model llm model open-source ts web
Last synced: 22 Jun 2025
https://github.com/DevAthul-88/random-fakedata.js
A package to generate random data
data data-generator fake fake-data fake-data-generator javascipt javascript nodejs npm-package package
Last synced: 22 Jun 2025
https://github.com/flownrecords/flightTracker
A mobile app built to record essential flight data for post-flight review and debriefing.
Last synced: 23 Jun 2025
https://github.com/shuklayash02/complete_data_analysis_project
A Full Data Analysis project where a sales data is ask,prepare,process,analyze,share and act through data analysis process
data data-visualization dataanalysis database datacleaning powerbi sql
Last synced: 16 Jul 2025
https://github.com/ouverz/governed_arr
A dbt project showing end-to-end ARR definitions, compute, transformation, validation and governance
arr data data-modeling dbt-core governance semantic-layer snowflake
Last synced: 02 Jul 2026
https://github.com/clabe45/kaz
Minimalistic local storage cli
cli data minimalistic storage utility
Last synced: 17 Jul 2025
https://github.com/andrianllmm/wika-data
Philippine language resources.
data language low-resource-languages parser philippines scraper
Last synced: 17 Jul 2025
https://github.com/giscience/measures-rest-oshdb-docker
Scripts for starting measures for geospatial datasets in docker container, using the OSHDB
data dggs docker geospatial mesure openstreetmap rest
Last synced: 18 Apr 2026
https://github.com/prioritizr/prioritizrdata
Conservation planning data sets
Last synced: 19 Jul 2025
https://github.com/fjc0k/vue-merge-data
Intelligently merge data for Vue render functions.
data merge-data render-functions vue
Last synced: 17 May 2026
https://github.com/muhammad-fiaz/ason
ASON: Adaptive Structured Object Notation - Python library for dynamic data serialization, providing flexibility and simplicity.
adaptive-structure-object-notation api ason cli client data file file-format file-sharing file-upload json json-data json-parser open-source opensource parser parsing python python3
Last synced: 02 Feb 2026
https://github.com/bacross/datamunger
python package for handling nan's and outliers
data data-frame datamunger knn nan outliers python scikit-learn
Last synced: 17 May 2026
https://github.com/simranjeet97/gpt4_applications
Applications build using OpenAI API and GPT4
ai ai-applications artificial-intelligence chatgpt data data-science gpt3 gpt4 large-language-models llm machine-learning openai openai-api project python
Last synced: 05 May 2026
https://github.com/shgysk8zer0/schema
A PHP implementation of schema.org structured data objects
data microdata schema seo structured-data
Last synced: 24 Jun 2025
https://github.com/nafisalawalidris/elfeenah
Configuration files for my GitHub profile. Welcome to my GitHub profile! I'm Nafisa Lawal Idris, a passionate Data Scientist with a strong interest for blockchain technology. Explore my GitHub portfolio to delve into the exciting world where data science and blockchain converge.
artificial-intelligence bitcoin blockchain config data data-science-portfolio data-science-projects datascience datascientist deep-learning github-config machinelearning
Last synced: 11 Sep 2025
https://github.com/legopitstop/mcextract
Extract assets and data from the Minecraft jar.
assets customtkinter data jar java minecraft pypi python pythonpackage reports serverjars userfolder
Last synced: 17 May 2026
https://github.com/giscience/measures-rest-sparql
A SPARQL endpoint for the Measures REST OSHDB App framework.
data osm quality semantics sparql sparql-endpoints
Last synced: 24 Jun 2025
https://github.com/fbraza/paris_airbnb
Analysis of Paris AirBnB data using R and Shiny
analysis data data-analysis paris-airbnb r shiny
Last synced: 21 Mar 2025
https://github.com/linas/archeo
File Recovery, Integrity and Archive Management
corruption data monitoring recovery
Last synced: 29 Mar 2025
https://github.com/sottey/shon
SHON (Structured Human-Optimized Notation) is a data serialization format designed for readability, schema support, and practical use in modern systems. Version 0.6 introduces advanced types and syntax improvements.
data golang json spec specification
Last synced: 18 May 2026
https://github.com/definetlynotai/test_generator
A tool to create datasets based on configurations from a csv file, This tool can be used as a skeleton for other software.
algorithim csv data development dynamic exam generator huge nirt powerful python skeleton test tools
Last synced: 21 Jul 2025
https://github.com/rrwen/slides-covid19-geosocial-db
Presentation titled "A Real-time Geo-social Media Database for Large-scale Coronavirus Disease 2019 (COVID-19) Research" for my second research seminar at Ryerson University
covid covid-19 covid19 data database disease geo gis index media ncov-2019 ncov19 postgres postgresql presentation research seminar slides social virus
Last synced: 18 May 2026
https://github.com/junkwaxdata/cardlists
Sports Card set lists in easily consumable JSON Format for databases, apps, websites, and more!
baseball baseball-cards baseball-data bowman data dataset datasets donruss fleer json json-schema panini topps upper-deck
Last synced: 13 Mar 2025
https://github.com/jigyasag18/sonar-rock-vs-mine-prediction-ml-project
This repository contains a machine learning project that classifies SONAR reading data to distinguish between rocks and mines. It implements various classification models,evaluates their performance,and features a user-friendly web application deployed with Streamlit for real-time predictions. The project is aimed to help in safe marine operations.
classification data dataset machine-learning machine-learning-algorithms machinelearning machinelearning-python machinelearningmodel machinelearningproject machinelearningprojects modelevaluation modeltraining prediction-model streamlit streamlit-webapp
Last synced: 18 May 2026
https://github.com/amyflo/cs448b
Exploring r/LoveLetters
d3-visualization d3js data react reactjs visualization
Last synced: 18 May 2026
https://github.com/iosdec/adstorage
Automatic Data Storage - iOS
data ios objective-c public storage xcode
Last synced: 21 Mar 2025
https://github.com/bredalis/seaborn
π Library to create graphics π
data graphics-programming librery python seaborn seaborn-plots
Last synced: 04 Mar 2025
https://github.com/r-mahesh45/hr---resume-text-classification
Text Classification for Resumes: Conducted Exploratory Data Analysis (EDA) on a vast collection of resumes. Organized the data using Bag of Words (BoW) and TF-IDF techniques. Built and evaluated multiple models, with Logistic Regression delivering standout performance. Created Word Clouds and Histograms.
data datacleaning extract-transform-load feature-extraction nlp nltk-tokenizer text-mining text-processing
Last synced: 12 Sep 2025
https://github.com/glassflow/pipelines-push-action
This Github Action lets you automate GlassFlow pipelines deployments as code
data data-processing datastreaming deployment github-actions glassflow python real-time stream-processing
Last synced: 19 May 2026
https://github.com/wahyuwsslah/salary_prediction-aiml
Salary Prediction using Machine Learning with 3 Models. Linear Regression, Decision Tree, Random Forest
ai analytics data data-science datascience machine-learning python python3
Last synced: 19 May 2026
https://github.com/gcoronelc/ucv_gdi-1_202302-b2
Taller de GestiΓ³n de Datos e InformaciΓ³n I con Gustavo Coronel.
data data-science data-structures database databases online oracle query relational-databases security sql sql-server
Last synced: 19 May 2026
https://github.com/viveknathani/maketest
A command line tool to generate test data. π
command-line data golang testing-tools
Last synced: 08 Jun 2026
https://github.com/habedi/adbis-2023-paper
This repository hosts the code and data used for the experiments reported in the paper titled "Diversification of Top-k Geosocial Queries", published in ADBIS 2023
artifacts conference-paper data experiments graphs java research-paper
Last synced: 19 May 2026
https://github.com/nottherealtar/data_engineering_assesments
assesments data data-engineer interview-questions interview-test
Last synced: 13 Sep 2025
https://github.com/pedro-donoso/productoskotlin
App que carga una lista de Productos con ID, Nombre, DescripciΓ³n, Disponible, Habilitado y Stock, convierte el nombre a mayΓΊsculas, cambia boolean por SI o NO si estΓ‘ disponible y habilitado, los ordena descendente segΓΊn Stock
class data fun id kotlin kotlin-android list
Last synced: 19 May 2026
https://github.com/georginapuig/graps-from-csv
π Data visualization with c3.js and Papaparse from CSV files.
c3 c3js chart d3 d3js data data-visualization graphs javascript javascript-library visualization
Last synced: 19 May 2026
https://github.com/ushkinaz/cbn-data
Automated game data extraction and processing for Cataclysm: Bright Nights. Provides JSON mirrors, WebP asset conversion, and unified translation data.
Last synced: 07 Mar 2026
https://github.com/warlock/tck
Data Type Checker
ajax browser data javascript nodejs type-checking types validation
Last synced: 19 May 2026
https://github.com/aboualine/sql-formation
Library Management System Database: A MySQL project with tables, triggers, stored procedures, and views for managing books, members, and borrowings. Includes sample data for testing. Ideal for learning SQL or building a library app.
data database library-management-system mysql sql system
Last synced: 18 Apr 2026
https://github.com/amethyst-php/contract
amethyst amethyst-package api contract data laravel
Last synced: 20 May 2026
https://github.com/owengombas/genyus
π Lyrics analysis with genius.com, Python and Jupyter Notebooks
api data data-science genius jupyter-notebook lyrics python statistics
Last synced: 20 May 2026
https://github.com/xrahul/android-logs
Get logs of various sensors and events in android 6.0+
Last synced: 20 May 2026
https://github.com/snitkin-lab-umich/prewas_manuscript_analysis
Manuscript in support of prewas software
data data-visualisation manuscript r
Last synced: 08 Jul 2025
https://github.com/gappeah/cookie-company-visual-dashboard
This Excel-based interactive dashboard provides a comprehensive overview of the Cookie Company's sales performance and key metrics.
dashboard data data-visualization excel microsoft-excel
Last synced: 25 Feb 2025
https://github.com/gappeah/british-airways-analysis
This project focuses on analyzing and visualising travel data from British Airways using Tableau. The goal is to extract insights and present them in an interactive and visually appealing manner.
data data-analysis data-visualization tableau
Last synced: 11 Jun 2025
https://github.com/swarchal/morar
Processing phenotypic screening data
biology data data-analysis drug-discovery hts phenotypic
Last synced: 19 Jun 2025
https://github.com/adrian-pasek-prv/data-modeling-with-cassandra
Create a data model in Apache Cassandra for music streaming app
apache-cassandra data data-engineering data-modeling python
Last synced: 02 Jan 2026
https://github.com/beangreen247/osfetch-old.sh
script that fetches system information and displays it to the user
247 bash bean beangreen247 data fetch green information neofetch neofetch-clone os script sh shell storage system tem zsh
Last synced: 02 Nov 2025
https://github.com/ibz-04/data-encryption
Encrypting and Decrypting given data of hospital patients such as: audio & image files
Last synced: 23 Jul 2025
https://github.com/mierune/tinygrib2
(experimental) A tiny toolkit for parsing JMA's GRIB2 files.
data grib grib2 meteorology rust weather
Last synced: 27 Jun 2025
https://github.com/bhpcv252/dda-binapprox-on-fits
Using the binapprox algorithm to efficiently estimate the median of each pixel from a set of astronomy images in FITS files.
Last synced: 22 Mar 2025
https://github.com/jen-uis/loan-status-prediction
This repository contains project materials for the Winter STAT 206 class, University of California, Riverside, A. Gary Anderson School of Management.
data data-analysis data-analytics data-cleaning data-visualization descriptive-analytics julia julia-language jupyter-notebook predictive-analytics predictive-modeling team-collaboration
Last synced: 02 Jan 2026
https://github.com/umbaji/yodi
This is the official repository for Yodi, the speech recognition model for 8 words, in Ewè. The yodi package is also useful for rapid inference inference on speech data, especially on the mini_speech datasets.
data data-visualization keras python3 speech-recognition tensorflow
Last synced: 12 Jan 2026
https://github.com/priyanshubiswas-tech/aws-etl-pipeline-on-cloud-using-glue-athena-lambda-and-redshift
Serverless ETL pipeline on AWS using Glue, Lambda, Athena, and Redshift β automates data ingestion, transformation, and analytics with scalable, event-driven architecture.
athena aws aws-glue data data-engineering etl etl-pipeline lambda redshift
Last synced: 02 May 2026
https://github.com/tushar2704/interview-quest
Interview-Quest is comprehensive collection of interview questions and answers that can help you prepare for technical interviews. Whether you're a seasoned developer looking to brush up on your skills or a job seeker preparing for your next big opportunity, this repository aims to provide valuable resources to enhance your interview readiness.
artificial-intelligence data data-science interview interview-questions machine-learning
Last synced: 23 Jan 2026
https://github.com/tupizz/data-processing-pipeline-aws
This project is a serverless application built with the Serverless Framework, TypeScript, and AWS services. It provides an enrichment service that processes contact information and enriches it with additional data.
aws data pipeline serverless typescript
Last synced: 13 May 2026
https://github.com/jebin1999/livestock-production-monitoring-
Livestock production Monitoring
data datascience livestock livestock-monitor r shiny shiny-apps shiny-r shinydashboard
Last synced: 05 Nov 2025
https://github.com/raigu/ordered-lists-sync
Library for synchronizing ordered data with the minimum of insert and delete operations. Suitable for lage data sets in isolated environments
data lists ordering sync syncrhonization update
Last synced: 12 Jan 2026
https://github.com/kevinsames/spark-fuse
spark-fuse is an open-source toolkit for PySpark β providing utilities, connectors, and tools to fuse your data workflows together.
data databricks fabric pyspark python spark
Last synced: 08 May 2026
https://github.com/thomd/git-scrape-hacker-news
scrape hacker news metadata for data analysis
data data-science git-scraping hacker-news
Last synced: 16 Sep 2025
https://github.com/aruneshbasak/python-dsa-problems-geeksforgeeks-160-days
I will upload my daily Python DSA problems solved on GeeksforGeeks and post it here!
algorithms-and-data-structures and data data-structures dsa python python3 structure
Last synced: 08 May 2025
https://github.com/qeeqbox/data-lifecycle-management
Data Lifecycle Management (DLM) is a policy-based model for managing data in an organization
data data-lifecycle-management infosecsimplified lifecycle management qeeqbox
Last synced: 07 Mar 2026
https://github.com/sixarm/sixarm_ruby_fab
SixArm.com β Ruby β Fab gem to fabricate sample data for testing
data fabrication factory fake gem mock ruby
Last synced: 24 Jul 2025
https://github.com/stonecharioteer/renfield
Synchronize and Search through Hard Drives
catalogue data search storage synchronization
Last synced: 09 Feb 2026
https://github.com/patelabhi574/hotel_reservation_analysis
Analyzing data collected by hotel to make future prediction for the owner of what are the segments they are making most profit & also which are the patterns & trends which have been seen over the past years in the booking in different times throughout the year and price setting on the website in peak time as per availability index.
data data-visualization datamodeling looker-studio powerbi reporting sql-query sql-server
Last synced: 19 Feb 2026
https://github.com/incubrain/awesome-maharashtra-data
A collection of datasets specific to Maharashtra, India. WIP
ai artificial-intelligence data data-analysis data-science datasets maharashtra marathi
Last synced: 23 May 2026
https://github.com/ginga1402/travego_travellers
MySQL Mini Project
college-project data mysql-database
Last synced: 27 Jul 2025
https://github.com/discindo/natochak
Analysis of bicycle accidents in Macedonia using Rmarkdown and ggplot2
Last synced: 19 Feb 2026
https://github.com/velocitatem/cellviz
Cellular Automata inspired by live-data visualization, designed to handle multidimensional and high-throughput data efficiently.
cellular-automata conways-game-of-life data economics
Last synced: 29 Jul 2025
https://github.com/lakecountryhuntclub/dnr-map-data-model
Data Model for the 2023 DNR Pheasant Stocking Property Data
data data-model documentation excel gis hunting mapping powerquery vba
Last synced: 29 Jul 2025
https://github.com/joeyism/py-cifar10
This library was created to allow an easy usage of CIFAR 10 DATA. This is a wrapper around the instructions givn on the CIFAR 10 site
cifar cifar-10 cifar10 data machine-learning machinelearning
Last synced: 30 Jul 2025
https://github.com/asuozzo/medicare-data-analysis
An analysis of Medicare Part D data in Vermont
Last synced: 04 May 2026
https://github.com/gappeah/london-housing-price-dashboard
This Excel-based Housing Visual Dashboard provides a comprehensive view of average house prices across various boroughs in London from 1996 to 2013. The dashboard is designed to offer insights into housing market trends and price variations across different areas of London over time.
data data-analysis data-visualization excel visual
Last synced: 31 Jul 2025
https://github.com/chalk-ai/roadmap
Chalk public roadmap
chalk data data-science mlops pipeline python
Last synced: 17 Jan 2026
https://github.com/BenSFGamer/B.I.O.S.
A biographer
academia agi ai ai-tools artificial-general-intelligence artificial-neural-networks automation data fact-checking information-extraction information-retrieval self-improving self-learning self-referential semi-autonomous software-engineering specialization web-agent web-scraping writer
Last synced: 27 Sep 2025
https://github.com/undistraction/grid-model
A small API for creating a grid and accessing the positions of the cells, rows and columns within it.
2d calculations cells data grid layout model
Last synced: 04 Aug 2025