data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-02 00:07:45 UTC
- JSON Representation
https://github.com/stdlib-js/array-nans
Create an array filled with NaNs and having a specified length.
array complex128 complex128array complex64array data float32array float64array javascript matrix ndarray node node-js nodejs stdlib structure typed typed-array types vector
Last synced: 06 Mar 2026
https://github.com/divithraju/divith-raju-data-mining
This project focuses on customer segmentation using data mining techniques, specifically K-Means clustering, to classify customers into distinct groups based on their purchasing behaviors. The goal is to analyze customer data and segment them into clusters for targeted marketing strategies and better customer relationship management.
algorthims analytics apache business client connector data dataarchitecture database dataengineering datamining datascience hadoop k-means-clustering mysql project project-repository pyspark python3 spark
Last synced: 06 Mar 2026
https://github.com/yasir13001/moonai_api
This MoonAI API service built with FastAPI that calculates and provides detailed Moon and Sun astronomical data based on user input such as date, latitude, longitude, elevation, and timezone.
ai almanac api astro-ai astronomy data data-science fastapi fastapi-api gemini groq-api hilal-detection html islamic-calenda llama llm-integration moon python
Last synced: 20 Jun 2025
https://github.com/utkarshverma439/simple-sms-spam-detector
Built a Python text classification model for spam detection in SMS. Explored data, preprocessed text, utilized TF-IDF, trained a classifier, and addressed visualization challenges, yielding practical insights.
data data-science data-visualization spam-detection
Last synced: 20 Jun 2025
https://github.com/alireza29675/goudi
GOUDI is a multi-layer data visualization application, inspired by mind maps and some other thinking and describing methods.
analysis data goudi visualization
Last synced: 11 Jul 2025
https://github.com/harmonydata/harmony_examples
Example Jupyter notebook and R scripts using Harmony in real research problems
data data-harmonisation data-harmonization harmonisation psychology python r research
Last synced: 11 Jul 2025
https://github.com/lunastev/reflectlm
ReflectLM is a self-reflective, language-structure-only AI model that learns exclusively through interaction. It starts with zero factual knowledge but can engage in dialogue, evaluate its own responses, and remember conversations for future learning.
ai data language-model llm model open-source ts web
Last synced: 22 Jun 2025
https://github.com/DevAthul-88/random-fakedata.js
A package to generate random data
data data-generator fake fake-data fake-data-generator javascipt javascript nodejs npm-package package
Last synced: 22 Jun 2025
https://github.com/dennyglee/open-covid19-public
A collaboration between SCRI and Databricks on the analysis of open COVID-19 datasets.
covid-19 data data-analytics data-engineering data-science nlp
Last synced: 22 Jun 2025
https://github.com/nia-cloud-official/datascript
DataScript: A Hypothetical Data Scripting Language, DataScript is designed for simplifying data manipulation and analysis tasks. It serves as a scripting language tailored specifically for handling various data operations efficiently.
data data-scripting scripting-language
Last synced: 22 Jun 2025
https://github.com/artcc/coredatagenericmodule
Core Data generic module for persist encrypted object
core coredata coredata-model data data-generic database encrypted encrypted-data encryption entity identifier persist protocol swift
Last synced: 08 May 2026
https://github.com/evoluteur/madeleinology
Playing with data science by taking a look at the proportions of flour, sugar, butter, and eggs in 147 Madeleine recipes (the traditional French sponge cake).
baking cake cooking cooking-recipes data data-science data-visualization dessert exploratory-analysis exploratory-data-analysis exploratory-data-visualizations food histogram longtail madeleine recipe visualization
Last synced: 23 Jun 2025
https://github.com/flownrecords/flightTracker
A mobile app built to record essential flight data for post-flight review and debriefing.
Last synced: 23 Jun 2025
https://github.com/elazar/pycopyql
Exports a subset of data from a relational database.
data database export relational tool utility
Last synced: 16 May 2026
https://github.com/nichtich/wikidata-taxonomy-examples
Extract classifications from Wikidata
coli-conc data knowledge-organization wikidata
Last synced: 12 Jul 2025
https://github.com/novecento99/nuvolino
air cloud data ikea iot pm pm25 sensor vindstyrka
Last synced: 13 Jul 2025
https://github.com/nafisalawalidris/dr.-semmelweis-and-the-discovery-of-handwashing
Uncover the revolutionary impact of handwashing on mortality rates in healthcare. Explore the story of Dr. Semmelweis and his groundbreaking findings.
data data-analysis handwashing healthcare-analysis medical-breakthrough mortality-rates
Last synced: 13 Jul 2025
https://github.com/davedupplaw/jquery.bargraph
Moving, sliding bargraph display for jQuery
barchart bargraphs data javascript javascript-library jquery jquery-library jquery-plugin jquery-widgets realtime scrolling visualization
Last synced: 17 May 2026
https://github.com/dineshpinto/geist-finance-subgraph
Subgraph for the Geist Finance protocol on the Fantom blockchain.
assemblyscript blockchain data fantom graphql typescript
Last synced: 17 May 2026
https://github.com/shuklayash02/complete_data_analysis_project
A Full Data Analysis project where a sales data is ask,prepare,process,analyze,share and act through data analysis process
data data-visualization dataanalysis database datacleaning powerbi sql
Last synced: 16 Jul 2025
https://github.com/ouverz/governed_arr
A dbt project showing end-to-end ARR definitions, compute, transformation, validation and governance
arr data data-modeling dbt-core governance semantic-layer snowflake
Last synced: 02 Jul 2026
https://github.com/denisecase/nw-network-data-analytics
Network for those earning a NW Masters of Applied Data Science
Last synced: 02 Feb 2026
https://github.com/grycap/cdmi-client-go
A basic Go library to perform CDMI core operations
Last synced: 02 Jul 2026
https://github.com/clabe45/kaz
Minimalistic local storage cli
cli data minimalistic storage utility
Last synced: 17 Jul 2025
https://github.com/mustika-putri-m/-tableu-laporan-data-karyawan-growian
I am currently pursuing a data analysis certification at GROWIA, where I've learned to use tools such as Python, SQL, Google Big Query, Google Data Studio, Advanced Microsoft Excel, and Tableau. This course has enhanced my ability to analyze data using KPIs and business metrics, enabling me to solve business problems more effectively
data data-visualization tableau
Last synced: 17 Feb 2026
https://github.com/andrianllmm/wika-data
Philippine language resources.
data language low-resource-languages parser philippines scraper
Last synced: 17 Jul 2025
https://github.com/yessasvini23/cisco-data-analytics-essentials_-virtual-_internship
From the CISCO Networking Academy
data dataanalysis database datascience excel relational-databases sql statistics structured-query tableau
Last synced: 17 Jul 2025
https://github.com/giscience/measures-rest-oshdb-docker
Scripts for starting measures for geospatial datasets in docker container, using the OSHDB
data dggs docker geospatial mesure openstreetmap rest
Last synced: 18 Apr 2026
https://github.com/saboye/web-scraping-with-python
A web scraping project using Python's "Requests" and "BeautifulSoup" libraries to extract structured data from one or more websites. This project involves sending HTTP requests to the target website(s), retrieving the HTML content of the website(s), and parsing this content to extract the desired data in a usable format.
beautifulsoup csv data data-harvesting data-mining python request web webscraping
Last synced: 18 Jul 2025
https://github.com/sevmardi/data-mining-hacks
Hacks in Data Mining
data data-mining data-mining-algorithms python3
Last synced: 18 Jul 2025
https://github.com/gabrieldim/world-bank-wdi-data-science
Faculty project. World Bank predictions with Data Science.
convolutional-neural-networks data data-science model neural-network neural-networks prediction-model python science
Last synced: 15 May 2026
https://github.com/am-i-groot/summer-intern-iitguwahati-spml
Developed an automated Water Quality Monitoring System (WQMS) at IIT Guwahati, using the pH-W218 sensor and K-Means Clustering to assess water potability. The project enhances water quality evaluation through machine learning-based classification.
algorithm data data-visualization kmeans-clustering machine-learning python report sensor signal-processing
Last synced: 17 May 2026
https://github.com/bytraembedded/Laptop-Price-Prediction-with-Machine-Learning
The Laptop Price Prediction with Machine Learning project provides a system to predict the price of laptops based on various features such as processor type, RAM size, storage capacity, and more/
airflow data data-science data-visualization fastapi heroku-deployment machine-learning-algorithms matplotlib-pyplot numpy pandas python reactjs seaborn
Last synced: 30 Dec 2025
https://github.com/bayer-group/cmc-ontologies
This is a submodule of cmc-knowledge-graph-setup. It contains ontologies and relevant data graph files
Last synced: 16 Jun 2025
https://github.com/patrikmasiar/algorythm-of-the-night
Awesome list of algorithms that help you 🚀 Feel free to contribute 👨🏻💻
algorithms data interview-questions logic logic-programming math mathematics science
Last synced: 02 Jul 2026
https://github.com/tsiarokhin/student_bsu_by
Tool for parsing various BSU student information from student.bsu.by website.
belarus bsu data grades python students study university
Last synced: 28 May 2026
https://github.com/fritzrehde/asciibar
A cli tool to print percentages as ascii bar charts
cli data percentage visualization
Last synced: 02 Jul 2026
https://github.com/prioritizr/prioritizrdata
Conservation planning data sets
Last synced: 19 Jul 2025
https://github.com/timxor/bitcoind-data-ingestion
crypto payments bitcoind data ingestion
Last synced: 02 Jul 2026
https://github.com/yazeed44/reform-api
A platform that harnesses the power of multiple data streams including satellite imagery and drone photos to visualize multiple urban planning indices and provide descriptive analytics that will empower local Saudi authorities to make data-driven decision that contribute to neighborhood quality of life.
Last synced: 18 May 2026
https://github.com/ate329/nsl-kdd-feature-extractor
Python-based tool designed to process network traffic packets and extract features compliant with the NSL-KDD dataset format.
cyber-security cybersecurity data data-science extractor feature-extraction machine-learning network-analysis nsl-kdd nsl-kdd-dataset
Last synced: 30 Oct 2025
https://github.com/DataHerb/dataherb-flora
DataHerb Flora: The core of DataHerb
data data-mining data-science datascience dataset datasets
Last synced: 08 May 2025
https://github.com/public-health-scotland/covid-19-publication-dashboard
Dashboard for weekly COVID-19 publication
coronavirus covid covid-19 covid-testing covid19-data dashboard data hospital-admissions lfd nhs public-health scotland shiny
Last synced: 02 Jul 2026
https://github.com/cont-limno/lagosus-reservoir
Data module classifying lakes as natural lakes or reservoirs in the conterminous U.S.
Last synced: 17 Jan 2026
https://github.com/fjc0k/vue-merge-data
Intelligently merge data for Vue render functions.
data merge-data render-functions vue
Last synced: 17 May 2026
https://github.com/mikebairdrocks/fluky
[floo-kee]: obtained by chance rather than skill.
data framework mock netcore netstandard nuget random vscode
Last synced: 17 May 2026
https://github.com/hmeleiro/r_dataviz
Data visualization projects with R / Proyectos de visualización de datos con R
data dataviz r rmd-files social-science survey-data
Last synced: 21 Jun 2026
https://github.com/greatwoman23/market-basket-analysis
Unlock the power of data-driven sales optimization with Market Basket Analysis. Explore frequent itemsets and association rules to strategically enhance product placement, design targeted promotions, and adapt to seasonal trends. Elevate your business strategy with insights tailored for boosting sales and engaging customers effectively.
analysis analytics analytics-product data data-science jupyter medium-articles notebook-jupyter python
Last synced: 28 Apr 2026
https://github.com/inzhenerka/scooters_data_uploader
Загрузка данных в PostgreSQL в рамках курса по dbt от Инженерка.Тех
Last synced: 04 May 2026
https://github.com/kingtous/bots_task_result
Result of the Barcelona OpenMP Tasks Suite (BOTS) using ompTG
Last synced: 09 Jul 2025
https://github.com/muhammad-fiaz/ason
ASON: Adaptive Structured Object Notation - Python library for dynamic data serialization, providing flexibility and simplicity.
adaptive-structure-object-notation api ason cli client data file file-format file-sharing file-upload json json-data json-parser open-source opensource parser parsing python python3
Last synced: 02 Feb 2026
https://github.com/denko5/sales-analysis
A complete SQL-based sales analysis project covering Africa, showcasing data cleaning, exploratory analysis, insights, and lessons learned. The project highlights sales trends, regional performances, and marketing effectiveness across multiple platforms.
africa data data-analysis data-science exploratory-data-analysis insights kenya sales sql
Last synced: 24 Jan 2026
https://github.com/bacross/datamunger
python package for handling nan's and outliers
data data-frame datamunger knn nan outliers python scikit-learn
Last synced: 17 May 2026
https://github.com/hughrawlinson/github-data-scripts
Scripts to grab data about repos of interest to compare
data github-graphql github-repo-organizer graphql scripts typescript
Last synced: 09 Jul 2025
https://github.com/simranjeet97/gpt4_applications
Applications build using OpenAI API and GPT4
ai ai-applications artificial-intelligence chatgpt data data-science gpt3 gpt4 large-language-models llm machine-learning openai openai-api project python
Last synced: 05 May 2026
https://github.com/reiiyuki/once-data-manager
Once Data Manager is temporary data management utility kit for Unity.
data manager playerprefs preference scene temporary unity
Last synced: 17 May 2026
https://github.com/wamphlett/smart-data-objects
An easy solution for capturing and validating data into usable DTO's
data dto forms php php7 validation
Last synced: 17 May 2026
https://github.com/priyanshubiswas-tech/ev-data-analysis-dashboard
An interactive dashboard analyzing EV trends, including total vehicles, BEV vs. PHEV breakdown, model popularity, state-wise distribution, and CAFV eligibility. Visualizes key insights for data-driven decisions in the EV industry. 📊
dashboard data data-analysis data-science data-visualization tableau tableau-public
Last synced: 17 Feb 2026
https://github.com/marians/tour-tracker
Track the general classification development of the Tour De France, stage over stage
cycling data sports statistics
Last synced: 24 Jun 2025
https://github.com/shgysk8zer0/schema
A PHP implementation of schema.org structured data objects
data microdata schema seo structured-data
Last synced: 24 Jun 2025
https://github.com/dostuffthatmatters/circadian-scp-upload
Resumable, interruptible, SCP upload client for any files or directories generated day by day
checksum daily data directories files library python scp ssh synchronization time-series upload utilities
Last synced: 24 Jun 2025
https://github.com/nafisalawalidris/elfeenah
Configuration files for my GitHub profile. Welcome to my GitHub profile! I'm Nafisa Lawal Idris, a passionate Data Scientist with a strong interest for blockchain technology. Explore my GitHub portfolio to delve into the exciting world where data science and blockchain converge.
artificial-intelligence bitcoin blockchain config data data-science-portfolio data-science-projects datascience datascientist deep-learning github-config machinelearning
Last synced: 11 Sep 2025
https://github.com/agustinmusanti/sqlchallenge-2
This repository contains my solutions to a SQL challenge using MySQL, centered around a fictional retail company called TechMarket. The challenge covers various SQL tasks such as data retrieval, manipulation, and analysis, simulating real-world scenarios within a retail business environment.
Last synced: 03 Apr 2025
https://github.com/legopitstop/mcextract
Extract assets and data from the Minecraft jar.
assets customtkinter data jar java minecraft pypi python pythonpackage reports serverjars userfolder
Last synced: 17 May 2026
https://github.com/Greatwoman23/Market-Basket-Analysis
Unlock the power of data-driven sales optimization with Market Basket Analysis. Explore frequent itemsets and association rules to strategically enhance product placement, design targeted promotions, and adapt to seasonal trends. Elevate your business strategy with insights tailored for boosting sales and engaging customers effectively.
analysis analytics analytics-product data data-science jupyter medium-articles notebook-jupyter python
Last synced: 04 May 2025
https://github.com/jub0t/Eso
An application to manage all your Encryption & Decryption keys and other related tools.
data encryption encryption-decryption hacking hacking-tool keys pgp privacy private
Last synced: 10 May 2025
https://github.com/giscience/measures-rest-sparql
A SPARQL endpoint for the Measures REST OSHDB App framework.
data osm quality semantics sparql sparql-endpoints
Last synced: 24 Jun 2025
https://github.com/ayush585/fireducksblog
BLOG: Unlocking AI Efficiency: How FireDucks Revolutionizes Data Preprocessing
Last synced: 28 Apr 2026
https://github.com/uhstray-io/just-dashboards
Light and Easy Rust-Fullstack/WASM application to build dashboards from any data source
analytics data dioxus rust visualization
Last synced: 29 Mar 2025
https://github.com/fbraza/paris_airbnb
Analysis of Paris AirBnB data using R and Shiny
analysis data data-analysis paris-airbnb r shiny
Last synced: 21 Mar 2025
https://github.com/dbrennand/rm-content
A Python 3.7 script to remove a specific string from all files and repos (owned by the user).
content data erase eraser privacy privacy-protection privacy-tools remove remover rm-content
Last synced: 29 Mar 2025
https://github.com/wklee610/de_project
[Data Engineer] Personal Toy Project For Study
Last synced: 31 Mar 2025
https://github.com/whatheheckisthis/pwc_project-
Successfully completed a PwC virtual case, advancing Power BI skills to address cybersecurity and cloud architecture requirements. Developed comprehensive dashboards that effectively communicated key performance indicators (KPIs), showcasing proficiency in data visualization and deliver
case-study data data-science dataanalytics databases datavisualization powerbi virtual
Last synced: 05 Apr 2025
https://github.com/epsoft/dataset-generator
dataset generator
data dataset dataset-generation matplotlib matplotlib-figures tensorflow tensorflow-datasets
Last synced: 18 May 2026
https://github.com/stefanbohacek/dataviz-projects
My dataviz projects.
data data-visualization dataviz
Last synced: 08 Jul 2025
https://github.com/sottey/shon
SHON (Structured Human-Optimized Notation) is a data serialization format designed for readability, schema support, and practical use in modern systems. Version 0.6 introduces advanced types and syntax improvements.
data golang json spec specification
Last synced: 18 May 2026
https://github.com/rifqanzalbina/libraryjs
A Library js
data data-science database datascience javascript javascript-library
Last synced: 17 Jan 2026
https://github.com/definetlynotai/test_generator
A tool to create datasets based on configurations from a csv file, This tool can be used as a skeleton for other software.
algorithim csv data development dynamic exam generator huge nirt powerful python skeleton test tools
Last synced: 21 Jul 2025
https://github.com/openfoodfacts/openfoodfacts-corrector
Ruby script to correct and enhance data on OpenFoodFacts
Last synced: 24 Apr 2026
https://github.com/rrwen/slides-covid19-geosocial-db
Presentation titled "A Real-time Geo-social Media Database for Large-scale Coronavirus Disease 2019 (COVID-19) Research" for my second research seminar at Ryerson University
covid covid-19 covid19 data database disease geo gis index media ncov-2019 ncov19 postgres postgresql presentation research seminar slides social virus
Last synced: 18 May 2026
https://github.com/dr-saad-la/r-distilled
R Programming Language distilled
data data-analysis learning programming-language r rlanguage rprogramming statistical-analysis
Last synced: 18 May 2026
https://github.com/yernaz-togizbayev/microsoft_store_data-analysis
Microsoft Store
data data-analysis data-visualization jupyter-notebook python3
Last synced: 15 May 2026
https://github.com/chompfoods/sdk-typescript-fetch
Fetch TypeScript SDK for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.
api branded chomp data database fetch food grocery ingredients nutrition raw recipe-api recipes sdk typescript
Last synced: 03 May 2026