data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-30 00:07:50 UTC
- JSON Representation
https://github.com/ngambip/diabetes_factors_2024
Exploring BMI Categories and Health Factors.
dashboards data datacleaning dax-languague powerbi sql sqlstudio tsql visualization
Last synced: 03 Mar 2026
https://github.com/oliverhennhoefer/shiny-template-interactive-table
Example of interactively adding rows / deleting rows by selecting directly in a data.table (DT) in Shiny
button data delete dt r select selection server shiny shiny-applications shiny-apps shiny-r shinyapps table ui userinterface
Last synced: 16 Apr 2026
https://github.com/freebirdscrew/datastructures_python
Data Structures Implementation in Python and Explains each Steps.
data data-visualization datascience datastructures datastructures-algorithms datastructures-algorithms-python datastructures-implementation datastructuresandalgorithm freebirdscrew programming python simranjeet simranjeetsingh
Last synced: 16 Apr 2026
https://github.com/parimala24-ds/datascientistmlinterviewprep24
DATASCIENTST ML INTERVIEW PREP24
data decisiontree interviewquestions linear-regression logistic machine-learning matplotlib numpy pandas python seaborn sklearn
Last synced: 12 Apr 2025
https://github.com/bdpedigo/neuropull
A (soon to be) lightweight Python package for accessing single-cell connectome networks with metadata.
connectome connectomes connectomics data dataset networks networks-biology
Last synced: 05 Oct 2025
https://github.com/Duartemartins/dados
Resultados de Eleições Portuguesas por Freguesia
data elections open-data portugal
Last synced: 20 Nov 2025
https://github.com/karashiiro/lodestone-id-time
Data scraper, formula and reference implementation for the estimated creation time of a FFXIV character given its Lodestone ID.
data ffxiv ffxiv-character lodestone
Last synced: 30 Jun 2025
https://github.com/azawawi/perl6-msgpack
Perl 6 Interface to libmsgpack
data messagepack msgpack perl6 wrapper
Last synced: 12 Jun 2025
https://github.com/nop-dev/learning-js
Esse repositório contem todas as anotações que fiz enquanto estudava um módulo da trilha Explorer da Rocketseat sobre JavaScript. 🔰
data data-structures functions javascript js
Last synced: 17 Apr 2026
https://github.com/stdlib-js/array-ones
Create an array filled with ones and having a specified length.
array data float32array float64array int16array int32array javascript matrix ndarray node node-js nodejs stdlib structure typed typed-array types uint32array vector
Last synced: 09 Apr 2025
https://github.com/aisurjyasamantaray/sales-perfomance-analysis-dashboard
A comprehensive sales performance analysis dashboard built using Python, and visualization tools. This project includes data cleaning, descriptive statistics, correlation analysis, and insights into sales trends, profitability, and the impact of discounts. Key features include interactive visualizations using Seaborn, and Matplot
analytics annova data data-analysis data-visualization-project dataproject eda hypothesis-testing pandas-dataframe python sales-performance-analysis statistics
Last synced: 04 Apr 2026
https://github.com/mahmoud-saeed-mahmoud/loading_state_handler
The StateHandlerWidget manages different UI states—loading, error, empty, and normal—allowing you to customize the displayed widgets for each state.
dart data error flutter flutter-package flutter-widget loading state
Last synced: 10 Mar 2026
https://github.com/lafayettegabe/g2m-insight-for-cab-investment-firm
📊 Exploratory Data Analysis (EDA) on multiple datasets related to the cab industry in the US, to provide actionable insights and recommendations to a private firm looking to invest in the market. The analysis includes data cleaning, transformation, visualization, and hypothesis testing.
big-data data data-analysis data-science data-visualization eda gotomarket
Last synced: 13 Jun 2025
https://github.com/slashdotted/pomapure
PoorMan's Pipeline
data json modular module pipeline processing
Last synced: 18 Apr 2026
https://github.com/pitmonticone/covid-italy
References for COVID-19 situation in Italy.
coronavirus covid-19 covid-19-italy data data-analysis documentation testing
Last synced: 05 Apr 2026
https://github.com/datafold/vhol-demo
Get hands-on examples of dbt + Datafold CI/CD workflows
data data-engineering datafold dbt diff
Last synced: 28 Dec 2025
https://github.com/jmbhughes/goes_solar_retriever
Tool to retrieve GOES-R Solar Data
data data-retrieval data-science goes-16 goes-satellite goes16 goes17 solar solar-physics
Last synced: 07 Jan 2026
https://github.com/0xdir/relief_web_dart
A Future-based wrapper around the Relief Web API, to retrieve information on humanitarian news, reports, training, jobs, and disasters
api dart data humanitarian jobs
Last synced: 11 Jun 2026
https://github.com/nafisalawalidris/advanced-fraud-detection-with-anomaly-detection
This repository demonstrates how to build a robust fraud detection system that combines supervised learning techniques with anomaly detection models. It provides end-to-end implementation, from data preprocessing and model training to deploying a real-time fraud detection API using FastAPI.
anomaly-detection creditcardfrauddetection data dataanalytics fastapi fraud-detection machinelearning modeldeployment python supervised-machine-learning unsupervised-machine-learning
Last synced: 20 Apr 2026
https://github.com/pommes-public/pommesdata
A full-featured transparent data preparation routine from raw data to POMMES model inputs
data opensource power raw-data transparent
Last synced: 07 Oct 2025
https://github.com/stdlib-js/array-shared-buffer
SharedArrayBuffer.
array arraybuffer buf buffer concurrency data javascript memory node node-js nodejs parallelism shared stdlib structure threading typed typed-array types
Last synced: 25 Apr 2025
https://github.com/mark-summerfield/uxf
Uniform eXchange Format (uxf) is a plain text human readable optionally typed storage format that supports custom types. It may serve as a convenient alternative to csv, ini, json, sqlite, toml, xml, or yaml.
data ini json parser pretty-printer sqlite storage-engine toml xml yaml
Last synced: 08 Oct 2025
https://github.com/rcourivaud/rcourivaud.github.io
Raphaël Courivaud
data database datascience python
Last synced: 21 Apr 2026
https://github.com/mskian/tamil-words
Tamil words Collections with English Meaning - API and SQL Data.
api data javascript json json-api mysql pdo php sql tamil tamil-language tamil-sms tamilwords translate translator
Last synced: 14 Apr 2026
https://github.com/snandasena/disaster-response-pipeline
Disaster Response Pipeline | Data Engineering
data data-engineering-pipeline etl flask machine-learning nlp nlp-pipeline
Last synced: 24 Apr 2026
https://github.com/mohasarc/treeviz
The best tree data-structures visualization tool
data structures visualization visualization-tools
Last synced: 25 Apr 2026
https://github.com/kefniark/kaaya
JS Library for State management and Data synchronization between Applications
data game kaaya mutation network serialization state-management
Last synced: 06 Jun 2026
https://github.com/ciscorn/tinygrib2
(experimental) A tiny toolkit for parsing JMA's GRIB2 files.
data grib grib2 meteorology rust weather
Last synced: 26 Apr 2026
https://github.com/infinitode/pwlds
A public dataset of over 10 million passwords, with assigned strength levels.
ai classes classification cyber-security data dataset ml open-source password passwords synthetic-data
Last synced: 22 Feb 2026
https://github.com/kylekirkby/cardatasnatch
CarDataSnatch allows you to quickly find information about a car in the uk using a valid number plate. Grab an image of the car in question along with a multitude of other data. Compare two cars' data for fast and easy analysis.
beautifulsoup cars command-line-tool data data-analysis data-mining ethical-hacking python python3 requests scraper social-engineering
Last synced: 15 Apr 2025
https://github.com/dantesc03/uberpool-case-study
This project was designed to understand the statistical effects of longer wait times on uber rides. Particularly on the user and driver experience with the Uber Pool System.
analysis data excel jupyter jupyternotebooks learn python seaborn statistics t-tests uber visualization
Last synced: 16 Apr 2026
https://github.com/andrewrporter/my-analytics
Analyzes FireFox browsing history with modern python3 features and libraries
analytics data firefox matplotlib python python3 sqlite3
Last synced: 28 Apr 2026
https://github.com/alexandregazagnes/unilasalle-public-resources
UniLaSalle-Public-Ressources : This public repository contains the notebooks and the data used for both : 2nd Year - Practical Statistical Tests 4th Year - Data Analysis with Python
data data-analysis data-analytics data-cleaning data-storytelling education educational exploratory-data-analysis python python3 r r-programming rstudio statistics visualization
Last synced: 28 Apr 2026
https://github.com/ismet55555/pdw-asym-2link
Clear and easy way of simulating a passive dynamic walker (PDW) model derived and exectured using MATLAB.
data dynamics inverted-pendulum matlab numerical-simulations passive-dynamic-walker passive-dynamics ramp research robotics simulation slope walking-simulator
Last synced: 29 Apr 2026
https://github.com/14richa/patient-readmission-analysis
This project focuses on predictive modeling to foresee hospital readmissions of diabetic patients within 30 days post-discharge. By leveraging a dataset spanning a decade (1999-2008) and covering records from 130 US hospitals, the aim is to enhance healthcare management and patient outcomes.
analytics data jupyter-notebook numpy
Last synced: 29 Apr 2026
https://github.com/bernard-ng/drc-news-corpus
DRC News Corpus : Towards a scalable and efficient system for Congolese news dataset curation
aggregator data news nlp politics
Last synced: 06 Sep 2025
https://github.com/eyedia/idpe
Eyedia's Integrated Data Processing Environment
csharp data designer development development-environment development-tools development-workflow environment ide no-coding parser processing rehosted workflow
Last synced: 11 Oct 2025
https://github.com/anandchowdhary/health
🫀 @AnandChowdhary's body measurements
csv data fitness github-actions health
Last synced: 29 Apr 2026
https://github.com/norton120/dfmock
Python Pandas DataFrame mock generator. You need mock'd data in a dataframe? this is what you need.
data mock pandas pandas-dataframe python python37
Last synced: 19 Jan 2026
https://github.com/sabujxi/python-scraper-and-data-analysts-admin-panel-in-django
A data scraper from texas govt site and a helping web app for managing, reviewing and editing the data
analyst data data-analysis data-entry data-scraper django django-application python python-scraper real-estate regex scraper texas
Last synced: 30 Apr 2026
https://github.com/yakupzengin/data-structures-and-algortihms
This repo contains implementation of data structures and algorithms using JAVA
algorithms algorithms-and-data-structures data structure
Last synced: 03 Dec 2025
https://github.com/automators-com/datamaker-js
The official Node.js / Typescript library for the DataMaker API
data javascript nodejs typescript
Last synced: 11 Oct 2025
https://github.com/lovethebomb/data-tiles
🍜 Data Tiles is a small website that shows data.
data express javascript nextjs typescript
Last synced: 10 Apr 2026
https://github.com/abuzar-alvi/employee-data-to-info-card-generator-with-python
This Python project is made by me, Python project for improving python skills.
card data data-generator employee python
Last synced: 03 Feb 2026
https://github.com/stdlib-js/datasets-cdc-nchs-us-births-1994-2003
US birth data from 1994 to 2003, as provided by the Center for Disease Control and Prevention's National Center for Health Statistics.
america babies births data dataset datasets javascript node node-js nodejs stdlib time-series timeseries united-states us usa
Last synced: 12 Oct 2025
https://github.com/hasnocool/war_thunder_camouflage_scraper
A concurrent web scraper designed to collect camouflage information from war thunder aircrafts.
asyncio camouflage concurrent data execution handling playwright python scraping signal sqlite3 thunder war web
Last synced: 04 Jan 2026
https://github.com/divithraju/divith-raju-searchengine-wikipedia
search engine optimizationA complete search engine experience built on top of 75 GB Wikipedia corpus with subsecond latency for searches. Results contain wiki pages ordered by TF/IDF relevance based on given search word/s. From an optimized code to the K-Way mergesort algorithm, this project addresses latency, indexing, and big data challenges.
algorithms data dataengineering inverted-index linux merge-sort nlp project project-repository python3 serchengine software-engineering ubuntu wikipedia
Last synced: 16 May 2026
https://github.com/banyan-team/banyan-julia-examples
Adventures in massively parallel cloud computing with Banyan Julia!
banyan data data-analytics data-processing data-science julia
Last synced: 02 May 2026
https://github.com/ozanarkancan/sailx
This repo contains the code for generating artificial navigational instruction following data.
data grounded-language-learning
Last synced: 08 Jan 2026
https://github.com/woctezuma/geforce-leak
Fetch data from the Geforce leak.
data datamining egs epic epic-games epic-games-launcher epic-games-store geforce geforce-experience geforce-leak geforce-now geforce-now-leak geforcenow geforcenow-leak graphql leak leaks nvidia steam steam-games
Last synced: 02 May 2026
https://github.com/yanpitangui/iteminfoconverter
Application that converts ragnarok legacy data files to iteminfo.lua
data itemdbconf iteminfo luafiles ragnarok
Last synced: 12 Oct 2025
https://github.com/rastmob/wordpress-llms-output-plugin
A WordPress plugin to export posts, pages, and custom post types as JSON for training Language Models (LLMs).
ai data llm llms training training-data wordpress wordpress-development wordpress-plugin
Last synced: 03 May 2026
https://github.com/nixinova/nzpolls
New Zealand polling data aggregation
data election-data election-polling graphing new-zealand nixinova polling polling-data
Last synced: 09 Apr 2025
https://github.com/danielbayley/schemas
A collection of useful @JSON-schema-org schemas for data validation.
ajv config configuration data data-science data-structures data-validation json json-schema linter linting schema schema-org validation yaml yaml-configuration
Last synced: 13 Oct 2025
https://github.com/luminati-io/Pinterest-dataset-samples
Two sample datasets of over 1000 Pinterest profiles and posts, extracted using the Bright Data API, ideal for market research, influencer marketing, and product development.
data data-extraction data-mining database datasets pinterest pinterest-api structured-data web-scraping
Last synced: 09 Apr 2025
https://github.com/acaciaman/db-autotest
DB Database test automation. This python package allows to create database object structure and load data from database.
Last synced: 05 May 2026
https://github.com/askaniy/celestialocationsmaker
Tool for making Celestia location files
celestia data geology locations mapping planetary-science space
Last synced: 14 Mar 2025
https://github.com/quin1sue/priceguidesph-bettergov
an economic and financial data platform project under bettergov.ph
bettergovph cloudflare data hacktoberfest nextjs priceguides
Last synced: 05 May 2026
https://github.com/mawburn/across-a-thousand-dead-worlds-data
Across a Thousand Dead Worlds Data
Last synced: 21 Apr 2026
https://github.com/iusztinpaul/airbnb-data-analysis
Airbnb data analysis on the biggest cities in The Netherlands following the CRISP-DM methodology.
airbnb data datanalysis datascience machine-learning numpy pandas python
Last synced: 06 May 2026
https://github.com/jinsyin/datalink
⚡ 数据集成 | DataLink is a lightweight data integration framework build on top of DataX, Spark and Flink
batch big-data bigdata cdc data data-collection data-exchange data-integration data-pipeline data-synchronization datalink etl flink flink-cdc framework integration pipeline spark streaming
Last synced: 19 Jul 2025
https://github.com/jesusgraterol/bitcoin-blockchain-dataset-builder
The dataset builder script extracts all the relevant block information from the Bitcoin Blockchain through Mempool.space's public API. The data is stored in a .csv file, facilitating its use in data science and machine learning projects.
bitcoin blockchain blockchain-technology data datascience datascience-machinelearning dataset dataset-generation machine-learning
Last synced: 06 May 2026
https://github.com/lxcoding06/e-gereja
Website CRUD untuk Gereja, untuk mengatur data jemaat, data kematian, data pernikahan dan data baptis
data data-gereja e-gereja gereja gereja-online jemaat kematian pernikahan
Last synced: 15 May 2025
https://github.com/ayemunhossain/firebase-realtime-db-advance-query
Firebase real time database, query with nodejs.
ayemunhossain data firebase firebase-functions firebase-realtime-database nodejs query
Last synced: 06 May 2026
https://github.com/tayeva/eia-client-python
EIA Open Data API Client - Python
data open-source python python-3 python3
Last synced: 14 Oct 2025
https://github.com/mednour2019/devolap
OLAP Cube Dispatcher Tool
analysis-services csharp data excel excel-export kpi mdx metroframework mvvm-architecture sql wpf
Last synced: 27 Jan 2026
https://github.com/doriclaudino/canarinho_nlp
labels, classify, summarization string for canarinho app
chrome-console classification classifier-model data labels nlp nlu python spacy spacy-models spacy-nlp summarization-string
Last synced: 08 May 2026
https://github.com/manifoldfinance/disco-schema
MEV Auction and Ethereum Network Data Schemas
cryo data dataset ethereum ethereum-builders ethereum-mev evm mev-data pandas schema-registry schemas
Last synced: 08 May 2026
https://github.com/dark-art108/yonk
A cli-utility to streamline data science work by creating templates
Last synced: 08 May 2026
https://github.com/rohan-paul/machine-learning-and-deep-learning-tutorial-notebooks
Various Machine Learning and Deep Learning Tutorial Notebooks in Blog Format
data data-analysis data-science deep-learning deep-learning-tutorial deep-neural-networks machine-learning machine-learning-algorithms machinelearning neural-network pytorch pytorch-implementation pytorch-tutorial tensorflow
Last synced: 09 May 2026
https://github.com/yorkulibraries/vendorpol
URLs for vendor privacy policies and terms of use.
Last synced: 15 Oct 2025
https://github.com/j1sk1ss/dateapppc.exmpl
Простое нативное приложение для Windows с демонстрацией ООП и SQL баз данных на примере приложения для знакомств.
data oop-principles parsing pgadmin4 sql wpf
Last synced: 11 Apr 2026
https://github.com/guslovesmath/top_tech_sp_500_forecasting
Forecasting the stock market is difficult. I sought to observe the relationship between Apple's stock price and others in the S&P500. In doing this, I was able to conclude that stocks in the tech industry can help predict a trend in Apple's Percent change.
arima-forecasting arima-model data data-science forecasting vector-autoregression
Last synced: 14 Mar 2025
https://github.com/codecentric/reedelk-bookingintegrationservice
Example service for the blog post series about Reedelk
api api-gateway data integration integration-flow
Last synced: 16 Oct 2025
https://github.com/natylaza89/covid19-il
Python package which brings a "Facade" interface for the client for using official covid 19 data of israeli data gov. ★19K+ Downloads★
api covid covid19 covid19-data data israel pandas python
Last synced: 13 Apr 2026
https://github.com/secret-guest/file_organizer
Files Organizer is a versatile tool for sorting and organizing files efficiently, ideal for managing recovered data.
c c-development data data-recovery file-management file-manager files sorting sorting-algorithms subdirectories subdirectory
Last synced: 10 Jun 2026
https://github.com/leapfrogtechnology/datamegh
Datamegh - Data Engineering for the cloud.
cloud cloud-native data datamegh docker megha python serverless
Last synced: 14 May 2026
https://github.com/ipstack/finder
Define data by IP Address
composer data geo geoip info ip ip-database ip-search ipstack ipstack-finder php search
Last synced: 14 May 2026
https://github.com/agnosticeng/agx
Query and explore local and remote data with Clickhouse
clickhouse d3 data rust svelte
Last synced: 26 Oct 2025
https://github.com/feltex/datahora-java
Aprenda a trabalhar com Data e Hora em Java com as novas classes LocalDateTime, LocalDate, DateTimeFormatter e outras novidades do pacote java.time.
brasil data date dateformat dateformat-brazil datetime hora java java11 localdatetime locale localization zoneddatetime
Last synced: 18 Jun 2026
https://github.com/nrennie/data
A collection of random datasets, either from web-scraping or processing more complex data.
Last synced: 30 May 2026