data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-27 00:07:33 UTC
- JSON Representation
https://github.com/stefen-taime/myubereats_datapipeline
Building a Modern Uber Eats Data Pipeline
airflow api data datawarehouse mongodb pipeline powerbi snowflake
Last synced: 22 Apr 2026
https://github.com/richardwarepam16/hotel_analysis_using_python
Unlocking Insights: Analyzing Hotel Reservation Data to Boost Business Performance
data data-analysis data-visualization hotel-booking hotel-cancellation-solution hotel-management-system jupyter-notebook python python3
Last synced: 22 Aug 2025
https://github.com/ofelipelucca/cdc-kafka-debezium-pipeline
A real-time event-driven social network API built with CDC (Change Data Capture), Kafka, Debezium, PostgreSQL and MongoDB implementing CQRS-style architecture with streaming data pipelines.
cdc data data-engineering data-integration data-pipeline debezium event-driven fastapi kafka kafka-connect microservices mongodb postgresql python sqlalchemy
Last synced: 05 Jun 2026
https://github.com/howtoquitvivek/ai-crop-yeild-prediction
AI-driven crop yield prediction and agricultural optimization system (SIH 2025)
2025 2026 ai crop-yeild data minor-project ml predcition python science sih
Last synced: 23 Apr 2026
https://github.com/mierune/tinygrib2
(experimental) A tiny toolkit for parsing JMA's GRIB2 files.
data grib grib2 meteorology rust weather
Last synced: 27 Jun 2025
https://github.com/vulcalien/vulcdataformat
Simple data storage system for Java.
data data-storage java serialization
Last synced: 25 Feb 2025
https://github.com/vvipjain/ev-data-analysis
EV Data Analysis
data data-analysis data-visualisation tableau tableau-public
Last synced: 16 Feb 2026
https://github.com/snimmagadda1/stack-exchange-dump-to-mysql
Batch pipeline to import Stack Exchange XML data dumps to relational DB
batch data mysql spring-batch stackoverflow
Last synced: 30 Mar 2025
https://github.com/oguzgn/a-case-study-for-a-livestreaming-platform
This project aims to analyze livestream watch times of users across different regions. The goal is to identify the top 5 users with the highest watch time for each region. The analysis involves multiple SQL transformations to extract meaningful insights from the data.
bigquery data data-analysis data-modeling live-streaming sql
Last synced: 23 Jun 2025
https://github.com/melinteflxrin/softserve-bigdata-project
End-to-end data warehousing project integrating APIs, ETL workflows, and PostgreSQL for analytics and reporting.
analytics api bigdata data datawarehousing externalapi pipeline postgres postgresql python warehouse
Last synced: 26 Jan 2026
https://github.com/sebastianbrzustowicz/collision-detection-ai
Python + TensorFlow. Repository for training a machine learning model for collision detection with an accelerometer sensor data and TensorFlow.
accelerometer accelerometer-data ai artificial-intelligence data dataset imu learning machine-learning microprocessor ml model quadcopter script sensor tensorflow
Last synced: 24 Apr 2026
https://github.com/yord/klp-core
A plugin with basic operations for klp (Kelpie), the small, fast, and magical command-line data processor.
csv data deserializer dsv json kelpie klp marshaller parser serializer ssv tsv
Last synced: 24 Apr 2026
https://github.com/fairspec/fairspec-standard
Fairspec is a data exchange format compatible with DataCite for metadata and JSON Schema for structured data
ckan csv data dataset excel fair fairspec json ods polars python quality schema sqlite table typescript validation zenodo
Last synced: 16 Jun 2026
https://github.com/rubenhortas/python_examples
Examples of Python code and DSA (data structures and algorithms).
algorithm algorithms data dsa examples python python-3 python3 samples snippets structures
Last synced: 03 Oct 2025
https://github.com/stefanbohacek/exploring-the-mapping-police-violence-dataset
Using my Gutenberg Data Visualization plugin to explore police violence against civilians.
data dataviz police police-brutality police-misconduct
Last synced: 03 Dec 2025
https://github.com/mohamedmaher-dev/mena
Middle East and North Africa country data utilities for Dart/Flutter.
api arabic-localization capitals country-codes country-data country-flags currencies dart data flutter internationalization localization mena mena-countries mena-region middle-east north-africa offline-data package
Last synced: 21 Feb 2026
https://github.com/nouman6093/advanced-statistical-models
in this repository i will upload everything i have learned about data science advanced statistical models. there are over 42 statistical models. each of them work on algorithms. and there are over 32 algorithms. each library has its own way of writing such statistical models. after learning i will try to upload as much statistical models as possibl
data data-analysis data-science data-visualization
Last synced: 11 Jun 2026
https://github.com/labwhatever/leetcode
Collection of LeetCode questions to ace the coding interview!
data data-structures-and-algorithms dsa leetcode-cpp leetcode-solutions structure structure-learning
Last synced: 22 Aug 2025
https://github.com/aadityatamrakar/futures_spread_chart
Cash Market & Futures Daily Spread Chart - NSE Stocks
data data-analysis data-mining expressjs nodejs requests
Last synced: 10 Apr 2026
https://github.com/fiddlydigital/fastmap
A simple 2D map that is optimized for speed.
Last synced: 23 Oct 2025
https://github.com/thiagopanini/datadelivery
Um módulo Terraform open source capaz de proporcionar um toolkit completo de infraestrutura para que usuários iniciem suas respectivas jornadas de exploração em serviços de Analytics na AWS.
analytics athena aws catalog crawler data datamesh glue s3 terraform
Last synced: 29 Nov 2025
https://github.com/dwidevelopes/database-input-pelanggran-mahasiswa
Menginput data Mahasiswa Yang Melakukan Pelanggran yang siap di data dan di hukum Dan juga siap Terkena Sanksi
aplikasi aplikasi-sekolah data data-analysis database input-method mahasiswa sekolah siswa siswi website
Last synced: 02 May 2026
https://github.com/zalweny26/open_data_unipa
Progetto per l'esame di Laboratorio di Algoritmi 23-24, UniPa, Informatica L-31
Last synced: 26 Apr 2026
https://github.com/vincentlaucsb/csv-data
A curated repository of real and fake CSV data for use in testing suites
Last synced: 08 Mar 2026
https://github.com/stdlib-js/array-base-fancy-slice-assign
Assign element values from a broadcasted input array to corresponding elements in an output array.
array assign assignment copy data fancy generic javascript node node-js nodejs shallow slice stdlib structure subseq subsequence types
Last synced: 06 Oct 2025
https://github.com/aidenellis/connectmp
🍰 ConnectMP - An easy way to share data between Processes in Python.
aidenellis connectmp data data-sharing multiprocessing process sharing
Last synced: 27 Apr 2026
https://github.com/sourceduty/data_hardware
🖥️ Comparing various hardware configurations needed for different data sizes, from personal laptops to mainframes.
calculation computer-hardware computer-science computers data data-calculation data-hardware data-processing data-project hardware hardware-configuration hardware-requirements hardware-science math process-programming programming python
Last synced: 08 Aug 2025
https://github.com/tylerben/data-spring
Easily generate a dummy dataset based on a provided config
data data-spring datagenerator fake-data generator javascript typescript
Last synced: 27 May 2026
https://github.com/jorgeatgu/apaga-luz
💡 ¿Cuánto cuesta la luz? 💶
data data-visualization flat-data
Last synced: 04 Feb 2026
https://github.com/nightroman/farnet.fsharp.data
FSharp.Data package for FarNet.FSharpFar
Last synced: 27 Apr 2026
https://github.com/carlotta94c/sql4datascientistsdemo
Demo material for Microsoft Reactor session "Getting Started with Databases: SQL and Data Visualizations"
analysis data r sqlite tidyverse visualisation
Last synced: 18 Apr 2026
https://github.com/miroslav-reiter/kurz_jazyk_sql_analytici_datovi_vedci
Materiály ku kurzu Jazyk SQL 1 pre Analytikov a Dátových Vedcov
analysis analytics data data-analysis data-science database mysql reiter sql
Last synced: 08 May 2026
https://github.com/petrosdemetrakopoulos/ethairballoons.py
A strictly typed ORM library for Ethereum blockchain.
blockchain dao dapp data database ethereum ethereum-blockchain library orm python smart-contracts web3
Last synced: 11 May 2026
https://github.com/saulojoab/crato-ce-json
Nesse repositório irei armazenar todos os bairros (e mais informações, no futuro) de Crato-CE em JSON.
data database geolocation json json-api localization
Last synced: 28 Apr 2026
https://github.com/ahmetcansolak/developer-insights
New project of ClubRockers from Sarıyer Hills
bitbucket data data-science data-visualization github python3
Last synced: 28 Apr 2026
https://github.com/helins/ex.clj
Java exceptions as clojure data
clojure data exception java java-exceptions
Last synced: 12 Dec 2025
https://github.com/tether/tether-schema
Custom protocol buffer schema for data validation
data protocol schema validation
Last synced: 09 Apr 2025
https://github.com/public-health-scotland/covid-19-publication-dashboard
Dashboard for weekly COVID-19 publication
coronavirus covid covid-19 covid-testing covid19-data dashboard data hospital-admissions lfd nhs public-health scotland shiny
Last synced: 24 Oct 2025
https://github.com/gher-uliege/bluecloud-plankton
Spatial interpolation of plankton data using a neural network
data data-analysis data-visualization neural-network oceanography
Last synced: 30 Mar 2025
https://github.com/suryavamsi-p/conflict-nlp-topic-modeling-sentiment-analysis-using-llms
Extracts insights from 26K+ protest events using BERTopic, Top2Vec, and LLMs for real-world applications like crisis monitoring, policy research, and social unrest analysis.
all-mpnet-base-v2 bertopic conflict-data data data-science lda llama2 llms machine-learning mistral-7b nlp nltk protest-analysis pyldavis python3 top2vec topic-modeling transformers visualization
Last synced: 11 May 2026
https://github.com/horisystems/uk_ev_data_analysis
Analysis of Electric Vehicle charging infrastructure in the United Kingdom.
data data-science electric-vehicles ev python uk united-kingdom
Last synced: 12 Jan 2026
https://github.com/rambodrahmani/covid19-behind-the-numbers
COVID-19: Behind the Numbers.
apriori-algorithm apriori-algorithm-python clustering clustering-algorithm clustering-analysis covid covid-19 covid19-data data data-mining data-science datamining fpgrowth machine-learning machine-learning-algorithms python python-machine-learning
Last synced: 20 Aug 2025
https://github.com/simranjeet97/leetcode_practice
Practicing the Leet Code Codes for Competitive Programming
algorithms amazon coding competitive-programming data data-structures facebook google leetcode python
Last synced: 03 Aug 2025
https://github.com/jackokring/www
Generic www flask server with phinka module
compression data flask phinka python
Last synced: 16 Jan 2026
https://github.com/bilalmehrban/data-log-monitor
A simple yet elegant desktop c# application based on 3 Tier architecture, designed to have a look at the logs stored in the database using Nlog or other logging framework's.
csharp data desktop-app logging
Last synced: 14 Mar 2025
https://github.com/gusenov/qazaqstan-geography-data
:world_map: Географические данные Казахстана.
data geographic-data geography json kazakhstan qazaqstan regions
Last synced: 20 Feb 2026
https://github.com/iguptashubham/walmart-eda
Imagine diving into the fascinating world of Walmart with just a few lines of code! This project lets you do that using MySQL, a powerful tool for data analysts. You can clean up messy data like a detective, uncovering hidden patterns and trends. Data scientists can take it further,.
analysis data dataset eda mysql portfolio-project python sql
Last synced: 10 Apr 2026
https://github.com/alja7dali/swift-bits
A bite sized library for dealing with bytes.
binary bit bits byte bytes comprehension data manipulation swift
Last synced: 09 Jun 2026
https://github.com/doziestar/datavinci
DataVinci enables you to visualize data from various sources, generate insights, analyze data with AI models, and receive real-time updates on anomalies
Last synced: 23 Jan 2026
https://github.com/aidanjuma/ankideckextractor
A CLI tool written in Python that extracts Anki flashcard decks (.apkg) into separate JSON notes and media files. Perfect for developers building custom learning applications or repurposing Anki content programmatically.
anki apkg cli data decompression extraction flashcards learning python zip
Last synced: 29 Apr 2026
https://github.com/capire/xtravels-java
Travel booking app using master data from xflights built with CAP Java
cap cds data federation flights java reuse
Last synced: 23 Jan 2026
https://github.com/cmda-tt/course-24-25
🎓 tech track · 2024-2025 · curriculum and syllabus 📊
d3 data datavis datavisualization es6 functional javascript programming svelte
Last synced: 28 Jan 2026
https://github.com/gbv/cocoda-mappings
concordances, mappings and conversion scripts to create JSKOS mappings
Last synced: 28 Oct 2025
https://github.com/imahdimir/githubdata
A very simple Python package to easily download from and manage a GitHub "Data Repository"
data data-repository python-package
Last synced: 23 Jan 2026
https://github.com/geo-y20/coursera-managment-system
ML and Data Science-based recommendation system
course coursera data data-science data-visualization datacleaning machine-learning mean-square-error recommendation-system
Last synced: 19 Jun 2026
https://github.com/hyperversal-blocks/averveil
Averveil is OpenSea for Data.
blockchain data golang iot privacy zero-knowledge zkp
Last synced: 14 Jan 2026
https://github.com/marcelo-earth/h5n8-data
🔢🦠 Confirmed cases of H5N8 in humans - Feel free to open Pull Requests with new data.
csv data h5n8 h5n8-cases h5n8-virus russia
Last synced: 19 Jan 2026
https://github.com/wu-rymd/pyobjectify
Bridging the gap across the different file formats and streamlining the process to accessing ingested data via Python objects
Last synced: 08 Jun 2026
https://github.com/freebirdscrew/datascience_crash_course
Data Science Crash Course that Explained about Each and Every Process in Data Science.
dash data datascience datascience-crash-course datascience-machinelearning datascientist datasets freebirdscrew matplotlib numpy numpy-library pandas plotly plotly-python python python3 simranjeet simranjeetsingh
Last synced: 29 Apr 2026
https://github.com/ibz-04/data-encryption
Encrypting and Decrypting given data of hospital patients such as: audio & image files
Last synced: 23 Jul 2025
https://github.com/devsujay19/knowledgebase
My knowledge base built with NextJS 14, Tailwind CSS 3 and Aceternity UI.
data knowledge-base nextjs nextjs-typescript nextjs14 react server-side-rendering tailwindcss vercel
Last synced: 10 Apr 2026
https://github.com/rnabla/cuda-des
Bruteforcing DES using CUDA
bruteforce cuda data des encryption gpu parallel standard
Last synced: 27 Oct 2025
https://github.com/stdlib-js/datasets-herndon-venus-semidiameters
Fifteen observations of the vertical semidiameter of Venus, made by Lieutenant Herndon, with the meridian circle at Washington, in the year 1846.
astronomy data dataset datasets grubbs herndon javascript node node-js nodejs outlier outliers sample statistics stats stdlib venus
Last synced: 09 Oct 2025
https://github.com/alrza2003/alrza2003.github.io
This repository contains the source files for my personal portfolio website. It highlights my background as a data analyst and radiology student, and showcases real-world projects, tools I use, and ways to connect with me. The site is based on a pre-built template that I customized to reflect my profile and experience.
data data-analysis data-visualization portfolio portfolio-website python
Last synced: 30 Apr 2026
https://github.com/vincentneo/sgtidetimings
Scraped SG NEA tide timings table into machine-readable JSON files!
data github-actions github-pages gov html-tables-to-json javascript json nodejs sg singapore singapore-data-analysis tide webscraping
Last synced: 10 Apr 2026
https://github.com/danish-foundation-models/dfm-processing
Toolkit for processing data in the danish foundation models project.
Last synced: 02 Jul 2025
https://github.com/shubham14p3/python-word-cloud
Simple python application to create word cloud.
data data-analysis data-science data-visualization nbextension python-3 upload-file
Last synced: 01 May 2026
https://github.com/cintia0528/data_cleaning_and_analytics-python
Evaluate if aggressive discounting benefits Eniac long-term, considering differing views on customer acquisition and brand positioning. Focus on data cleaning for informed decision-making.
colab-notebook data data-analysis datacleaning dataquality jupyter-notebook matplotlib pandas python seaborn
Last synced: 08 Jan 2026
https://github.com/osiota10/alx-low_level_programming
C Low Level Programming - Data Structures, Linux/Unix System Programming and Algorithms with ALX Software Engineering
algorithms assembly c data data-structures linux shell unix
Last synced: 25 Jun 2025
https://github.com/adrian-pasek-prv/data-modeling-with-cassandra
Create a data model in Apache Cassandra for music streaming app
apache-cassandra data data-engineering data-modeling python
Last synced: 02 Jan 2026
https://github.com/timxor/bitcoind-data-ingestion
crypto payments bitcoind data ingestion
Last synced: 27 Oct 2025
https://github.com/leomsgit/extrator-de-parametros-analise-hemograma-e-bioquimico
Software em Python para varrer arquivos PDF e extrair parâmetros diretamente para arquivo Excel
analysis data excel excel-export google-colab hemogram jupyter-notebook pdf pdf-document-processor pdf-viewer python python3
Last synced: 01 May 2026
https://github.com/chandraprakash-bathula/keywords_prediction-machine-learning-integration
Keywords Prediction Model Built the Model By: Data Cleaning Removing Stopwords Constructing Word2vec Advancing to TF-IDF Weighted Word2vec.
algori artifici data machine-learning tf-idf weighted-word2vec word2vec
Last synced: 08 Nov 2025
https://github.com/n4ze3m/timezone-json
JSON file with more than 1642 cities timezone in UTC format.
Last synced: 19 Jul 2025
https://github.com/cdcgov/importsurvey
Import survey: Import data into R, with an application to the National Center for Health Statistics (NCHS)
data import r sas survey survey-data
Last synced: 19 Jun 2026
https://github.com/joocer/data_expectations
Are your data meeting your expectations?
data data-engineering data-quality data-science data-unit-tests observability pipelines quality validation
Last synced: 07 Oct 2025
https://github.com/codenoid/webtoons.com-database
a Webtoons.com Database, collected by Hofesh Bot (Scrapper)
Last synced: 28 Mar 2025
https://github.com/henrylin03/china-gdp
Analysis and visualisation of China GDP data using Python.
data data-analysis data-visualisation dataset kaggle pandas
Last synced: 01 May 2026
https://github.com/milandjurdjevic/discriminalizer
.NET library designed for seamless JSON deserialization of objects with complex discrimination requirements, built on top of System.Text.Json.
data deserialization dotnet json
Last synced: 15 Apr 2025
https://github.com/maccccd/wsoa3029a_2444372
This website serves an extension of my portfolio work. It focuses specifically on showcasing my understanding of D3.js , a JavaScript library used to create interactive data visualizations. The visualizations in here were used to provide insights on two types of cybersecurity attacks: Phishing & Ransomware.
d3js data hacking visualization
Last synced: 24 Jan 2026
https://github.com/edugmenes/azure-data-engineering
This repository contains my first end-to-end Data Engineering project, built using Microsoft Azure Cloud and Azure Databricks with PySpark.
azure cloud data data-engineering data-lakehouse data-structures databricks delta-lake etl-pipelines lakehouse lakehouse-architectures medallion-architecture microsoft-azure pyspark spark
Last synced: 29 Jan 2026
https://github.com/liuliqiang/laueagle
YAML/JSON Lints and Converters
converter data formater json linter python serialization yaml
Last synced: 02 May 2026
https://github.com/jrcichra/ingestd
HTTP server that easily ingests data into a database
data gin hacktoberfest ingest ingestion restful-api
Last synced: 28 Apr 2026
https://github.com/arif-miad/titanic-analysis
artificial-intelligence data data-science deep-neural-networks
Last synced: 09 Jun 2026
https://github.com/stdlib-js/ndarray-base-to-reversed
Return a new ndarray where the order of elements of an input ndarray is reversed along each dimension.
base data flip javascript matrix ndarray node node-js nodejs reverse slice stdlib structure to-reversed types vector view
Last synced: 12 Apr 2026
https://github.com/zoekelepiri/ota_observatory
A front-end web application that provides detailed information about the boundaries and statistical data of the regions and prefectures of Greece.
backend data database spring-boot
Last synced: 06 Feb 2026
https://github.com/nafisalawalidris/sales-performance-dashboard
Sales Performance Dashboard: Analyze and visualize sales data using Power BI. Gain insights into trends, customer segments, product performance, and geographic distribution. Make data-driven decisions to optimize sales strategies and maximize revenue.
analytics-revenue dashboard-power-bi data data-analysis intelligence-sales optimization performance sales visualization-business
Last synced: 03 Feb 2026
https://github.com/tushar2704/insurance-cross-sell
This project harnesses the power of cutting-edge technologies including H2O AutoML, MLflow, FastAPI, and Streamlit to enhance cross-selling campaigns and boost efficiency.
data datascience h20automl machine-learning mlflow python streamlit-tushar2704
Last synced: 08 Oct 2025
https://github.com/azeemmirza/structures
Structures Applied
data data-structures javascript typescript
Last synced: 14 Feb 2026