data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-03 00:07:49 UTC
- JSON Representation
https://github.com/nfaltir/dataxplorer
🔬 A Streamlit app that performs various data exploration operations on an uploaded dataset instantly.
data data-science python streamlit
Last synced: 05 May 2026
https://github.com/dkosarevsky/db_cp
DB course project
data database db postgres postgresql postgresql-database postgressql
Last synced: 05 May 2026
https://github.com/hyperversal-blocks/averveil
Averveil is OpenSea for Data.
blockchain data golang iot privacy zero-knowledge zkp
Last synced: 14 Jan 2026
https://github.com/6km/islamic-data-repository
مستودع البيانات الإسلامية - قائمة بالموارد التي قد تفيد المبرمجين في تطوير التطبيقات ومواقع الويب.
data fonts hadeeth json quran quran-json
Last synced: 06 May 2026
https://github.com/aymane-maghouti/mobile-data-hive-insights
This project demonstrates the process of extracting data from a MySQL database, transferring it using Apache Sqoop, storing it in Hive Data warehouse (the data actually is store in Hadoop Distributed File System (HDFS)), and performing analysis using Hive Query Language (Hive QL) (it is a language close to SQL). Then visualize the data in Power BI,
apache-sqoop data data-integration data-visualization hadoop-hdfs hivedb hiveql powerbi
Last synced: 09 Mar 2026
https://github.com/yukti-09/extracting-data-from-twitter
Data From Twitter!
data data-mining extracting-data timeline tweepy tweets twitter
Last synced: 11 Oct 2025
https://github.com/yash22222/sync-intern-s-ml-tasks
SYNC INTERN'S Machine Learning internship will offer you to enhance your skills by doing real-life example projects. This internship will increase your knowledge in the field of data and algorithms to understand how a machine learns.
bhpp boston-house-datasets boston-house-price-prediction boston-house-pricing data data-structures machine-learning machine-learning-algorithms numpy pandas sync-intern sync-interns
Last synced: 07 May 2026
https://github.com/andyduke/data_processor_cli
Flexible command-line data processor.
command-line command-line-tool converter csv data data-structures json template toml xml yaml
Last synced: 08 May 2026
https://github.com/themuhd/world-cup-analysis
Analysis of The FIFA World cup from its inception to the recently completed tournament in 2023
data data-science data-visualization dataanalysis matplotlib matplotlib-pyplot notebook python
Last synced: 08 May 2026
https://github.com/devathul-88/random-fakedata.js
A package to generate random data
data data-generator fake fake-data fake-data-generator javascipt javascript nodejs npm-package package
Last synced: 09 May 2026
https://github.com/stefanbohacek/exploring-the-mapping-police-violence-dataset
Using my Gutenberg Data Visualization plugin to explore police violence against civilians.
data dataviz police police-brutality police-misconduct
Last synced: 03 Dec 2025
https://github.com/keanteng/nextjs-directory
🌐A Draft Website For Data Catalogue Using NextJs
catalogue climate-change css data directory html javascript nextjs website
Last synced: 09 May 2026
https://github.com/sefakcmn00/tensorflow_machine_learning_simple-
Artificial Neural Network(ANN) Perceptron
data mathplotlib pandas pandas-dataframe pandas-python sklearn tensorflow-examples tensorflow2
Last synced: 06 Feb 2026
https://github.com/cosmos-loops/cosmos-dapper
Cosmos.Dapper is a part of Cosmos.Data, a inline project of COSMOS LOOPS PROGRAMME. This repository provides a package of StackExchange.Dapper to improve development efficiency.
dapper data mysql mysqlconnector oracle postgresql sql-query sqlite sqlkata sqlserver
Last synced: 11 Apr 2026
https://github.com/labwhatever/leetcode
Collection of LeetCode questions to ace the coding interview!
data data-structures-and-algorithms dsa leetcode-cpp leetcode-solutions structure structure-learning
Last synced: 22 Aug 2025
https://github.com/bijx/firestore-data-fetcher
A simple Python script to fetch documents from a Firebase Firestore collection and save them to a local `.json` file.
automation data database downloader exporter fetcher firebase firestore open-source script
Last synced: 12 Apr 2026
https://github.com/jorgeatgu/apaga-luz
💡 ¿Cuánto cuesta la luz? 💶
data data-visualization flat-data
Last synced: 04 Feb 2026
https://github.com/carlotta94c/sql4datascientistsdemo
Demo material for Microsoft Reactor session "Getting Started with Databases: SQL and Data Visualizations"
analysis data r sqlite tidyverse visualisation
Last synced: 18 Apr 2026
https://github.com/dbriane208/omdena-apprenticeship-project
This is part of my contribution to the Omdena apprenticeship program .
data data-science feature-engineering machine-learning
Last synced: 14 Mar 2026
https://github.com/pharo-ai/data-imputers
This project contains transformers for missing value imputation
ai data data-science imputer pharo pharo-smalltalk smalltalk
Last synced: 18 Jan 2026
https://github.com/vvipjain/bike-sales-dashboard
Bike Sales Dashboard
dashboards data data-analysis data-cleaning data-normalisation data-visualization excel pivot-chart pivot-tables
Last synced: 04 Feb 2026
https://github.com/sandk21/etude_eau_potable_monde
Etude sur l'accès à l'eau dans le monde - Tableaux de bord avec Tableau
analysis data tableau tableau-public visualization
Last synced: 19 Mar 2026
https://github.com/gusenov/qazaqstan-geography-data
:world_map: Географические данные Казахстана.
data geographic-data geography json kazakhstan qazaqstan regions
Last synced: 20 Feb 2026
https://github.com/so-cool/uobrain
My solution to the University of Bristol PURE Data Challenge
Last synced: 09 Sep 2025
https://github.com/dev-owdenmag/dataflow-manager
A dynamic and versatile web application for managing, collecting, and presenting data with an integrated printing feature.
data data-management data-management-platform data-visualization python
Last synced: 30 Mar 2025
https://github.com/kirkalyn13/portfolio-dashboard-site
Portfolio Site; Initially a Service Provider Metrics Dashboard using React.
dashboard data data-visualization react
Last synced: 15 Apr 2026
https://github.com/ginga1402/travego_travellers
MySQL Mini Project
college-project data mysql-database
Last synced: 27 Jul 2025
https://github.com/scienxlab/datasets
Some small datasets for demos, courses, testing, etc.
data open-data sample-data teaching-resources
Last synced: 09 Oct 2025
https://github.com/e-panourgia/data-science-projects
Data Science Projects
annotations augmentation data data-preprocessing-and-cleaning hyperparameter-tuning llm logistic-regression nlp random-forest-classifier xboost-classifier
Last synced: 09 Apr 2025
https://github.com/waylonwalker/exceltocsv
A usefull tool to convert excel spreadsheets to csv files without launching excel
csv-converter csv-files data excel python spreadsheet
Last synced: 05 May 2025
https://github.com/vincentneo/sgtidetimings
Scraped SG NEA tide timings table into machine-readable JSON files!
data github-actions github-pages gov html-tables-to-json javascript json nodejs sg singapore singapore-data-analysis tide webscraping
Last synced: 10 Apr 2026
https://github.com/openearth/rws-viewer
This viewer is created by Deltares in cooperation with Voorhoede under OpenEarth GPL License. The viewer can be used via several RWS websites, please visit https://www.informatiehuismarien.nl/, https://waterinfo-extra.rws.nl/ and https://basismonitoringwadden.waddenzee.nl/.
data mapbox-gl-js ogc-services viewer
Last synced: 01 Feb 2026
https://github.com/gsmith257-cyber/bit3434cve
BI T3434 Project on data mining CVEs and Exploits
cve data data-mining exploits research-project
Last synced: 17 Jun 2026
https://github.com/tonykipkemboi/ens_subgraph_data
Query On-Chain Data from Subgraphs by The Graph Protocol using Python
data subgraphs thegraphprotocol web3
Last synced: 17 Sep 2025
https://github.com/ntia/compound_radar_waveforms-data
Data used by NTIA/ITS TR-23-566 Examining the Effects of Resolution Bandwidth when Measuring Compound Radar Waveforms.
bandwidth data measurement p0n q3n radar resolution stepped waveform
Last synced: 27 Jan 2026
https://github.com/devsujay19/knowledgebase
My knowledge base built with NextJS 14, Tailwind CSS 3 and Aceternity UI.
data knowledge-base nextjs nextjs-typescript nextjs14 react server-side-rendering tailwindcss vercel
Last synced: 10 Apr 2026
https://github.com/giladbarnea/to
A simple CLI tool to convert and diff between JSON, YAML, TOML, JSON5 and Python collections.
conversion data data-conversion json json5 parser script terminal toml yaml
Last synced: 08 Feb 2026
https://github.com/spine-tools/metreload
Python application for downloading meteorological reanalysis data
Last synced: 01 Jul 2025
https://github.com/awesomelistsio/awesome-open-data
A curated list of high-quality open data resources, tools, platforms, and projects across domains.
awesome awesome-list awesome-lists data open open-data
Last synced: 29 Jun 2025
https://github.com/jayantur13/kountry
Node module variant of the Country API
api data jsdelivr kountry nodejs npm npm-module npm-package unpkg yarn
Last synced: 26 Jan 2026
https://github.com/ndohvich/ndohvich
Je suis un grand fan de l'analyse des données avev PYTHON
anaconda arduino data github jypyter keras machine-learning machine-learning-algorithms numpy pandas python scikit-learn sql tensorflow visual-studio-code visualization-dashboard
Last synced: 11 Apr 2026
https://github.com/pharo-ai/data-preprocessing
Project including data pre-processing algo. We aim to include scaling, centering, normalization, binarization methods.
data pharo pharo-smalltalk preprocessing smalltalk
Last synced: 09 Feb 2026
https://github.com/fredhutch/gdscnsoilsites
Homepage for BioDIGS Project. Learn about the project and download data.
biodigs data metagenomics student-research
Last synced: 25 Mar 2025
https://github.com/cqllum/schema2dwh
⚡ Automatically produce a data model on your database using its information schema using GenAI.
ai data data-structures dataengineering datawarehousing dwh gemini gemini-api genai reporting reporting-tool schema-design
Last synced: 13 Mar 2025
https://github.com/gappeah/global-shipping-analytics-dashboard
This Tableau project provides a comprehensive visual analysis of global sales, shipping costs, and quality metrics across different regions and countries.
data data-analysis data-analyst data-visualization metrics tableau
Last synced: 25 Feb 2025
https://github.com/famarks/grafarg
Grafarg is an interactive data analytics and graphical data visualization application. Grafarg being a progressive fork of Grafana 7.5.17 continues to be available under open source Apache 2.0 License
analytics charts data data-analysis data-science data-visualization grafana grafarg graph
Last synced: 19 Jan 2026
https://github.com/cdcgov/importsurvey
Import survey: Import data into R, with an application to the National Center for Health Statistics (NCHS)
data import r sas survey survey-data
Last synced: 19 Jun 2026
https://github.com/elhariri78/case-study-a-better-smoker-detector
Case Study-A better Smoker Detector
data dataframe evaluation kaggle matplotlib-pyplot numpy pandas pandas-dataframe pandas-python python3 seaborn sklearn
Last synced: 07 Apr 2026
https://github.com/ewertondrigues02/engenharia-de-dados
Varios Projetos de Engenharia de Dados usando principais ferramentas como: Airflow, Snowflake, dbt, Postrgres, Looker Studio, Power BI
airflow analise-exploratoria analytics aws-ec2 dados data dbt-cloud engenharia-de-dados looker-studio postgres pyspark python3 snowflake spark
Last synced: 16 Apr 2026
https://github.com/lmuffato/project-mongodb-dataflights-trybe
Projeto MongoDB Dataflights - Projeto avaliativo da Trybe do Bloco 23: Introdução ao MongoDB
back-end crud data database filter mongo mongodb query trybe-projects
Last synced: 16 Apr 2026
https://github.com/gematik/app-fhir-snapshots-package-generator
The repository contains a library and a console application to generate snapshots for StructureDefinitions in FHIR-packages.
Last synced: 05 Oct 2025
https://github.com/mohnoor94/datasciencefundementalsusingpython
My journey to learn Data Science with Python
data data-analysis data-science data-visualization learning learning-by-doing python python3
Last synced: 19 Jun 2026
https://github.com/davorg/dmp
Data Munging with Perl
book data hacktoberfest munging perl
Last synced: 21 Jan 2026
https://github.com/arif-miad/heart-attack-risk-prediction
This dataset explores key factors influencing heart attack risk, such as age, cholesterol, blood pressure, and lifestyle habits. Using machine learning models.
classification data data-science matplotlib ml pandas-python seaborn visualization
Last synced: 18 Aug 2025
https://github.com/sakan811/tekken-8-jun-kazama-data-analysis-showcase
Showcase visualizations about Jun Kazama from Tekken 8
data data-analysis data-visualization game games powerbi tekken tekken-8 video-game visualization web-scraping
Last synced: 28 Feb 2026
https://github.com/dannyben/datamix
DSL for manipulating tabular data
csv data data-analysis data-engineering gem ruby tabular-data
Last synced: 31 Jul 2025
https://github.com/sakshisrivastava-2601/credit-card-fraud-detection
Credit Card Fraud Detection Project Using Machine Learning. This project focuses on leveraging advanced Machine learning techniques to identify fraudulent transactions with high accuracy.
advanced-machine data machine-learning numpy project-repository python pytorch random-forest
Last synced: 16 Apr 2026
https://github.com/garcane/london-housing-price-dashboard
This Excel-based Housing Visual Dashboard provides a comprehensive view of average house prices across various boroughs in London from 1996 to 2013. The dashboard is designed to offer insights into housing market trends and price variations across different areas of London over time.
data data-analysis data-visualization excel visual
Last synced: 13 Feb 2026
https://github.com/visenger/prada
Profiling Datasets
cleaning data dataset profiling
Last synced: 24 Aug 2025
https://github.com/frictionlessdata/extensiondp
Extension DP (Data Package Extension Template) is a Git repository template for rapid Data Package extension development
data datapackage exchange extension format
Last synced: 13 Feb 2026
https://github.com/mouneshgouda/learn_dsa
This repository explores fundamental data structures and their implementations. Learn how to organize and manipulate data efficiently for various programming tasks. (Feel free to add your specific focus areas here, e.g., algorithms, interview prep)
c data queue sorting-algorithms stack structured-data
Last synced: 30 Jul 2025
https://github.com/rformassspectrometry/msdatahub
Mass Spectrometry Data on ExperimentHub
bioconductor data mass-spectrometry metabolomics proteomics r r-package
Last synced: 14 Apr 2025
https://github.com/saroshfarhan/kaggle-playground-s4e12
Kaggle competition first attempt
analytics data data-analysis-python data-science
Last synced: 12 Oct 2025
https://github.com/stdlib-js/array-base-assert-is-complex-floating-point-data-type
Test if an input value is a supported array complex-valued floating-point data type.
array assert base check data dtype is javascript node node-js nodejs stdlib test types util utilities utility utils valid validate
Last synced: 14 Feb 2026
https://github.com/freddy03h/immutable-data-structure
Normalize and Merge your application's data store using Immutable.JS objects
Last synced: 05 Oct 2025
https://github.com/mickfrog/uace-analysis
UACE ANALYSIS FOR 2011 - 2015
data data-science data-visualization folium-maps geocoder jupyter-notebook pandas python3
Last synced: 14 Feb 2026
https://github.com/oneblack333/pizza_sales_analysis
The project involves transforming raw pizza sales data into actionable business intelligence through analysis and visualization. This enables pizza business owners to make data-driven decisions on inventory, staffing, and marketing, ultimately improving performance and profitability.
data data-structures data-visualization excel mysql powerbi
Last synced: 20 Jun 2026
https://github.com/serhatderya/tabular-playground-series
This repository contains solutions of monthly Tabular Playground Series in Kaggle.
ai artificial-intelligence data data-preprocessing data-processing data-science data-visualization jupyter-notebook kaggle machine-learning numpy pandas python regression scikit-learn scikitlearn-machine-learning seaborn software statsmodels
Last synced: 11 Apr 2026
https://github.com/mvicens/sporscor
TypeScript API to manage sport data getting scoreboards and statistics
api-client data score scoreboards sport statistics typescript
Last synced: 16 Feb 2026
https://github.com/divanny/tiendabackend
Tienda
backend core csharp csharp-code csharp-core data integration webapi
Last synced: 20 Jun 2026
https://github.com/stdlib-js/array-base-none-by-right
Test whether all elements in an array fail a test implemented by a predicate function, iterating from right to left.
all array data every generic javascript node node-js nodejs none predicate stdlib structure test types validate
Last synced: 01 Mar 2026
https://github.com/garcane/income-prediction-ml
This is a machine learning project aimed at predicting whether an individual's annual income exceeds $50,000 based on their demographic and personal information.
data data-science machine-learning ml numpy pandas python random-forest scikit-learn
Last synced: 08 Apr 2026
https://github.com/efler/microservice-data-bus
Data bus based on Apache Kafka and consisting of separate components [copied from own private repos]
data data-bus deduplication enrichment filtering kafka microservice mongodb postgresql redis
Last synced: 16 Apr 2026
https://github.com/varbrad/mindb
🗄 🔍 ⚡️ Schema-less document-oriented collection model data-store for Node & Browsers.
browser data datastore db document javascript json-schema mongo mongodb nodejs nosql query schema
Last synced: 13 Apr 2026
https://github.com/mohamedhany99/human-voice-identifier-counter
the application developed in (KIVY) it can identify the users imported into the dataset based on the support vector machine training model it has two features ( Importing new voice - Detection to detect the human voices and count them)
android android-app android-application automation automation-framework data data-analysis data-mining data-science data-visualization datascience kivy kivy-framework machine-learning python
Last synced: 27 Mar 2026
https://github.com/lakecountryhuntclub/dnr-map-data-model
Data Model for the 2023 DNR Pheasant Stocking Property Data
data data-model documentation excel gis hunting mapping powerquery vba
Last synced: 29 Jul 2025
https://github.com/apfirebolt/data-structures-and-algorithms-in-python
Data Structure and Algorithms in Python
algorithms data data-structures python python3 tkinter-gui
Last synced: 15 Mar 2025
https://github.com/petrosdemetrakopoulos/ethairballoons.py
A strictly typed ORM library for Ethereum blockchain.
blockchain dao dapp data database ethereum ethereum-blockchain library orm python smart-contracts web3
Last synced: 11 May 2026
https://github.com/ismailarilik/react-covid-maps
A global maps application aims to display COVID-19 statistics by countries, written with React
covid-19 data global maps react statistics
Last synced: 16 Apr 2026
https://github.com/connectaman/deepseek-ocr-multigpu-infer
Efficient multi-GPU OCR inference framework leveraging parallel processes for accelerated token throughput and faster batch processing. Designed for scalable, high-performance optical character recognition workloads using PyTorch. Supports dynamic GPU assignment, optimized resource utilization, and easy integration for large-scale image datasets.
agentic-extraction data deepseek document-parser extraction extractor gpu image-parser llm multigpu nvidia ocr parallel-computing parser pdf-parser vlm
Last synced: 22 Jan 2026
https://github.com/masu-baumgartner/dbsync.net
A c# mysql model sync library
Last synced: 13 May 2026
https://github.com/pferreirafabricio/data-immersion
🏊🏻♂️ Activities and exercises from 'Imersão Dados' event
data data-analysis data-science dataset jupiter-notebook python
Last synced: 14 May 2026
https://github.com/gkapfham/ast2016-paper
Source Code of and Supporting Files for a Paper Published at AST 2016
data latex-document paper research
Last synced: 19 Oct 2025
https://github.com/simranjeet97/datastructures_algoritms_python
Data Structures and Algorithms using Python
algorithms arrays arrays-and-strings coding data data-science data-structures datastructures-python hashing interview-preparation interview-questions linked-list python stacks stacks-as-an-array
Last synced: 09 Apr 2026
https://github.com/svelterun/store
Persisted version of svelte/store.
data state state-management store svelte svelte-store sveltekit svelterun typescript
Last synced: 08 Jan 2026