data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-23 00:07:41 UTC
- JSON Representation
https://github.com/dark-art108/yonk
A cli-utility to streamline data science work by creating templates
Last synced: 08 May 2026
https://github.com/snandasena/disaster-response-pipeline
Disaster Response Pipeline | Data Engineering
data data-engineering-pipeline etl flask machine-learning nlp nlp-pipeline
Last synced: 24 Apr 2026
https://github.com/danielbayley/schemas
A collection of useful @JSON-schema-org schemas for data validation.
ajv config configuration data data-science data-structures data-validation json json-schema linter linting schema schema-org validation yaml yaml-configuration
Last synced: 13 Oct 2025
https://github.com/asirihewage/simplest-xpath-web-scraper
Simplest web scraper created using Python3 and MongoDB
data data-mining python3 scraper web webscrping
Last synced: 29 Jan 2026
https://github.com/physio/flatten-ts
Flatten-ts is a lightweight TypeScript library for easily flattening and unflattening nested objects and arrays with customizable options and fast performance.
array conversion data flatten javascript json object typescript
Last synced: 06 May 2026
https://github.com/jaffarabbas/library-management-system-in-java-
GUI base + Database functionality
data database datastructures-algorithms dbms gson java javafx javafx-application javafx-desktop-apps javamail library-management-system mysql sql xammp
Last synced: 05 May 2026
https://github.com/mednour2019/devolap
OLAP Cube Dispatcher Tool
analysis-services csharp data excel excel-export kpi mdx metroframework mvvm-architecture sql wpf
Last synced: 27 Jan 2026
https://github.com/andreaselia/quotes-xd
A plugin for Adobe XD to insert a text element with a random quote and respective author.
adobe adobe-xd data design design-tool design-tools quote random xd
Last synced: 24 Apr 2026
https://github.com/skywarth/fenrir-wolfpack-simulator
Simulating wolfpack behaviours and future of the pack in an environment using Javascript and data trees.
data data-structures javascript max-heap simulation simulations wolfpack
Last synced: 14 Oct 2025
https://github.com/doriclaudino/canarinho_nlp
labels, classify, summarization string for canarinho app
chrome-console classification classifier-model data labels nlp nlu python spacy spacy-models spacy-nlp summarization-string
Last synced: 08 May 2026
https://github.com/manifoldfinance/disco-schema
MEV Auction and Ethereum Network Data Schemas
cryo data dataset ethereum ethereum-builders ethereum-mev evm mev-data pandas schema-registry schemas
Last synced: 08 May 2026
https://github.com/perceptronv/miscellaneous
A huge variety of materials, mostly training data for AI. Not a lot of source code yet.
data gan machine-learning nlp text-generation
Last synced: 04 May 2026
https://github.com/capire/xtravels-java
Travel booking app using master data from xflights built with CAP Java
cap cds data federation flights java reuse
Last synced: 23 Jan 2026
https://github.com/rdjarbeng/rdjarbeng
Richard Djarbeng's github profile-computer engineer specializing in web development, machine learning, and IoT devices. New web posts have moved to website below
data jekyll machine-learning ruby website
Last synced: 28 Apr 2026
https://github.com/ahmetcansolak/developer-insights
New project of ClubRockers from Sarıyer Hills
bitbucket data data-science data-visualization github python3
Last synced: 28 Apr 2026
https://github.com/2kabhishek/pyramen
Data Analysis for Ramen 🍜💹
csv data data-analysis fun python report
Last synced: 26 Oct 2025
https://github.com/mikeintoshsystems/dhis2heat
A Comprehensive data management and Health Equity Assessment and Analysis platform that fetches data from DHIS2, optimize, calculate, clean and visualize inequality data.
analytics data data-science dhis2 equality equity health heat inequality r shiny shinydashboard visualization
Last synced: 28 Apr 2026
https://github.com/mihaiconstantin/lavot
A `React` application that allows users to indicate how votes will be redistributed among candidates for the second round of Romanian presidential elections.
data data-visualization elections react sankey typescript
Last synced: 06 Feb 2026
https://github.com/iamlucianojr/laravel-api-query-handler
:flashlight: This Laravel package helps to handle a query request properly
api collection data eloquent handler l5x laravel query
Last synced: 28 Apr 2026
https://github.com/igorwastaken/math-problems
Solve math problems easily with this utility library.
algorithm area data demography geography javascript math npm package population school typescript util utils
Last synced: 23 Feb 2026
https://github.com/helins/ex.clj
Java exceptions as clojure data
clojure data exception java java-exceptions
Last synced: 12 Dec 2025
https://github.com/karthikmprakash/github_repos_scraper
A tool to extract names of github repos of any user
automation bs4 data github python repositories requests webscraping
Last synced: 27 Apr 2026
https://github.com/aidenellis/connectmp
🍰 ConnectMP - An easy way to share data between Processes in Python.
aidenellis connectmp data data-sharing multiprocessing process sharing
Last synced: 27 Apr 2026
https://github.com/bukalapak/bukadata
Data supplier plugin for populating design with real data.
data plugin sketch sketch-plugin
Last synced: 05 Jul 2025
https://github.com/sap-samples/security-research-codegraphsmote
Data augmentation strategy that can be applied to code graphs for learning-based vulnerability discovery.
augmentation data detection learning machine research sample security vulnerability
Last synced: 07 Jun 2026
https://github.com/mohamedhany99/human-voice-identifier-counter
the application developed in (KIVY) it can identify the users imported into the dataset based on the support vector machine training model it has two features ( Importing new voice - Detection to detect the human voices and count them)
android android-app android-application automation automation-framework data data-analysis data-mining data-science data-visualization datascience kivy kivy-framework machine-learning python
Last synced: 27 Mar 2026
https://github.com/maccccd/wsoa3029a_2444372
This website serves an extension of my portfolio work. It focuses specifically on showcasing my understanding of D3.js , a JavaScript library used to create interactive data visualizations. The visualizations in here were used to provide insights on two types of cybersecurity attacks: Phishing & Ransomware.
d3js data hacking visualization
Last synced: 24 Jan 2026
https://github.com/city-of-helsinki/drupal-helfi-tyollisyyspalvelut-manuaali
Työllisyyden kuntakokeilujen palvelutietovarannon manuaali
data drupal drupal-9 unemployment
Last synced: 24 Jan 2026
https://github.com/zalweny26/open_data_unipa
Progetto per l'esame di Laboratorio di Algoritmi 23-24, UniPa, Informatica L-31
Last synced: 26 Apr 2026
https://github.com/zoekelepiri/ota_observatory
A front-end web application that provides detailed information about the boundaries and statistical data of the regions and prefectures of Greece.
backend data database spring-boot
Last synced: 06 Feb 2026
https://github.com/avto-dev/static-references-data
Data for static references
Last synced: 05 Oct 2025
https://github.com/CheeseWithSauce/HadithsJSONFormat
Free, authentic Hadith data from sunnah.com organized bookwise specially for Muslim devs. Includes Arabic, English, and gradings. Use freely without credits. Collections: Bukhari, Muslim, Abu Dawud, Tirmidhi, Nasa'i, Ibn Majah, Malik, Riyad as-Salihin. Expanding soon, Inshallah.
api arabic data dev free hadith islam islamic muslim open-source quran sunnah
Last synced: 24 Feb 2026
https://github.com/ahmad-ali-rafique/pyviznotebook
PyVizNotebook is a collection of Matplotlib visualizations demonstrating a wide range of plot types and techniques for data visualization. Whether you're a beginner looking to learn or an experienced developer seeking inspiration, this repository offers a diverse set of examples to explore.
analytics colab-notebook data data-science data-visualization dataanalytics matplotlib-python plots seaborn-python visualization
Last synced: 06 Jun 2026
https://github.com/chriseaton/sample-database
A long-term supported sample dataset for file and database unit testing and validation. Simple, straight-forward, raw data shared across formats.
data database examples flat-file samples schema unit-testing
Last synced: 25 Apr 2026
https://github.com/fairspec/fairspec-typescript
Fairspec TypeScript is a fast data management framework built on top of the Fairspec standard and Polars DataFrames
ckan csv data dataframe dataset excel fair json ods polars quality schema sqlite table typescript validation zenodo
Last synced: 09 Feb 2026
https://github.com/alejo1630/titanic_kaggle
This Python Notebook is a proposal to analyse the Titanic dataset for the Kaggle Competition, using several data science techniques and concepts.
data data-science jupyter-notebook notebook python titanic-survival-prediction
Last synced: 03 May 2026
https://github.com/bredalis/functionalprogrammingpython
💻 Programación Funcional en Python
data functional-programming functions programing programming-language python structured-data
Last synced: 06 Jun 2026
https://github.com/yord/klp-core
A plugin with basic operations for klp (Kelpie), the small, fast, and magical command-line data processor.
csv data deserializer dsv json kelpie klp marshaller parser serializer ssv tsv
Last synced: 24 Apr 2026
https://github.com/sebastianbrzustowicz/collision-detection-ai
Python + TensorFlow. Repository for training a machine learning model for collision detection with an accelerometer sensor data and TensorFlow.
accelerometer accelerometer-data ai artificial-intelligence data dataset imu learning machine-learning microprocessor ml model quadcopter script sensor tensorflow
Last synced: 24 Apr 2026
https://github.com/jinsyin/datagovernance
公众号:「数据之道」
data data-governance datagovernance governance
Last synced: 30 Jan 2026
https://github.com/howtoquitvivek/ai-crop-yeild-prediction
AI-driven crop yield prediction and agricultural optimization system (SIH 2025)
2025 2026 ai crop-yeild data minor-project ml predcition python science sih
Last synced: 23 Apr 2026
https://github.com/aiwithqasim/competitive-programming
I will add all material which i did or in the future i will do to make my programming skill more enhanced to become a competitive programmer
c-plus-plus code data java programming structured-data
Last synced: 20 May 2026
https://github.com/stefen-taime/myubereats_datapipeline
Building a Modern Uber Eats Data Pipeline
airflow api data datawarehouse mongodb pipeline powerbi snowflake
Last synced: 22 Apr 2026
https://github.com/cintia0528/data_cleaning_and_analytics-python
Evaluate if aggressive discounting benefits Eniac long-term, considering differing views on customer acquisition and brand positioning. Focus on data cleaning for informed decision-making.
colab-notebook data data-analysis datacleaning dataquality jupyter-notebook matplotlib pandas python seaborn
Last synced: 08 Jan 2026
https://github.com/tether/tether-schema
Custom protocol buffer schema for data validation
data protocol schema validation
Last synced: 09 Apr 2025
https://github.com/vvipjain/bike-sales-dashboard
Bike Sales Dashboard
dashboards data data-analysis data-cleaning data-normalisation data-visualization excel pivot-chart pivot-tables
Last synced: 04 Feb 2026
https://github.com/sefakcmn00/tensorflow_car_price_analysis
In this project, after extracting the data sets as csv, we tried to represent the car prices graphically and schematically by using data analysis and data visualization methods. We checked the connection of the car prices we analyzed with other data, then we created a 4-layer and 12-neuron system.
data datatrain keras machine-learning matplotlib-pyplot pandas seaborn sklearn tensorflow
Last synced: 14 Apr 2026
https://github.com/simranjeet97/quotes-analysis
Kaggle Dataset on Quotes Analysis and Visualization With Python, Pandas and MatplotLib Using Jupyter Notebook.
data data-science datavisualization jupyter-notebook kaggle kaggle-dataset machine-learning matplotlib-pyplot numpy pandas python quotes quotes-application
Last synced: 15 Apr 2026
https://github.com/robson-python/data-analysis-car-price-prediction
This dataset contains 10,000 entries created for the purpose of predicting car prices.
data data-visualization dataanalysis inteligencia-artificial machine-learning matplotlib pandas-dataframe python scikit-learn seaborn vscode
Last synced: 21 Apr 2026
https://github.com/cicerotcv/br-gen
A browser extension for generating Brazilian placeholder data.
chrome data extension generation hacktoberfest
Last synced: 21 Apr 2026
https://github.com/neomutt/sample-data
📚 Lists of things. Useful for developing and testing.
Last synced: 19 Mar 2026
https://github.com/inphyt/quantitative_single_neuron_modeling_competition_2009
Data for the Quantitative Single-Neuron Modeling Competition (2009).
bayesian-inference bayesian-methods bayesian-optimization bayesian-statistics challenge competition computational-neuroscience data electrophysiological-data electrophysiology-data model-calibration modeling neuronal-models neuroscience neuroscience-competition parameter-estimation simulation simulation-modeling single-neuron-model uncertainty-quantification
Last synced: 25 Feb 2026
https://github.com/tee8z/noaa-oracle
NOAA data oracle, queryable from the browser and can attest to events for a Bitcoin DLC in dlctix style
data duckdb-wasm noaa-weather parquet-files sql weather
Last synced: 17 Feb 2026
https://github.com/dandre3000/matrix
Matrix library
algebra array data data-structure math matrix vector
Last synced: 01 Feb 2026
https://github.com/unownone/spenddy-link
Simple Privacy Friendly chrome extension to track your spends and more!
Last synced: 12 Mar 2026
https://github.com/openearth/rws-viewer
This viewer is created by Deltares in cooperation with Voorhoede under OpenEarth GPL License. The viewer can be used via several RWS websites, please visit https://www.informatiehuismarien.nl/, https://waterinfo-extra.rws.nl/ and https://basismonitoringwadden.waddenzee.nl/.
data mapbox-gl-js ogc-services viewer
Last synced: 01 Feb 2026
https://github.com/garciparedes/r-examples
Set of awesome R Examples
data data-science garciparedes r statistics university-of-valladolid
Last synced: 20 Apr 2026
https://github.com/ktbarrett/scdil
simple configuration and data interchange language
configuration data json python yaml
Last synced: 20 Apr 2026
https://github.com/jub0t/eso
An application to manage all your Encryption & Decryption keys and other related tools.
data encryption encryption-decryption hacking hacking-tool keys pgp privacy private
Last synced: 07 Feb 2026
https://github.com/whitehathackerpr/data-visualization-tool
This is a Python-based web application that allows users to upload datasets, analyze data, and create visualizations interactively. The tool is designed for ease of use and provides a simple interface to perform basic data analysis and generate visualizations
data data-analysis data-visualization python python3
Last synced: 05 Sep 2025
https://github.com/marabesi/d3-visualization
Different visualizations using data and d3.js
charts css d3js data html js json timeline-chart visualization
Last synced: 01 May 2026
https://github.com/basemax/buskool.com-data
This repository contains the collected product data from the Buskool website (باسکول). The data is stored in 20k+ JSON files, each containing detailed information about products available on the website.
buskool buskoolcom data farsi information ir iran json persian
Last synced: 03 Apr 2025
https://github.com/adriweb/wsualizer
Some random code to visualize things coming from a websocket (pronounced 'visualizer')
bootstrap data html jquery real-time visualization visualizer websockets
Last synced: 20 Apr 2026
https://github.com/rayyan9477/dep
data data-science machine-learning python visualization web-scraping
Last synced: 08 May 2026
https://github.com/garcane/cookie-company-visual-dashboard
This Excel-based interactive dashboard provides a comprehensive overview of the Cookie Company's sales performance and key metrics.
dashboard data data-visualization excel microsoft-excel
Last synced: 09 Feb 2026
https://github.com/pharo-ai/data-preprocessing
Project including data pre-processing algo. We aim to include scaling, centering, normalization, binarization methods.
data pharo pharo-smalltalk preprocessing smalltalk
Last synced: 09 Feb 2026
https://github.com/programmer-rd-ai/library-management-system-oraclesql
The Library Management System project, part of the CI6320 Advanced Data Modelling coursework, features comprehensive SQL scripts utilizing OracleSQL to facilitate efficient data modeling and management.
adm advanced ci6320 cw data icw library management modelling oracle oraclesql report sql system
Last synced: 29 Oct 2025
https://github.com/danielrosehill/monetised-ghg-emissions
Calculating monetised GHG emissions for various companies based upon disclosure data
data sustainability sustainability-data
Last synced: 07 Sep 2025
https://github.com/quasilyte/phpcorpus
A collection of various PHP code; useful for PHP tools writers to get some insights on how "real-world" PHP code looks like
analysis corpus data php php-corpus
Last synced: 04 Jul 2025
https://github.com/ayushai/salesfoce-hospital-management
A custom Salesforce-based Hospital Management System with powerful dashboards and data analysis tools. It provides real-time insights into patient care, appointment scheduling, and inventory management, optimizing healthcare operations and decision-making.
analytics dashboard data salesforce-developers visualization
Last synced: 22 Feb 2026
https://github.com/colour-science/colour-checker-detection-examples-datasets
Colour - Checker Detection - Examples Datasets
color color-checker color-science color-space color-spaces colorspace colorspaces colour colour-checker colour-science colour-space colour-spaces colourspace colourspaces data dataset datasets raw
Last synced: 19 Mar 2026
https://github.com/prajwalsinha/unveiling-climate-change-dynamics-through-earth-surface-temperature-analysis
Climate change analysis through global surface temperature data. Includes data preprocessing, statistical analysis, visualizations, and forecasting. Python-based project using Pandas, Matplotlib, and Scikit-learn.
data dataanalysis dynamic-mapping pyplot python scikit-learn seaborn
Last synced: 10 Feb 2026
https://github.com/jhpoelen/bats
self-documenting data publication on Bat (Chiroptera) specimen
biodiversity data natural-history-collections provenance specimen
Last synced: 18 Mar 2026
https://github.com/mattythedev01/easydatadb
A quick and easy way to store data!
data database discord-bot discord-js discord-ts discordbot discordjs discordts npm npm-package package quick-db quickdb
Last synced: 13 Apr 2026
https://github.com/scottleechua/data
Public datasets under CC-BY-4.0 license.
Last synced: 18 Mar 2026
https://github.com/m-rishab/stock_trend-analysis-power-bi-project-
In this project, I've harnessed the robust capabilities of Power BI to analyse, visualize, and uncover the story behind HUL's stock performance.
data datavisualization datavisualization-project powerbi
Last synced: 19 Mar 2026
https://github.com/aiwithqasim/recommendationengines
Recommendations Engines with IBM a project of DataScientist Nanodegree on Udacity. For this project i will analyze the interactions that users have with articles on the IBM Watson Studio platform, and make recommendations to them about new articles you think they will like.
data data-manging data-science ibm ipython-notebook normalization python3
Last synced: 18 Apr 2026
https://github.com/dbriane208/omdena-apprenticeship-project
This is part of my contribution to the Omdena apprenticeship program .
data data-science feature-engineering machine-learning
Last synced: 14 Mar 2026
https://github.com/vagnerbellacosa/029_analisededadoscompythonpandas
Neste Labs será apresentada a biblioteca Pandas, uma biblioteca Python de código aberto para análise de dados. Ela dá ao Python a capacidade de trabalhar com dados do tipo planilha, permitindo carregar, manipular e combinar dados rapidamente, entre outras funções. Python
data digital-innovation-one dio jupiter-notebook labs ms-excel panda python
Last synced: 14 May 2026
https://github.com/luminati-io/twitter-x-dataset-samples
A sample dataset of over 1000 Twitter (X) posts, extracted using the Bright Data API, ideal for trend discovery, brand monitoring, and competitive insights.
api data dataset twitter twitter-api twitter-scraper web-scraping x
Last synced: 19 Mar 2026