data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-23 00:07:41 UTC
- JSON Representation
https://github.com/chaitanyac22/hr_policy_query_resolution_with_retrieval_augmented_generation_rag
This repository contains an HR Policy Query Resolution system using Retrieval-Augmented Generation (RAG). It leverages a 4-bit quantized Mistral-7B-Instruct-v0.2 LLM and JP Morgan Chase’s publicly available Code of Conduct documents to generate accurate, contextually relevant responses for HR policy queries.
artificial-intelligence data hr large-language-models llm mistral-7b nlp pipeline prompt-engineering quantization rag retrieval-augmented-generation
Last synced: 12 Feb 2026
https://github.com/a3r0id/lightshot-data-miner
A random idea I had a while back to make a data miner for lightshot. Never released this but after a friend sent me a post about lightshot's transparency I figured it'd be a good time to release this. I've included some output from a run before making the repo. I am not responsible for the imagery or it's contents.
brute-force bruteforce data dataset face-recognition image-processing lightshot mining scraper scraping text-recognition
Last synced: 19 Oct 2025
https://github.com/karashiiro/lodestone-id-time
Data scraper, formula and reference implementation for the estimated creation time of a FFXIV character given its Lodestone ID.
data ffxiv ffxiv-character lodestone
Last synced: 30 Jun 2025
https://github.com/marek-jakub/monitoring
A university project concerning field data management for bird ringers.
bird data fieldwork management ringing
Last synced: 24 Jun 2026
https://github.com/skylinenando/javascript
autocomplete browser data disable events javascript language loop
Last synced: 14 Feb 2026
https://github.com/rulox/faker
A Go library to create Fake Data for your projects
data dummy dummy-data fake fake-data faker go golang
Last synced: 28 May 2026
https://github.com/divithraju/divith-raju-searchengine-wikipedia
search engine optimizationA complete search engine experience built on top of 75 GB Wikipedia corpus with subsecond latency for searches. Results contain wiki pages ordered by TF/IDF relevance based on given search word/s. From an optimized code to the K-Way mergesort algorithm, this project addresses latency, indexing, and big data challenges.
algorithms data dataengineering inverted-index linux merge-sort nlp project project-repository python3 serchengine software-engineering ubuntu wikipedia
Last synced: 16 May 2026
https://github.com/poncoe/passdatatoanotherfragment
Latihan Passing data Ke Fragment Lain
android android-app android-application android-studio data fragment fragments kotlin kotlin-android passing-parameters passingdataintent viewmodel
Last synced: 23 Jun 2026
https://github.com/jongirard/unique_names_generator
A Unique Names Generator built in Elixir
data data-generator elixir elixir-lang fake-data name-generator phoenix seed
Last synced: 21 Oct 2025
https://github.com/kylekirkby/cardatasnatch
CarDataSnatch allows you to quickly find information about a car in the uk using a valid number plate. Grab an image of the car in question along with a multitude of other data. Compare two cars' data for fast and easy analysis.
beautifulsoup cars command-line-tool data data-analysis data-mining ethical-hacking python python3 requests scraper social-engineering
Last synced: 15 Apr 2025
https://github.com/wibosco/modelingformchanges-example
An example project to show how we can implement a model to simplify form validation
data swift unit-testing validator
Last synced: 16 Mar 2025
https://github.com/planarnetwork/feeds.planar.network
GTFS feeds for bus, train and plane
data feeds gtfs transit transportation
Last synced: 11 Feb 2026
https://github.com/utrechtuniversity/dataprivacyproject
This is the repository underlying the landing page for the Data Privacy Project @UtrechtUniversity, the Netherlands.
data gdpr open-science privacy rdm research research-data-management utrecht-university
Last synced: 10 Oct 2025
https://github.com/stdlib-js/array-ones
Create an array filled with ones and having a specified length.
array data float32array float64array int16array int32array javascript matrix ndarray node node-js nodejs stdlib structure typed typed-array types uint32array vector
Last synced: 09 Apr 2025
https://github.com/lafayettegabe/g2m-insight-for-cab-investment-firm
📊 Exploratory Data Analysis (EDA) on multiple datasets related to the cab industry in the US, to provide actionable insights and recommendations to a private firm looking to invest in the market. The analysis includes data cleaning, transformation, visualization, and hypothesis testing.
big-data data data-analysis data-science data-visualization eda gotomarket
Last synced: 13 Jun 2025
https://github.com/oliverhennhoefer/shiny-template-interactive-table
Example of interactively adding rows / deleting rows by selecting directly in a data.table (DT) in Shiny
button data delete dt r select selection server shiny shiny-applications shiny-apps shiny-r shinyapps table ui userinterface
Last synced: 16 Apr 2026
https://github.com/uk-ipop/open-data-pipeline
A pipeline for processing, enhancing, and sharing open datasets.
actions automation data python
Last synced: 25 May 2026
https://github.com/mskian/tamil-words
Tamil words Collections with English Meaning - API and SQL Data.
api data javascript json json-api mysql pdo php sql tamil tamil-language tamil-sms tamilwords translate translator
Last synced: 14 Apr 2026
https://github.com/slashdotted/pomapure
PoorMan's Pipeline
data json modular module pipeline processing
Last synced: 18 Apr 2026
https://github.com/yakupzengin/data-structures-and-algortihms
This repo contains implementation of data structures and algorithms using JAVA
algorithms algorithms-and-data-structures data structure
Last synced: 03 Dec 2025
https://github.com/evoluteur/web-scraper-sitemaps
Sitemaps for the Web Scraper Chrome extension.
chrome-extension data dataset scraper scraping scrapper scrapping scrapy-crawler sitemap web-scraper web-scraping
Last synced: 04 Jun 2026
https://github.com/ssiarhei115/customer-classification
Developing ML model predicting bank' customer inclination to open a deposit
big-data big-data-analytics data data-science data-visualization mashine-learning
Last synced: 09 Apr 2025
https://github.com/rcourivaud/rcourivaud.github.io
Raphaël Courivaud
data database datascience python
Last synced: 21 Apr 2026
https://github.com/zgbjgg/quetzal-examples
Examples using Quetzal :rocket: :bird:
analytics dashboard data data-visualization elixir erlang plotly web-app
Last synced: 24 Apr 2026
https://github.com/openpeeps/zxc-nim
Bindings to the ZXC compression library, a LZ77-based compressor optimized for high decompression speed
archive compression compressor data decompression game-assets lossless lossless-compression lz77 nim nim-bindings nim-package nim-wrapper openpeeps zxc
Last synced: 07 Jun 2026
https://github.com/andrewrporter/my-analytics
Analyzes FireFox browsing history with modern python3 features and libraries
analytics data firefox matplotlib python python3 sqlite3
Last synced: 28 Apr 2026
https://github.com/espoirmur/balobi_nini
An End to End Data Science Project, where I used Tweepy and Airflow to collect tweets related to the DRC and topic modeling technics to discover which topics Congolese are talking about on Twitter.
Last synced: 24 Aug 2025
https://github.com/ismet55555/pdw-asym-2link
Clear and easy way of simulating a passive dynamic walker (PDW) model derived and exectured using MATLAB.
data dynamics inverted-pendulum matlab numerical-simulations passive-dynamic-walker passive-dynamics ramp research robotics simulation slope walking-simulator
Last synced: 29 Apr 2026
https://github.com/14richa/patient-readmission-analysis
This project focuses on predictive modeling to foresee hospital readmissions of diabetic patients within 30 days post-discharge. By leveraging a dataset spanning a decade (1999-2008) and covering records from 130 US hospitals, the aim is to enhance healthcare management and patient outcomes.
analytics data jupyter-notebook numpy
Last synced: 29 Apr 2026
https://github.com/andrey-tech/data-storage-php
Простое хранилище данных в виде ключ-значение в JSON-файлах с разделяемой блокировкой на чтение и эксклюзивной блокировкой на запись.
data data-storage files json php php7 storage
Last synced: 29 Apr 2026
https://github.com/saleh0987/mohamed_saleh
That's my personal website where I show my skills and projects.
aos-animation axios boot data json nextjs portfolio portfolio-website projects react-icons reactjs sass swiper
Last synced: 09 Mar 2026
https://github.com/camara94/data-visualization-with-python
Data visualization and some of the best practices when creating plots and visuals. The history and architecture of Matplotlib, and how to do basic plotting with Matplotlib. Generating different visualization tools using Matplotlib such as line plots, area plots, histograms, bar charts, box plots, and pie charts. Seaborn, another data visualization library in Python, and how to use it to create attractive statistical graphics. Folium, and how to use to create maps and visualize geospatial data.
data data-science data-structures data-visualization python3
Last synced: 16 May 2026
https://github.com/dongminlee94/data-visualization-tutorial
A repository for data visualization tutorial
data data-science data-visualization matp matplotlib pca plotly python seaborn t-sne tutorial umap visualization
Last synced: 29 Apr 2026
https://github.com/sabujxi/python-scraper-and-data-analysts-admin-panel-in-django
A data scraper from texas govt site and a helping web app for managing, reviewing and editing the data
analyst data data-analysis data-entry data-scraper django django-application python python-scraper real-estate regex scraper texas
Last synced: 30 Apr 2026
https://github.com/stdlib-js/ndarray-base-char2dtype
Return the data type string associated with a provided single letter abbreviation.
abbr abbreviation array base c data dtype javascript multidimensional ndarray node node-js nodejs stdlib type types util utilities utility utils
Last synced: 12 Mar 2026
https://github.com/noklam/blog_archive_fastpage
Nok's data science blog
blog data data-science machine-learning python sceince
Last synced: 01 May 2026
https://github.com/woctezuma/geforce-leak
Fetch data from the Geforce leak.
data datamining egs epic epic-games epic-games-launcher epic-games-store geforce geforce-experience geforce-leak geforce-now geforce-now-leak geforcenow geforcenow-leak graphql leak leaks nvidia steam steam-games
Last synced: 02 May 2026
https://github.com/StudyResearchProjects/arrbuffstr
Creates Strings from ArrayBuffers and viceversa in NodeJS and the Browser
arraybuffer browser data node string transform
Last synced: 09 Oct 2025
https://github.com/priyanka7411/customer-segmentation-churn-dashboard
📊 Streamlit + Plotly dashboard for customer segmentation, RFM analysis, and churn prediction using machine learning.
churn data machine-learning pandas prediction python rfm rfm-analysis streamlit visualization
Last synced: 14 Apr 2026
https://github.com/dantesc03/uberpool-case-study
This project was designed to understand the statistical effects of longer wait times on uber rides. Particularly on the user and driver experience with the Uber Pool System.
analysis data excel jupyter jupyternotebooks learn python seaborn statistics t-tests uber visualization
Last synced: 16 Apr 2026
https://github.com/assem-elqersh/creativa-data-science-bootcamp
Jupyter notebooks from the Creativa Data Science Bootcamp, covering key data science concepts and practices across multiple sessions, from data preprocessing to model building and time series analysis.
data data-science eda exploratory-data-analysis machine-learning pandas time-series-analysis xgboost xgboost-classifier
Last synced: 03 May 2026
https://github.com/rastmob/wordpress-llms-output-plugin
A WordPress plugin to export posts, pages, and custom post types as JSON for training Language Models (LLMs).
ai data llm llms training training-data wordpress wordpress-development wordpress-plugin
Last synced: 03 May 2026
https://github.com/stefen-taime/real-time-data-pipeline-snake-game
Dynamic Snake Game: Unleashing Real-Time Streaming Analytics with Redis, Kafka, Flink, ClickHouse & Chart.js in an Online Snake Game via Flask API
chartjs clickhouse confluent-cloud data flask kafka-streams pipeline redis
Last synced: 04 May 2026
https://github.com/eyedia/idpe
Eyedia's Integrated Data Processing Environment
csharp data designer development development-environment development-tools development-workflow environment ide no-coding parser processing rehosted workflow
Last synced: 11 Oct 2025
https://github.com/acaciaman/db-autotest
DB Database test automation. This python package allows to create database object structure and load data from database.
Last synced: 05 May 2026
https://github.com/husna-poyraz/artificial-intelligence-and-data-science
Some studies on Artificial Intelligence and Data Science ...
artificial-intelligence data data-analysis-python data-science matplotlib-pyplot numpy pandas python
Last synced: 05 May 2026
https://github.com/quin1sue/priceguidesph-bettergov
an economic and financial data platform project under bettergov.ph
bettergovph cloudflare data hacktoberfest nextjs priceguides
Last synced: 05 May 2026
https://github.com/automators-com/datamaker-js
The official Node.js / Typescript library for the DataMaker API
data javascript nodejs typescript
Last synced: 11 Oct 2025
https://github.com/iusztinpaul/airbnb-data-analysis
Airbnb data analysis on the biggest cities in The Netherlands following the CRISP-DM methodology.
airbnb data datanalysis datascience machine-learning numpy pandas python
Last synced: 06 May 2026
https://github.com/jesusgraterol/bitcoin-blockchain-dataset-builder
The dataset builder script extracts all the relevant block information from the Bitcoin Blockchain through Mempool.space's public API. The data is stored in a .csv file, facilitating its use in data science and machine learning projects.
bitcoin blockchain blockchain-technology data datascience datascience-machinelearning dataset dataset-generation machine-learning
Last synced: 06 May 2026
https://github.com/erictleung/erictleung.github.io
:memo: Source code for my website, portfolio of projects, and more
bioinformatics blog data data-analysis data-science github-jekyll github-page jekyll lanyon open-science open-source software-engineering
Last synced: 21 Jan 2026
https://github.com/doriclaudino/canarinho_nlp
labels, classify, summarization string for canarinho app
chrome-console classification classifier-model data labels nlp nlu python spacy spacy-models spacy-nlp summarization-string
Last synced: 08 May 2026
https://github.com/yanpitangui/iteminfoconverter
Application that converts ragnarok legacy data files to iteminfo.lua
data itemdbconf iteminfo luafiles ragnarok
Last synced: 12 Oct 2025
https://github.com/hasnocool/war_thunder_camouflage_scraper
A concurrent web scraper designed to collect camouflage information from war thunder aircrafts.
asyncio camouflage concurrent data execution handling playwright python scraping signal sqlite3 thunder war web
Last synced: 04 Jan 2026
https://github.com/rohan-paul/machine-learning-and-deep-learning-tutorial-notebooks
Various Machine Learning and Deep Learning Tutorial Notebooks in Blog Format
data data-analysis data-science deep-learning deep-learning-tutorial deep-neural-networks machine-learning machine-learning-algorithms machinelearning neural-network pytorch pytorch-implementation pytorch-tutorial tensorflow
Last synced: 09 May 2026
https://github.com/eby8zevin/android-pos4122020
The Next Project . . .
android android-app android-application android-database android-studio androidstudio create data database database-sqlite delete point-of-sale pos read search sqlite update
Last synced: 13 Oct 2025
https://github.com/datahub-local/datahub-local
DataHub.local is a powerful data platform designed for edge devices, enabling seamless analytics and insights at home
data data-engineering devops kubernetes raspberrypi
Last synced: 21 Jan 2026
https://github.com/secret-guest/file_organizer
Files Organizer is a versatile tool for sorting and organizing files efficiently, ideal for managing recovered data.
c c-development data data-recovery file-management file-manager files sorting sorting-algorithms subdirectories subdirectory
Last synced: 10 Jun 2026
https://github.com/mahmoud-saeed-mahmoud/loading_state_handler
The StateHandlerWidget manages different UI states—loading, error, empty, and normal—allowing you to customize the displayed widgets for each state.
dart data error flutter flutter-package flutter-widget loading state
Last synced: 10 Mar 2026
https://github.com/tayeva/eia-client-python
EIA Open Data API Client - Python
data open-source python python-3 python3
Last synced: 14 Oct 2025
https://github.com/mednour2019/devolap
OLAP Cube Dispatcher Tool
analysis-services csharp data excel excel-export kpi mdx metroframework mvvm-architecture sql wpf
Last synced: 27 Jan 2026
https://github.com/skywarth/fenrir-wolfpack-simulator
Simulating wolfpack behaviours and future of the pack in an environment using Javascript and data trees.
data data-structures javascript max-heap simulation simulations wolfpack
Last synced: 14 Oct 2025
https://github.com/missiontoscale/bluesky-scraper
This is a work of art that enables you to scrape data off BlueSky.
analytics bluesky bluesky-api bluesky-client data datascraper-framework datascraping scraping social-media web webscraping
Last synced: 19 Jun 2026
https://github.com/yorkulibraries/vendorpol
URLs for vendor privacy policies and terms of use.
Last synced: 15 Oct 2025
https://github.com/codecentric/reedelk-bookingintegrationservice
Example service for the blog post series about Reedelk
api api-gateway data integration integration-flow
Last synced: 16 Oct 2025
https://github.com/audeering/emodb
Publishes Berlin Database of Emotional Speech with audb
Last synced: 19 Oct 2025
https://github.com/plabayo/datapoints.earth
Earth data liberation for and by its citizens.
Last synced: 15 Mar 2026
https://github.com/davemlz/master_of_datascience
Master of Data Science repository
data data-mining data-science database r rmd sql sqlite statistics
Last synced: 14 Apr 2026
https://github.com/mrnazu/eth-data-library
eth-data-library is a Nodejs library that provides tools for accessing and processing data on the Ethereum blockchain.
blockchain data ethereum nodejs smart-contracts web3
Last synced: 28 Jan 2026
https://github.com/banbord/data-vis-tornados
This repository includes data files, processing scripts, visualization code, and documentation for our tornado data visualization project. It aims to provide insights into tornado patterns across the United States using interactive and informative visual representations.
d3-visualization d3js data javascript json visualization
Last synced: 24 Feb 2026
https://github.com/asirihewage/simplest-xpath-web-scraper
Simplest web scraper created using Python3 and MongoDB
data data-mining python3 scraper web webscrping
Last synced: 29 Jan 2026
https://github.com/peterdavehello/nrd-list-archive
🌐📂 A collection of past NRD lists to explore—perfect for fun, research, or just plain curiosity! 🎉🔍✨
Last synced: 17 Mar 2026
https://github.com/eesunmoon/algorithms
[Fall 2020] Algorithms
algorithms algorithms-and-data-structures c data data-structures
Last synced: 01 Feb 2026
https://github.com/junkwaxhero/cardlists
Sports Card set lists in easily consumable JSON Format for databases, apps, websites, and more!
baseball baseball-cards baseball-data bowman data dataset datasets donruss fleer json json-schema panini topps upper-deck
Last synced: 24 Apr 2025
https://github.com/stdlib-js/array-typed-float-ctors
Floating-point typed array constructors.
array constructor constructors ctor ctors data dtype dtypes javascript node node-js nodejs stdlib structure type typed typed-array types utilities
Last synced: 24 Apr 2025
https://github.com/mihasm/arso-scraper
Unofficial Python CLI tool for downloading automated sensor weather data from the Slovenian Environment Agency.
api arso cli data historical-data meteorological python slovenia weather
Last synced: 14 Feb 2026
https://github.com/nixhantb/data-structures-and-algorithms-in-java-
Master Java Programming and Data Structures and Algorithms in Java in an efficient way. Clear concept on Recursion and Sorting
algorithms algorithms-and-data-structures competitive-programming data data-structures java java-8 programming
Last synced: 05 Jul 2025
https://github.com/satyam4229/college-predictor-system
The college predictor system is a Python-based application that utilizes a machine learning model to predict colleges and their corresponding degree programs and branches based on a student's JEE (Joint Entrance Examination) score.
data data-science jupyter-notebook kaggle prediction python
Last synced: 06 Apr 2026
https://github.com/sermetpekin/evdscpp
evdscpp is a C++ library for fast, efficient, and user-friendly interaction with the EVDS API Server. Designed with performance in mind, it provides built-in caching, an Excel export option, and an intuitive user interface for configuring and retrieving data. evdscpp can be extended for integration with other C++ projects and offers options for use
cbrt central-bank cpp data edds evds evds-api evdscpp tcmb tcmb-api
Last synced: 07 Sep 2025
https://github.com/antoineaugusti/purchasing-power
Archive daily data about purchasing power parity: how much goods should cost in various countries
archive data purchasing-power-parity
Last synced: 28 Oct 2025
https://github.com/helixspiral/ndbc
Golang wrapper for the National Data Buoy Center (NDBC)
data data-science golang government-data ndbc ndbc-buoy-data noaa noaa-api noaa-buoys noaa-data noaa-weather wrapper wrapper-api wrapper-library
Last synced: 14 Jun 2025
https://github.com/headless-start/data-augmentation-impact
This repository contains effect of Data Augmentation of Training Set during Model Training.
augmented-images cuda data gpu keras matplotlib mnist opencv-python python3 tensorflow training-data
Last synced: 05 Apr 2026
https://github.com/stimulsoft/samples-dashboards.js-for-react
JavaScript samples for Dashboards.JS data analysis tool for React applications
analyzer chart components constructor dashboard dashboards data designer export expression javascript js library parser react react-dashboard reactjs relation text viewer
Last synced: 09 Aug 2025
https://github.com/amethyst-php/address
The place where a person or organization can be found or communicated with. Contains fields such as: street, postal code, city, country etc... Can be used for example as a shipment address or as an invoice address.
address amethyst amethyst-package api data laravel
Last synced: 13 Aug 2025
https://github.com/thekartikeyamishra/data_cleaning_project
Welcome to the Data Cleaning and Visualization project! This repository demonstrates how to clean messy data and create insightful visualizations using Python with Pandas and Matplotlib.
data dataanalysis matplotlib matplotlib-pyplot pandas python
Last synced: 02 May 2026