data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-27 00:07:33 UTC
- JSON Representation
https://github.com/ssiarhei115/customer-classification
Developing ML model predicting bank' customer inclination to open a deposit
big-data big-data-analytics data data-science data-visualization mashine-learning
Last synced: 09 Apr 2025
https://github.com/nixinova/nzpolls
New Zealand polling data aggregation
data election-data election-polling graphing new-zealand nixinova polling polling-data
Last synced: 09 Apr 2025
https://github.com/yakupzengin/data-structures-and-algortihms
This repo contains implementation of data structures and algorithms using JAVA
algorithms algorithms-and-data-structures data structure
Last synced: 03 Dec 2025
https://github.com/rdmpage/checklist-of-the-freshwater-snails-of-sabah
Data from A preliminary checklist of the freshwater snails of Sabah (Malaysian Borneo) deposited in the BORNEENSIS collection, Universiti Malaysia Sabah https://doi.org/10.3897/zookeys.673.12544
checklist data gbif google-earth kmz sabah
Last synced: 09 Mar 2026
https://github.com/joaocarmo/react-very-simple-data-table
When all you want is a table
Last synced: 06 Mar 2025
https://github.com/simoneas02/data-science
🐍 A planning study to become a data scientist and to improve my current skills. 🤘🏼🌻
data data-analysis data-science data-visualization deep-learning machine-learning pandas python3 r sql
Last synced: 12 Apr 2026
https://github.com/nononoexe/setariaviridis
🌾 Field-collected data of green foxtail
data data-science dataset rpackage
Last synced: 27 Feb 2026
https://github.com/chaitanyac22/hr_policy_query_resolution_with_retrieval_augmented_generation_rag
This repository contains an HR Policy Query Resolution system using Retrieval-Augmented Generation (RAG). It leverages a 4-bit quantized Mistral-7B-Instruct-v0.2 LLM and JP Morgan Chase’s publicly available Code of Conduct documents to generate accurate, contextually relevant responses for HR policy queries.
artificial-intelligence data hr large-language-models llm mistral-7b nlp pipeline prompt-engineering quantization rag retrieval-augmented-generation
Last synced: 12 Feb 2026
https://github.com/perezrd5/publicdataprojects
These are public database and data analysis projects from the portfolio of Doug Perez
data data-model data-modeling data-models data-science data-structure data-structures database microsoft-sql-server mysql olap olap-cube oltp postgresql ssas ssis ssrs t-sql
Last synced: 13 Apr 2026
https://github.com/deepwaterpaladin/statscanpy
Basic package for querying & downloading StatsCan data by table name.
Last synced: 16 Jan 2026
https://github.com/jmbhughes/goes_solar_retriever
Tool to retrieve GOES-R Solar Data
data data-retrieval data-science goes-16 goes-satellite goes16 goes17 solar solar-physics
Last synced: 07 Jan 2026
https://github.com/azawawi/perl6-msgpack
Perl 6 Interface to libmsgpack
data messagepack msgpack perl6 wrapper
Last synced: 12 Jun 2025
https://github.com/tomasoak/datahopper
Python package for data engineering and data wrangling
data data-analysis data-engineering data-mining data-science data-structures data-wrangling datascience pandas python
Last synced: 12 Mar 2026
https://github.com/davorg/dmp
Data Munging with Perl
book data hacktoberfest munging perl
Last synced: 21 Jan 2026
https://github.com/malvfr/zap
Fill your database with fake data.
cli csv data database generator hacktoberfest mock node populate populate-database seed sql
Last synced: 21 Jan 2026
https://github.com/jrmedd/emojinal
An experimental API for determining emoji sentiment, based on research from Institut "Jožef Stefan", Slovenia.
data emojis sentiment user-research ux
Last synced: 19 Jan 2026
https://github.com/rohancyberops/r-language
R Language Projects directory. This repository contains various projects, scripts, and experiments developed using R, a powerful statistical computing and data visualization language.
caret cran data dplyr ggplot2 rlanguage rstudio shiny tidyverse
Last synced: 12 Oct 2025
https://github.com/geocollections/turvas
Database of peat geology
data data-visualization database estonia geology mineral-resources peat
Last synced: 05 Feb 2026
https://github.com/ompreetham/dcn-network-traffic-anomaly-detection
Data Communication Networks - Network Traffic Anomaly Detection
anomaly anomaly-detection communication data dcn keras learning machine machine-learning network pandas presentation project python scikit-learn tensorflow traffic
Last synced: 08 Apr 2026
https://github.com/lahcenezzara/whatsapp-scraping-python
WhatsApp Scraping Python
automation data python scraping selenium whatsapp
Last synced: 05 Feb 2026
https://github.com/souvik09-tech/adventure-works-kpi-dashboard
This repository contains a complete Business Intelligence solution for AdventureWorks, a global manufacturing company specializing in cycling equipment and accessories. Built using Power BI Desktop, this project helps track KPIs, analyze product performance, compare regional data, and identify high-value customers.
analysis data kpi powerbi visualization
Last synced: 27 Jan 2026
https://github.com/sanskaryo/ultimate-dsa-repo
One Stop Solution for DSA Learning and Resources
data data-structures-and-algorithms dsa hacktoberfest hacktoberfest-accepted hacktoberfest2025
Last synced: 15 Oct 2025
https://github.com/akv3sic/cryptocurrency-charts
Cryptocurrency API data visualizations 📈 with Matplolib.
cryptocurrency data data-visualization matplotlib python
Last synced: 16 Oct 2025
https://github.com/charliecm/meteorite-landings
Data visualization of meteorite landings on Earth.
astronomy d3 data data-visualization mapbox space visualization
Last synced: 18 Apr 2026
https://github.com/gematik/poc-isik-patient-merge
The repository contains a proof of concept (POC). The POC demonstrates how a FHIR subscription can be used to inform about happened merges within the ISIK context.
Last synced: 19 Oct 2025
https://github.com/jaldekoa/fiscaldataapi
A Python wrapper to easily retrieve data from the Fiscal Data (US Treasury) official API in pandas format.
api api-wrapper banking data finance pandas python united-states
Last synced: 27 Jan 2026
https://github.com/rodekruis/510-data-catalog
The Project is CKAN based Data Catalog Portal for 510
Last synced: 23 Jan 2026
https://github.com/garcane/Income-Prediction-ML
This is a machine learning project aimed at predicting whether an individual's annual income exceeds $50,000 based on their demographic and personal information.
data data-science machine-learning ml numpy pandas python random-forest scikit-learn
Last synced: 24 Oct 2025
https://github.com/doziestar/datavinci
DataVinci enables you to visualize data from various sources, generate insights, analyze data with AI models, and receive real-time updates on anomalies
Last synced: 23 Jan 2026
https://github.com/capire/xtravels-java
Travel booking app using master data from xflights built with CAP Java
cap cds data federation flights java reuse
Last synced: 23 Jan 2026
https://github.com/2kabhishek/pyramen
Data Analysis for Ramen 🍜💹
csv data data-analysis fun python report
Last synced: 26 Oct 2025
https://github.com/patrikmasiar/algorythm-of-the-night
Awesome list of algorithms that help you 🚀 Feel free to contribute 👨🏻💻
algorithms data interview-questions logic logic-programming math mathematics science
Last synced: 27 Oct 2025
https://github.com/bredalis/exceptions
Examples of exceptions 🚫
algotithms coding data exceptions language-programing python
Last synced: 04 Mar 2025
https://github.com/city-of-helsinki/drupal-helfi-tyollisyyspalvelut-manuaali
Työllisyyden kuntakokeilujen palvelutietovarannon manuaali
data drupal drupal-9 unemployment
Last synced: 24 Jan 2026
https://github.com/CheeseWithSauce/HadithsJSONFormat
Free, authentic Hadith data from sunnah.com organized bookwise specially for Muslim devs. Includes Arabic, English, and gradings. Use freely without credits. Collections: Bukhari, Muslim, Abu Dawud, Tirmidhi, Nasa'i, Ibn Majah, Malik, Riyad as-Salihin. Expanding soon, Inshallah.
api arabic data dev free hadith islam islamic muslim open-source quran sunnah
Last synced: 24 Feb 2026
https://github.com/ariqf1/learn_data
Currently learning and building projects related to data pipelines, ETL processes, and data processing using Python. Passionate about scalable data solutions and modern data stack tools.
Last synced: 15 Apr 2026
https://github.com/stdlib-js/array-one-to
Generate a linearly spaced numeric array whose elements increment by 1 starting from one.
array data float32array float64array int16array int32array javascript matrix ndarray node node-js nodejs stdlib structure typed typed-array types uint32array vector
Last synced: 26 Feb 2026
https://github.com/azeemmirza/structures
Structures Applied
data data-structures javascript typescript
Last synced: 14 Feb 2026
https://github.com/codenoid/webtoons.com-database
a Webtoons.com Database, collected by Hofesh Bot (Scrapper)
Last synced: 28 Mar 2025
https://github.com/sandk21/etude_eau_potable_monde
Etude sur l'accès à l'eau dans le monde - Tableaux de bord avec Tableau
analysis data tableau tableau-public visualization
Last synced: 19 Mar 2026
https://github.com/kirkalyn13/portfolio-dashboard-site
Portfolio Site; Initially a Service Provider Metrics Dashboard using React.
dashboard data data-visualization react
Last synced: 15 Apr 2026
https://github.com/castdrian/kdapi
A TypeScript library that scrapes K-pop idol and group information from online sources to create comprehensive JSON datasets.
api data kpop scraper typescript
Last synced: 15 May 2025
https://github.com/giladbarnea/to
A simple CLI tool to convert and diff between JSON, YAML, TOML, JSON5 and Python collections.
conversion data data-conversion json json5 parser script terminal toml yaml
Last synced: 08 Feb 2026
https://github.com/jeanmanguy/milk-sci-fi
Census of every mention of milk in sci-fi works.
Last synced: 26 Feb 2026
https://github.com/3squared/smoulder
Smoulder is a really good data pipe
composition data facade-pattern forge-framework object-oriented
Last synced: 25 Apr 2026
https://github.com/BenSFGamer/B.I.O.S.
A biographer
academia agi ai ai-tools artificial-general-intelligence artificial-neural-networks automation data fact-checking information-extraction information-retrieval self-improving self-learning self-referential semi-autonomous software-engineering specialization web-agent web-scraping writer
Last synced: 27 Sep 2025
https://github.com/codenoid/alodokter.com-database
a Alodokter.com Database, collected by Hofesh Bot (Scrapper)
alodokter data extraction hofesh
Last synced: 18 Mar 2026
https://github.com/alja7dali/swift-bits
A bite sized library for dealing with bytes.
binary bit bits byte bytes comprehension data manipulation swift
Last synced: 09 Jun 2026
https://github.com/skygenesisenterprise/aether-account
Your cloud hub to securely manage all Aether services, profiles, and preferences in one unified dashboard. Fully open-source, fully cloud.
account data javascript nextjs platform service sso-service typescript user-interface
Last synced: 16 Apr 2026
https://github.com/tushar2704/applied-ai-playground
This repository serves as a comprehensive collection of resources and projects for Applied Artificial Intelligence (AI). Whether you're an AI enthusiast, a data scientist, or a developer looking to explore practical applications of AI, this repository aims to provide you with valuable materials and hands-on projects to deepen your understanding.
artificial-intelligence data data-science machine-learning machine-learning-algorithms
Last synced: 12 Feb 2026
https://github.com/makepath/medaprep
medaprep is a data preparation and feature engineering toolkit for geospatial applications.
data data-science datacleaning eda exploratory-data-analysis xarray
Last synced: 29 Jun 2025
https://github.com/ismail-mouyahada/lodscroljs-library
LodScrolJS Documentation LodScrolJS is a lightweight, fast, and secure JavaScript library designed to load any type of content from APIs on scroll, helping to avoid loading too much data at once. It works seamlessly with various JavaScript frameworks
data data-visualization load-on-scroll loading loading-spinner loadonscroll scroll
Last synced: 13 Feb 2026
https://github.com/garcane/london-housing-price-dashboard
This Excel-based Housing Visual Dashboard provides a comprehensive view of average house prices across various boroughs in London from 1996 to 2013. The dashboard is designed to offer insights into housing market trends and price variations across different areas of London over time.
data data-analysis data-visualization excel visual
Last synced: 13 Feb 2026
https://github.com/frictionlessdata/cardealerdp
Cardealer DP (Car Dealer Data Package) is a data exchange format for car dealerships. It is developed on top of the Data Package standard
car data datapackage dealer exchange extension format
Last synced: 13 Feb 2026
https://github.com/stdlib-js/datasets-harrison-boston-house-prices-corrected
A (corrected) dataset derived from information collected by the US Census Service concerning housing in Boston, Massachusetts (1978).
boston data dataset datasets house housing javascript linear-regression node node-js nodejs prediction prices statistics stats stdlib value
Last synced: 15 Feb 2026
https://github.com/m-rishab/stock_trend-analysis-power-bi-project-
In this project, I've harnessed the robust capabilities of Power BI to analyse, visualize, and uncover the story behind HUL's stock performance.
data datavisualization datavisualization-project powerbi
Last synced: 19 Mar 2026
https://github.com/ghonimo/diode-pn-junction-characterization-psu-ece515
A detailed analysis of the I-V characteristics of a PN junction diode (1N4148) under different temperatures, utilizing Excel for graphical analysis and parameter extraction. This study was conducted as part of the ECE 515: Fundamentals of Semiconductor Devices course at Portland State University.
analysis characterization data device diode diodes excel mosfet-transistor pn-junction
Last synced: 28 Feb 2026
https://github.com/docusign/extension-app-data-io-reference-implementation
Extension App for Data IO Reference Implementation for the Docusign IAM Platform
Last synced: 02 Mar 2026
https://github.com/mohamedhany99/human-voice-identifier-counter
the application developed in (KIVY) it can identify the users imported into the dataset based on the support vector machine training model it has two features ( Importing new voice - Detection to detect the human voices and count them)
android android-app android-application automation automation-framework data data-analysis data-mining data-science data-visualization datascience kivy kivy-framework machine-learning python
Last synced: 27 Mar 2026
https://github.com/theryston/db-mycro
A node module with a json database that saves data in a specific directory, similar to sqlite, but in JSON
base crud data database db db-mycro javascript json jsondatabase nodejs nosql typescript
Last synced: 09 Apr 2026
https://github.com/mews-labs/dataframe-memory
This tools aims to provide simple solution to save memory when using pandas' data frame.
data data-science memory-usage pandas-dataframe python3
Last synced: 22 May 2026
https://github.com/themost-framework/memory
MOST Web Framework in-memory data adapter for testing environments
Last synced: 06 Mar 2025
https://github.com/gallo13/neuralnetworks-deeplearning-stats-classification
Descriptive Statistics, Classification and Analysis Using Python & Python Libraries (Assignment 1)
analysis data datasets deep-learning jupyter-notebook matplotlib neural-networks numpy pandas plotting python seaborn
Last synced: 17 Apr 2026
https://github.com/timmymatten/spikeball-stat-tracker
Spikeball stat tracking web app built with Streamlit and Python, designed to easily log and analyze player performance over multiple games.
data data-analysis data-visualization dataset matplotlib-pyplot multipage python spikeball statistics streamlit
Last synced: 18 Apr 2026
https://github.com/thechibuzornwachukwu/bluesky-scraper
This is a work of art that enables you to scrape data off BlueSky.
analytics bluesky bluesky-api bluesky-client data datascraper-framework datascraping scraping social-media web webscraping
Last synced: 16 Nov 2025
https://github.com/adriweb/wsualizer
Some random code to visualize things coming from a websocket (pronounced 'visualizer')
bootstrap data html jquery real-time visualization visualizer websockets
Last synced: 20 Apr 2026
https://github.com/yukti-09/extracting-data-from-twitter
Data From Twitter!
data data-mining extracting-data timeline tweepy tweets twitter
Last synced: 11 Oct 2025
https://github.com/bredalis/functionalprogrammingpython
💻 Programación Funcional en Python
data functional-programming functions programing programming-language python structured-data
Last synced: 06 Jun 2026
https://github.com/avitai/datarax
A Differentiable Data Pipeline Framework for JAX
autograd data data-analysis data-science differentiable flax-nnx jax jit machine-learning xla
Last synced: 25 Apr 2026
https://github.com/rohancyberops/rp1
This project performs an analysis of Starbucks (SBUX) stock returns using R. The analysis includes both simple returns and continuously compounded returns (CC returns) for a period of one month. It also calculates the growth of $1 invested in SBUX and provides visual insights through various plots.
analysis cc data r rlanguage sbux
Last synced: 15 Mar 2025
https://github.com/aidenellis/connectmp
🍰 ConnectMP - An easy way to share data between Processes in Python.
aidenellis connectmp data data-sharing multiprocessing process sharing
Last synced: 27 Apr 2026
https://github.com/apfirebolt/data-structures-and-algorithms-in-python
Data Structure and Algorithms in Python
algorithms data data-structures python python3 tkinter-gui
Last synced: 15 Mar 2025
https://github.com/ucd-cws/nitrates-cv
california centralvalley data frep groundwater model nitrates
Last synced: 16 Jan 2026
https://github.com/mohsinali08000/myportfolio
I’m Mohsin Ali, a passionate software engineer with over 2 years of experience in developing robust software solutions. Currently transitioning into the field of data science.
Last synced: 22 Apr 2026
https://github.com/jackosheadev/databasetechproject
This is a repo for a database project which involves creating tables, populating them, viewing data with selects and finally simulating a transaction
Last synced: 18 May 2026
https://github.com/oefenweb/python-untraceables
Randomizes IDs for a given set of tables making them untraceable across environments
anonymize data database mysql privacy python python2 python3 randomization
Last synced: 03 Feb 2026
https://github.com/sodascience/open_supply_hub
Processing supply chain data obtained from Open Supply Hub
data global-supply-chain open-supply-hub python
Last synced: 29 Apr 2026
https://github.com/chompfoods/stub-asp-net-core
ASP.NET Core server stub for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.
api asp asp-net-core aspnetcore branded chomp data database food grocery ingredients nutrition raw recipe-api recipes server stub stub-server
Last synced: 30 Apr 2026
https://github.com/leomsgit/extrator-de-parametros-analise-hemograma-e-bioquimico
Software em Python para varrer arquivos PDF e extrair parâmetros diretamente para arquivo Excel
analysis data excel excel-export google-colab hemogram jupyter-notebook pdf pdf-document-processor pdf-viewer python python3
Last synced: 01 May 2026
https://github.com/antononcube/raku-data-cryptocurrencies
Raku package of cryptocurrency data retrieval.
Last synced: 02 Apr 2025
https://github.com/athari22/house_sales_in_king_count_usa
The idea of the project is to do a Data analysis in a Real Estate Investment Trust. The Trust would like to start investing in Residential real estate.
analysis data data-science data-visualization ibm ibm-watson linearregression machine-learning matplotlib numpy pandas sklearn-library
Last synced: 01 May 2026
https://github.com/henrylin03/china-gdp
Analysis and visualisation of China GDP data using Python.
data data-analysis data-visualisation dataset kaggle pandas
Last synced: 01 May 2026