data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-27 00:07:33 UTC
- JSON Representation
https://github.com/imtiaz-emu/exploratory-data-analysis-with-r
Data Transformation, Descriptive statistics, data visualization, Linear regression using R
data dplyr ggplot2 r rstudio visualization
Last synced: 15 Mar 2025
https://github.com/dongminlee94/data-visualization-tutorial
A repository for data visualization tutorial
data data-science data-visualization matp matplotlib pca plotly python seaborn t-sne tutorial umap visualization
Last synced: 29 Apr 2026
https://github.com/divithraju/divith-raju-searchengine-wikipedia
search engine optimizationA complete search engine experience built on top of 75 GB Wikipedia corpus with subsecond latency for searches. Results contain wiki pages ordered by TF/IDF relevance based on given search word/s. From an optimized code to the K-Way mergesort algorithm, this project addresses latency, indexing, and big data challenges.
algorithms data dataengineering inverted-index linux merge-sort nlp project project-repository python3 serchengine software-engineering ubuntu wikipedia
Last synced: 16 May 2026
https://github.com/karashiiro/lodestone-id-time
Data scraper, formula and reference implementation for the estimated creation time of a FFXIV character given its Lodestone ID.
data ffxiv ffxiv-character lodestone
Last synced: 30 Jun 2025
https://github.com/mahmoud-saeed-mahmoud/loading_state_handler
The StateHandlerWidget manages different UI states—loading, error, empty, and normal—allowing you to customize the displayed widgets for each state.
dart data error flutter flutter-package flutter-widget loading state
Last synced: 10 Mar 2026
https://github.com/kylekirkby/cardatasnatch
CarDataSnatch allows you to quickly find information about a car in the uk using a valid number plate. Grab an image of the car in question along with a multitude of other data. Compare two cars' data for fast and easy analysis.
beautifulsoup cars command-line-tool data data-analysis data-mining ethical-hacking python python3 requests scraper social-engineering
Last synced: 15 Apr 2025
https://github.com/andrew-johnson-4/misspeller
Take correctly spelled words and return common spelling mistakes
common-mistakes data language natural nlp processing rust
Last synced: 30 Apr 2025
https://github.com/zalweny26/tools
Just a bunch of tools made in TypeScript.
algorithms data dimensionality distances helpers reduction sortings structures tools utils
Last synced: 03 Feb 2026
https://github.com/hasnocool/war_thunder_camouflage_scraper
A concurrent web scraper designed to collect camouflage information from war thunder aircrafts.
asyncio camouflage concurrent data execution handling playwright python scraping signal sqlite3 thunder war web
Last synced: 04 Jan 2026
https://github.com/thomas-nyanumba/r-programming-air-pollution_disease-project
Personal R Programming Project
aggregate-functions boxplot-visualization data dpylr ggplot2 leftjoin linear-regression patchwork powerquery r readxl scatter-plot tidyr visualization
Last synced: 25 Mar 2025
https://github.com/imranhsayed/programming-in-c
Programming in C
array c c-programming circular-linked-list cprogramming data data-structures-and-algorithms file-handling linked-list pointers
Last synced: 28 Jan 2026
https://github.com/squareslab/probabilisticmodel_saner2018
Paper and supporting materials of the Probabilistic Model paper Accepted to SANER 2018
code data mausotog published replication
Last synced: 26 Oct 2025
https://github.com/guslovesmath/top_tech_sp_500_forecasting
Forecasting the stock market is difficult. I sought to observe the relationship between Apple's stock price and others in the S&P500. In doing this, I was able to conclude that stocks in the tech industry can help predict a trend in Apple's Percent change.
arima-forecasting arima-model data data-science forecasting vector-autoregression
Last synced: 14 Mar 2025
https://github.com/slashdotted/pomapure
PoorMan's Pipeline
data json modular module pipeline processing
Last synced: 18 Apr 2026
https://github.com/rikvdh/zabuffer
Zero-Allocation buffer handling in C
buffer c clib data embedded memory string zero-allocation
Last synced: 03 Mar 2025
https://github.com/joaocarmo/react-very-simple-data-table
When all you want is a table
Last synced: 06 Mar 2025
https://github.com/mo-karbalaee/introduction-to-data-science-sbu
Reports and full documentation of the introduction to data science course held at SBU
data data-science python shahid-beheshti-university
Last synced: 02 Aug 2025
https://github.com/frefrik/covid19norge-data
🦠 COVID-19 Datasets for Norway
covid covid-19 covid19 covid19-data csv data datasets norge norway norwegian smittestopp vaccine
Last synced: 09 Apr 2026
https://github.com/sabujxi/python-scraper-and-data-analysts-admin-panel-in-django
A data scraper from texas govt site and a helping web app for managing, reviewing and editing the data
analyst data data-analysis data-entry data-scraper django django-application python python-scraper real-estate regex scraper texas
Last synced: 30 Apr 2026
https://github.com/yashmistry-24/ytcomment-iq
YTComment-IQ is a web app for analyzing and visualizing YouTube comments, offering insights through sentiment analysis, topic modeling, and interactive charts.
analysis comments data dataanalysis dataanalytics deep-learning machine-learning nlp python streamlit training visualization webapp youtube
Last synced: 15 Feb 2026
https://github.com/eyedia/idpe
Eyedia's Integrated Data Processing Environment
csharp data designer development development-environment development-tools development-workflow environment ide no-coding parser processing rehosted workflow
Last synced: 11 Oct 2025
https://github.com/deepwaterpaladin/statscanpy
Basic package for querying & downloading StatsCan data by table name.
Last synced: 16 Jan 2026
https://github.com/mikeintoshsystems/hispmd
HIS Performance Monitoring Dashboard
api dashboard data dhis2 dhis2-api docker docker-compose hispmd mfr rest-api visualization web
Last synced: 08 Apr 2025
https://github.com/bradlindblad/quotableoffice
Repo for the quotable office R Shiny app
data datascience golem-apps r shiny shiny-apps text text-mining
Last synced: 26 May 2026
https://github.com/mrnazu/eth-data-library
eth-data-library is a Nodejs library that provides tools for accessing and processing data on the Ethereum blockchain.
blockchain data ethereum nodejs smart-contracts web3
Last synced: 28 Jan 2026
https://github.com/yetnt/ump
These utils are useless
area data distance factorization factors gcd-calculator javascript math mean median mode numbers pattern prime range rate ratio temprature temprature-converter volume
Last synced: 03 Feb 2026
https://github.com/vikyw89/usesyncv
a simplistic react global store with pregenerated CRUD, and built in async fetch
data fetch mobx reactjs reactquery redux state state-management store swr zustand
Last synced: 06 Jan 2026
https://github.com/csadorf/pydata-ann-arbor-2018
Slides and notebooks demonstrating signac for PyData Ann Arbor Meetup 2018
data data-management jupyter signac workflow
Last synced: 04 Jun 2026
https://github.com/utrechtuniversity/dataprivacyproject
This is the repository underlying the landing page for the Data Privacy Project @UtrechtUniversity, the Netherlands.
data gdpr open-science privacy rdm research research-data-management utrecht-university
Last synced: 10 Oct 2025
https://github.com/stdlib-js/datasets-suthaharan-multi-hop-sensor-network
Labeled wireless sensor network data set collected from a multi-hop wireless sensor network deployment using TelosB motes.
data dataset datasets javascript labeled machine-learning ml mote motes network node node-js nodejs outlier outliers sample sensor statistics stats stdlib
Last synced: 10 Oct 2025
https://github.com/d3oxy/country-state-data
A comprehensive JSON dataset containing countries, states, cities, regions, and languages with TypeScript support. Perfect for building location-based dropdowns, address forms, and geographical applications.
address cities countries currency data dropdown geographical iso json languages location regions states typescript
Last synced: 24 Jan 2026
https://github.com/quin1sue/priceguidesph-bettergov
an economic and financial data platform project under bettergov.ph
bettergovph cloudflare data hacktoberfest nextjs priceguides
Last synced: 05 May 2026
https://github.com/noklam/blog_archive_fastpage
Nok's data science blog
blog data data-science machine-learning python sceince
Last synced: 01 May 2026
https://github.com/infinitode/pwlds
A public dataset of over 10 million passwords, with assigned strength levels.
ai classes classification cyber-security data dataset ml open-source password passwords synthetic-data
Last synced: 22 Feb 2026
https://github.com/kaos599/apollo-synthetic-data-generator
Apollo is a Python GUI application designed to simplify the complex process of generating random data based on fixed values. It allows users to generate various types of binary datasets, such as Yes/No type questions, by specifying probabilities.
data data-engineering data-generation data-generator data-science faker-library machine-learning tkinter-gui
Last synced: 22 Jul 2025
https://github.com/flrd/standardlastprofile
R Data Package for BDEW Standard Load Profiles in Electricity
Last synced: 16 Mar 2026
https://github.com/vatshayan/final-year-project-image-recognition
Machine Learning project to recognize faces from an Image
btech computerscience data facial final image imageclassification learning machine project recognition science students year
Last synced: 29 May 2026
https://github.com/lastancientone/amd-vs-nvda
Analyzing 2 technology stocks using Master Analyst Program (MAP).
data data-analysis data-structures data-visualization excel forecasting time-series-analysis
Last synced: 15 May 2025
https://github.com/drkenreid/introductory-data-science
Hands-on machine learning tutorials in Google Colab, covering various algorithms and techniques for learners at different levels.
cnn data data-science deep-learning learning-datascience learning-machine-learning learning-python neural-network neural-networks regression rnn science tutorial tutorial-exercises tutorials
Last synced: 28 Jan 2026
https://github.com/gonzalezlrjesus/covid-19API
Convierte la data ofrecida por: the Johns Hopkins University Center en formato CSV al formato JSON sobre los casos confirmados, muertos y recuperados de COVID-19 por paises.
api api-rest api-server coronavirus covid-19 data go golang json
Last synced: 06 May 2025
https://github.com/machu-gwu/constant2-project
provide extensive way of managing your constant variable.
configuration constants data developer-tools python
Last synced: 26 May 2026
https://github.com/dantesc03/uberpool-case-study
This project was designed to understand the statistical effects of longer wait times on uber rides. Particularly on the user and driver experience with the Uber Pool System.
analysis data excel jupyter jupyternotebooks learn python seaborn statistics t-tests uber visualization
Last synced: 16 Apr 2026
https://github.com/ballerina-platform/module-ballerina-data.csv
The Ballerina CSV Data Library is a comprehensive toolkit designed to facilitate the handling and manipulation of CSV data within Ballerina applications. It streamlines the process of converting CSV data to native Ballerina data types, enabling developers to work with CSV content seamlessly and efficiently.
ballerina ballerina-csv csv csv-data data
Last synced: 29 Jan 2026
https://github.com/asirihewage/simplest-xpath-web-scraper
Simplest web scraper created using Python3 and MongoDB
data data-mining python3 scraper web webscrping
Last synced: 29 Jan 2026
https://github.com/evoluteur/web-scraper-sitemaps
Sitemaps for the Web Scraper Chrome extension.
chrome-extension data dataset scraper scraping scrapper scrapping scrapy-crawler sitemap web-scraper web-scraping
Last synced: 04 Jun 2026
https://github.com/farovictor/mongodbextractor
This project is intended to be used as a data extractor to support ELT pipelines or any kind of process that requires a heavy data dump from MongoDb databases.
Last synced: 14 Jan 2026
https://github.com/slipke/eurlex-model-go
This projects implements the EUR-Lex XML data model in Golang. For more information see README.md
data datamodel eur-lex eurlex webservice
Last synced: 09 Mar 2026
https://github.com/muhammadibrahim313/start-your-data-science-journey
In this Repo i will be Sharing all Resources that we will be Learning during December Data Science Workhops on iCode Guru
btajicrew data data-science eda icodeguru machine-learning matplotlib pandas python
Last synced: 03 Feb 2026
https://github.com/rudxain/ideas
A collection of my non-started projects
brain-storms brainstorming broken concepts crap data dreams experiments graphics hardware inspiration lazy mono-repository monorepo pet-project proposals software text unfinished wishes
Last synced: 06 Feb 2026
https://github.com/kocyigitkim/realtime.io
Real time data streaming & socket programming library
data realtime socket streaming
Last synced: 29 Jul 2025
https://github.com/georgetdn/syscppcplinux
Store Linux C++ class data in a file ( persistence ) and manipulate it programmatically or using Small SQL (included)
class data framework linux object persistence serialize sql
Last synced: 12 Feb 2026
https://github.com/sapienzanlp/exploring-srl
Repository for the paper "Exploring Non-Verbal Predicates in Semantic Role Labeling: Challenges and Opportunities"
acl acl2023 conllu data dataset natural-language-processing nlp semantic-role-labeling srl
Last synced: 31 Jan 2026
https://github.com/stdlib-js/utils-compact-adjacency-matrix
Compact adjacency matrix.
adjacency dag data data-structure data-structures graph javascript matrix node node-js nodejs stdlib structure topological toposort tsort util utilities utility utils
Last synced: 15 Apr 2026
https://github.com/eesunmoon/algorithms
[Fall 2020] Algorithms
algorithms algorithms-and-data-structures c data data-structures
Last synced: 01 Feb 2026
https://github.com/macsual/dotgov-jamaica-domains
A listing of .gov.jm domains.
Last synced: 03 Jan 2026
https://github.com/instaclustr/cassandra-parquet-transformer
Transform SSTables from Apache Cassandra to Parquet or Avro files, locally or remotely via Apache Cassandra Sidecar
analytics apache apache-cassandra avro big cassandra data parquet spark sstable transformation
Last synced: 29 Aug 2025
https://github.com/dhimmel/het.io-rep-data
Data from Project Rephetio for the het.io website
browser data datatables drug-repurposing rephetio
Last synced: 07 Feb 2026
https://github.com/StudyResearchProjects/arrbuffstr
Creates Strings from ArrayBuffers and viceversa in NodeJS and the Browser
arraybuffer browser data node string transform
Last synced: 09 Oct 2025
https://github.com/espoirmur/balobi_nini
An End to End Data Science Project, where I used Tweepy and Airflow to collect tweets related to the DRC and topic modeling technics to discover which topics Congolese are talking about on Twitter.
Last synced: 24 Aug 2025
https://github.com/jaldekoa/nyfedapi
A Python wrapper to easily retrieve data from the Federal Reserve Bank of New York (FRBoNY) official API in pandas format.
api api-wrapper banking data finance pandas python united-states
Last synced: 08 Feb 2026
https://github.com/lovethebomb/data-tiles
🍜 Data Tiles is a small website that shows data.
data express javascript nextjs typescript
Last synced: 10 Apr 2026
https://github.com/nononoexe/setariaviridis
🌾 Field-collected data of green foxtail
data data-science dataset rpackage
Last synced: 27 Feb 2026
https://github.com/e-candeloro/data-analysis-code-snippets-for-pandas-and-sklearn
These notebooks are useful to learn how to load, understand, clean and classify data using Pandas and Sklearn with Python
analysis big-data classification data datascience datavisualization machine-learning notebook numpy pandas python sklearn
Last synced: 10 Apr 2026
https://github.com/relintai/ess_data
Godot plugin that helps to create/manage resource files.
addon data data-management godot
Last synced: 18 Aug 2025
https://github.com/divithraju/divith-raju-openmetadata
Open Standard for Metadata. A Single place to Discover, Collaborate and Get your data right.
automation bigdata bigdataanalytics data data-structures dataengineering datascience hacktoberfest2022 metadata metadata-extraction
Last synced: 20 Feb 2026
https://github.com/ctechhindi/auto-fill-form-data
AUTO FILL AND AUTOCOMPLETE USER DATA WITH KEY NAME
autocomplete chrome-extension data extension
Last synced: 17 Apr 2026
https://github.com/qit-tools/unicode-emoji-json-lite
This library provides a lightweight version of the unicode-emoji-json library.
data emoji emojipedia emojis json lite unicode
Last synced: 07 Jan 2026
https://github.com/bkamapantula/india-pc-nfhs4
Parliamentary constituency factsheet for indicators of nutrition, health, and development in India using NFHS4 data.
data government health india nfhs nfhs4
Last synced: 19 Mar 2026
https://github.com/mskian/tamil-words
Tamil words Collections with English Meaning - API and SQL Data.
api data javascript json json-api mysql pdo php sql tamil tamil-language tamil-sms tamilwords translate translator
Last synced: 14 Apr 2026
https://github.com/countervolts/apple-music-stats-calculator
how to get your most streamed songs/artists
apple apple-music applemusic calculator data
Last synced: 11 Feb 2026
https://github.com/cosmos-loops/cosmos-efcore
Cosmos.EntityFrameworkCore is a part of Cosmos.Data, a inline project of COSMOS LOOPS PROGRAMME. This repository provides a package of Microsoft.EntityFrameworkCore to improve development efficiency.
cosmos-loops data efcore entityframeworkcore
Last synced: 14 Aug 2025
https://github.com/rdmpage/checklist-of-the-freshwater-snails-of-sabah
Data from A preliminary checklist of the freshwater snails of Sabah (Malaysian Borneo) deposited in the BORNEENSIS collection, Universiti Malaysia Sabah https://doi.org/10.3897/zookeys.673.12544
checklist data gbif google-earth kmz sabah
Last synced: 09 Mar 2026
https://github.com/0xdir/relief_web_dart
A Future-based wrapper around the Relief Web API, to retrieve information on humanitarian news, reports, training, jobs, and disasters
api dart data humanitarian jobs
Last synced: 11 Jun 2026
https://github.com/banyan-team/banyan-julia-examples
Adventures in massively parallel cloud computing with Banyan Julia!
banyan data data-analytics data-processing data-science julia
Last synced: 02 May 2026
https://github.com/bredalis/kpopnews
A place to see kpop news 📝
backend css data feedparser flask frameworks frontend html jinja2 kpop mongodb mongodb-atlas news newsletter os pages pymongo python requests web
Last synced: 12 Feb 2026
https://github.com/chaitanyac22/hr_policy_query_resolution_with_retrieval_augmented_generation_rag
This repository contains an HR Policy Query Resolution system using Retrieval-Augmented Generation (RAG). It leverages a 4-bit quantized Mistral-7B-Instruct-v0.2 LLM and JP Morgan Chase’s publicly available Code of Conduct documents to generate accurate, contextually relevant responses for HR policy queries.
artificial-intelligence data hr large-language-models llm mistral-7b nlp pipeline prompt-engineering quantization rag retrieval-augmented-generation
Last synced: 12 Feb 2026
https://github.com/perezrd5/publicdataprojects
These are public database and data analysis projects from the portfolio of Doug Perez
data data-model data-modeling data-models data-science data-structure data-structures database microsoft-sql-server mysql olap olap-cube oltp postgresql ssas ssis ssrs t-sql
Last synced: 13 Apr 2026
https://github.com/woo071002/parcel-management-system
A Parcel Delivery Management System streamlining deliveries with features for admin, users, and delivery personnel, including real-time tracking, delivery requests, and personalized dashboards.
cors csharp data dotenv html-css iconfont jkuat land-information-system mongodb python react-router-dom sass tech-expo xaml
Last synced: 08 Oct 2025
https://github.com/woctezuma/geforce-leak
Fetch data from the Geforce leak.
data datamining egs epic epic-games epic-games-launcher epic-games-store geforce geforce-experience geforce-leak geforce-now geforce-now-leak geforcenow geforcenow-leak graphql leak leaks nvidia steam steam-games
Last synced: 02 May 2026
https://github.com/colour-science/colour-demosaicing-tests-datasets
Colour - Demosaicing - Tests Datasets
color color-science color-space color-spaces colorspace colorspaces colour colour-science colour-space colour-spaces colourspace colourspaces data dataset datasets de-mosaicing debayering demosaicing demosaicking raw
Last synced: 19 Mar 2026
https://github.com/mihasm/arso-scraper
Unofficial Python CLI tool for downloading automated sensor weather data from the Slovenian Environment Agency.
api arso cli data historical-data meteorological python slovenia weather
Last synced: 14 Feb 2026