data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-02 00:07:45 UTC
- JSON Representation
https://github.com/cbartram/advancedai
AdvancedAI Selection Option for Command and Conquer Generals Zero Hour
Last synced: 30 May 2026
https://github.com/jinsyin/datalink
⚡ 数据集成 | DataLink is a lightweight data integration framework build on top of DataX, Spark and Flink
batch big-data bigdata cdc data data-collection data-exchange data-integration data-pipeline data-synchronization datalink etl flink flink-cdc framework integration pipeline spark streaming
Last synced: 19 Jul 2025
https://github.com/arcticsnow/climatepy
Collection of tools to perform timeseries analysis on climate data (Observation and Downscaled)
climate data era5 meteorological-data noaa-data pandas timeseries weather wmo xarray
Last synced: 05 Feb 2026
https://github.com/secret-guest/file_organizer
Files Organizer is a versatile tool for sorting and organizing files efficiently, ideal for managing recovered data.
c c-development data data-recovery file-management file-manager files sorting sorting-algorithms subdirectories subdirectory
Last synced: 10 Jun 2026
https://github.com/ashwinpn/visualization
Data Visualization using Matplotlib, Pandas Visualization, Seaborn, ggplot, and Plotly.
analysis data data-analysis data-science data-visualization graphs plots python python3 visualization
Last synced: 13 Apr 2026
https://github.com/leapfrogtechnology/datamegh
Datamegh - Data Engineering for the cloud.
cloud cloud-native data datamegh docker megha python serverless
Last synced: 14 May 2026
https://github.com/fabriciopsouza/covid-19-demographic-social-dataset
A social demographic dataset for analysis of the COVID-19 pandemic.
alteryx coronavirus coronavirus-analysis coronavirus-dataset covid-19 covid19 covid19-data data data-science dataset enrichment-analysis timeseries timeseries-analysis timeseries-clustering timeseries-covid-19 timeseries-database timeseries-segmentation timeseriesclassification
Last synced: 31 May 2026
https://github.com/imtiaz-emu/exploratory-data-analysis-with-r
Data Transformation, Descriptive statistics, data visualization, Linear regression using R
data dplyr ggplot2 r rstudio visualization
Last synced: 15 Mar 2025
https://github.com/feltex/datahora-java
Aprenda a trabalhar com Data e Hora em Java com as novas classes LocalDateTime, LocalDate, DateTimeFormatter e outras novidades do pacote java.time.
brasil data date dateformat dateformat-brazil datetime hora java java11 localdatetime locale localization zoneddatetime
Last synced: 18 Jun 2026
https://github.com/bastgau/snow-revoke-privileges
Script designed to simplify the management of permissions in your Snowflake databases.
data database dba dev-container python snowflake
Last synced: 20 Apr 2025
https://github.com/marek-jakub/monitoring
A university project concerning field data management for bird ringers.
bird data fieldwork management ringing
Last synced: 24 Jun 2026
https://github.com/henrylin03/video-games
Using Python and SQL to clean, analyse and visualise video games' data from Metacritic. Includes scraping using BeautifulSoup.
analysis beautifulsoup beautifulsoup4 data data-analysis data-science eda jupyter-notebook pandas python sql sqlite3 video-game video-games
Last synced: 14 Apr 2026
https://github.com/xsolla/data-fast-insights
Xsolla data analytics tool for fast business insights and reporting.
analytics data data-analysis data-science python reporting xsolla
Last synced: 29 Jun 2026
https://github.com/doctorlai/hex-viewer
Simple File Viewer in HEX
application data files hacktoberfest hex-viewer hexeditor hexidecimal web-app
Last synced: 09 Oct 2025
https://github.com/praveenpuglia/css-support
The source of truth for CSS browser support of info
api browser compatibility css data properties selectors support
Last synced: 31 Mar 2025
https://github.com/eyedia/idpe
Eyedia's Integrated Data Processing Environment
csharp data designer development development-environment development-tools development-workflow environment ide no-coding parser processing rehosted workflow
Last synced: 11 Oct 2025
https://github.com/danielbayley/schemas
A collection of useful @JSON-schema-org schemas for data validation.
ajv config configuration data data-science data-structures data-validation json json-schema linter linting schema schema-org validation yaml yaml-configuration
Last synced: 13 Oct 2025
https://github.com/skywarth/fenrir-wolfpack-simulator
Simulating wolfpack behaviours and future of the pack in an environment using Javascript and data trees.
data data-structures javascript max-heap simulation simulations wolfpack
Last synced: 14 Oct 2025
https://github.com/a3r0id/lightshot-data-miner
A random idea I had a while back to make a data miner for lightshot. Never released this but after a friend sent me a post about lightshot's transparency I figured it'd be a good time to release this. I've included some output from a run before making the repo. I am not responsible for the imagery or it's contents.
brute-force bruteforce data dataset face-recognition image-processing lightshot mining scraper scraping text-recognition
Last synced: 19 Oct 2025
https://github.com/amacd31/daily_hydromet_sample_data
This repository contains streamflow, precipitation, and potential-evapotranspiration data for the Twentymile Creek USGS streamflow station.
data dataset hydrology potential-evapotranspiration precipitation public-domain streamflow
Last synced: 16 Jan 2026
https://github.com/natylaza89/covid19-il
Python package which brings a "Facade" interface for the client for using official covid 19 data of israeli data gov. ★19K+ Downloads★
api covid covid19 covid19-data data israel pandas python
Last synced: 13 Apr 2026
https://github.com/audeering/emodb
Publishes Berlin Database of Emotional Speech with audb
Last synced: 19 Oct 2025
https://github.com/freight-trust/edi-onboarding
ESC Guidelines for X12/EDIFACT Messages
b2b data data-interchange edi edi-xml edifact enterprise x12
Last synced: 04 Mar 2026
https://github.com/squareslab/probabilisticmodel_saner2018
Paper and supporting materials of the Probabilistic Model paper Accepted to SANER 2018
code data mausotog published replication
Last synced: 26 Oct 2025
https://github.com/joaocarmo/react-very-simple-data-table
When all you want is a table
Last synced: 06 Mar 2025
https://github.com/blakedrumm/scvmm-scripts-and-sql
The Scripts provided here are compatible with System Center Virtual Machine Manager
collector data powershell scripts scvmm sql
Last synced: 11 May 2025
https://github.com/mlr-org/mlr3data
Data sets used in the book, gallery, or in examples of mlr3.
data data-science data-sets machine-learning mlr3 r r-package
Last synced: 09 Apr 2025
https://github.com/cdcgov/nchsdata
NCHS data: public use files (PUFs) from the National Center for Health Statistics (NCHS)
data public-health r survey survey-data
Last synced: 13 Apr 2026
https://github.com/lmantw/binarion
A simple binary format for storing JavaScript objects.
binary data decoding encoding format javascript
Last synced: 02 Sep 2025
https://github.com/slipke/eurlex-model-go
This projects implements the EUR-Lex XML data model in Golang. For more information see README.md
data datamodel eur-lex eurlex webservice
Last synced: 09 Mar 2026
https://github.com/1sumer/sql
This repository contains SQL scripts and data for various analytical and database management tasks. The project is designed to demonstrate SQL capabilities in handling complex queries, data analysis, and database design. It includes datasets related to e-commerce and streaming services, with a focus on real-world scenarios and use cases.
analytics data data-analysis data-storage sql vscode
Last synced: 19 Jan 2026
https://github.com/e-candeloro/data-analysis-code-snippets-for-pandas-and-sklearn
These notebooks are useful to learn how to load, understand, clean and classify data using Pandas and Sklearn with Python
analysis big-data classification data datascience datavisualization machine-learning notebook numpy pandas python sklearn
Last synced: 10 Apr 2026
https://github.com/camara94/data-visualization-with-python
Data visualization and some of the best practices when creating plots and visuals. The history and architecture of Matplotlib, and how to do basic plotting with Matplotlib. Generating different visualization tools using Matplotlib such as line plots, area plots, histograms, bar charts, box plots, and pie charts. Seaborn, another data visualization library in Python, and how to use it to create attractive statistical graphics. Folium, and how to use to create maps and visualize geospatial data.
data data-science data-structures data-visualization python3
Last synced: 16 May 2026
https://github.com/espoirmur/balobi_nini
An End to End Data Science Project, where I used Tweepy and Airflow to collect tweets related to the DRC and topic modeling technics to discover which topics Congolese are talking about on Twitter.
Last synced: 24 Aug 2025
https://github.com/ngambip/diabetes_factors_2024
Exploring BMI Categories and Health Factors.
dashboards data datacleaning dax-languague powerbi sql sqlstudio tsql visualization
Last synced: 03 Mar 2026
https://github.com/poncoe/passdatatoanotherfragment
Latihan Passing data Ke Fragment Lain
android android-app android-application android-studio data fragment fragments kotlin kotlin-android passing-parameters passingdataintent viewmodel
Last synced: 23 Jun 2026
https://github.com/yanpitangui/iteminfoconverter
Application that converts ragnarok legacy data files to iteminfo.lua
data itemdbconf iteminfo luafiles ragnarok
Last synced: 12 Oct 2025
https://github.com/erwan-simon/aws-data-platform-framework
A unified framework to industrialize data ingestion, transformation and pipeline execution on AWS using Terraform, from infrastructure provisioning to runtime execution, designed as a reusable and standalone data platform.
aws data data-framework datalake docker iceberg python spark step-functions terraform terraform-module
Last synced: 23 May 2026
https://github.com/infinitode/pwlds
A public dataset of over 10 million passwords, with assigned strength levels.
ai classes classification cyber-security data dataset ml open-source password passwords synthetic-data
Last synced: 22 Feb 2026
https://github.com/georgetdn/syscppcplinux
Store Linux C++ class data in a file ( persistence ) and manipulate it programmatically or using Small SQL (included)
class data framework linux object persistence serialize sql
Last synced: 12 Feb 2026
https://github.com/woo071002/parcel-management-system
A Parcel Delivery Management System streamlining deliveries with features for admin, users, and delivery personnel, including real-time tracking, delivery requests, and personalized dashboards.
cors csharp data dotenv html-css iconfont jkuat land-information-system mongodb python react-router-dom sass tech-expo xaml
Last synced: 08 Oct 2025
https://github.com/mystpi/crossings
🌉 A tiny library focused on easily connecting JS to HTML.
connect data frontend html javascript reactive simple small tiny
Last synced: 10 Jun 2026
https://github.com/amethyst-php/customer
A person or an organization that pays for goods or services
amethyst amethyst-package api customer data laravel
Last synced: 11 May 2026
https://github.com/rcourivaud/rcourivaud.github.io
Raphaël Courivaud
data database datascience python
Last synced: 21 Apr 2026
https://github.com/physio/flatten-ts
Flatten-ts is a lightweight TypeScript library for easily flattening and unflattening nested objects and arrays with customizable options and fast performance.
array conversion data flatten javascript json object typescript
Last synced: 06 May 2026
https://github.com/wibosco/modelingformchanges-example
An example project to show how we can implement a model to simplify form validation
data swift unit-testing validator
Last synced: 16 Mar 2025
https://github.com/codecentric/reedelk-bookingintegrationservice
Example service for the blog post series about Reedelk
api api-gateway data integration integration-flow
Last synced: 16 Oct 2025
https://github.com/mrsaeeddev/data-science-roadmap-for-beginners
📈 A minimal and easy road map for beginners who want to dive into the field of Data Science
data data-science datascience python
Last synced: 29 Jun 2025
https://github.com/utrechtuniversity/dataprivacyproject
This is the repository underlying the landing page for the Data Privacy Project @UtrechtUniversity, the Netherlands.
data gdpr open-science privacy rdm research research-data-management utrecht-university
Last synced: 10 Oct 2025
https://github.com/p32929/use-megamind
A simple react hook for managing asynchronous function calls with ease on the client side
async asynchronous-tasks axios client-side-javascript data data-fetching easy fetch generics hooks javascript npm painless promise query react rest simple small typescript
Last synced: 23 Jan 2026
https://github.com/datahub-local/datahub-local
DataHub.local is a powerful data platform designed for edge devices, enabling seamless analytics and insights at home
data data-engineering devops kubernetes raspberrypi
Last synced: 21 Jan 2026
https://github.com/drkenreid/introductory-data-science
Hands-on machine learning tutorials in Google Colab, covering various algorithms and techniques for learners at different levels.
cnn data data-science deep-learning learning-datascience learning-machine-learning learning-python neural-network neural-networks regression rnn science tutorial tutorial-exercises tutorials
Last synced: 28 Jan 2026
https://github.com/tayeva/eia-client-python
EIA Open Data API Client - Python
data open-source python python-3 python3
Last synced: 14 Oct 2025
https://github.com/yashmistry-24/ytcomment-iq
YTComment-IQ is a web app for analyzing and visualizing YouTube comments, offering insights through sentiment analysis, topic modeling, and interactive charts.
analysis comments data dataanalysis dataanalytics deep-learning machine-learning nlp python streamlit training visualization webapp youtube
Last synced: 15 Feb 2026
https://github.com/ipstack/finder
Define data by IP Address
composer data geo geoip info ip ip-database ip-search ipstack ipstack-finder php search
Last synced: 14 May 2026
https://github.com/ymougenel/referencecollector
Helps you gather, store and share references links
ansible data docker keycloak kotlin spring-boot thymeleaf
Last synced: 14 Apr 2026
https://github.com/ballerina-platform/module-ballerina-data.csv
The Ballerina CSV Data Library is a comprehensive toolkit designed to facilitate the handling and manipulation of CSV data within Ballerina applications. It streamlines the process of converting CSV data to native Ballerina data types, enabling developers to work with CSV content seamlessly and efficiently.
ballerina ballerina-csv csv csv-data data
Last synced: 29 Jan 2026
https://github.com/davemlz/master_of_datascience
Master of Data Science repository
data data-mining data-science database r rmd sql sqlite statistics
Last synced: 14 Apr 2026
https://github.com/abuzar-alvi/employee-data-to-info-card-generator-with-python
This Python project is made by me, Python project for improving python skills.
card data data-generator employee python
Last synced: 03 Feb 2026
https://github.com/planarnetwork/feeds.planar.network
GTFS feeds for bus, train and plane
data feeds gtfs transit transportation
Last synced: 11 Feb 2026
https://github.com/mednour2019/devolap
OLAP Cube Dispatcher Tool
analysis-services csharp data excel excel-export kpi mdx metroframework mvvm-architecture sql wpf
Last synced: 27 Jan 2026
https://github.com/automators-com/datamaker-js
The official Node.js / Typescript library for the DataMaker API
data javascript nodejs typescript
Last synced: 11 Oct 2025
https://github.com/banyan-team/banyan-julia-examples
Adventures in massively parallel cloud computing with Banyan Julia!
banyan data data-analytics data-processing data-science julia
Last synced: 02 May 2026
https://github.com/mollybeach/cherryether
CherryEther: Typescript Staking Deposits Ethereum Transactions
blockchain data data-science ethereum typescripts
Last synced: 21 May 2026
https://github.com/ivangrigorov/neutrino-search-engine
Creating Java search engine both for HTML or document type of files
data data-analysis data-knowledge information-extraction information-retrieval java-language search-engine
Last synced: 31 Mar 2025
https://github.com/stdlib-js/datasets-anscombes-quartet
Anscombe's quartet.
anscombe anscombes-quartet data dataset datasets javascript node node-js nodejs quartet sample statistics stats stdlib
Last synced: 13 Oct 2025
https://github.com/StudyResearchProjects/arrbuffstr
Creates Strings from ArrayBuffers and viceversa in NodeJS and the Browser
arraybuffer browser data node string transform
Last synced: 09 Oct 2025
https://github.com/woctezuma/geforce-leak
Fetch data from the Geforce leak.
data datamining egs epic epic-games epic-games-launcher epic-games-store geforce geforce-experience geforce-leak geforce-now geforce-now-leak geforcenow geforcenow-leak graphql leak leaks nvidia steam steam-games
Last synced: 02 May 2026
https://github.com/andrewrporter/my-analytics
Analyzes FireFox browsing history with modern python3 features and libraries
analytics data firefox matplotlib python python3 sqlite3
Last synced: 28 Apr 2026
https://github.com/jimut123/scrapers
All Scrapers that I'll build
bs4 data python3 real-time-visualisations scrapers scrapy wget
Last synced: 16 Jan 2026
https://github.com/missiontoscale/bluesky-scraper
This is a work of art that enables you to scrape data off BlueSky.
analytics bluesky bluesky-api bluesky-client data datascraper-framework datascraping scraping social-media web webscraping
Last synced: 19 Jun 2026
https://github.com/doriclaudino/canarinho_nlp
labels, classify, summarization string for canarinho app
chrome-console classification classifier-model data labels nlp nlu python spacy spacy-models spacy-nlp summarization-string
Last synced: 08 May 2026
https://github.com/manifoldfinance/disco-schema
MEV Auction and Ethereum Network Data Schemas
cryo data dataset ethereum ethereum-builders ethereum-mev evm mev-data pandas schema-registry schemas
Last synced: 08 May 2026
https://github.com/andreaselia/quotes-xd
A plugin for Adobe XD to insert a text element with a random quote and respective author.
adobe adobe-xd data design design-tool design-tools quote random xd
Last synced: 24 Apr 2026
https://github.com/phelipe-sempreboni/data-engineering
Repository for tutorials, information, notes and projects about data engineering.
data dataengineering engine engineering enviroment etl etl-pipeline pipeline project python
Last synced: 04 Oct 2025
https://github.com/chaitanyac22/hr_policy_query_resolution_with_retrieval_augmented_generation_rag
This repository contains an HR Policy Query Resolution system using Retrieval-Augmented Generation (RAG). It leverages a 4-bit quantized Mistral-7B-Instruct-v0.2 LLM and JP Morgan Chase’s publicly available Code of Conduct documents to generate accurate, contextually relevant responses for HR policy queries.
artificial-intelligence data hr large-language-models llm mistral-7b nlp pipeline prompt-engineering quantization rag retrieval-augmented-generation
Last synced: 12 Feb 2026
https://github.com/leechristophermurray/parquetframe
Unlocking the power of Parquets
data data-analysis dataframe entity-framework etl graph interactive python rust workflow worklow zanzibar
Last synced: 28 May 2026
https://github.com/vatshayan/final-year-project-image-recognition
Machine Learning project to recognize faces from an Image
btech computerscience data facial final image imageclassification learning machine project recognition science students year
Last synced: 29 May 2026
https://github.com/peterdavehello/nrd-list-archive
🌐📂 A collection of past NRD lists to explore—perfect for fun, research, or just plain curiosity! 🎉🔍✨
Last synced: 17 Mar 2026
https://github.com/nrennie/data
A collection of random datasets, either from web-scraping or processing more complex data.
Last synced: 30 May 2026