data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-30 00:07:50 UTC
- JSON Representation
https://github.com/missiontoscale/bluesky-scraper
This is a work of art that enables you to scrape data off BlueSky.
analytics bluesky bluesky-api bluesky-client data datascraper-framework datascraping scraping social-media web webscraping
Last synced: 19 Jun 2026
https://github.com/sabujxi/python-scraper-and-data-analysts-admin-panel-in-django
A data scraper from texas govt site and a helping web app for managing, reviewing and editing the data
analyst data data-analysis data-entry data-scraper django django-application python python-scraper real-estate regex scraper texas
Last synced: 30 Apr 2026
https://github.com/andrey-tech/data-storage-php
Простое хранилище данных в виде ключ-значение в JSON-файлах с разделяемой блокировкой на чтение и эксклюзивной блокировкой на запись.
data data-storage files json php php7 storage
Last synced: 29 Apr 2026
https://github.com/deepwaterpaladin/statscanpy
Basic package for querying & downloading StatsCan data by table name.
Last synced: 16 Jan 2026
https://github.com/14richa/patient-readmission-analysis
This project focuses on predictive modeling to foresee hospital readmissions of diabetic patients within 30 days post-discharge. By leveraging a dataset spanning a decade (1999-2008) and covering records from 130 US hospitals, the aim is to enhance healthcare management and patient outcomes.
analytics data jupyter-notebook numpy
Last synced: 29 Apr 2026
https://github.com/luminati-io/Pinterest-dataset-samples
Two sample datasets of over 1000 Pinterest profiles and posts, extracted using the Bright Data API, ideal for market research, influencer marketing, and product development.
data data-extraction data-mining database datasets pinterest pinterest-api structured-data web-scraping
Last synced: 09 Apr 2025
https://github.com/andrewrporter/my-analytics
Analyzes FireFox browsing history with modern python3 features and libraries
analytics data firefox matplotlib python python3 sqlite3
Last synced: 28 Apr 2026
https://github.com/datafold/vhol-demo
Get hands-on examples of dbt + Datafold CI/CD workflows
data data-engineering datafold dbt diff
Last synced: 28 Dec 2025
https://github.com/freight-trust/edi-onboarding
ESC Guidelines for X12/EDIFACT Messages
b2b data data-interchange edi edi-xml edifact enterprise x12
Last synced: 04 Mar 2026
https://github.com/openpeeps/zxc-nim
Bindings to the ZXC compression library, a LZ77-based compressor optimized for high decompression speed
archive compression compressor data decompression game-assets lossless lossless-compression lz77 nim nim-bindings nim-package nim-wrapper openpeeps zxc
Last synced: 07 Jun 2026
https://github.com/ciscorn/tinygrib2
(experimental) A tiny toolkit for parsing JMA's GRIB2 files.
data grib grib2 meteorology rust weather
Last synced: 26 Apr 2026
https://github.com/nrennie/data
A collection of random datasets, either from web-scraping or processing more complex data.
Last synced: 30 May 2026
https://github.com/mohasarc/treeviz
The best tree data-structures visualization tool
data structures visualization visualization-tools
Last synced: 25 Apr 2026
https://github.com/andreaselia/quotes-xd
A plugin for Adobe XD to insert a text element with a random quote and respective author.
adobe adobe-xd data design design-tool design-tools quote random xd
Last synced: 24 Apr 2026
https://github.com/oliver021/entity-dock
A superset with libraries, components, tools and more to work with entity on .Net
api asp-net-core controller data database dotnet entity entity-framework-core library model mvc netstandard orm support webapi
Last synced: 09 May 2026
https://github.com/caelean/twittermap
Map of twitter user's influence as defined on by influencetracker
data google-maps maps sparql twitter visualization
Last synced: 14 Jun 2025
https://github.com/wibosco/modelingformchanges-example
An example project to show how we can implement a model to simplify form validation
data swift unit-testing validator
Last synced: 16 Mar 2025
https://github.com/rcourivaud/rcourivaud.github.io
Raphaël Courivaud
data database datascience python
Last synced: 21 Apr 2026
https://github.com/aisurjyasamantaray/sales-perfomance-analysis-dashboard
A comprehensive sales performance analysis dashboard built using Python, and visualization tools. This project includes data cleaning, descriptive statistics, correlation analysis, and insights into sales trends, profitability, and the impact of discounts. Key features include interactive visualizations using Seaborn, and Matplot
analytics annova data data-analysis data-visualization-project dataproject eda hypothesis-testing pandas-dataframe python sales-performance-analysis statistics
Last synced: 04 Apr 2026
https://github.com/nop-dev/learning-js
Esse repositório contem todas as anotações que fiz enquanto estudava um módulo da trilha Explorer da Rocketseat sobre JavaScript. 🔰
data data-structures functions javascript js
Last synced: 17 Apr 2026
https://github.com/saleh0987/mohamed_saleh
That's my personal website where I show my skills and projects.
aos-animation axios boot data json nextjs portfolio portfolio-website projects react-icons reactjs sass swiper
Last synced: 09 Mar 2026
https://github.com/mollybeach/cherryether
CherryEther: Typescript Staking Deposits Ethereum Transactions
blockchain data data-science ethereum typescripts
Last synced: 21 May 2026
https://github.com/sanand0/imdbscrape
A weekly archive of the IMDB Top 250 results. Automatically scraped via GitHub Actions. Useful to see trends on IMDb Top 250
Last synced: 30 May 2026
https://github.com/hasnocool/war_thunder_camouflage_scraper
A concurrent web scraper designed to collect camouflage information from war thunder aircrafts.
asyncio camouflage concurrent data execution handling playwright python scraping signal sqlite3 thunder war web
Last synced: 04 Jan 2026
https://github.com/tusharnankani/analysis-2.0
An Exhaustive WhatsApp Chat Data Analysis 2.0
analysis data data-science plots trends visualization
Last synced: 31 Mar 2025
https://github.com/andrew-johnson-4/misspeller
Take correctly spelled words and return common spelling mistakes
common-mistakes data language natural nlp processing rust
Last synced: 30 Apr 2025
https://github.com/dark-art108/yonk
A cli-utility to streamline data science work by creating templates
Last synced: 08 May 2026
https://github.com/vatshayan/final-year-project-image-recognition
Machine Learning project to recognize faces from an Image
btech computerscience data facial final image imageclassification learning machine project recognition science students year
Last synced: 29 May 2026
https://github.com/mystpi/crossings
🌉 A tiny library focused on easily connecting JS to HTML.
connect data frontend html javascript reactive simple small tiny
Last synced: 10 Jun 2026
https://github.com/fabriciopsouza/covid-19-demographic-social-dataset
A social demographic dataset for analysis of the COVID-19 pandemic.
alteryx coronavirus coronavirus-analysis coronavirus-dataset covid-19 covid19 covid19-data data data-science dataset enrichment-analysis timeseries timeseries-analysis timeseries-clustering timeseries-covid-19 timeseries-database timeseries-segmentation timeseriesclassification
Last synced: 31 May 2026
https://github.com/manifoldfinance/disco-schema
MEV Auction and Ethereum Network Data Schemas
cryo data dataset ethereum ethereum-builders ethereum-mev evm mev-data pandas schema-registry schemas
Last synced: 08 May 2026
https://github.com/rajatt95/python_rs
Programming | Python | PyCharm | Data Types | Tuple | Dictionary | If-Else | Loops - For, While | Functions | OOPS Principles | Constructor | String - SubString, Concatenation, Split, Strip | Read & Write data into files | JSON Parsing | CSV package | Web Scrapping
constructor csv-parser data dictionary functions if-else-statements json json-parser oops parser pycharm-ide python python-programming-language read-write-file strings tuple web-scrapping
Last synced: 15 Feb 2026
https://github.com/ayemunhossain/firebase-realtime-db-advance-query
Firebase real time database, query with nodejs.
ayemunhossain data firebase firebase-functions firebase-realtime-database nodejs query
Last synced: 06 May 2026
https://github.com/kylekirkby/cardatasnatch
CarDataSnatch allows you to quickly find information about a car in the uk using a valid number plate. Grab an image of the car in question along with a multitude of other data. Compare two cars' data for fast and easy analysis.
beautifulsoup cars command-line-tool data data-analysis data-mining ethical-hacking python python3 requests scraper social-engineering
Last synced: 15 Apr 2025
https://github.com/colour-science/colour-demosaicing-tests-datasets
Colour - Demosaicing - Tests Datasets
color color-science color-space color-spaces colorspace colorspaces colour colour-science colour-space colour-spaces colourspace colourspaces data dataset datasets de-mosaicing debayering demosaicing demosaicking raw
Last synced: 19 Mar 2026
https://github.com/doughtnerd/pod
Read and write Excel data with Java
data excel extract poi-library
Last synced: 08 Apr 2025
https://github.com/achraf-oujjir/chatgpt-users-tweets-pipeline
🐦🔵End-to-end ChatGPT Users' Tweets Data Pipeline with Python 🐍, Hive 🐝, and Power BI 📊
bash-script cloudera data data-engineering data-vizualisation datawarehouse hdfs hive networking powerbi python sentiment-analysis sftp shell tweepy twitter-api ubuntu virtualization vmware-workstation
Last synced: 28 Feb 2026
https://github.com/chaitanyac22/hr_policy_query_resolution_with_retrieval_augmented_generation_rag
This repository contains an HR Policy Query Resolution system using Retrieval-Augmented Generation (RAG). It leverages a 4-bit quantized Mistral-7B-Instruct-v0.2 LLM and JP Morgan Chase’s publicly available Code of Conduct documents to generate accurate, contextually relevant responses for HR policy queries.
artificial-intelligence data hr large-language-models llm mistral-7b nlp pipeline prompt-engineering quantization rag retrieval-augmented-generation
Last synced: 12 Feb 2026
https://github.com/sapienzanlp/exploring-srl
Repository for the paper "Exploring Non-Verbal Predicates in Semantic Role Labeling: Challenges and Opportunities"
acl acl2023 conllu data dataset natural-language-processing nlp semantic-role-labeling srl
Last synced: 31 Jan 2026
https://github.com/jesusgraterol/bitcoin-blockchain-dataset-builder
The dataset builder script extracts all the relevant block information from the Bitcoin Blockchain through Mempool.space's public API. The data is stored in a .csv file, facilitating its use in data science and machine learning projects.
bitcoin blockchain blockchain-technology data datascience datascience-machinelearning dataset dataset-generation machine-learning
Last synced: 06 May 2026
https://github.com/rulox/faker
A Go library to create Fake Data for your projects
data dummy dummy-data fake fake-data faker go golang
Last synced: 28 May 2026
https://github.com/physio/flatten-ts
Flatten-ts is a lightweight TypeScript library for easily flattening and unflattening nested objects and arrays with customizable options and fast performance.
array conversion data flatten javascript json object typescript
Last synced: 06 May 2026
https://github.com/quin1sue/priceguidesph-bettergov
an economic and financial data platform project under bettergov.ph
bettergovph cloudflare data hacktoberfest nextjs priceguides
Last synced: 05 May 2026
https://github.com/lmantw/binarion
A simple binary format for storing JavaScript objects.
binary data decoding encoding format javascript
Last synced: 02 Sep 2025
https://github.com/eesunmoon/algorithms
[Fall 2020] Algorithms
algorithms algorithms-and-data-structures c data data-structures
Last synced: 01 Feb 2026
https://github.com/StudyResearchProjects/arrbuffstr
Creates Strings from ArrayBuffers and viceversa in NodeJS and the Browser
arraybuffer browser data node string transform
Last synced: 09 Oct 2025
https://github.com/amethyst-php/customer
A person or an organization that pays for goods or services
amethyst amethyst-package api customer data laravel
Last synced: 11 May 2026
https://github.com/alexandregazagnes/unilasalle-public-resources
UniLaSalle-Public-Ressources : This public repository contains the notebooks and the data used for both : 2nd Year - Practical Statistical Tests 4th Year - Data Analysis with Python
data data-analysis data-analytics data-cleaning data-storytelling education educational exploratory-data-analysis python python3 r r-programming rstudio statistics visualization
Last synced: 28 Apr 2026
https://github.com/stdlib-js/datasets-cdc-nchs-us-births-1994-2003
US birth data from 1994 to 2003, as provided by the Center for Disease Control and Prevention's National Center for Health Statistics.
america babies births data dataset datasets javascript node node-js nodejs stdlib time-series timeseries united-states us usa
Last synced: 12 Oct 2025
https://github.com/ismet55555/pdw-asym-2link
Clear and easy way of simulating a passive dynamic walker (PDW) model derived and exectured using MATLAB.
data dynamics inverted-pendulum matlab numerical-simulations passive-dynamic-walker passive-dynamics ramp research robotics simulation slope walking-simulator
Last synced: 29 Apr 2026
https://github.com/anandchowdhary/health
🫀 @AnandChowdhary's body measurements
csv data fitness github-actions health
Last synced: 29 Apr 2026
https://github.com/dongminlee94/data-visualization-tutorial
A repository for data visualization tutorial
data data-science data-visualization matp matplotlib pca plotly python seaborn t-sne tutorial umap visualization
Last synced: 29 Apr 2026
https://github.com/a3r0id/lightshot-data-miner
A random idea I had a while back to make a data miner for lightshot. Never released this but after a friend sent me a post about lightshot's transparency I figured it'd be a good time to release this. I've included some output from a run before making the repo. I am not responsible for the imagery or it's contents.
brute-force bruteforce data dataset face-recognition image-processing lightshot mining scraper scraping text-recognition
Last synced: 19 Oct 2025
https://github.com/jazeee/dexcom-android-wall-panel
Display data as a Graph on Android, jazeee data plotter
Last synced: 02 May 2026
https://github.com/keosariel/nairagazer-clustered-news
Providing clustered News data specifically Nigeria news. In hindsight this repo contain nigeria news and it's coverage. Data is from Nairagazer
ai data data-science news nigeria nigerian-data python
Last synced: 30 Aug 2025
https://github.com/cbartram/advancedai
AdvancedAI Selection Option for Command and Conquer Generals Zero Hour
Last synced: 30 May 2026
https://github.com/rastmob/wordpress-llms-output-plugin
A WordPress plugin to export posts, pages, and custom post types as JSON for training Language Models (LLMs).
ai data llm llms training training-data wordpress wordpress-development wordpress-plugin
Last synced: 03 May 2026
https://github.com/dantesc03/uberpool-case-study
This project was designed to understand the statistical effects of longer wait times on uber rides. Particularly on the user and driver experience with the Uber Pool System.
analysis data excel jupyter jupyternotebooks learn python seaborn statistics t-tests uber visualization
Last synced: 16 Apr 2026
https://github.com/rohan-paul/machine-learning-and-deep-learning-tutorial-notebooks
Various Machine Learning and Deep Learning Tutorial Notebooks in Blog Format
data data-analysis data-science deep-learning deep-learning-tutorial deep-neural-networks machine-learning machine-learning-algorithms machinelearning neural-network pytorch pytorch-implementation pytorch-tutorial tensorflow
Last synced: 09 May 2026
https://github.com/stefen-taime/real-time-data-pipeline-snake-game
Dynamic Snake Game: Unleashing Real-Time Streaming Analytics with Redis, Kafka, Flink, ClickHouse & Chart.js in an Online Snake Game via Flask API
chartjs clickhouse confluent-cloud data flask kafka-streams pipeline redis
Last synced: 04 May 2026
https://github.com/imtiaz-emu/exploratory-data-analysis-with-r
Data Transformation, Descriptive statistics, data visualization, Linear regression using R
data dplyr ggplot2 r rstudio visualization
Last synced: 15 Mar 2025
https://github.com/nononoexe/setariaviridis
🌾 Field-collected data of green foxtail
data data-science dataset rpackage
Last synced: 27 Feb 2026
https://github.com/azawawi/perl6-msgpack
Perl 6 Interface to libmsgpack
data messagepack msgpack perl6 wrapper
Last synced: 12 Jun 2025
https://github.com/antononcube/raku-data-importers
Various data importing routines with a unified interface (data-import, slurp).
data data-ingestion raku rakulang slurp
Last synced: 23 Feb 2026
https://github.com/bastianolea/sinim_info_municipal
Base de datos del Sistema Nacional de Información Municipal, que incluye datos comunales sobre finanzas municipales, recursos humanos, educación, salud, pensiones, organizaciones sociales, y más.
chile comunas data estado laboral politica social tiempo
Last synced: 26 Oct 2025
https://github.com/colour-science/colour-hdri-examples-datasets
Colour - HDRI - Examples Datasets
color color-science color-space color-spaces colorspace colorspaces colour colour-science colour-space colour-spaces colourspace colourspaces data dataset datasets hdr hdri raw tone-mapping tonemapping
Last synced: 19 Mar 2026
https://github.com/skylinenando/javascript
autocomplete browser data disable events javascript language loop
Last synced: 14 Feb 2026
https://github.com/lxcoding06/e-gereja
Website CRUD untuk Gereja, untuk mengatur data jemaat, data kematian, data pernikahan dan data baptis
data data-gereja e-gereja gereja gereja-online jemaat kematian pernikahan
Last synced: 15 May 2025
https://github.com/datahub-local/datahub-local
DataHub.local is a powerful data platform designed for edge devices, enabling seamless analytics and insights at home
data data-engineering devops kubernetes raspberrypi
Last synced: 21 Jan 2026
https://github.com/husna-poyraz/artificial-intelligence-and-data-science
Some studies on Artificial Intelligence and Data Science ...
artificial-intelligence data data-analysis-python data-science matplotlib-pyplot numpy pandas python
Last synced: 05 May 2026
https://github.com/bastianolea/siedu_indicadores_urbanos
Datos del Sistema de Indicadores y Estándares de Desarrollo Urbano, con datos comunales sobre temas como transporte, urbanismo, servicios básicos, calidad de vida y más.
ambiental app chile ciudad comunas data estado social
Last synced: 19 Feb 2026
https://github.com/cerema/groum
Utilitaire en ligne de commande pour convertir les données d'arrêtés de circulation
Last synced: 06 Feb 2026
https://github.com/ingmarboeschen/jatsdecoderevaluation
Evaluation data and code
Last synced: 04 Feb 2026
https://github.com/lindsaygelle/emojipedia
Go application. Simple program that scrapes unicode.org for Emoji content. Parses out HTML into categorically ordered data subsets. Explored from the command line.
cli data data-mining emoji emojipedia encyclopedia go golang golang-application html-scraping unicode-characters
Last synced: 11 Mar 2026
https://github.com/praveenpuglia/css-support
The source of truth for CSS browser support of info
api browser compatibility css data properties selectors support
Last synced: 31 Mar 2025
https://github.com/iusztinpaul/airbnb-data-analysis
Airbnb data analysis on the biggest cities in The Netherlands following the CRISP-DM methodology.
airbnb data datanalysis datascience machine-learning numpy pandas python
Last synced: 06 May 2026
https://github.com/woo071002/parcel-management-system
A Parcel Delivery Management System streamlining deliveries with features for admin, users, and delivery personnel, including real-time tracking, delivery requests, and personalized dashboards.
cors csharp data dotenv html-css iconfont jkuat land-information-system mongodb python react-router-dom sass tech-expo xaml
Last synced: 08 Oct 2025
https://github.com/blakedrumm/scvmm-scripts-and-sql
The Scripts provided here are compatible with System Center Virtual Machine Manager
collector data powershell scripts scvmm sql
Last synced: 11 May 2025
https://github.com/tayeva/eia-client-python
EIA Open Data API Client - Python
data open-source python python-3 python3
Last synced: 14 Oct 2025
https://github.com/uk-ipop/open-data-pipeline
A pipeline for processing, enhancing, and sharing open datasets.
actions automation data python
Last synced: 25 May 2026
https://github.com/banbord/data-vis-tornados
This repository includes data files, processing scripts, visualization code, and documentation for our tornado data visualization project. It aims to provide insights into tornado patterns across the United States using interactive and informative visual representations.
d3-visualization d3js data javascript json visualization
Last synced: 24 Feb 2026
https://github.com/feltex/datahora-java
Aprenda a trabalhar com Data e Hora em Java com as novas classes LocalDateTime, LocalDate, DateTimeFormatter e outras novidades do pacote java.time.
brasil data date dateformat dateformat-brazil datetime hora java java11 localdatetime locale localization zoneddatetime
Last synced: 18 Jun 2026