data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-23 00:07:41 UTC
- JSON Representation
https://github.com/countervolts/apple-music-stats-calculator
how to get your most streamed songs/artists
apple apple-music applemusic calculator data
Last synced: 11 Feb 2026
https://github.com/gonzalezlrjesus/covid-19API
Convierte la data ofrecida por: the Johns Hopkins University Center en formato CSV al formato JSON sobre los casos confirmados, muertos y recuperados de COVID-19 por paises.
api api-rest api-server coronavirus covid-19 data go golang json
Last synced: 06 May 2025
https://github.com/chaitanyac22/hr_policy_query_resolution_with_retrieval_augmented_generation_rag
This repository contains an HR Policy Query Resolution system using Retrieval-Augmented Generation (RAG). It leverages a 4-bit quantized Mistral-7B-Instruct-v0.2 LLM and JP Morgan Chase’s publicly available Code of Conduct documents to generate accurate, contextually relevant responses for HR policy queries.
artificial-intelligence data hr large-language-models llm mistral-7b nlp pipeline prompt-engineering quantization rag retrieval-augmented-generation
Last synced: 12 Feb 2026
https://github.com/achraf-oujjir/chatgpt-users-tweets-pipeline
🐦🔵End-to-end ChatGPT Users' Tweets Data Pipeline with Python 🐍, Hive 🐝, and Power BI 📊
bash-script cloudera data data-engineering data-vizualisation datawarehouse hdfs hive networking powerbi python sentiment-analysis sftp shell tweepy twitter-api ubuntu virtualization vmware-workstation
Last synced: 28 Feb 2026
https://github.com/eby8zevin/android-pos4122020
The Next Project . . .
android android-app android-application android-database android-studio androidstudio create data database database-sqlite delete point-of-sale pos read search sqlite update
Last synced: 13 Oct 2025
https://github.com/1sumer/sql
This repository contains SQL scripts and data for various analytical and database management tasks. The project is designed to demonstrate SQL capabilities in handling complex queries, data analysis, and database design. It includes datasets related to e-commerce and streaming services, with a focus on real-world scenarios and use cases.
analytics data data-analysis data-storage sql vscode
Last synced: 19 Jan 2026
https://github.com/slipke/eurlex-model-go
This projects implements the EUR-Lex XML data model in Golang. For more information see README.md
data datamodel eur-lex eurlex webservice
Last synced: 09 Mar 2026
https://github.com/colour-science/colour-demosaicing-tests-datasets
Colour - Demosaicing - Tests Datasets
color color-science color-space color-spaces colorspace colorspaces colour colour-science colour-space colour-spaces colourspace colourspaces data dataset datasets de-mosaicing debayering demosaicing demosaicking raw
Last synced: 19 Mar 2026
https://github.com/stdlib-js/utils-compact-adjacency-matrix
Compact adjacency matrix.
adjacency dag data data-structure data-structures graph javascript matrix node node-js nodejs stdlib structure topological toposort tsort util utilities utility utils
Last synced: 15 Apr 2026
https://github.com/dongminlee94/data-visualization-tutorial
A repository for data visualization tutorial
data data-science data-visualization matp matplotlib pca plotly python seaborn t-sne tutorial umap visualization
Last synced: 29 Apr 2026
https://github.com/macsual/dotgov-jamaica-domains
A listing of .gov.jm domains.
Last synced: 03 Jan 2026
https://github.com/instaclustr/cassandra-parquet-transformer
Transform SSTables from Apache Cassandra to Parquet or Avro files, locally or remotely via Apache Cassandra Sidecar
analytics apache apache-cassandra avro big cassandra data parquet spark sstable transformation
Last synced: 29 Aug 2025
https://github.com/vatshayan/final-year-project-image-recognition
Machine Learning project to recognize faces from an Image
btech computerscience data facial final image imageclassification learning machine project recognition science students year
Last synced: 29 May 2026
https://github.com/rajatt95/python_rs
Programming | Python | PyCharm | Data Types | Tuple | Dictionary | If-Else | Loops - For, While | Functions | OOPS Principles | Constructor | String - SubString, Concatenation, Split, Strip | Read & Write data into files | JSON Parsing | CSV package | Web Scrapping
constructor csv-parser data dictionary functions if-else-statements json json-parser oops parser pycharm-ide python python-programming-language read-write-file strings tuple web-scrapping
Last synced: 15 Feb 2026
https://github.com/pawelzny/vo
DDD Value Object implementation
data ddd-patterns object python3 value
Last synced: 15 Feb 2026
https://github.com/debdutto/algorhythm
Algorithmic music driven by data and / or algorithms
Last synced: 18 Apr 2026
https://github.com/lovethebomb/data-tiles
🍜 Data Tiles is a small website that shows data.
data express javascript nextjs typescript
Last synced: 10 Apr 2026
https://github.com/ngambip/diabetes_factors_2024
Exploring BMI Categories and Health Factors.
dashboards data datacleaning dax-languague powerbi sql sqlstudio tsql visualization
Last synced: 03 Mar 2026
https://github.com/stdlib-js/datasets-cdc-nchs-us-births-1994-2003
US birth data from 1994 to 2003, as provided by the Center for Disease Control and Prevention's National Center for Health Statistics.
america babies births data dataset datasets javascript node node-js nodejs stdlib time-series timeseries united-states us usa
Last synced: 12 Oct 2025
https://github.com/everythings-gonna-be-alright/amazing-clickhouse-connector
Quick recording of analytics data
analytics clickhouse data k8s kubernetes
Last synced: 04 Jan 2026
https://github.com/vikashpr/18cse301j_ra2011003010737
This website tells the story of a nation's GDP through data visualization, providing insights on global GDP, state-wise GDP, sector-wise GDP, and the vision for India's economy. It includes data sets and sources for further reference.
css3 d3-visualization d3js data data-vizualisation gephi-visualizations html5 indian-economy indian-gdp information-visualization js python-word-cloud python3 storytelling tableau tableau-public threejs wordcloud-visualization
Last synced: 03 May 2026
https://github.com/drkenreid/introductory-data-science
Hands-on machine learning tutorials in Google Colab, covering various algorithms and techniques for learners at different levels.
cnn data data-science deep-learning learning-datascience learning-machine-learning learning-python neural-network neural-networks regression rnn science tutorial tutorial-exercises tutorials
Last synced: 28 Jan 2026
https://github.com/ballerina-platform/module-ballerina-data.csv
The Ballerina CSV Data Library is a comprehensive toolkit designed to facilitate the handling and manipulation of CSV data within Ballerina applications. It streamlines the process of converting CSV data to native Ballerina data types, enabling developers to work with CSV content seamlessly and efficiently.
ballerina ballerina-csv csv csv-data data
Last synced: 29 Jan 2026
https://github.com/banyan-team/banyan-julia-examples
Adventures in massively parallel cloud computing with Banyan Julia!
banyan data data-analytics data-processing data-science julia
Last synced: 02 May 2026
https://github.com/asirihewage/simplest-xpath-web-scraper
Simplest web scraper created using Python3 and MongoDB
data data-mining python3 scraper web webscrping
Last synced: 29 Jan 2026
https://github.com/freight-trust/edi-onboarding
ESC Guidelines for X12/EDIFACT Messages
b2b data data-interchange edi edi-xml edifact enterprise x12
Last synced: 04 Mar 2026
https://github.com/ingmarboeschen/jatsdecoderevaluation
Evaluation data and code
Last synced: 04 Feb 2026
https://github.com/stdlib-js/array-uint16
Uint16Array.
array data int integer javascript node node-js nodejs short stdlib structure typed typed-array types uint uint16 uint16array unsigned
Last synced: 22 Apr 2025
https://github.com/stdlib-js/array-typed-float-ctors
Floating-point typed array constructors.
array constructor constructors ctor ctors data dtype dtypes javascript node node-js nodejs stdlib structure type typed typed-array types utilities
Last synced: 24 Apr 2025
https://github.com/praveenpuglia/css-support
The source of truth for CSS browser support of info
api browser compatibility css data properties selectors support
Last synced: 31 Mar 2025
https://github.com/blakedrumm/scvmm-scripts-and-sql
The Scripts provided here are compatible with System Center Virtual Machine Manager
collector data powershell scripts scvmm sql
Last synced: 11 May 2025
https://github.com/nononoexe/setariaviridis
🌾 Field-collected data of green foxtail
data data-science dataset rpackage
Last synced: 27 Feb 2026
https://github.com/thyringer/cast
CLI tool for reading strings or complex data sets from CSV files to output them in other text formats.
csv-converter data data-preprocessing python python3 sql-builder
Last synced: 02 Feb 2026
https://github.com/ymougenel/referencecollector
Helps you gather, store and share references links
ansible data docker keycloak kotlin spring-boot thymeleaf
Last synced: 14 Apr 2026
https://github.com/stefen-taime/real-time-data-pipeline-snake-game
Dynamic Snake Game: Unleashing Real-Time Streaming Analytics with Redis, Kafka, Flink, ClickHouse & Chart.js in an Online Snake Game via Flask API
chartjs clickhouse confluent-cloud data flask kafka-streams pipeline redis
Last synced: 04 May 2026
https://github.com/nixhantb/data-structures-and-algorithms-in-java-
Master Java Programming and Data Structures and Algorithms in Java in an efficient way. Clear concept on Recursion and Sorting
algorithms algorithms-and-data-structures competitive-programming data data-structures java java-8 programming
Last synced: 05 Jul 2025
https://github.com/stdlib-js/utils-named-typed-tuple
Named typed tuple.
array collection data data-structure data-structures javascript list named node node-js nodejs stdlib structure tuple typed typed-array util utilities utility utils
Last synced: 14 Apr 2025
https://github.com/plabayo/datapoints.earth
Earth data liberation for and by its citizens.
Last synced: 15 Mar 2026
https://github.com/parimala24-ds/datascientistmlinterviewprep24
DATASCIENTST ML INTERVIEW PREP24
data decisiontree interviewquestions linear-regression logistic machine-learning matplotlib numpy pandas python seaborn sklearn
Last synced: 12 Apr 2025
https://github.com/sdhutchins/jxn-open-data-api
Access Jackson, MS open government data using a python API wrapper.
api data jackson jxn mississippi open-gov
Last synced: 08 Apr 2025
https://github.com/doughtnerd/pod
Read and write Excel data with Java
data excel extract poi-library
Last synced: 08 Apr 2025
https://github.com/richardschoen/ibmixmlservicestd
IBM i XMLSERVICE C# and VB.Net Data Access Service Wrapper for .Net 4.6.1 and above and .Net Core 2.0 and above
as400 cl cobol command data database db2 ddm drda ibm ibmi os400 pase program qcmdexc qcmdexec queue rpg service xmlservice
Last synced: 18 Apr 2025
https://github.com/bredalis/kpopnews
A place to see kpop news 📝
backend css data feedparser flask frameworks frontend html jinja2 kpop mongodb mongodb-atlas news newsletter os pages pymongo python requests web
Last synced: 12 Feb 2026
https://github.com/lmantw/binarion
A simple binary format for storing JavaScript objects.
binary data decoding encoding format javascript
Last synced: 02 Sep 2025
https://github.com/e-candeloro/data-analysis-code-snippets-for-pandas-and-sklearn
These notebooks are useful to learn how to load, understand, clean and classify data using Pandas and Sklearn with Python
analysis big-data classification data datascience datavisualization machine-learning notebook numpy pandas python sklearn
Last synced: 10 Apr 2026
https://github.com/stdlib-js/array-ones
Create an array filled with ones and having a specified length.
array data float32array float64array int16array int32array javascript matrix ndarray node node-js nodejs stdlib structure typed typed-array types uint32array vector
Last synced: 09 Apr 2025
https://github.com/lafayettegabe/g2m-insight-for-cab-investment-firm
📊 Exploratory Data Analysis (EDA) on multiple datasets related to the cab industry in the US, to provide actionable insights and recommendations to a private firm looking to invest in the market. The analysis includes data cleaning, transformation, visualization, and hypothesis testing.
big-data data data-analysis data-science data-visualization eda gotomarket
Last synced: 13 Jun 2025
https://github.com/jaffarabbas/library-management-system-in-java-
GUI base + Database functionality
data database datastructures-algorithms dbms gson java javafx javafx-application javafx-desktop-apps javamail library-management-system mysql sql xammp
Last synced: 05 May 2026
https://github.com/mihasm/arso-scraper
Unofficial Python CLI tool for downloading automated sensor weather data from the Slovenian Environment Agency.
api arso cli data historical-data meteorological python slovenia weather
Last synced: 14 Feb 2026
https://github.com/iusztinpaul/airbnb-data-analysis
Airbnb data analysis on the biggest cities in The Netherlands following the CRISP-DM methodology.
airbnb data datanalysis datascience machine-learning numpy pandas python
Last synced: 06 May 2026
https://github.com/jimut123/scrapers
All Scrapers that I'll build
bs4 data python3 real-time-visualisations scrapers scrapy wget
Last synced: 16 Jan 2026
https://github.com/oliverhennhoefer/shiny-template-interactive-table
Example of interactively adding rows / deleting rows by selecting directly in a data.table (DT) in Shiny
button data delete dt r select selection server shiny shiny-applications shiny-apps shiny-r shinyapps table ui userinterface
Last synced: 16 Apr 2026
https://github.com/jesusgraterol/bitcoin-blockchain-dataset-builder
The dataset builder script extracts all the relevant block information from the Bitcoin Blockchain through Mempool.space's public API. The data is stored in a .csv file, facilitating its use in data science and machine learning projects.
bitcoin blockchain blockchain-technology data datascience datascience-machinelearning dataset dataset-generation machine-learning
Last synced: 06 May 2026
https://github.com/georgetdn/syscppcplinux
Store Linux C++ class data in a file ( persistence ) and manipulate it programmatically or using Small SQL (included)
class data framework linux object persistence serialize sql
Last synced: 12 Feb 2026
https://github.com/imtiaz-emu/exploratory-data-analysis-with-r
Data Transformation, Descriptive statistics, data visualization, Linear regression using R
data dplyr ggplot2 r rstudio visualization
Last synced: 15 Mar 2025
https://github.com/freebirdscrew/datastructures_python
Data Structures Implementation in Python and Explains each Steps.
data data-visualization datascience datastructures datastructures-algorithms datastructures-algorithms-python datastructures-implementation datastructuresandalgorithm freebirdscrew programming python simranjeet simranjeetsingh
Last synced: 16 Apr 2026
https://github.com/mahmoud-saeed-mahmoud/loading_state_handler
The StateHandlerWidget manages different UI states—loading, error, empty, and normal—allowing you to customize the displayed widgets for each state.
dart data error flutter flutter-package flutter-widget loading state
Last synced: 10 Mar 2026
https://github.com/manifoldfinance/disco-schema
MEV Auction and Ethereum Network Data Schemas
cryo data dataset ethereum ethereum-builders ethereum-mev evm mev-data pandas schema-registry schemas
Last synced: 08 May 2026
https://github.com/askaniy/celestialocationsmaker
Tool for making Celestia location files
celestia data geology locations mapping planetary-science space
Last synced: 14 Mar 2025
https://github.com/infinitode/pwlds
A public dataset of over 10 million passwords, with assigned strength levels.
ai classes classification cyber-security data dataset ml open-source password passwords synthetic-data
Last synced: 22 Feb 2026
https://github.com/muhammadibrahim313/start-your-data-science-journey
In this Repo i will be Sharing all Resources that we will be Learning during December Data Science Workhops on iCode Guru
btajicrew data data-science eda icodeguru machine-learning matplotlib pandas python
Last synced: 03 Feb 2026
https://github.com/csadorf/pydata-ann-arbor-2018
Slides and notebooks demonstrating signac for PyData Ann Arbor Meetup 2018
data data-management jupyter signac workflow
Last synced: 04 Jun 2026
https://github.com/slashdotted/pomapure
PoorMan's Pipeline
data json modular module pipeline processing
Last synced: 18 Apr 2026
https://github.com/danielbayley/schemas
A collection of useful @JSON-schema-org schemas for data validation.
ajv config configuration data data-science data-structures data-validation json json-schema linter linting schema schema-org validation yaml yaml-configuration
Last synced: 13 Oct 2025
https://github.com/0xdir/relief_web_dart
A Future-based wrapper around the Relief Web API, to retrieve information on humanitarian news, reports, training, jobs, and disasters
api dart data humanitarian jobs
Last synced: 11 Jun 2026
https://github.com/evoluteur/web-scraper-sitemaps
Sitemaps for the Web Scraper Chrome extension.
chrome-extension data dataset scraper scraping scrapper scrapping scrapy-crawler sitemap web-scraper web-scraping
Last synced: 04 Jun 2026
https://github.com/dark-art108/yonk
A cli-utility to streamline data science work by creating templates
Last synced: 08 May 2026
https://github.com/mlr-org/mlr3data
Data sets used in the book, gallery, or in examples of mlr3.
data data-science data-sets machine-learning mlr3 r r-package
Last synced: 09 Apr 2025
https://github.com/yakupzengin/data-structures-and-algortihms
This repo contains implementation of data structures and algorithms using JAVA
algorithms algorithms-and-data-structures data structure
Last synced: 03 Dec 2025
https://github.com/divithraju/divith-raju-searchengine-wikipedia
search engine optimizationA complete search engine experience built on top of 75 GB Wikipedia corpus with subsecond latency for searches. Results contain wiki pages ordered by TF/IDF relevance based on given search word/s. From an optimized code to the K-Way mergesort algorithm, this project addresses latency, indexing, and big data challenges.
algorithms data dataengineering inverted-index linux merge-sort nlp project project-repository python3 serchengine software-engineering ubuntu wikipedia
Last synced: 16 May 2026
https://github.com/mohasarc/treeviz
The best tree data-structures visualization tool
data structures visualization visualization-tools
Last synced: 25 Apr 2026
https://github.com/ciscorn/tinygrib2
(experimental) A tiny toolkit for parsing JMA's GRIB2 files.
data grib grib2 meteorology rust weather
Last synced: 26 Apr 2026
https://github.com/lastancientone/amd-vs-nvda
Analyzing 2 technology stocks using Master Analyst Program (MAP).
data data-analysis data-structures data-visualization excel forecasting time-series-analysis
Last synced: 15 May 2025
https://github.com/openpeeps/zxc-nim
Bindings to the ZXC compression library, a LZ77-based compressor optimized for high decompression speed
archive compression compressor data decompression game-assets lossless lossless-compression lz77 nim nim-bindings nim-package nim-wrapper openpeeps zxc
Last synced: 07 Jun 2026
https://github.com/mark-summerfield/uxf
Uniform eXchange Format (uxf) is a plain text human readable optionally typed storage format that supports custom types. It may serve as a convenient alternative to csv, ini, json, sqlite, toml, xml, or yaml.
data ini json parser pretty-printer sqlite storage-engine toml xml yaml
Last synced: 08 Oct 2025
https://github.com/machu-gwu/constant2-project
provide extensive way of managing your constant variable.
configuration constants data developer-tools python
Last synced: 26 May 2026
https://github.com/ssiarhei115/customer-classification
Developing ML model predicting bank' customer inclination to open a deposit
big-data big-data-analytics data data-science data-visualization mashine-learning
Last synced: 09 Apr 2025
https://github.com/skywarth/fenrir-wolfpack-simulator
Simulating wolfpack behaviours and future of the pack in an environment using Javascript and data trees.
data data-structures javascript max-heap simulation simulations wolfpack
Last synced: 14 Oct 2025