data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-02 00:07:45 UTC
- JSON Representation
https://github.com/jderstd/spec
A standard for JSON responses
data error jder json response specification structure
Last synced: 13 May 2026
https://github.com/josephbarbierdarnal/data-matplotlib-journey
datasets for matplotlib-journey.com
Last synced: 11 Mar 2026
https://github.com/jmsallan/esdata
A R package to bring Spanish economic databases into the R environment
data datasets ine inflation spain unemployment-data
Last synced: 18 Jan 2026
https://github.com/definetlynotai/llm_data
A bunch of very famous repos source code's in python as pure localdocs all in this repo to train CODE AI
c code-examples cpp cuda data data-dum jupyter-notebook llm llm-code llm-datasets programming-data programming-data-sets python3
Last synced: 08 Oct 2025
https://github.com/qeeqbox/data-compliance
Data compliance is the process of following various regulations and standards to ensure that sensitive digital assets (data) are guarded against loss, theft, and misuse
compliance data data-compliance infosecsimplified qeeqbox
Last synced: 19 Mar 2026
https://github.com/mongodb-developer/rocket-analytics
Learn how the various components of MongoDB's Developer Data Platform (DDP) can support app-driven and traditional analytics in real-time without duplicating data to other data stores. This demo was created for AWS re:Invent 2022 and presented at the MongoDB booth area at the Venetian expo hall.
data federation lucene lucenesearch mongodb s3 search sql
Last synced: 28 Apr 2026
https://github.com/msampathkumar/fakereceiptimagegenerator
Receipt Generator using PIL, Python
data fake generator image python receipt synthetic-data
Last synced: 06 Sep 2025
https://github.com/ggreen/data-orchestration-with-scdf-showcase
data-orchestration-with-scdf-showcase
data orchestration scdf spring
Last synced: 14 Jan 2026
https://github.com/metapsy-project/data-gambling-psyctr
Database of psychological interventions for problem gambling and gambling disorder.
Last synced: 02 Apr 2026
https://github.com/luminovrym/pbo-biodata
Simulasi Cara Input Data dengan OOP
Last synced: 18 Jun 2026
https://github.com/karlyndiary/imdb-data-analysis
Data Analysis on the IMDb Dataset using Python & Power BI.
data data-preprocessing data-visualization datacleaning dataset jupyter-lab power-bi powerbi-dashboards powerbi-report powerbi-visuals python
Last synced: 15 Apr 2026
https://github.com/haideralipunjabi/harrypotter-analysis
Repository with code to generate visualisations of Harry Potter Fanfiction and Books
analysis data harry-potter python visualization wordcloud
Last synced: 25 Mar 2025
https://github.com/eosdis-nasa/earthdata-pub
core code repository for Earthdata Pub (EDPub)
data earthdata edpub publication
Last synced: 17 Jan 2026
https://github.com/olgaele/playing-with-julia
Playing with data!
data data-analysis data-science julia statistics
Last synced: 19 Apr 2026
https://github.com/geo2france/odema-dashboard
Tableaux de bord thématiques Odema
application client-side dashboard data echarts maplibre odema react waste
Last synced: 05 Feb 2026
https://github.com/mews-labs/crep
This simple module aims at providing some function to tackle tabular data that have a continuous axis. In situations, this index can represent time, but this tool was originally developed to tackle rail way description.
data pandas pandas-dataframe python python3 rails-application time-series
Last synced: 23 Feb 2026
https://github.com/mukhopadhyay/opendata
Open Data ❤️
data data-science datasets deep-learning kaggle kaggle-dataset machine-learning open-source opendata
Last synced: 25 Apr 2026
https://github.com/fforres/webpack-plugin-dx-metrics
Webpack plugin to track webpack behaviour in datadog
data datadog developer-experience typescript visualization webpack
Last synced: 13 Feb 2026
https://github.com/rsn601kri/classifiertoidentifydogbreeds
This project aims to develop an image classification system to identify dog breeds using deep learning models. The classifier leverages pre-trained models from the PyTorch library, including ResNet18, AlexNet, and VGG16, to achieve accurate breed identification from images.
aiml cnn data model python pytorch
Last synced: 26 Sep 2025
https://github.com/huangcongqing/ranking-list
数据!important | 各种排行,榜单数据汇总 数据为王的时代 Data
Last synced: 15 Feb 2026
https://github.com/platob/yggdrasil
arrow data databricks pandas polars spark sql
Last synced: 02 Jun 2026
https://github.com/lewagon/matplotlib
Matplotlib examples for Le Wagon's Data Science bootcamp
Last synced: 13 Jul 2025
https://github.com/0xdir/htcds_dart
Human Trafficking Case Data Standard (HTCDS v0.2) objects, for easy creation, storage and transmission of case data related to human trafficking.
data humanitarian schema standards
Last synced: 24 Oct 2025
https://github.com/msrd0/gotham_formdata
Form data parsing for the gotham web framework
data form gotham html http multipart rust server urlencoded
Last synced: 16 Mar 2025
https://github.com/enes9103/039_react_task_tracker-json_server
api axios-react css3 data javascript json-server react reactjs responsive todoapp
Last synced: 11 Feb 2026
https://github.com/yashika-malhotra/exploratory-data-analysis-for-multinational-retail-corporation
Analysis via CLT and Visualization on Multinational Retail Corporation's data to provide insights and recommendations to improve their userbase.
colab-notebook data jupyter-notebook matplotlib numpy pandas python seaborn stats
Last synced: 11 Feb 2026
https://github.com/perezrd5/publicdataprojects
These are public database and data analysis projects from the portfolio of Doug Perez
data data-model data-modeling data-models data-science data-structure data-structures database microsoft-sql-server mysql olap olap-cube oltp postgresql ssas ssis ssrs t-sql
Last synced: 13 Apr 2026
https://github.com/wibosco/modelingformchanges-example
An example project to show how we can implement a model to simplify form validation
data swift unit-testing validator
Last synced: 16 Mar 2025
https://github.com/henrylin03/video-games
Using Python and SQL to clean, analyse and visualise video games' data from Metacritic. Includes scraping using BeautifulSoup.
analysis beautifulsoup beautifulsoup4 data data-analysis data-science eda jupyter-notebook pandas python sql sqlite3 video-game video-games
Last synced: 14 Apr 2026
https://github.com/financejs/discord-bot
A Discord Bot Used In Financejs Discord Server
data discord discord-bot discordjs-bot finance financejs financial
Last synced: 13 Apr 2026
https://github.com/mskian/tamil-words
Tamil words Collections with English Meaning - API and SQL Data.
api data javascript json json-api mysql pdo php sql tamil tamil-language tamil-sms tamilwords translate translator
Last synced: 14 Apr 2026
https://github.com/skylinenando/javascript
autocomplete browser data disable events javascript language loop
Last synced: 14 Feb 2026
https://github.com/mednour2019/devolap
OLAP Cube Dispatcher Tool
analysis-services csharp data excel excel-export kpi mdx metroframework mvvm-architecture sql wpf
Last synced: 27 Jan 2026
https://github.com/sapienzanlp/exploring-srl
Repository for the paper "Exploring Non-Verbal Predicates in Semantic Role Labeling: Challenges and Opportunities"
acl acl2023 conllu data dataset natural-language-processing nlp semantic-role-labeling srl
Last synced: 31 Jan 2026
https://github.com/joelllllll/up-sync
Sync account and transaction data from up bank to your local environment
accounts bank data postgres sync transactions up upbank
Last synced: 06 Jul 2025
https://github.com/everythings-gonna-be-alright/amazing-clickhouse-connector
Quick recording of analytics data
analytics clickhouse data k8s kubernetes
Last synced: 04 Jan 2026
https://github.com/doctorlai/hex-viewer
Simple File Viewer in HEX
application data files hacktoberfest hex-viewer hexeditor hexidecimal web-app
Last synced: 09 Oct 2025
https://github.com/natylaza89/covid19-il
Python package which brings a "Facade" interface for the client for using official covid 19 data of israeli data gov. ★19K+ Downloads★
api covid covid19 covid19-data data israel pandas python
Last synced: 13 Apr 2026
https://github.com/georgetdn/syscppcplinux
Store Linux C++ class data in a file ( persistence ) and manipulate it programmatically or using Small SQL (included)
class data framework linux object persistence serialize sql
Last synced: 12 Feb 2026
https://github.com/uk-ipop/open-data-pipeline
A pipeline for processing, enhancing, and sharing open datasets.
actions automation data python
Last synced: 25 May 2026
https://github.com/saleh0987/mohamed_saleh
That's my personal website where I show my skills and projects.
aos-animation axios boot data json nextjs portfolio portfolio-website projects react-icons reactjs sass swiper
Last synced: 09 Mar 2026
https://github.com/rcourivaud/rcourivaud.github.io
Raphaël Courivaud
data database datascience python
Last synced: 21 Apr 2026
https://github.com/colour-science/colour-hdri-examples-datasets
Colour - HDRI - Examples Datasets
color color-science color-space color-spaces colorspace colorspaces colour colour-science colour-space colour-spaces colourspace colourspaces data dataset datasets hdr hdri raw tone-mapping tonemapping
Last synced: 19 Mar 2026
https://github.com/divithraju/divith-raju-searchengine-wikipedia
search engine optimizationA complete search engine experience built on top of 75 GB Wikipedia corpus with subsecond latency for searches. Results contain wiki pages ordered by TF/IDF relevance based on given search word/s. From an optimized code to the K-Way mergesort algorithm, this project addresses latency, indexing, and big data challenges.
algorithms data dataengineering inverted-index linux merge-sort nlp project project-repository python3 serchengine software-engineering ubuntu wikipedia
Last synced: 16 May 2026
https://github.com/bastianolea/sinim_info_municipal
Base de datos del Sistema Nacional de Información Municipal, que incluye datos comunales sobre finanzas municipales, recursos humanos, educación, salud, pensiones, organizaciones sociales, y más.
chile comunas data estado laboral politica social tiempo
Last synced: 26 Oct 2025
https://github.com/yeisonmontoya1815/machine-learning_prediction_can_inflation
we aim to predict trends in the Canadian market basket using sentiment analysis techniques. Sentiment analysis involves analyzing text data to determine the sentiment expressed, whether positive, negative, or neutral.
algorithms-and-data-structures data data-analysis data-science data-visualization feature-engineering machine-learning matplotlib-pyplot numerical-analysis numpy pandas pipelines python sklearn structured-data super unsupervised-learning
Last synced: 05 Feb 2026
https://github.com/bastianolea/siedu_indicadores_urbanos
Datos del Sistema de Indicadores y Estándares de Desarrollo Urbano, con datos comunales sobre temas como transporte, urbanismo, servicios básicos, calidad de vida y más.
ambiental app chile ciudad comunas data estado social
Last synced: 19 Feb 2026
https://github.com/oliverhennhoefer/shiny-template-interactive-table
Example of interactively adding rows / deleting rows by selecting directly in a data.table (DT) in Shiny
button data delete dt r select selection server shiny shiny-applications shiny-apps shiny-r shinyapps table ui userinterface
Last synced: 16 Apr 2026
https://github.com/StudyResearchProjects/arrbuffstr
Creates Strings from ArrayBuffers and viceversa in NodeJS and the Browser
arraybuffer browser data node string transform
Last synced: 09 Oct 2025
https://github.com/nononoexe/setariaviridis
🌾 Field-collected data of green foxtail
data data-science dataset rpackage
Last synced: 27 Feb 2026
https://github.com/feltex/datahora-java
Aprenda a trabalhar com Data e Hora em Java com as novas classes LocalDateTime, LocalDate, DateTimeFormatter e outras novidades do pacote java.time.
brasil data date dateformat dateformat-brazil datetime hora java java11 localdatetime locale localization zoneddatetime
Last synced: 18 Jun 2026
https://github.com/missiontoscale/bluesky-scraper
This is a work of art that enables you to scrape data off BlueSky.
analytics bluesky bluesky-api bluesky-client data datascraper-framework datascraping scraping social-media web webscraping
Last synced: 19 Jun 2026
https://github.com/stefen-taime/real-time-data-pipeline-snake-game
Dynamic Snake Game: Unleashing Real-Time Streaming Analytics with Redis, Kafka, Flink, ClickHouse & Chart.js in an Online Snake Game via Flask API
chartjs clickhouse confluent-cloud data flask kafka-streams pipeline redis
Last synced: 04 May 2026
https://github.com/farovictor/mongodbextractor
This project is intended to be used as a data extractor to support ELT pipelines or any kind of process that requires a heavy data dump from MongoDb databases.
Last synced: 14 Jan 2026
https://github.com/ballerina-platform/module-ballerina-data.csv
The Ballerina CSV Data Library is a comprehensive toolkit designed to facilitate the handling and manipulation of CSV data within Ballerina applications. It streamlines the process of converting CSV data to native Ballerina data types, enabling developers to work with CSV content seamlessly and efficiently.
ballerina ballerina-csv csv csv-data data
Last synced: 29 Jan 2026
https://github.com/nop-dev/learning-js
Esse repositório contem todas as anotações que fiz enquanto estudava um módulo da trilha Explorer da Rocketseat sobre JavaScript. 🔰
data data-structures functions javascript js
Last synced: 17 Apr 2026
https://github.com/yakupzengin/data-structures-and-algortihms
This repo contains implementation of data structures and algorithms using JAVA
algorithms algorithms-and-data-structures data structure
Last synced: 03 Dec 2025
https://github.com/camara94/data-visualization-with-python
Data visualization and some of the best practices when creating plots and visuals. The history and architecture of Matplotlib, and how to do basic plotting with Matplotlib. Generating different visualization tools using Matplotlib such as line plots, area plots, histograms, bar charts, box plots, and pie charts. Seaborn, another data visualization library in Python, and how to use it to create attractive statistical graphics. Folium, and how to use to create maps and visualize geospatial data.
data data-science data-structures data-visualization python3
Last synced: 16 May 2026
https://github.com/freight-trust/edi-onboarding
ESC Guidelines for X12/EDIFACT Messages
b2b data data-interchange edi edi-xml edifact enterprise x12
Last synced: 04 Mar 2026
https://github.com/gauravkoradiya/tensorflow-data-and-deployement
This repository contains usage of data and deployment pipline in tensorflow.
data deployment machine-learning-algorithms pipline tensorflowjs
Last synced: 06 Oct 2025
https://github.com/eesunmoon/algorithms
[Fall 2020] Algorithms
algorithms algorithms-and-data-structures c data data-structures
Last synced: 01 Feb 2026
https://github.com/colour-science/colour-demosaicing-tests-datasets
Colour - Demosaicing - Tests Datasets
color color-science color-space color-spaces colorspace colorspaces colour colour-science colour-space colour-spaces colourspace colourspaces data dataset datasets de-mosaicing debayering demosaicing demosaicking raw
Last synced: 19 Mar 2026
https://github.com/rohan-paul/machine-learning-and-deep-learning-tutorial-notebooks
Various Machine Learning and Deep Learning Tutorial Notebooks in Blog Format
data data-analysis data-science deep-learning deep-learning-tutorial deep-neural-networks machine-learning machine-learning-algorithms machinelearning neural-network pytorch pytorch-implementation pytorch-tutorial tensorflow
Last synced: 09 May 2026
https://github.com/stdlib-js/datasets-cdc-nchs-us-births-1994-2003
US birth data from 1994 to 2003, as provided by the Center for Disease Control and Prevention's National Center for Health Statistics.
america babies births data dataset datasets javascript node node-js nodejs stdlib time-series timeseries united-states us usa
Last synced: 12 Oct 2025
https://github.com/rajatt95/python_rs
Programming | Python | PyCharm | Data Types | Tuple | Dictionary | If-Else | Loops - For, While | Functions | OOPS Principles | Constructor | String - SubString, Concatenation, Split, Strip | Read & Write data into files | JSON Parsing | CSV package | Web Scrapping
constructor csv-parser data dictionary functions if-else-statements json json-parser oops parser pycharm-ide python python-programming-language read-write-file strings tuple web-scrapping
Last synced: 15 Feb 2026
https://github.com/banbord/data-vis-tornados
This repository includes data files, processing scripts, visualization code, and documentation for our tornado data visualization project. It aims to provide insights into tornado patterns across the United States using interactive and informative visual representations.
d3-visualization d3js data javascript json visualization
Last synced: 24 Feb 2026
https://github.com/junkwaxhero/cardlists
Sports Card set lists in easily consumable JSON Format for databases, apps, websites, and more!
baseball baseball-cards baseball-data bowman data dataset datasets donruss fleer json json-schema panini topps upper-deck
Last synced: 24 Apr 2025
https://github.com/csadorf/pydata-ann-arbor-2018
Slides and notebooks demonstrating signac for PyData Ann Arbor Meetup 2018
data data-management jupyter signac workflow
Last synced: 04 Jun 2026
https://github.com/drkenreid/introductory-data-science
Hands-on machine learning tutorials in Google Colab, covering various algorithms and techniques for learners at different levels.
cnn data data-science deep-learning learning-datascience learning-machine-learning learning-python neural-network neural-networks regression rnn science tutorial tutorial-exercises tutorials
Last synced: 28 Jan 2026
https://github.com/anandchowdhary/health
🫀 @AnandChowdhary's body measurements
csv data fitness github-actions health
Last synced: 29 Apr 2026
https://github.com/jmbhughes/goes_solar_retriever
Tool to retrieve GOES-R Solar Data
data data-retrieval data-science goes-16 goes-satellite goes16 goes17 solar solar-physics
Last synced: 07 Jan 2026
https://github.com/chaitanyac22/hr_policy_query_resolution_with_retrieval_augmented_generation_rag
This repository contains an HR Policy Query Resolution system using Retrieval-Augmented Generation (RAG). It leverages a 4-bit quantized Mistral-7B-Instruct-v0.2 LLM and JP Morgan Chase’s publicly available Code of Conduct documents to generate accurate, contextually relevant responses for HR policy queries.
artificial-intelligence data hr large-language-models llm mistral-7b nlp pipeline prompt-engineering quantization rag retrieval-augmented-generation
Last synced: 12 Feb 2026
https://github.com/pitmonticone/covid-italy
References for COVID-19 situation in Italy.
coronavirus covid-19 covid-19-italy data data-analysis documentation testing
Last synced: 05 Apr 2026
https://github.com/oliver021/entity-dock
A superset with libraries, components, tools and more to work with entity on .Net
api asp-net-core controller data database dotnet entity entity-framework-core library model mvc netstandard orm support webapi
Last synced: 09 May 2026
https://github.com/ssiarhei115/customer-classification
Developing ML model predicting bank' customer inclination to open a deposit
big-data big-data-analytics data data-science data-visualization mashine-learning
Last synced: 09 Apr 2025
https://github.com/flrd/standardlastprofile
R Data Package for BDEW Standard Load Profiles in Electricity
Last synced: 16 Mar 2026
https://github.com/codecentric/reedelk-bookingintegrationservice
Example service for the blog post series about Reedelk
api api-gateway data integration integration-flow
Last synced: 16 Oct 2025
https://github.com/poncoe/passdatatoanotherfragment
Latihan Passing data Ke Fragment Lain
android android-app android-application android-studio data fragment fragments kotlin kotlin-android passing-parameters passingdataintent viewmodel
Last synced: 23 Jun 2026
https://github.com/dhimmel/het.io-rep-data
Data from Project Rephetio for the het.io website
browser data datatables drug-repurposing rephetio
Last synced: 07 Feb 2026
https://github.com/caelean/twittermap
Map of twitter user's influence as defined on by influencetracker
data google-maps maps sparql twitter visualization
Last synced: 14 Jun 2025
https://github.com/marek-jakub/monitoring
A university project concerning field data management for bird ringers.
bird data fieldwork management ringing
Last synced: 24 Jun 2026
https://github.com/dark-art108/yonk
A cli-utility to streamline data science work by creating templates
Last synced: 08 May 2026