data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-02 00:07:45 UTC
- JSON Representation
https://github.com/missiontoscale/bluesky-scraper
This is a work of art that enables you to scrape data off BlueSky.
analytics bluesky bluesky-api bluesky-client data datascraper-framework datascraping scraping social-media web webscraping
Last synced: 19 Jun 2026
https://github.com/bernard-ng/drc-news-corpus
DRC News Corpus : Towards a scalable and efficient system for Congolese news dataset curation
aggregator data news nlp politics
Last synced: 06 Sep 2025
https://github.com/fabriciopsouza/covid-19-demographic-social-dataset
A social demographic dataset for analysis of the COVID-19 pandemic.
alteryx coronavirus coronavirus-analysis coronavirus-dataset covid-19 covid19 covid19-data data data-science dataset enrichment-analysis timeseries timeseries-analysis timeseries-clustering timeseries-covid-19 timeseries-database timeseries-segmentation timeseriesclassification
Last synced: 31 May 2026
https://github.com/marcuwynu23/phaddress
Data API of Regions,Provinces, CityMunicipalities, and Barangay of the Philippines
address address-data-api api barangay city data geolocation municipalities provinces
Last synced: 14 Feb 2026
https://github.com/azawawi/perl6-msgpack
Perl 6 Interface to libmsgpack
data messagepack msgpack perl6 wrapper
Last synced: 12 Jun 2025
https://github.com/georgetdn/syscppcp
Store C++ class data in a file ( persistence ) and manipulate it programmatically or using Small SQL (included)
class data framework object persistence serialize sql windows
Last synced: 04 Apr 2025
https://github.com/praveenpuglia/css-support
The source of truth for CSS browser support of info
api browser compatibility css data properties selectors support
Last synced: 31 Mar 2025
https://github.com/datafold/vhol-demo
Get hands-on examples of dbt + Datafold CI/CD workflows
data data-engineering datafold dbt diff
Last synced: 28 Dec 2025
https://github.com/yorkulibraries/vendorpol
URLs for vendor privacy policies and terms of use.
Last synced: 15 Oct 2025
https://github.com/keosariel/nairagazer-clustered-news
Providing clustered News data specifically Nigeria news. In hindsight this repo contain nigeria news and it's coverage. Data is from Nairagazer
ai data data-science news nigeria nigerian-data python
Last synced: 30 Aug 2025
https://github.com/debdutto/algorhythm
Algorithmic music driven by data and / or algorithms
Last synced: 18 Apr 2026
https://github.com/doughtnerd/pod
Read and write Excel data with Java
data excel extract poi-library
Last synced: 08 Apr 2025
https://github.com/pawelzny/vo
DDD Value Object implementation
data ddd-patterns object python3 value
Last synced: 15 Feb 2026
https://github.com/sdhutchins/jxn-open-data-api
Access Jackson, MS open government data using a python API wrapper.
api data jackson jxn mississippi open-gov
Last synced: 08 Apr 2025
https://github.com/mihasm/arso-scraper
Unofficial Python CLI tool for downloading automated sensor weather data from the Slovenian Environment Agency.
api arso cli data historical-data meteorological python slovenia weather
Last synced: 14 Feb 2026
https://github.com/leeper/mcode
Functions to merge and recode across multiple variables
data data-transformation r recode recoding
Last synced: 16 May 2025
https://github.com/lmantw/binarion
A simple binary format for storing JavaScript objects.
binary data decoding encoding format javascript
Last synced: 02 Sep 2025
https://github.com/ymougenel/referencecollector
Helps you gather, store and share references links
ansible data docker keycloak kotlin spring-boot thymeleaf
Last synced: 14 Apr 2026
https://github.com/chaitanyac22/hr_policy_query_resolution_with_retrieval_augmented_generation_rag
This repository contains an HR Policy Query Resolution system using Retrieval-Augmented Generation (RAG). It leverages a 4-bit quantized Mistral-7B-Instruct-v0.2 LLM and JP Morgan Chase’s publicly available Code of Conduct documents to generate accurate, contextually relevant responses for HR policy queries.
artificial-intelligence data hr large-language-models llm mistral-7b nlp pipeline prompt-engineering quantization rag retrieval-augmented-generation
Last synced: 12 Feb 2026
https://github.com/poncoe/passdatatoanotherfragment
Latihan Passing data Ke Fragment Lain
android android-app android-application android-studio data fragment fragments kotlin kotlin-android passing-parameters passingdataintent viewmodel
Last synced: 23 Jun 2026
https://github.com/lastancientone/amd-vs-nvda
Analyzing 2 technology stocks using Master Analyst Program (MAP).
data data-analysis data-structures data-visualization excel forecasting time-series-analysis
Last synced: 15 May 2025
https://github.com/bastgau/snow-revoke-privileges
Script designed to simplify the management of permissions in your Snowflake databases.
data database dba dev-container python snowflake
Last synced: 20 Apr 2025
https://github.com/jaldekoa/nyfedapi
A Python wrapper to easily retrieve data from the Federal Reserve Bank of New York (FRBoNY) official API in pandas format.
api api-wrapper banking data finance pandas python united-states
Last synced: 08 Feb 2026
https://github.com/ingmarboeschen/jatsdecoderevaluation
Evaluation data and code
Last synced: 04 Feb 2026
https://github.com/uk-ipop/open-data-pipeline
A pipeline for processing, enhancing, and sharing open datasets.
actions automation data python
Last synced: 25 May 2026
https://github.com/cerema/groum
Utilitaire en ligne de commande pour convertir les données d'arrêtés de circulation
Last synced: 06 Feb 2026
https://github.com/vatshayan/final-year-project-image-recognition
Machine Learning project to recognize faces from an Image
btech computerscience data facial final image imageclassification learning machine project recognition science students year
Last synced: 29 May 2026
https://github.com/ballerina-platform/module-ballerina-data.csv
The Ballerina CSV Data Library is a comprehensive toolkit designed to facilitate the handling and manipulation of CSV data within Ballerina applications. It streamlines the process of converting CSV data to native Ballerina data types, enabling developers to work with CSV content seamlessly and efficiently.
ballerina ballerina-csv csv csv-data data
Last synced: 29 Jan 2026
https://github.com/deepwaterpaladin/statscanpy
Basic package for querying & downloading StatsCan data by table name.
Last synced: 16 Jan 2026
https://github.com/erwan-simon/aws-data-platform-framework
A unified framework to industrialize data ingestion, transformation and pipeline execution on AWS using Terraform, from infrastructure provisioning to runtime execution, designed as a reusable and standalone data platform.
aws data data-framework datalake docker iceberg python spark step-functions terraform terraform-module
Last synced: 23 May 2026
https://github.com/drkenreid/introductory-data-science
Hands-on machine learning tutorials in Google Colab, covering various algorithms and techniques for learners at different levels.
cnn data data-science deep-learning learning-datascience learning-machine-learning learning-python neural-network neural-networks regression rnn science tutorial tutorial-exercises tutorials
Last synced: 28 Jan 2026
https://github.com/gonzalezlrjesus/covid-19API
Convierte la data ofrecida por: the Johns Hopkins University Center en formato CSV al formato JSON sobre los casos confirmados, muertos y recuperados de COVID-19 por paises.
api api-rest api-server coronavirus covid-19 data go golang json
Last synced: 06 May 2025
https://github.com/flrd/standardlastprofile
R Data Package for BDEW Standard Load Profiles in Electricity
Last synced: 16 Mar 2026
https://github.com/cosmos-loops/cosmos-efcore
Cosmos.EntityFrameworkCore is a part of Cosmos.Data, a inline project of COSMOS LOOPS PROGRAMME. This repository provides a package of Microsoft.EntityFrameworkCore to improve development efficiency.
cosmos-loops data efcore entityframeworkcore
Last synced: 14 Aug 2025
https://github.com/0xdir/relief_web_dart
A Future-based wrapper around the Relief Web API, to retrieve information on humanitarian news, reports, training, jobs, and disasters
api dart data humanitarian jobs
Last synced: 11 Jun 2026
https://github.com/tomwhite/chernoff
A visual mood indicator. One of the first Java programs I ever wrote.
chernoff-faces data visualization
Last synced: 20 Apr 2026
https://github.com/imtiaz-emu/exploratory-data-analysis-with-r
Data Transformation, Descriptive statistics, data visualization, Linear regression using R
data dplyr ggplot2 r rstudio visualization
Last synced: 15 Mar 2025
https://github.com/zgbjgg/quetzal-examples
Examples using Quetzal :rocket: :bird:
analytics dashboard data data-visualization elixir erlang plotly web-app
Last synced: 24 Apr 2026
https://github.com/rohan-paul/machine-learning-and-deep-learning-tutorial-notebooks
Various Machine Learning and Deep Learning Tutorial Notebooks in Blog Format
data data-analysis data-science deep-learning deep-learning-tutorial deep-neural-networks machine-learning machine-learning-algorithms machinelearning neural-network pytorch pytorch-implementation pytorch-tutorial tensorflow
Last synced: 09 May 2026
https://github.com/karashiiro/lodestone-id-time
Data scraper, formula and reference implementation for the estimated creation time of a FFXIV character given its Lodestone ID.
data ffxiv ffxiv-character lodestone
Last synced: 30 Jun 2025
https://github.com/yashika-malhotra/cardioflex-treadmill-analysis-using-descriptive-statistics-probability
Description Analysis and Visualization on CardioFlex Treadmill data to provide insights and recommendations to improve their userbase.
colab-notebook data data-visualization jupyter-notebook matplotlib numpy pandas python seaborn
Last synced: 12 Apr 2026
https://github.com/mohasarc/treeviz
The best tree data-structures visualization tool
data structures visualization visualization-tools
Last synced: 25 Apr 2026
https://github.com/kvstore-io/sdk-java
api data java sdk sdk-java serverless storage
Last synced: 14 Jan 2026
https://github.com/mujadded/facebook_scrapper
The fcebook scrapper gem that dont need the api
data data-mining facebook ruby-gem scrapper selenium-webdriver
Last synced: 28 Oct 2025
https://github.com/iusztinpaul/airbnb-data-analysis
Airbnb data analysis on the biggest cities in The Netherlands following the CRISP-DM methodology.
airbnb data datanalysis datascience machine-learning numpy pandas python
Last synced: 06 May 2026
https://github.com/kefniark/kaaya
JS Library for State management and Data synchronization between Applications
data game kaaya mutation network serialization state-management
Last synced: 06 Jun 2026
https://github.com/davemlz/master_of_datascience
Master of Data Science repository
data data-mining data-science database r rmd sql sqlite statistics
Last synced: 14 Apr 2026
https://github.com/OliverHennhoefer/shiny-template-interactive-table
Example of interactively adding rows / deleting rows by selecting directly in a data.table (DT) in Shiny
button data delete dt r select selection server shiny shiny-applications shiny-apps shiny-r shinyapps table ui userinterface
Last synced: 30 Jul 2025
https://github.com/guslovesmath/top_tech_sp_500_forecasting
Forecasting the stock market is difficult. I sought to observe the relationship between Apple's stock price and others in the S&P500. In doing this, I was able to conclude that stocks in the tech industry can help predict a trend in Apple's Percent change.
arima-forecasting arima-model data data-science forecasting vector-autoregression
Last synced: 14 Mar 2025
https://github.com/doctorlai/hex-viewer
Simple File Viewer in HEX
application data files hacktoberfest hex-viewer hexeditor hexidecimal web-app
Last synced: 09 Oct 2025
https://github.com/ivangrigorov/neutrino-search-engine
Creating Java search engine both for HTML or document type of files
data data-analysis data-knowledge information-extraction information-retrieval java-language search-engine
Last synced: 31 Mar 2025
https://github.com/ium101/files-and-folders-lister-z
Files and Folders Lister Z is a utility for listing the contents of directories on your computer. It provides both a command-line and a graphical user interface (GUI) for easy use.
application application-code brasil brazil cmd command data database databases exe filemanagement filesystem linux lowcode macos python sh tool utility windows
Last synced: 09 Oct 2025
https://github.com/henrylin03/video-games
Using Python and SQL to clean, analyse and visualise video games' data from Metacritic. Includes scraping using BeautifulSoup.
analysis beautifulsoup beautifulsoup4 data data-analysis data-science eda jupyter-notebook pandas python sql sqlite3 video-game video-games
Last synced: 14 Apr 2026
https://github.com/antononcube/raku-data-importers
Various data importing routines with a unified interface (data-import, slurp).
data data-ingestion raku rakulang slurp
Last synced: 23 Feb 2026
https://github.com/ismet55555/pdw-asym-2link
Clear and easy way of simulating a passive dynamic walker (PDW) model derived and exectured using MATLAB.
data dynamics inverted-pendulum matlab numerical-simulations passive-dynamic-walker passive-dynamics ramp research robotics simulation slope walking-simulator
Last synced: 29 Apr 2026
https://github.com/rastmob/wordpress-llms-output-plugin
A WordPress plugin to export posts, pages, and custom post types as JSON for training Language Models (LLMs).
ai data llm llms training training-data wordpress wordpress-development wordpress-plugin
Last synced: 03 May 2026
https://github.com/lafayettegabe/g2m-insight-for-cab-investment-firm
📊 Exploratory Data Analysis (EDA) on multiple datasets related to the cab industry in the US, to provide actionable insights and recommendations to a private firm looking to invest in the market. The analysis includes data cleaning, transformation, visualization, and hypothesis testing.
big-data data data-analysis data-science data-visualization eda gotomarket
Last synced: 13 Jun 2025
https://github.com/a3r0id/lightshot-data-miner
A random idea I had a while back to make a data miner for lightshot. Never released this but after a friend sent me a post about lightshot's transparency I figured it'd be a good time to release this. I've included some output from a run before making the repo. I am not responsible for the imagery or it's contents.
brute-force bruteforce data dataset face-recognition image-processing lightshot mining scraper scraping text-recognition
Last synced: 19 Oct 2025
https://github.com/sanand0/imdbscrape
A weekly archive of the IMDB Top 250 results. Automatically scraped via GitHub Actions. Useful to see trends on IMDb Top 250
Last synced: 30 May 2026
https://github.com/mystpi/crossings
🌉 A tiny library focused on easily connecting JS to HTML.
connect data frontend html javascript reactive simple small tiny
Last synced: 10 Jun 2026
https://github.com/noklam/blog_archive_fastpage
Nok's data science blog
blog data data-science machine-learning python sceince
Last synced: 01 May 2026
https://github.com/joelllllll/up-sync
Sync account and transaction data from up bank to your local environment
accounts bank data postgres sync transactions up upbank
Last synced: 06 Jul 2025
https://github.com/yeisonmontoya1815/machine-learning_prediction_can_inflation
we aim to predict trends in the Canadian market basket using sentiment analysis techniques. Sentiment analysis involves analyzing text data to determine the sentiment expressed, whether positive, negative, or neutral.
algorithms-and-data-structures data data-analysis data-science data-visualization feature-engineering machine-learning matplotlib-pyplot numerical-analysis numpy pandas pipelines python sklearn structured-data super unsupervised-learning
Last synced: 05 Feb 2026
https://github.com/eby8zevin/android-pos4122020
The Next Project . . .
android android-app android-application android-database android-studio androidstudio create data database database-sqlite delete point-of-sale pos read search sqlite update
Last synced: 13 Oct 2025
https://github.com/mednour2019/devolap
OLAP Cube Dispatcher Tool
analysis-services csharp data excel excel-export kpi mdx metroframework mvvm-architecture sql wpf
Last synced: 27 Jan 2026
https://github.com/tayeva/eia-client-python
EIA Open Data API Client - Python
data open-source python python-3 python3
Last synced: 14 Oct 2025
https://github.com/14richa/patient-readmission-analysis
This project focuses on predictive modeling to foresee hospital readmissions of diabetic patients within 30 days post-discharge. By leveraging a dataset spanning a decade (1999-2008) and covering records from 130 US hospitals, the aim is to enhance healthcare management and patient outcomes.
analytics data jupyter-notebook numpy
Last synced: 29 Apr 2026
https://github.com/mrsaeeddev/data-science-roadmap-for-beginners
📈 A minimal and easy road map for beginners who want to dive into the field of Data Science
data data-science datascience python
Last synced: 29 Jun 2025
https://github.com/d2hydro/hydrodashboards
Open Source Dashboards for hydro data
bokeh data geopandas hydrology hydrometrics pandas sheetjs
Last synced: 26 Jul 2025
https://github.com/longzheng/southeastwater-usage-scraper
Extract hourly water usage data from South East Water portal website for digital water meters
australia data iot playwright southeastwater victoria water
Last synced: 06 Feb 2026
https://github.com/kom-senapati/ghw-data-hacks
🌍 Global Hack Week data projects, 📊 focused on exploration, manipulation, and analysis...
Last synced: 12 Mar 2025
https://github.com/zarr-developers/cookiecutter-zarr-store
Cookiecutter for Zarr store implementations
chunked data n-dimensional zarr
Last synced: 16 Jun 2025
https://github.com/mickeyshi-syd/actuarial-hackathon-2019
2019 Actuarial Hackathon
actuarial actuaries analytics data data-science hackathon
Last synced: 15 Jul 2025
https://github.com/laurensius/covid-19-info
data datatabel grafik peta visualisasi
Last synced: 21 May 2026
https://github.com/maskedsyntax/covid-tracker
Qt app to keep a track of Covid-19 records of different countries.
coronavirus coronavirus-tracking covid-19 data parsing scraping scraping-websites tracker web-scraping
Last synced: 29 Mar 2025
https://github.com/joisino/twinpaper
Code for "Twin Papers: A Simple Framework of Causal Inference for Citations via Coupling" (CIKM 2022)
causal-inference data research science-of-science
Last synced: 21 Mar 2025
https://github.com/pjmagee/starwars-data
A Star Wars Web app with Charts and entire Timeline events!
aspire blazor blazor-webassembly data database dataset docker dotnet json starwars starwars-data starwars-fandom
Last synced: 07 Mar 2026
https://github.com/andrewjbateman/mevn-stack-data
:clipboard: MEVN Info & Full stack MEVN app with CRUD functions
data database express expressjs full-stack info mevn mevn-stack middleware mongodb mongodb-atlas nodejs typescript vue vue3 vue3-typescript
Last synced: 07 Apr 2026
https://github.com/rawsashimi1604/jobextract
Scrapes LinkedIn data. Conducts sentiment analysis on what traits and qualifications employers are looking for.
data data-analysis data-analytics data-cleaning linkedin mvc python webscraper
Last synced: 06 Nov 2025
https://github.com/cherylisabella/statistics--caret
Training Regression and Classification Models using caret
data data-analysis data-mining data-science datascience dataset r statistics
Last synced: 24 Jun 2025
https://github.com/stdlib-js/ndarray-base-buffer-ctors
ndarray data buffer constructors.
array base buffer constructor constructors ctor ctors data dtype dtypes javascript multidimensional ndarray node node-js nodejs stdlib types utilities utility
Last synced: 06 Apr 2025
https://github.com/horizom/dto
Data Transfer Objects for all PHP applications.
Last synced: 14 Sep 2025
https://github.com/rrighart/rrighart.github.io
A webpage about data science, programming, statistics and related topics
analyses data data-mining programming statistics
Last synced: 20 Jan 2026
https://github.com/stdlib-js/array-bool
BooleanArray.
array binary bool boolean booleanarray data javascript mask node node-js nodejs stdlib structure typed typed-array types
Last synced: 13 May 2025