data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-23 00:07:41 UTC
- JSON Representation
https://github.com/defano/chicago-oasis
A visualization of Chicago business accessibility by neighborhood or census tract.
census chicago data data-science javascript neighborhood
Last synced: 11 Mar 2026
https://github.com/quetz-al/quetzal-client
Python client for the Quetzal API
client data data-science openapi-client openapi3 python quetzal
Last synced: 28 Jul 2025
https://github.com/sanand0/imdbscrape
A weekly archive of the IMDB Top 250 results. Automatically scraped via GitHub Actions. Useful to see trends on IMDb Top 250
Last synced: 30 May 2026
https://github.com/Duartemartins/dados
Resultados de Eleições Portuguesas por Freguesia
data elections open-data portugal
Last synced: 20 Nov 2025
https://github.com/rn0x/aliexpress_product_data
استخراج بيانات المنتج من موقع علي إكسبريس
aliexpress aliexpress-api aliexpress-bot aliexpress-data aliexpress-json api data dropshipping express json nodejs
Last synced: 03 Oct 2025
https://github.com/lmantw/binarion
A simple binary format for storing JavaScript objects.
binary data decoding encoding format javascript
Last synced: 02 Sep 2025
https://github.com/oliver021/entity-dock
A superset with libraries, components, tools and more to work with entity on .Net
api asp-net-core controller data database dotnet entity entity-framework-core library model mvc netstandard orm support webapi
Last synced: 09 May 2026
https://github.com/divithraju/divith-raju-openmetadata
Open Standard for Metadata. A Single place to Discover, Collaborate and Get your data right.
automation bigdata bigdataanalytics data data-structures dataengineering datascience hacktoberfest2022 metadata metadata-extraction
Last synced: 20 Feb 2026
https://github.com/simoneas02/data-science
🐍 A planning study to become a data scientist and to improve my current skills. 🤘🏼🌻
data data-analysis data-science data-visualization deep-learning machine-learning pandas python3 r sql
Last synced: 12 Apr 2026
https://github.com/squareslab/probabilisticmodel_saner2018
Paper and supporting materials of the Probabilistic Model paper Accepted to SANER 2018
code data mausotog published replication
Last synced: 26 Oct 2025
https://github.com/ballerina-platform/module-ballerina-data.csv
The Ballerina CSV Data Library is a comprehensive toolkit designed to facilitate the handling and manipulation of CSV data within Ballerina applications. It streamlines the process of converting CSV data to native Ballerina data types, enabling developers to work with CSV content seamlessly and efficiently.
ballerina ballerina-csv csv csv-data data
Last synced: 29 Jan 2026
https://github.com/audeering/emodb
Publishes Berlin Database of Emotional Speech with audb
Last synced: 19 Oct 2025
https://github.com/phelipe-sempreboni/data-engineering
Repository for tutorials, information, notes and projects about data engineering.
data dataengineering engine engineering enviroment etl etl-pipeline pipeline project python
Last synced: 04 Oct 2025
https://github.com/yakupzengin/data-structures-and-algortihms
This repo contains implementation of data structures and algorithms using JAVA
algorithms algorithms-and-data-structures data structure
Last synced: 03 Dec 2025
https://github.com/financejs/discord-bot
A Discord Bot Used In Financejs Discord Server
data discord discord-bot discordjs-bot finance financejs financial
Last synced: 13 Apr 2026
https://github.com/a3r0id/lightshot-data-miner
A random idea I had a while back to make a data miner for lightshot. Never released this but after a friend sent me a post about lightshot's transparency I figured it'd be a good time to release this. I've included some output from a run before making the repo. I am not responsible for the imagery or it's contents.
brute-force bruteforce data dataset face-recognition image-processing lightshot mining scraper scraping text-recognition
Last synced: 19 Oct 2025
https://github.com/ingmarboeschen/jatsdecoderevaluation
Evaluation data and code
Last synced: 04 Feb 2026
https://github.com/mmaithani/loan-approvel-ml-model-with-insights
This project will approved or reject the loan applications. Public api, data insights and predictive models for loan prediction project are also provided
data data-science loan-prediction-analysis machine-learning visualization
Last synced: 16 Aug 2025
https://github.com/mmabiaa/data-structure-and-algorithms-java
Data structures and algorithms in java
algorithms algorithms-and-data-structures data data-structure-and-algorithm data-structures data-structures-algorithms data-structures-and-algorithms datastructures dsa dsa-learning-series dsa-practice java
Last synced: 09 Apr 2026
https://github.com/priyanka7411/customer-segmentation-churn-dashboard
📊 Streamlit + Plotly dashboard for customer segmentation, RFM analysis, and churn prediction using machine learning.
churn data machine-learning pandas prediction python rfm rfm-analysis streamlit visualization
Last synced: 14 Apr 2026
https://github.com/rajatt95/python_rs
Programming | Python | PyCharm | Data Types | Tuple | Dictionary | If-Else | Loops - For, While | Functions | OOPS Principles | Constructor | String - SubString, Concatenation, Split, Strip | Read & Write data into files | JSON Parsing | CSV package | Web Scrapping
constructor csv-parser data dictionary functions if-else-statements json json-parser oops parser pycharm-ide python python-programming-language read-write-file strings tuple web-scrapping
Last synced: 15 Feb 2026
https://github.com/windwalker-io/data
[READ ONLY] A library contains data/collection objects with null-object pattern.
collection collections data data-object iterator nullobject value-object
Last synced: 12 Mar 2026
https://github.com/frefrik/covid19norge-data
🦠 COVID-19 Datasets for Norway
covid covid-19 covid19 covid19-data csv data datasets norge norway norwegian smittestopp vaccine
Last synced: 09 Apr 2026
https://github.com/sdhutchins/jxn-open-data-api
Access Jackson, MS open government data using a python API wrapper.
api data jackson jxn mississippi open-gov
Last synced: 08 Apr 2025
https://github.com/programmer-rd-ai/open-images-v6
Open-Images-V6
ai data dataset dl images ml object-detection open open-images programming python v6
Last synced: 03 Aug 2025
https://github.com/parimala24-ds/datascientistmlinterviewprep24
DATASCIENTST ML INTERVIEW PREP24
data decisiontree interviewquestions linear-regression logistic machine-learning matplotlib numpy pandas python seaborn sklearn
Last synced: 12 Apr 2025
https://github.com/lastancientone/amd-vs-nvda
Analyzing 2 technology stocks using Master Analyst Program (MAP).
data data-analysis data-structures data-visualization excel forecasting time-series-analysis
Last synced: 15 May 2025
https://github.com/steelcake/cherry-pipelines
A collection of pipelines built with cherry
blockchain clickhouse data pipeline pyhton
Last synced: 09 Mar 2026
https://github.com/sadcenter/messenger
Data messaging system between servers using popular messaging brokers
Last synced: 06 Aug 2025
https://github.com/missiontoscale/bluesky-scraper
This is a work of art that enables you to scrape data off BlueSky.
analytics bluesky bluesky-api bluesky-client data datascraper-framework datascraping scraping social-media web webscraping
Last synced: 19 Jun 2026
https://github.com/mihasm/arso-scraper
Unofficial Python CLI tool for downloading automated sensor weather data from the Slovenian Environment Agency.
api arso cli data historical-data meteorological python slovenia weather
Last synced: 14 Feb 2026
https://github.com/bradlindblad/quotableoffice
Repo for the quotable office R Shiny app
data datascience golem-apps r shiny shiny-apps text text-mining
Last synced: 26 May 2026
https://github.com/colour-science/colour-demosaicing-tests-datasets
Colour - Demosaicing - Tests Datasets
color color-science color-space color-spaces colorspace colorspaces colour colour-science colour-space colour-spaces colourspace colourspaces data dataset datasets de-mosaicing debayering demosaicing demosaicking raw
Last synced: 19 Mar 2026
https://github.com/stdlib-js/array-uint16
Uint16Array.
array data int integer javascript node node-js nodejs short stdlib structure typed typed-array types uint uint16 uint16array unsigned
Last synced: 22 Apr 2025
https://github.com/achraf-oujjir/chatgpt-users-tweets-pipeline
🐦🔵End-to-end ChatGPT Users' Tweets Data Pipeline with Python 🐍, Hive 🐝, and Power BI 📊
bash-script cloudera data data-engineering data-vizualisation datawarehouse hdfs hive networking powerbi python sentiment-analysis sftp shell tweepy twitter-api ubuntu virtualization vmware-workstation
Last synced: 28 Feb 2026
https://github.com/xxczaki/parsify-plugin-covid19
Parsify plugin, that adds COVID 19-related variables 🦠
confirmed coronavirus covid19 data deaths fun math parser parsify parsify-plugin plugin variable variables
Last synced: 13 Mar 2026
https://github.com/chaitanyac22/hr_policy_query_resolution_with_retrieval_augmented_generation_rag
This repository contains an HR Policy Query Resolution system using Retrieval-Augmented Generation (RAG). It leverages a 4-bit quantized Mistral-7B-Instruct-v0.2 LLM and JP Morgan Chase’s publicly available Code of Conduct documents to generate accurate, contextually relevant responses for HR policy queries.
artificial-intelligence data hr large-language-models llm mistral-7b nlp pipeline prompt-engineering quantization rag retrieval-augmented-generation
Last synced: 12 Feb 2026
https://github.com/henrylin03/video-games
Using Python and SQL to clean, analyse and visualise video games' data from Metacritic. Includes scraping using BeautifulSoup.
analysis beautifulsoup beautifulsoup4 data data-analysis data-science eda jupyter-notebook pandas python sql sqlite3 video-game video-games
Last synced: 14 Apr 2026
https://github.com/bredalis/kpopnews
A place to see kpop news 📝
backend css data feedparser flask frameworks frontend html jinja2 kpop mongodb mongodb-atlas news newsletter os pages pymongo python requests web
Last synced: 12 Feb 2026
https://github.com/guslovesmath/top_tech_sp_500_forecasting
Forecasting the stock market is difficult. I sought to observe the relationship between Apple's stock price and others in the S&P500. In doing this, I was able to conclude that stocks in the tech industry can help predict a trend in Apple's Percent change.
arima-forecasting arima-model data data-science forecasting vector-autoregression
Last synced: 14 Mar 2025
https://github.com/datafold/vhol-demo
Get hands-on examples of dbt + Datafold CI/CD workflows
data data-engineering datafold dbt diff
Last synced: 28 Dec 2025
https://github.com/oliverhennhoefer/shiny-template-interactive-table
Example of interactively adding rows / deleting rows by selecting directly in a data.table (DT) in Shiny
button data delete dt r select selection server shiny shiny-applications shiny-apps shiny-r shinyapps table ui userinterface
Last synced: 16 Apr 2026
https://github.com/lxcoding06/e-gereja
Website CRUD untuk Gereja, untuk mengatur data jemaat, data kematian, data pernikahan dan data baptis
data data-gereja e-gereja gereja gereja-online jemaat kematian pernikahan
Last synced: 15 May 2025
https://github.com/agnosticeng/agx
Query and explore local and remote data with Clickhouse
clickhouse d3 data rust svelte
Last synced: 26 Oct 2025
https://github.com/bkamapantula/india-pc-nfhs4
Parliamentary constituency factsheet for indicators of nutrition, health, and development in India using NFHS4 data.
data government health india nfhs nfhs4
Last synced: 19 Mar 2026
https://github.com/mawburn/across-a-thousand-dead-worlds-data
Across a Thousand Dead Worlds Data
Last synced: 21 Apr 2026
https://github.com/askaniy/celestialocationsmaker
Tool for making Celestia location files
celestia data geology locations mapping planetary-science space
Last synced: 14 Mar 2025
https://github.com/ctechhindi/auto-fill-form-data
AUTO FILL AND AUTOCOMPLETE USER DATA WITH KEY NAME
autocomplete chrome-extension data extension
Last synced: 17 Apr 2026
https://github.com/leeper/mcode
Functions to merge and recode across multiple variables
data data-transformation r recode recoding
Last synced: 16 May 2025
https://github.com/kocyigitkim/realtime.io
Real time data streaming & socket programming library
data realtime socket streaming
Last synced: 29 Jul 2025
https://github.com/bdpedigo/neuropull
A (soon to be) lightweight Python package for accessing single-cell connectome networks with metadata.
connectome connectomes connectomics data dataset networks networks-biology
Last synced: 05 Oct 2025
https://github.com/csadorf/pydata-ann-arbor-2018
Slides and notebooks demonstrating signac for PyData Ann Arbor Meetup 2018
data data-management jupyter signac workflow
Last synced: 04 Jun 2026
https://github.com/stdlib-js/utils-named-typed-tuple
Named typed tuple.
array collection data data-structure data-structures javascript list named node node-js nodejs stdlib structure tuple typed typed-array util utilities utility utils
Last synced: 14 Apr 2025
https://github.com/zalweny26/tools
Just a bunch of tools made in TypeScript.
algorithms data dimensionality distances helpers reduction sortings structures tools utils
Last synced: 03 Feb 2026
https://github.com/mo-karbalaee/introduction-to-data-science-sbu
Reports and full documentation of the introduction to data science course held at SBU
data data-science python shahid-beheshti-university
Last synced: 02 Aug 2025
https://github.com/rcourivaud/rcourivaud.github.io
Raphaël Courivaud
data database datascience python
Last synced: 21 Apr 2026
https://github.com/zgbjgg/quetzal-examples
Examples using Quetzal :rocket: :bird:
analytics dashboard data data-visualization elixir erlang plotly web-app
Last synced: 24 Apr 2026
https://github.com/stdlib-js/datasets-cdc-nchs-us-births-1994-2003
US birth data from 1994 to 2003, as provided by the Center for Disease Control and Prevention's National Center for Health Statistics.
america babies births data dataset datasets javascript node node-js nodejs stdlib time-series timeseries united-states us usa
Last synced: 12 Oct 2025
https://github.com/noklam/blog_archive_fastpage
Nok's data science blog
blog data data-science machine-learning python sceince
Last synced: 01 May 2026
https://github.com/macsual/dotgov-jamaica-domains
A listing of .gov.jm domains.
Last synced: 03 Jan 2026
https://github.com/mohasarc/treeviz
The best tree data-structures visualization tool
data structures visualization visualization-tools
Last synced: 25 Apr 2026
https://github.com/nrennie/londonmarathon
R package containing data relating to London Marathon.
Last synced: 02 Apr 2025
https://github.com/bastgau/snow-revoke-privileges
Script designed to simplify the management of permissions in your Snowflake databases.
data database dba dev-container python snowflake
Last synced: 20 Apr 2025
https://github.com/kefniark/kaaya
JS Library for State management and Data synchronization between Applications
data game kaaya mutation network serialization state-management
Last synced: 06 Jun 2026
https://github.com/purarue/listenbrainz_export
Export your scrobbling history from ListenBrainz
data data-export music scrobbling
Last synced: 24 Jan 2026
https://github.com/andrey-tech/data-storage-php
Простое хранилище данных в виде ключ-значение в JSON-файлах с разделяемой блокировкой на чтение и эксклюзивной блокировкой на запись.
data data-storage files json php php7 storage
Last synced: 29 Apr 2026
https://github.com/yash22222/data-analysis-with-python
This repository provides a practical introduction to data acquisition and analysis using Pandas. It covers loading datasets, exploring data, manipulating data, and gaining insights through statistical summaries. Ideal for beginners, it offers code examples and explanations to enhance your data manipulation skills using Pandas for Python.
binning data data-acquisition data-analysis data-binning data-cleaning data-formatting data-integration data-normalization data-preprocessing data-science data-transformation data-wrangling dataframe description numpy pandas pandas-dataframe python python3
Last synced: 09 Apr 2026
https://github.com/imtiaz-emu/exploratory-data-analysis-with-r
Data Transformation, Descriptive statistics, data visualization, Linear regression using R
data dplyr ggplot2 r rstudio visualization
Last synced: 15 Mar 2025
https://github.com/ayemunhossain/firebase-realtime-db-advance-query
Firebase real time database, query with nodejs.
ayemunhossain data firebase firebase-functions firebase-realtime-database nodejs query
Last synced: 06 May 2026
https://github.com/marcuwynu23/phaddress
Data API of Regions,Provinces, CityMunicipalities, and Barangay of the Philippines
address address-data-api api barangay city data geolocation municipalities provinces
Last synced: 14 Feb 2026
https://github.com/erictleung/erictleung.github.io
:memo: Source code for my website, portfolio of projects, and more
bioinformatics blog data data-analysis data-science github-jekyll github-page jekyll lanyon open-science open-source software-engineering
Last synced: 21 Jan 2026
https://github.com/kaos599/apollo-synthetic-data-generator
Apollo is a Python GUI application designed to simplify the complex process of generating random data based on fixed values. It allows users to generate various types of binary datasets, such as Yes/No type questions, by specifying probabilities.
data data-engineering data-generation data-generator data-science faker-library machine-learning tkinter-gui
Last synced: 22 Jul 2025
https://github.com/1sumer/sql
This repository contains SQL scripts and data for various analytical and database management tasks. The project is designed to demonstrate SQL capabilities in handling complex queries, data analysis, and database design. It includes datasets related to e-commerce and streaming services, with a focus on real-world scenarios and use cases.
analytics data data-analysis data-storage sql vscode
Last synced: 19 Jan 2026
https://github.com/OliverHennhoefer/shiny-template-interactive-table
Example of interactively adding rows / deleting rows by selecting directly in a data.table (DT) in Shiny
button data delete dt r select selection server shiny shiny-applications shiny-apps shiny-r shinyapps table ui userinterface
Last synced: 30 Jul 2025
https://github.com/frnt-end/weather-app-react
:atom_symbol: React project - Fetch and Toggle display of current weather in Berlin, Paris, New York & London (tabs) - using axios for API fetch. Watch DEMO 🌞 https://Frnt-End.github.io/Weather-App-React 👈
api axios axios-react background card current-weather data fetch gh-pages react reactjs tabs toggle ui usestate usestate-hook weather weather-app weather-information weatherapp
Last synced: 18 Feb 2026
https://github.com/erwan-simon/aws-data-platform-framework
A unified framework to industrialize data ingestion, transformation and pipeline execution on AWS using Terraform, from infrastructure provisioning to runtime execution, designed as a reusable and standalone data platform.
aws data data-framework datalake docker iceberg python spark step-functions terraform terraform-module
Last synced: 23 May 2026
https://github.com/cgossain/genericresultscontroller
A generic NSFetchedResultsController replacement for iOS, written in Swift.
api client connector controller coredata data database fetch firebase firebase-firestore firebase-realtime-database generic ios mongodb nsfetchedresultscontroller results source swift-generics tableview ui
Last synced: 19 Feb 2026
https://github.com/fabriciopsouza/covid-19-demographic-social-dataset
A social demographic dataset for analysis of the COVID-19 pandemic.
alteryx coronavirus coronavirus-analysis coronavirus-dataset covid-19 covid19 covid19-data data data-science dataset enrichment-analysis timeseries timeseries-analysis timeseries-clustering timeseries-covid-19 timeseries-database timeseries-segmentation timeseriesclassification
Last synced: 31 May 2026
https://github.com/marek-jakub/monitoring
A university project concerning field data management for bird ringers.
bird data fieldwork management ringing
Last synced: 24 Jun 2026
https://github.com/lovethebomb/data-tiles
🍜 Data Tiles is a small website that shows data.
data express javascript nextjs typescript
Last synced: 10 Apr 2026
https://github.com/ashwinpn/visualization
Data Visualization using Matplotlib, Pandas Visualization, Seaborn, ggplot, and Plotly.
analysis data data-analysis data-science data-visualization graphs plots python python3 visualization
Last synced: 13 Apr 2026
https://github.com/joelllllll/up-sync
Sync account and transaction data from up bank to your local environment
accounts bank data postgres sync transactions up upbank
Last synced: 06 Jul 2025
https://github.com/arcticsnow/climatepy
Collection of tools to perform timeseries analysis on climate data (Observation and Downscaled)
climate data era5 meteorological-data noaa-data pandas timeseries weather wmo xarray
Last synced: 05 Feb 2026
https://github.com/tayeva/eia-client-python
EIA Open Data API Client - Python
data open-source python python-3 python3
Last synced: 14 Oct 2025
https://github.com/amacd31/daily_hydromet_sample_data
This repository contains streamflow, precipitation, and potential-evapotranspiration data for the Twentymile Creek USGS streamflow station.
data dataset hydrology potential-evapotranspiration precipitation public-domain streamflow
Last synced: 16 Jan 2026