data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-23 00:07:41 UTC
- JSON Representation
https://github.com/wzbsocialsciencecenter/gemeindeverzeichnis
Python-Modul zum Einlesen von Gemeindeverzeichnisdaten des Statistischen Bundesamts als pandas DataFrame
administrative-data data germany pandas pandas-dataframe python
Last synced: 12 Apr 2025
https://github.com/haroldeustaquio/analysis-of-oncological-diseases-in-peru
This repository presents an analysis of care related to the 7 most frequent cancers in 2022, compiling open data provided by FISSAL. This report includes a breakdown of consultations by department, a patient profile by age, insurance and sex, as well as the most requested types of services. The findings are complemented by an interactive dashboard
analysis data diseases powerbi r sql sqlite
Last synced: 03 Apr 2025
https://github.com/wzbsocialsciencecenter/mdb-twitter-network
Twitter network of members of the 19th German Bundestag
bundestag data network-analysis r scraping twitter
Last synced: 12 Apr 2025
https://github.com/nasa-pds/naif-pds4-bundler
Package to generate PDS4 SPICE Kernels Archives
archive data geometry geometry-processing navigation planetary-data planetary-science python spice
Last synced: 06 Jan 2026
https://github.com/arindal1/striversdsasheet
Solutions of all the problems in Striver's A2Z DSA Sheet
cpp data datastructures datastructures-algorithms striver strivers-sde-sheet
Last synced: 04 Apr 2025
https://github.com/Scetrov/FrontierSharp
C# / .NET API Clients for EVE Frontier — API client for the static data exposed by CCPs HTTP API plus a HTTP Client tuned to the specific API design patterns implemented by CCP.
api data eve-frontier static-data
Last synced: 30 May 2026
https://github.com/sabyasachi-seal/summer-olympics-data-analysis
Analyzing 2020 Summer Olympic Dataset
analysis colab-notebook data data-science dataset jypyternotebook olympics
Last synced: 08 May 2025
https://github.com/juliaearth/geoartifacts.jl
Artifacts (e.g., datasets) for Geospatial Data Science
Last synced: 10 Apr 2026
https://github.com/tomaztk/datasetR
Generate datasets for R projects
data data-frame data-science r-language r-programming sample sample-data sample-data-generator
Last synced: 29 Jul 2025
https://github.com/sjefvanleeuwen/rqlite-dotnet
A lightweight database HTTP API client for rqlite. rqlite is a lightweight, distributed relational database, which uses RAFT and SQLite as its storage engine.
cluster data database distributed distributed-computing distributed-database distributed-systems dotnet raft rqlite
Last synced: 12 May 2025
https://github.com/flintsh/outlier-tools
A collection of free open-source tools to help you better understand your Outlier account, entirely handled in-browser.
Last synced: 27 Feb 2025
https://github.com/RDeconomist/RDeconomist.github.io
RapidCharts - a site for teaching and demonstrating Data Science and Visualisation techniques
data data-science data-visualization economics politics sports
Last synced: 08 Apr 2025
https://github.com/effect-deprecated/morphic
Domain Modelling and Structural Derivation (port of morphic-ts)
data domain functional typeclasses
Last synced: 29 Jun 2025
https://github.com/aircloud/use-groot
React Hooks for Data Fetching
cache data data-fetching fetch hook hooks react react-native stale-while-revalidate suspense swr
Last synced: 30 Jul 2025
https://github.com/unicef/magasin
Cloud native open-source end-to-end data / AI / ML platform
cloud dagster data data-pipelines data-science data-visualization helm-charts kubernetes magasin
Last synced: 21 Apr 2025
https://github.com/stefen-taime/kafka-pipeline
In the following post, we will learn how to build a data pipeline using a combination of open-source software (OSS), including Debezium, Apache Kafka, Kafka Connect.
bash data docker elasticsearch etl-pipeline k kafka kafka-connect kafka-streams kafka-topic kibana ksqldb masking mongodb mysql pii pipeline postgresql
Last synced: 15 Apr 2025
https://github.com/extratone/routinehubreport
Maintaining a git-tracked record of analytics-esque data for my RoutineHub Library generated by Martin de Boer's exceptional Routinehub Report Siri Shortcut.
analytics automation data routinehub siri-shortcuts
Last synced: 03 Sep 2025
https://github.com/navchandar/file-convertor-utils
Set of custom Python Utilities to convert one file format into another. Filetypes supported: Excel, Images, PDF, GIF, MP4, XML, etc.
conversions convertor-utils data dataconversion excel file-conversion fileconversion fileformats image pdf python video xml
Last synced: 21 Sep 2025
https://github.com/gregyjames/mapperic
Automatically generate DTO Classes and AutoMapper Configurations.
automapper automapper-profiles code-generation csharp data dotnet dotnet-core dto dto-entity-mapper dto-generator dto-mapper dto-pattern object-oriented rosyln syntax-analysis syntax-tree
Last synced: 07 May 2025
https://github.com/samashi47/ml-toolkit-project
A general-purpose toolkit for data preprocessing, machine learning modeling, and visualization.
classification data data-preprocessing machine-learning python3 visualization
Last synced: 30 Jul 2025
https://github.com/pottekkat/bulldozer-prize-predictions
Predict the auction sale price for a piece of heavy equipment to create a "blue book" for bulldozers.
bluebook bulldozer data data-science jupyter-notebook kaggle-competition machine-learning
Last synced: 20 Jun 2026
https://github.com/r-js/mangos
🥭's is monorepo collecting data wrangling and data validation utilities
counterculture data data-wrangling fold functional isomorphism javascript json lens optics schema traversal validation
Last synced: 22 Feb 2026
https://github.com/ahmetfurkandemir/iot
Internet of Things (IoT)
accelerometer arduino data dht22sensor esp32-arduino gyroscope iot iot-device lcd-display sensor wokwii
Last synced: 15 Apr 2025
https://github.com/apache/incubator-devlake-playground
Apache DevLake is an open-source dev data platform to ingest, analyze, and visualize the fragmented data from DevOps tools, extracting insights for engineering excellence, developer experience, and community growth.
dashboard-friendly data data-analysis data-engineering data-integration data-transfers devops domain-layer dora etl hacktoberfest integration jira open-source python user-friendly
Last synced: 19 Oct 2025
https://github.com/simranjeet97/financial_analysis_python
This repository contains code and videos related to financial data analysis using python.
data data-analysis data-science finance financial-data-analysis indian-stock-market machine-learning nifty50 numpy pandas portfolio-optimization python stock-analysis stock-market stock-price-prediction stock-price-simulation stock-simulation
Last synced: 22 Sep 2025
https://github.com/datadistillr/datadistillr-python-sdk
A Python SDK for Programmatically Interacting with DataDistillr
apache-drill data data-science datadistillr jupyter sql
Last synced: 01 Jul 2025
https://github.com/mozahran/data-mapper
A data mapping tool that helps you map JSON with configuration files (JSON structure transformation). It also supports if conditions, casting, and mutators (custom or built-in functions).
data json mapper mappings mutator transformer
Last synced: 13 Jan 2026
https://github.com/maelgangloff/sfrmobile-api
Support non-officiel de l'API mobile du FAI français SFR/RED
consommation data giga mobile phone sfr
Last synced: 10 Mar 2026
https://github.com/wazzabeee/twitter-sentiment-analysis-pyspark
Comparative study of classification algorithms implemented in PySpark on the Sentiment 140 dataset.
apache-spark data data-science gcp google-cloud logistic-regression naive-bayes-classifier natural-language-processing nlp nlp-machine-learning pyspark python python3 sentiment-analysis sentiment-classification sentiment140-dataset sentimental-analysis spark tweet twitter
Last synced: 06 May 2025
https://github.com/olamide100/data-engineering-project
This Project summarizes the data set of Tweets related to the forth coming 2023 general election in Nigeria targeted at the two leading presidential aspirants in the country
airflow data data-engineering datatalks docker election nigeria politics twitter
Last synced: 05 May 2025
https://github.com/firelink-sh/evolve-py
A highly efficient, composable, and lightweight ETL and data integration framework.
analytics arrow big-data data data-engineering data-integration data-science duckdb elt etl ingestion ingress ml olap pipeline polars postgresql python s3
Last synced: 10 Mar 2026
https://github.com/theadam/pivoter
A library to create pivot tables that you can plug into any view library
Last synced: 16 Dec 2025
https://github.com/russss/coviddata
Yet another python package for accessing COVID-19 data
Last synced: 13 May 2025
https://github.com/The-Swarm-Corporation/BackTesterAgent
An enterprise-grade AI-powered backtesting framework built on the Swarms framework for automated trading strategy validation and optimization.
ai backtesting data finance finance-agents jpmorgan yahoofinance
Last synced: 11 Sep 2025
https://github.com/guilt/neuralinkcompression
Neuralink Compression Submission
Last synced: 14 Apr 2025
https://github.com/nrennie/shakespeare
CSV files of the works of William Shakespeare.
Last synced: 20 Mar 2025
https://github.com/bullet-db/bullet-record
The generic AVRO record container for plugging in your data into Bullet
avro bullet bullet-record container data
Last synced: 15 Jul 2025
https://github.com/piotrlaczkowski/keras-data-processor
Data Preprocessing model based on Keras preprocessing layers that can be used as a standalone model or incorporated to Keras model as first layers.
data keras layers preprocessing tensorflow
Last synced: 06 May 2025
https://github.com/lacerbi/changeprob
Human Online Adaptation to Changes in Prior Probability
bayesian-inference change-point-detection data modeling psychophysics
Last synced: 13 May 2025
https://github.com/makomweb/curious-moon-exercise
My notes about: A Curious Moon by Rob Conery
data data-science docker make postgres postgresql sql
Last synced: 06 Apr 2025
https://github.com/stdlib-js/array-pool
Typed array pool.
array data float32array float64array int16array int32array javascript matrix ndarray node node-js nodejs stdlib structure typed typed-array types uint32array vector
Last synced: 05 May 2025
https://github.com/tillathehun0/tilla
Permission based object transformation made easy.
asynchronous data datatransferobject dto mapper mapping node permissions promise-api transform transformer
Last synced: 12 May 2025
https://github.com/katilingban/ppitables
Poverty Probability Index (PPI) Lookup Tables
data poverty poverty-likelihoods poverty-probability ppi
Last synced: 04 Oct 2025
https://github.com/thiagodnf/qwood
Data structure visualization tool based on Qt
data data-structure qt5 tree visualization
Last synced: 15 Jul 2025
https://github.com/tjpalanca/education-analytics
Data-driven analysis of the Philippine education system
Last synced: 20 Oct 2025
https://github.com/lykmapipo/nyc-tlc-trip-data
Python scripts to download, process, and analyze the New York City Taxi and Limousine Commission (TLC) Trip Record Data dataset
apache-arrow apache-spark data data-engineering data-extraction data-transformation etl fsspec geopandas joblib jupyterlab lykmapipo metadata nyc nyc-taxi-dataset pandas pyarrow python s3
Last synced: 17 Sep 2025
https://github.com/aeternalis-ingenium/anomalytics
The ultimate anomaly detection and its analytics.
analysis analytics anomaly-detection data detection engineering inference mathematics numpy open-source pandas prediction python3 research scientific scipy statistics
Last synced: 12 Apr 2025
https://github.com/femtotrader/dukascopyticksreader.jl
A Julia library to download tick data from Dukascopy https://www.dukascopy.com/swiss/english/marketwatch/historical/
data data-analysis dataset dukascopy html julia stock-data
Last synced: 10 Apr 2025
https://github.com/strmprivacy/data-plane-helm-chart
Care about your data leaving your VPC/environment in SaaS mode? With our self-hosted option you can run our privacy focused Data Plane in your own Kubernetes Cluster. Just (1) sign-up, (2) request a self-hosted installation, (3) use our values.yaml on your own k8s clusters and (4) run your (customer) data inside your own cloud like 🪄
charts data helm kubernetes privacy security
Last synced: 23 Jun 2025
https://github.com/woctezuma/steam-api-data
Data consisting of Steam app details.
Last synced: 13 Apr 2025
https://github.com/blockception/bc-minecraft-bedrock-vanilla-data
A Typescript library for dealing provides vanilla minecraft data
bedrock data hacktoberfest hacktoberfest2022 minecraft ts typescript vanilla
Last synced: 08 Sep 2025
https://github.com/ygor-j/sql-guard
A small package for data quality rules using Standard SQL
assertions bigquery data data-cleaning data-quality data-quality-checks duckdb ducklake gcp in-memory pure-sql sql testing-tools
Last synced: 26 Oct 2025
https://github.com/lgatto/rpx
R Interface to the ProteomeXchange Repository
bioconductor data mass-spectrometry proteomexchange proteomics
Last synced: 21 Mar 2025
https://github.com/iguptashubham/online-retail-sales
This Power BI dashboard, designed for marketing strategists, analyzes sales trends and customer behavior. It provides key insights empowering them to identify sales opportunities and optimize marketing campaigns, ultimately boosting business sales.
dashboard data data-analysis data-analysis-project data-analysis-project-powerbi data-analysis-python data-project data-science powerbi project
Last synced: 19 Mar 2026
https://github.com/lynnlangit/sample-data
Small datasets and files in many formats, used for testing cloud SQL, NoSQL or Machine Learning Services
Last synced: 24 Mar 2025
https://github.com/ifttt/stripe-webhook-event-ingester
An AWS CDK-based project for ingesting Stripe webhook events into an SQS queue
aws cdk data lambda-functions pipeline python stripe
Last synced: 24 Dec 2025
https://github.com/ahmet-dedeler/concussify-csya-hacks
AI Powered Concussion Detection ⚡ On your browser, in one minute.
ai data github javascript learn web
Last synced: 08 Jul 2025
https://github.com/dhershman1/simply_valid
A simple to use data driven validation system
data easy-to-use plug-and-play simple validation
Last synced: 03 May 2025
https://github.com/FCC/pirateaction2
pirate action version 2
data data-visualization gis map
Last synced: 27 Jul 2025
https://github.com/bbc/verify-it
Randomised test property/data generation for NodeJS
data node nodejs property random randomization test testing typescript
Last synced: 17 Jun 2025
https://github.com/tim-crisp/javascript.lol
An in-browser javascript repl with flow based prototyping and dynamic module imports. https://javascript.lol
browser data javascript js json node playground repl
Last synced: 14 May 2025
https://github.com/gsantiago/extract-data-options
Extract `data-(namespace)-*` options from a HTML element
attributes data element extract html options
Last synced: 10 Apr 2025
https://github.com/hadiazt/medias-info
Scrap Media Info Such As Information & Download Link
data downloader media movie music
Last synced: 17 Jul 2025
https://github.com/navchandar/wifi_usage_tracker
Used to track WiFi usage from ISP website and track network speed using fast.com
data fast isp isp-website python scraper speedtest task-scheduler track-wifi tracker wifi
Last synced: 10 Apr 2025
https://github.com/lmacken/covidmon
A tool to monitor and visualize COVID-19 case status across many locations
covid covid-19 data dataviz global matplotlib monitoring pandas pandemic python virus
Last synced: 10 Apr 2025
https://github.com/ritiek/infrared-serial-comm
Transmits data using an IR led to an IR receiver on a Raspberry Pi
Last synced: 10 Apr 2025
https://github.com/aerisweather/mapsgl-ios-webview
Displays a MapsGL map view in a WKWebView with native controls using a Swift and Javascript bridge.
data forecast googlemaps leaflet map mapbox maplibre mapsgl visualization weather webgl
Last synced: 13 Apr 2025
https://github.com/hevalhazalkurt/exploring_the_data_of_lego_history
A data exploration project on LEGO history in Python with pandas, matplotlib etc. (WIP)
data data-analysis data-science data-visualization datascience datasets lego lego-history matplotlib pandas python python3
Last synced: 13 Apr 2025
https://github.com/xfhy/datastructure
数据结构,语言:C++
cpp data data-structures structure
Last synced: 13 Apr 2025
https://github.com/nashory/loader-torch
An Multi-threaded Data Loader Module for Torch.
data dataloader loader module multi-thread preprocessing toolbox torch
Last synced: 14 May 2025
https://github.com/maltyxx/images-generator
Generator of placeholder-type images using GD for FakerPHP/Faker
data faker fixtures image-generator imagesgenerator maltyxx php-faker
Last synced: 15 Jan 2026
https://github.com/astrapi69/data-api
Defines interfaces for data beans like jpa entities or DTO(Data Transfer Object) objects
active data dto identity java jpa modification tree-structure versioning
Last synced: 01 Apr 2025
https://github.com/trostalski/fhan
A simple client to query FHIR servers
data fhir fhir-client healthcare interoperability
Last synced: 17 Jan 2026
https://github.com/octoenergy/s3migrate
Bulk delete/copy/move files or modify Hive/Drill/Athena partitions using pythonic pattern matching
Last synced: 24 Jun 2025
https://github.com/navchandar/Wifi_Usage_Tracker
Used to track WiFi usage from ISP website and track network speed using fast.com
data fast isp isp-website python scraper speedtest task-scheduler track-wifi tracker wifi
Last synced: 15 Jul 2025
https://github.com/contributte/dag
:running: Dag is opinionated expression data generator written in PHP to fake your API.
alice api data expression faker generator ninjify php serverless
Last synced: 11 Jan 2026
https://github.com/weecology/retriever-recipes
data data-retrieval data-science dataset datasets hacktobefest
Last synced: 17 Feb 2026
https://github.com/anasfik/claude-code-prompts-reference
Leaked Claude code research material (how it does everything)
claude claude-code code data leak leaks prompts research skills system-prompts tools
Last synced: 08 Apr 2026
https://github.com/scottgriv/river-charts
🌊 📉 A Python, Django, Plotly, and Pandas web application that visualizes river data in real time, pulled using an API from the United States Geological Survey (USGS).
api charts data data-analysis data-visualization dataset django pandas plotly python usgs usgs-api visualization webapp
Last synced: 12 Aug 2025
https://github.com/fahim1049/data-structure-and-algorithm
Data Structure and Algorithm
array array-manipulations binary data data-science linear peek pop push search-algorithm stack structures
Last synced: 24 Apr 2025
https://github.com/smilyorg/tinygpkg-data
Small geographic datasets based on open data + tools
data dataset geo geodata geography geojson geopackage golang naturalearth naturalearthdata
Last synced: 27 Jun 2025