data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-02 00:07:45 UTC
- JSON Representation
https://github.com/canelmas/data-producer
Fake data producer for Kafka, console and http endpoints
data fake-content fake-data fakerjs kafka kafka-producer
Last synced: 05 Apr 2025
https://github.com/priyanshubiswas-tech/aws-etl-pipeline-on-cloud-using-glue-athena-lambda-and-redshift
Serverless ETL pipeline on AWS using Glue, Lambda, Athena, and Redshift — automates data ingestion, transformation, and analytics with scalable, event-driven architecture.
athena aws aws-glue data data-engineering etl etl-pipeline lambda redshift
Last synced: 02 May 2026
https://github.com/davidgamero/gatech-covid-chart
Line chart showing COVID19 cases per day at Georgia Tech
Last synced: 28 Oct 2025
https://github.com/nitsc/spell-from-threebodytrilogy
Implemented the process of extrapolating from Gaia stellar data, to 3D visualizations, to three-views, to three-view signals, to three-view audio of signals, and even their inversions. This project proves the feasibility of the Logic (Luoji)'s “spell” from “The Three Body Problem” trilogy.
3d 3d-graphics astronomy astronomy-astrophysics audio audio-processing data data-science data-visualization gaia graph information-technology information-visualization numpy python python-3 python3 signal signal-processing visiualization
Last synced: 02 May 2026
https://github.com/r-mahesh45/hr---resume-text-classification
Text Classification for Resumes: Conducted Exploratory Data Analysis (EDA) on a vast collection of resumes. Organized the data using Bag of Words (BoW) and TF-IDF techniques. Built and evaluated multiple models, with Logistic Regression delivering standout performance. Created Word Clouds and Histograms.
data datacleaning extract-transform-load feature-extraction nlp nltk-tokenizer text-mining text-processing
Last synced: 12 Sep 2025
https://github.com/priyanka7411/customer-flight-prediction-app-mlflow
A comprehensive project predicting flight prices and customer satisfaction using machine learning models, deployed through interactive Streamlit apps.
classification customer-satisfaction data data-cleaning data-visualization feature-engineering flight-price-prediction machine-learning mlflow python regression streamlit
Last synced: 12 May 2026
https://github.com/wamphlett/smart-data-objects
An easy solution for capturing and validating data into usable DTO's
data dto forms php php7 validation
Last synced: 17 May 2026
https://github.com/stone-zeng/china-infectious-diseases
全国法定传染病疫情概况
analytics covid-19 data healthcare infectious-diseases
Last synced: 31 Dec 2025
https://github.com/tushar2704/interview-quest
Interview-Quest is comprehensive collection of interview questions and answers that can help you prepare for technical interviews. Whether you're a seasoned developer looking to brush up on your skills or a job seeker preparing for your next big opportunity, this repository aims to provide valuable resources to enhance your interview readiness.
artificial-intelligence data data-science interview interview-questions machine-learning
Last synced: 23 Jan 2026
https://github.com/mvuorre/psyarxivdb
Datasette serving PsyArXiv preprint metadata
data datasette open-science preprints psyarxiv
Last synced: 14 May 2026
https://github.com/novecento99/nuvolino
air cloud data ikea iot pm pm25 sensor vindstyrka
Last synced: 13 Jul 2025
https://github.com/tillahoffmann/idxhound
🐶 Track indices across one or more numpy selections.
data numpy scientific-computing
Last synced: 14 May 2026
https://github.com/parzibyte/cifrar-descifrar-php
Cifrar y descifrar datos con PHP usando la librería php-encryption; cifrar con clave general o con claves generadas por contraseñas de usuarios
crypto data decrypt encryption password php security
Last synced: 20 May 2026
https://github.com/dennyglee/open-covid19-public
A collaboration between SCRI and Databricks on the analysis of open COVID-19 datasets.
covid-19 data data-analytics data-engineering data-science nlp
Last synced: 22 Jun 2025
https://github.com/bastianolea/censo_viviendas
Censo de Viviendas procesado con R para disponibilizarlo con códigos/nombres de comunas, regiones, y etiquetas de sus variables. En formato original (6,5 millones de filas) y en conteo por comunas.
chile comunas data poblacion rural
Last synced: 30 Oct 2025
https://github.com/eddybrando/peru-year-names
Directory of Peru's official year names
Last synced: 23 Jul 2025
https://github.com/nafisalawalidris/dr.-semmelweis-and-the-discovery-of-handwashing
Uncover the revolutionary impact of handwashing on mortality rates in healthcare. Explore the story of Dr. Semmelweis and his groundbreaking findings.
data data-analysis handwashing healthcare-analysis medical-breakthrough mortality-rates
Last synced: 13 Jul 2025
https://github.com/artcc/coredatagenericmodule
Core Data generic module for persist encrypted object
core coredata coredata-model data data-generic database encrypted encrypted-data encryption entity identifier persist protocol swift
Last synced: 08 May 2026
https://github.com/michellepellon/jobx
A modern, powerful job scraper for LinkedIn, Indeed and beyond.
compensation data data-analysis indeed indeed-scraping jobs jobsearch linkedin linkedin-scraper
Last synced: 17 Jan 2026
https://github.com/conduitio/conduit-site
data data-ingestion data-integration documentation
Last synced: 06 May 2025
https://github.com/MikeBairdRocks/Fluky
[floo-kee]: obtained by chance rather than skill.
data framework mock netcore netstandard nuget random vscode
Last synced: 02 Apr 2025
https://github.com/emomaxd/flog
header-only logging library
c-plus-plus data files formatting logging stdout
Last synced: 20 Mar 2025
https://github.com/dhimmel/erc
Processing human Evolutionary Rate Covariation data
data erc evolution evolutionary-rate-covariation genes hetionet human rephetio
Last synced: 23 Jul 2025
https://github.com/cyberoctane29/cyclistic-bike-share--analyzing-rider-behavior
Analyzed Cyclistic's bike-share data to uncover usage differences between casual riders and annual members. Utilized SQL and MySQL for data processing, R for visualisation, and Kaggle for collaboration. Insights will guide marketing strategies to convert casual riders into annual members.
data dataanalysis dataanalytics database rlanguage rmarkdown spreadsheet sql
Last synced: 22 May 2026
https://github.com/2kabhishek/pokemon-stats
Gotta stat 'em all 🖲🐭
d3 data emoji pokemon rollup statistics
Last synced: 14 May 2026
https://github.com/nia-cloud-official/datascript
DataScript: A Hypothetical Data Scripting Language, DataScript is designed for simplifying data manipulation and analysis tasks. It serves as a scripting language tailored specifically for handling various data operations efficiently.
data data-scripting scripting-language
Last synced: 22 Jun 2025
https://github.com/codedotjs/indiaartfairy
:beetle: Data & More - India Art Fair • 2018 - 2024
Last synced: 16 Jun 2025
https://github.com/stdlib-js/array-base-filled4d-by
Create a filled four-dimensional nested array according to a provided callback function.
alloc allocate array callback data fill filled foreach generic javascript map matrix multidimensional node node-js nodejs stdlib strided structure types
Last synced: 07 Sep 2025
https://github.com/tupizz/data-processing-pipeline-aws
This project is a serverless application built with the Serverless Framework, TypeScript, and AWS services. It provides an enrichment service that processes contact information and enriches it with additional data.
aws data pipeline serverless typescript
Last synced: 13 May 2026
https://github.com/phatdev12/diem-thi-tuyen-sinh-10-da-nang
Danh sách điểm thi tuyển sinh 10 Đà Nẵng 2023-2024
data data-science dataanalytics dataset json
Last synced: 28 Jun 2025
https://github.com/elvis-not-presley-one/lostcassowary
LostCassowary is an Minecraft data miner that searches region files/.MCA files for data from the game, this one can search for banners, signs, biomes, blocks
data data-mining data-science dataminer minecraft nbt nbt-parser scraper
Last synced: 12 Apr 2025
https://github.com/priyanshubiswas-tech/ev-data-analysis-dashboard
An interactive dashboard analyzing EV trends, including total vehicles, BEV vs. PHEV breakdown, model popularity, state-wise distribution, and CAFV eligibility. Visualizes key insights for data-driven decisions in the EV industry. 📊
dashboard data data-analysis data-science data-visualization tableau tableau-public
Last synced: 17 Feb 2026
https://github.com/tbrowder/classfactory
Provides tools to create a data collection with classes to manipulate the persistent data.
Last synced: 04 Apr 2025
https://github.com/camilajaviera91/dbt-transformations-sql-mock-data
This repository contains the transformations and documentation for the data model generated in sql-mock-data.
Last synced: 02 Feb 2026
https://github.com/technicalguru/php-database
A PHP library for accessing databases easily
data database database-access datamodel datamodels mariadb mariadb-client mariadb-database mariadb-mysql mysql mysql-client mysql-database
Last synced: 20 Mar 2025
https://github.com/luminati-io/pinterest-dataset-samples
Two sample datasets of over 1000 Pinterest profiles and posts, extracted using the Bright Data API, ideal for market research, influencer marketing, and product development.
data data-extraction data-mining database datasets pinterest pinterest-api structured-data web-scraping
Last synced: 17 Mar 2025
https://github.com/sarincr/basics-of-julia-programming-language
Julia is a high-level, high-performance, dynamic programming language. While it is a general purpose language and can be used to write any application, many of its features are well-suited for high-performance numerical analysis and computational science.
data data-analysis data-mining data-science data-visualization dataanalysis dataanalytics datascience julia julia-language julia-library julia-package julialang machine-learning
Last synced: 19 May 2026
https://github.com/denisecase/nw-network-data-analytics
Network for those earning a NW Masters of Applied Data Science
Last synced: 02 Feb 2026
https://github.com/makosai/covid19datachart
A basic chart for checking corona data. Written in a single HTML file for convenience. Grab the single file and run it anywhere. Or visit the webpage.
chart chartjs corona coronavirus coronavirus-analysis covid-19 covid-2019 covid19 covid19-data data data-analysis datasets
Last synced: 23 Feb 2026
https://github.com/yogaprasadk/dbms_course_a_to_z
it is a repository for complete lecture of Database Management Systems taught by riti kumari
acidproperties btree data database dbms filesystem normalform normalization sql
Last synced: 20 Mar 2025
https://github.com/gabrieldim/world-bank-wdi-data-science
Faculty project. World Bank predictions with Data Science.
convolutional-neural-networks data data-science model neural-network neural-networks prediction-model python science
Last synced: 15 May 2026
https://github.com/FAIMS/OpenDataPresentation
Brian Ballsun-Stanton's presentation
Last synced: 03 Apr 2025
https://github.com/jebin1999/livestock-production-monitoring-
Livestock production Monitoring
data datascience livestock livestock-monitor r shiny shiny-apps shiny-r shinydashboard
Last synced: 05 Nov 2025
https://github.com/bayer-group/cmc-ontologies
This is a submodule of cmc-knowledge-graph-setup. It contains ontologies and relevant data graph files
Last synced: 16 Jun 2025
https://github.com/raigu/ordered-lists-sync
Library for synchronizing ordered data with the minimum of insert and delete operations. Suitable for lage data sets in isolated environments
data lists ordering sync syncrhonization update
Last synced: 12 Jan 2026
https://github.com/millengustavo/salarios-data-science
Aplicativo Streamlit de exploração dos dados da Pesquisa de mercado de Data Science feita pelo Data Hackers
brasil brazil ciencia-de-dados data data-science heroku salarios salary
Last synced: 07 Oct 2025
https://github.com/marians/tour-tracker
Track the general classification development of the Tour De France, stage over stage
cycling data sports statistics
Last synced: 24 Jun 2025
https://github.com/nicolau-369/bash.sh-treinamento
Versão mais organizada (+ ou -)
data database debian gnome gnome-extension gnu gnu-linux linux shell shell-script
Last synced: 19 Mar 2025
https://github.com/tsiarokhin/student_bsu_by
Tool for parsing various BSU student information from student.bsu.by website.
belarus bsu data grades python students study university
Last synced: 28 May 2026
https://github.com/stonecharioteer/renfield
Synchronize and Search through Hard Drives
catalogue data search storage synchronization
Last synced: 09 Feb 2026
https://github.com/eugenedakin/steganography-pictures
Add and remove a picture-in-a-picture with steganography
compare data steganography steganography-tools xojo
Last synced: 12 Feb 2026
https://github.com/real-veersandhu/cia-country-comparison
Data analysis system on the CIA World Factbook
Last synced: 25 Feb 2025
https://github.com/simranjeet97/datascience_crashcourse
Data Science Crash Course that Explained about Each and Every Process in Data Science.
dash data data-science data-science-crash-course data-structures data-visualization datascience-machinelearning datasciencecoursera datascienceproject instagram matplotlib numpy pandas telegram tutorials youtube
Last synced: 08 Apr 2026
https://github.com/glassflow/pipelines-push-action
This Github Action lets you automate GlassFlow pipelines deployments as code
data data-processing datastreaming deployment github-actions glassflow python real-time stream-processing
Last synced: 19 May 2026
https://github.com/hoaihuongbk/lakeops
A modern data lake operations toolkit working with multiple table formats (Delta, Iceberg, Parquet) and engines (Spark, Polars) via the same APIs.
data data-operations dataengineering datalake
Last synced: 07 Mar 2026
https://github.com/wahyuwsslah/salary_prediction-aiml
Salary Prediction using Machine Learning with 3 Models. Linear Regression, Decision Tree, Random Forest
ai analytics data data-science datascience machine-learning python python3
Last synced: 19 May 2026
https://github.com/sandravizz/global_inequality_story
Dataviz Project about Global Inequality
data data-visualization inequality
Last synced: 03 Jul 2025
https://github.com/yazeed44/reform-api
A platform that harnesses the power of multiple data streams including satellite imagery and drone photos to visualize multiple urban planning indices and provide descriptive analytics that will empower local Saudi authorities to make data-driven decision that contribute to neighborhood quality of life.
Last synced: 18 May 2026
https://github.com/jimbrig/jimstaskviews
CRAN Task Views and Shiny App https://jimstaskviews.jimbrig.com
cran data docs rstats shiny-app submodules task-views
Last synced: 06 Mar 2026
https://github.com/kevinsames/spark-fuse
spark-fuse is an open-source toolkit for PySpark — providing utilities, connectors, and tools to fuse your data workflows together.
data databricks fabric pyspark python spark
Last synced: 08 May 2026
https://github.com/thomd/git-scrape-hacker-news
scrape hacker news metadata for data analysis
data data-science git-scraping hacker-news
Last synced: 16 Sep 2025
https://github.com/stdlib-js/array-base-any-by-right
Test whether at least one element in an array passes a test implemented by a predicate function, while iterating from right to left.
any array data generic javascript node node-js nodejs predicate some stdlib structure test types validate
Last synced: 14 Apr 2025
https://github.com/diddypod/crop-data-comparer
A Python script to compare crop data over years
comparison crop data openpyxl python
Last synced: 28 Jun 2026
https://github.com/shgysk8zer0/schema
A PHP implementation of schema.org structured data objects
data microdata schema seo structured-data
Last synced: 24 Jun 2025
https://github.com/jonsafari/toy-data
Embeddable submodule of parallel/monolingual text data, for use in testing code and sanity checks
data language-data machine-translation nlp sanity-checks toy-data
Last synced: 06 Nov 2025
https://github.com/stdlib-js/ndarray-base-reverse-dimension
Return a view of an input ndarray in which the order of elements along a specified dimension is reversed.
base data flip javascript matrix ndarray node node-js nodejs reverse slice stdlib structure types vector view
Last synced: 07 Mar 2026
https://github.com/vishwagauravin/screener-scraper-pro
Effortlessly scrape comprehensive financial data from screener.in and use it in your projects. No API key required.
data finance finances market-data scraper scrapers screener screener-in screener-plugin stock stock-data stock-market stocks
Last synced: 18 Feb 2026
https://github.com/epogrebnyak/business-conditions-digest-2017
Replicate illustration from Business Conditions Digest
Last synced: 22 Mar 2025
https://github.com/hmeleiro/r_dataviz
Data visualization projects with R / Proyectos de visualización de datos con R
data dataviz r rmd-files social-science survey-data
Last synced: 21 Jun 2026
https://github.com/redodo/shipper
Hide encrypted data in files.
audio data images python steganography
Last synced: 26 Mar 2025
https://github.com/greatwoman23/market-basket-analysis
Unlock the power of data-driven sales optimization with Market Basket Analysis. Explore frequent itemsets and association rules to strategically enhance product placement, design targeted promotions, and adapt to seasonal trends. Elevate your business strategy with insights tailored for boosting sales and engaging customers effectively.
analysis analytics analytics-product data data-science jupyter medium-articles notebook-jupyter python
Last synced: 28 Apr 2026
https://github.com/dostuffthatmatters/circadian-scp-upload
Resumable, interruptible, SCP upload client for any files or directories generated day by day
checksum daily data directories files library python scp ssh synchronization time-series upload utilities
Last synced: 24 Jun 2025
https://github.com/chompfoods/sdk-go
Go SDK for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.
api branded chomp data database food go grocery ingredients nutrition raw recipe-api recipes sdk
Last synced: 19 May 2026
https://github.com/aruneshbasak/python-dsa-problems-geeksforgeeks-160-days
I will upload my daily Python DSA problems solved on GeeksforGeeks and post it here!
algorithms-and-data-structures and data data-structures dsa python python3 structure
Last synced: 08 May 2025
https://github.com/stdlib-js/ndarray-base-assert-is-integer-data-type
Test if an input value is a supported ndarray integer data type.
array assert base check data dtype is javascript multidimensional ndarray node node-js nodejs stdlib test types util utilities utility utils
Last synced: 12 Apr 2025
https://github.com/qeeqbox/data-security
Safeguarding your personal information (How your info is protected)
data data-security infosecsimplified qeeqbox security
Last synced: 19 Mar 2026
https://github.com/qeeqbox/data-lifecycle-management
Data Lifecycle Management (DLM) is a policy-based model for managing data in an organization
data data-lifecycle-management infosecsimplified lifecycle management qeeqbox
Last synced: 07 Mar 2026
https://github.com/jvrck/australianpayphones
Get Australian payphone data in GeoJSON format.
australia data geojson geojson-data scraper
Last synced: 04 Apr 2025
https://github.com/kerlossony/nested-formdata
Nested-FormData is a Function designed to handle nested form data structures in a simplified and efficient way. It helps in managing complex form data, making it easier to work with forms that require hierarchical data
data forms javascript nested-structures nextjs reactjs typescript
Last synced: 08 Mar 2026
https://github.com/stimulsoft/samples-reports.js-for-python
JavaScript samples for Reports.JS reporting components for Python applications
client-side components data designer django document flask javascript js native python python3 report reporting source templates tornado viewer vscode web-server
Last synced: 14 Oct 2025
https://github.com/sixarm/sixarm_ruby_fab
SixArm.com → Ruby → Fab gem to fabricate sample data for testing
data fabrication factory fake gem mock ruby
Last synced: 24 Jul 2025
https://github.com/gbburleigh/quick-seeders
Generate realistic test data quickly with Quick-Seeders, a Python library offering a wide range of data types and schema definitions. Control data variance, probabilities, and output formats, including SQL. Simplify your data seeding process and improve testing efficiency.
data dataset faker generator python seeder sql test
Last synced: 03 Apr 2025
https://github.com/kingtous/bots_task_result
Result of the Barcelona OpenMP Tasks Suite (BOTS) using ompTG
Last synced: 09 Jul 2025
https://github.com/hamzacham/data_set_projet-3
analysis data project rstudio visualization
Last synced: 29 Oct 2025