data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-23 00:07:41 UTC
- JSON Representation
https://github.com/cpanse/tartare
raw file collection recorded on Thermo Fisher Scientific mass spectrometers for extented unit testing
bioconductor blob data r unittesting
Last synced: 03 Apr 2025
https://github.com/tobinchilongo/oop-school-library
This project consists of Ruby script for the school library app. I implemented encapsulation and inheritance with Ruby by creating classes to represent students and teachers in the school.
data database gemfile input-output preserve rspec-testing rubocop unit-test
Last synced: 02 May 2026
https://github.com/bhpcv252/dda-binapprox-on-fits
Using the binapprox algorithm to efficiently estimate the median of each pixel from a set of astronomy images in FITS files.
Last synced: 22 Mar 2025
https://github.com/seafloor-geodesy/gnatss-test-data
Repository to host test data for GNATSS software
Last synced: 06 Apr 2026
https://github.com/maxnowack/elastic-sync
Connector to sync mongodb documents into a elasticsearch index
data elasticsearch mongodb sync
Last synced: 20 Jan 2026
https://github.com/shukkkur/py_dash
Assignment for ETL Course - Dashbaord (plotly & dash)
dash dashboard data data-visualization plotly
Last synced: 06 Oct 2025
https://github.com/rob-med/data-visualizations-for-python
A collection of useful snippets for clean data visualizations in Python (with matplotlib)
academic-publishing data data-science data-visualization dataviz matplotlib python scientific-publications storytelling visualization
Last synced: 08 May 2026
https://github.com/tezcatlipoca0000/ayudante
It's mainly a program for a store to manage the products data
data javascript scraping self-taught web
Last synced: 09 Apr 2025
https://github.com/tezcatlipoca0000/db-helper_sf
A program tailored for my workplace; it analyze, visualize and manipulate a Firebird 2.0 database
data data-visualization fdb firebird jupyter-notebook pandas python3
Last synced: 09 Apr 2025
https://github.com/ornella-gigante/wildlife-data-analysis-toolkit-ml
A data-driven exploration of Canis lupus signatus (Iberian) and Canis lupus labradorius (Labrador) subspecies, leveraging Jupyter Notebook and pandas to analyze weight distributions (25-56 kg), geographic patterns, and reproductive behaviors. Features size-weight correlations and NaN-handling workflows for robust ecological insights
analysis data datasets jupyter-notebook pandas-dataframe python
Last synced: 15 May 2026
https://github.com/jen-uis/loan-status-prediction
This repository contains project materials for the Winter STAT 206 class, University of California, Riverside, A. Gary Anderson School of Management.
data data-analysis data-analytics data-cleaning data-visualization descriptive-analytics julia julia-language jupyter-notebook predictive-analytics predictive-modeling team-collaboration
Last synced: 02 Jan 2026
https://github.com/concaption/ksa-lawyers-data
scraped data of ksa lawyers and law firms
Last synced: 03 Apr 2025
https://github.com/stefanpietrusky/facts
Repository for the article in the online magazine Data Science Collective.
ai arxiv-papers beautifulsoup data flask-application gensim llama matplotlib ollama plotly pyldavis python selenium webdriver
Last synced: 09 May 2026
https://github.com/umbaji/yodi
This is the official repository for Yodi, the speech recognition model for 8 words, in Ewè. The yodi package is also useful for rapid inference inference on speech data, especially on the mini_speech datasets.
data data-visualization keras python3 speech-recognition tensorflow
Last synced: 12 Jan 2026
https://github.com/kingsley-ezenwaka/app-profile-data-analysis
A Python data analysis project that aims to propose an app profile based on analysis of Google Playstore dataset.
analysis data jupyter-notebook matplotlib pandas python seaborn
Last synced: 29 Apr 2026
https://github.com/fairdataihub/fair-amd-oct-paper-code
Code associated with the paper on FAIR assessment of AMD-related datasets containing OCT data
amd biomedical data eye fair oct
Last synced: 03 Apr 2025
https://github.com/canelmas/data-producer
Fake data producer for Kafka, console and http endpoints
data fake-content fake-data fakerjs kafka kafka-producer
Last synced: 05 Apr 2025
https://github.com/priyanshubiswas-tech/aws-etl-pipeline-on-cloud-using-glue-athena-lambda-and-redshift
Serverless ETL pipeline on AWS using Glue, Lambda, Athena, and Redshift — automates data ingestion, transformation, and analytics with scalable, event-driven architecture.
athena aws aws-glue data data-engineering etl etl-pipeline lambda redshift
Last synced: 02 May 2026
https://github.com/davidgamero/gatech-covid-chart
Line chart showing COVID19 cases per day at Georgia Tech
Last synced: 28 Oct 2025
https://github.com/nitsc/spell-from-threebodytrilogy
Implemented the process of extrapolating from Gaia stellar data, to 3D visualizations, to three-views, to three-view signals, to three-view audio of signals, and even their inversions. This project proves the feasibility of the Logic (Luoji)'s “spell” from “The Three Body Problem” trilogy.
3d 3d-graphics astronomy astronomy-astrophysics audio audio-processing data data-science data-visualization gaia graph information-technology information-visualization numpy python python-3 python3 signal signal-processing visiualization
Last synced: 02 May 2026
https://github.com/priyanka7411/customer-flight-prediction-app-mlflow
A comprehensive project predicting flight prices and customer satisfaction using machine learning models, deployed through interactive Streamlit apps.
classification customer-satisfaction data data-cleaning data-visualization feature-engineering flight-price-prediction machine-learning mlflow python regression streamlit
Last synced: 12 May 2026
https://github.com/glaucopater/covid19-vaccinations
Covid19 Vaccination Statistics
charts covid-19 data echarts italia react statistics vaccini
Last synced: 27 Mar 2025
https://github.com/realabbas/instagram-user-meta-data
Instagram User Meta Data 📷 can be fetched using this script in an easy to use JSON Object for displaying Instagram Cards.
data instagram javascript metadata nodejs profile user xray
Last synced: 10 May 2026
https://github.com/stone-zeng/china-infectious-diseases
全国法定传染病疫情概况
analytics covid-19 data healthcare infectious-diseases
Last synced: 31 Dec 2025
https://github.com/tushar2704/interview-quest
Interview-Quest is comprehensive collection of interview questions and answers that can help you prepare for technical interviews. Whether you're a seasoned developer looking to brush up on your skills or a job seeker preparing for your next big opportunity, this repository aims to provide valuable resources to enhance your interview readiness.
artificial-intelligence data data-science interview interview-questions machine-learning
Last synced: 23 Jan 2026
https://github.com/mheadd/SamDotNet
:office: A C# wrapper for the SAM.gov API.
api business client data gov-api government
Last synced: 30 Apr 2025
https://github.com/avto-dev/data-migrations-laravel
Package for database data migrations
data database laravel migrations package
Last synced: 12 Jul 2025
https://github.com/katerynazakharova/common-ml
Creating this lib for ML tasks, because I'm bored of copy-pasting the same functions for different projects.
data data-processing deep-learning lib machi
Last synced: 26 Mar 2025
https://github.com/alpheustangs/jder
A standardized structure for JSON responses
api data error json response specification structure
Last synced: 26 Mar 2025
https://github.com/williamzebrowski/assistant-api
OpenAI Assistant API integrated with Elasticsearch, Logstash & Kibana
ai chatapp chatgpt conversational-ai data elasticsearch kibana llm-inference llms openai rag
Last synced: 16 Feb 2026
https://github.com/soulyma/web_crawler
A focused web crawler to extract and structure Arabic content from web pages. Designed for researchers, data analysts, and developers working on Arabic language datasets.
beautifulsoup4 crawler csv data json python structured-data
Last synced: 15 May 2026
https://github.com/eddybrando/peru-year-names
Directory of Peru's official year names
Last synced: 23 Jul 2025
https://github.com/dhimmel/erc
Processing human Evolutionary Rate Covariation data
data erc evolution evolutionary-rate-covariation genes hetionet human rephetio
Last synced: 23 Jul 2025
https://github.com/cyberoctane29/cyclistic-bike-share--analyzing-rider-behavior
Analyzed Cyclistic's bike-share data to uncover usage differences between casual riders and annual members. Utilized SQL and MySQL for data processing, R for visualisation, and Kaggle for collaboration. Insights will guide marketing strategies to convert casual riders into annual members.
data dataanalysis dataanalytics database rlanguage rmarkdown spreadsheet sql
Last synced: 22 May 2026
https://github.com/tupizz/data-processing-pipeline-aws
This project is a serverless application built with the Serverless Framework, TypeScript, and AWS services. It provides an enrichment service that processes contact information and enriches it with additional data.
aws data pipeline serverless typescript
Last synced: 13 May 2026
https://github.com/phatdev12/diem-thi-tuyen-sinh-10-da-nang
Danh sách điểm thi tuyển sinh 10 Đà Nẵng 2023-2024
data data-science dataanalytics dataset json
Last synced: 28 Jun 2025
https://github.com/qetdr/names-genders
Surnames, genders, and gender probabilities data extraction script and dataset
Last synced: 01 May 2026
https://github.com/tbrowder/classfactory
Provides tools to create a data collection with classes to manipulate the persistent data.
Last synced: 04 Apr 2025
https://github.com/hoangsonww/fred-banking-data-analysis
💸 AI-powered banking data explorer that combines FRED API insights with vector search, regression analysis, and interactive chat via OpenAI, Claude, and Gemini. Built with TypeScript, React, and Express for seamless full-stack performance.
anthropic chartjs claude-ai data data-analysis data-analytics data-science data-visualization fred fred-api gemini google-generative-ai logistic-regression multiple-regression openai pinecone react regression typescript vector-database
Last synced: 09 Apr 2025
https://github.com/benjaminr/udacity-data-engineering
Data Engineering
data dataengineering python udacity
Last synced: 14 May 2026
https://github.com/sarincr/basics-of-julia-programming-language
Julia is a high-level, high-performance, dynamic programming language. While it is a general purpose language and can be used to write any application, many of its features are well-suited for high-performance numerical analysis and computational science.
data data-analysis data-mining data-science data-visualization dataanalysis dataanalytics datascience julia julia-language julia-library julia-package julialang machine-learning
Last synced: 19 May 2026
https://github.com/jebin1999/livestock-production-monitoring-
Livestock production Monitoring
data datascience livestock livestock-monitor r shiny shiny-apps shiny-r shinydashboard
Last synced: 05 Nov 2025
https://github.com/mvuorre/psyarxivdb
Datasette serving PsyArXiv preprint metadata
data datasette open-science preprints psyarxiv
Last synced: 14 May 2026
https://github.com/raigu/ordered-lists-sync
Library for synchronizing ordered data with the minimum of insert and delete operations. Suitable for lage data sets in isolated environments
data lists ordering sync syncrhonization update
Last synced: 12 Jan 2026
https://github.com/tillahoffmann/idxhound
🐶 Track indices across one or more numpy selections.
data numpy scientific-computing
Last synced: 14 May 2026
https://github.com/ginga1402/travego_travellers
MySQL Mini Project
college-project data mysql-database
Last synced: 27 Jul 2025
https://github.com/m-muecke/isocountry
R package containing ISO codes for countries and currencies
country-codes currency-codes data iso-3166-1 iso-4217 r r-package
Last synced: 20 Mar 2025
https://github.com/real-veersandhu/cia-country-comparison
Data analysis system on the CIA World Factbook
Last synced: 25 Feb 2025
https://github.com/michellepellon/jobx
A modern, powerful job scraper for LinkedIn, Indeed and beyond.
compensation data data-analysis indeed indeed-scraping jobs jobsearch linkedin linkedin-scraper
Last synced: 17 Jan 2026
https://github.com/simranjeet97/datascience_crashcourse
Data Science Crash Course that Explained about Each and Every Process in Data Science.
dash data data-science data-science-crash-course data-structures data-visualization datascience-machinelearning datasciencecoursera datascienceproject instagram matplotlib numpy pandas telegram tutorials youtube
Last synced: 08 Apr 2026
https://github.com/MikeBairdRocks/Fluky
[floo-kee]: obtained by chance rather than skill.
data framework mock netcore netstandard nuget random vscode
Last synced: 02 Apr 2025
https://github.com/2kabhishek/pokemon-stats
Gotta stat 'em all 🖲🐭
d3 data emoji pokemon rollup statistics
Last synced: 14 May 2026
https://github.com/stdlib-js/array-base-filled4d-by
Create a filled four-dimensional nested array according to a provided callback function.
alloc allocate array callback data fill filled foreach generic javascript map matrix multidimensional node node-js nodejs stdlib strided structure types
Last synced: 07 Sep 2025
https://github.com/sandravizz/global_inequality_story
Dataviz Project about Global Inequality
data data-visualization inequality
Last synced: 03 Jul 2025
https://github.com/kevinsames/spark-fuse
spark-fuse is an open-source toolkit for PySpark — providing utilities, connectors, and tools to fuse your data workflows together.
data databricks fabric pyspark python spark
Last synced: 08 May 2026
https://github.com/thomd/git-scrape-hacker-news
scrape hacker news metadata for data analysis
data data-science git-scraping hacker-news
Last synced: 16 Sep 2025
https://github.com/stdlib-js/array-base-any-by-right
Test whether at least one element in an array passes a test implemented by a predicate function, while iterating from right to left.
any array data generic javascript node node-js nodejs predicate some stdlib structure test types validate
Last synced: 14 Apr 2025
https://github.com/jonsafari/toy-data
Embeddable submodule of parallel/monolingual text data, for use in testing code and sanity checks
data language-data machine-translation nlp sanity-checks toy-data
Last synced: 06 Nov 2025
https://github.com/epogrebnyak/business-conditions-digest-2017
Replicate illustration from Business Conditions Digest
Last synced: 22 Mar 2025
https://github.com/sermetpekin/perse
Perse is an experimental Python package that combines some of the most widely-used functionalities from the powerhouse libraries Pandas, Polars, and DuckDB into a single, unified DataFrame object. The goal of Perse is to provide a streamlined and efficient interface, leveraging the strengths of these libraries to create a versatile data handling.
data data-science data-structures duckdb pandas polars
Last synced: 09 May 2026
https://github.com/millengustavo/salarios-data-science
Aplicativo Streamlit de exploração dos dados da Pesquisa de mercado de Data Science feita pelo Data Hackers
brasil brazil ciencia-de-dados data data-science heroku salarios salary
Last synced: 07 Oct 2025
https://github.com/aruneshbasak/python-dsa-problems-geeksforgeeks-160-days
I will upload my daily Python DSA problems solved on GeeksforGeeks and post it here!
algorithms-and-data-structures and data data-structures dsa python python3 structure
Last synced: 08 May 2025
https://github.com/qeeqbox/data-security
Safeguarding your personal information (How your info is protected)
data data-security infosecsimplified qeeqbox security
Last synced: 19 Mar 2026
https://github.com/qeeqbox/data-lifecycle-management
Data Lifecycle Management (DLM) is a policy-based model for managing data in an organization
data data-lifecycle-management infosecsimplified lifecycle management qeeqbox
Last synced: 07 Mar 2026
https://github.com/kerlossony/nested-formdata
Nested-FormData is a Function designed to handle nested form data structures in a simplified and efficient way. It helps in managing complex form data, making it easier to work with forms that require hierarchical data
data forms javascript nested-structures nextjs reactjs typescript
Last synced: 08 Mar 2026
https://github.com/luminati-io/crunchbase-dataset-samples
A sample of 1001 Crunchbase companies with key data points, extracted using the Bright Data API.
crunchbase crunchbase-api crunchbase-scraper data database datasets webscraper-api webscraping
Last synced: 17 Mar 2025
https://github.com/sixarm/sixarm_ruby_fab
SixArm.com → Ruby → Fab gem to fabricate sample data for testing
data fabrication factory fake gem mock ruby
Last synced: 24 Jul 2025
https://github.com/jimbrig/jimstaskviews
CRAN Task Views and Shiny App https://jimstaskviews.jimbrig.com
cran data docs rstats shiny-app submodules task-views
Last synced: 06 Mar 2026
https://github.com/cont-limno/lagosus-reservoir
Data module classifying lakes as natural lakes or reservoirs in the conterminous U.S.
Last synced: 17 Jan 2026
https://github.com/DataHerb/dataherb-flora
DataHerb Flora: The core of DataHerb
data data-mining data-science datascience dataset datasets
Last synced: 08 May 2025
https://github.com/fjc0k/vue-merge-data
Intelligently merge data for Vue render functions.
data merge-data render-functions vue
Last synced: 17 May 2026
https://github.com/mikebairdrocks/fluky
[floo-kee]: obtained by chance rather than skill.
data framework mock netcore netstandard nuget random vscode
Last synced: 17 May 2026
https://github.com/ate329/nsl-kdd-feature-extractor
Python-based tool designed to process network traffic packets and extract features compliant with the NSL-KDD dataset format.
cyber-security cybersecurity data data-science extractor feature-extraction machine-learning network-analysis nsl-kdd nsl-kdd-dataset
Last synced: 30 Oct 2025
https://github.com/prioritizr/prioritizrdata
Conservation planning data sets
Last synced: 19 Jul 2025
https://github.com/inzhenerka/scooters_data_uploader
Загрузка данных в PostgreSQL в рамках курса по dbt от Инженерка.Тех
Last synced: 04 May 2026
https://github.com/bytraembedded/Laptop-Price-Prediction-with-Machine-Learning
The Laptop Price Prediction with Machine Learning project provides a system to predict the price of laptops based on various features such as processor type, RAM size, storage capacity, and more/
airflow data data-science data-visualization fastapi heroku-deployment machine-learning-algorithms matplotlib-pyplot numpy pandas python reactjs seaborn
Last synced: 30 Dec 2025
https://github.com/redodo/shipper
Hide encrypted data in files.
audio data images python steganography
Last synced: 26 Mar 2025
https://github.com/am-i-groot/summer-intern-iitguwahati-spml
Developed an automated Water Quality Monitoring System (WQMS) at IIT Guwahati, using the pH-W218 sensor and K-Means Clustering to assess water potability. The project enhances water quality evaluation through machine learning-based classification.
algorithm data data-visualization kmeans-clustering machine-learning python report sensor signal-processing
Last synced: 17 May 2026
https://github.com/muhammad-fiaz/ason
ASON: Adaptive Structured Object Notation - Python library for dynamic data serialization, providing flexibility and simplicity.
adaptive-structure-object-notation api ason cli client data file file-format file-sharing file-upload json json-data json-parser open-source opensource parser parsing python python3
Last synced: 02 Feb 2026
https://github.com/sevmardi/data-mining-hacks
Hacks in Data Mining
data data-mining data-mining-algorithms python3
Last synced: 18 Jul 2025
https://github.com/saboye/web-scraping-with-python
A web scraping project using Python's "Requests" and "BeautifulSoup" libraries to extract structured data from one or more websites. This project involves sending HTTP requests to the target website(s), retrieving the HTML content of the website(s), and parsing this content to extract the desired data in a usable format.
beautifulsoup csv data data-harvesting data-mining python request web webscraping
Last synced: 18 Jul 2025
https://github.com/giscience/measures-rest-oshdb-docker
Scripts for starting measures for geospatial datasets in docker container, using the OSHDB
data dggs docker geospatial mesure openstreetmap rest
Last synced: 18 Apr 2026
https://github.com/tkonopka/makealive
Dynamic web content through controlled javascript
conversion-functions d3 data data-science javascript visualization
Last synced: 22 Apr 2026
https://github.com/cdcgov/importsurvey
Import survey: Import data into R, with an application to the National Center for Health Statistics (NCHS)
data import r sas survey survey-data
Last synced: 19 Jun 2026
https://github.com/mohnoor94/datasciencefundementalsusingpython
My journey to learn Data Science with Python
data data-analysis data-science data-visualization learning learning-by-doing python python3
Last synced: 19 Jun 2026
https://github.com/sodascience/open_supply_hub
Processing supply chain data obtained from Open Supply Hub
data global-supply-chain open-supply-hub python
Last synced: 29 Apr 2026