data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-27 00:07:33 UTC
- JSON Representation
https://github.com/jrdnbradford/google-sheet-color-sort
Google Sheet-bound script that assists with sorting Google Sheet rows by background fill color
data excel google-apps google-apps-script google-sheet google-sheets javascript microsoft-excel sort-rows
Last synced: 14 Apr 2025
https://github.com/warlock/tck
Data Type Checker
ajax browser data javascript nodejs type-checking types validation
Last synced: 19 May 2026
https://github.com/lmuffato/project-mysql-one-for-all-trybe
Projeto mysql one for all - Projeto avaliativo da Trybe do Bloco 21: Normalização e Modelagem de Banco de Dados
back-end data database database-modeling mysql mysqlworkbench query sql trybe-projects
Last synced: 08 May 2026
https://github.com/aboualine/sql-formation
Library Management System Database: A MySQL project with tables, triggers, stored procedures, and views for managing books, members, and borrowings. Includes sample data for testing. Ideal for learning SQL or building a library app.
data database library-management-system mysql sql system
Last synced: 18 Apr 2026
https://github.com/marxmit7/kaggle
Kaggle competitions
data kaggle kaggle-competition
Last synced: 19 May 2026
https://github.com/aditya172926/blockchain_indexers
Indexers to fetch data from blockchain events and transactions data with their parameters
Last synced: 02 Aug 2025
https://github.com/lmuffato/project-restaurant-orders-trybe
Projeto restaurant orders - Projeto avaliativo da Trybe do Bloco 36: Estrutura de Dados I: Arrays, Hashmaps e Sets
array array-set csv data data-analysis hashmap python set trybe trybe-projects
Last synced: 13 Sep 2025
https://github.com/luminati-io/pinterest-dataset-samples
Two sample datasets of over 1000 Pinterest profiles and posts, extracted using the Bright Data API, ideal for market research, influencer marketing, and product development.
data data-extraction data-mining database datasets pinterest pinterest-api structured-data web-scraping
Last synced: 17 Mar 2025
https://github.com/aleklukanen/chapterhousedb-example-app
An example application using the ChapterhouseDB processing engine
arrow data database event golang parquet processing stream
Last synced: 18 Apr 2026
https://github.com/agustinmusanti/sqlchallenge-2
This repository contains my solutions to a SQL challenge using MySQL, centered around a fictional retail company called TechMarket. The challenge covers various SQL tasks such as data retrieval, manipulation, and analysis, simulating real-world scenarios within a retail business environment.
Last synced: 03 Apr 2025
https://github.com/Greatwoman23/Market-Basket-Analysis
Unlock the power of data-driven sales optimization with Market Basket Analysis. Explore frequent itemsets and association rules to strategically enhance product placement, design targeted promotions, and adapt to seasonal trends. Elevate your business strategy with insights tailored for boosting sales and engaging customers effectively.
analysis analytics analytics-product data data-science jupyter medium-articles notebook-jupyter python
Last synced: 04 May 2025
https://github.com/charliecm/meteorite-landings
Data visualization of meteorite landings on Earth.
astronomy d3 data data-visualization mapbox space visualization
Last synced: 18 Apr 2026
https://github.com/cliffano/volothamp
Random D&D stuffs my son and I dabble with
data dungeons-and-dragons info little-godzilla
Last synced: 06 Apr 2025
https://github.com/amethyst-php/contract
amethyst amethyst-package api contract data laravel
Last synced: 20 May 2026
https://github.com/chibuzordev/bluesky-scraper
This is a work of art that enables you to scrape data off BlueSky.
analytics bluesky bluesky-api bluesky-client data datascraper-framework datascraping scraping social-media web webscraping
Last synced: 31 Oct 2025
https://github.com/tomasfarias/pipeline
A simple data pipeline done as a challenge project
Last synced: 29 Mar 2025
https://github.com/lamiaaali/depi-graduation-project
SkinCare Sentiment Analysis Reviews
analytics azure azure-data-factory azure-data-lake azure-databricks azure-synapse-analytics data data-analytics data-engineering machine-learning pyspark python sql ssms unsupervised-learning
Last synced: 03 Feb 2026
https://github.com/owengombas/genyus
🐍 Lyrics analysis with genius.com, Python and Jupyter Notebooks
api data data-science genius jupyter-notebook lyrics python statistics
Last synced: 20 May 2026
https://github.com/stefanbohacek/dataviz-projects
My dataviz projects.
data data-visualization dataviz
Last synced: 08 Jul 2025
https://github.com/yernaz-togizbayev/microsoft_store_data-analysis
Microsoft Store
data data-analysis data-visualization jupyter-notebook python3
Last synced: 15 May 2026
https://github.com/chompfoods/sdk-typescript-fetch
Fetch TypeScript SDK for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.
api branded chomp data database fetch food grocery ingredients nutrition raw recipe-api recipes sdk typescript
Last synced: 03 May 2026
https://github.com/xrahul/android-logs
Get logs of various sensors and events in android 6.0+
Last synced: 20 May 2026
https://github.com/cpanse/tartare
raw file collection recorded on Thermo Fisher Scientific mass spectrometers for extented unit testing
bioconductor blob data r unittesting
Last synced: 03 Apr 2025
https://github.com/snitkin-lab-umich/prewas_manuscript_analysis
Manuscript in support of prewas software
data data-visualisation manuscript r
Last synced: 08 Jul 2025
https://github.com/vvipjain/ev-data-analysis
EV Data Analysis
data data-analysis data-visualisation tableau tableau-public
Last synced: 16 Feb 2026
https://github.com/seafloor-geodesy/gnatss-test-data
Repository to host test data for GNATSS software
Last synced: 06 Apr 2026
https://github.com/nia-cloud-official/influx
Influx is a powerful search engine application designed to provide access to personal information of individuals from anywhere in the world. With Influx, users can search for and retrieve personal details of people, enabling them to find and connect with individuals across the globe.
data find people-search search-engine
Last synced: 27 Jun 2025
https://github.com/shukkkur/py_dash
Assignment for ETL Course - Dashbaord (plotly & dash)
dash dashboard data data-visualization plotly
Last synced: 06 Oct 2025
https://github.com/gappeah/cookie-company-visual-dashboard
This Excel-based interactive dashboard provides a comprehensive overview of the Cookie Company's sales performance and key metrics.
dashboard data data-visualization excel microsoft-excel
Last synced: 25 Feb 2025
https://github.com/gappeah/beverage-sales-analytics
This project provides an in-depth analysis of beverage sales and delivery across different states using Power BI.
data data-visualization powerbi powerbi-report powerbi-visuals
Last synced: 25 Feb 2025
https://github.com/gappeah/british-airways-analysis
This project focuses on analyzing and visualising travel data from British Airways using Tableau. The goal is to extract insights and present them in an interactive and visually appealing manner.
data data-analysis data-visualization tableau
Last synced: 11 Jun 2025
https://github.com/speakeasy-sdks/fivetran-python-sdk
Python SDK for accessing Fivetran API.
api connector data fivetran fivetran-connector python sdk
Last synced: 01 Jul 2025
https://github.com/rob-med/data-visualizations-for-python
A collection of useful snippets for clean data visualizations in Python (with matplotlib)
academic-publishing data data-science data-visualization dataviz matplotlib python scientific-publications storytelling visualization
Last synced: 08 May 2026
https://github.com/tezcatlipoca0000/ayudante
It's mainly a program for a store to manage the products data
data javascript scraping self-taught web
Last synced: 09 Apr 2025
https://github.com/tezcatlipoca0000/db-helper_sf
A program tailored for my workplace; it analyze, visualize and manipulate a Firebird 2.0 database
data data-visualization fdb firebird jupyter-notebook pandas python3
Last synced: 09 Apr 2025
https://github.com/rafalwrzeszcz-wrzasqpl/pl.wrzasq.commons
General-purpose data structures and routines.
aws data data-structures library rust
Last synced: 10 Apr 2025
https://github.com/amazingtest/data4test
测试数据构造生成器,you can get useful data here for software testing
data test-automation testdata testdatabuilder testing testing-tools
Last synced: 16 Jan 2026
https://github.com/ornella-gigante/wildlife-data-analysis-toolkit-ml
A data-driven exploration of Canis lupus signatus (Iberian) and Canis lupus labradorius (Labrador) subspecies, leveraging Jupyter Notebook and pandas to analyze weight distributions (25-56 kg), geographic patterns, and reproductive behaviors. Features size-weight correlations and NaN-handling workflows for robust ecological insights
analysis data datasets jupyter-notebook pandas-dataframe python
Last synced: 15 May 2026
https://github.com/antoineaugusti/antennes-free
Historique des antennes relais Free Mobile en maintenance ou en panne
data free-mobile free-mobile-operator mobile-networks
Last synced: 30 Jul 2025
https://github.com/swarchal/morar
Processing phenotypic screening data
biology data data-analysis drug-discovery hts phenotypic
Last synced: 19 Jun 2025
https://github.com/concaption/ksa-lawyers-data
scraped data of ksa lawyers and law firms
Last synced: 03 Apr 2025
https://github.com/cobluestars/dataherd-raika
"Dataherd-Raika is a library designed to simulate large-scale user behavior datasets. It takes a single user event (like a click or keyword input) and, by applying simple probability distributions and custom variables, expands it into a vast dataset."
big-data data data-generation data-generator data-science front-end javascript machine-learning npm-package simulator statistics typescript user-behavior user-experience
Last synced: 02 Jan 2026
https://github.com/fairdataihub/fair-amd-oct-paper-code
Code associated with the paper on FAIR assessment of AMD-related datasets containing OCT data
amd biomedical data eye fair oct
Last synced: 03 Apr 2025
https://github.com/alexandregazagnes/ghisa
ghisa - Github Import Statistic Analyzer is a free and open-source software, app and python package that helps you to analyze the import statistics of your github repositories.
analytics data dependencies git github github-api import package pypi python skills tool
Last synced: 27 Jun 2025
https://github.com/adrian-pasek-prv/data-modeling-with-cassandra
Create a data model in Apache Cassandra for music streaming app
apache-cassandra data data-engineering data-modeling python
Last synced: 02 Jan 2026
https://github.com/beangreen247/osfetch-old.sh
script that fetches system information and displays it to the user
247 bash bean beangreen247 data fetch green information neofetch neofetch-clone os script sh shell storage system tem zsh
Last synced: 02 Nov 2025
https://github.com/ibz-04/data-encryption
Encrypting and Decrypting given data of hospital patients such as: audio & image files
Last synced: 23 Jul 2025
https://github.com/glaucopater/covid19-vaccinations
Covid19 Vaccination Statistics
charts covid-19 data echarts italia react statistics vaccini
Last synced: 27 Mar 2025
https://github.com/realabbas/instagram-user-meta-data
Instagram User Meta Data 📷 can be fetched using this script in an easy to use JSON Object for displaying Instagram Cards.
data instagram javascript metadata nodejs profile user xray
Last synced: 10 May 2026
https://github.com/mheadd/SamDotNet
:office: A C# wrapper for the SAM.gov API.
api business client data gov-api government
Last synced: 30 Apr 2025
https://github.com/oguzgn/a-case-study-for-a-livestreaming-platform
This project aims to analyze livestream watch times of users across different regions. The goal is to identify the top 5 users with the highest watch time for each region. The analysis involves multiple SQL transformations to extract meaningful insights from the data.
bigquery data data-analysis data-modeling live-streaming sql
Last synced: 23 Jun 2025
https://github.com/bredalis/matplotlib
📊 Library to create graphs in Python 📊
data graphics librery matplotlib matplotlib-pyplot python
Last synced: 30 Mar 2025
https://github.com/vulcalien/vulcdataformat
Simple data storage system for Java.
data data-storage java serialization
Last synced: 25 Feb 2025
https://github.com/avto-dev/data-migrations-laravel
Package for database data migrations
data database laravel migrations package
Last synced: 12 Jul 2025
https://github.com/e-kotov/mapineqr
Access Mapineq inequality indicators via API
data demogrpahy r rstats socio-economic-indicators
Last synced: 06 Apr 2025
https://github.com/mierune/tinygrib2
(experimental) A tiny toolkit for parsing JMA's GRIB2 files.
data grib grib2 meteorology rust weather
Last synced: 27 Jun 2025
https://github.com/katerynazakharova/common-ml
Creating this lib for ML tasks, because I'm bored of copy-pasting the same functions for different projects.
data data-processing deep-learning lib machi
Last synced: 26 Mar 2025
https://github.com/alpheustangs/jder
A standardized structure for JSON responses
api data error json response specification structure
Last synced: 26 Mar 2025
https://github.com/tobinchilongo/oop-school-library
This project consists of Ruby script for the school library app. I implemented encapsulation and inheritance with Ruby by creating classes to represent students and teachers in the school.
data database gemfile input-output preserve rspec-testing rubocop unit-test
Last synced: 02 May 2026
https://github.com/bhpcv252/dda-binapprox-on-fits
Using the binapprox algorithm to efficiently estimate the median of each pixel from a set of astronomy images in FITS files.
Last synced: 22 Mar 2025
https://github.com/williamzebrowski/assistant-api
OpenAI Assistant API integrated with Elasticsearch, Logstash & Kibana
ai chatapp chatgpt conversational-ai data elasticsearch kibana llm-inference llms openai rag
Last synced: 16 Feb 2026
https://github.com/maxnowack/elastic-sync
Connector to sync mongodb documents into a elasticsearch index
data elasticsearch mongodb sync
Last synced: 20 Jan 2026
https://github.com/soulyma/web_crawler
A focused web crawler to extract and structure Arabic content from web pages. Designed for researchers, data analysts, and developers working on Arabic language datasets.
beautifulsoup4 crawler csv data json python structured-data
Last synced: 15 May 2026
https://github.com/jen-uis/loan-status-prediction
This repository contains project materials for the Winter STAT 206 class, University of California, Riverside, A. Gary Anderson School of Management.
data data-analysis data-analytics data-cleaning data-visualization descriptive-analytics julia julia-language jupyter-notebook predictive-analytics predictive-modeling team-collaboration
Last synced: 02 Jan 2026
https://github.com/stefanpietrusky/facts
Repository for the article in the online magazine Data Science Collective.
ai arxiv-papers beautifulsoup data flask-application gensim llama matplotlib ollama plotly pyldavis python selenium webdriver
Last synced: 09 May 2026
https://github.com/umbaji/yodi
This is the official repository for Yodi, the speech recognition model for 8 words, in Ewè. The yodi package is also useful for rapid inference inference on speech data, especially on the mini_speech datasets.
data data-visualization keras python3 speech-recognition tensorflow
Last synced: 12 Jan 2026
https://github.com/kingsley-ezenwaka/app-profile-data-analysis
A Python data analysis project that aims to propose an app profile based on analysis of Google Playstore dataset.
analysis data jupyter-notebook matplotlib pandas python seaborn
Last synced: 29 Apr 2026
https://github.com/canelmas/data-producer
Fake data producer for Kafka, console and http endpoints
data fake-content fake-data fakerjs kafka kafka-producer
Last synced: 05 Apr 2025
https://github.com/priyanshubiswas-tech/aws-etl-pipeline-on-cloud-using-glue-athena-lambda-and-redshift
Serverless ETL pipeline on AWS using Glue, Lambda, Athena, and Redshift — automates data ingestion, transformation, and analytics with scalable, event-driven architecture.
athena aws aws-glue data data-engineering etl etl-pipeline lambda redshift
Last synced: 02 May 2026
https://github.com/davidgamero/gatech-covid-chart
Line chart showing COVID19 cases per day at Georgia Tech
Last synced: 28 Oct 2025
https://github.com/nitsc/spell-from-threebodytrilogy
Implemented the process of extrapolating from Gaia stellar data, to 3D visualizations, to three-views, to three-view signals, to three-view audio of signals, and even their inversions. This project proves the feasibility of the Logic (Luoji)'s “spell” from “The Three Body Problem” trilogy.
3d 3d-graphics astronomy astronomy-astrophysics audio audio-processing data data-science data-visualization gaia graph information-technology information-visualization numpy python python-3 python3 signal signal-processing visiualization
Last synced: 02 May 2026
https://github.com/qetdr/names-genders
Surnames, genders, and gender probabilities data extraction script and dataset
Last synced: 01 May 2026
https://github.com/priyanka7411/customer-flight-prediction-app-mlflow
A comprehensive project predicting flight prices and customer satisfaction using machine learning models, deployed through interactive Streamlit apps.
classification customer-satisfaction data data-cleaning data-visualization feature-engineering flight-price-prediction machine-learning mlflow python regression streamlit
Last synced: 12 May 2026
https://github.com/hoangsonww/fred-banking-data-analysis
💸 AI-powered banking data explorer that combines FRED API insights with vector search, regression analysis, and interactive chat via OpenAI, Claude, and Gemini. Built with TypeScript, React, and Express for seamless full-stack performance.
anthropic chartjs claude-ai data data-analysis data-analytics data-science data-visualization fred fred-api gemini google-generative-ai logistic-regression multiple-regression openai pinecone react regression typescript vector-database
Last synced: 09 Apr 2025
https://github.com/benjaminr/udacity-data-engineering
Data Engineering
data dataengineering python udacity
Last synced: 14 May 2026
https://github.com/stone-zeng/china-infectious-diseases
全国法定传染病疫情概况
analytics covid-19 data healthcare infectious-diseases
Last synced: 31 Dec 2025
https://github.com/tushar2704/interview-quest
Interview-Quest is comprehensive collection of interview questions and answers that can help you prepare for technical interviews. Whether you're a seasoned developer looking to brush up on your skills or a job seeker preparing for your next big opportunity, this repository aims to provide valuable resources to enhance your interview readiness.
artificial-intelligence data data-science interview interview-questions machine-learning
Last synced: 23 Jan 2026
https://github.com/m-muecke/isocountry
R package containing ISO codes for countries and currencies
country-codes currency-codes data iso-3166-1 iso-4217 r r-package
Last synced: 20 Mar 2025
https://github.com/eddybrando/peru-year-names
Directory of Peru's official year names
Last synced: 23 Jul 2025