data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-23 00:07:41 UTC
- JSON Representation
https://github.com/panukatan/senso
An Interface to the Philippine Census of Population and Housing Data
census data philippines r rstats
Last synced: 31 Oct 2025
https://github.com/gcoronelc/ucv_gdi-1_202302-b2
Taller de Gestión de Datos e Información I con Gustavo Coronel.
data data-science data-structures database databases online oracle query relational-databases security sql sql-server
Last synced: 19 May 2026
https://github.com/cyberaula/edvl
Educational Data Virtual Lab
apache-zeppelin big-data-platform data education fiware fiware-cosmos fiware-draco fiware-keyrock fiware-ngsi fiware-orion human-data-interaction ipynb notebook notebooks spark streaming-data upm zeppelin zeppelin-notebook
Last synced: 19 May 2026
https://github.com/cliffano/volothamp
Random D&D stuffs my son and I dabble with
data dungeons-and-dragons info little-godzilla
Last synced: 06 Apr 2025
https://github.com/tomasfarias/pipeline
A simple data pipeline done as a challenge project
Last synced: 29 Mar 2025
https://github.com/lamiaaali/depi-graduation-project
SkinCare Sentiment Analysis Reviews
analytics azure azure-data-factory azure-data-lake azure-databricks azure-synapse-analytics data data-analytics data-engineering machine-learning pyspark python sql ssms unsupervised-learning
Last synced: 03 Feb 2026
https://github.com/xrahul/android-logs
Get logs of various sensors and events in android 6.0+
Last synced: 20 May 2026
https://github.com/vvipjain/ev-data-analysis
EV Data Analysis
data data-analysis data-visualisation tableau tableau-public
Last synced: 16 Feb 2026
https://github.com/nia-cloud-official/influx
Influx is a powerful search engine application designed to provide access to personal information of individuals from anywhere in the world. With Influx, users can search for and retrieve personal details of people, enabling them to find and connect with individuals across the globe.
data find people-search search-engine
Last synced: 27 Jun 2025
https://github.com/gappeah/cookie-company-visual-dashboard
This Excel-based interactive dashboard provides a comprehensive overview of the Cookie Company's sales performance and key metrics.
dashboard data data-visualization excel microsoft-excel
Last synced: 25 Feb 2025
https://github.com/amazingtest/data4test
测试数据构造生成器,you can get useful data here for software testing
data test-automation testdata testdatabuilder testing testing-tools
Last synced: 16 Jan 2026
https://github.com/cobluestars/dataherd-raika
"Dataherd-Raika is a library designed to simulate large-scale user behavior datasets. It takes a single user event (like a click or keyword input) and, by applying simple probability distributions and custom variables, expands it into a vast dataset."
big-data data data-generation data-generator data-science front-end javascript machine-learning npm-package simulator statistics typescript user-behavior user-experience
Last synced: 02 Jan 2026
https://github.com/alexandregazagnes/ghisa
ghisa - Github Import Statistic Analyzer is a free and open-source software, app and python package that helps you to analyze the import statistics of your github repositories.
analytics data dependencies git github github-api import package pypi python skills tool
Last synced: 27 Jun 2025
https://github.com/ibz-04/data-encryption
Encrypting and Decrypting given data of hospital patients such as: audio & image files
Last synced: 23 Jul 2025
https://github.com/oguzgn/a-case-study-for-a-livestreaming-platform
This project aims to analyze livestream watch times of users across different regions. The goal is to identify the top 5 users with the highest watch time for each region. The analysis involves multiple SQL transformations to extract meaningful insights from the data.
bigquery data data-analysis data-modeling live-streaming sql
Last synced: 23 Jun 2025
https://github.com/vulcalien/vulcdataformat
Simple data storage system for Java.
data data-storage java serialization
Last synced: 25 Feb 2025
https://github.com/e-kotov/mapineqr
Access Mapineq inequality indicators via API
data demogrpahy r rstats socio-economic-indicators
Last synced: 06 Apr 2025
https://github.com/mierune/tinygrib2
(experimental) A tiny toolkit for parsing JMA's GRIB2 files.
data grib grib2 meteorology rust weather
Last synced: 27 Jun 2025
https://github.com/bhpcv252/dda-binapprox-on-fits
Using the binapprox algorithm to efficiently estimate the median of each pixel from a set of astronomy images in FITS files.
Last synced: 22 Mar 2025
https://github.com/maxnowack/elastic-sync
Connector to sync mongodb documents into a elasticsearch index
data elasticsearch mongodb sync
Last synced: 20 Jan 2026
https://github.com/jen-uis/loan-status-prediction
This repository contains project materials for the Winter STAT 206 class, University of California, Riverside, A. Gary Anderson School of Management.
data data-analysis data-analytics data-cleaning data-visualization descriptive-analytics julia julia-language jupyter-notebook predictive-analytics predictive-modeling team-collaboration
Last synced: 02 Jan 2026
https://github.com/umbaji/yodi
This is the official repository for Yodi, the speech recognition model for 8 words, in Ewè. The yodi package is also useful for rapid inference inference on speech data, especially on the mini_speech datasets.
data data-visualization keras python3 speech-recognition tensorflow
Last synced: 12 Jan 2026
https://github.com/canelmas/data-producer
Fake data producer for Kafka, console and http endpoints
data fake-content fake-data fakerjs kafka kafka-producer
Last synced: 05 Apr 2025
https://github.com/priyanshubiswas-tech/aws-etl-pipeline-on-cloud-using-glue-athena-lambda-and-redshift
Serverless ETL pipeline on AWS using Glue, Lambda, Athena, and Redshift — automates data ingestion, transformation, and analytics with scalable, event-driven architecture.
athena aws aws-glue data data-engineering etl etl-pipeline lambda redshift
Last synced: 02 May 2026
https://github.com/nitsc/spell-from-threebodytrilogy
Implemented the process of extrapolating from Gaia stellar data, to 3D visualizations, to three-views, to three-view signals, to three-view audio of signals, and even their inversions. This project proves the feasibility of the Logic (Luoji)'s “spell” from “The Three Body Problem” trilogy.
3d 3d-graphics astronomy astronomy-astrophysics audio audio-processing data data-science data-visualization gaia graph information-technology information-visualization numpy python python-3 python3 signal signal-processing visiualization
Last synced: 02 May 2026
https://github.com/stone-zeng/china-infectious-diseases
全国法定传染病疫情概况
analytics covid-19 data healthcare infectious-diseases
Last synced: 31 Dec 2025
https://github.com/tushar2704/interview-quest
Interview-Quest is comprehensive collection of interview questions and answers that can help you prepare for technical interviews. Whether you're a seasoned developer looking to brush up on your skills or a job seeker preparing for your next big opportunity, this repository aims to provide valuable resources to enhance your interview readiness.
artificial-intelligence data data-science interview interview-questions machine-learning
Last synced: 23 Jan 2026
https://github.com/dhimmel/erc
Processing human Evolutionary Rate Covariation data
data erc evolution evolutionary-rate-covariation genes hetionet human rephetio
Last synced: 23 Jul 2025
https://github.com/cyberoctane29/cyclistic-bike-share--analyzing-rider-behavior
Analyzed Cyclistic's bike-share data to uncover usage differences between casual riders and annual members. Utilized SQL and MySQL for data processing, R for visualisation, and Kaggle for collaboration. Insights will guide marketing strategies to convert casual riders into annual members.
data dataanalysis dataanalytics database rlanguage rmarkdown spreadsheet sql
Last synced: 22 May 2026
https://github.com/phatdev12/diem-thi-tuyen-sinh-10-da-nang
Danh sách điểm thi tuyển sinh 10 Đà Nẵng 2023-2024
data data-science dataanalytics dataset json
Last synced: 28 Jun 2025
https://github.com/tbrowder/classfactory
Provides tools to create a data collection with classes to manipulate the persistent data.
Last synced: 04 Apr 2025
https://github.com/sarincr/basics-of-julia-programming-language
Julia is a high-level, high-performance, dynamic programming language. While it is a general purpose language and can be used to write any application, many of its features are well-suited for high-performance numerical analysis and computational science.
data data-analysis data-mining data-science data-visualization dataanalysis dataanalytics datascience julia julia-language julia-library julia-package julialang machine-learning
Last synced: 19 May 2026
https://github.com/jebin1999/livestock-production-monitoring-
Livestock production Monitoring
data datascience livestock livestock-monitor r shiny shiny-apps shiny-r shinydashboard
Last synced: 05 Nov 2025
https://github.com/real-veersandhu/cia-country-comparison
Data analysis system on the CIA World Factbook
Last synced: 25 Feb 2025
https://github.com/simranjeet97/datascience_crashcourse
Data Science Crash Course that Explained about Each and Every Process in Data Science.
dash data data-science data-science-crash-course data-structures data-visualization datascience-machinelearning datasciencecoursera datascienceproject instagram matplotlib numpy pandas telegram tutorials youtube
Last synced: 08 Apr 2026
https://github.com/mouneshgouda/learn_dsa
This repository explores fundamental data structures and their implementations. Learn how to organize and manipulate data efficiently for various programming tasks. (Feel free to add your specific focus areas here, e.g., algorithms, interview prep)
c data queue sorting-algorithms stack structured-data
Last synced: 30 Jul 2025
https://github.com/visenger/prada
Profiling Datasets
cleaning data dataset profiling
Last synced: 24 Aug 2025
https://github.com/tonykipkemboi/ens_subgraph_data
Query On-Chain Data from Subgraphs by The Graph Protocol using Python
data subgraphs thegraphprotocol web3
Last synced: 17 Sep 2025
https://github.com/v6ntage/sql-sales_data-analytics-project
This repository contains a SQL scripts demonstration analytical techniques.
analytics business-analytics data data-analysis database query sql sql-server
Last synced: 12 Apr 2026
https://github.com/chalk-ai/roadmap
Chalk public roadmap
chalk data data-science mlops pipeline python
Last synced: 17 Jan 2026
https://github.com/BenSFGamer/B.I.O.S.
A biographer
academia agi ai ai-tools artificial-general-intelligence artificial-neural-networks automation data fact-checking information-extraction information-retrieval self-improving self-learning self-referential semi-autonomous software-engineering specialization web-agent web-scraping writer
Last synced: 27 Sep 2025
https://github.com/theryston/db-mycro
A node module with a json database that saves data in a specific directory, similar to sqlite, but in JSON
base crud data database db db-mycro javascript json jsondatabase nodejs nosql typescript
Last synced: 09 Apr 2026
https://github.com/tiaanduplessis/country-currency-data
Data about currencies of countries
countries currencies data symbols
Last synced: 08 Aug 2025
https://github.com/woctezuma/download-steam-screenshots-data
Data consisting of Steam screenshots.
Last synced: 19 Feb 2026
https://github.com/isaac-lal/english-arabic-dictionary
This is a dictionary website that implements a search feature which allows input for a word in either English or Arabic and returns the alternative translation.
data db javascript react web-development
Last synced: 09 Apr 2026
https://github.com/rubenhortas/python_examples
Examples of Python code and DSA (data structures and algorithms).
algorithm algorithms data dsa examples python python-3 python3 samples snippets structures
Last synced: 03 Oct 2025
https://github.com/semibran/img-data
Easily read from and write to ImageData instances
Last synced: 11 Aug 2025
https://github.com/jorgeatgu/casa-caida-bot
Twitter-bot sobre la despoblación en Aragón
aragon bot data data-viz despoblacion twitter-bot
Last synced: 11 Aug 2025
https://github.com/r12habh/datacamp.com-micro_projects
data data-analysis data-science datascience python python3
Last synced: 23 May 2026
https://github.com/helosantosdesousa/analise-previsao-de-rotatividade-ml
Projeto final do Bootcamp Data Girls 2025 que analisa a rotatividade de funcionários usando Machine Learning. Com base no dataset IBM HR Analytics Attrition, o projeto identifica os principais fatores de risco e cria modelos preditivos (SVC e Random Forest) com até 89% de acurácia para antecipar saídas e apoiar decisões estratégicas de RH.
analise-de-dados analise-exploratoria bootcamp ciencia-de-dados colab-notebook dados data data-analysis data-science dataanalytics dataframe eda machine-learning machine-learning-algorithms pandas python random-forest svc
Last synced: 16 Apr 2026
https://github.com/davidteather/scrape-crossfit-gyms
Scrapes crossfit gym data
cross-fit crossfit data data-scraping python python-requests python3 scraping
Last synced: 13 Aug 2025
https://github.com/soenneker/soenneker.quark.table
A native Blazor table component.
blazor blazorlibrary csharp data dotnet html quark quarktable table tables
Last synced: 13 Aug 2025
https://github.com/simranjeet97/datastructures_algoritms_python
Data Structures and Algorithms using Python
algorithms arrays arrays-and-strings coding data data-science data-structures datastructures-python hashing interview-preparation interview-questions linked-list python stacks stacks-as-an-array
Last synced: 09 Apr 2026
https://github.com/stdlib-js/array-one-to-like
Generate a linearly spaced numeric array whose elements increment by 1 starting from one and having the same length and data type as a provided input array.
array data float32array float64array int16array int32array javascript matrix ndarray node node-js nodejs stdlib structure typed typed-array types uint32array vector
Last synced: 20 Feb 2026
https://github.com/rishabh-agarwal/datastructuremachineproblem
Data Structure MP - Clemson University (Language C)
273 alogrithms clemson data ece structure university
Last synced: 26 Oct 2025
https://github.com/pradeep221b/turbofan_predictive_maintenance
An R project for predicting turbofan engine RUL using {targets} and {tidymodels}.
data data-science-portfolio machine-learning nasa preditive-maintaince r rstats targets-pipeline tidymodels
Last synced: 04 Oct 2025
https://github.com/zediculz/block
Block is a data structure/collection that uses Blockchain principle in managing data.
Last synced: 05 Oct 2025
https://github.com/garcane/income-prediction-ml
This is a machine learning project aimed at predicting whether an individual's annual income exceeds $50,000 based on their demographic and personal information.
data data-science machine-learning ml numpy pandas python random-forest scikit-learn
Last synced: 08 Apr 2026
https://github.com/dylanhogg/cloud-products
A package for getting cloud products and product descriptions from a cloud provider website.
aws cloud-products crawler data text-processing
Last synced: 05 Oct 2025
https://github.com/DefinetlyNotAI/VulnScan_Data
Logicytics VulnScan Module's Training Data and old model archive
ai data logicytics ml models pytorch sensitive-files text-processing tfidf-text-analysis training-data
Last synced: 17 Aug 2025
https://github.com/freddy03h/immutable-data-structure
Normalize and Merge your application's data store using Immutable.JS objects
Last synced: 05 Oct 2025
https://github.com/arif-miad/heart-attack-risk-prediction
This dataset explores key factors influencing heart attack risk, such as age, cholesterol, blood pressure, and lifestyle habits. Using machine learning models.
classification data data-science matplotlib ml pandas-python seaborn visualization
Last synced: 18 Aug 2025
https://github.com/gematik/app-fhir-snapshots-package-generator
The repository contains a library and a console application to generate snapshots for StructureDefinitions in FHIR-packages.
Last synced: 05 Oct 2025
https://github.com/giorgiosavastano/process
processing-chain provides a convenient way to seamlessly set up processing chains for large amounts of data.
big-data data data-science parallel parallel-computing process processing processing-chain rust
Last synced: 05 Oct 2025
https://github.com/vincentneo/sgtidetimings
Scraped SG NEA tide timings table into machine-readable JSON files!
data github-actions github-pages gov html-tables-to-json javascript json nodejs sg singapore singapore-data-analysis tide webscraping
Last synced: 10 Apr 2026
https://github.com/gusenov/qazaqstan-geography-data
:world_map: Географические данные Казахстана.
data geographic-data geography json kazakhstan qazaqstan regions
Last synced: 20 Feb 2026
https://github.com/rambodrahmani/covid19-behind-the-numbers
COVID-19: Behind the Numbers.
apriori-algorithm apriori-algorithm-python clustering clustering-algorithm clustering-analysis covid covid-19 covid19-data data data-mining data-science datamining fpgrowth machine-learning machine-learning-algorithms python python-machine-learning
Last synced: 20 Aug 2025
https://github.com/carlotta94c/sql4datascientistsdemo
Demo material for Microsoft Reactor session "Getting Started with Databases: SQL and Data Visualizations"
analysis data r sqlite tidyverse visualisation
Last synced: 18 Apr 2026
https://github.com/aadityatamrakar/futures_spread_chart
Cash Market & Futures Daily Spread Chart - NSE Stocks
data data-analysis data-mining expressjs nodejs requests
Last synced: 10 Apr 2026
https://github.com/labwhatever/leetcode
Collection of LeetCode questions to ace the coding interview!
data data-structures-and-algorithms dsa leetcode-cpp leetcode-solutions structure structure-learning
Last synced: 22 Aug 2025
https://github.com/richardwarepam16/hotel_analysis_using_python
Unlocking Insights: Analyzing Hotel Reservation Data to Boost Business Performance
data data-analysis data-visualization hotel-booking hotel-cancellation-solution hotel-management-system jupyter-notebook python python3
Last synced: 22 Aug 2025
https://github.com/jerryfzhang/rockets
A Node + React App that displays space launch missions around the world.
bootstrap data expressjs less momentjs nodejs react reactjs reactstrap
Last synced: 10 Apr 2026
https://github.com/petermartens98/nba-analytics-streamlit-app-with-langchain-agent
Interactive NBA Analytics app with Streamlit and a LangChain conversational agent connected to extracted data. Explore player, team, and game stats, track injuries, run simulations, visualize trends, and get AI-powered insights. Ongoing development, open to collaboration.
agentic-ai analysis data deepseek langchain nba python streamlit visualization
Last synced: 08 May 2026
https://github.com/grkndev/twitcher
A great library that will allow you to use the Twitch API service. All you need to do is use your Token and Client Id information.
api clip clipr data javascript nodejs npm npm-package npmjs streamers streaming twitch twitch-api twitch-bot twitchtv twtich-clip user
Last synced: 09 Mar 2026
https://github.com/miniql/miniql-express-mongodb-example
A MiniQL example for querying a MongoDB database through an Express REST API.
data database mongodb query query-language
Last synced: 19 Apr 2026
https://github.com/sstendahl/giscan
Simple tool to read and analyze existing GISAXS data
cbf data diffraction diffraction-analysis gisans gisaxs physics reflectivity scattering xray
Last synced: 11 Nov 2025
https://github.com/aymane-maghouti/mobile-data-hive-insights
This project demonstrates the process of extracting data from a MySQL database, transferring it using Apache Sqoop, storing it in Hive Data warehouse (the data actually is store in Hadoop Distributed File System (HDFS)), and performing analysis using Hive Query Language (Hive QL) (it is a language close to SQL). Then visualize the data in Power BI,
apache-sqoop data data-integration data-visualization hadoop-hdfs hivedb hiveql powerbi
Last synced: 09 Mar 2026
https://github.com/wildanmujjahid29/books-sales-analytics-python
Books Sales Analytics With Pyhton
data data-analysis data-science data-visualization
Last synced: 12 Jun 2026
https://github.com/stdlib-js/array-filled-by
Create a filled array according to a provided callback function.
array data float32array float64array int16array int32array javascript matrix ndarray node node-js nodejs stdlib structure typed typed-array types uint32array vector
Last synced: 09 Mar 2026
https://github.com/kunalshelke90/predict-bank-credit-risk-using-south-german-credit-data
This is an end-to-end ML project, which aims at developing a classification model for the problem of classifying a given customer profile into either of the risk category (safe or not safe). The final classifier used for this project is CatBoost classifier. Deployed in AWS.
aws cassandra catboost-classifier classification credit-risk data data-science dataanalysis dockerfile finance financial-analysis flask github-actions logging machine-learning mlflow numpy pandas python
Last synced: 03 Jan 2026
https://github.com/xdrokra/road-accident-analytics
A data visualization project that maps and analyzes road accidents across major Italian municipalities in 2023
analytics data design italy javascript
Last synced: 30 Aug 2025
https://github.com/harmonydata/harmonyapi
This is the source code for the Harmony project REST API
anxiety data data-harmonisation data-harmonization data-science deep-learning depression first-timers-only gad-7 harmonisation harmonization harmony mental-health natural-language-processing neural-network nlp psychology research social-sciences wellcome
Last synced: 31 Aug 2025
https://github.com/tatey/list_of_baby_names
A list of baby names given to tiny humans in Ruby
Last synced: 11 Nov 2025
https://github.com/ukplab/pragtag2023
Code and data for the PragTag-2023 Shared Task
argument-mining data peer-review pragmatics shared-task
Last synced: 18 Jun 2025