data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-23 00:07:41 UTC
- JSON Representation
https://github.com/SAP-archive/signavio-qualtrics-di
Setup an SAP Data Intelligence data pipeline to connect Qualtrics surveys data to SAP Signavio Process Intelligence via Ingestion API.
data intelligence process-intelligence qualtrics sample sap-data-intelligence sap-signavio-process-intelligence signavio
Last synced: 09 May 2025
https://github.com/svelterun/store
Persisted version of svelte/store.
data state state-management store svelte svelte-store sveltekit svelterun typescript
Last synced: 08 Jan 2026
https://github.com/themost-framework/jspa
JavaScript Persistent API
api data database-schema jspa object-relational-mapping orm orm-framework
Last synced: 31 Aug 2025
https://github.com/yash22222/sync-intern-s-ml-tasks
SYNC INTERN'S Machine Learning internship will offer you to enhance your skills by doing real-life example projects. This internship will increase your knowledge in the field of data and algorithms to understand how a machine learns.
bhpp boston-house-datasets boston-house-price-prediction boston-house-pricing data data-structures machine-learning machine-learning-algorithms numpy pandas sync-intern sync-interns
Last synced: 07 May 2026
https://github.com/izam-mohammed/data-source
🌐 A source directory for the data of my projects and experiments.📂 This curated collection simplifies access to diverse data that used in various projects💡
csv-files data data-source zip-files
Last synced: 03 Jun 2026
https://github.com/jigyasag18/gold-price-prediction-project-using-machine-learning
This repository contains a machine learning project focused on predicting gold prices (GLD) using historical stock market data, including indicators such as SPX, USO, SLV, and EUR/USD. The project implements a Random Forest Regressor for accurate price forecasting, complete with data visualization, correlation analysis, and model evaluation metrics
data dataset jupyter-notebook jupyter-notebooks machine-learning machinelearing machinelearningalgorithms machinelearningmodel machinelearningprojects matplotlib mlproject numpy pandas randomforestregressor seaborn
Last synced: 23 Jul 2025
https://github.com/gallo13/neuralnetworks-deeplearning-stats-classification
Descriptive Statistics, Classification and Analysis Using Python & Python Libraries (Assignment 1)
analysis data datasets deep-learning jupyter-notebook matplotlib neural-networks numpy pandas plotting python seaborn
Last synced: 17 Apr 2026
https://github.com/nitrosh/nitro-validate
A powerful, standalone, dependency-free data validation library for Python with extensible rules and a clean, intuitive API.
data python3 validation validation-library
Last synced: 17 Apr 2026
https://github.com/capire/xtravels-java
Travel booking app using master data from xflights built with CAP Java
cap cds data federation flights java reuse
Last synced: 23 Jan 2026
https://github.com/karthikmprakash/github_repos_scraper
A tool to extract names of github repos of any user
automation bs4 data github python repositories requests webscraping
Last synced: 27 Apr 2026
https://github.com/flowsynx/plugin-csv
FlowSynx plugin to reads and writes CSV files, enabling easy batch data import/export operations and integration with spreadsheet-based data workflows.
comma-separated-values csv data data-platform flowsynx
Last synced: 10 Mar 2026
https://github.com/alexscigalszky/palabras-aleatorias-data
This package have a set of datasets of random words, animals, colors, jokes, onomatopeias and types
aleatorias data palabras random words
Last synced: 04 Oct 2025
https://github.com/edugmenes/azure-data-engineering
This repository contains my first end-to-end Data Engineering project, built using Microsoft Azure Cloud and Azure Databricks with PySpark.
azure cloud data data-engineering data-lakehouse data-structures databricks delta-lake etl-pipelines lakehouse lakehouse-architectures medallion-architecture microsoft-azure pyspark spark
Last synced: 29 Jan 2026
https://github.com/san089/black-friday-sales-analysis
This Project gives an insight into few statistics related to black Friday Sale.
custom data dataanalysis insights sales statistics
Last synced: 13 Jul 2025
https://github.com/nikhilash45/live_ipl_report
This repository hosts the source code for an interactive IPL (Indian Premier League) Dashboard built using PowerBI. The dashboard provides real-time updates on ongoing matches, including live scores, batting and bowling statistics for both teams, and the points table.
analysts cleaning-data cricket-data dashboard data data-analysis data-visualization dax powerbi
Last synced: 19 Mar 2026
https://github.com/camara94/introduction-to-data-engineering
Describe the different entities that form a modern data ecosystem. Describe and differentiate between the role and responsibilities of Data Engineers, Data Scientists, Data Analysts, Business Analysts, and Business Intelligence Analysts. Explain what Data Engineering is. List the tasks that need to be performed in a typical data engineering lifecycle. Describe what a day in the life of a Data Engineer looks like.
business-analytics business-intelligence data dataingestion dataintegration datascience machinelearning python statistical-analysis
Last synced: 09 Apr 2025
https://github.com/chompfoods/stub-go-server
Go server stub for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.
api branded chomp data database food go-server go-swagger grocery ingredients nutrition raw recipe-api recipes
Last synced: 17 Apr 2026
https://github.com/apfirebolt/data-structures-and-algorithms-in-python
Data Structure and Algorithms in Python
algorithms data data-structures python python3 tkinter-gui
Last synced: 15 Mar 2025
https://github.com/nouman6093/advanced-statistical-models
in this repository i will upload everything i have learned about data science advanced statistical models. there are over 42 statistical models. each of them work on algorithms. and there are over 32 algorithms. each library has its own way of writing such statistical models. after learning i will try to upload as much statistical models as possibl
data data-analysis data-science data-visualization
Last synced: 11 Jun 2026
https://github.com/stdlib-js/ndarray-base
Base ndarray.
array base buffer data javascript matrix multidimensional namespace ndarray node node-js nodejs ns stdlib structures types vector
Last synced: 09 Apr 2025
https://github.com/alrza2003/alrza2003.github.io
This repository contains the source files for my personal portfolio website. It highlights my background as a data analyst and radiology student, and showcases real-world projects, tools I use, and ways to connect with me. The site is based on a pre-built template that I customized to reflect my profile and experience.
data data-analysis data-visualization portfolio portfolio-website python
Last synced: 30 Apr 2026
https://github.com/stdlib-js/array-zero-to-like
Generate a linearly spaced numeric array whose elements increment by 1 starting from zero and having the same length and data type as a provided input array.
array data float32array float64array int16array int32array javascript matrix ndarray node node-js nodejs stdlib structure typed typed-array types uint32array vector
Last synced: 07 Jan 2026
https://github.com/garcane/income-prediction-ml
This is a machine learning project aimed at predicting whether an individual's annual income exceeds $50,000 based on their demographic and personal information.
data data-science machine-learning ml numpy pandas python random-forest scikit-learn
Last synced: 08 Apr 2026
https://github.com/DefinetlyNotAI/VulnScan_Data
Logicytics VulnScan Module's Training Data and old model archive
ai data logicytics ml models pytorch sensitive-files text-processing tfidf-text-analysis training-data
Last synced: 17 Aug 2025
https://github.com/ayushverma135/sas-health-metrics-analysis-bmi-categorization-and-gender-insights
Using SAS, this project processes Excel data on individual statistics and health metrics. It calculates BMI, categorizes health status, and visualizes distributions through pie charts.
analytics data excel sas sasprogramming statistical-analysis
Last synced: 24 Feb 2026
https://github.com/eradical/analytics-unibody
Ansible role that sets up a farm of analytics collectors based on nginx
analytics ansible ansible-role big-data collectors data nginx
Last synced: 06 May 2026
https://github.com/gusenov/qazaqstan-geography-data
:world_map: Географические данные Казахстана.
data geographic-data geography json kazakhstan qazaqstan regions
Last synced: 20 Feb 2026
https://github.com/rafaelfloressouza/Covid-19-Dashboard
Python web application to display COVID19 data from the world using Plotly and Dash
bootstrap covid-19 css data datavisualization plotly-dash python3
Last synced: 10 Mar 2025
https://github.com/ukplab/pragtag2023
Code and data for the PragTag-2023 Shared Task
argument-mining data peer-review pragmatics shared-task
Last synced: 18 Jun 2025
https://github.com/varbrad/mindb
🗄 🔍 ⚡️ Schema-less document-oriented collection model data-store for Node & Browsers.
browser data datastore db document javascript json-schema mongo mongodb nodejs nosql query schema
Last synced: 13 Apr 2026
https://github.com/aymane-maghouti/mobile-data-hive-insights
This project demonstrates the process of extracting data from a MySQL database, transferring it using Apache Sqoop, storing it in Hive Data warehouse (the data actually is store in Hadoop Distributed File System (HDFS)), and performing analysis using Hive Query Language (Hive QL) (it is a language close to SQL). Then visualize the data in Power BI,
apache-sqoop data data-integration data-visualization hadoop-hdfs hivedb hiveql powerbi
Last synced: 09 Mar 2026
https://github.com/gbowne1/jsonhelix
This is a X11 GUI JSON application for editing, debugging and converting JSON and schemas and API data.
api data gui gui-application json x11
Last synced: 10 Jun 2025
https://github.com/serhatderya/tabular-playground-series
This repository contains solutions of monthly Tabular Playground Series in Kaggle.
ai artificial-intelligence data data-preprocessing data-processing data-science data-visualization jupyter-notebook kaggle machine-learning numpy pandas python regression scikit-learn scikitlearn-machine-learning seaborn software statsmodels
Last synced: 11 Apr 2026
https://github.com/denko5/sales-analysis
A complete SQL-based sales analysis project covering Africa, showcasing data cleaning, exploratory analysis, insights, and lessons learned. The project highlights sales trends, regional performances, and marketing effectiveness across multiple platforms.
africa data data-analysis data-science exploratory-data-analysis insights kenya sales sql
Last synced: 24 Jan 2026
https://github.com/yessasvini23/cisco-data-analytics-essentials_-virtual-_internship
From the CISCO Networking Academy
data dataanalysis database datascience excel relational-databases sql statistics structured-query tableau
Last synced: 17 Jul 2025
https://github.com/andrianllmm/wika-data
Philippine language resources.
data language low-resource-languages parser philippines scraper
Last synced: 17 Jul 2025
https://github.com/bacross/datamunger
python package for handling nan's and outliers
data data-frame datamunger knn nan outliers python scikit-learn
Last synced: 17 May 2026
https://github.com/hughrawlinson/github-data-scripts
Scripts to grab data about repos of interest to compare
data github-graphql github-repo-organizer graphql scripts typescript
Last synced: 09 Jul 2025
https://github.com/mustika-putri-m/-tableu-laporan-data-karyawan-growian
I am currently pursuing a data analysis certification at GROWIA, where I've learned to use tools such as Python, SQL, Google Big Query, Google Data Studio, Advanced Microsoft Excel, and Tableau. This course has enhanced my ability to analyze data using KPIs and business metrics, enabling me to solve business problems more effectively
data data-visualization tableau
Last synced: 17 Feb 2026
https://github.com/clabe45/kaz
Minimalistic local storage cli
cli data minimalistic storage utility
Last synced: 17 Jul 2025
https://github.com/simranjeet97/gpt4_applications
Applications build using OpenAI API and GPT4
ai ai-applications artificial-intelligence chatgpt data data-science gpt3 gpt4 large-language-models llm machine-learning openai openai-api project python
Last synced: 05 May 2026
https://github.com/shuklayash02/complete_data_analysis_project
A Full Data Analysis project where a sales data is ask,prepare,process,analyze,share and act through data analysis process
data data-visualization dataanalysis database datacleaning powerbi sql
Last synced: 16 Jul 2025
https://github.com/dineshpinto/geist-finance-subgraph
Subgraph for the Geist Finance protocol on the Fantom blockchain.
assemblyscript blockchain data fantom graphql typescript
Last synced: 17 May 2026
https://github.com/davedupplaw/jquery.bargraph
Moving, sliding bargraph display for jQuery
barchart bargraphs data javascript javascript-library jquery jquery-library jquery-plugin jquery-widgets realtime scrolling visualization
Last synced: 17 May 2026
https://github.com/reiiyuki/once-data-manager
Once Data Manager is temporary data management utility kit for Unity.
data manager playerprefs preference scene temporary unity
Last synced: 17 May 2026
https://github.com/wamphlett/smart-data-objects
An easy solution for capturing and validating data into usable DTO's
data dto forms php php7 validation
Last synced: 17 May 2026
https://github.com/nafisalawalidris/dr.-semmelweis-and-the-discovery-of-handwashing
Uncover the revolutionary impact of handwashing on mortality rates in healthcare. Explore the story of Dr. Semmelweis and his groundbreaking findings.
data data-analysis handwashing healthcare-analysis medical-breakthrough mortality-rates
Last synced: 13 Jul 2025
https://github.com/novecento99/nuvolino
air cloud data ikea iot pm pm25 sensor vindstyrka
Last synced: 13 Jul 2025
https://github.com/nichtich/wikidata-taxonomy-examples
Extract classifications from Wikidata
coli-conc data knowledge-organization wikidata
Last synced: 12 Jul 2025
https://github.com/elazar/pycopyql
Exports a subset of data from a relational database.
data database export relational tool utility
Last synced: 16 May 2026
https://github.com/flownrecords/flightTracker
A mobile app built to record essential flight data for post-flight review and debriefing.
Last synced: 23 Jun 2025
https://github.com/evoluteur/madeleinology
Playing with data science by taking a look at the proportions of flour, sugar, butter, and eggs in 147 Madeleine recipes (the traditional French sponge cake).
baking cake cooking cooking-recipes data data-science data-visualization dessert exploratory-analysis exploratory-data-analysis exploratory-data-visualizations food histogram longtail madeleine recipe visualization
Last synced: 23 Jun 2025
https://github.com/priyanshubiswas-tech/ev-data-analysis-dashboard
An interactive dashboard analyzing EV trends, including total vehicles, BEV vs. PHEV breakdown, model popularity, state-wise distribution, and CAFV eligibility. Visualizes key insights for data-driven decisions in the EV industry. 📊
dashboard data data-analysis data-science data-visualization tableau tableau-public
Last synced: 17 Feb 2026
https://github.com/marians/tour-tracker
Track the general classification development of the Tour De France, stage over stage
cycling data sports statistics
Last synced: 24 Jun 2025
https://github.com/nia-cloud-official/datascript
DataScript: A Hypothetical Data Scripting Language, DataScript is designed for simplifying data manipulation and analysis tasks. It serves as a scripting language tailored specifically for handling various data operations efficiently.
data data-scripting scripting-language
Last synced: 22 Jun 2025
https://github.com/dennyglee/open-covid19-public
A collaboration between SCRI and Databricks on the analysis of open COVID-19 datasets.
covid-19 data data-analytics data-engineering data-science nlp
Last synced: 22 Jun 2025
https://github.com/DevAthul-88/random-fakedata.js
A package to generate random data
data data-generator fake fake-data fake-data-generator javascipt javascript nodejs npm-package package
Last synced: 22 Jun 2025
https://github.com/shgysk8zer0/schema
A PHP implementation of schema.org structured data objects
data microdata schema seo structured-data
Last synced: 24 Jun 2025
https://github.com/dostuffthatmatters/circadian-scp-upload
Resumable, interruptible, SCP upload client for any files or directories generated day by day
checksum daily data directories files library python scp ssh synchronization time-series upload utilities
Last synced: 24 Jun 2025
https://github.com/lunastev/reflectlm
ReflectLM is a self-reflective, language-structure-only AI model that learns exclusively through interaction. It starts with zero factual knowledge but can engage in dialogue, evaluate its own responses, and remember conversations for future learning.
ai data language-model llm model open-source ts web
Last synced: 22 Jun 2025
https://github.com/harmonydata/harmony_examples
Example Jupyter notebook and R scripts using Harmony in real research problems
data data-harmonisation data-harmonization harmonisation psychology python r research
Last synced: 11 Jul 2025
https://github.com/nafisalawalidris/elfeenah
Configuration files for my GitHub profile. Welcome to my GitHub profile! I'm Nafisa Lawal Idris, a passionate Data Scientist with a strong interest for blockchain technology. Explore my GitHub portfolio to delve into the exciting world where data science and blockchain converge.
artificial-intelligence bitcoin blockchain config data data-science-portfolio data-science-projects datascience datascientist deep-learning github-config machinelearning
Last synced: 11 Sep 2025
https://github.com/legopitstop/mcextract
Extract assets and data from the Minecraft jar.
assets customtkinter data jar java minecraft pypi python pythonpackage reports serverjars userfolder
Last synced: 17 May 2026
https://github.com/alireza29675/goudi
GOUDI is a multi-layer data visualization application, inspired by mind maps and some other thinking and describing methods.
analysis data goudi visualization
Last synced: 11 Jul 2025
https://github.com/utkarshverma439/simple-sms-spam-detector
Built a Python text classification model for spam detection in SMS. Explored data, preprocessed text, utilized TF-IDF, trained a classifier, and addressed visualization challenges, yielding practical insights.
data data-science data-visualization spam-detection
Last synced: 20 Jun 2025
https://github.com/yasir13001/moonai_api
This MoonAI API service built with FastAPI that calculates and provides detailed Moon and Sun astronomical data based on user input such as date, latitude, longitude, elevation, and timezone.
ai almanac api astro-ai astronomy data data-science fastapi fastapi-api gemini groq-api hilal-detection html islamic-calenda llama llm-integration moon python
Last synced: 20 Jun 2025
https://github.com/divithraju/divith-raju-data-mining
This project focuses on customer segmentation using data mining techniques, specifically K-Means clustering, to classify customers into distinct groups based on their purchasing behaviors. The goal is to analyze customer data and segment them into clusters for targeted marketing strategies and better customer relationship management.
algorthims analytics apache business client connector data dataarchitecture database dataengineering datamining datascience hadoop k-means-clustering mysql project project-repository pyspark python3 spark
Last synced: 06 Mar 2026
https://github.com/jub0t/Eso
An application to manage all your Encryption & Decryption keys and other related tools.
data encryption encryption-decryption hacking hacking-tool keys pgp privacy private
Last synced: 10 May 2025
https://github.com/giscience/measures-rest-sparql
A SPARQL endpoint for the Measures REST OSHDB App framework.
data osm quality semantics sparql sparql-endpoints
Last synced: 24 Jun 2025
https://github.com/ayush585/fireducksblog
BLOG: Unlocking AI Efficiency: How FireDucks Revolutionizes Data Preprocessing
Last synced: 28 Apr 2026
https://github.com/stdlib-js/array-nans
Create an array filled with NaNs and having a specified length.
array complex128 complex128array complex64array data float32array float64array javascript matrix ndarray node node-js nodejs stdlib structure typed typed-array types vector
Last synced: 06 Mar 2026
https://github.com/uhstray-io/just-dashboards
Light and Easy Rust-Fullstack/WASM application to build dashboards from any data source
analytics data dioxus rust visualization
Last synced: 29 Mar 2025
https://github.com/fbraza/paris_airbnb
Analysis of Paris AirBnB data using R and Shiny
analysis data data-analysis paris-airbnb r shiny
Last synced: 21 Mar 2025
https://github.com/dbrennand/rm-content
A Python 3.7 script to remove a specific string from all files and repos (owned by the user).
content data erase eraser privacy privacy-protection privacy-tools remove remover rm-content
Last synced: 29 Mar 2025
https://github.com/hamzacham/data_set_projet-3
analysis data project rstudio visualization
Last synced: 29 Oct 2025
https://github.com/benji-lewis/archivord
An archival bot for Discord servers designed to retain as much data as possible to show future generations how we communicated.
archive data data-mining discord discord-bot typescript
Last synced: 16 May 2026
https://github.com/wklee610/de_project
[Data Engineer] Personal Toy Project For Study
Last synced: 31 Mar 2025
https://github.com/danieljdufour/fast-bin
Quickly Convert an Array of Numbers into their Minimal Binary Representations
array binarize binary bits data nbits numbers unbinarize
Last synced: 13 Apr 2025
https://github.com/whatheheckisthis/pwc_project-
Successfully completed a PwC virtual case, advancing Power BI skills to address cybersecurity and cloud architecture requirements. Developed comprehensive dashboards that effectively communicated key performance indicators (KPIs), showcasing proficiency in data visualization and deliver
case-study data data-science dataanalytics databases datavisualization powerbi virtual
Last synced: 05 Apr 2025
https://github.com/danieljdufour/easy-file-saver
Very Easily Save a File
csv data download file file-saver javascript js json save
Last synced: 21 Apr 2026