data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-01 00:07:35 UTC
- JSON Representation
https://github.com/thedevreda/jadaerospace
A Real life project showing how to improve selling aircraftparts and helping salers to focus more on effective products at JadAero
data data-analysis data-cleaning data-visualization jupyter-notebook powerbi python
Last synced: 02 Aug 2025
https://github.com/anct-cartographie-nationale/mednum-cli
✨ Interface en ligne de commande pour la transformation des données de lieux de médiation numériques collectées dans un format non standard vers le schéma de la mednum et leur publication sur data.gouv
anct betagouv data donnees gouvernement mediation-numerique nodejs open-data transformation
Last synced: 02 Aug 2025
https://github.com/smeltier/data-structures-c
This repository contains C language implementations of the main data structures covered in the Algorithms and Data Structures course. The implementations were developed as part of my hands-on learning process and include sequential lists, linked lists, and other fundamental structures.
algorithms algorithms-and-data-structures c c-language c-programming data data-structures data-structures-c structures-c
Last synced: 16 May 2025
https://github.com/plurid/defocus
Apophatic User Content Resolution [Desearch Concept]
Last synced: 08 Nov 2025
https://github.com/sebastianhochreiter/sql-projects
business-intelligence data datascience microsoft microsoft-sql-server sql
Last synced: 22 Feb 2026
https://github.com/flyconnectome/hnf
Documentation for the hierarchical neuron format
annotations data dotprops hdf5 mesh neurons skeleton storage
Last synced: 17 Jan 2026
https://github.com/bdr-pro/streamlint
ltra-cool Streamlit app, where you can interact with widgets, see data in action, and even upload and download files
Last synced: 14 Apr 2026
https://github.com/fatihilhan42/nba-players-data-1950-to-2021
In this project, the data of the NBA players between the years 1950-2021 were examined. After the NBA players' season, height, performance, averages of points, teams and positions they played were obtained through csv files, important tables and graphs were created using data cleaning and data visualization algorithms.
data data-analysis data-engineering data-science data-visualization
Last synced: 16 Oct 2025
https://github.com/vuurvos1/functional-programming
HVA functional-programming
data formatting functional nodejs programming
Last synced: 03 Oct 2025
https://github.com/nisanth2004/springboot-kafka-real-world-project-wikimedia
Creating a project about Wikimedia using Kafka involves building a system that leverages Apache Kafka for data streaming and processing related to Wikimedia data.
async broker communication data java kafka message real-time real-time-analytics springboot wikimedia
Last synced: 14 May 2026
https://github.com/dug22/jjournal
A Jupyter like notebook software for Java
data data-analysis data-science java jshell jshell-repl notebook swing swing-application
Last synced: 11 Apr 2026
https://github.com/bhojpur/dlm
The Bhojpur DLM is a software-as-a-service product used for Data Lifecycle Management based on Bhojpur.NET Platform for data delivery.
Last synced: 19 Feb 2026
https://github.com/abdullahashfaqvirk/earth-engine-data-scraper
A Python based web scraper designed to extract and organize dataset metadata from the Google Earth Engine Datasets Catalog for research, and analysis purposes.
beautifulsoup data data-science python requests scraper web-scraping
Last synced: 10 May 2026
https://github.com/vim89/flowforge
Let's be honest - most data pipeline frameworks treat types as suggestions. Config files are strings. Schemas are "validated" at runtime. Data quality is an afterthought. So, let's do differently
archetype data data-contracts data-engineering data-pipelines data-quality data-science database dataengineering datapipeline etl etl-framework pipelines scala scalability spark spark-sql spark-streaming
Last synced: 14 Apr 2026
https://github.com/plurid/datasign
Single Source of Truth Data Contract Specifier
Last synced: 08 Nov 2025
https://github.com/nushratjabenaurnima/cse_477_data_mining
A collection of labs, reports, Jupyter notebooks, and project outputs for the CSE 477 Data Mining course. This repository tracks my learning journey through data preprocessing, association rules, clustering, classification, and real-world data analysis with Python.
data data-analysis data-mining data-science google-colab-notebook jupyter-notebook machine-learning python python-3
Last synced: 09 Apr 2026
https://github.com/diegoperea20/datos-secuenciales-con-ia
Realizacion de procesamiento de señales unidimensionales con modelos auto regresivos, convolución 1d, convolución 2d usando el espectrograma y redes recurrentes
ai artificial-intelligence convolutional-neural-networks data ia secuential-data spectrogram uao
Last synced: 06 Feb 2026
https://github.com/edjoukou/human_resources
A data analysis project using MySQL Server database
analysis data mysql powerbi sql visualization
Last synced: 25 Sep 2025
https://github.com/sanchittechnogeek/overscripted-analysis
Geolocation and user language extraction analysis from Mozilla Overscripted dataset
analysis data data-analysis mozilla
Last synced: 23 Mar 2025
https://github.com/mendel5/wifi
Information about Wi-Fi (wifi, WLAN, wireless LAN)
bitrate data data-transmission ethernet internet latency speed throughput transfer transmission wi-fi wifi wireless wireless-lan wlan
Last synced: 02 Aug 2025
https://github.com/pythoncoderunicorn/tool-discography
Music Band TOOL albums and songs dataset
data data-science metal-music music r songs
Last synced: 26 May 2026
https://github.com/ryanga09/digitalent_fundamentaldatascience-selfpractice
A repository of hands-on projects from DigiTalent’s Fundamental Data Science training, covering web scraping, data exploration, data cleaning, and data annotation. Includes Jupyter notebooks and example code for practical learning.
data data-analysis data-science data-visualization dataset digitalent komdigi notebook-jupyter notebooks
Last synced: 02 Aug 2025
https://github.com/abdellah-laassairi/thyroid-disease-analysis
Thyroid dataset visualization dashboard in R
dashboard data flexdashboard imputation-methods rshiny visualization
Last synced: 18 Jan 2026
https://github.com/climate-resource/input4mips_validation
Validation of input4MIPs data
cmip data forcing input4mips validation
Last synced: 20 Jan 2026
https://github.com/kalaspuff/ready
🎟 [not yet built] Take control of the event loop with simplified task management, queueing and data loading.
asyncio data dataloading event futures python python3 resolver tasks
Last synced: 10 May 2026
https://github.com/ohspc89/better_call_jin
A repository containing mentoring materials for a Ph.D. student in Neuroscience
data matlab spss-statistics visualization visualization-tools wrangling-data
Last synced: 08 Oct 2025
https://github.com/gianlucatruda/titanic
An exhibition of my experience in data processing and visualisation. Python script to process and visualise the Titanic survivor data.
data database flask info matplotlib python science scrape server titanic visualisation web
Last synced: 10 Apr 2026
https://github.com/jacob-pitsenberger/python-electronics-inventory-management-system-object-oriented-programming-project
Welcome to the Python Electronics Inventory Management System project repository! This project is a demonstration of Object-Oriented Programming (OOP) principles in Python for managing an electronic parts inventory.
data data-structures dictionary exception-handling file-io filesystem input-output inventory-management-system management-system modules oop pickle python user-interface
Last synced: 08 Oct 2025
https://github.com/jigyasag18/airline-performance-and-passenger-satisfaction-project-using-big-data-analytics
This project analyzes 10 years of U.S. domestic airline data (~3GB) using Hadoop (Cloudera) and Hive for data processing. Power BI dashboards visualize key metrics like delays, on-time rates, air time, and diversions. The solution includes Hive queries, DAX measures, HDFS ingestion scripts, and year-wise insights with recommendations.
big-data big-data-analytics bigdata cloudera cloudera-hadoop cloudera-hadoop-framework data data-analysis data-visualization database hadoop hive power-bi powerbi powerbi-dashboard powerbi-dashboards powerbi-report powerbi-visuals powerbi-visuals-tools powerbidashboard
Last synced: 01 Aug 2025
https://github.com/danieljdufour/fast-b64
Quickly Convert between B64 and Binary Strings
b64 base64 base64-decoding base64-encoding binary bits compression data
Last synced: 08 Oct 2025
https://github.com/dpbm/depencies-sets
append multiple depencies to your python project quickly
data dependencies dependencies-list dependencies-manager dependencies-set frameworks libraries lists pip py python python3 web
Last synced: 17 Oct 2025
https://github.com/mwiatrzyk/modelity
Data parsing and validation library for Python
data library model parsing python tool validation
Last synced: 18 Jan 2026
https://github.com/rahul1582/bank-loan-classification
Classifying whether a person is taking personal loan or not using all the Classification Algorithms.
algorithm analysis classi data
Last synced: 08 Oct 2025
https://github.com/jigyasag18/global-terrorism-1970-2017-analysis-using-big-data
This repository explores over 180,000 terrorist incidents across 205 countries using Hadoop and Power BI. The project identifies global and regional patterns in terrorism, analyzes the impact on civilians, and highlights high-risk areas. Key insights include attack trends,weapon usage,top terror groups,& country-specific risks like those in India.
big-data big-data-analytics data data-analysis data-visualization dataanalytics dataset hadoop hive hive-database hive-db hivedb power-bi powerbi powerbi-dashboards powerbi-desktop powerbi-report powerbi-report-validation powerbi-visuals powerbidashboard
Last synced: 19 Feb 2026
https://github.com/djdhairya/whatsapp-chat-analysis
WhatsApp chat analysis is a multidimensional process that delves into the content, structure, and dynamics of conversations within the platform. It provides valuable insights for personal reflection, organizational decision-making, and improving communication strategies.
data data-science dataanalytics datapreprocessing machine-learning ml
Last synced: 08 Oct 2025
https://github.com/jigyasag18/ai-ml-salaries-and-ai-tools-usage-trends
This repository presents an in-depth Power BI analytics report on the AI job market trends and student AI tool usage from 2020 to 2025. It combines structured datasets (job postings, salaries, surveys) with custom DAX measures to uncover key patterns in salaries, remote work, industry demand, and student engagement. 5 interaractive dashboards made.
analysis data data-analysis data-visualization dataanalysis dataanalytics dataset datavisualization power-bi powerbi powerbi-dashboards powerbi-desktop powerbi-report powerbi-visuals powerbidashboard visualization
Last synced: 16 Feb 2026
https://github.com/udofia2/crudwithdatabase
A simple Nodejs app that connect to a database.
Last synced: 08 Oct 2025
https://github.com/thicclatka/tetration
New file format for tensors
cli data fileformat mmap tensors
Last synced: 26 May 2026
https://github.com/leevilaukka/alkometriikka
Tool to search Alko database and see some fun stats about different beverages
data gh-pages svelte typescript xlsx
Last synced: 18 May 2026
https://github.com/athari22/analyzing-the-yelp-dataset
SQL for Data Science
analytics data data-science data-structures er sql
Last synced: 27 Jan 2026
https://github.com/neptun-software/neptun.data.generators
Send scraped data from neptun-scraper to CHATGPT to generate training data for NEPTUN.AI.
Last synced: 30 Jul 2025
https://github.com/bastianolea/cut_comunas
Versión actualizada de los códigos únicos territoriales (CUT) de las comunas y regiones del país.
Last synced: 24 Jun 2026
https://github.com/mat06mat/matbot
My discord bot code
data discord-bot discord-py py-cord
Last synced: 17 Oct 2025
https://github.com/huseyincenik/power_bi
PowerBI Files
dashboard data data-science data-structures data-visualization dataset datavisualization dataworld microsoft-power-bi powerbi powerbi-custom-visuals powerbi-dashboards powerbi-desktop powerbi-reports powerbi-visuals powerbidashboard powerquery querying user-interface workout-wednesday
Last synced: 05 Jan 2026
https://github.com/jacopodl/jcollections
Common data structures for the C language
c collections data data-structures jcollections
Last synced: 30 Jul 2025
https://github.com/gunjanmimo/d3-visualization
D3.js is a JavaScript library for producing dynamic, interactive data visualizations in web browsers. It makes use of Scalable Vector Graphics, HTML5, and Cascading Style Sheets standards. It is the successor to the earlier Protovis framework
d3js data data-science data-visualization reactjs
Last synced: 29 Apr 2026
https://github.com/anarya22/e-commerce_analysis
E-Commerce_Analysis is a data analysis project performed on the Superstore_USA dataset. It explores various aspects of e-commerce performance, including sales trends, customer demographics, product categories, and regional performance. The analysis includes data cleaning, visualizations, and insights on factors influencing sales and profitability.
analysis analytics cleaning-data data
Last synced: 09 Oct 2025
https://github.com/schoolsquirrel/holiday-data
Automatically updated holiday data for SchoolSquirrel
data holidays schoolsquirrel scripts vacation
Last synced: 03 Oct 2025
https://github.com/coderixc/rforai
Learn R Programming Language for Statistics & Data Science
artificial-neural-networks data data-science deep-neural-networks machine-learning probability quant-analyst r science
Last synced: 09 Oct 2025
https://github.com/ronknight/user-data-dashboard
📈 A data visualization tool for analyzing user data using an Excel-based data source.
dashboard data excel ga4 screenshot
Last synced: 17 Oct 2025
https://github.com/darshjasani/insurance-claim-analysis
This dataset contains insightful information related to insurance claims, giving us an in-depth look into the demographic patterns of those receiving them.
Last synced: 27 Aug 2025
https://github.com/ssiarhei115/countryhouse-price-prediction
ML modeling for house price prediction in Belarus
big-data data data-science fullstack fullstack-development mashine-learning parsing parsing-engine
Last synced: 28 Aug 2025
https://github.com/rahult18/atmo-flow
AtmoFlow is a robust data engineering pipeline built on Google Cloud Platform (GCP) that processes and analyzes weather and air quality data in both batch and streaming modes
airflow data data-modeling data-science data-visualization dataengineering gcp-bigquery gcp-cloud-composer gcp-cloud-functions pyspark
Last synced: 23 Jun 2026
https://github.com/cburmeister/disc-golf-courses
All the disc golf courses i've played at. Maintained with http://geojson.io/.
Last synced: 21 Jan 2026
https://github.com/miozilla/fraudfinder
fraudfinder :mag_right::smiling_imp::suspect: : Historical Payment Transactions # Fraud Detection # EDA # Feature Store # Model Registry
analysis data exploratory feature-store fraud-detection
Last synced: 29 Aug 2025
https://github.com/smaug6739/data-bit
This project is a module for converting a structured dataset into a number that can be stored in a database taking up little space.
Last synced: 14 May 2026
https://github.com/faster-games/dynamic-components
Dynamic Runtime Components for Unity3D
Last synced: 11 Apr 2026
https://github.com/lucasnbsb/data-structures-and-algorithms
Studying data structures and algorithms, mostly on leetcode
Last synced: 29 Aug 2025
https://github.com/psyteachr/sdg-data
Data relevant to the UN Sustainable Development Goals
Last synced: 09 Oct 2025
https://github.com/ashamethedestroyer/data-structures
Dedication of all Data Structures Creation 🛠
cpp data data-structures implementation implementation-of-data-structures structure structured-data
Last synced: 23 May 2026
https://github.com/debjyotisaha/tableau-projects-phase-2
Published interactive dashboards on Tableau Public, highlighting expertise in data visualization and storytelling through analyses of transportation patterns, sales trends, and demographic studies. These projects showcase the ability to transform complex datasets into actionable, intuitive visuals for decision-making.
dashboards data data-analysis data-visualisation tableau
Last synced: 26 Aug 2025
https://github.com/mateuszskoczek/generatorcsv
GeneratorCSV is a students and teachers data converter for Microsoft 365 Admin Center. The project was implemented for Sobolew High School.
admin converter data microsoft365 python school tkinter
Last synced: 26 Aug 2025
https://github.com/analyst-amitbisht/pizza-sales-report-
Its a guided project to practice tools like SSMS + Power BI & also skills like data cleaning, data exploration, data analysis, data visualization, etc.
analytics data data-visualization powerbi sql-server
Last synced: 18 Oct 2025
https://github.com/tomcardoso/journalism-data-intersection
A talk on working at the intersection of journalism and data science
data data-journalism journalism
Last synced: 15 May 2025
https://github.com/n4en/python-for-data-engineers
Python for data engineers
data data-engineer data-engineering dataengineering python python-notebooks python3 tutorial
Last synced: 26 Aug 2025
https://github.com/0xnu/data-analyst-training
The repository contains training materials for data analysts.
data data-analysis data-analyst
Last synced: 25 Aug 2025
https://github.com/awpala/udemy-my-courses-data-parser
Download Udemy lists and courses metadata for authenticated student user
Last synced: 07 May 2026
https://github.com/franckalbinet/maris-crawlers
Automated data harvesting of MARIS data sources
automation data marine-radioactivity
Last synced: 25 Aug 2025
https://github.com/ssiarhei115/shop-customers-segmentation
Shop customers segmentation
data data-analysis data-science data-visualization
Last synced: 24 Aug 2025
https://github.com/luminati-io/google-maps-dataset-samples
A sample dataset of over 1000 Google Maps businesses, extracted using the Bright Data API, ideal for competitor analysis, location-based marketing, and market strategies.
api data dataset google-maps maps web-scraping
Last synced: 03 Jan 2026
https://github.com/bscript07/softuni-javascript-applications
Javascript for Applications course at SoftUni -Oct 2023
architecture-component authentication client-side-rendering-seo data lit-html-template routing
Last synced: 15 Mar 2025
https://github.com/0xHericles/SpamDetector
:email: A Simple Python Spam Detector with Scikit-Learn
data ham machine-learning python sklearn spam
Last synced: 24 Mar 2025
https://github.com/xp-forge/marshalling
Marshalling
data marshalling object-mapping xp-framework
Last synced: 02 Jul 2025
https://github.com/vidushibhadana/eda-on-nyc-taxi-data
About Conducting an Exploratory Data Analysis (EDA) on New York City taxi data and visualizing it through countplots, distribution plots (displot), and histograms using Python and it's libraries.
data data-visualization jupyter-notebook matplotlib numpy pandas python seaborn
Last synced: 11 Apr 2026
https://github.com/meokullu/colorizenumber
ColorizeNumber - Bodrum Papatya, visualizes numeric data into colors which creates an image.
color colorize colors data data-visualization visualization vizualize-data
Last synced: 01 Jun 2026
https://github.com/dvaser/heart-attact-analysis-prediction
DATA ANALYSIS
classification data data-analysis data-visualization jupyter jupyter-notebook lineer-regresyon machine-learning python regression
Last synced: 20 Jan 2026
https://github.com/ehvenga/data.driven.modeling
Repository to practice data driven modelling
Last synced: 23 Mar 2025
https://github.com/turner-kendall/turner-kendall
Turner Kendall - dev, opps, sec.
config data github-config go rust security
Last synced: 31 Oct 2025
https://github.com/wlgs/got-dialogues-data-stats
Game of Thrones dialogues data statistics processed with R and SQLite. Project for Probability and Statistics course 21/22 at AGH UST. The project was about manipulating data and getting many pieces of information from it in addition to visualizing these results.
data game-of-thrones got r statistics stats
Last synced: 22 May 2026
https://github.com/45harry/potato_disease_classification
Potato Disease Classification - Traning, Rest Api and FrontEnd to Test
cnn-classification data data-science datapreprocessing deep-learning fastapi flaskapi frontend keras restapi tensorflow
Last synced: 12 Apr 2026
https://github.com/anuragagarwal96/hospital-mortality-rate-sql-analysis
In this project, I have taken a hospital dataset from Kaggle, analysed it and predicted the mortality rate of patients who have been admitted in hospitals. I have utilised a combination of SQL, Tableau and Microsoft Excel for this project.
data data-visualization dataanalysis dataanalysisusingsql excel msexcel mssqlserver sql tableau tableau-public
Last synced: 09 Mar 2026
https://github.com/0xHericles/ufcg-geojson
GeoJSON file containing the blocks and buildings of the Federal University of Campina Grande.
data data-visualization geojson map open-source ufcg university
Last synced: 24 Mar 2025