data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-01 00:07:35 UTC
- JSON Representation
https://github.com/raphaellaude/usaschooldata
Cleaned and accessible school enrollment data for US schools
data duckdb duckdb-wasm education object-storage oss wasm
Last synced: 12 May 2026
https://github.com/vlamug/ratibor
Ratibor is a service for making metrics from data
Last synced: 10 Mar 2026
https://github.com/csoren66/financial-budget-analysis
Financial budget for 2021
Last synced: 03 Mar 2025
https://github.com/ashu3291/blinkit-app-store-
conducted a comprehensive analysis of Blinkit's sales performance, customer satisfaction and inventory distribution to improve the sales performance.
cleaning-data data dataanalysis-projects powerbi-visuals powerbidashboard sql
Last synced: 05 Jan 2026
https://github.com/yorkearwaker/data
Data things; representation, transformation, pipelines, governance,
actuality data epistemology information knowledge ontology
Last synced: 07 Apr 2025
https://github.com/dms-codes/scrape-kesaintblanc-id
Kesaintblanc Data Scraper This Python script is designed to scrape product data from the Kesaintblanc website. It collects information about products, including product name, URL, price, image URLs, status, stock, and more. The scraped data is saved to a CSV file for further analysis.
data kesaintblanc python webscraper
Last synced: 27 May 2026
https://github.com/abirsaha111/ipl-2022-analysis
The IPL 2022 Analysis project is a data-driven exploration of the Indian Premier League (IPL) 2022 cricket tournament. The analysis focuses on utilizing Python programming and various libraries to analyze and visualize the performance of teams, players, and key metrics in the IPL 2022 season.
data dataana dataanalytics datavi matplotlib python
Last synced: 07 Jun 2026
https://github.com/charon25/weatherdata
17 000 weather measurements collected by a weather station created for a college project.
csv data dataset datasets json measurements strasbourg weather weather-data
Last synced: 16 Jan 2026
https://github.com/dineshkumarkotha/impact-of-alcohol-consumption-on-public-health
Impact of Alcohol Consumption on Public Health
analyzation data datapreprocessing datavisualization tableau
Last synced: 05 Jan 2026
https://github.com/jpcadena/palmer-penguins
Palmer Penguins
analytics csv data data-analytics data-science exploratory-data-analysis matplotlib numpy palmer-penguin pandas plotly pylint python seaborn visualization
Last synced: 11 Apr 2026
https://github.com/karo23361/toy-store-kpi-power-bi
PowerBI Portfolio Project
csv data data-visualization powerbi
Last synced: 03 Feb 2026
https://github.com/cognitixe/metamask-wallet-recovery-funds-phrase-data-seed-token
This repository provides tools and guidelines for securely recovering MetaMask Wallet funds using recovery phrases, seed data, and tokens. It ensures safe and reliable methods for recovering access to your wallet and managing your cryptocurrency assets.
bitcoin blockchain cryptocurrencies cryptocurrency data ethereum funds metamask metamask-bot metamask-desktop metamask-extension metamask-plugin metamask-snap metamask-wallet phrase recovery seed token wallet wallet-security
Last synced: 13 May 2026
https://github.com/equinor/fmu-sumo
Interaction with Sumo in the FMU context
analytics data fmu python subsurface sumo visualization
Last synced: 01 May 2025
https://github.com/gunjanmimo/d3-visualization
D3.js is a JavaScript library for producing dynamic, interactive data visualizations in web browsers. It makes use of Scalable Vector Graphics, HTML5, and Cascading Style Sheets standards. It is the successor to the earlier Protovis framework
d3js data data-science data-visualization reactjs
Last synced: 29 Apr 2026
https://github.com/bastianolea/cut_comunas
Versión actualizada de los códigos únicos territoriales (CUT) de las comunas y regiones del país.
Last synced: 24 Jun 2026
https://github.com/kalaspuff/ready
🎟 [not yet built] Take control of the event loop with simplified task management, queueing and data loading.
asyncio data dataloading event futures python python3 resolver tasks
Last synced: 10 May 2026
https://github.com/smeltier/data-structures-c
This repository contains C language implementations of the main data structures covered in the Algorithms and Data Structures course. The implementations were developed as part of my hands-on learning process and include sequential lists, linked lists, and other fundamental structures.
algorithms algorithms-and-data-structures c c-language c-programming data data-structures data-structures-c structures-c
Last synced: 16 May 2025
https://github.com/eslamdyab21/data-visualization-using-matplotlib-and-seaborn
This is the last project in the nanodegree udacity program. it's about data visualization.
data data-analysis data-visualization matplotlib pandas python seaborn udacity udacity-data-analyst-nanodegree
Last synced: 09 May 2026
https://github.com/suryadev99/stream_processing_website_click_data
Stream Processing of website click data using Kafka and monitored and visualised using Prometheus and Grafana
clickdata data dataengineering docker flink-kafka flink-metrics flink-stream-processing git grafana kafka kafka-streams kafka-topic prometheus psql python
Last synced: 10 Mar 2026
https://github.com/2022-04-11588/data-fakes
🔍 Generate realistic fake data for testing and development, enhancing your projects with simple, customizable data solutions.
data dataset developer-tools fake-content faker fakery groovy java mock phoenix python random ruby seeding struct swift-framework test-data testing
Last synced: 11 Apr 2026
https://github.com/halyusa16/mysql-employee-analysis
This project focuses on analyzing employee data through querying, performing table joins to connect related information, aggregating salary statistics, and using subqueries to extract meaningful insights.
data data-analytics data-exploration database mysql self-project sql
Last synced: 20 Jan 2026
https://github.com/avestura/shell-dads
❓ Show a random tip from NIST DADS (https://xlinux.nist.gov/dads) every time you open your terminal
algorithms dads data data-structures ds nist
Last synced: 23 Oct 2025
https://github.com/lotfiferaga/instagram-reach-analysis
The Instagram Reach Analysis project aims to develop a Python-based tool to analyze the reach and engagement metrics of Instagram posts.
analytics data data-science datavisualization python
Last synced: 18 Jun 2026
https://github.com/nel-zi/insighthire_agency
Built a web scraping solution using BeautifulSoup to extract job listings from MyJobMag, cleaned the data, and loaded it into PostgreSQL with SQLAlchemy for better job data management.
data dataloading datatransformation sql webscraping
Last synced: 16 May 2025
https://github.com/dansalahi/query-builder-experiment
Customized Query Builder for creating Rules and Groups
data data-structures jsonlogic query-builder reactjs typescript validation
Last synced: 11 Apr 2026
https://github.com/adamouization/python-machine-learning-data-science-notes
:orange_book: Jupyter notebooks containing useful Python code and notes for general Machine Learning and Data Science projects.
data data-science data-visualization guide jupyter jupyter-notebook machine-learning matplotlib notes numpy pandas pandas-dataframe python seaborn
Last synced: 11 Apr 2026
https://github.com/boratechlife/tensorflow-questions-datasets
A Tensorflow questions Datasets to help you practice Machine learning and Train Models
data datapreprocessing datasets machinelearning modeltrain questions tensorflow
Last synced: 23 Mar 2025
https://github.com/rezapace/newbash
This project involves managing various application shortcuts and configurations primarily for a Linux environment. It includes scripts for creating .desktop entries for applications, managing system configurations, and handling application processes.
automation backup bash data dekstop linux newbash ohmyzsh script testing zsh
Last synced: 11 Apr 2026
https://github.com/roovedot/unet-cnn-for-road-segmentation
(In Progress) Unet architecture with CNNs (Convolutional Neural Networks) aimed at Road Segmentation
cnn cnn-for-visual-recognition cnn-pytorch computer-vision data data-engineering data-science unet unet-image-segmentation unet-pytorch
Last synced: 01 Jul 2025
https://github.com/thirza258/country-sdg
VOX ASTRA Submission : Country SDG
css d3 d3-visualization data django html python sdg social social-good un visualization
Last synced: 11 Apr 2026
https://github.com/miniql/miniql-inline
A MiniQL query resolver for inline data.
Last synced: 27 May 2026
https://github.com/vishwas-chakilam/hr-dashboard
This project involves creating an interactive HR Dashboard using Power BI for visualization and MySQL for data cleaning and analysis. It provides insights into employee performance, attrition, salary distribution, and hiring trends.
dashboard data datac datacleaning datavisualization mysql powerbi
Last synced: 23 Mar 2025
https://github.com/michaelschoenburg/rapidfiretools-computerdatacollector-automation
Automation for RapidFire Tools Computer Data Collector.
automation collector computer data fire powershell powershell-script rapid rapidfire-tools tools
Last synced: 01 Jul 2025
https://github.com/hemangsharma/dataanalysis
This repo contains analysis like a dashboard and time series forecast on NASDAQ data
analysis data data-analysis data-visualization python
Last synced: 10 Mar 2026
https://github.com/foufou-exe/finegourmet
Project Business Intelligence Architecture
business-intelligence data docker docker-compose etl mysql powerbi pyspark python workflow
Last synced: 11 Apr 2026
https://github.com/lablnet/alibaba_scraper
This is a robust web scraper that extracts data from the Alibaba website. It's multi-threaded and utilizes Playwright to efficiently scrape data from the website. This script is capable of scraping the entire Alibaba site, which would take approximately 4-6 months to complete.
alibaba data ecom mit-license open-source products scraper
Last synced: 15 Mar 2025
https://github.com/wittyicon29/kritika-iit-b-2023
Seletcion task for the summer projects of Kritika IIT-B
data data-analysis data-science
Last synced: 15 Mar 2025
https://github.com/jamiew/void-runners-analysis
basic data analysis for the Void Runners Genesis Fleet spaceships
Last synced: 29 Mar 2025
https://github.com/amir76717/healthai-pro
HealthAI Pro revolutionizes the healthcare experience by leveraging cutting-edge AI technologies to provide intelligent, personalized healthcare solutions to patients and medical professionals alike. This platform incorporates machine learning, natural language processing, and robust data management to enhance the quality of healthcare services.
Last synced: 31 Mar 2025
https://github.com/mohammad-malik/covid-visualizations-d3
This project provides a dashboard with five different perspectives on the pandemic, from patient-infection relationships to regional trends and hierarchical distributions. This was developed as part of a project for the course Data Analysis and Visualization (DS3001).
covid-19 d3 d3-visualization d3js data data-analysis data-analytics data-science visualization
Last synced: 28 May 2026
https://github.com/meizuflux/cion
Python minimal data validation library
data minimal python validation
Last synced: 28 May 2026
https://github.com/pranjaldhamane/social-media-sentiment-analysis
This project aims to analyze sentiment in Twitter data to understand attitudes towards specific topics or entities. It seeks to uncover positive and negative sentiment patterns, detect potential cyberbullying or hate speech, and provide insights into Twitter's overall sentiment landscape.
data dataanalysis logistic-regression nlp-machine-learning python sentiment-analysis twitter
Last synced: 18 Apr 2026
https://github.com/stdlib-js/ndarray-vector-int8
Create a signed 8-bit integer vector (i.e., a one-dimensional ndarray).
constructor ctor data int8 javascript ndarray node node-js nodejs stdlib structure types vec vector
Last synced: 24 Apr 2026
https://github.com/parablelab/parable
Work in progress...
data data-management data-platform data-validation database pipelines
Last synced: 28 May 2026
https://github.com/rubyonworld/ldpath
This is a ruby implementation of LDPath, a language for selecting values linked data resources.
Last synced: 12 Nov 2025
https://github.com/mubashirsidiki/certifications_work
his repository contains my work, projects, and solutions from various professional certification programs.
analysis coursera data data-science google ibm john-hopkins machine-learning michigan udemy
Last synced: 01 Jul 2025
https://github.com/juanandres-montero/dataanalysis
Dedicado al análisis de datos.
Last synced: 10 Aug 2025
https://github.com/bertrand31/one-billion-rows-challenge
🌪️ Pushing Scala to its limits to aggregate a billion rows' worth of data in 2.42 seconds
competitive-programming competitive-programming-contests data data-engineering data-processing performance scala
Last synced: 05 Sep 2025
https://github.com/plnech/never2late
Never 2 Late - a reinterpretation of Everest Pipkin's 'i've never picked a protected flower'
dada dada-science data generative-art glitch-art installation nlp poetry spacy vector-similarity wallpaper
Last synced: 10 Jun 2025
https://github.com/bablukumarjha/startup-funding-revenue-analysis-by-sql-and-pandas
SQL project analyzing startup funding, revenue, and founder data to extract business insights using Python and MySQL.
data data-analysis data-platform data-science dataanalysisusingpython dataanalytics pandas-dataframe pandas-library python sql sql-server sqlalchemy sqldatabase
Last synced: 18 May 2026
https://github.com/moeabbas6/bq_data_loader
A Python script for executing and logging batch SQL commands in Google BigQuery. Includes tracking of execution times, unique job and statement IDs, and automated logging to a specified BigQuery table.
Last synced: 24 Mar 2025
https://github.com/mikeqfu/network-rail-track-fixity-layer
This project develops a data mining tool for analysing and predicting track movements using asset data, environmental factors and track design knowledge to model key parameters and generate fixity values for the GB rail network.
data data-integration data-mining data-science information-management knowledge-discovery point-cloud rail rail-alignment rail-track track-fixity
Last synced: 02 Sep 2025
https://github.com/murshidazher/client-side-data-storage
🚌 A workspace containing client-side data storage implementations
cache cache-storage client-side data indexeddb localstorage sessionstorage storage websql
Last synced: 02 Sep 2025
https://github.com/heyimsteve/solnftdatadash
This a React-based web application that provides detailed information about NFT collections on the Solana blockchain. It uses the HelloMoon API to fetch and display data about NFT collections, including statistics, loan summaries, ownership information, and floor prices.
dashboard data hellomoon nft react solana solana-nft
Last synced: 30 Jan 2026
https://github.com/bdr-pro/graphyml
A powerful, interactive Streamlit application to explore, edit, visualize, and query a graph-based database of YAML nodes — ideal for movie metadata, research articles, or structured knowledge graphs.
Last synced: 23 Jul 2025
https://github.com/amethyst-php/owner
amethyst amethyst-package api data laravel owner
Last synced: 28 Apr 2026
https://github.com/shudhanshusaurabh001/super_market-data-analysis-using-python
This project focuses on analyzing supermarket sales data using Python. The goal is to extract meaningful insights from the dataset, such as sales trends, customer purchasing behavior, and product performance.
analysis csv data insights matplotlib numpy pandas project python seaborn
Last synced: 06 Apr 2026
https://github.com/jameshenderson12/chatbot-utils
Generic data and elements that can be reused or repurposed for chatbot development.
boilerplate chatbot data development elements intents template utterances
Last synced: 04 Mar 2026
https://github.com/flowsynx/plugin-base64
FlowSynx plugin to provides encoding and decoding of Base64 strings, allowing workflows to handle Base64 content transformations efficiently.
base64 base64-decoding base64-encoding data data-platform decoding encoding flowsynx flowsynx-plugins
Last synced: 10 Mar 2026
https://github.com/dvaser/heart-attact-analysis-prediction
DATA ANALYSIS
classification data data-analysis data-visualization jupyter jupyter-notebook lineer-regresyon machine-learning python regression
Last synced: 20 Jan 2026
https://github.com/awpala/udemy-my-courses-data-parser
Download Udemy lists and courses metadata for authenticated student user
Last synced: 07 May 2026
https://github.com/faster-games/dynamic-components
Dynamic Runtime Components for Unity3D
Last synced: 11 Apr 2026
https://github.com/smaug6739/data-bit
This project is a module for converting a structured dataset into a number that can be stored in a database taking up little space.
Last synced: 14 May 2026
https://github.com/jigyasag18/aircraft-data-management
This repository offers a comprehensive simulation of global military air deployments involving 10 countries, aircraft models, mission types, and strategic zones. It analyzes air power distribution, mission intent (offensive, defensive, support), and geopolitical positioning. The project provides structured insights into regional & zone level threat
aircraft-data aircraft-performance data data-analysis data-visualization database database-management dataset datavisualisation mysql powerbi powerbi-report powerbi-visuals sql
Last synced: 04 Feb 2026
https://github.com/ailixter/gears-dictionary
The project, which Gears Dictionary
arrays data dictionaries dictionary php struct utilities
Last synced: 19 Jul 2025
https://github.com/abhijeetdasbakshi/ecommerce-insights
A Dockerized end-to-end project that combines unsupervised machine learning for customer segmentation with scalable data pipelines. It uses MongoDB for data ingestion, Scikit-learn for clustering, Airflow for orchestration, and Streamlit for interactive visualization — enabling actionable insights into e-commerce
airflow airflow-dags ci-cd-pipeline clustering dags data data-pipelines docker docker-compose docker-container dockerfile git great-expectations kafka mongodb pca-analysis postgresql pyspark t-sne umap-learn
Last synced: 04 Apr 2026
https://github.com/0xHericles/SpamDetector
:email: A Simple Python Spam Detector with Scikit-Learn
data ham machine-learning python sklearn spam
Last synced: 24 Mar 2025
https://github.com/xp-forge/marshalling
Marshalling
data marshalling object-mapping xp-framework
Last synced: 02 Jul 2025
https://github.com/jpcadena/ventas-facturas
Ventas con facturas
data data-analysis data-exploration data-extraction data-science excel feature-engineering matplotlib microsoft numpy pandas powerbi product-sales pylint python receipts sales
Last synced: 12 Apr 2026
https://github.com/45harry/potato_disease_classification
Potato Disease Classification - Traning, Rest Api and FrontEnd to Test
cnn-classification data data-science datapreprocessing deep-learning fastapi flaskapi frontend keras restapi tensorflow
Last synced: 12 Apr 2026
https://github.com/0xHericles/ufcg-geojson
GeoJSON file containing the blocks and buildings of the Federal University of Campina Grande.
data data-visualization geojson map open-source ufcg university
Last synced: 24 Mar 2025
https://github.com/ragibasif/bobdylan
Bob Dylan
bob-dylan csv data data-science data-visualization lyrics music python
Last synced: 03 Sep 2025
https://github.com/noedemange/orderedheatmapanalysis
OrderedHeatMapAnalysis (OHMA) is a direct data analysis framework allowing to simultaneously visualize and analyze the structure of complex datasets. An optimized seriation of rows and columns of the input data table is performed, resulting in a mapping of the whole dataset into an ordered heatmap.
analysis bi-seriation data dataanalysis heatmap r rstats seriation shiny shiny-apps
Last synced: 27 Feb 2025
https://github.com/petzi53/repair
R Datasets of the Open Repair Alliance (ORA).
Last synced: 19 May 2026
https://github.com/cpietsch/breitband
developer repo of breitband-berlin
d3js data threejs visualization
Last synced: 02 May 2026
https://github.com/shahsuvarli/election-voters-data-analysis-pandas
Educational project analyzing Azerbaijan voter demographics with pandas, focusing on data cleaning, grouping, and visualization.
cleaning data grouping matplotlib numpy pandas python visualization
Last synced: 12 Apr 2026
https://github.com/pew-pew-team/hydrator
Hydrator kernel component
data deserializer dto hydrator kernel mapper mapping serializer structure
Last synced: 24 Mar 2025
https://github.com/arthurdanjou/studies
💼 This is the repository containing all my projects done during my studies in Python and R.
ai data data-science data-visualization jupyter jupyter-notebook ml python r
Last synced: 08 Apr 2025