data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-23 00:07:41 UTC
- JSON Representation
https://github.com/cqllum/schema2dwh
⚡ Automatically produce a data model on your database using its information schema using GenAI.
ai data data-structures dataengineering datawarehousing dwh gemini gemini-api genai reporting reporting-tool schema-design
Last synced: 13 Mar 2025
https://github.com/nik-kusanagi/bash.sh-treinamento
Versão mais organizada (+ ou -)
data database debian gnome gnome-extension gnu gnu-linux linux shell shell-script
Last synced: 05 May 2026
https://github.com/castelao/bufr
BUFR binary data format from WMO
binary data format meteorology oceanography wmo
Last synced: 13 Jul 2025
https://github.com/lane-romuald/iot-irrigation-data-collection-system
An IoT-based data collection system using the ESP32 microcontroller programmed with Arduino to monitor environmental conditions for smart irrigation. The system measures soil moisture, temperature, air temperature, humidity, and rain probability. Data is stored locally on an SD card and uploaded to the ThingSpeak platform.
arduino cloud data data-collection esp32 openweather openweathermap thingspeak wi-fi
Last synced: 12 Apr 2026
https://github.com/svelterun/store
Persisted version of svelte/store.
data state state-management store svelte svelte-store sveltekit svelterun typescript
Last synced: 08 Jan 2026
https://github.com/cleanzr/restaurant
Restaurant data set for entity resolution
Last synced: 11 Mar 2026
https://github.com/fiskeben/meetjescraper
HTTP proxy for Meet je stad project
api data go iot meetjestad proxy scraper weather
Last synced: 29 May 2026
https://github.com/grycap/cdmi-client-go
A basic Go library to perform CDMI core operations
Last synced: 21 Jan 2026
https://github.com/codeforafrica/ckanext-followy
[ARCHIVED] A CKAN extension to show the datasets a user is following.
ckan ckan-extension ckanext-followy data dataset followy-extension open-data
Last synced: 16 Mar 2025
https://github.com/e-panourgia/data-science-projects
Data Science Projects
annotations augmentation data data-preprocessing-and-cleaning hyperparameter-tuning llm logistic-regression nlp random-forest-classifier xboost-classifier
Last synced: 09 Apr 2025
https://github.com/bolajiolayinka/graph-api-automation
An End to End Automation from Facebook Business to Data Visualization of Campaigns
Last synced: 07 May 2025
https://github.com/desininja/data-engineer-interview-questions
This repository contains all the Data Engineer Interview Questions asked by interviewers.
data data-engineer-interview-questions
Last synced: 31 Mar 2025
https://github.com/devlive-community/mockaroo
一个轻量级的 HTTP Mock 服务器,用于快速构建模拟数据接口,适用于前后端开发和接口测试场景。
Last synced: 08 Jul 2025
https://github.com/sefakcmn00/tensorflow_car_price_analysis
In this project, after extracting the data sets as csv, we tried to represent the car prices graphically and schematically by using data analysis and data visualization methods. We checked the connection of the car prices we analyzed with other data, then we created a 4-layer and 12-neuron system.
data datatrain keras machine-learning matplotlib-pyplot pandas seaborn sklearn tensorflow
Last synced: 14 Apr 2026
https://github.com/alexscigalszky/palabras-aleatorias-data
This package have a set of datasets of random words, animals, colors, jokes, onomatopeias and types
aleatorias data palabras random words
Last synced: 04 Oct 2025
https://github.com/san089/black-friday-sales-analysis
This Project gives an insight into few statistics related to black Friday Sale.
custom data dataanalysis insights sales statistics
Last synced: 13 Jul 2025
https://github.com/nikhilash45/live_ipl_report
This repository hosts the source code for an interactive IPL (Indian Premier League) Dashboard built using PowerBI. The dashboard provides real-time updates on ongoing matches, including live scores, batting and bowling statistics for both teams, and the points table.
analysts cleaning-data cricket-data dashboard data data-analysis data-visualization dax powerbi
Last synced: 19 Mar 2026
https://github.com/stdlib-js/ndarray-base
Base ndarray.
array base buffer data javascript matrix multidimensional namespace ndarray node node-js nodejs ns stdlib structures types vector
Last synced: 09 Apr 2025
https://github.com/stdlib-js/array-zero-to-like
Generate a linearly spaced numeric array whose elements increment by 1 starting from zero and having the same length and data type as a provided input array.
array data float32array float64array int16array int32array javascript matrix ndarray node node-js nodejs stdlib structure typed typed-array types uint32array vector
Last synced: 07 Jan 2026
https://github.com/varbrad/mindb
🗄 🔍 ⚡️ Schema-less document-oriented collection model data-store for Node & Browsers.
browser data datastore db document javascript json-schema mongo mongodb nodejs nosql query schema
Last synced: 13 Apr 2026
https://github.com/dbriane208/omdena-apprenticeship-project
This is part of my contribution to the Omdena apprenticeship program .
data data-science feature-engineering machine-learning
Last synced: 14 Mar 2026
https://github.com/spiceai/datasets
Spice AI curated dataset definitions for Spice.ai
ai bitcoin blockchain data ethereum polygon
Last synced: 20 Apr 2026
https://github.com/garcane/global-shipping-analytics-dashboard
This Tableau project provides a comprehensive visual analysis of global sales, shipping costs, and quality metrics across different regions and countries.
data data-analysis data-analyst data-visualization metrics tableau
Last synced: 01 Mar 2026
https://github.com/vagnerbellacosa/029_analisededadoscompythonpandas
Neste Labs será apresentada a biblioteca Pandas, uma biblioteca Python de código aberto para análise de dados. Ela dá ao Python a capacidade de trabalhar com dados do tipo planilha, permitindo carregar, manipular e combinar dados rapidamente, entre outras funções. Python
data digital-innovation-one dio jupiter-notebook labs ms-excel panda python
Last synced: 14 May 2026
https://github.com/gagolews/clustering-results-v1
A framework for benchmarking clustering algorithms – Benchmark results (for version 1 of the Suite)
benchmark benchmark-datasets clustering data dataset datasets machine-learning
Last synced: 16 Mar 2025
https://github.com/programmer-rd-ai/moviedatascraper
Explore the cinematic universe with our IMDb web scraping project! Dive into movie data with ease, uncovering insights from cast to critical reviews. With dynamic visualizations and reliable data, let's journey through the world of movies like never before. Lights, camera, analysis!
beautifulsoup beautifulsoup4 data data-analysis jupyter-notebook matplotlib numpy pandas programming python python3 scraping seaborn software web
Last synced: 01 Mar 2025
https://github.com/bunnysunny24/bluepulse
A Smart Water Management System
data data-processing data-visualization firebase iot machine-learning mysql-database reactjs
Last synced: 17 Mar 2025
https://github.com/stdlib-js/array-base-fancy-slice-assign
Assign element values from a broadcasted input array to corresponding elements in an output array.
array assign assignment copy data fancy generic javascript node node-js nodejs shallow slice stdlib structure subseq subsequence types
Last synced: 06 Oct 2025
https://github.com/igorwastaken/math-problems
Solve math problems easily with this utility library.
algorithm area data demography geography javascript math npm package population school typescript util utils
Last synced: 23 Feb 2026
https://github.com/tushar2704/insurance-cross-sell
This project harnesses the power of cutting-edge technologies including H2O AutoML, MLflow, FastAPI, and Streamlit to enhance cross-selling campaigns and boost efficiency.
data datascience h20automl machine-learning mlflow python streamlit-tushar2704
Last synced: 08 Oct 2025
https://github.com/jakakokosar/bioinformatics-serverfiles
Knowledge base for Orange3-bioinformatics add-on
bioinformatics data dictybase gene genesets go homologene markergenes ncbi serverfiles
Last synced: 16 Apr 2026
https://github.com/varun-khorgade/sentimentscope-e-commerce-review-analyzer
Analyzed customer reviews and purchase data to extract sentiment and behavioral insights. Built SQL-based ETL for data preparation and visualized results using Python and Power BI dashboards for actionable business decisions.
analytics customer-beheviour dashboard data data-visualization dataextraction natural-language-processing nlp pandas powerbi python sentiment-analysis sql textblob
Last synced: 17 Apr 2026
https://github.com/yessasvini23/accenture_-social-buzz-data-analytics-virtual-programme-forage
Accenture Data Analytics and Visualization - Virtual Internship
accenture content data dataanalytics excel forge socialbuzz
Last synced: 18 Jan 2026
https://github.com/strata/data
Tools to help you read data from a range of different data providers.
Last synced: 27 Jan 2026
https://github.com/saroshfarhan/kaggle-playground-s4e12
Kaggle competition first attempt
analytics data data-analysis-python data-science
Last synced: 12 Oct 2025
https://github.com/rohancyberops/r-language
R Language Projects directory. This repository contains various projects, scripts, and experiments developed using R, a powerful statistical computing and data visualization language.
caret cran data dplyr ggplot2 rlanguage rstudio shiny tidyverse
Last synced: 12 Oct 2025
https://github.com/anobaka/insidecollector
这是一个介于Excel和纯记录工具之间的软件,您可以自由创建各种列表,然后将其以各种规则关联起来,并且可以创建自定义视图帮助您更好地理解数据。
collection data excel-like list list-manager table
Last synced: 19 Jan 2026
https://github.com/tberey/social-stocks
A Graphical Data and Analysis Tool
data data-analysis data-science data-stream data-visualization database javascript mysql mysql-database node nodejs rest rest-api social-stocks stock-market stocks ticker-data tickers trends typescript
Last synced: 21 Jan 2026
https://github.com/twistezo/ts-dto-mapper
DTO (Data Transfer Object) to Object Model transformer
data dto map mapper model object transfer transform transformer typescript
Last synced: 05 Feb 2026
https://github.com/athul64/powerbi
Financial Reports Dashboard This repository showcases a Financial Reporting Dashboard that visualizes key financial metrics and performance insights. The dashboard contains Monthly and Annual reports, allowing users to switch between the two views to analyze data at different intervals.
data data-an data-visualization dax dax-expression powerbi
Last synced: 23 Feb 2026
https://github.com/lisakey/datacamp-data-analyst-python-sql-projects
Several projects completed during my Data Analyst 📊 training on the DataCamp platform with Python 🐍 and SQL 🗃️. Each project addresses real-world challenges using modern analytical tools and techniques.
analysis cleaning-data data dataanalysis dataanalyst matplotlib pandas python seaborn sql transformation visuali
Last synced: 19 Apr 2026
https://github.com/yeshunit/walmart-product-customer-sales-sql-analysis
This project aims to explore the Walmart Sales data to understand top performing branches and products, sales trend of of different products, customer behaviour. The aims is to study how sales strategies can be improved and optimized. The dataset was obtained from the Kaggle
data database mysql sql walmart
Last synced: 24 Feb 2026
https://github.com/morphaxthedeveloper/yokatlas-dataset-2025
yök atlas detaylı üniversite, bölüm, puan vb. datası..
data database liste scrape universite veri yok-atlas yok-atlas-api yok-atlas-data yokatlas yokatlas-crawler yokatlas-data
Last synced: 14 Oct 2025
https://github.com/souvik09-tech/adventure-works-kpi-dashboard
This repository contains a complete Business Intelligence solution for AdventureWorks, a global manufacturing company specializing in cycling equipment and accessories. Built using Power BI Desktop, this project helps track KPIs, analyze product performance, compare regional data, and identify high-value customers.
analysis data kpi powerbi visualization
Last synced: 27 Jan 2026
https://github.com/kledenai/jsonweaver
A powerful and easy-to-use library for transforming JSON data into popular formats such as CSV, XML, Markdown tables, YAML, and JSONLines (NDJSON).
csv data data-transform format json jsonlines jsonweaver markdown markdown-tables xml yaml
Last synced: 24 Feb 2026
https://github.com/nxank4/loclean
⚡️ The All-in-One Local AI Data Cleaning Library. No GPU or API keys required.
automated-cleaning data data-cleaning data-engineering data-preprocessing data-science data-wrangling etl llm normalization open-source polars privacy-preserving python semantic-analysis slm structured-data
Last synced: 22 Jan 2026
https://github.com/potreic/etl-fashion-trend-analysis
✨ Automate fashion trend analysis with Apache Airflow! Extract data from X & Pinterest, transform into insights, and load into PostgreSQL. Predict seasonal styles & visualize trends. 💃📊
airflow airflow-dags data data-engineering etl etl-automation etl-pipeline fashion-trends
Last synced: 27 Jan 2026
https://github.com/data-forge-notebook/javascript-cheat-sheet
Cheat sheet that accompanies my book Data Wrangling with JavaScript
cheatsheet data data-wrangling javascript nodejs
Last synced: 15 Apr 2026
https://github.com/gematik/poc-isik-patient-merge
The repository contains a proof of concept (POC). The POC demonstrates how a FHIR subscription can be used to inform about happened merges within the ISIK context.
Last synced: 19 Oct 2025
https://github.com/divithraju/divith-aju-hadoop-pyspark-pipeline
This project demonstrates the creation of a scalable data processing pipeline for handling and analyzing log data from a hypothetical e-commerce platform. Leveraging Hadoop and PySpark, the pipeline is designed to process large volumes of log files, providing meaningful insights into user behavior, system performance, and sales metrics.
apache-hadoop-framework apache-spark bigdata client data database dataengineering dataingestionframework datapreprocessing documentation ecommerce-platform hdfs pipeline project project-repository pyspark python3 software-engineering
Last synced: 27 Jan 2026
https://github.com/marcelo-earth/H5N8-Data
🔢🦠 Confirmed cases of H5N8 in humans - Feel free to open Pull Requests with new data.
csv data h5n8 h5n8-cases h5n8-virus russia
Last synced: 20 Oct 2025
https://github.com/sksubhadeep/airbnb-dashboard-tableau
Airbnb Dashboard Using Tableau
airbnb data tableau tableau-public visualization
Last synced: 19 Mar 2026
https://github.com/mustika-putri-m/analysis-of-sales-transactions-in-an-online-shop---london
Crucial Question 1. How was the sales trend over the months? 2. What are the most frequently purchased products? 3. How many products does the customer purchase in each transaction? 4. What are the most profitable segment customers? 5. Based on your findings, what strategy could you recommend to the business to gain more profit?
data data-analysis-python data-analytics data-visualization ecommerce
Last synced: 24 Oct 2025
https://github.com/imahdimir/githubdata
A very simple Python package to easily download from and manage a GitHub "Data Repository"
data data-repository python-package
Last synced: 23 Jan 2026
https://github.com/2kabhishek/pyramen
Data Analysis for Ramen 🍜💹
csv data data-analysis fun python report
Last synced: 26 Oct 2025
https://github.com/rnabla/cuda-des
Bruteforcing DES using CUDA
bruteforce cuda data des encryption gpu parallel standard
Last synced: 27 Oct 2025
https://github.com/patrikmasiar/algorythm-of-the-night
Awesome list of algorithms that help you 🚀 Feel free to contribute 👨🏻💻
algorithms data interview-questions logic logic-programming math mathematics science
Last synced: 27 Oct 2025
https://github.com/itu-helper/data-updater
Periodically scrapes data related to ITU to be used by anyone. This data powers the ITU Helper web sites.
data istanbul-technical-university scraper selenium-python
Last synced: 29 Jan 2026
https://github.com/stdlib-js/ndarray-base-assert-is-real-data-type
Test if an input value is a supported ndarray real-valued data type.
array assert base check data dtype is javascript multidimensional ndarray node node-js nodejs stdlib test types util utilities utility utils
Last synced: 31 Jan 2026
https://github.com/jhpoelen/rats
self-replicating data publication related to rat (Rattus sp.) specimen.
biodiversity data natural-history-collections provenance
Last synced: 18 Mar 2026
https://github.com/scottleechua/data
Public datasets under CC-BY-4.0 license.
Last synced: 18 Mar 2026
https://github.com/sakshisrivastava-2601/credit-card-fraud-detection
Credit Card Fraud Detection Project Using Machine Learning. This project focuses on leveraging advanced Machine learning techniques to identify fraudulent transactions with high accuracy.
advanced-machine data machine-learning numpy project-repository python pytorch random-forest
Last synced: 16 Apr 2026
https://github.com/colour-science/colour-checker-detection-tests-datasets
Colour - Checker Detection - Tests Datasets
color color-checker color-science color-space color-spaces colorspace colorspaces colour colour-checker colour-science colour-space colour-spaces colourspace colourspaces data dataset datasets raw
Last synced: 19 Mar 2026
https://github.com/garcane/beverage-sales-analytics
This project provides an in-depth analysis of beverage sales and delivery across different states using Power BI.
data data-visualization powerbi powerbi-report powerbi-visuals
Last synced: 19 Mar 2026
https://github.com/nikhilash45/power-bi-vsualisation-of-joins
In This Power Bi Report User Can Visualis Join By Themselves , and it is easy to understand joins now.
business-analytics business-intelligence data data-analysis data-visualization joins powerbi sql visualization
Last synced: 19 Mar 2026
https://github.com/colour-science/colour-checker-detection-examples-datasets
Colour - Checker Detection - Examples Datasets
color color-checker color-science color-space color-spaces colorspace colorspaces colour colour-checker colour-science colour-space colour-spaces colourspace colourspaces data dataset datasets raw
Last synced: 19 Mar 2026
https://github.com/stdlib-js/array-base-none-by-right
Test whether all elements in an array fail a test implemented by a predicate function, iterating from right to left.
all array data every generic javascript node node-js nodejs none predicate stdlib structure test types validate
Last synced: 01 Mar 2026
https://github.com/docusign/extension-app-data-io-reference-implementation
Extension App for Data IO Reference Implementation for the Docusign IAM Platform
Last synced: 02 Mar 2026
https://github.com/stdlib-js/utils-fifo
First-in-first-out (FIFO) queue.
collection data data-structure data-structures fifo first-in first-out javascript node node-js nodejs queue stdlib structure util utilities utility utils
Last synced: 16 Apr 2026
https://github.com/gallo13/neuralnetworks-deeplearning-stats-classification
Descriptive Statistics, Classification and Analysis Using Python & Python Libraries (Assignment 1)
analysis data datasets deep-learning jupyter-notebook matplotlib neural-networks numpy pandas plotting python seaborn
Last synced: 17 Apr 2026
https://github.com/csheldonhess/reporting-on-congress
What has Congress passed and not passed, lately?
civic-data congress data government government-data propublica propublica-congress-api
Last synced: 20 Apr 2026
https://github.com/garciparedes/r-examples
Set of awesome R Examples
data data-science garciparedes r statistics university-of-valladolid
Last synced: 20 Apr 2026
https://github.com/mishra-krishna/analysis-and-optimization-of-supply-chain-operations
Analyzed supply chain data to identify trends and key factors. Visualized sales, defect rates, lead times, and costs. Used Decision Tree Regressor to find top features impacting product costs and lead times.
data dataanalytics datavisualization supplychain supplychainanalytics
Last synced: 20 Apr 2026
https://github.com/jinsyin/dataorigin
数据之源 | A data source management framework
Last synced: 21 Apr 2026
https://github.com/stefen-taime/myubereats_datapipeline
Building a Modern Uber Eats Data Pipeline
airflow api data datawarehouse mongodb pipeline powerbi snowflake
Last synced: 22 Apr 2026
https://github.com/sebastianbrzustowicz/collision-detection-ai
Python + TensorFlow. Repository for training a machine learning model for collision detection with an accelerometer sensor data and TensorFlow.
accelerometer accelerometer-data ai artificial-intelligence data dataset imu learning machine-learning microprocessor ml model quadcopter script sensor tensorflow
Last synced: 24 Apr 2026
https://github.com/yord/klp-core
A plugin with basic operations for klp (Kelpie), the small, fast, and magical command-line data processor.
csv data deserializer dsv json kelpie klp marshaller parser serializer ssv tsv
Last synced: 24 Apr 2026
https://github.com/zalweny26/open_data_unipa
Progetto per l'esame di Laboratorio di Algoritmi 23-24, UniPa, Informatica L-31
Last synced: 26 Apr 2026
https://github.com/sap-samples/security-research-codegraphsmote
Data augmentation strategy that can be applied to code graphs for learning-based vulnerability discovery.
augmentation data detection learning machine research sample security vulnerability
Last synced: 07 Jun 2026
https://github.com/karthikmprakash/github_repos_scraper
A tool to extract names of github repos of any user
automation bs4 data github python repositories requests webscraping
Last synced: 27 Apr 2026
https://github.com/the-aerospace-corporation/pivt
PIVT is an analytics tool to help software development teams visualize the life cycle and behavior of their software factory.
analytics dashboards data devops jenkins pipeline python splunk visualization
Last synced: 29 Apr 2026
https://github.com/chompfoods/sdk-php
PHP SDK for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.
api branded chomp data database food grocery ingredients php raw recipe-api recipes sdk
Last synced: 30 Apr 2026
https://github.com/alrza2003/alrza2003.github.io
This repository contains the source files for my personal portfolio website. It highlights my background as a data analyst and radiology student, and showcases real-world projects, tools I use, and ways to connect with me. The site is based on a pre-built template that I customized to reflect my profile and experience.
data data-analysis data-visualization portfolio portfolio-website python
Last synced: 30 Apr 2026
https://github.com/leomsgit/extrator-de-parametros-analise-hemograma-e-bioquimico
Software em Python para varrer arquivos PDF e extrair parâmetros diretamente para arquivo Excel
analysis data excel excel-export google-colab hemogram jupyter-notebook pdf pdf-document-processor pdf-viewer python python3
Last synced: 01 May 2026
https://github.com/divanny/academixbackend
🧑🎓 Academix is a comprehensive academic management system designed to streamline and enhance the educational experience for both students and professors. This repository contains the backend codebase for the Academix system, responsible for handling data processing, authentication, and API endpoints.
backend csharp data net webapi
Last synced: 07 Jun 2026
https://github.com/gdhhgnbnvbn/f1-2025-ai-predict
fully generated by claude 3.5 sonnet via Windsurf IDE. Not a single lines wrote.
agent-based-modeling claude csv data f1 gpt machine-learning model prediction predictive-modeling python rainforest streamlit vibe
Last synced: 01 May 2026
https://github.com/ggeop/multiple-fields-management
Fields management from/to different data sources. :bulb:
data data-engineering data-organization data-retrieval data-science pandas python
Last synced: 01 May 2026
https://github.com/y-india/project-road-accident-severity-prediction-system
see README below , please.
application data data-analysis data-classification data-cleaning data-science data-visualization data-visualization-project machine-learning ml pandas project real-world-problem-solving real-world-project road-project streamlit-webapp
Last synced: 02 May 2026