data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-03 00:07:49 UTC
- JSON Representation
https://github.com/mohsinali08000/myportfolio
I’m Mohsin Ali, a passionate software engineer with over 2 years of experience in developing robust software solutions. Currently transitioning into the field of data science.
Last synced: 22 Apr 2026
https://github.com/oefenweb/python-untraceables
Randomizes IDs for a given set of tables making them untraceable across environments
anonymize data database mysql privacy python python2 python3 randomization
Last synced: 03 Feb 2026
https://github.com/antononcube/raku-data-cryptocurrencies
Raku package of cryptocurrency data retrieval.
Last synced: 02 Apr 2025
https://github.com/ishanoshada/matplot3dex
A Matplotlib 3D Extension package for enhanced data visualization
data data-science matplotlib python-packages scikit-learn
Last synced: 05 Jan 2026
https://github.com/mbolam/DSWS_OpenRefine
Cleaning and Linking Data with OpenRefine
cleaning data metadata openrefine
Last synced: 07 Apr 2025
https://github.com/emnetdegafe/allesoverfilm-backend
AllesOverFilm-backend is part of the AllesOverFilm mobile app development project and contains the database structure, server query scripts, and Sequelize-cli database structures.
backend data data-model express postgresql sequelize-cli
Last synced: 11 Apr 2026
https://github.com/cosmos-loops/cosmos-dapper
Cosmos.Dapper is a part of Cosmos.Data, a inline project of COSMOS LOOPS PROGRAMME. This repository provides a package of StackExchange.Dapper to improve development efficiency.
dapper data mysql mysqlconnector oracle postgresql sql-query sqlite sqlkata sqlserver
Last synced: 11 Apr 2026
https://github.com/idea2app/public-meta-data
HTTP API for Public Meta Data, written in TypeScript & designed for CDN.
api cdn data http meta public typescript
Last synced: 15 Mar 2025
https://github.com/tsvikas/covid-19-israel-data
Unofficial Github with the data published by The Israel Ministry of Health, regarding The Coronavirus disease
coronavirus-disease covid-19 csv daily-reports data health israel
Last synced: 05 Jan 2026
https://github.com/humbertocg18/pucrs-alest-i-2.3-2023.24
Trabalhos, Projetos, Exercícios e aulas realizados em Java na cadeira de Algoritimos e estrutura de dados 1, matéria do segundo semestre.
beecrowd beecrowd-solution-in-js beecrowd-solutions-in-java data data-structures datastructures-algorithms hashmap hashtable java-8 leetcode leetcode-javascript leetcode-solutions leetcodepra pucrs sorting-algorithms
Last synced: 29 Mar 2025
https://github.com/mtingers/opacify
Opacify reads a file and builds a manifest of external sources to rebuild said file.
backup data obfuscation python
Last synced: 18 May 2026
https://github.com/andygeiss/pipeline
Build your own data pipeline to gather, organize and transform data by using protobuf as an intermediate format.
data data-pipeline data-science go golang machine-learning protobuf protobuf-compiler
Last synced: 31 Mar 2025
https://github.com/serhatderya/tabular-playground-series
This repository contains solutions of monthly Tabular Playground Series in Kaggle.
ai artificial-intelligence data data-preprocessing data-processing data-science data-visualization jupyter-notebook kaggle machine-learning numpy pandas python regression scikit-learn scikitlearn-machine-learning seaborn software statsmodels
Last synced: 11 Apr 2026
https://github.com/hamzacham/data_set_projet-4
analysis analytics data data-science datawarehouse sas sql sql-server
Last synced: 24 Mar 2025
https://github.com/benmaier/boarding_school_sir
Fit SIR dynamics to the prevalence curve of an H1N1 outbreak of a British boarding school in 1978.
boarding data disease epidemiology modeling school spreading
Last synced: 31 Mar 2025
https://github.com/richardschoen/sshnetibmi
This .Net/.Net Core class library is used to interface with existing IBM i database, program calls, CL commands, service programs and data queues via the PASE based xmlservice-cli PASE command program or regular qsh/bash commands. qsh/bash commands can be used to interface with any qsh/pase based utilities such as the IBM i db2util utility
as400 cl command csharp data db2 ddm dotnet drda ibm ibmi os400 pase program qcmdexc qcmdexec queue rpg xmlservice xmlservice-cli
Last synced: 04 Feb 2026
https://github.com/flowsynx/plugin-json
FlowSynx plugin to loads and parses local JSON files. Supports transformation, extraction, and mapping of hierarchical data structures in workflows.
data data-platform flowsynx json
Last synced: 10 Mar 2026
https://github.com/spectrochempy/spectrochempy_data
Test and examples data repository for SpectroChemPy
Last synced: 04 Apr 2025
https://github.com/katiesaund/dresden_maps
Contains a data file with locations from The Dresden Files. The data file is to be used for my map tutorial in R.
Last synced: 05 Jan 2026
https://github.com/dev-owdenmag/dataflow-manager
A dynamic and versatile web application for managing, collecting, and presenting data with an integrated printing feature.
data data-management data-management-platform data-visualization python
Last synced: 30 Mar 2025
https://github.com/mftnakrsu/crm-rfm-analysis
CRM-RFM-Analysis
ai crm data data-analysis data-science deep-learning machine-learning python rfm rfm-analysis
Last synced: 16 Mar 2025
https://github.com/rayenfathallah/students_analysis
This projects contains an analysis of the different fadtors affecting students performance in their final exams. The project uses D3.js to create interactive dashboards that are compelling and easy to interpret.
analysis d3 data education javascript python students
Last synced: 12 Apr 2026
https://github.com/metriccoders/metriccoders_datasets
This is the Metric Coders repository containing all the datasets for machine learning.
data datasets machine-learning natural-language-processing scikit-learn
Last synced: 08 Apr 2025
https://github.com/dwidevelopes/database-input-pelanggran-mahasiswa
Menginput data Mahasiswa Yang Melakukan Pelanggran yang siap di data dan di hukum Dan juga siap Terkena Sanksi
aplikasi aplikasi-sekolah data data-analysis database input-method mahasiswa sekolah siswa siswi website
Last synced: 02 May 2026
https://github.com/gher-uliege/bluecloud-plankton
Spatial interpolation of plankton data using a neural network
data data-analysis data-visualization neural-network oceanography
Last synced: 30 Mar 2025
https://github.com/open-i18n/data-unicode-math
Git mirror for Unicode Support for Mathematics data
data i18n internationalization math mathematics open-i18n unicode unicode-consortium unicode-data
Last synced: 11 Mar 2026
https://github.com/GiveMePseudonyms/PiVisualisations
A way to visualise millions of digits of Pi. Written in Python using Pygame and Tkinter.
data data-visualization pi pygame python self-organising-criticality tkinter
Last synced: 08 Apr 2025
https://github.com/stefanbohacek/exploring-the-mapping-police-violence-dataset
Using my Gutenberg Data Visualization plugin to explore police violence against civilians.
data dataviz police police-brutality police-misconduct
Last synced: 03 Dec 2025
https://github.com/bijx/firestore-data-fetcher
A simple Python script to fetch documents from a Firebase Firestore collection and save them to a local `.json` file.
automation data database downloader exporter fetcher firebase firestore open-source script
Last synced: 12 Apr 2026
https://github.com/cqllum/schema2dwh
⚡ Automatically produce a data model on your database using its information schema using GenAI.
ai data data-structures dataengineering datawarehousing dwh gemini gemini-api genai reporting reporting-tool schema-design
Last synced: 13 Mar 2025
https://github.com/toransahu/excel-implementation-of-regression-clustering
B.Tech. Major Project
btech-project-proposal clustering data kmeans-clustering machine-learning mining regression
Last synced: 25 Mar 2025
https://github.com/nik-kusanagi/bash.sh-treinamento
Versão mais organizada (+ ou -)
data database debian gnome gnome-extension gnu gnu-linux linux shell shell-script
Last synced: 05 May 2026
https://github.com/castelao/bufr
BUFR binary data format from WMO
binary data format meteorology oceanography wmo
Last synced: 13 Jul 2025
https://github.com/shivam1808/data-cleaning-project
We take raw housing data and transform it in SQL Server to make it more usable for analysis.
analysis data datacleaning sql sqlserver
Last synced: 29 May 2026
https://github.com/ntia/compound_radar_waveforms-data
Data used by NTIA/ITS TR-23-566 Examining the Effects of Resolution Bandwidth when Measuring Compound Radar Waveforms.
bandwidth data measurement p0n q3n radar resolution stepped waveform
Last synced: 27 Jan 2026
https://github.com/fredhutch/gdscnsoilsites
Homepage for BioDIGS Project. Learn about the project and download data.
biodigs data metagenomics student-research
Last synced: 25 Mar 2025
https://github.com/agahkarakuzu/datavis_edu
Presented in BrainHack School 2019-2020, QBIN SciComm 2021
binder dashboard data notebooks repo2docker visualization
Last synced: 01 Apr 2025
https://github.com/datenoio/internacia-db
Public registry of the intergovernmental organizations, country groups and countries. Available as JSONl, Parquet, YAML and DuckDB database datasets
countries data datasets international international-trade reference
Last synced: 29 May 2026
https://github.com/lmuffato/project-ting-trybe
Projeto ting - Projeto avaliativo da Trybe do Bloco 37: Estrutura de Dados II: Listas, Filas e Pilhas
data data-analysis python queue read-file stack trybe trybe-projects
Last synced: 12 Jun 2025
https://github.com/lmuffato/project-job-insights-trybe
Projeto job insights - Projeto avaliativo da Trybe do Bloco 32: Introdução à Python
data data-science data-transformation filter python
Last synced: 12 Jun 2025
https://github.com/azrunguraya/kabyle-corpus-dataset
Dans l'univers du Traitement Automatique des Langues , l'accès à des datasets diversifiés et bien annotés est essentiel pour développer des modèles performants. Ce projet vise à combler cette lacune spécifique pour la langue taqbaylit, une langue berbère parlée principalement en Kabylie
ber berber berber-dataset corpus data dataset ia kabyle kabyle-art kb machine-learning nlp nlp-machine-learning python taqbaylit text words
Last synced: 31 Jul 2025
https://github.com/toransahu/metoffice
Data visualisation - MetOffice
data metoffice uk visualization weather
Last synced: 25 Mar 2025
https://github.com/gbv/cocoda-mappings
concordances, mappings and conversion scripts to create JSKOS mappings
Last synced: 28 Oct 2025
https://github.com/lane-romuald/iot-irrigation-data-collection-system
An IoT-based data collection system using the ESP32 microcontroller programmed with Arduino to monitor environmental conditions for smart irrigation. The system measures soil moisture, temperature, air temperature, humidity, and rain probability. Data is stored locally on an SD card and uploaded to the ThingSpeak platform.
arduino cloud data data-collection esp32 openweather openweathermap thingspeak wi-fi
Last synced: 12 Apr 2026
https://github.com/osiota10/alx-low_level_programming
C Low Level Programming - Data Structures, Linux/Unix System Programming and Algorithms with ALX Software Engineering
algorithms assembly c data data-structures linux shell unix
Last synced: 25 Jun 2025
https://github.com/edugmenes/azure-data-engineering
This repository contains my first end-to-end Data Engineering project, built using Microsoft Azure Cloud and Azure Databricks with PySpark.
azure cloud data data-engineering data-lakehouse data-structures databricks delta-lake etl-pipelines lakehouse lakehouse-architectures medallion-architecture microsoft-azure pyspark spark
Last synced: 29 Jan 2026
https://github.com/themost-framework/jspa
JavaScript Persistent API
api data database-schema jspa object-relational-mapping orm orm-framework
Last synced: 31 Aug 2025
https://github.com/vapourismo/binary-io
Read and write values of types that implement Binary from and to Handles
data haskell haskell-library io parsing
Last synced: 28 Mar 2025
https://github.com/quasilyte/phpcorpus
A collection of various PHP code; useful for PHP tools writers to get some insights on how "real-world" PHP code looks like
analysis corpus data php php-corpus
Last synced: 04 Jul 2025
https://github.com/vvipjain/bike-sales-dashboard
Bike Sales Dashboard
dashboards data data-analysis data-cleaning data-normalisation data-visualization excel pivot-chart pivot-tables
Last synced: 04 Feb 2026
https://github.com/e-panourgia/data-science-projects
Data Science Projects
annotations augmentation data data-preprocessing-and-cleaning hyperparameter-tuning llm logistic-regression nlp random-forest-classifier xboost-classifier
Last synced: 09 Apr 2025
https://github.com/avahoffman/dataplay
🤸♂️ Load data to play with
data data-package r r-package rstats
Last synced: 25 Mar 2025
https://github.com/stdlib-js/ndarray-base-assert-is-complex-floating-point-data-type
Test if an input value is a supported ndarray complex-valued floating-point data type.
array assert base check data dtype is javascript multidimensional ndarray node node-js nodejs stdlib test types util utilities utility utils
Last synced: 08 Mar 2026
https://github.com/melinteflxrin/softserve-bigdata-project
End-to-end data warehousing project integrating APIs, ETL workflows, and PostgreSQL for analytics and reporting.
analytics api bigdata data datawarehousing externalapi pipeline postgres postgresql python warehouse
Last synced: 26 Jan 2026
https://github.com/tether/tether-schema
Custom protocol buffer schema for data validation
data protocol schema validation
Last synced: 09 Apr 2025
https://github.com/whitehathackerpr/data-visualization-tool
This is a Python-based web application that allows users to upload datasets, analyze data, and create visualizations interactively. The tool is designed for ease of use and provides a simple interface to perform basic data analysis and generate visualizations
data data-analysis data-visualization python python3
Last synced: 05 Sep 2025
https://github.com/xpotify/scraper
Scraper designed for Xpotify's client to gather information from websites🌟
axios cheerio data javascript scraper webscraper
Last synced: 07 Jul 2025
https://github.com/cainmi/data-page-project
A repository to pull code and files from, may be used to store page data links, code etc. mainly used for python for now
data html javascript python schema
Last synced: 21 Oct 2025
https://github.com/desininja/data-engineer-interview-questions
This repository contains all the Data Engineer Interview Questions asked by interviewers.
data data-engineer-interview-questions
Last synced: 31 Mar 2025
https://github.com/bredalis/datastructure
📚 Estructuras de Datos en Python
algorithms data data-structure python
Last synced: 12 Apr 2026
https://github.com/gkapfham/ast2016-paper
Source Code of and Supporting Files for a Paper Published at AST 2016
data latex-document paper research
Last synced: 19 Oct 2025
https://github.com/nodef/infoods
Kit for International Network of Food Data Systems (INFOODS).
component data food identifier infoods international network systems tagnames
Last synced: 11 Mar 2026
https://github.com/danreynolds/data_batcher
Data batcher batches and de-dupes data fetched in the same task of the event loop.
batching data flutter hacktoberfest
Last synced: 19 May 2026
https://github.com/harmanveer-2546/supply-chain
Supply chain analytics is a valuable part of data-driven decision-making in various industries such as manufacturing, retail, healthcare, and logistics. It is the process of collecting, analyzing and interpreting data related to the movement of products and services from suppliers to customers.
customer-segmentation-analysis data data-analysis data-cleaning data-insights ggplot2 numpy pandas performance-evaluation predictive-analytics-for-business python risk-assessment sales-analysis statistical-analysis supply-chain tidyverse trend-analysis
Last synced: 10 Apr 2026
https://github.com/goncaloperes/datavisualization
Here I will share some of my data visualizations using a variety of datasets, technologies and tools.
d3js data dataset datavisualization dataviz ggplot matplotlib rawgraphs seaborn tableau visualization yellowbrick
Last synced: 04 Feb 2026
https://github.com/ispyhumanfly/prowler
Query the web, extract data from the results, and transform that data into a format you can use.
ai analytics business cryptocurrency data extract-data machine-learning mining scraping web
Last synced: 06 Sep 2025
https://github.com/tylerben/data-spring
Easily generate a dummy dataset based on a provided config
data data-spring datagenerator fake-data generator javascript typescript
Last synced: 27 May 2026
https://github.com/stdlib-js/array-zero-to
Generate a linearly spaced numeric array whose elements increment by 1 starting from zero.
array data float32array float64array int16array int32array javascript matrix ndarray node node-js nodejs stdlib structure typed typed-array types uint32array vector
Last synced: 08 Jan 2026
https://github.com/luminati-io/Crunchbase-dataset-samples
A sample of 1001 Crunchbase companies with key data points, extracted using the Bright Data API.
crunchbase crunchbase-api crunchbase-scraper data database datasets webscraper-api webscraping
Last synced: 09 Apr 2025
https://github.com/luminati-io/Twitter-X-dataset-samples
A sample dataset of over 1000 Twitter (X) posts, extracted using the Bright Data API, ideal for trend discovery, brand monitoring, and competitive insights.
api data dataset twitter twitter-api twitter-scraper web-scraping x
Last synced: 09 Apr 2025
https://github.com/cintia0528/data_cleaning_and_analytics-python
Evaluate if aggressive discounting benefits Eniac long-term, considering differing views on customer acquisition and brand positioning. Focus on data cleaning for informed decision-making.
colab-notebook data data-analysis datacleaning dataquality jupyter-notebook matplotlib pandas python seaborn
Last synced: 08 Jan 2026
https://github.com/jrcichra/ingestd
HTTP server that easily ingests data into a database
data gin hacktoberfest ingest ingestion restful-api
Last synced: 28 Apr 2026
https://github.com/marabesi/d3-visualization
Different visualizations using data and d3.js
charts css d3js data html js json timeline-chart visualization
Last synced: 01 May 2026
https://github.com/rayyan9477/dep
data data-science machine-learning python visualization web-scraping
Last synced: 08 May 2026
https://github.com/ayushai/salesfoce-hospital-management
A custom Salesforce-based Hospital Management System with powerful dashboards and data analysis tools. It provides real-time insights into patient care, appointment scheduling, and inventory management, optimizing healthcare operations and decision-making.
analytics dashboard data salesforce-developers visualization
Last synced: 22 Feb 2026
https://github.com/dbriane208/omdena-apprenticeship-project
This is part of my contribution to the Omdena apprenticeship program .
data data-science feature-engineering machine-learning
Last synced: 14 Mar 2026
https://github.com/ronaldkanyepi/python-streamlit-covid-19-dashboard
This is a responsive streamlit covid 19 Dashboard
analytics data data-analysis data-visualization datascience python streamlit
Last synced: 18 May 2026
https://github.com/spiceai/datasets
Spice AI curated dataset definitions for Spice.ai
ai bitcoin blockchain data ethereum polygon
Last synced: 20 Apr 2026
https://github.com/stdlib-js/array-base-to-deduped
Copy elements to a new generic array after removing consecutive duplicated values.
array compress copy data dedupe deduplicate deduplication duplicate generic javascript node node-js nodejs stdlib structure types uniq unique
Last synced: 14 Jun 2025
https://github.com/neelravi/fairtool
A CLI tool for FAIR processing of computational materials science data.
computational data data-analytics fair management materials physics python science
Last synced: 14 Jan 2026
https://github.com/bredalis/scikitlearn
🤖 Library to create ML models 🤖
data ia learning-python librery ml python
Last synced: 30 May 2026
https://github.com/nafisalawalidris/buybuy-e-commerce-company
The BuyBuy E-commerce Company repository is a comprehensive hub for the company's e-commerce platform. It includes source code, documentation, and data analysis insights, providing a data-driven approach to improve customer experience, drive revenue, and inform decision-making.
buybuy cleaning-data company customer-experience data data-analysis decision-making documentation e-commerce excel insights postgresql repository revenue source-code sql
Last synced: 16 Mar 2025
https://github.com/vagnerbellacosa/029_analisededadoscompythonpandas
Neste Labs será apresentada a biblioteca Pandas, uma biblioteca Python de código aberto para análise de dados. Ela dá ao Python a capacidade de trabalhar com dados do tipo planilha, permitindo carregar, manipular e combinar dados rapidamente, entre outras funções. Python
data digital-innovation-one dio jupiter-notebook labs ms-excel panda python
Last synced: 14 May 2026
https://github.com/jmcanterafonseca/leaflet-context-information
A Leaflet plugin + infrastructure for getting access to Context Information (i.e. data) exposed through FIWARE NGSIv2
context data fiware information leaflet map open visualization web
Last synced: 21 Apr 2026
https://github.com/mini-ware/mini-ware
Just some very simple markdown for my GitHub profile
codewars ctf data hackthebox javascript markdown minimalistic profile-readme python readme-profile simple stattistics svg
Last synced: 13 Apr 2026
https://github.com/danielrosehill/monetised-ghg-emissions
Calculating monetised GHG emissions for various companies based upon disclosure data
data sustainability sustainability-data
Last synced: 07 Sep 2025
https://github.com/programmer-rd-ai/library-management-system-oraclesql
The Library Management System project, part of the CI6320 Advanced Data Modelling coursework, features comprehensive SQL scripts utilizing OracleSQL to facilitate efficient data modeling and management.
adm advanced ci6320 cw data icw library management modelling oracle oraclesql report sql system
Last synced: 29 Oct 2025
https://github.com/sandipbera35/blogapp.spring.boot
A proof-of-concept Project Of Blog application In Java Spring Boot, Spring Data JPA with mysql Minio Object Storage , it is an Integration with JWT authservice project(written in golang) .
data java jpa jpa-entity-manager jpa-hibernate mysql mysql-server postman postmanapi spring-boot
Last synced: 13 Apr 2026
https://github.com/ncgl-git/eriparse
Python code to parse the cost-of-living HTML from erieri.com, i.e. https://www.erieri.com/cost-of-living/united-states/illinois/chicago
cost-of-living crime crime-data data economic-research-institute erieri webscraper
Last synced: 14 Jan 2026