data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-01 00:07:35 UTC
- JSON Representation
https://github.com/clinton-mwachia/data-analysis-in-r
Various Analysis in R
data data-science machine-learning machine-learning-algorithms r random-forest rstats
Last synced: 30 Nov 2025
https://github.com/devlive-community/mockaroo
一个轻量级的 HTTP Mock 服务器,用于快速构建模拟数据接口,适用于前后端开发和接口测试场景。
Last synced: 08 Jul 2025
https://github.com/jigyasag18/gold-price-prediction-project-using-machine-learning
This repository contains a machine learning project focused on predicting gold prices (GLD) using historical stock market data, including indicators such as SPX, USO, SLV, and EUR/USD. The project implements a Random Forest Regressor for accurate price forecasting, complete with data visualization, correlation analysis, and model evaluation metrics
data dataset jupyter-notebook jupyter-notebooks machine-learning machinelearing machinelearningalgorithms machinelearningmodel machinelearningprojects matplotlib mlproject numpy pandas randomforestregressor seaborn
Last synced: 23 Jul 2025
https://github.com/alexscigalszky/palabras-aleatorias-data
This package have a set of datasets of random words, animals, colors, jokes, onomatopeias and types
aleatorias data palabras random words
Last synced: 04 Oct 2025
https://github.com/san089/black-friday-sales-analysis
This Project gives an insight into few statistics related to black Friday Sale.
custom data dataanalysis insights sales statistics
Last synced: 13 Jul 2025
https://github.com/stdlib-js/array-zero-to-like
Generate a linearly spaced numeric array whose elements increment by 1 starting from zero and having the same length and data type as a provided input array.
array data float32array float64array int16array int32array javascript matrix ndarray node node-js nodejs stdlib structure typed typed-array types uint32array vector
Last synced: 07 Jan 2026
https://github.com/goncaloperes/datavisualization
Here I will share some of my data visualizations using a variety of datasets, technologies and tools.
d3js data dataset datavisualization dataviz ggplot matplotlib rawgraphs seaborn tableau visualization yellowbrick
Last synced: 04 Feb 2026
https://github.com/jrcichra/ingestd
HTTP server that easily ingests data into a database
data gin hacktoberfest ingest ingestion restful-api
Last synced: 28 Apr 2026
https://github.com/marabesi/d3-visualization
Different visualizations using data and d3.js
charts css d3js data html js json timeline-chart visualization
Last synced: 01 May 2026
https://github.com/spiceai/datasets
Spice AI curated dataset definitions for Spice.ai
ai bitcoin blockchain data ethereum polygon
Last synced: 20 Apr 2026
https://github.com/vagnerbellacosa/029_analisededadoscompythonpandas
Neste Labs será apresentada a biblioteca Pandas, uma biblioteca Python de código aberto para análise de dados. Ela dá ao Python a capacidade de trabalhar com dados do tipo planilha, permitindo carregar, manipular e combinar dados rapidamente, entre outras funções. Python
data digital-innovation-one dio jupiter-notebook labs ms-excel panda python
Last synced: 14 May 2026
https://github.com/jmcanterafonseca/leaflet-context-information
A Leaflet plugin + infrastructure for getting access to Context Information (i.e. data) exposed through FIWARE NGSIv2
context data fiware information leaflet map open visualization web
Last synced: 21 Apr 2026
https://github.com/mini-ware/mini-ware
Just some very simple markdown for my GitHub profile
codewars ctf data hackthebox javascript markdown minimalistic profile-readme python readme-profile simple stattistics svg
Last synced: 13 Apr 2026
https://github.com/bunnysunny24/bluepulse
A Smart Water Management System
data data-processing data-visualization firebase iot machine-learning mysql-database reactjs
Last synced: 17 Mar 2025
https://github.com/avto-dev/static-references-data
Data for static references
Last synced: 05 Oct 2025
https://github.com/dixslyf/nbparts
Unpack a Jupyter notebook into its sources, outputs and metadata.
data haskell jupyter jupyter-notebook nix nix-flake
Last synced: 05 Oct 2025
https://github.com/jahilldev/immutable-parsejs
Parse a JS object or array/map into an Immutable collection. Makes use of ImmutableJs List, and Record primitives.
data immutablejs javascript json nodejs parse typescript
Last synced: 13 Apr 2026
https://github.com/joocer/data_expectations
Are your data meeting your expectations?
data data-engineering data-quality data-science data-unit-tests observability pipelines quality validation
Last synced: 07 Oct 2025
https://github.com/ahmad-ali-rafique/comment-generation-tool
This repository hosts a Jupyter Notebook-based Comment Generation Tool exploring advanced NLP techniques for automated, contextually relevant comment generation from input data. Ideal for developers and researchers in NLP and automated text generation.
ai aitools artificial-intelligence content-based-recommendation data datascience jupyter-notebook machine-learning
Last synced: 07 Oct 2025
https://github.com/patrickdavies100/datapipeline37
Some Data Science practice using datasets available online. Currently test data is similar to this dataset: https://www.kaggle.com/datasets/asaniczka/amazon-uk-products-dataset-2023 but the plan is to expand.
data data-science pandas-dataframe python3
Last synced: 08 Oct 2025
https://github.com/davorg/dmp
Data Munging with Perl
book data hacktoberfest munging perl
Last synced: 21 Jan 2026
https://github.com/jrmedd/emojinal
An experimental API for determining emoji sentiment, based on research from Institut "Jožef Stefan", Slovenia.
data emojis sentiment user-research ux
Last synced: 19 Jan 2026
https://github.com/rohancyberops/r-language
R Language Projects directory. This repository contains various projects, scripts, and experiments developed using R, a powerful statistical computing and data visualization language.
caret cran data dplyr ggplot2 rlanguage rstudio shiny tidyverse
Last synced: 12 Oct 2025
https://github.com/genert/metis
Asynchronous data sender library
analytics asynchronous data dependency-free typescript
Last synced: 27 Jan 2026
https://github.com/anobaka/insidecollector
这是一个介于Excel和纯记录工具之间的软件,您可以自由创建各种列表,然后将其以各种规则关联起来,并且可以创建自定义视图帮助您更好地理解数据。
collection data excel-like list list-manager table
Last synced: 19 Jan 2026
https://github.com/ompreetham/dcn-network-traffic-anomaly-detection
Data Communication Networks - Network Traffic Anomaly Detection
anomaly anomaly-detection communication data dcn keras learning machine machine-learning network pandas presentation project python scikit-learn tensorflow traffic
Last synced: 08 Apr 2026
https://github.com/morphaxthedeveloper/yokatlas-dataset-2025
yök atlas detaylı üniversite, bölüm, puan vb. datası..
data database liste scrape universite veri yok-atlas yok-atlas-api yok-atlas-data yokatlas yokatlas-crawler yokatlas-data
Last synced: 14 Oct 2025
https://github.com/souvik09-tech/adventure-works-kpi-dashboard
This repository contains a complete Business Intelligence solution for AdventureWorks, a global manufacturing company specializing in cycling equipment and accessories. Built using Power BI Desktop, this project helps track KPIs, analyze product performance, compare regional data, and identify high-value customers.
analysis data kpi powerbi visualization
Last synced: 27 Jan 2026
https://github.com/sanskaryo/ultimate-dsa-repo
One Stop Solution for DSA Learning and Resources
data data-structures-and-algorithms dsa hacktoberfest hacktoberfest-accepted hacktoberfest2025
Last synced: 15 Oct 2025
https://github.com/nxank4/loclean
⚡️ The All-in-One Local AI Data Cleaning Library. No GPU or API keys required.
automated-cleaning data data-cleaning data-engineering data-preprocessing data-science data-wrangling etl llm normalization open-source polars privacy-preserving python semantic-analysis slm structured-data
Last synced: 22 Jan 2026
https://github.com/potreic/etl-fashion-trend-analysis
✨ Automate fashion trend analysis with Apache Airflow! Extract data from X & Pinterest, transform into insights, and load into PostgreSQL. Predict seasonal styles & visualize trends. 💃📊
airflow airflow-dags data data-engineering etl etl-automation etl-pipeline fashion-trends
Last synced: 27 Jan 2026
https://github.com/data-forge-notebook/javascript-cheat-sheet
Cheat sheet that accompanies my book Data Wrangling with JavaScript
cheatsheet data data-wrangling javascript nodejs
Last synced: 15 Apr 2026
https://github.com/florianwendelborn/metatypes
Monorepo of TypeScript Metadata Definitions (e.g. HTTP Status Codes)
code-generation data datastructures enum http-status-codes jsdoc lerna metadata typescript
Last synced: 27 Jan 2026
https://github.com/rodekruis/510-data-catalog
The Project is CKAN based Data Catalog Portal for 510
Last synced: 23 Jan 2026
https://github.com/atymri/linqsimulator
LINQ Simulator is an interactive C# console application designed to let you experiment with LINQ queries in real time.
console csharp data data-analysis linq query sql
Last synced: 23 Oct 2025
https://github.com/capire/xtravels-java
Travel booking app using master data from xflights built with CAP Java
cap cds data federation flights java reuse
Last synced: 23 Jan 2026
https://github.com/cmda-tt/course-24-25
🎓 tech track · 2024-2025 · curriculum and syllabus 📊
d3 data datavis datavisualization es6 functional javascript programming svelte
Last synced: 28 Jan 2026
https://github.com/2kabhishek/pyramen
Data Analysis for Ramen 🍜💹
csv data data-analysis fun python report
Last synced: 26 Oct 2025
https://github.com/aleenprd/docbt
Documentation Build Tool - Generate YAML documentation for dbt models with optional AI assistance. Built with Streamlit for an intuitive and familiar web interface.
ai analytics-engineering bigquery data data-modeling data-science dbt docker llm lmstudio ollama openai snowflake sql streamlit
Last synced: 11 Nov 2025
https://github.com/city-of-helsinki/drupal-helfi-tyollisyyspalvelut-manuaali
Työllisyyden kuntakokeilujen palvelutietovarannon manuaali
data drupal drupal-9 unemployment
Last synced: 24 Jan 2026
https://gitlab.com/Native-Coder/d3-react-component
This is a dead-simple React component that makes D3 implementation a breeze.
chart component d3 data react vis visualization viz
Last synced: 24 Jan 2026
https://github.com/zoekelepiri/ota_observatory
A front-end web application that provides detailed information about the boundaries and statistical data of the regions and prefectures of Greece.
backend data database spring-boot
Last synced: 06 Feb 2026
https://github.com/qbicsoftware/research-data-management
Documentation about the life science research data management at QBiC
data data-management data-stewardship documentation hacktoberfest life-science management metadata rdm reasearch-data-management
Last synced: 30 Jan 2026
https://github.com/sandk21/etude_eau_potable_monde
Etude sur l'accès à l'eau dans le monde - Tableaux de bord avec Tableau
analysis data tableau tableau-public visualization
Last synced: 19 Mar 2026
https://github.com/kirkalyn13/portfolio-dashboard-site
Portfolio Site; Initially a Service Provider Metrics Dashboard using React.
dashboard data data-visualization react
Last synced: 15 Apr 2026
https://github.com/simranjeet97/quotes-analysis
Kaggle Dataset on Quotes Analysis and Visualization With Python, Pandas and MatplotLib Using Jupyter Notebook.
data data-science datavisualization jupyter-notebook kaggle kaggle-dataset machine-learning matplotlib-pyplot numpy pandas python quotes quotes-application
Last synced: 15 Apr 2026
https://github.com/stdlib-js/ndarray-base-assert-is-real-data-type
Test if an input value is a supported ndarray real-valued data type.
array assert base check data dtype is javascript multidimensional ndarray node node-js nodejs stdlib test types util utilities utility utils
Last synced: 31 Jan 2026
https://github.com/dandre3000/matrix
Matrix library
algebra array data data-structure math matrix vector
Last synced: 01 Feb 2026
https://github.com/raymondcm/strawberrydata
Tool suite for fast multi-camera strawberry data collection project. The standards document houses cross compatibility/purpose implementation details.
camera cpp data intel multi-camera
Last synced: 08 Feb 2026
https://github.com/jeanmanguy/milk-sci-fi
Census of every mention of milk in sci-fi works.
Last synced: 26 Feb 2026
https://github.com/3squared/smoulder
Smoulder is a really good data pipe
composition data facade-pattern forge-framework object-oriented
Last synced: 25 Apr 2026
https://github.com/codenoid/alodokter.com-database
a Alodokter.com Database, collected by Hofesh Bot (Scrapper)
alodokter data extraction hofesh
Last synced: 18 Mar 2026
https://github.com/jhpoelen/bats
self-documenting data publication on Bat (Chiroptera) specimen
biodiversity data natural-history-collections provenance specimen
Last synced: 18 Mar 2026
https://github.com/ewertondrigues02/engenharia-de-dados
Varios Projetos de Engenharia de Dados usando principais ferramentas como: Airflow, Snowflake, dbt, Postrgres, Looker Studio, Power BI
airflow analise-exploratoria analytics aws-ec2 dados data dbt-cloud engenharia-de-dados looker-studio postgres pyspark python3 snowflake spark
Last synced: 16 Apr 2026
https://github.com/seabbs/estzoonotictb
Explore, Visualise and Estimate the Global Zoonotic Tuberculosis Burden
bovine-tb data estimation package rstats tuberculosis visualisation zoonotic-tb
Last synced: 28 Feb 2026
https://github.com/tushard48/analyzing-usa-market-trends-a-financial-overview
In-depth analysis of US market trends, encompassing economic indicators, industry performance, and financial data
data data-visualization powerbi
Last synced: 19 Mar 2026
https://github.com/m0nica/datalogues-outdated
Programming blog focused on data with an emphasis on exploration in Python. Has been migrated from Pelican to Jekyll
data pelican pelican-blog pelican-theme
Last synced: 28 Feb 2026
https://github.com/stdlib-js/datasets-harrison-boston-house-prices-corrected
A (corrected) dataset derived from information collected by the US Census Service concerning housing in Boston, Massachusetts (1978).
boston data dataset datasets house housing javascript linear-regression node node-js nodejs prediction prices statistics stats stdlib value
Last synced: 15 Feb 2026
https://github.com/luminati-io/twitter-x-dataset-samples
A sample dataset of over 1000 Twitter (X) posts, extracted using the Bright Data API, ideal for trend discovery, brand monitoring, and competitive insights.
api data dataset twitter twitter-api twitter-scraper web-scraping x
Last synced: 19 Mar 2026
https://github.com/huseyincenik/tableau
This repository contains Tableau visualizations and related resources for my project.
analytics api bianalyst business-analytics business-intelligence business-solutions dashboard data data-analysis data-science data-structures dataanalysis dataset datavisualization drilldown interactive-visualizations tableau tableau-dashboards viz
Last synced: 19 Mar 2026
https://github.com/eugenedakin/polyalphabeticcipher
PolyAlphabetic Cipher
data decryption encryption polyalphabetic polyalphabetic-cipher polyalphabetic-crypto polyalphabetic-substitution xojo
Last synced: 19 Mar 2026
https://github.com/efler/microservice-data-bus
Data bus based on Apache Kafka and consisting of separate components [copied from own private repos]
data data-bus deduplication enrichment filtering kafka microservice mongodb postgresql redis
Last synced: 16 Apr 2026
https://github.com/timmymatten/spikeball-stat-tracker
Spikeball stat tracking web app built with Streamlit and Python, designed to easily log and analyze player performance over multiple games.
data data-analysis data-visualization dataset matplotlib-pyplot multipage python spikeball statistics streamlit
Last synced: 18 Apr 2026
https://github.com/buchananja/dpyp
A convenience tool for small-scale data pipelines in Python
data data-analysis data-cleaning data-engineering data-pipeline data-preprocessing data-processing data-science pandas pipeline
Last synced: 18 Apr 2026
https://github.com/arif-miad/data-visualization
A Comprehensive Guide to Data Visualization
analytics data data-science machine machine-learning-algorithms model python visualization
Last synced: 20 Apr 2026
https://github.com/ktbarrett/scdil
simple configuration and data interchange language
configuration data json python yaml
Last synced: 20 Apr 2026
https://github.com/allianz/yukimi
Self-service Snowflake provisioning with built-in security and policy enforcement.
Last synced: 05 Jun 2026
https://github.com/mishra-krishna/analysis-and-optimization-of-supply-chain-operations
Analyzed supply chain data to identify trends and key factors. Visualized sales, defect rates, lead times, and costs. Used Decision Tree Regressor to find top features impacting product costs and lead times.
data dataanalytics datavisualization supplychain supplychainanalytics
Last synced: 20 Apr 2026
https://github.com/stefen-taime/myubereats_datapipeline
Building a Modern Uber Eats Data Pipeline
airflow api data datawarehouse mongodb pipeline powerbi snowflake
Last synced: 22 Apr 2026
https://github.com/howtoquitvivek/ai-crop-yeild-prediction
AI-driven crop yield prediction and agricultural optimization system (SIH 2025)
2025 2026 ai crop-yeild data minor-project ml predcition python science sih
Last synced: 23 Apr 2026
https://github.com/andygol/osm-diff-state
CLI tool to search OSM diff state files
custom data openstreetmap planet replication
Last synced: 24 Apr 2026
https://github.com/chriseaton/sample-database
A long-term supported sample dataset for file and database unit testing and validation. Simple, straight-forward, raw data shared across formats.
data database examples flat-file samples schema unit-testing
Last synced: 25 Apr 2026
https://github.com/ahmad-ali-rafique/pyviznotebook
PyVizNotebook is a collection of Matplotlib visualizations demonstrating a wide range of plot types and techniques for data visualization. Whether you're a beginner looking to learn or an experienced developer seeking inspiration, this repository offers a diverse set of examples to explore.
analytics colab-notebook data data-science data-visualization dataanalytics matplotlib-python plots seaborn-python visualization
Last synced: 06 Jun 2026
https://github.com/sap-samples/security-research-codegraphsmote
Data augmentation strategy that can be applied to code graphs for learning-based vulnerability discovery.
augmentation data detection learning machine research sample security vulnerability
Last synced: 07 Jun 2026
https://github.com/nightroman/farnet.fsharp.data
FSharp.Data package for FarNet.FSharpFar
Last synced: 27 Apr 2026
https://github.com/sodascience/open_supply_hub
Processing supply chain data obtained from Open Supply Hub
data global-supply-chain open-supply-hub python
Last synced: 29 Apr 2026
https://github.com/wu-rymd/pyobjectify
Bridging the gap across the different file formats and streamlining the process to accessing ingested data via Python objects
Last synced: 08 Jun 2026
https://github.com/timclicks/dataclerk
zero fuss data logging over HTTP
actix-web command-line data logging rust sqlite sqlite3 utility
Last synced: 30 Apr 2026
https://github.com/shubham14p3/python-word-cloud
Simple python application to create word cloud.
data data-analysis data-science data-visualization nbextension python-3 upload-file
Last synced: 01 May 2026
https://github.com/lucien-loua/libgn
Manipulate geographical and administrative data about Guinea.
Last synced: 08 Jun 2026
https://github.com/gdhhgnbnvbn/f1-2025-ai-predict
fully generated by claude 3.5 sonnet via Windsurf IDE. Not a single lines wrote.
agent-based-modeling claude csv data f1 gpt machine-learning model prediction predictive-modeling python rainforest streamlit vibe
Last synced: 01 May 2026
https://github.com/syed-bilal-haider-engineer/interview_questions
Interview Questions
data database interview-questions javascript oop operating-system reactjs structure technical
Last synced: 01 May 2026
https://github.com/ggeop/multiple-fields-management
Fields management from/to different data sources. :bulb:
data data-engineering data-organization data-retrieval data-science pandas python
Last synced: 01 May 2026