data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-03 00:07:49 UTC
- JSON Representation
https://github.com/sandipbera35/blogapp.spring.boot
A proof-of-concept Project Of Blog application In Java Spring Boot, Spring Data JPA with mysql Minio Object Storage , it is an Integration with JWT authservice project(written in golang) .
data java jpa jpa-entity-manager jpa-hibernate mysql mysql-server postman postmanapi spring-boot
Last synced: 13 Apr 2026
https://github.com/unownone/spenddy-link
Simple Privacy Friendly chrome extension to track your spends and more!
Last synced: 12 Mar 2026
https://github.com/s-raza/csvio
Wrapper for conveniently processing CSV files
csv data file processing wrapper
Last synced: 14 Jan 2026
https://github.com/avto-dev/static-references-data
Data for static references
Last synced: 05 Oct 2025
https://github.com/wangshouh/cryptofinancedata
An ipynb file containing data acquisition of futures, options and other financial derivatives
Last synced: 05 Oct 2025
https://github.com/helins/ex.clj
Java exceptions as clojure data
clojure data exception java java-exceptions
Last synced: 12 Dec 2025
https://github.com/hyperversal-blocks/averveil
Averveil is OpenSea for Data.
blockchain data golang iot privacy zero-knowledge zkp
Last synced: 14 Jan 2026
https://github.com/patrickdavies100/datapipeline37
Some Data Science practice using datasets available online. Currently test data is similar to this dataset: https://www.kaggle.com/datasets/asaniczka/amazon-uk-products-dataset-2023 but the plan is to expand.
data data-science pandas-dataframe python3
Last synced: 08 Oct 2025
https://github.com/pharo-ai/data-imputers
This project contains transformers for missing value imputation
ai data data-science imputer pharo pharo-smalltalk smalltalk
Last synced: 18 Jan 2026
https://github.com/scienxlab/datasets
Some small datasets for demos, courses, testing, etc.
data open-data sample-data teaching-resources
Last synced: 09 Oct 2025
https://github.com/varun-khorgade/sentimentscope-e-commerce-review-analyzer
Analyzed customer reviews and purchase data to extract sentiment and behavioral insights. Built SQL-based ETL for data preparation and visualized results using Python and Power BI dashboards for actionable business decisions.
analytics customer-beheviour dashboard data data-visualization dataextraction natural-language-processing nlp pandas powerbi python sentiment-analysis sql textblob
Last synced: 17 Apr 2026
https://github.com/east-empire-trading-company/eetc-data-client
Client library for retrieving data managed by EETC Data Hub.
client-library data data-science finance library python
Last synced: 31 May 2026
https://github.com/alexandregazagnes/rica-analysis
This repository contains the code to download, analyse, and modelize the RICA dataset from the french ministry of agriculture.
analysis argiculture business data data-analysis data-analytics food python
Last synced: 29 Apr 2026
https://github.com/definetlynotai/vulnscan_data
Logicytics VulnScan Module's Training Data and old model archive
ai data logicytics ml models pytorch sensitive-files text-processing tfidf-text-analysis training-data
Last synced: 11 Oct 2025
https://github.com/strata/data
Tools to help you read data from a range of different data providers.
Last synced: 27 Jan 2026
https://github.com/famarks/grafarg
Grafarg is an interactive data analytics and graphical data visualization application. Grafarg being a progressive fork of Grafana 7.5.17 continues to be available under open source Apache 2.0 License
analytics charts data data-analysis data-science data-visualization grafana grafarg graph
Last synced: 19 Jan 2026
https://github.com/jrmedd/emojinal
An experimental API for determining emoji sentiment, based on research from Institut "Jožef Stefan", Slovenia.
data emojis sentiment user-research ux
Last synced: 19 Jan 2026
https://github.com/saroshfarhan/kaggle-playground-s4e12
Kaggle competition first attempt
analytics data data-analysis-python data-science
Last synced: 12 Oct 2025
https://github.com/R-Mahesh45/HR---Resume-Text-Classification
Text Classification for Resumes: Conducted Exploratory Data Analysis (EDA) on a vast collection of resumes. Organized the data using Bag of Words (BoW) and TF-IDF techniques. Built and evaluated multiple models, with Logistic Regression delivering standout performance. Created Word Clouds and Histograms.
data datacleaning extract-transform-load feature-extraction nlp nltk-tokenizer text-mining text-processing
Last synced: 13 Oct 2025
https://github.com/iamgmujtaba/github-python-daily-trending
This repository provides an automated, daily-updated list of the top trending Python repositories on GitHub. Using a GitHub Actions workflow, it scrapes data from GitHub's trending page, sorts the results by total stars, and generates a clean, well-structured README file
data data-scraping github-actions tranding tranding-bot
Last synced: 13 Oct 2025
https://github.com/tberey/social-stocks
A Graphical Data and Analysis Tool
data data-analysis data-science data-stream data-visualization database javascript mysql mysql-database node nodejs rest rest-api social-stocks stock-market stocks ticker-data tickers trends typescript
Last synced: 21 Jan 2026
https://github.com/connectaman/deepseek-ocr-multigpu-infer
Efficient multi-GPU OCR inference framework leveraging parallel processes for accelerated token throughput and faster batch processing. Designed for scalable, high-performance optical character recognition workloads using PyTorch. Supports dynamic GPU assignment, optimized resource utilization, and easy integration for large-scale image datasets.
agentic-extraction data deepseek document-parser extraction extractor gpu image-parser llm multigpu nvidia ocr parallel-computing parser pdf-parser vlm
Last synced: 22 Jan 2026
https://github.com/player29879/neum-ai
Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.
ai chatgpt data data-engineering database embeddings etl llm llmops mlops ops pipeline python rag retrieval vector-database vectors
Last synced: 18 Apr 2026
https://github.com/lisakey/datacamp-data-analyst-python-sql-projects
Several projects completed during my Data Analyst 📊 training on the DataCamp platform with Python 🐍 and SQL 🗃️. Each project addresses real-world challenges using modern analytical tools and techniques.
analysis cleaning-data data dataanalysis dataanalyst matplotlib pandas python seaborn sql transformation visuali
Last synced: 19 Apr 2026
https://github.com/lahcenezzara/whatsapp-scraping-python
WhatsApp Scraping Python
automation data python scraping selenium whatsapp
Last synced: 05 Feb 2026
https://github.com/yeshunit/walmart-product-customer-sales-sql-analysis
This project aims to explore the Walmart Sales data to understand top performing branches and products, sales trend of of different products, customer behaviour. The aims is to study how sales strategies can be improved and optimized. The dataset was obtained from the Kaggle
data database mysql sql walmart
Last synced: 24 Feb 2026
https://github.com/morphaxthedeveloper/yokatlas-dataset-2025
yök atlas detaylı üniversite, bölüm, puan vb. datası..
data database liste scrape universite veri yok-atlas yok-atlas-api yok-atlas-data yokatlas yokatlas-crawler yokatlas-data
Last synced: 14 Oct 2025
https://github.com/souvik09-tech/adventure-works-kpi-dashboard
This repository contains a complete Business Intelligence solution for AdventureWorks, a global manufacturing company specializing in cycling equipment and accessories. Built using Power BI Desktop, this project helps track KPIs, analyze product performance, compare regional data, and identify high-value customers.
analysis data kpi powerbi visualization
Last synced: 27 Jan 2026
https://github.com/ibilalkayy/covid-tracking-app
This repository contains the code of a covid tracking app that shows the data of covid-19 on Google Map.
Last synced: 14 Oct 2025
https://github.com/nnavales/desafios-data-engineer
En este proyecto abordaremos desafíos comunes en el rol de un Data Engineer con tecnologías modernas.
data data-engineering database dataengineering docker minio scrapping spark
Last synced: 01 Jun 2026
https://github.com/intersystems-ib/workshop-healthcare-interop
Learn the basics in HealthCare Interoperability using InterSystems IRIS for Health
data fhir health hl7 interoperability
Last synced: 14 Apr 2026
https://github.com/open-i18n/data-iso-15924
Git mirror for ISO 15924, Codes for the representation of names of scripts data
data iso iso-15924 iso15924 open-i18n scripts unicode unicode-data writing-systems
Last synced: 14 Mar 2026
https://github.com/kledenai/jsonweaver
A powerful and easy-to-use library for transforming JSON data into popular formats such as CSV, XML, Markdown tables, YAML, and JSONLines (NDJSON).
csv data data-transform format json jsonlines jsonweaver markdown markdown-tables xml yaml
Last synced: 24 Feb 2026
https://github.com/sanskaryo/ultimate-dsa-repo
One Stop Solution for DSA Learning and Resources
data data-structures-and-algorithms dsa hacktoberfest hacktoberfest-accepted hacktoberfest2025
Last synced: 15 Oct 2025
https://github.com/bishtrishu/pizza_sales_data_analysis_sql
This project is a comprehensive data analysis of pizza sales, aimed at uncovering key insights and trends to inform business decisions. Using a combination of SQL, Python, and data visualization tools, the project analyzes sales data to understand customer preferences, peak sales periods, and the most popular pizza types.
cloud data data-analysis data-science data-visualization dataanalytics database mysql oracle-database
Last synced: 14 Apr 2026
https://github.com/potreic/etl-fashion-trend-analysis
✨ Automate fashion trend analysis with Apache Airflow! Extract data from X & Pinterest, transform into insights, and load into PostgreSQL. Predict seasonal styles & visualize trends. 💃📊
airflow airflow-dags data data-engineering etl etl-automation etl-pipeline fashion-trends
Last synced: 27 Jan 2026
https://github.com/data-forge-notebook/javascript-cheat-sheet
Cheat sheet that accompanies my book Data Wrangling with JavaScript
cheatsheet data data-wrangling javascript nodejs
Last synced: 15 Apr 2026
https://github.com/florianwendelborn/metatypes
Monorepo of TypeScript Metadata Definitions (e.g. HTTP Status Codes)
code-generation data datastructures enum http-status-codes jsdoc lerna metadata typescript
Last synced: 27 Jan 2026
https://github.com/mscbuild/analysis
🎢 This collection of data analysis projects demonstrates techniques for extracting, transforming, analyzing, and visualizing data. Data Analytics Projects for Beginners 📈 ⚡
anallysis analysis chart csv dashboard data data-science data-science-projects excel google html5 mashine-learning portfolio pyton
Last synced: 19 Oct 2025
https://github.com/sksubhadeep/airbnb-dashboard-tableau
Airbnb Dashboard Using Tableau
airbnb data tableau tableau-public visualization
Last synced: 19 Mar 2026
https://github.com/lemniscate-world/stratai
This project analyzes financial assets using a Hidden Markov Model (HMM) to identify different market regimes and patterns. The analysis includes calculating daily returns, rolling volatility, and volume changes, and visualizing the hidden states identified by the HMM.
ai assets data data-science data-visualization finance financial-analysis fintech hmm-model hmmlearn machine-learning trading
Last synced: 23 Oct 2025
https://github.com/cisagov/cyhy-feeds
Tools to create and retrieve Cyber Hygiene (CyHy) data extracts
Last synced: 23 Oct 2025
https://github.com/atymri/linqsimulator
LINQ Simulator is an interactive C# console application designed to let you experiment with LINQ queries in real time.
console csharp data data-analysis linq query sql
Last synced: 23 Oct 2025
https://github.com/eshaagarwa/hr-analytics-project
Explore our HR Analytics Dashboard, a powerful Power BI project designed for HR managers and leaders. Analyzed essential KPIs such as Employee Count, Attrition Rate, and Job Satisfaction across various demographics.
dashboard data data-visualization dataanylasis ms-excel ms-excel-data-analytics powerbi statistics
Last synced: 23 Jan 2026
https://github.com/garcane/Income-Prediction-ML
This is a machine learning project aimed at predicting whether an individual's annual income exceeds $50,000 based on their demographic and personal information.
data data-science machine-learning ml numpy pandas python random-forest scikit-learn
Last synced: 24 Oct 2025
https://github.com/farzai/geonames-php
This package provides a simple way to download Geonames data and format it for friendly use.
countries country-codes data geography geonames
Last synced: 24 Oct 2025
https://github.com/capire/xtravels-java
Travel booking app using master data from xflights built with CAP Java
cap cds data federation flights java reuse
Last synced: 23 Jan 2026
https://github.com/cmda-tt/course-24-25
🎓 tech track · 2024-2025 · curriculum and syllabus 📊
d3 data datavis datavisualization es6 functional javascript programming svelte
Last synced: 28 Jan 2026
https://github.com/aleenprd/docbt
Documentation Build Tool - Generate YAML documentation for dbt models with optional AI assistance. Built with Streamlit for an intuitive and familiar web interface.
ai analytics-engineering bigquery data data-modeling data-science dbt docker llm lmstudio ollama openai snowflake sql streamlit
Last synced: 11 Nov 2025
https://github.com/city-of-helsinki/drupal-helfi-tyollisyyspalvelut-manuaali
Työllisyyden kuntakokeilujen palvelutietovarannon manuaali
data drupal drupal-9 unemployment
Last synced: 24 Jan 2026
https://gitlab.com/Native-Coder/d3-react-component
This is a dead-simple React component that makes D3 implementation a breeze.
chart component d3 data react vis visualization viz
Last synced: 24 Jan 2026
https://github.com/sefakcmn00/tensorflow_machine_learning_simple-
Artificial Neural Network(ANN) Perceptron
data mathplotlib pandas pandas-dataframe pandas-python sklearn tensorflow-examples tensorflow2
Last synced: 06 Feb 2026
https://github.com/CheeseWithSauce/HadithsJSONFormat
Free, authentic Hadith data from sunnah.com organized bookwise specially for Muslim devs. Includes Arabic, English, and gradings. Use freely without credits. Collections: Bukhari, Muslim, Abu Dawud, Tirmidhi, Nasa'i, Ibn Majah, Malik, Riyad as-Salihin. Expanding soon, Inshallah.
api arabic data dev free hadith islam islamic muslim open-source quran sunnah
Last synced: 24 Feb 2026
https://github.com/desktopcleaner/naturemagazinescraper
Scrapes open-access Nature magazine articles and store as txt files.
data nature-magazine python scrapper word-frequency
Last synced: 06 Feb 2026
https://github.com/stdlib-js/ndarray-base-output-policy-str2enum
Return the enumeration constant associated with an output ndarray data type policy string.
array data dtype dtypes enum javascript multidimensional ndarray node node-js nodejs policy stdlib types util utilities utility utils
Last synced: 15 Apr 2026
https://github.com/alejo1630/titanic_kaggle
This Python Notebook is a proposal to analyse the Titanic dataset for the Kaggle Competition, using several data science techniques and concepts.
data data-science jupyter-notebook notebook python titanic-survival-prediction
Last synced: 03 May 2026
https://github.com/itu-helper/data-updater
Periodically scrapes data related to ITU to be used by anyone. This data powers the ITU Helper web sites.
data istanbul-technical-university scraper selenium-python
Last synced: 29 Jan 2026
https://github.com/jinsyin/datagovernance
公众号:「数据之道」
data data-governance datagovernance governance
Last synced: 30 Jan 2026
https://github.com/sandk21/etude_eau_potable_monde
Etude sur l'accès à l'eau dans le monde - Tableaux de bord avec Tableau
analysis data tableau tableau-public visualization
Last synced: 19 Mar 2026
https://github.com/simranjeet97/quotes-analysis
Kaggle Dataset on Quotes Analysis and Visualization With Python, Pandas and MatplotLib Using Jupyter Notebook.
data data-science datavisualization jupyter-notebook kaggle kaggle-dataset machine-learning matplotlib-pyplot numpy pandas python quotes quotes-application
Last synced: 15 Apr 2026
https://github.com/pablolec/sb_querydsl_criteria_builder
Complex and dynamic frontend-to-backend queries using querydsl
api data design dynamic-queries hibernate java jpa json query query-builder querydsl querydsl-generator rest-api rsql spring spring-boot sql vue web
Last synced: 07 Feb 2026
https://github.com/elissorokin/data-analyst-portfolio-rus
Это репозиторий, в котором я демонстрирую свои навыки, делюсь проектами и отслеживаю прогресс в области анализа данных и Data Science.
ab-testing data data-analysis datalense matplotlib numpy pandas plotly portfolio postgresql python scipy seaborn sql statistical-analysis
Last synced: 25 Feb 2026
https://github.com/aniketkkajania/wassupanalyzer
WhatsAnalyzer is a powerful statistical analysis tool designed for analyzing WhatsApp chats. With the ability to process chat files exported from WhatsApp, this tool provides valuable insights by generating various plots and statistics.
data data-science datavisualization streamlit streamlit-webapp webapp whatsapp whatsapp-chat
Last synced: 25 Feb 2026
https://github.com/giladbarnea/to
A simple CLI tool to convert and diff between JSON, YAML, TOML, JSON5 and Python collections.
conversion data data-conversion json json5 parser script terminal toml yaml
Last synced: 08 Feb 2026
https://github.com/garcane/cookie-company-visual-dashboard
This Excel-based interactive dashboard provides a comprehensive overview of the Cookie Company's sales performance and key metrics.
dashboard data data-visualization excel microsoft-excel
Last synced: 09 Feb 2026
https://github.com/ajityadav2621/datadoom
Currently working on backend, and as user interaction has been done so updated also deployed for reference. will be adding up many things.
Last synced: 09 Feb 2026
https://github.com/jhpoelen/bats
self-documenting data publication on Bat (Chiroptera) specimen
biodiversity data natural-history-collections provenance specimen
Last synced: 18 Mar 2026
https://github.com/scottleechua/data
Public datasets under CC-BY-4.0 license.
Last synced: 18 Mar 2026
https://github.com/mchenryspagg/hng-hire-data-model
The project involves creating a data model for HNG Hire, implementing it in MySQL, and building a Power BI dashboard to display hiring statistics.
dashboard data database datamodeling dimensional-modeling mysql mysql-database powerbi starschema
Last synced: 11 Feb 2026
https://github.com/skygenesisenterprise/aether-account
Your cloud hub to securely manage all Aether services, profiles, and preferences in one unified dashboard. Fully open-source, fully cloud.
account data javascript nextjs platform service sso-service typescript user-interface
Last synced: 16 Apr 2026
https://github.com/lmuffato/project-mongodb-dataflights-trybe
Projeto MongoDB Dataflights - Projeto avaliativo da Trybe do Bloco 23: Introdução ao MongoDB
back-end crud data database filter mongo mongodb query trybe-projects
Last synced: 16 Apr 2026
https://github.com/shuklayash02/excel_complete_vrindastore_dataanalysis
Compltete AnalysisData Cleaning,processing and data analysis with interactive dashboard
analysis data data-visualization datacleaning excel excel-vba
Last synced: 19 Mar 2026
https://github.com/seabbs/estzoonotictb
Explore, Visualise and Estimate the Global Zoonotic Tuberculosis Burden
bovine-tb data estimation package rstats tuberculosis visualisation zoonotic-tb
Last synced: 28 Feb 2026
https://github.com/tushard48/analyzing-usa-market-trends-a-financial-overview
In-depth analysis of US market trends, encompassing economic indicators, industry performance, and financial data
data data-visualization powerbi
Last synced: 19 Mar 2026
https://github.com/m0nica/datalogues-outdated
Programming blog focused on data with an emphasis on exploration in Python. Has been migrated from Pelican to Jekyll
data pelican pelican-blog pelican-theme
Last synced: 28 Feb 2026
https://github.com/sakshisrivastava-2601/credit-card-fraud-detection
Credit Card Fraud Detection Project Using Machine Learning. This project focuses on leveraging advanced Machine learning techniques to identify fraudulent transactions with high accuracy.
advanced-machine data machine-learning numpy project-repository python pytorch random-forest
Last synced: 16 Apr 2026
https://github.com/garcane/london-housing-price-dashboard
This Excel-based Housing Visual Dashboard provides a comprehensive view of average house prices across various boroughs in London from 1996 to 2013. The dashboard is designed to offer insights into housing market trends and price variations across different areas of London over time.
data data-analysis data-visualization excel visual
Last synced: 13 Feb 2026
https://github.com/stdlib-js/array-base-every-by-right
Test whether all elements in an array pass a test implemented by a predicate function, iterating from right to left.
all array data every generic javascript node node-js nodejs predicate stdlib structure test types validate
Last synced: 13 Feb 2026
https://github.com/frictionlessdata/extensiondp
Extension DP (Data Package Extension Template) is a Git repository template for rapid Data Package extension development
data datapackage exchange extension format
Last synced: 13 Feb 2026
https://github.com/saisriramkamineni/e-commerce-sales-analysis-excel-
Conducted an in-depth sales analysis for an e-commerce platform, leveraging Excel for data preprocessing and Power BI for visualization. Identified key sales trends, customer purchasing behavior, and revenue growth patterns to optimize business performance.
analysis analytics data excel sales
Last synced: 14 Feb 2026
https://github.com/stdlib-js/array-base-assert-is-complex-floating-point-data-type
Test if an input value is a supported array complex-valued floating-point data type.
array assert base check data dtype is javascript node node-js nodejs stdlib test types util utilities utility utils valid validate
Last synced: 14 Feb 2026
https://github.com/luminati-io/twitter-x-dataset-samples
A sample dataset of over 1000 Twitter (X) posts, extracted using the Bright Data API, ideal for trend discovery, brand monitoring, and competitive insights.
api data dataset twitter twitter-api twitter-scraper web-scraping x
Last synced: 19 Mar 2026
https://github.com/ghonimo/diode-pn-junction-characterization-psu-ece515
A detailed analysis of the I-V characteristics of a PN junction diode (1N4148) under different temperatures, utilizing Excel for graphical analysis and parameter extraction. This study was conducted as part of the ECE 515: Fundamentals of Semiconductor Devices course at Portland State University.
analysis characterization data device diode diodes excel mosfet-transistor pn-junction
Last synced: 28 Feb 2026
https://github.com/linx-software/file-import-to-rest-api
Import a CSV file and make the data available via a REST API.
Last synced: 19 Mar 2026
https://github.com/theonlybeardedbeast/exercise-data
Datasets for workout exercises
data dataset fitness health healthcare
Last synced: 20 Mar 2026