data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-01 00:07:35 UTC
- JSON Representation
https://github.com/olamide100/capstone-project-llm-zoomcamp
Comparative Guide Assistant
argocd data dataengineering docker grafana kubernetes llm-agent mlops-workflow rag strreamlit
Last synced: 14 Feb 2026
https://github.com/palewire/nyc-hpd-bronx-lead-paint-violations
Download and process housing code lead paint violations in the Bronx from NYC Open Data
bronx data data-journalism news nyc python
Last synced: 02 Apr 2026
https://github.com/2kabhishek/pyramen
Data Analysis for Ramen 🍜💹
csv data data-analysis fun python report
Last synced: 26 Oct 2025
https://github.com/codeforafrica/ckanext-followy
[ARCHIVED] A CKAN extension to show the datasets a user is following.
ckan ckan-extension ckanext-followy data dataset followy-extension open-data
Last synced: 29 Jun 2026
https://github.com/abdul-rafay19/youngdevinterns_machine-learning_tasks
This internship offers hands-on exposure to real-world Machine Learning applications — from data visualization and preprocessing to model development, evaluation, and deployment. It focuses on real ML workflows, problem-solving, neural networks, and hyperparameter tuning — all within a collaborative, remote, and growth-oriented environment.
ai artificial-intelligence artificial-intelligence-algorithms artificial-neural-networks data data-visualization internship machine-learning machine-learning-algorithms machinelearning ml model model-development neural-network preprocessing programming-language python task tasks youngdevintern
Last synced: 29 Apr 2026
https://github.com/nitrosh/nitro-validate
A powerful, standalone, dependency-free data validation library for Python with extensible rules and a clean, intuitive API.
data python3 validation validation-library
Last synced: 17 Apr 2026
https://github.com/aleenprd/docbt
Documentation Build Tool - Generate YAML documentation for dbt models with optional AI assistance. Built with Streamlit for an intuitive and familiar web interface.
ai analytics-engineering bigquery data data-modeling data-science dbt docker llm lmstudio ollama openai snowflake sql streamlit
Last synced: 11 Nov 2025
https://github.com/aranfononi/h4x0r-news-section-17-project
A SwiftUI-powered app that displays top stories from Hacker News. Users can open articles directly within the app, utilizing SwiftUI’s NavigationLink and custom WebView integration.
app-development data data-binding data-binding-library ios swift swiftui xcode
Last synced: 18 May 2026
https://github.com/svetlanam/twitter-ads
Get data about campaigns from Twitter Ads API
api data keboola keboola-extractor twitter twitter-ads twitter-api
Last synced: 12 Jun 2026
https://github.com/vagnerbellacosa/029_analisededadoscompythonpandas
Neste Labs será apresentada a biblioteca Pandas, uma biblioteca Python de código aberto para análise de dados. Ela dá ao Python a capacidade de trabalhar com dados do tipo planilha, permitindo carregar, manipular e combinar dados rapidamente, entre outras funções. Python
data digital-innovation-one dio jupiter-notebook labs ms-excel panda python
Last synced: 14 May 2026
https://github.com/brandonhimpfen/data-size-parser
A tiny, practical parser for human-readable data sizes.
data data-size data-sizes npm open-source web-design web-development
Last synced: 12 Jun 2026
https://github.com/giorgiosavastano/process
processing-chain provides a convenient way to seamlessly set up processing chains for large amounts of data.
big-data data data-science parallel parallel-computing process processing processing-chain rust
Last synced: 05 Oct 2025
https://github.com/ilejuxepwaduzd/structured-data-extractor
🛠️ Extract structured data from messy texts using Chain-of-Thought prompting to improve processing of customer support and technical issues.
cdp chrome-fetcher data document-extraction ecommerce golang-library headless metadata-extraction ocr open-source pdf pdf-converter pdf-extractor ruby scraper shopify spider structured-data
Last synced: 10 Apr 2026
https://github.com/patrikmasiar/algorythm-of-the-night
Awesome list of algorithms that help you 🚀 Feel free to contribute 👨🏻💻
algorithms data interview-questions logic logic-programming math mathematics science
Last synced: 27 Oct 2025
https://github.com/arif-miad/heart-attack-risk-prediction
This dataset explores key factors influencing heart attack risk, such as age, cholesterol, blood pressure, and lifestyle habits. Using machine learning models.
classification data data-science matplotlib ml pandas-python seaborn visualization
Last synced: 18 Aug 2025
https://github.com/freddy03h/immutable-data-structure
Normalize and Merge your application's data store using Immutable.JS objects
Last synced: 05 Oct 2025
https://github.com/stdlib-js/array-base-to-deduped
Copy elements to a new generic array after removing consecutive duplicated values.
array compress copy data dedupe deduplicate deduplication duplicate generic javascript node node-js nodejs stdlib structure types uniq unique
Last synced: 14 Jun 2025
https://github.com/simranjeet97/python-flask-crud-web-app
Python Flask CRUD Web App Performing Operations Using MySQL.
bootstrap3 data data-science flask flask-web machine-learning natural-language-processing python templates webapp
Last synced: 19 Apr 2026
https://github.com/arif-miad/data-visualization
A Comprehensive Guide to Data Visualization
analytics data data-science machine machine-learning-algorithms model python visualization
Last synced: 20 Apr 2026
https://github.com/dylanhogg/cloud-products
A package for getting cloud products and product descriptions from a cloud provider website.
aws cloud-products crawler data text-processing
Last synced: 05 Oct 2025
https://github.com/stdlib-js/datasets-herndon-venus-semidiameters
Fifteen observations of the vertical semidiameter of Venus, made by Lieutenant Herndon, with the meridian circle at Washington, in the year 1846.
astronomy data dataset datasets grubbs herndon javascript node node-js nodejs outlier outliers sample statistics stats stdlib venus
Last synced: 09 Oct 2025
https://github.com/maccccd/wsoa3029a_2444372
This website serves an extension of my portfolio work. It focuses specifically on showcasing my understanding of D3.js , a JavaScript library used to create interactive data visualizations. The visualizations in here were used to provide insights on two types of cybersecurity attacks: Phishing & Ransomware.
d3js data hacking visualization
Last synced: 24 Jan 2026
https://github.com/allianz/yukimi
Self-service Snowflake provisioning with built-in security and policy enforcement.
Last synced: 05 Jun 2026
https://github.com/mohsinali08000/myportfolio
I’m Mohsin Ali, a passionate software engineer with over 2 years of experience in developing robust software solutions. Currently transitioning into the field of data science.
Last synced: 22 Apr 2026
https://github.com/mishra-krishna/analysis-and-optimization-of-supply-chain-operations
Analyzed supply chain data to identify trends and key factors. Visualized sales, defect rates, lead times, and costs. Used Decision Tree Regressor to find top features impacting product costs and lead times.
data dataanalytics datavisualization supplychain supplychainanalytics
Last synced: 20 Apr 2026
https://github.com/jinsyin/dataorigin
数据之源 | A data source management framework
Last synced: 21 Apr 2026
https://github.com/zoekelepiri/ota_observatory
A front-end web application that provides detailed information about the boundaries and statistical data of the regions and prefectures of Greece.
backend data database spring-boot
Last synced: 06 Feb 2026
https://github.com/ezfe/activityringsexporter
apple-watch applewatch data healthkit ios
Last synced: 08 May 2026
https://github.com/hyperversal-blocks/averveil
Averveil is OpenSea for Data.
blockchain data golang iot privacy zero-knowledge zkp
Last synced: 14 Jan 2026
https://github.com/sefakcmn00/tensorflow_machine_learning_simple-
Artificial Neural Network(ANN) Perceptron
data mathplotlib pandas pandas-dataframe pandas-python sklearn tensorflow-examples tensorflow2
Last synced: 06 Feb 2026
https://github.com/CheeseWithSauce/HadithsJSONFormat
Free, authentic Hadith data from sunnah.com organized bookwise specially for Muslim devs. Includes Arabic, English, and gradings. Use freely without credits. Collections: Bukhari, Muslim, Abu Dawud, Tirmidhi, Nasa'i, Ibn Majah, Malik, Riyad as-Salihin. Expanding soon, Inshallah.
api arabic data dev free hadith islam islamic muslim open-source quran sunnah
Last synced: 24 Feb 2026
https://github.com/ariqf1/learn_data
Currently learning and building projects related to data pipelines, ETL processes, and data processing using Python. Passionate about scalable data solutions and modern data stack tools.
Last synced: 15 Apr 2026
https://github.com/aravind-selvam/bikeshare-company-analysis
Google Data Analytics Professional Certificate program's Capstone project, of a bike sharing company
analytics business-analytics business-intelligence data data-analysis data-visualization dataanalytics google-data-analytics postgresql sql sql-server
Last synced: 22 Apr 2026
https://github.com/tkonopka/makealive
Dynamic web content through controlled javascript
conversion-functions d3 data data-science javascript visualization
Last synced: 22 Apr 2026
https://github.com/neelravi/data-management
A data management plan for computational chemists/physicists and material scientists for a FAIR storage of raw data
data dmp fair management workflows
Last synced: 16 Jan 2026
https://github.com/howtoquitvivek/ai-crop-yeild-prediction
AI-driven crop yield prediction and agricultural optimization system (SIH 2025)
2025 2026 ai crop-yeild data minor-project ml predcition python science sih
Last synced: 23 Apr 2026
https://github.com/jmcanterafonseca/leaflet-context-information
A Leaflet plugin + infrastructure for getting access to Context Information (i.e. data) exposed through FIWARE NGSIv2
context data fiware information leaflet map open visualization web
Last synced: 21 Apr 2026
https://github.com/stephaniehicks/flowsorted.blood.wgbs.blueprint
A Bioconductor ExperimentHub data package for flow sorted purified whole blood cell types measured using DNA methylation on WGBS platform from BLUEPRINT
bioconductor bioconductor-package bisulfite-sequencing blood data dna-methylation flowsort wgbs
Last synced: 25 Sep 2025
https://github.com/sathyasris27/data-analysis-on-adult-smoking-patterns-in-the-uk
The aim of this analysis is to understand the smoking patterns among adults in the UK.
data data-analysis data-visualization python3
Last synced: 09 May 2026
https://github.com/fairspec/fairspec-typescript
Fairspec TypeScript is a fast data management framework built on top of the Fairspec standard and Polars DataFrames
ckan csv data dataframe dataset excel fair json ods polars quality schema sqlite table typescript validation zenodo
Last synced: 09 Feb 2026
https://github.com/alejo1630/titanic_kaggle
This Python Notebook is a proposal to analyse the Titanic dataset for the Kaggle Competition, using several data science techniques and concepts.
data data-science jupyter-notebook notebook python titanic-survival-prediction
Last synced: 03 May 2026
https://github.com/dataship/beam
Get collimate'd data into Frame, in Node or the Browser
column-store data data-science
Last synced: 27 Apr 2026
https://github.com/scarblase/russian-military-losses-analysis
This repository provides an in-depth analysis of Russian equipment losses using PySpark and data visualization techniques.
data data-science data-visualization jyputer-notebook matplotlib pyspark python3 seaborn seaborn-plots ukraine ukraine-invasion
Last synced: 12 May 2026
https://github.com/bredalis/functionalprogrammingpython
💻 Programación Funcional en Python
data functional-programming functions programing programming-language python structured-data
Last synced: 06 Jun 2026
https://github.com/R-Mahesh45/HR---Resume-Text-Classification
Text Classification for Resumes: Conducted Exploratory Data Analysis (EDA) on a vast collection of resumes. Organized the data using Bag of Words (BoW) and TF-IDF techniques. Built and evaluated multiple models, with Logistic Regression delivering standout performance. Created Word Clouds and Histograms.
data datacleaning extract-transform-load feature-extraction nlp nltk-tokenizer text-mining text-processing
Last synced: 13 Oct 2025
https://github.com/qbicsoftware/research-data-management
Documentation about the life science research data management at QBiC
data data-management data-stewardship documentation hacktoberfest life-science management metadata rdm reasearch-data-management
Last synced: 30 Jan 2026
https://github.com/ucd-cws/nitrates-cv
california centralvalley data frep groundwater model nitrates
Last synced: 16 Jan 2026
https://github.com/iamgmujtaba/github-python-daily-trending
This repository provides an automated, daily-updated list of the top trending Python repositories on GitHub. Using a GitHub Actions workflow, it scrapes data from GitHub's trending page, sorts the results by total stars, and generates a clean, well-structured README file
data data-scraping github-actions tranding tranding-bot
Last synced: 13 Oct 2025
https://github.com/ahmad-ali-rafique/pyviznotebook
PyVizNotebook is a collection of Matplotlib visualizations demonstrating a wide range of plot types and techniques for data visualization. Whether you're a beginner looking to learn or an experienced developer seeking inspiration, this repository offers a diverse set of examples to explore.
analytics colab-notebook data data-science data-visualization dataanalytics matplotlib-python plots seaborn-python visualization
Last synced: 06 Jun 2026
https://github.com/danieljdufour/rle-serializers
Serialize and Deserialize Run Length Encoding
cloud-optimized compression csv data deserializer run-length run-length-decoding run-length-encoding serializer
Last synced: 24 Sep 2025
https://github.com/tberey/social-stocks
A Graphical Data and Analysis Tool
data data-analysis data-science data-stream data-visualization database javascript mysql mysql-database node nodejs rest rest-api social-stocks stock-market stocks ticker-data tickers trends typescript
Last synced: 21 Jan 2026
https://github.com/stdlib-js/array-one-to
Generate a linearly spaced numeric array whose elements increment by 1 starting from one.
array data float32array float64array int16array int32array javascript matrix ndarray node node-js nodejs stdlib structure typed typed-array types uint32array vector
Last synced: 26 Feb 2026
https://github.com/zalweny26/open_data_unipa
Progetto per l'esame di Laboratorio di Algoritmi 23-24, UniPa, Informatica L-31
Last synced: 26 Apr 2026
https://github.com/connectaman/c-and-data-structure
Program,Notes,Explanation on Data Structure using C++
cpp data data-structures sorting-algorithms
Last synced: 14 Mar 2025
https://github.com/sap-samples/security-research-codegraphsmote
Data augmentation strategy that can be applied to code graphs for learning-based vulnerability discovery.
augmentation data detection learning machine research sample security vulnerability
Last synced: 07 Jun 2026
https://github.com/rishabh-agarwal/datastructuremachineproblem
Data Structure MP - Clemson University (Language C)
273 alogrithms clemson data ece structure university
Last synced: 26 Oct 2025
https://github.com/williamwutq/bblock
Persistent checksummed blocks built on top of bstack's allocators
allocation binary block data data-structures database rust rust-crate rust-library serialization
Last synced: 25 Jun 2026
https://github.com/elhariri78/case-study-a-better-smoker-detector
Case Study-A better Smoker Detector
data dataframe evaluation kaggle matplotlib-pyplot numpy pandas pandas-dataframe pandas-python python3 seaborn sklearn
Last synced: 07 Apr 2026
https://github.com/stdlib-js/array-base-to-accessor-array
Convert an array-like object to a minimal array-like object supporting the accessor protocol.
accessor accessors array array-like convert data javascript node node-js nodejs object protocol stdlib structure types wrap wrapper
Last synced: 04 Jan 2026
https://github.com/nodef/infoods
Kit for International Network of Food Data Systems (INFOODS).
component data food identifier infoods international network systems tagnames
Last synced: 11 Mar 2026
https://github.com/nightroman/farnet.fsharp.data
FSharp.Data package for FarNet.FSharpFar
Last synced: 27 Apr 2026
https://github.com/athul64/powerbi
Financial Reports Dashboard This repository showcases a Financial Reporting Dashboard that visualizes key financial metrics and performance insights. The dashboard contains Monthly and Annual reports, allowing users to switch between the two views to analyze data at different intervals.
data data-an data-visualization dax dax-expression powerbi
Last synced: 23 Feb 2026
https://github.com/mini-ware/mini-ware
Just some very simple markdown for my GitHub profile
codewars ctf data hackthebox javascript markdown minimalistic profile-readme python readme-profile simple stattistics svg
Last synced: 13 Apr 2026
https://github.com/devsujay19/knowledgebase
My knowledge base built with NextJS 14, Tailwind CSS 3 and Aceternity UI.
data knowledge-base nextjs nextjs-typescript nextjs14 react server-side-rendering tailwindcss vercel
Last synced: 10 Apr 2026
https://github.com/rdjarbeng/rdjarbeng
Richard Djarbeng's github profile-computer engineer specializing in web development, machine learning, and IoT devices. New web posts have moved to website below
data jekyll machine-learning ruby website
Last synced: 28 Apr 2026
https://github.com/awesomelistsio/awesome-open-data
A curated list of high-quality open data resources, tools, platforms, and projects across domains.
awesome awesome-list awesome-lists data open open-data
Last synced: 29 Jun 2025
https://github.com/soenneker/soenneker.quark.table
A native Blazor table component.
blazor blazorlibrary csharp data dotnet html quark quarktable table tables
Last synced: 13 Aug 2025
https://github.com/tee8z/noaa-oracle
NOAA data oracle, queryable from the browser and can attest to events for a Bitcoin DLC in dlctix style
data duckdb-wasm noaa-weather parquet-files sql weather
Last synced: 17 Feb 2026
https://github.com/flowsynx/plugin-postgresql
FlowSynx plugin to interfaces with PostgreSQL for CRUD operations. Supports JSONB, full-text search, and advanced query features.
data database flowsynx postgresql postgresql-database sql
Last synced: 09 May 2026
https://github.com/the-aerospace-corporation/pivt
PIVT is an analytics tool to help software development teams visualize the life cycle and behavior of their software factory.
analytics dashboards data devops jenkins pipeline python splunk visualization
Last synced: 29 Apr 2026
https://github.com/garcane/global-shipping-analytics-dashboard
This Tableau project provides a comprehensive visual analysis of global sales, shipping costs, and quality metrics across different regions and countries.
data data-analysis data-analyst data-visualization metrics tableau
Last synced: 01 Mar 2026
https://github.com/joocer/data_expectations
Are your data meeting your expectations?
data data-engineering data-quality data-science data-unit-tests observability pipelines quality validation
Last synced: 07 Oct 2025
https://github.com/elissorokin/data-analyst-portfolio-rus
Это репозиторий, в котором я демонстрирую свои навыки, делюсь проектами и отслеживаю прогресс в области анализа данных и Data Science.
ab-testing data data-analysis datalense matplotlib numpy pandas plotly portfolio postgresql python scipy seaborn sql statistical-analysis
Last synced: 25 Feb 2026
https://github.com/aniketkkajania/wassupanalyzer
WhatsAnalyzer is a powerful statistical analysis tool designed for analyzing WhatsApp chats. With the ability to process chat files exported from WhatsApp, this tool provides valuable insights by generating various plots and statistics.
data data-science datavisualization streamlit streamlit-webapp webapp whatsapp whatsapp-chat
Last synced: 25 Feb 2026
https://github.com/cworld1/novel-data
The data repository of novel analysis
Last synced: 01 Feb 2026
https://github.com/SAP-archive/signavio-qualtrics-di
Setup an SAP Data Intelligence data pipeline to connect Qualtrics surveys data to SAP Signavio Process Intelligence via Ingestion API.
data intelligence process-intelligence qualtrics sample sap-data-intelligence sap-signavio-process-intelligence signavio
Last synced: 09 May 2025
https://github.com/helosantosdesousa/analise-previsao-de-rotatividade-ml
Projeto final do Bootcamp Data Girls 2025 que analisa a rotatividade de funcionários usando Machine Learning. Com base no dataset IBM HR Analytics Attrition, o projeto identifica os principais fatores de risco e cria modelos preditivos (SVC e Random Forest) com até 89% de acurácia para antecipar saídas e apoiar decisões estratégicas de RH.
analise-de-dados analise-exploratoria bootcamp ciencia-de-dados colab-notebook dados data data-analysis data-science dataanalytics dataframe eda machine-learning machine-learning-algorithms pandas python random-forest svc
Last synced: 16 Apr 2026
https://github.com/vvipjain/hockey-tournament-analysis
Hockey Tournament Analysis
beautifulsoup data data-analysis data-visualization databases pandas pandas-dataframe powerbi python python-library python-script requests-library-python sql sql-server sqlalchemy
Last synced: 27 Jan 2026
https://github.com/rformassspectrometry/msdatahub
Mass Spectrometry Data on ExperimentHub
bioconductor data mass-spectrometry metabolomics proteomics r r-package
Last synced: 14 Apr 2025
https://github.com/v-mayya/python-sales-data-analysis
Group project with another team member held by CFG to conduct spreadsheet data analysis of fake sales data using Python
analysis data matplotlib numpy python
Last synced: 29 Apr 2026
https://github.com/velocitatem/cellviz
Cellular Automata inspired by live-data visualization, designed to handle multidimensional and high-throughput data efficiently.
cellular-automata conways-game-of-life data economics
Last synced: 29 Jul 2025
https://github.com/mohnoor94/datasciencefundementalsusingpython
My journey to learn Data Science with Python
data data-analysis data-science data-visualization learning learning-by-doing python python3
Last synced: 19 Jun 2026
https://github.com/spine-tools/metreload
Python application for downloading meteorological reanalysis data
Last synced: 01 Jul 2025
https://github.com/wu-rymd/pyobjectify
Bridging the gap across the different file formats and streamlining the process to accessing ingested data via Python objects
Last synced: 08 Jun 2026
https://github.com/mrbisquit/weathercollector
Open-Source weather station data collector
collector customisable data modular opensource weather weather-forecast weather-station
Last synced: 16 Jan 2026
https://github.com/oneblack333/pizza_sales_analysis
The project involves transforming raw pizza sales data into actionable business intelligence through analysis and visualization. This enables pizza business owners to make data-driven decisions on inventory, staffing, and marketing, ultimately improving performance and profitability.
data data-structures data-visualization excel mysql powerbi
Last synced: 20 Jun 2026
https://github.com/chompfoods/stub-asp-net-core
ASP.NET Core server stub for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.
api asp asp-net-core aspnetcore branded chomp data database food grocery ingredients nutrition raw recipe-api recipes server stub stub-server
Last synced: 30 Apr 2026
https://github.com/jorgeatgu/casa-caida-bot
Twitter-bot sobre la despoblación en Aragón
aragon bot data data-viz despoblacion twitter-bot
Last synced: 11 Aug 2025
https://github.com/qeeqbox/data-classification
Data classification defines and categorizes data according to its type, sensitivity, and value
classification data data-classification infosecsimplified qeeqbox
Last synced: 09 Mar 2026
https://github.com/souvik09-tech/adventure-works-kpi-dashboard
This repository contains a complete Business Intelligence solution for AdventureWorks, a global manufacturing company specializing in cycling equipment and accessories. Built using Power BI Desktop, this project helps track KPIs, analyze product performance, compare regional data, and identify high-value customers.
analysis data kpi powerbi visualization
Last synced: 27 Jan 2026