data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-23 00:07:41 UTC
- JSON Representation
https://github.com/petermartens98/nba-analytics-streamlit-app-with-langchain-agent
Interactive NBA Analytics app with Streamlit and a LangChain conversational agent connected to extracted data. Explore player, team, and game stats, track injuries, run simulations, visualize trends, and get AI-powered insights. Ongoing development, open to collaboration.
agentic-ai analysis data deepseek langchain nba python streamlit visualization
Last synced: 08 May 2026
https://github.com/vagnerbellacosa/029_analisededadoscompythonpandas
Neste Labs será apresentada a biblioteca Pandas, uma biblioteca Python de código aberto para análise de dados. Ela dá ao Python a capacidade de trabalhar com dados do tipo planilha, permitindo carregar, manipular e combinar dados rapidamente, entre outras funções. Python
data digital-innovation-one dio jupiter-notebook labs ms-excel panda python
Last synced: 14 May 2026
https://github.com/jmcanterafonseca/leaflet-context-information
A Leaflet plugin + infrastructure for getting access to Context Information (i.e. data) exposed through FIWARE NGSIv2
context data fiware information leaflet map open visualization web
Last synced: 21 Apr 2026
https://github.com/mattythedev01/easydatadb
A quick and easy way to store data!
data database discord-bot discord-js discord-ts discordbot discordjs discordts npm npm-package package quick-db quickdb
Last synced: 13 Apr 2026
https://github.com/robertopatino1/oscars2023_data_analysis
A deep data science analysis involving tweets regarding the upcoming Academy Awards
data data-analysis-python data-science data-visualization html jupyter-notebook lda-model machine-learning python trends tweepy twitter
Last synced: 24 Apr 2026
https://github.com/jerryfzhang/rockets
A Node + React App that displays space launch missions around the world.
bootstrap data expressjs less momentjs nodejs react reactjs reactstrap
Last synced: 10 Apr 2026
https://github.com/sandipbera35/blogapp.spring.boot
A proof-of-concept Project Of Blog application In Java Spring Boot, Spring Data JPA with mysql Minio Object Storage , it is an Integration with JWT authservice project(written in golang) .
data java jpa jpa-entity-manager jpa-hibernate mysql mysql-server postman postmanapi spring-boot
Last synced: 13 Apr 2026
https://github.com/richardwarepam16/hotel_analysis_using_python
Unlocking Insights: Analyzing Hotel Reservation Data to Boost Business Performance
data data-analysis data-visualization hotel-booking hotel-cancellation-solution hotel-management-system jupyter-notebook python python3
Last synced: 22 Aug 2025
https://github.com/aadityatamrakar/futures_spread_chart
Cash Market & Futures Daily Spread Chart - NSE Stocks
data data-analysis data-mining expressjs nodejs requests
Last synced: 10 Apr 2026
https://github.com/ncgl-git/eriparse
Python code to parse the cost-of-living HTML from erieri.com, i.e. https://www.erieri.com/cost-of-living/united-states/illinois/chicago
cost-of-living crime crime-data data economic-research-institute erieri webscraper
Last synced: 14 Jan 2026
https://github.com/rambodrahmani/covid19-behind-the-numbers
COVID-19: Behind the Numbers.
apriori-algorithm apriori-algorithm-python clustering clustering-algorithm clustering-analysis covid covid-19 covid19-data data data-mining data-science datamining fpgrowth machine-learning machine-learning-algorithms python python-machine-learning
Last synced: 20 Aug 2025
https://github.com/bunnysunny24/bluepulse
A Smart Water Management System
data data-processing data-visualization firebase iot machine-learning mysql-database reactjs
Last synced: 17 Mar 2025
https://github.com/gusenov/qazaqstan-geography-data
:world_map: Географические данные Казахстана.
data geographic-data geography json kazakhstan qazaqstan regions
Last synced: 20 Feb 2026
https://github.com/akhi07rx/f1-statistics-dashboard
A comprehensive command-line tool for analyzing Formula 1 race data using the FastF1 library.
akhi07rx cli cli-tools data f1 f1-score f1cli f1dashboard f1stats fastf1 formula1 opensource race race-analytics
Last synced: 23 May 2026
https://github.com/s-raza/csvio
Wrapper for conveniently processing CSV files
csv data file processing wrapper
Last synced: 14 Jan 2026
https://github.com/avto-dev/static-references-data
Data for static references
Last synced: 05 Oct 2025
https://github.com/giorgiosavastano/process
processing-chain provides a convenient way to seamlessly set up processing chains for large amounts of data.
big-data data data-science parallel parallel-computing process processing processing-chain rust
Last synced: 05 Oct 2025
https://github.com/gematik/app-fhir-snapshots-package-generator
The repository contains a library and a console application to generate snapshots for StructureDefinitions in FHIR-packages.
Last synced: 05 Oct 2025
https://github.com/igorwastaken/math-problems
Solve math problems easily with this utility library.
algorithm area data demography geography javascript math npm package population school typescript util utils
Last synced: 23 Feb 2026
https://github.com/arif-miad/heart-attack-risk-prediction
This dataset explores key factors influencing heart attack risk, such as age, cholesterol, blood pressure, and lifestyle habits. Using machine learning models.
classification data data-science matplotlib ml pandas-python seaborn visualization
Last synced: 18 Aug 2025
https://github.com/freddy03h/immutable-data-structure
Normalize and Merge your application's data store using Immutable.JS objects
Last synced: 05 Oct 2025
https://github.com/DefinetlyNotAI/VulnScan_Data
Logicytics VulnScan Module's Training Data and old model archive
ai data logicytics ml models pytorch sensitive-files text-processing tfidf-text-analysis training-data
Last synced: 17 Aug 2025
https://github.com/dylanhogg/cloud-products
A package for getting cloud products and product descriptions from a cloud provider website.
aws cloud-products crawler data text-processing
Last synced: 05 Oct 2025
https://github.com/joocer/data_expectations
Are your data meeting your expectations?
data data-engineering data-quality data-science data-unit-tests observability pipelines quality validation
Last synced: 07 Oct 2025
https://github.com/ahmad-ali-rafique/comment-generation-tool
This repository hosts a Jupyter Notebook-based Comment Generation Tool exploring advanced NLP techniques for automated, contextually relevant comment generation from input data. Ideal for developers and researchers in NLP and automated text generation.
ai aitools artificial-intelligence content-based-recommendation data datascience jupyter-notebook machine-learning
Last synced: 07 Oct 2025
https://github.com/garcane/income-prediction-ml
This is a machine learning project aimed at predicting whether an individual's annual income exceeds $50,000 based on their demographic and personal information.
data data-science machine-learning ml numpy pandas python random-forest scikit-learn
Last synced: 08 Apr 2026
https://github.com/tushar2704/insurance-cross-sell
This project harnesses the power of cutting-edge technologies including H2O AutoML, MLflow, FastAPI, and Streamlit to enhance cross-selling campaigns and boost efficiency.
data datascience h20automl machine-learning mlflow python streamlit-tushar2704
Last synced: 08 Oct 2025
https://github.com/zediculz/block
Block is a data structure/collection that uses Blockchain principle in managing data.
Last synced: 05 Oct 2025
https://github.com/nikoshet/rust-dms-cdc-operator
The rust-dms-cdc-operator is a Rust-based utility for comparing the state of a list of tables in an Amazon RDS database with data stored in Parquet files on Amazon S3, particularly useful for change data capture (CDC) scenarios.
aws cdc data dms parquet pgdatadiff polars postgres rds rust s3 validation
Last synced: 18 Jan 2026
https://github.com/ryanjoy0000/yt-notifier
Youtube Notifier (Telegram Bot) - A real time data processing pipeline
data go kafka-streams real-time telegram-api youtube-api
Last synced: 14 Jan 2026
https://github.com/pharo-ai/data-imputers
This project contains transformers for missing value imputation
ai data data-science imputer pharo pharo-smalltalk smalltalk
Last synced: 18 Jan 2026
https://github.com/varun-khorgade/sentimentscope-e-commerce-review-analyzer
Analyzed customer reviews and purchase data to extract sentiment and behavioral insights. Built SQL-based ETL for data preparation and visualized results using Python and Power BI dashboards for actionable business decisions.
analytics customer-beheviour dashboard data data-visualization dataextraction natural-language-processing nlp pandas powerbi python sentiment-analysis sql textblob
Last synced: 17 Apr 2026
https://github.com/stdlib-js/array-one-to-like
Generate a linearly spaced numeric array whose elements increment by 1 starting from one and having the same length and data type as a provided input array.
array data float32array float64array int16array int32array javascript matrix ndarray node node-js nodejs stdlib structure typed typed-array types uint32array vector
Last synced: 20 Feb 2026
https://github.com/alexandregazagnes/rica-analysis
This repository contains the code to download, analyse, and modelize the RICA dataset from the french ministry of agriculture.
analysis argiculture business data data-analysis data-analytics food python
Last synced: 29 Apr 2026
https://github.com/simranjeet97/datastructures_algoritms_python
Data Structures and Algorithms using Python
algorithms arrays arrays-and-strings coding data data-science data-structures datastructures-python hashing interview-preparation interview-questions linked-list python stacks stacks-as-an-array
Last synced: 09 Apr 2026
https://github.com/famarks/grafarg
Grafarg is an interactive data analytics and graphical data visualization application. Grafarg being a progressive fork of Grafana 7.5.17 continues to be available under open source Apache 2.0 License
analytics charts data data-analysis data-science data-visualization grafana grafarg graph
Last synced: 19 Jan 2026
https://github.com/davorg/dmp
Data Munging with Perl
book data hacktoberfest munging perl
Last synced: 21 Jan 2026
https://github.com/stdlib-js/array-base-assert-is-real-floating-point-data-type
Test if an input value is a supported array real-valued floating-point data type.
array assert base check data dtype is javascript node node-js nodejs stdlib test types util utilities utility utils valid validate
Last synced: 12 Oct 2025
https://github.com/malvfr/zap
Fill your database with fake data.
cli csv data database generator hacktoberfest mock node populate populate-database seed sql
Last synced: 21 Jan 2026
https://github.com/saroshfarhan/kaggle-playground-s4e12
Kaggle competition first attempt
analytics data data-analysis-python data-science
Last synced: 12 Oct 2025
https://github.com/soenneker/soenneker.quark.table
A native Blazor table component.
blazor blazorlibrary csharp data dotnet html quark quarktable table tables
Last synced: 13 Aug 2025
https://github.com/simonbernarding/ml_project_simonbernarding
This project focuses on predicting flight delays using historical data from a Tunisian airline. We analyzed patterns in airport operations and flight schedules to build a machine learning model that can forecast potential delays.
data data-science flight-delay-prediction machine-learning machinelearning prediction
Last synced: 12 Oct 2025
https://github.com/genert/metis
Asynchronous data sender library
analytics asynchronous data dependency-free typescript
Last synced: 27 Jan 2026
https://github.com/anobaka/insidecollector
这是一个介于Excel和纯记录工具之间的软件,您可以自由创建各种列表,然后将其以各种规则关联起来,并且可以创建自定义视图帮助您更好地理解数据。
collection data excel-like list list-manager table
Last synced: 19 Jan 2026
https://github.com/R-Mahesh45/HR---Resume-Text-Classification
Text Classification for Resumes: Conducted Exploratory Data Analysis (EDA) on a vast collection of resumes. Organized the data using Bag of Words (BoW) and TF-IDF techniques. Built and evaluated multiple models, with Logistic Regression delivering standout performance. Created Word Clouds and Histograms.
data datacleaning extract-transform-load feature-extraction nlp nltk-tokenizer text-mining text-processing
Last synced: 13 Oct 2025
https://github.com/r12habh/datacamp.com-micro_projects
data data-analysis data-science datascience python python3
Last synced: 23 May 2026
https://github.com/player29879/neum-ai
Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.
ai chatgpt data data-engineering database embeddings etl llm llmops mlops ops pipeline python rag retrieval vector-database vectors
Last synced: 18 Apr 2026
https://github.com/ompreetham/dcn-network-traffic-anomaly-detection
Data Communication Networks - Network Traffic Anomaly Detection
anomaly anomaly-detection communication data dcn keras learning machine machine-learning network pandas presentation project python scikit-learn tensorflow traffic
Last synced: 08 Apr 2026
https://github.com/athul64/powerbi
Financial Reports Dashboard This repository showcases a Financial Reporting Dashboard that visualizes key financial metrics and performance insights. The dashboard contains Monthly and Annual reports, allowing users to switch between the two views to analyze data at different intervals.
data data-an data-visualization dax dax-expression powerbi
Last synced: 23 Feb 2026
https://github.com/lisakey/datacamp-data-analyst-python-sql-projects
Several projects completed during my Data Analyst 📊 training on the DataCamp platform with Python 🐍 and SQL 🗃️. Each project addresses real-world challenges using modern analytical tools and techniques.
analysis cleaning-data data dataanalysis dataanalyst matplotlib pandas python seaborn sql transformation visuali
Last synced: 19 Apr 2026
https://github.com/vikjam/ui-policy
Unemployment policy at the state level
data government government-data
Last synced: 13 Feb 2026
https://github.com/semibran/img-data
Easily read from and write to ImageData instances
Last synced: 11 Aug 2025
https://github.com/stdlib-js/ndarray-base-dtypes2signatures
Transform a list of array argument data types into a list of signatures.
api array base data dtype dtypes interface javascript multidimensional ndarray node node-js nodejs sig signatures stdlib types utilities utility utils
Last synced: 14 Apr 2026
https://github.com/souvik09-tech/adventure-works-kpi-dashboard
This repository contains a complete Business Intelligence solution for AdventureWorks, a global manufacturing company specializing in cycling equipment and accessories. Built using Power BI Desktop, this project helps track KPIs, analyze product performance, compare regional data, and identify high-value customers.
analysis data kpi powerbi visualization
Last synced: 27 Jan 2026
https://github.com/ibilalkayy/covid-tracking-app
This repository contains the code of a covid tracking app that shows the data of covid-19 on Google Map.
Last synced: 14 Oct 2025
https://github.com/intersystems-ib/workshop-healthcare-interop
Learn the basics in HealthCare Interoperability using InterSystems IRIS for Health
data fhir health hl7 interoperability
Last synced: 14 Apr 2026
https://github.com/open-i18n/data-iso-15924
Git mirror for ISO 15924, Codes for the representation of names of scripts data
data iso iso-15924 iso15924 open-i18n scripts unicode unicode-data writing-systems
Last synced: 14 Mar 2026
https://github.com/rubenhortas/python_examples
Examples of Python code and DSA (data structures and algorithms).
algorithm algorithms data dsa examples python python-3 python3 samples snippets structures
Last synced: 03 Oct 2025
https://github.com/akv3sic/cryptocurrency-charts
Cryptocurrency API data visualizations 📈 with Matplolib.
cryptocurrency data data-visualization matplotlib python
Last synced: 16 Oct 2025
https://github.com/ddeutils/ddedocs
📖 Data Developer & Engineer Documents and Hands-On
blogs data data-engineering documents hands-on
Last synced: 08 Aug 2025
https://github.com/bishtrishu/pizza_sales_data_analysis_sql
This project is a comprehensive data analysis of pizza sales, aimed at uncovering key insights and trends to inform business decisions. Using a combination of SQL, Python, and data visualization tools, the project analyzes sales data to understand customer preferences, peak sales periods, and the most popular pizza types.
cloud data data-analysis data-science data-visualization dataanalytics database mysql oracle-database
Last synced: 14 Apr 2026
https://github.com/nicolasbizzozzero/datagenerator
Randomly generate various commonly used data
data data-generation data-generator data-science
Last synced: 18 Oct 2025
https://github.com/woctezuma/download-steam-screenshots-data
Data consisting of Steam screenshots.
Last synced: 19 Feb 2026
https://github.com/mscbuild/analysis
🎢 This collection of data analysis projects demonstrates techniques for extracting, transforming, analyzing, and visualizing data. Data Analytics Projects for Beginners 📈 ⚡
anallysis analysis chart csv dashboard data data-science data-science-projects excel google html5 mashine-learning portfolio pyton
Last synced: 19 Oct 2025
https://github.com/gematik/poc-isik-patient-merge
The repository contains a proof of concept (POC). The POC demonstrates how a FHIR subscription can be used to inform about happened merges within the ISIK context.
Last synced: 19 Oct 2025
https://github.com/sourceduty/data_hardware
🖥️ Comparing various hardware configurations needed for different data sizes, from personal laptops to mainframes.
calculation computer-hardware computer-science computers data data-calculation data-hardware data-processing data-project hardware hardware-configuration hardware-requirements hardware-science math process-programming programming python
Last synced: 08 Aug 2025
https://github.com/jaldekoa/fiscaldataapi
A Python wrapper to easily retrieve data from the Fiscal Data (US Treasury) official API in pandas format.
api api-wrapper banking data finance pandas python united-states
Last synced: 27 Jan 2026
https://github.com/tiaanduplessis/country-currency-data
Data about currencies of countries
countries currencies data symbols
Last synced: 08 Aug 2025
https://github.com/lemniscate-world/stratai
This project analyzes financial assets using a Hidden Markov Model (HMM) to identify different market regimes and patterns. The analysis includes calculating daily returns, rolling volatility, and volume changes, and visualizing the hidden states identified by the HMM.
ai assets data data-science data-visualization finance financial-analysis fintech hmm-model hmmlearn machine-learning trading
Last synced: 23 Oct 2025
https://github.com/cisagov/cyhy-feeds
Tools to create and retrieve Cyber Hygiene (CyHy) data extracts
Last synced: 23 Oct 2025
https://github.com/dicook/tutorial_effective_data_plots
Materials for WOMBAT 2024 tutorial
data graphics inference statistics tidyverse visualisation
Last synced: 23 Jan 2026
https://github.com/purarue/git_doc_history
copy/track file history in git, with python bindings to traverse and extract history/files/lines at some date
Last synced: 17 May 2026
https://github.com/mustika-putri-m/analysis-of-sales-transactions-in-an-online-shop---london
Crucial Question 1. How was the sales trend over the months? 2. What are the most frequently purchased products? 3. How many products does the customer purchase in each transaction? 4. What are the most profitable segment customers? 5. Based on your findings, what strategy could you recommend to the business to gain more profit?
data data-analysis-python data-analytics data-visualization ecommerce
Last synced: 24 Oct 2025
https://github.com/eshaagarwa/hr-analytics-project
Explore our HR Analytics Dashboard, a powerful Power BI project designed for HR managers and leaders. Analyzed essential KPIs such as Employee Count, Attrition Rate, and Job Satisfaction across various demographics.
dashboard data data-visualization dataanylasis ms-excel ms-excel-data-analytics powerbi statistics
Last synced: 23 Jan 2026
https://github.com/public-health-scotland/covid-19-publication-dashboard
Dashboard for weekly COVID-19 publication
coronavirus covid covid-19 covid-testing covid19-data dashboard data hospital-admissions lfd nhs public-health scotland shiny
Last synced: 24 Oct 2025
https://github.com/undistraction/grid-model
A small API for creating a grid and accessing the positions of the cells, rows and columns within it.
2d calculations cells data grid layout model
Last synced: 04 Aug 2025
https://github.com/simranjeet97/leetcode_practice
Practicing the Leet Code Codes for Competitive Programming
algorithms amazon coding competitive-programming data data-structures facebook google leetcode python
Last synced: 03 Aug 2025
https://github.com/priyanshubiswas-tech/pwc-power-bi-task-1-2
Power BI dashboards analyzing Phonenow's call center performance and customer retention. Task 1 focuses on KPIs like satisfaction rating, call count, and agent efficiency. Task 2 analyzes retention trends and customer behavior to enhance loyalty. Built using Power BI, DAX, and Excel.
dashboard data data-analysis dax-measures excel powerbi powerbidashboard
Last synced: 23 Jan 2026
https://github.com/BenSFGamer/B.I.O.S.
A biographer
academia agi ai ai-tools artificial-general-intelligence artificial-neural-networks automation data fact-checking information-extraction information-retrieval self-improving self-learning self-referential semi-autonomous software-engineering specialization web-agent web-scraping writer
Last synced: 27 Sep 2025
https://github.com/doziestar/datavinci
DataVinci enables you to visualize data from various sources, generate insights, analyze data with AI models, and receive real-time updates on anomalies
Last synced: 23 Jan 2026
https://github.com/capire/xtravels-java
Travel booking app using master data from xflights built with CAP Java
cap cds data federation flights java reuse
Last synced: 23 Jan 2026