data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-29 00:07:49 UTC
- JSON Representation
https://github.com/flyconnectome/hnf
Documentation for the hierarchical neuron format
annotations data dotprops hdf5 mesh neurons skeleton storage
Last synced: 17 Jan 2026
https://github.com/eshan-sud/secureit
A Blockchain-based Data Sovereignty Platform
blockchain data decentralised-application platform sovereignty
Last synced: 21 Jan 2026
https://github.com/vdoninav/real_estate_analysis
real estate analysis
data data-analysis data-analysis-python data-science pandas pandas-dataframe pandas-python plotly plotly-express scipy seaborn streamlit streamlit-application streamlit-dashboard streamlit-webapp
Last synced: 12 Apr 2026
https://github.com/amethyst-php/courier
amethyst amethyst-package api courier data laravel
Last synced: 17 May 2026
https://github.com/tabarzin/dh
A collection of links to various resources on Digital Humanities
data digitalhumanities opensource
Last synced: 24 Jan 2026
https://github.com/anilanadella/facebook_user_data_analysis_project
Jupyter project for analyzing Facebook user data
data data-visualization dataanalysis dataanalysis-projects dataanalysisusingpython jupyter jupyter-notebook pandas-library pandas-python python python3
Last synced: 13 May 2026
https://github.com/blueheron786/quranic-universal-library-mushaf-layouts
The Quranic Universal Library (QUL)'s Qur'an mushaf 15-line layouts (madini, uthmani)
data database layout mushaf quran sqlite uthmani uthmani-quran
Last synced: 13 Apr 2026
https://github.com/sakshamarora07/whatsapp-chat-analyser
This repository contains code for a WhatsApp Chat Analyzer that uses Python libraries to extract insights from chat messages.
chat data dataanalytics datascience matplotlib pandas python seaborn statistics streamlit whatsapp
Last synced: 04 Jan 2026
https://github.com/isandyawan/simplelinearregression
A application to analyze data using simple linear regression. This application can make regression model from variable and give advice to user if the model break regression assumsion
data linear r regression rstudio shiny statistic
Last synced: 14 Oct 2025
https://github.com/darshjasani/insurance-claim-analysis
This dataset contains insightful information related to insurance claims, giving us an in-depth look into the demographic patterns of those receiving them.
Last synced: 27 Aug 2025
https://github.com/brandonzylstra/essence
🧘🏼♂️ Relaxed Rails Modeling & Migrations
active-record data database gem hcl modeling rails ruby ruby-on-rails yaml
Last synced: 14 Apr 2026
https://github.com/gagolews/datafusion
Data Fusion (open-access research monograph, 2015)
aggregation data fusion fuzzy-logic mean multidimensional-analysis multidimensional-data spread statistics strings variance
Last synced: 16 Mar 2025
https://github.com/writetome51/page-load-access
A TypeScript/Javascript class that loads a batch (array) of data from a larger set too big to be loaded all at once.
batch class data javascript load loader typescript
Last synced: 16 May 2026
https://github.com/desininja/food-delivery-realtime-data-analysis
ETL Pipeline in AWS for Real Time Data Analysis
airflow data data-engineering emr-cluster etl kinesis kinesis-strea real-time redshift
Last synced: 15 Oct 2025
https://github.com/anisimov-anthony/data_forest
Implementation of various types of trees
algorithms-and-data-structures data lib rust tree
Last synced: 28 Apr 2025
https://github.com/science-analyse/clv_model
customer lifetime value prediction
banking banking-applications clv clv-analysis data data-science machine-learning
Last synced: 15 Oct 2025
https://github.com/robthree/cfnreader
Provides a simple way to read FNIRSI's CFN files (*.cfn) produced by the FNIRSI UsbMeter tool
cfn csv data fnirsi usb usb-tester
Last synced: 01 Mar 2025
https://github.com/fehmitahsindemirkan/web-scrapper
Professional and high performance web scraping project.
data ecommerce emailsender fileexplorer logging python web webscraping
Last synced: 10 Jan 2026
https://github.com/davitshahnazaryan3/data-management-web
Explore datasets with ease using taxonomy filtering, allowing you to quickly identify the specific experimental datasets you need and download them effortlessly
data environmental experiments filtering-data seismic taxonomy
Last synced: 17 Jan 2026
https://github.com/jigyasag18/project-diwali-sales-analysis
This project analyzes retail sales data during the Diwali festival using exploratory data analysis (EDA) to identify buyer demographics and product preferences. The findings reveal that the primary purchasers are married women aged 26-35 from Uttar Pradesh, Maharashtra, and Karnataka, working in IT, Healthcare, and Aviation.
analysis data datapr datapro eda jupyter-notebook python realtimedata
Last synced: 01 Jun 2026
https://github.com/dcmox/algorithms
General purpose data structures and algorithms
algorithms binary data hash linked list structures tree
Last synced: 10 Jun 2026
https://github.com/bdr-pro/streamlint
ltra-cool Streamlit app, where you can interact with widgets, see data in action, and even upload and download files
Last synced: 14 Apr 2026
https://github.com/writetome51/public-data-container-interface
Just a TypeScript interface with 1 property: 'data'
container data interface typescript
Last synced: 15 May 2026
https://github.com/saboye/sales-performance-analysis
A dashboard that presents monthly sales performance by product segment and product category to help clients identifying the segments and categories that have met or exceeded their sales targets, as well as those that have not met their sales targets.
dashboard data data-science eda tableau visualization
Last synced: 27 Jan 2026
https://github.com/mat06mat/matbot
My discord bot code
data discord-bot discord-py py-cord
Last synced: 17 Oct 2025
https://github.com/ronknight/user-data-dashboard
📈 A data visualization tool for analyzing user data using an Excel-based data source.
dashboard data excel ga4 screenshot
Last synced: 17 Oct 2025
https://github.com/miniql/miniql-csv
A MiniQL query resolver that loads data from CSV files.
comma-separated-values csv data query query-language
Last synced: 08 May 2026
https://github.com/muhammadadilnaeem/student-performance-indicater-end-to-end-data-science-project
This project leverages data science techniques to build a predictive model that estimates a student's exam performance. The project follows a structured data science workflow, including data collection, preprocessing, model building, evaluation, and deployment.
data machine-learning-algorithms pandas pymysql python sql
Last synced: 11 Apr 2026
https://github.com/luminati-io/jupyter-notebooks-web-scraping
Perform web scraping interactively using Jupyter Notebooks, integrating coding, data analysis, and visualization into one seamless workflow.
beautifulsoup4 data jupyter jupyter-notebook pandas python requests seaborn virtual-environment web-scraper web-scraping
Last synced: 13 Apr 2026
https://github.com/anand-sony/mttr-dashboard
Streamlit dashboard for MTTR analysis with shift-wise loss insights and machine-level downtime tracking.
analytics business-analytics dashboard data python statistical-analysis
Last synced: 30 May 2026
https://github.com/natarizkie2/neurochain-airdrop-bot
🍋 — A smart bot designed to complete data tasks like true/false selections automatically, with multi-account support for extra convenience.
airdrop automated bot data multi-account natarizkie neurochain nodejs web3
Last synced: 10 Jun 2026
https://github.com/yeti-robotics/past-scouting-data
❄️ Scouting Data from Previous Events/Seasons ❄️
Last synced: 06 Jan 2026
https://github.com/newrelic-experimental/newrelic-java-atomikos
Gives status of Atomikos Data Sources since this information is unavailable via JMX
atomikos data instrumentation java nrlabs nrlabs-data nrlabs-java-verify nrlabs-odp observability-data
Last synced: 30 May 2026
https://github.com/woctezuma/recent-sales-data
Data available to estimate sales of Steam games during release week.
Last synced: 05 Feb 2026
https://github.com/meokullu/prefill
PreFill adds desired characters onto output values to increase their legibility.
alignment data data-analysis data-engineering data-science legibility
Last synced: 17 Jan 2026
https://github.com/rosette-api/mock-data
Mock data that is used for unit testing of the Babel Street Analytics bindings
data entity-extraction entity-level-sentiment entity-linking entity-relationship entity-resolution language-detection machine-learning mock-data morphology natural-language-processing nlp relation-extraction sentiment-analysis test-framework testing text-mining text-processing tokenization
Last synced: 04 Mar 2026
https://github.com/taquece/goals-per-match
basic script to calculate average football goals per match from .CSV
beginner csv data football nodejs python sports-analytics
Last synced: 09 May 2026
https://github.com/blackroad-os-inc/blackroad-portal
BlackRoad Portal — unified search routing to 30+ BlackRoad services.
blackroad cloudflare-workers data search
Last synced: 04 Apr 2026
https://github.com/chompfoods/sdk-typescript-angular
Angular TypeScript SDK for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.
angular api branded chomp data database food grocery ingredients nutrition raw recipe-api recipes sdk typescript
Last synced: 09 May 2026
https://github.com/otoneko1102/roulette-base
ルーレットの色と番号をjson形式でまとめたものです。カジノ風ルーレットを作るときにどうぞ。A collection of roulette colors and numbers in json format. Use it when making a casino-style roulette.
casino casino-games data json require roulette
Last synced: 16 Mar 2025
https://github.com/desoga10/nety-form
In this tutorial, I show you how to send data from a form to the Netlify dashboard. I also show you how to create a form using Materialize.
contact-form css css3 data form forms html html5 materialize materialize-css materializecss-framework netlify
Last synced: 03 Jan 2026
https://github.com/athari22/analyzing-the-yelp-dataset
SQL for Data Science
analytics data data-science data-structures er sql
Last synced: 27 Jan 2026
https://github.com/gabrieldim/complete-analysis-covid-19
Analysis of the Covid 19.
analysis covid-19 covid19 data data-science science virus
Last synced: 23 Jan 2026
https://github.com/andrewl/danelaw
Geopackage containing the boundary of the Danelaw
data geospatial medieval viking
Last synced: 23 Jan 2026
https://github.com/olekscode/datageneration
Exploring the methods of data generation for different Machine Learning algorithms
data javascript machine-learning
Last synced: 05 Apr 2025
https://github.com/itrauco/data-dirtying-tool
a simple command line tool to generate dirty data and do common data things in google cloud
data data-analysis data-engineering data-ops data-pipeline data-science data-visualization data-wrangling dirty-data google-cloud machine-learning
Last synced: 24 Feb 2025
https://github.com/tupizz/python-data-manipulation
Data manipulation and visualization with Python 2.x
Last synced: 09 May 2026
https://github.com/mattjesc/ddo-semiconductor
Data-Driven Optimization of Semiconductor Processes and Forecasting
ai artificial-intelligence data data-science data-visualization deep-learning keras machine-learning manufacturing ml prophet python pytorch semiconductor semiconductor-manufacturing semiconductors tensorflow
Last synced: 23 Feb 2026
https://github.com/luminati-io/LinkedIn-dataset-samples
Sample dataset of 1001 LinkedIn companies, extracted via Bright Data API, featuring essential data points for competitive analysis and market insights.
data database dataset linkedin linkedin-api linkedin-data linkedin-dataset linkedin-scraper sample web-scraping
Last synced: 09 Apr 2025
https://github.com/deliprofesor/breast-cancer-detection-using-svm-with-smote-and-model-optimization
This project analyzes health and lifestyle factors influencing heart attack risk using statistical methods and machine learning, with Ridge Regression identified as the best predictive model.
classification data data-preprocessing data-science data-visualization gridsearchcv machine-learning python roc-curve smote svm
Last synced: 10 Apr 2025
https://github.com/mukhlishga/data-engineering
all about data engineering
airflow beam data data-engineering pyspark python
Last synced: 13 Apr 2026
https://github.com/remcostoeten/github-and-vercel-api-showcase-dashboard
Showcase results of possible fetched data from the Github and Vercel API built in all vanilla js.
api-rest da data express-js github-api nodejs vercel-api
Last synced: 07 Mar 2026
https://github.com/knowcnu12/metamask-wallet-recovery-funds-phrase-data-seed-token
This repository provides tools and guidelines for securely recovering MetaMask Wallet funds using recovery phrases, seed data, and tokens. It ensures safe and reliable methods for recovering access to your wallet and managing your cryptocurrency assets.
bitcoin blockchain cryptocurrencies cryptocurrency data ethereum funds metamask metamask-bot metamask-desktop metamask-extension metamask-plugin metamask-snap metamask-wallet phrase recovery seed token wallet wallet-security
Last synced: 08 Mar 2026
https://github.com/abhroroy365/market_analysis
This project explores customer segmentation and market analysis in the context of online retail using an online retail dataset. By applying advanced analytics, we aim to uncover insights that can drive strategic decisions and enhance business performance.
clustering data data-analysis data-visualization kmeans-clustering machine-learning market-analysis python silhouette-analysis
Last synced: 09 May 2026
https://github.com/musamairshad/dsa-python
This repository contains all the material related to Data Structures and Algorithms implemented in Python.
algorithms data datastructures efficiency python searching-algorithms sorting-algorithms
Last synced: 25 Mar 2025
https://github.com/farhashaad/farhashaad98
This is a repository to showcase my skills, share projects and track my progress in Data Science related projects.
data data-visualization dataanalysis matplotlib pandas python seaborn sql tableau
Last synced: 24 Apr 2026
https://github.com/mikeasilva/api_data
API Data makes working with open data APIs easy.
Last synced: 23 Jan 2026
https://github.com/sakshamarora07/blinkit-sales-report-power-bi
This dashboard provides Blinkit with insights to optimize its grocery delivery operations and understand customer preferences. It evaluates sales trends, outlet performance, and item categories to identify key areas for improvement. The interactive visuals allow detailed exploration of sales distribution, customer ratings, and product popularity.
data data-science dataanalytics datavisualization excel powerbi sql
Last synced: 08 Jan 2026
https://github.com/iqbalmind/learn-python-data-scientist
IqbalMind Playground for python data scientist
data data-analysis data-visualization datascience datascientist datascientisttraining python python-playground
Last synced: 16 Mar 2025
https://github.com/pdoup/enegry
Time-Series dataset combining multiple sources to explain the broader Greek energy market
data dataset day-ahead-auction energy-markets exploratory-data-analysis forecasting futures-market greek-energy-market renewable-energy time-series-data weather-data
Last synced: 07 May 2025
https://github.com/quangandrei1003/france_air_pollution_pipeline
End-to-end air pollution data pipeline for French metropolitan cities using Airflow, Python, dbt, BigQuery.
airflow bigquery data data-analytics data-engineering data-modeling data-visualization dbt docker etl pandas python terraform
Last synced: 13 Apr 2026
https://github.com/white-gecko/lineage-dump
RDF dump of the device information from the lineage wiki
Last synced: 28 May 2026
https://github.com/etmendz/mendz.data.oracle
Provides a generic Mendz.Data-aware context for ADO.Net-compatible access to Oracle databases.
ado-net context data database datasettings mendz oracle
Last synced: 13 Apr 2026
https://github.com/so-cool/junction
My solution to the University of Bristol "Bristol Journey Time" Data Challenge https://So-Cool.github.io/junction
competition data modelling timeseries
Last synced: 02 Apr 2025
https://github.com/flexthink/matricize
A convenience library to convert between pure Python objects and their vectorized representations
data machine-learning numpy python
Last synced: 09 May 2026
https://github.com/maximiliancw/completely
Measure your data completeness
data data-cleaning data-quality data-science missing-data
Last synced: 25 Jun 2025
https://github.com/stefanpietrusky/factsv2
Repository for the article in the online magazine TDS.
ai arxiv-papers beautifulsoup data flask-application gensim llama matplotlib ollama plotly pyldavis python selenium webdriver
Last synced: 09 Apr 2025
https://github.com/ismailhakkii/digital_vault
This project can be used for securing data, similar to a real vault.
data digital security-data vault
Last synced: 25 Mar 2025
https://github.com/naliferopoulos/datamining
Bring your own pickaxe.
aueb aueb-students data data-mining machine-learning machine-learning-algorithms mining random-forest
Last synced: 25 Jan 2026
https://github.com/naitiknayak196/tech-layoffs-cleaning-sql-vs-python
This project cleans and analyzes a tech layoffs dataset using MySQL and Python (Pandas) to compare their efficiency in data processing. It provides business insights into workforce trends, industry stability, and economic impacts to support data-driven decision-making.
data datacleaning dataset jyputer-notebook layoffdata layoffs mysql python sql
Last synced: 09 May 2026
https://github.com/bcongdon/nid-data
National Inventory of Dams Data
data datasette government-data
Last synced: 21 Apr 2026
https://github.com/ahmad-ali-rafique/linear-regression-modeling
In-depth exploration of linear regression models, including data cleaning, model building, and performance evaluation on various datasets.
artificial-intelligence data dataanalytics linear-models linear-regression model multilinear-regression regression regression-models
Last synced: 19 Apr 2026
https://github.com/eng-gabrielscardoso/data-science-formation
Data science course walkthrough
data data-science data-visualisation google-colab google-colaboratory google-colaboratory-notebooks python r r-lang
Last synced: 28 Feb 2025
https://github.com/mfurmanczyk/wh-sales
E-commerce analytics data warehouse ETL made with Apache Spark.
airflow data data-engineering data-warehouse kotlin python spark
Last synced: 24 Jan 2026
https://github.com/robertoostenveld/dccn.dsc_3015055.00_583_v1
The FieldTrip-SimBio Pipeline for EEG Forward Solutions [Data set].
Last synced: 24 Jan 2026
https://github.com/yash-rewalia/airbnb_eda_pandas
The goal of the project is to gather information and analyze the detailed information of the different entries in order to provide insights about the host and price of the property in a particular area as per your preference , type of rooms and number of reviews accordingly.
data data-cleaning data-insights data-preprocessing data-visualization matplotlib numpy pandas python seaborn
Last synced: 11 Apr 2026
https://github.com/wlgs/got-dialogues-data-stats
Game of Thrones dialogues data statistics processed with R and SQLite. Project for Probability and Statistics course 21/22 at AGH UST. The project was about manipulating data and getting many pieces of information from it in addition to visualizing these results.
data game-of-thrones got r statistics stats
Last synced: 22 May 2026
https://github.com/jpb06/kubot-dal
data data-access-layer gulp-tasks mongodb typescript
Last synced: 12 Apr 2026
https://github.com/muhammed-fazal/student-success-and-early-intervention-analytics-system
To consolidate scattered student performance records into a unified Data Warehouse in SQL Server. Engineer an Interactive Power BI dashboards that visualize academic trends, identifying student performance and implement predictive analytics.
analysis analytics dashboard data data-analysis data-engineering data-science data-visualization database etl etl-pipeline power-bi powerbi python sql sql-server
Last synced: 29 May 2026
https://github.com/woctezuma/steamspy-data
Data snapshot from SteamSpy.
data data-dump data-dumps steam steam-data steamspy steamspy-api
Last synced: 07 Jan 2026
https://github.com/sanchittechnogeek/overscripted-analysis
Geolocation and user language extraction analysis from Mozilla Overscripted dataset
analysis data data-analysis mozilla
Last synced: 23 Mar 2025
https://github.com/arnavk-09/phishing-detection
🎣 Detect Phishing URLs with Data Pre-fitted... API & Web UI
csv data fastapi flask python scikit-learn
Last synced: 03 May 2026
https://github.com/qrailibs/dataflow
✨ Data processing in Node.js made multithreaded and type-safe.
data dataprocessing multithread node
Last synced: 04 May 2026
https://github.com/fallaciousreasoning/nz-mountains
A list of mountains in NZ, scraped from https://climbnz.org.nz
alpine climbing climbnz data json json-api maps mountaineering scraping
Last synced: 04 May 2026
https://github.com/dimitryzub/russo-ukraine-war-prediction-losses
Highlights rusian losses with predictions based on historic data from Ministry Defence of Ukraine 🐱👤
data dataanalysis dataanalytics matplotlib pandas prophet python
Last synced: 04 May 2026
https://github.com/gabya06/twitter_models
Repository used for twitter impression models
data data-science impressions machinelearning python ridge-regression sklearn twitter
Last synced: 04 May 2026
https://github.com/chanchalsoorma/web-scraping
This repo aims to provide a straightforward, easy-to-use scraping code written in Python.
beautifulsoup beautifulsoup4 data python request selenium webscraping
Last synced: 05 May 2026
https://github.com/sohomm/predict-insurance-charges
A predictive model to estimate the insurance charges based on a client's attributes, such as age and health factors. It offers a practical application of ml in business, enabling more accurate pricing models and helping companies manage risk while delivering personalized pricing strategies to clients.
administration algorithm bot data decision-trees download easy finance github java machine-learning management model neural-network nlp prediction project science trading university
Last synced: 05 May 2026
https://github.com/lexz-08/sharpdata
Easily manage DataGridViews or create one with the struct 'DataGridManager' provided.
csharp data datagridview ui user-interface windows windows-forms winforms
Last synced: 06 May 2026