data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-03 00:07:49 UTC
- JSON Representation
https://github.com/kaijagahm/2023-10-20-stlzoo
Data Carpentry workshop, hosted at the St. Louis Zoo. Beta testing the new ecology data lesson.
data data-science ecology r rstudio
Last synced: 05 Feb 2026
https://github.com/steventhompson6460-stack/octoparse-government-listings-scraper
Octoparse workflow for structured government data
data extraction government listings octoparse public-records python scraper scrapy structured web-crawling workflow
Last synced: 31 May 2026
https://github.com/theopenwebjp/theopenweb-data-loader
Package for loading data to local project
data downloader import javascript typings
Last synced: 10 Oct 2025
https://github.com/amirreza81/kaggle-pandas-course-solutions
Kaggle Pandas Course - Solved exercises in another way of sample solution
data data-analysis data-cleaning data-manipulation data-science dataframe jupyter-notebook kaggle machine-learning open-source pandas
Last synced: 14 Apr 2026
https://github.com/loggdme/kyro
Collection of utilities and examples for creating efficient data pipelines in go with parallel queues and, rate limitiers and much more.
Last synced: 14 Jan 2026
https://github.com/dumkydewilde/mcp-memory-layer
A template for building your own BI MCP with dbt, LLMs and multi-user corrections
Last synced: 13 Mar 2026
https://github.com/jatin-mehra119/paris_housing_price-kaggle-
Paris Housing Price Kaggle Competiton
data data-visualization kaggle-competition machine-learning numpy pandas predictive-modeling scikit-learn
Last synced: 29 Apr 2026
https://github.com/aldro61/mmit-data
The data used in the Maximum Margin Interval Trees paper
data machine-learning machine-learning-algorithms reproducible-research
Last synced: 19 Feb 2026
https://github.com/madhuresh2011/daily-sql-from-hackerrank
Welcome to my SQL Series, where I tackle SQL problems from HackerRank on a daily basis.
data dataanalysis database question-answering sql
Last synced: 19 Jan 2026
https://github.com/ckongala/data-warehouse-concepts
Data Warehouse Basics
data data-engineering data-warehouse data-warehouse-architecture data-warehouse-construction data-warehousing
Last synced: 13 Oct 2025
https://github.com/jhpoelen/bees
Content-based iDigBio prototype
biodiversity data ecololgical informatics provenance
Last synced: 18 Mar 2026
https://github.com/luminati-io/httpx-web-scraping
Web scraping using HTTPX in Python, covering setup, advanced features, comparisons with Requests, and more.
beautifulsoup data html httpx python web-scraper web-scraping
Last synced: 13 Oct 2025
https://github.com/flowsta/ods-educacion-aporta
ODS para educación, iniciativa APORTA 2021
data data-visualization ods sdg
Last synced: 27 Jan 2026
https://github.com/fnu-ankit/8-week-sql-challenge
My attempt on solving Case studies from #8WeeksSQLChallenge
8-week-sql-challenge 8-weeks-sql-challenge 8weeksqlchallenge case-study data data-analysis data-analysis-sql data-analytics database datawithdanny sql sqlserver
Last synced: 19 Apr 2026
https://github.com/digital-media/cv_data
Datasets used for courses/tutorials at the Digital Media Department
computer-vision data image-processing images
Last synced: 14 Oct 2025
https://github.com/isandyawan/simplelinearregression
A application to analyze data using simple linear regression. This application can make regression model from variable and give advice to user if the model break regression assumsion
data linear r regression rstudio shiny statistic
Last synced: 14 Oct 2025
https://github.com/mominurr/fire-gas-leak-detection-system
A real-time fire prevention system integrating IoT sensors and computer vision to trigger evacuations.
ai computer-vision data datascience machine-learning ml python yolo
Last synced: 27 Jan 2026
https://github.com/yagoluiz/enem-analise-extracao
[PT-BR] Extração e análise de dados do desempenho da região Centro-Oeste
analysis data extraction python3 r
Last synced: 17 Apr 2026
https://github.com/jigyasag18/project-diwali-sales-analysis
This project analyzes retail sales data during the Diwali festival using exploratory data analysis (EDA) to identify buyer demographics and product preferences. The findings reveal that the primary purchasers are married women aged 26-35 from Uttar Pradesh, Maharashtra, and Karnataka, working in IT, Healthcare, and Aviation.
analysis data datapr datapro eda jupyter-notebook python realtimedata
Last synced: 01 Jun 2026
https://github.com/bdr-pro/streamlint
ltra-cool Streamlit app, where you can interact with widgets, see data in action, and even upload and download files
Last synced: 14 Apr 2026
https://github.com/dpbm/depencies-sets
append multiple depencies to your python project quickly
data dependencies dependencies-list dependencies-manager dependencies-set frameworks libraries lists pip py python python3 web
Last synced: 17 Oct 2025
https://github.com/enoch208/eventmaster
A user-friendly application that helps you easily record and play back your keyboard and mouse actions. With its modern design using `tkinter` and `ttkthemes`, it provides a smooth and easy-to-use interface. The app combines reliable technical features to give you a great experience.
automation data key keylogging-python replay spy tools
Last synced: 01 Jun 2026
https://github.com/robertoostenveld/dcn.dsc_62002071_01_114_v1
Simon task M/EEG data [Data set].
Last synced: 23 Jan 2026
https://github.com/athari22/analyzing-the-yelp-dataset
SQL for Data Science
analytics data data-science data-structures er sql
Last synced: 27 Jan 2026
https://github.com/shubhamsoni98/prediction-with-binomial-logistic-regression
To predict client subscription to term deposits and optimize marketing strategies by identifying potential subscribers.
binomial data data-science eda machine-learning matplotlib pipeline python scikit-learn seaborn sklearn sql visualization
Last synced: 06 Feb 2026
https://github.com/andrewl/danelaw
Geopackage containing the boundary of the Danelaw
data geospatial medieval viking
Last synced: 23 Jan 2026
https://github.com/kenjyco/libs
Easily install kenjyco libs
api cli command-line data helper kenjyco libs python
Last synced: 16 May 2026
https://github.com/harmanveer-2546/reducing-data-entries
Way to delete data entries from csv/excel file using. For excel file, use excel instead of csv in the code.
csv data data-entry delete-data excel numpy pandas python
Last synced: 05 May 2026
https://github.com/prajjwol09/power-bi-project
The Data Survey Breakdown is an interactive Power BI dashboard designed to present insights gathered from a survey of professionals and enthusiasts in the data industry.
dashboard data interactive powerbi survey
Last synced: 15 Mar 2026
https://github.com/miriswisdom/coral.bells
Guiding and Reassuring Safety, Holistically and Empathetically
civic community data engagement govhack open safety
Last synced: 28 Jan 2026
https://github.com/tomquirk/sunshine-coast-council-rates-data
Rates data for the Sunshine Coast, Australia
australia data property rates real-estate
Last synced: 24 Feb 2026
https://github.com/alsult/alsult
Aliia Sultanova Portfolio
data datascience programming python
Last synced: 23 Jan 2026
https://github.com/prateekmaj21/tableau-public-links
Tableau work as part of Data Visualization [AI&DS_205]
data data-visualization dataanalytics tableau-public
Last synced: 24 Jan 2026
https://github.com/sahraiidle/email-spam-detector
Email/SMS spam detector with a Flask UI/API, tuned ML models (TF‑IDF + SVM/LogReg/NB), and a ready-to-run web form plus JSON endpoint for predictions.
data machine-learning numpy pandas python randomforest scikit-learn spam-classifier spam-detection svm
Last synced: 24 Jan 2026
https://github.com/semcod/code2llm
Python Code Flow Analysis Tool - Static analysis for control flow graphs (CFG), data flow graphs (DFG), and call graph extraction
ast cfg code code2data code2logic code2process data dfg diagram flow graphs llm
Last synced: 01 Jun 2026
https://github.com/cmdrvl/rvl
rvl reveals the smallest set of numeric changes that explain what actually changed between two datasets — or confidently tells you nothing changed.
cli csv data data-quality data-validation diff finance numerical-analysis open-source ops rust tooling
Last synced: 25 Feb 2026
https://github.com/spatialcurrent/go-flat
Recursively flatten a slice of slices.
Last synced: 29 Jan 2026
https://github.com/aimin-nur/data-analyst-model-predictive
Sebuah Project data analyst yang bertujuan untuk mengindentifikasi karakteristik customer untuk menerima penawaran campaign marketing.
analyst data mechine-learning visualization
Last synced: 29 Jan 2026
https://github.com/tpltnt/wir_vs_virus_hackathon_projects
A list of all projects / challenges for the WirVsVirus hackathon as CSV
coronavirus csv data hackathon raw-data
Last synced: 29 Jan 2026
https://github.com/chenxingqiang/modeling_tabular_data
# modeling_tabular_data | Keywords: modeling_tabular_data focusing on modeling_tabular_data.
Last synced: 30 Jan 2026
https://github.com/opendatach/alds
a colaborative list of resources and ideas to enable "Amt Local Data Stewards" to manage the (open) data of their respective federal office
awesome-list data datagovernance dataliteracy datamanagement datastewardship opendata opengovernmentdata
Last synced: 31 Jan 2026
https://github.com/ms140569/loki-example-store
Testdata for loki password manager
Last synced: 26 Feb 2026
https://github.com/drostlab/biodbretrievr
Retrieve and efficiently index entire biological sequence databases
biological-data biological-sequences data databasestoring retrieval
Last synced: 26 Feb 2026
https://github.com/ymorsi7/quranicvisualization
A visual exploration tool for the Holy Quran using D3.js treemaps.
css d3 d3js data data-visualization html islam islamic javascript js quran quranic treemaps visualization
Last synced: 15 Apr 2026
https://github.com/matt-dray/draytasets
:1234::disguised_face: Miscellaneous datasets I've collected or prepared
Last synced: 09 Feb 2026
https://github.com/samaalharbi2/project-recommendation-system
This project focuses on building a Recommendation System using real interaction data from IBM's Watson Studio platform.
clustering data ibm-watson kmeans nlp python rec svd udacity-nanodegree
Last synced: 09 Feb 2026
https://github.com/neurazum-ai-department/tumor-stages-dataset---v1
Synthetic MRI data generated by the ‘HF’ and 'Vbai' models based on real data.
brain data dataset datasets image mri neuroscience tumor tumor-segmentation
Last synced: 18 Mar 2026
https://github.com/javdomgom/nifi-custom-processors
Apache NiFi custom processors
apache-nifi bigdata data data-engineering datascience flowfile nifi nifi-custom-processor
Last synced: 27 Feb 2026
https://github.com/paladini/aa-daily-reflections-database
Alcoholics Anonymous (AA) Daily Reflections in English, Spanish, French and Brazilian Portuguese
aa alcoholics-anonymous daily-reflections data database reflections
Last synced: 16 Apr 2026
https://github.com/abhinavrobinson/mc-community-world
Minecraft community world data.
Last synced: 27 Feb 2026
https://github.com/miozilla/snowden
snowden :snowman::video_game: : VR Game # Snowflake # Data Engineering # ELT
data elt engineering snowflake sql vr-game
Last synced: 11 Feb 2026
https://github.com/anandanraju/power_bi_dashboard_projects
The goal of this project is to provide insights into consumer behavior and purchasing trends across different platforms. By analyzing data from Amazon and other sources, we aim to uncover valuable insights that can inform marketing strategies, product development, and decision-making processes.
amazon dashboard data data-visualization healthcare powerbi project
Last synced: 11 Feb 2026
https://github.com/kunalthakur204/visualization-on-flower
🌸 Flower Dataset Visualization Visualizing patterns and relationships in flower data through charts and plots. Perfect for exploring floral characteristics and trends! 📊
data data-visualization dataanalysis flowerdataset python
Last synced: 16 Apr 2026
https://github.com/kirillsemyonkin/lsd
LSD (Less Syntax Data) configuration/data transfer format.
configuration data java parsing rust
Last synced: 27 Feb 2026
https://github.com/soenneker/soenneker.dtos.requestdataoptions
A flexible request options object for paging, sorting, and filtering queryable data, similar to OData-style parameters.
controller coordinator csharp data dotnet dto dtos http manager object odata options request requestdataoptions
Last synced: 12 Mar 2026
https://github.com/imartinezl/madrid-challenge
Madrid Route Optimization Challenge 🚚♻️🚚
challenge city data optimization routing-algorithm traffic
Last synced: 28 Feb 2026
https://github.com/lijesh010/roadaccidentanalysisproject
This data analysis project was completed using MS Excel, and includes the creation of a dashboard.
data data-analytics data-exploration data-visualization msexcel
Last synced: 15 Feb 2026
https://github.com/j2kun/terrorism-usa-post-9-11
A copy of the terror data published by NewAmerica
data politics terrorism transparency
Last synced: 02 Mar 2026
https://github.com/inzhenerka/scooters_data_generator
Generate data of scooter trips for analysis
Last synced: 02 Jun 2026
https://github.com/ineelhere/langchain-chat-with-your-data
LangChain Chat with Your Data course from DeepLearning.AI and LangChain
chatapplication chatgpt data deeplearning-ai deeplearning-notebooks jupyter-notebooks langchain langchain-python openai-api opensource personalised-learning python3
Last synced: 16 Apr 2026
https://github.com/colesmcintosh/colesmcintosh.github.io
My portfolio site :)
ai automation data llms open-source
Last synced: 04 Mar 2026
https://github.com/erickpeirson/jhb-data
Data from the forthcoming paper: Quantitative Perspectives on Fifty Years of the Journal of the History of Biology
data geolocation history-of-biology named-entity-recognition topic-modeling
Last synced: 04 Mar 2026
https://github.com/amethyst-php/collection
Simple as the name, this package allow you to create collection of other models.
amethyst amethyst-package api collection data laravel
Last synced: 17 Apr 2026
https://github.com/ashfaqalizardariofficial/databasehelper
A C# database helper library to connect with the database server and perform actions insert, update, delete, select data and select multiple data from the database.
ashfaq-ali-zardari ashfaq-ali-zardari-official data database delete helper insert ms-sql-server multiple select-data server sql-server update
Last synced: 02 Apr 2026
https://github.com/cnr-ibba/smarter-repository
SMARTER Data Repository
bootstrap5 data django repository smarter
Last synced: 03 Apr 2026
https://github.com/cloud-shuttle/drover-sqlforge
The Data Automation Engine. A blazing-fast, pure Go alternative to dbt for data transformations.
ast data drover sql transformation
Last synced: 03 Jun 2026
https://github.com/awhipp/forex-api-export
API Service that pulls forex data and returns CSV file based on the parameters
data forex forex-trading oanda oanda-api-v20 trading
Last synced: 04 Jun 2026
https://github.com/yuvrajsaraogi/sales-prediction-using-python
Sales prediction involves estimating future product sales based on factors like advertising spend, target audience, and platform. Businesses rely on data scientists to forecast sales and optimize advertising costs. Machine learning in Python can be used for this task.
data data-analysis data-science data-visualization machine-learning matplotlib natural-language-processing numpy pandas prediction python sales-prediction-using-python sql
Last synced: 19 Apr 2026
https://github.com/mksingh431/free-data-science-courses
Data science is a rapidly growing tech field that’s transforming business decision-making. To break into this field, you need the right skills. Fortunately, top institutions like Harvard and IBM offer free online courses. These courses cover everything from basic programming to advanced machine learning.
course data data-analysis data-science data-visualization free freecou python
Last synced: 19 Apr 2026
https://github.com/montanaz0r/suicide-rate-analysis
Testing a significance of the correlation between a suicide rate and a number of psychiatrists and psychologists working in the mental health sector
analysis correlation data data-analysis data-science jupyter-notebook jupyter-notebooks matplotlib numpy pandas psychology python python-3 seaborn statistics suicide-rate
Last synced: 20 Apr 2026
https://github.com/omers/sre-devops-tools
Tools and useful sources for SRE and DevOps
awsome awsome-list data devops monitoring sre tools
Last synced: 20 Apr 2026
https://github.com/nikoheikkila/maps
A TypeScript collection of specialized map implementations
data javascript maps typescript
Last synced: 20 Apr 2026
https://github.com/fastpix/android-data-kaltura
This SDK enables seamless integration with Kaltura Player, offering advanced video analytics via the FastPix Dashboard
analytics android-sdk data fastpix kaltura kaltura-player metrics sdk video video-metrics
Last synced: 21 Apr 2026
https://github.com/critocrito/data-scores-map
Data scores in the UK web app.
algorithmic-decision-making data data-investigation data-scores investigation
Last synced: 21 Apr 2026
https://github.com/snickerdoodlelabs/whitepaper
LaTex files for protocol whitepaper.
data latex pdf self-custody snickerdoodle whitepaper zero-knowledge
Last synced: 21 Apr 2026
https://github.com/ppatrzyk/heatmap
Display CSV as a heatmap in terminal
csv data data-visualization terminal
Last synced: 24 Apr 2026
https://github.com/howwohmm/fetchgram
era-adjusted Instagram content intelligence — scrape any public profile, OCR every image, measure what actually works. free, local, no API keys.
analytics cli content-strategy data instagram ocr python scraper
Last synced: 06 Jun 2026
https://github.com/hruth-vik/sales-analysis-report
SalesScope is a powerful sales analytics dashboard that extracts insights, reveals trends, and drives strategy from raw data.
analytics data powerbi-report powerbi-visuals python
Last synced: 24 Apr 2026
https://github.com/desininja/weather-data-etl-pipeline
ETL pipeline using Apache Airflow
apache-airflow aws cicd dags data data-engineering etl glue-job mwaa pyspark redshift
Last synced: 25 Apr 2026
https://github.com/carlos-levi/twitterbots_analise_redesneurais
Projeto para a disciplina de IA - análise exploratória e aplicação de técnicas de aprendizado de máquina para detectar contas automatizadas (bots) na plataforma 𝕏 (Twitter)
data machine-learning twitter-bot
Last synced: 06 Jun 2026
https://github.com/shwetajanwekar/prediction-with-regression
prediction with regression for salary_hike and delivery time dataset
data data-science datset exploratory-data-analysis matplotlib pandas plot prediction r2-score seaborn sns
Last synced: 25 Apr 2026
https://github.com/jigyasag18/multiple-disease-detection-app
This repository contains the implementation of a Multiple Disease Detection System, which employs advanced machine learning techniques for early detection and prediction of prevalent diseases, including diabetes, heart disease, and Parkinson's disease. The system utilizes a variety of patient health metrics such as demographics and medical history.
data datapreprocessing machine-learning machine-learning-algorithms machinelearningmodel prediction python streamlit streamlit-webapp
Last synced: 07 Jun 2026
https://github.com/fatihemres/africa
Africa app by SwiftUI. Using AVFoundation, MapKit, data, models, animations, stickers.
animations avfoundation data mapkit models swift swift-animations swiftui
Last synced: 27 Apr 2026
https://github.com/dmoayad/tuberculosis-classification-ai
Tuberculosis X-ray Classification with training a computer vision model
artificial-intelligence computer-vision data data-science machine-learning medical-image-processing python tuberculosis tuberculosis-classification tuberculosis-detection
Last synced: 27 Apr 2026
https://github.com/mohamedezzeldeenhassanmohamed/data-mining-project
Data minnig GUI project to predict laptop prices,I uses most of ML algorithmes here
data data-mining-assignments datamining-algorithms datapreprocessing decision-trees entropy gini k-means-clustering knn-classification laptop-dataset laptop-price-prediction linear-regression logistic-regression ml mlalgotithms naive-bayes-classifier pca python svm-classifier visualization
Last synced: 27 Apr 2026
https://github.com/leonardomusini/mbe-growth-nexus-converter
Python tool to convert laboratory text files into NeXus files for Molecular Beam Epitaxy (MBE) data.
data data-engineering nexus python
Last synced: 28 Apr 2026