data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-03 00:07:49 UTC
- JSON Representation
https://github.com/frer0t/userverse
creating api for data analysis
data data-analytics spring-boot users
Last synced: 12 Apr 2026
https://github.com/jigyasag18/multiple-disease-detection-app
This repository contains the implementation of a Multiple Disease Detection System, which employs advanced machine learning techniques for early detection and prediction of prevalent diseases, including diabetes, heart disease, and Parkinson's disease. The system utilizes a variety of patient health metrics such as demographics and medical history.
data datapreprocessing machine-learning machine-learning-algorithms machinelearningmodel prediction python streamlit streamlit-webapp
Last synced: 07 Jun 2026
https://github.com/wisdom-osborn/data-analytics-course-online-
🔍 Data Analytics with Python — Hands-on Course Materials Jupyter notebooks, projects, and datasets based on the freeCodeCamp Data Analysis with Python certification. Learn NumPy, Pandas, data cleaning, and visualization through real-world examples
data data-analysis data-science data-visualization freecodecamp numpy pandas pandas-dataframe project python
Last synced: 19 Apr 2026
https://github.com/twilighty-abhi/locust-data-visualiser
Locust Data Visualiser
Last synced: 15 Aug 2025
https://github.com/getconversio/dig-the-data
Data visualizations for the Conversio blog
Last synced: 12 Apr 2026
https://github.com/vedantwalia/mymusicvisualisationproject
data datavisualisation json jupyter-notebook pandas python xml xml-parser
Last synced: 09 Apr 2026
https://github.com/eva-kaushik/data-clustering
Clustering Accelerators for hard and soft clustering, including implementations of K-means, K-medoids, hierarchical clustering, fuzzy C-means, and Gaussian mixture models. Demonstrates text clustering using both hard and soft clustering algorithms.
clustering clustering-algorithm data datascience machine-learning-algorithms
Last synced: 09 Apr 2025
https://github.com/zeptosec/bpscrapper
Shows history of oil prices
data data-visualization database nodejs scraper
Last synced: 13 Apr 2026
https://github.com/prishabhanot/facial_recognition_pca
A face recognition system using Principal Component Analysis (PCA) for dimensionality reduction and a Support Vector Machine (SVM) classifier for classification. PCA extracts essential features (eigenfaces) from facial images, significantly reducing computational complexity while retaining critical information for accurate recognition.
data eigenfaces facial-recognition pca python reducing-computational-complexity reducing-data-dimensions svm-classifier
Last synced: 01 Mar 2025
https://github.com/elimu-ai/ml-event-simulator
🤖 Simulation of learning events and assessment events
data learning-analytics machine-learning ml
Last synced: 28 Feb 2025
https://github.com/pawal/tldmonitor-ui-go
Web UI for TLDMonitor
analysis data dns go golang mongodb statistics webapp website
Last synced: 16 Jan 2026
https://github.com/cosmos-loops/cosmos-data
Cosmos.Data is a inline project of COSMOS LOOPS PROGRAMME to provide several SQL-Query, RMDB/ORM and No-SQL components' extensions.
connection-pool data mysql mysqlconnector oracle postgresql sqlite sqlkata sqlserver transaction uow
Last synced: 12 Apr 2026
https://github.com/badawy403/egy.list
A Node.js package providing access to official Egyptian data including universities, governorates, cities, and more. This package makes it easy for developers to integrate Egypt-specific information into their applications.
city data egypt javascript nodejs npm package
Last synced: 08 Mar 2026
https://github.com/ioanzicu/batch_loading_one-to-many_data_model
Unesco Batch Loading One-to-Many Data using Django
Last synced: 27 Apr 2026
https://github.com/yuweaec/project-scidatapipeline
A comprehensive toolkit for processing, simulating, and analyzing scientific data, integrating Python, Fortran, and Jupyter notebooks for seamless workflows.
analysis data pipeline processing scientific simulation
Last synced: 27 Apr 2026
https://github.com/luminati-io/LinkedIn-dataset-samples
Sample dataset of 1001 LinkedIn companies, extracted via Bright Data API, featuring essential data points for competitive analysis and market insights.
data database dataset linkedin linkedin-api linkedin-data linkedin-dataset linkedin-scraper sample web-scraping
Last synced: 09 Apr 2025
https://github.com/desoga10/nety-form
In this tutorial, I show you how to send data from a form to the Netlify dashboard. I also show you how to create a form using Materialize.
contact-form css css3 data form forms html html5 materialize materialize-css materializecss-framework netlify
Last synced: 03 Jan 2026
https://github.com/analyticslover/sales-python-dashboard
Dashboard Ventas Japon 2023
dashboards data data-analysis jupyter-notebook python3 sales streamlit
Last synced: 09 Apr 2026
https://github.com/otoneko1102/roulette-base
ルーレットの色と番号をjson形式でまとめたものです。カジノ風ルーレットを作るときにどうぞ。A collection of roulette colors and numbers in json format. Use it when making a casino-style roulette.
casino casino-games data json require roulette
Last synced: 16 Mar 2025
https://github.com/robson-python/academic-performance
Project to evaluate students' academic performance.
csv-import data data-analysis data-science jupyter-notebook machine-learning matplotlib pandas python scikit-learn seaborn vscode
Last synced: 12 Apr 2026
https://github.com/karosi12/ng-data-share
Angular communication with input and output properties
angular communication data data-binding input output sharing typescript
Last synced: 16 Jan 2026
https://github.com/paul-henryp/simulate-investment-strategies
This Java program simulates different investment strategies using historical stock market data. It allows users to test various strategies such as buy and hold, moving average, buying when the stock price is lower than the last purchase, and dollar-cost averaging.
data data-science investing-java java plots plotting simulated-data simulated-investments sp500 sp500-data-analysis
Last synced: 21 May 2026
https://github.com/demkeys/lazydatatransfer
Lazy method to transfer upto 64kb of data over the network using UDP
data data-trans network python transfer udp
Last synced: 07 Jun 2026
https://github.com/sirmaxx/log_manager
log manager services for microservices
data fastapi logging microservice mongodb
Last synced: 09 Apr 2026
https://github.com/bmcollier/contiguous
Provides COBOL-style contiguous data structures in Python
Last synced: 14 Jan 2026
https://github.com/mchenryspagg/wrangle-and-analyze-data
This project which is known as 'wrangle and analyze data' involves the wrangling of WeRateDogs twitter archive data from the period of 2015 to 2017
api data dataanalysis datacollection datawrangling datetime json numpy os pandas pil python requests tweepy-api visualization
Last synced: 09 Apr 2026
https://github.com/dmoayad/tuberculosis-classification-ai
Tuberculosis X-ray Classification with training a computer vision model
artificial-intelligence computer-vision data data-science machine-learning medical-image-processing python tuberculosis tuberculosis-classification tuberculosis-detection
Last synced: 27 Apr 2026
https://github.com/afnanenayet/kaggle-titanic
The classic Kaggle Titanic data science challenge
backprop backpropagation classification classifier data forest kaggle layer learn mlp multi numpy pandas perceptron random science scikit sklearn titanic
Last synced: 12 Apr 2026
https://github.com/sourceduty/text_file_metadata
📄 Extract metadata from .txt files and record the metadata in .txt files.
data datascience metadata metafile practice sourceduty
Last synced: 08 Aug 2025
https://github.com/sourceduty/language_barriers
🔤 Language barriers between the world's 7,000 languages.
communication concept data idea info information language language-barrier language-barriers languages project research
Last synced: 11 Feb 2026
https://github.com/etmendz/mendz.data
Provides tools and guidance for creating data access contexts and repositories.
context data datasettings entity-framework mendz paginginfo repository resultinfo
Last synced: 11 Jun 2025
https://github.com/rishitabansal9/adult-census-income-prediction
This is a project made for data analysis and income prediction using random forest classifier with 91% accuracy.
data data-analysis data-science feature-engineering random-forest-classifier
Last synced: 25 Mar 2025
https://github.com/sourceduty/digital_brand_footprint
🔗 Expert in finding and analyzing branded websites and social media links.
analytics artificial-intelligence business business-footprint businesses chatgpt company concept data link openai social-media tool url website
Last synced: 16 Aug 2025
https://github.com/tacticalnuclearraccoon/dataviz_with_js
Sample data vizualisation as part of a training on Javascript Frameworks for dataviz
d3 data datawrapper echarts javascript visualization
Last synced: 27 Apr 2026
https://github.com/drkane/area-profiles
Produce UK area profiles based on various data sources
dash-plotly data flask statistics uk
Last synced: 27 Apr 2026
https://github.com/rajlabmssm/echodata
echoverse module: Example data.
data echoverse fine-mapping genomics gwas qtl
Last synced: 17 Jan 2026
https://github.com/theprodigyleague/d1g174lx534f00d
react/node bootstrapped project for a digi(company){["SEAFOOD"]}
bootstrap companies data data-conduit digital digital-seafood java javascript node project react seafood
Last synced: 01 Oct 2025
https://github.com/machinecyc/lotteryinsight
Use crawler to collect Taiwan Lotto data, and save data into local MySQL server.
crawler data docker lottery mysql-database python3 taiwan
Last synced: 09 May 2026
https://github.com/soenneker/soenneker.cloudflare.origincerts.thumbprints
The current Cloudflare origin certificate thumbprints
cloudflare csharp data dotnet origincerts thumbprint thumbprints
Last synced: 23 Apr 2026
https://github.com/soenneker/soenneker.datatables.attributes.column
A C# attribute for Datatables.js column building
attributes column columns csharp data datatablecolumnattribute datatables dotnet mapping object
Last synced: 12 Mar 2026
https://github.com/zazza123/hamana
A python library for seamless data extraction, storage, and SQL-based analysis using pandas and SQLite.
Last synced: 14 Jan 2026
https://github.com/filiprokita/foldertoiso
Python script that converts a specified folder into an ISO.
automation command-line-interface command-line-tool compression cross-platform data file-system folder-to-iso iso iso-image iso-tool python python-cli python-script python3 shutil utility
Last synced: 24 Mar 2025
https://github.com/purarue/scramble-history
parses rubiks cube scramble history/solve time from cstimer.net, cubers.io, twistytimer -- merges them together giving you uniform averages/data/graphs
cstimer cubing data rubiks-cube speedsolving
Last synced: 11 Jun 2025
https://github.com/newrelic-experimental/newrelic-java-atomikos
Gives status of Atomikos Data Sources since this information is unavailable via JMX
atomikos data instrumentation java nrlabs nrlabs-data nrlabs-java-verify nrlabs-odp observability-data
Last synced: 30 May 2026
https://github.com/oniani/miniframe
Minimal data frames with relational algebra
data dataframe-library haskell haskell-library library
Last synced: 04 Mar 2025
https://github.com/shubhamsoni98/project_using_knn
This project applies the K-Nearest Neighbors (KNN) algorithm to predict iPhone purchases based on customer data. Using features like age, salary, and previous purchase behavior, the KNN model classifies customers into buyers and non-buyers.
anaconda analytics data data-science eda knn knn-classification machine-learning-algorithms predict project python scikit-learn tableau
Last synced: 03 Jan 2026
https://github.com/rodrigojunqueiradev/curso-python-3-do-basico-ao-avancado
Curso de Python 3 do básico ao avançado - com projetos reais
data data-analysis data-science python python-3 python-library python-script python3
Last synced: 27 Jun 2026
https://github.com/yeti-robotics/past-scouting-data
❄️ Scouting Data from Previous Events/Seasons ❄️
Last synced: 06 Jan 2026
https://github.com/diegoperea20/datos-secuenciales-con-ia
Realizacion de procesamiento de señales unidimensionales con modelos auto regresivos, convolución 1d, convolución 2d usando el espectrograma y redes recurrentes
ai artificial-intelligence convolutional-neural-networks data ia secuential-data spectrogram uao
Last synced: 06 Feb 2026
https://github.com/kingsley-ezenwaka/medical-data-visualizer
A data analysis project that investigates a dataset of anonymous patients' medical information, and explores the relationship between cardiac disease, body measurements, blood markers, and lifestyle choices.
analysis data matplotlib numpy pandas seaborn
Last synced: 28 Apr 2026
https://github.com/kunalshelke90/kunalshelke90
💻 Machine Learning Enthusiast | Data Science Explorer | eager about solving problems with help of data.
data data-science dataanalysis database machine-learning mlops
Last synced: 06 Jul 2025
https://github.com/adilsaid64/real-time-data-monitoring
Exploring what a real-time data drift monitoring solution could look like within MLOps
data datadrift grafana machine-learning mlops mlops-workflow prometheus python software-engineering
Last synced: 04 Aug 2025
https://github.com/priyanshubiswas-tech/e-commerce_data_analysis
Analyzes 9,994 e-commerce transactions to uncover insights on sales trends, customer behavior, profitability, and logistics using EDA and visualization. Identifies top products, customer segments, and shipping efficiencies to optimize marketing, inventory, and operations, making it valuable for retail, finance, and logistics.
data data-analysis data-visualization pandas pandas-dataframe plotly-analytics-projects plotly-express python
Last synced: 28 Apr 2026
https://github.com/davitshahnazaryan3/data-management-web
Explore datasets with ease using taxonomy filtering, allowing you to quickly identify the specific experimental datasets you need and download them effortlessly
data environmental experiments filtering-data seismic taxonomy
Last synced: 17 Jan 2026
https://github.com/pchaparro/search-engine
Full stack search-engine created from youtube videos obtained using "web-scraping"
data opensearch python python3 react scraper scraping scraping-websites search search-engine semantic-search sentence-transformers typescript website
Last synced: 17 Apr 2026
https://github.com/robthree/cfnreader
Provides a simple way to read FNIRSI's CFN files (*.cfn) produced by the FNIRSI UsbMeter tool
cfn csv data fnirsi usb usb-tester
Last synced: 01 Mar 2025
https://github.com/h-sutiwas/r2de-2025
This repository is related to the Road To Data Engineer Bootcamp by DataTH. It contains all related coursework, some mini projects and other resources within the field of Data Engineering.
data data-engineering data-visualization docker gcp pipeline spark
Last synced: 30 Apr 2026
https://github.com/armand-sauzay/datasets
Datasets for machine learning
ai data datasets machine-learning ml
Last synced: 18 Jan 2026
https://github.com/natanast/euroleaguebasketball
An R package providing data on Euroleague Basketball
Last synced: 01 Apr 2025
https://github.com/codegeekr/test_datasciencestarter
test Data Science Starter
analytics data data-science data-visualization machine-learning python science starter-kit statistics test
Last synced: 28 Apr 2026
https://github.com/gagolews/datafusion
Data Fusion (open-access research monograph, 2015)
aggregation data fusion fuzzy-logic mean multidimensional-analysis multidimensional-data spread statistics strings variance
Last synced: 16 Mar 2025
https://github.com/mrlynn/sizing-exercise-data-generator
Data Generator for December 2017 Sizing Exercise
Last synced: 28 Apr 2026
https://github.com/paezha/bsantiago
A data package with the results of a travel and well-being survey conducted in Santiago in 2016
data equity package r santiago survey travel well-being
Last synced: 18 Mar 2025
https://github.com/sadratehranian/data-collection-and-machine-learning
create a model using logistic regression to predict whether the fire alarm of a smoke detector should sound or not. Second, predicts whether an electric drive in a production plant may be faulty or not.
data data-analysis data-science datacollection logistic-regression machine-learning ml nn
Last synced: 05 Jan 2026
https://github.com/servierhub/adsv
Analyze delimiter-separated values files
csv csv-converter csv-format csv-parser csv-parsing csv-reader csv-reading data data-analysis data-engineering data-mining
Last synced: 28 Sep 2025
https://github.com/yashaswitir28/yashaswitir28.github.io
This is my Portfolio Website
data data-analysis-python data-analyst data-cleaning data-science data-visualization excel html-css ms office365 portfolio-website powerbi python sql
Last synced: 29 May 2026
https://github.com/abdullahashfaqvirk/Earth-Engine-Data-Scraper
A Python based web scraper designed to extract and organize dataset metadata from the Google Earth Engine Datasets Catalog for research, and analysis purposes.
beautifulsoup data data-science python requests scraper web-scraping
Last synced: 27 Sep 2025
https://github.com/entorb/analyze-ha-energy
Analyze Home Assistant Solar Production Data
data home-assistant pandas photovoltaic pv python
Last synced: 08 May 2026
https://github.com/anct-cartographie-nationale/mednum-cli
✨ Interface en ligne de commande pour la transformation des données de lieux de médiation numériques collectées dans un format non standard vers le schéma de la mednum et leur publication sur data.gouv
anct betagouv data donnees gouvernement mediation-numerique nodejs open-data transformation
Last synced: 02 Aug 2025
https://github.com/matthewgferrari/covid-contextualizer
A Coronavirus Contextualizer for the USA
Last synced: 26 Jun 2026
https://github.com/vedikasnehil/my-data-science-projects
This repository is a comprehensive collection of resources and implementations dedicated to the field of Data Science. It serves as a platform for exploring various aspects of data science, ranging from data preprocessing and exploratory data analysis (EDA) to machine learning and deep learning.
data data-science deep-learning machine-learning matplotlib numpy python sql visualization
Last synced: 10 Apr 2026
https://github.com/quonverbat/ordner
A simple, customizable and cross-platform data tracker.
data datatracker javafx management
Last synced: 07 Jul 2025
https://github.com/progati00/marketing-mix-modeling-mmm-for-marketing-budget-optimization
A Marketing Mix Modeling (MMM) project using Python to analyze channel performance, calculate ROI, and simulate marketing budget changes for better business decisions. Includes a trained Linear Regression model, ROI analytics, and a Flask API for revenue prediction.
api budget-optimization data data-analysis data-science ecommerce eda flask jupyter-notebook linear-regression machine-learning marketing-analytics marketing-mix-modeling python roi-analysis vscode
Last synced: 14 Apr 2026
https://github.com/ankitrai259/sales_insight_dashboard
Sales Insight: Using SQL for data cleaning and Power BI for making interactive dashboard
dashboard data data-visualization datacleaning postgresql powerbi sql
Last synced: 17 Mar 2025
https://github.com/cityofnewyork/nyco-wp-open-data-transients
Interface for saving Open Data endpoints as WordPress Transients. Maintained by @NYCOpportunity
civic-tech composer data nycopportunity open-data plugin transients wordpress
Last synced: 10 Apr 2026
https://github.com/shef4793/hackerrank-sql-challenges-solutions
The solutions of all SQL challenges on HackerRank executed on either MySQL or MS SQL environment.
data data-engineering hackerrank hackerrank-challenges hackerrank-solutions mssql mssql-server mysql problem-solving solutions sql sql-challenges sql-query
Last synced: 11 Mar 2026
https://github.com/plurid/datasign
Single Source of Truth Data Contract Specifier
Last synced: 08 Nov 2025
https://github.com/mendel5/wifi
Information about Wi-Fi (wifi, WLAN, wireless LAN)
bitrate data data-transmission ethernet internet latency speed throughput transfer transmission wi-fi wifi wireless wireless-lan wlan
Last synced: 02 Aug 2025
https://github.com/khushi-sabarad/data_analysis
linkedin learning capstone project
data data-engineering matplotlib pandas python
Last synced: 10 May 2026
https://github.com/deliprofesor/health-score-prediction-model-the-impact-of-lifestyle-and-demographic-factors
A machine learning project predicting health scores based on lifestyle and demographic factors like age, BMI, diet, and exercise. Techniques include Random Forest, Polynomial Regression, and Linear Regression, with a focus on model performance and actionable health insights.
cross-validation data data-science data-visualization feature-engineering linear-regression machine-learning polynomial-regression random-forest
Last synced: 10 Apr 2025
https://github.com/infinitode/pyautoplot
PyAutoPlot is an open-source Python library designed to make dataset analysis much easier by generating helpful detailed plots using matplotlib. It automatically generates appropriate plots based on the dataset you feed it.
analysis automatic csv data dataset dataset-analysis generation matplotlib pandas plots plotting-in-python plotting-library python
Last synced: 16 Mar 2025
https://github.com/zevio/acl
ACL Anthology corpus sample
data dataset scholarly-articles
Last synced: 01 Mar 2026
https://github.com/ryanga09/digitalent_fundamentaldatascience-selfpractice
A repository of hands-on projects from DigiTalent’s Fundamental Data Science training, covering web scraping, data exploration, data cleaning, and data annotation. Includes Jupyter notebooks and example code for practical learning.
data data-analysis data-science data-visualization dataset digitalent komdigi notebook-jupyter notebooks
Last synced: 02 Aug 2025
https://github.com/hakusaro/facts
A fact based knowledge system (FBKS) experiment.
Last synced: 03 Jan 2026
https://github.com/arthurdanjou/studies
💼 This is the repository containing all my projects done during my studies in Python and R.
ai data data-science data-visualization jupyter jupyter-notebook ml python r
Last synced: 08 Apr 2025
https://github.com/shahsuvarli/election-voters-data-analysis-pandas
Educational project analyzing Azerbaijan voter demographics with pandas, focusing on data cleaning, grouping, and visualization.
cleaning data grouping matplotlib numpy pandas python visualization
Last synced: 12 Apr 2026
https://github.com/samhollings/nhs_data_cleansing
A repo of reusable functions for cleansing data
cleansing data data-cleaning data-cleansing preprocessing pyspark python python3
Last synced: 05 Oct 2025