data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-29 00:07:49 UTC
- JSON Representation
https://github.com/0xHericles/ufcg-geojson
GeoJSON file containing the blocks and buildings of the Federal University of Campina Grande.
data data-visualization geojson map open-source ufcg university
Last synced: 24 Mar 2025
https://github.com/doppelgunner/baby
A program for storing data just for fun
data doppelgunner java note storing
Last synced: 12 Jun 2026
https://github.com/janakajain/Joshua_Project
christianity data proselytizing religion
Last synced: 10 Mar 2025
https://github.com/petzi53/repair
R Datasets of the Open Repair Alliance (ORA).
Last synced: 19 May 2026
https://github.com/cpietsch/breitband
developer repo of breitband-berlin
d3js data threejs visualization
Last synced: 02 May 2026
https://github.com/infinitode/pyautoplot
PyAutoPlot is an open-source Python library designed to make dataset analysis much easier by generating helpful detailed plots using matplotlib. It automatically generates appropriate plots based on the dataset you feed it.
analysis automatic csv data dataset dataset-analysis generation matplotlib pandas plots plotting-in-python plotting-library python
Last synced: 16 Mar 2025
https://github.com/jigyasag18/movie-recommendation-system-project
This repository features a personalized movie recommendation system that offers tailored suggestions to users. It leverages a dataset of 5,000 English-language films and utilizes data processing, feature engineering, and a cosine similarity algorithm to analyze user preferences. The system includes an intuitive user interface for easy navigation.
data datacleaning datapreprocessing machine-learning machine-learning-algorithms python streamlit streamlit-webapp
Last synced: 28 May 2026
https://github.com/beriberikix/senml-zephyr
A codec for encoding and decoding Sensor Measurement Lists (SenML) for Zephyr
codec data iot senml sensor zephyr-rtos
Last synced: 24 Mar 2025
https://github.com/giuleo129/dataanalysis
This folder contains two projects focused on data analysis and statistical learning using R, covering exploratory data analysis, modeling, and predictive techniques.
data data-analysis data-science statistical-learning
Last synced: 25 Jan 2026
https://github.com/darkogamerz/dhis2heat
A Comprehensive data management and Health Equity Assessment and Analysis platform that fetches data from DHIS2, optimize, calculate, clean and visualize inequality data.
analytics data data-science dhis2 equality equity health heat inequality r shiny shinydashboard visualization
Last synced: 01 Apr 2025
https://github.com/eudesgccunha/automated-management-panel
Automated management panel using Power BI
data data-analysis data-visualization database excel powerbi
Last synced: 04 Feb 2026
https://github.com/dhruvil-26/powerbi-projects
This repository contains Power BI projects showcasing data analysis and interactive dashboards. Each project includes detailed visualizations and insights on diverse topics such as loan analysis, sales performance, and customer behavior.
customer-behavior-analysis data data-analysis interactive-dashboards loan-analysis powerbi sales-performance visualization
Last synced: 04 Feb 2026
https://github.com/trollmii/bunnybase
An efficient data managing system
bunnybase data data-science data-structures database datascience python python3
Last synced: 22 Apr 2025
https://github.com/inist-cnrs/ws-data
Modèles et données pour les web services
Last synced: 03 Sep 2025
https://github.com/sushmashreeps/data-science-with-python
This repository showcases a comprehensive data science project utilizing Python, demonstrating expertise in data analysis, visualization, and machine learning. Built with Python 3.x, the project leverages popular libraries like Pandas, NumPy, Matplotlib, Seaborn, Scikit-learn, and TensorFlow. The project features data preprocessing, feature engine
cnn data dataanalysis datascience keras linear-regression matplotlib python python3 regression rnn visualization
Last synced: 14 Apr 2026
https://github.com/lohithgsk/dynamic-qr-generator
A Python-based QR generator application was developed using the qrcode and Pillow libraries, dynamically generating QR codes for custom data inputs. Designed for a college grievance management system, the application creates QR codes containing block, floor, room, and machine numbers, allowing easy placement and identification on each floor.
data pillow python qrcode qrcode-generator
Last synced: 16 Mar 2025
https://github.com/zulfachafidz/telco_churn_insight_customer_loss_prediction_with_random_forest_and_decision_tree-algorithms
The main problem in the business world is customer churn, or losing customers, especially in the telecommunications industry, which experiences very tight competition. To overcome this problem, an analysis was carried out to help the company understand how many customers have the potential to switch providers.
data data-science data-visualization dataanalysis dataanalyst dataanalytics datadrivenwithdataprovider decision-tree decision-tree-classifier decision-trees random-forest random-forest-classifier
Last synced: 01 May 2026
https://github.com/purarue/scramble-history
parses rubiks cube scramble history/solve time from cstimer.net, cubers.io, twistytimer -- merges them together giving you uniform averages/data/graphs
cstimer cubing data rubiks-cube speedsolving
Last synced: 11 Jun 2025
https://github.com/srvanderplas/statistical_atlas
Framed Charts and the Statistical Atlas of 1870
census data ggplot2 graphics r statistics visualization
Last synced: 29 May 2026
https://github.com/etmendz/mendz.data
Provides tools and guidance for creating data access contexts and repositories.
context data datasettings entity-framework mendz paginginfo repository resultinfo
Last synced: 11 Jun 2025
https://github.com/powersyang/visualization
data visualization templates 数据可视化模板
Last synced: 24 Mar 2025
https://github.com/filipnet/infoscreen
Arduino subscribes values by MQTT and view info on an OLED I2C display
arduino data display i2c mqtt oled-display-ssd1306 visualization weather weatherstation
Last synced: 12 Apr 2026
https://github.com/entropyorg/p5-data-testimage
:notebook::camera: interface for retrieving test images
Last synced: 29 May 2026
https://github.com/jneidel/nationalities
Dataset of 100 common nationalities
data dataset json nationalities nationality opendata
Last synced: 25 Mar 2025
https://github.com/cosmos-loops/cosmos-data
Cosmos.Data is a inline project of COSMOS LOOPS PROGRAMME to provide several SQL-Query, RMDB/ORM and No-SQL components' extensions.
connection-pool data mysql mysqlconnector oracle postgresql sqlite sqlkata sqlserver transaction uow
Last synced: 12 Apr 2026
https://github.com/elimu-ai/ml-event-simulator
🤖 Simulation of learning events and assessment events
data learning-analytics machine-learning ml
Last synced: 28 Feb 2025
https://github.com/denisecase/620-mod6-web-scraping
Notes on how to get started scraping content from the web
beautifulsoup4 data mining python
Last synced: 11 Apr 2025
https://github.com/coko7/vegapull-records
Cards dataset for One Piece TCG
data dataset one-piece one-piece-card-game one-piece-tcg tcg
Last synced: 26 Feb 2025
https://github.com/survi218/angular-http-service
client-server communication using http service in angular
angularjs client-server communication data get http-client http-requests http-response http-server post
Last synced: 16 Mar 2025
https://github.com/plandes/datdesc
Describe and optimize data
data hyperparameter-optimization hyperparameter-tuning latex table
Last synced: 04 Sep 2025
https://github.com/yash-chauhan-dev/sf_analytics
Business teams often rely on data analysts to extract insights using SQL. This tool eliminates that dependency by bridging the gap between humans and data using AI.
aiml analytics data dbt langchain llm python snowflake streamlit
Last synced: 07 May 2026
https://github.com/luciarevaliente/shell_script_data_cleaning
This project focuses on cleaning and processing datasets using Shell scripts. It is part of the Fundamentals of Informatics course (2022-23) and involves handling movie and show data to create cleaned and filtered datasets for further analysis.
data data-cleaning shell-script
Last synced: 04 Feb 2026
https://github.com/getconversio/dig-the-data
Data visualizations for the Conversio blog
Last synced: 12 Apr 2026
https://github.com/thewillyhuman/willyos-java
willyOS for java developers
collections data data-structures java os structures
Last synced: 12 Jun 2025
https://github.com/frer0t/userverse
creating api for data analysis
data data-analytics spring-boot users
Last synced: 12 Apr 2026
https://github.com/matheusafonseca/deploy-ml-models-with-streamlit-udemy
This repository is dedicated to storing the code developed during the "Machine Learning Model Deployment with Streamlit" course on Udemy. The course covers basic to advanced techniques for deploying machine learning models using Streamlit.
data data-science data-visualization interface joblib layout machine-learning optimization-algorithms python python3 sklearn sklearn-datasets sklearn-library sklearn-pipeline streamlit
Last synced: 19 Apr 2026
https://github.com/trissim/polystore
Framework-agnostic multi-backend storage abstraction for ML and scientific computing
backend data io jax ml multi-framework numpy pytorch scientific-computing storage tensorflow zarr
Last synced: 12 Apr 2026
https://github.com/ginga1402/data_visualization_on_honey_production_dataset
Data Visualization using Matplotlib & Seaborn Libraries
college-project data data-visualization
Last synced: 25 Aug 2025
https://github.com/davorg/towerbridge
When is Tower Bridge lifting?
data hacktoberfest london perl web-scraping
Last synced: 25 Oct 2025
https://github.com/syed-bakhtawar-fahim/dsa_algorithm_code
Assalam o Alikum Guys, This is the repo of Data Structure and Algorithm in C programming language. I hope it will help you in learning Data Structure and Algorithm in C. I'm also learning Data Structure and algorithm in Python in better and easy way you can also explore it
algorithm algorithms-and-data-structures c data data-structures-and-algorithms dsa-algorithm dsa-learning-series dsa-practice
Last synced: 12 Apr 2025
https://github.com/pietrapaz/bootcamp_dio_ciencia_de_dados
Bootcamp Potência Tech powered by iFood | Ciência de Dados - Dio ⚠️
cienciadedados dados data datascience python
Last synced: 09 Apr 2025
https://github.com/woctezuma/steamspy-data
Data snapshot from SteamSpy.
data data-dump data-dumps steam steam-data steamspy steamspy-api
Last synced: 07 Jan 2026
https://github.com/publici/state-integrity-data
Data from a comprehensive assessment of state government accountability and transparency
Last synced: 04 Feb 2026
https://github.com/jpb06/kubot-dal
data data-access-layer gulp-tasks mongodb typescript
Last synced: 12 Apr 2026
https://github.com/theanujsinha01/mcdonalds-customer-analysis
This project analyzes customer feedback data to understand what drives people to like or dislike McDonald’s. Using Python and data visualization tools in a Jupyter Notebook, we explore how different factors—such as taste, price, health, and visit frequency—affect customer satisfaction.
case-study data data-visualization dataanalysis
Last synced: 05 Sep 2025
https://github.com/ayushverma135/dbms-labfile
Created for practical learning, this DBMS lab file offers hands-on exercises covering SQL queries, normalization, indexing, and more. With clear instructions and sample datasets, students gain invaluable experience in database design and management.
Last synced: 04 Feb 2026
https://github.com/kahlery/my-jupyter-notebook-projects
🐊 collection of my data science analysis, actually I store most of my data science projects in my google drive because of google colab
Last synced: 12 Apr 2026
https://github.com/veivel/f1-sentiment-analysis
An entiment analysis project on tweets about Formula 1. To be reworked.
data f1 nlp-library nlp-machine-learning
Last synced: 04 Jul 2025
https://github.com/vara-co/tech-certifications
These are the certifications that back-up some of my skills.
certificates certifications data data-analytics skills
Last synced: 07 Jan 2026
https://github.com/afnanenayet/ds-a
Some interview prep I've been doing. This repo is reimplementations of algorithms and data structures in Python3
algorithms data interview prep python structures
Last synced: 05 Apr 2025
https://github.com/miraclx/split-merge
Efficient, flexible data stream chunker and merger
chunk data efficient merge middleware nodejs pipeline split stream
Last synced: 07 May 2026
https://github.com/eng-gabrielscardoso/data-science-formation
Data science course walkthrough
data data-science data-visualisation google-colab google-colaboratory google-colaboratory-notebooks python r r-lang
Last synced: 28 Feb 2025
https://github.com/doughtnerd/pod-old
Read and write Excel data
data data-analysis excel poi-library workbook
Last synced: 21 Jan 2026
https://github.com/jerboaburrow/uk-counties-and-unitary-authorities-may-2023-geojson
UK "Counties" Extracted from Office for National Statistics data
Last synced: 29 Mar 2025
https://github.com/naliferopoulos/datamining
Bring your own pickaxe.
aueb aueb-students data data-mining machine-learning machine-learning-algorithms mining random-forest
Last synced: 25 Jan 2026
https://github.com/didier/frontend-data
Functional Programming subject of @CMDA-TT
convenience d3 d3-visualization d3js data datavis datavisualization dataviz front-end functional-programming interactive jsdoc node nodejs parking-spots svelte sveltejs
Last synced: 13 Apr 2026
https://github.com/ismailhakkii/digital_vault
This project can be used for securing data, similar to a real vault.
data digital security-data vault
Last synced: 25 Mar 2025
https://github.com/maximiliancw/completely
Measure your data completeness
data data-cleaning data-quality data-science missing-data
Last synced: 25 Jun 2025
https://github.com/unknownsoup/budget_tracker
A personal budget tracker to build my knowledge of working with databases and data analysis. In this case using SQL and python for the analysis.
data data-science databases python sql
Last synced: 26 Jan 2026
https://github.com/shadmanshaikh/data-analysis-and-ml-work
All of my work in Data Analysis and Machine learning
analytics artificial-intelligence data machine-learning
Last synced: 05 Jul 2025
https://github.com/rorylshanks/devdb-client
This is the repository for the official command line client for DevDB (https://devdb.cloud)
cloud data database-management development
Last synced: 29 May 2026
https://github.com/pdoup/enegry
Time-Series dataset combining multiple sources to explain the broader Greek energy market
data dataset day-ahead-auction energy-markets exploratory-data-analysis forecasting futures-market greek-energy-market renewable-energy time-series-data weather-data
Last synced: 07 May 2025
https://github.com/grace-mengke-hu/redditpushshiftapi
This package is for collecting Reddit dataset and organize the data in Mongo Database
Last synced: 13 Jun 2025
https://github.com/ompreetham/fylo-data-storage-component
Flyo Data Storage Component Challenge on Frontend Mentor.io.
component css data front-end front-end-development frontend frontend-mentor frontendmentor-challenge fylo html react render scss storage vite website
Last synced: 11 Apr 2026
https://github.com/farhashaad/farhashaad98
This is a repository to showcase my skills, share projects and track my progress in Data Science related projects.
data data-visualization dataanalysis matplotlib pandas python seaborn sql tableau
Last synced: 24 Apr 2026
https://github.com/musamairshad/dsa-python
This repository contains all the material related to Data Structures and Algorithms implemented in Python.
algorithms data datastructures efficiency python searching-algorithms sorting-algorithms
Last synced: 25 Mar 2025
https://github.com/prishabhanot/facial_recognition_pca
A face recognition system using Principal Component Analysis (PCA) for dimensionality reduction and a Support Vector Machine (SVM) classifier for classification. PCA extracts essential features (eigenfaces) from facial images, significantly reducing computational complexity while retaining critical information for accurate recognition.
data eigenfaces facial-recognition pca python reducing-computational-complexity reducing-data-dimensions svm-classifier
Last synced: 01 Mar 2025
https://github.com/vatshayan/youtube-user-analysis
Analysis of Youtube Users about their choice and preferences
data data-analysis data-mining data-science data-visualization dataset machine-learning machine-learning-algorithms
Last synced: 05 Feb 2026
https://github.com/deliprofesor/breast-cancer-detection-using-svm-with-smote-and-model-optimization
This project analyzes health and lifestyle factors influencing heart attack risk using statistical methods and machine learning, with Ridge Regression identified as the best predictive model.
classification data data-preprocessing data-science data-visualization gridsearchcv machine-learning python roc-curve smote svm
Last synced: 10 Apr 2025
https://github.com/luminati-io/ZoomInfo-dataset-samples
A sample dataset of over 1000 ZoomInfo companies, extracted using the Bright Data API, ideal for market growth, lead generation, and market analysis.
b2b business companies data data-extraction database dataset datasets web-scraping zoominfo
Last synced: 09 Apr 2025
https://github.com/mukul273/spring-data-rest-jpa-demo
Spring Data Rest JPA Demo
data jpa rest spring spring-boot spring-mvc
Last synced: 20 Apr 2026
https://github.com/otoneko1102/roulette-base
ルーレットの色と番号をjson形式でまとめたものです。カジノ風ルーレットを作るときにどうぞ。A collection of roulette colors and numbers in json format. Use it when making a casino-style roulette.
casino casino-games data json require roulette
Last synced: 16 Mar 2025
https://github.com/vishwas-chakilam/twitter-sentiment-analysis
Twitter Sentiment Analysis is a Python project that analyzes the sentiment of tweets based on a user-defined keyword. It uses Tweepy to fetch tweets from the Twitter API and TextBlob for sentiment analysis. The application features a user-friendly GUI with Tkinter, displaying tweet sentiment as positive, negative, or neutral.
api data data-science dataanalysis python3 textblob-sentiment-analysis tkinter tweepy-api
Last synced: 11 Mar 2025
https://github.com/jstafford5380/provausio.testing.generators
Generate fake data for testing and/or mocking
data fake-data generator testing
Last synced: 14 Jan 2026
https://github.com/rosette-api/mock-data
Mock data that is used for unit testing of the Babel Street Analytics bindings
data entity-extraction entity-level-sentiment entity-linking entity-relationship entity-resolution language-detection machine-learning mock-data morphology natural-language-processing nlp relation-extraction sentiment-analysis test-framework testing text-mining text-processing tokenization
Last synced: 04 Mar 2026
https://github.com/bmcollier/contiguous
Provides COBOL-style contiguous data structures in Python
Last synced: 14 Jan 2026
https://github.com/allanotieno254/powerbi-dax-filter-context
This repository contains a Power BI project that explores **DAX Filter Context**, a crucial concept in DAX calculations. The project focuses on **Bank Loan Analysis**, demonstrating how different filter contexts affect DAX formulas.
business-intelligence data data-analysis dax dax-functions powerbi powerbi-visuals visualization
Last synced: 08 Jan 2026
https://github.com/shadeglare/genum
The ES Next tools to process data in a LINQ manner
data linq processing typescript
Last synced: 13 Apr 2026
https://github.com/rishitabansal9/adult-census-income-prediction
This is a project made for data analysis and income prediction using random forest classifier with 91% accuracy.
data data-analysis data-science feature-engineering random-forest-classifier
Last synced: 25 Mar 2025
https://github.com/juangesino/research-project
Course files for Research Project @ University of Amsterdam
data data-science economics stata
Last synced: 02 Jan 2026
https://github.com/meokullu/prefill
PreFill adds desired characters onto output values to increase their legibility.
alignment data data-analysis data-engineering data-science legibility
Last synced: 17 Jan 2026
https://github.com/bkestelman/dasy-ml
DaSy DataSynthesizer - Create synthetic data with desired statistical properties for machine learning research.
data data-science machine-learning
Last synced: 14 Jan 2026
https://github.com/fiedsch/data_util
misc. Utilities for data files like variable name lists
Last synced: 14 Jun 2025
https://github.com/woctezuma/recent-sales-data
Data available to estimate sales of Steam games during release week.
Last synced: 05 Feb 2026
https://github.com/soenneker/soenneker.cloudflare.origincerts.thumbprints
The current Cloudflare origin certificate thumbprints
cloudflare csharp data dotnet origincerts thumbprint thumbprints
Last synced: 23 Apr 2026
https://github.com/soenneker/soenneker.datatables.attributes.column
A C# attribute for Datatables.js column building
attributes column columns csharp data datatablecolumnattribute datatables dotnet mapping object
Last synced: 12 Mar 2026