data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-29 00:07:49 UTC
- JSON Representation
https://github.com/revolutionarybukhari/datawarehouse_meshjoin_superstore
A dataware house is generated for streaming data of a superstore using extended mesh join by Syed Husnain Haider Bukhari
data data-science data-warehousing meshjoin
Last synced: 23 May 2026
https://github.com/danielrosehill/value-factors-data-vis
Streamlit app containing visualisations of the Global Value Factors Database (GVFD) released by the IFVI in 2024
data data-visualization sustainability sustainability-data
Last synced: 29 Jul 2025
https://github.com/bastianolea/servel_elecciones_core
Resultados electorales desde Servel (2024)
chile comunas data elecciones genero
Last synced: 01 Aug 2025
https://github.com/cunfuu/network-bubbles
For Easier to manage organizations and keeping notes about them to organize events and easy access their needs
data data-visualization organizations organizations-volunteer
Last synced: 31 Jul 2025
https://github.com/johndelatto/automate-your-job-search-ai-applies-to-1000-positions
Automate Your Job Search: AI Applies to 1000 Positions Overnight & Get 100+ Interviews! In today’s fast-paced and highly competitive job market, finding and securing your dream job can be both time-consuming and exhausting.
ai data non-profit open-ai open-source
Last synced: 28 Jan 2026
https://github.com/pew-pew-team/hydrator
Hydrator kernel component
data deserializer dto hydrator kernel mapper mapping serializer structure
Last synced: 24 Mar 2025
https://github.com/shahsuvarli/election-voters-data-analysis-pandas
Educational project analyzing Azerbaijan voter demographics with pandas, focusing on data cleaning, grouping, and visualization.
cleaning data grouping matplotlib numpy pandas python visualization
Last synced: 12 Apr 2026
https://github.com/arthurdanjou/studies
💼 This is the repository containing all my projects done during my studies in Python and R.
ai data data-science data-visualization jupyter jupyter-notebook ml python r
Last synced: 08 Apr 2025
https://github.com/farrelfaricaf/exploratorydataanalyst---titanic
This project analyzes the Titanic dataset using exploratory data analysis (EDA) and visualization techniques to identify survival patterns. The goal is to understand how demographic factors like gender and age influenced survival rates during the 1912 disaster.
data data-analysis data-science data-visualization eda python titanic-dataset
Last synced: 31 Jul 2025
https://github.com/mumtaz4118/nlp-course
Programming Assignments and Lectures for Stanford's CS 224: Natural Language Processing with Deep Learning
course data data-analysis data-analytics data-science data-visualization deep-learning education machine-learning natural-language-processing neural-network transfer-learning
Last synced: 24 Nov 2025
https://github.com/cpietsch/breitband
developer repo of breitband-berlin
d3js data threejs visualization
Last synced: 02 May 2026
https://github.com/infinitode/pyautoplot
PyAutoPlot is an open-source Python library designed to make dataset analysis much easier by generating helpful detailed plots using matplotlib. It automatically generates appropriate plots based on the dataset you feed it.
analysis automatic csv data dataset dataset-analysis generation matplotlib pandas plots plotting-in-python plotting-library python
Last synced: 16 Mar 2025
https://github.com/jigyasag18/movie-recommendation-system-project
This repository features a personalized movie recommendation system that offers tailored suggestions to users. It leverages a dataset of 5,000 English-language films and utilizes data processing, feature engineering, and a cosine similarity algorithm to analyze user preferences. The system includes an intuitive user interface for easy navigation.
data datacleaning datapreprocessing machine-learning machine-learning-algorithms python streamlit streamlit-webapp
Last synced: 28 May 2026
https://github.com/ot-code/sql-sabor-y-tradicion
A SQL-driven project that integrates menu and order data to reveal insights on dish performance, customer preferences, and spending trends. It informs pricing strategies, menu adjustments, and targeted promotions, ultimately enhancing the overall customer experience and driving business growth.
analytical-queries data data-aggregation data-analysis database-design join-queries mysql order-analytics relational-databases restaurant-data sql sql-script
Last synced: 08 Apr 2025
https://github.com/petzi53/repair
R Datasets of the Open Repair Alliance (ORA).
Last synced: 19 May 2026
https://github.com/fatihilhan42/hollywood-theatrical-market-synopsis-1995-to-2021
In this project, the data of hollywood film production companies from 1995 to 2021 were examined. Significant tables and graphs were created using data visualization algorithms, with the tickets sold divided into categories.
data data-analysis data-science data-visualization
Last synced: 23 Mar 2025
https://github.com/anuraganalog/twitter-data-analysis
My internship work during the 2020 summer
analysis data eda exploratory-data-analysis jupyter-notebook nlp spotle textblob twitter wordcloud
Last synced: 20 May 2026
https://github.com/janakajain/Joshua_Project
christianity data proselytizing religion
Last synced: 10 Mar 2025
https://github.com/ragibasif/bobdylan
Bob Dylan
bob-dylan csv data data-science data-visualization lyrics music python
Last synced: 03 Sep 2025
https://github.com/doppelgunner/baby
A program for storing data just for fun
data doppelgunner java note storing
Last synced: 12 Jun 2026
https://github.com/0xHericles/ufcg-geojson
GeoJSON file containing the blocks and buildings of the Federal University of Campina Grande.
data data-visualization geojson map open-source ufcg university
Last synced: 24 Mar 2025
https://github.com/beastbytes/n6l-phone-number-data-php
NationalPhoneNumerInterface implementation using PHP for storage
data itu-t0202 phone-number php yii3
Last synced: 08 Feb 2026
https://github.com/aniruddha-biswas/shield-insurance-business-insights
Shield Insurance Business Insights
data data-visualization dataanalysis excel mysql powerbi sql
Last synced: 01 Apr 2025
https://github.com/45harry/potato_disease_classification
Potato Disease Classification - Traning, Rest Api and FrontEnd to Test
cnn-classification data data-science datapreprocessing deep-learning fastapi flaskapi frontend keras restapi tensorflow
Last synced: 12 Apr 2026
https://github.com/giuleo129/dataanalysis
This folder contains two projects focused on data analysis and statistical learning using R, covering exploratory data analysis, modeling, and predictive techniques.
data data-analysis data-science statistical-learning
Last synced: 25 Jan 2026
https://github.com/jpcadena/ventas-facturas
Ventas con facturas
data data-analysis data-exploration data-extraction data-science excel feature-engineering matplotlib microsoft numpy pandas powerbi product-sales pylint python receipts sales
Last synced: 12 Apr 2026
https://github.com/rrwen/r-reference
Quick reference to learning R
analysis beginner data guide introduction learn r reference statistics stats syntax
Last synced: 02 Jul 2025
https://github.com/yashaswitir28/yashaswitir28.github.io
This is my Portfolio Website
data data-analysis-python data-analyst data-cleaning data-science data-visualization excel html-css ms office365 portfolio-website powerbi python sql
Last synced: 29 May 2026
https://github.com/0xHericles/SpamDetector
:email: A Simple Python Spam Detector with Scikit-Learn
data ham machine-learning python sklearn spam
Last synced: 24 Mar 2025
https://github.com/natanast/euroleaguebasketball
An R package providing data on Euroleague Basketball
Last synced: 01 Apr 2025
https://github.com/v-mayya/quantitative-analysis-data-dashboard
Quantitative survey data analysis using R
data data-analysis data-visualization flourish r
Last synced: 01 Apr 2025
https://github.com/ludwing-mj/manipulacion_ej
Ejercicio utilizado en la seccion numero ocho del manual para ejemplificar las herramientas proporcionadas por el tydyverse para la manipulacion de datos.
data manipulate-data package r
Last synced: 01 Apr 2025
https://github.com/suchi25sathavara/data-wrangling-with-r
Analyzing Road Accidents in Victoria, Australia
data r reporting rstudio wrangling-data
Last synced: 01 Apr 2025
https://github.com/suchi25sathavara/r-projects
R projects in Real world Scenerios for Data Analysis
data data-analysis datavisualization r
Last synced: 01 Apr 2025
https://github.com/seif-elkateb/dataset-analysis-r
cu-boulder data data-analysis datamodeling datascience ms-ds msds434 r
Last synced: 01 Apr 2025
https://github.com/wraith13/systematic-metasyntactic-variables
This is a list for that you can express the existence of different serieses when using metasyntax variables.
Last synced: 14 Jun 2025
https://github.com/armand-sauzay/datasets
Datasets for machine learning
ai data datasets machine-learning ml
Last synced: 18 Jan 2026
https://github.com/abhijeetdasbakshi/ecommerce-insights
A Dockerized end-to-end project that combines unsupervised machine learning for customer segmentation with scalable data pipelines. It uses MongoDB for data ingestion, Scikit-learn for clustering, Airflow for orchestration, and Streamlit for interactive visualization — enabling actionable insights into e-commerce
airflow airflow-dags ci-cd-pipeline clustering dags data data-pipelines docker docker-compose docker-container dockerfile git great-expectations kafka mongodb pca-analysis postgresql pyspark t-sne umap-learn
Last synced: 04 Apr 2026
https://github.com/h-sutiwas/r2de-2025
This repository is related to the Road To Data Engineer Bootcamp by DataTH. It contains all related coursework, some mini projects and other resources within the field of Data Engineering.
data data-engineering data-visualization docker gcp pipeline spark
Last synced: 30 Apr 2026
https://github.com/pchaparro/search-engine
Full stack search-engine created from youtube videos obtained using "web-scraping"
data opensearch python python3 react scraper scraping scraping-websites search search-engine semantic-search sentence-transformers typescript website
Last synced: 17 Apr 2026
https://github.com/smaug6739/data-bit
This project is a module for converting a structured dataset into a number that can be stored in a database taking up little space.
Last synced: 14 May 2026
https://github.com/rahult18/atmo-flow
AtmoFlow is a robust data engineering pipeline built on Google Cloud Platform (GCP) that processes and analyzes weather and air quality data in both batch and streaming modes
airflow data data-modeling data-science data-visualization dataengineering gcp-bigquery gcp-cloud-composer gcp-cloud-functions pyspark
Last synced: 23 Jun 2026
https://github.com/makcymal/silvera
My researches on ML and statistics, optimization methods, CS algoritms and numerical methods
algorithms data data-structures machine-learning numerical-methods statistics
Last synced: 01 Apr 2025
https://github.com/faster-games/dynamic-components
Dynamic Runtime Components for Unity3D
Last synced: 11 Apr 2026
https://github.com/johnelliott/wb-web
Moved —> https://github.com/johnelliott/waybot
arduino browser data iot raspberry-pi web
Last synced: 12 Apr 2026
https://github.com/awpala/udemy-my-courses-data-parser
Download Udemy lists and courses metadata for authenticated student user
Last synced: 07 May 2026
https://github.com/dvaser/heart-attact-analysis-prediction
DATA ANALYSIS
classification data data-analysis data-visualization jupyter jupyter-notebook lineer-regresyon machine-learning python regression
Last synced: 20 Jan 2026
https://github.com/turner-kendall/turner-kendall
Turner Kendall - dev, opps, sec.
config data github-config go rust security
Last synced: 31 Oct 2025
https://github.com/flowsynx/plugin-base64
FlowSynx plugin to provides encoding and decoding of Base64 strings, allowing workflows to handle Base64 content transformations efficiently.
base64 base64-decoding base64-encoding data data-platform decoding encoding flowsynx flowsynx-plugins
Last synced: 10 Mar 2026
https://github.com/shubhamsoni98/project_using_knn
This project applies the K-Nearest Neighbors (KNN) algorithm to predict iPhone purchases based on customer data. Using features like age, salary, and previous purchase behavior, the KNN model classifies customers into buyers and non-buyers.
anaconda analytics data data-science eda knn knn-classification machine-learning-algorithms predict project python scikit-learn tableau
Last synced: 03 Jan 2026
https://github.com/lananolana/test_data_generator
Generate test data with Telegram bot in one click: random users, files, texts and credit cards.
credit-card data data-generation fake-data random telegram-bot test-data test-data-generator test-file-generator testing testing-tools text-generation user-generator
Last synced: 18 Jan 2026
https://github.com/lohithgsk/dynamic-qr-generator
A Python-based QR generator application was developed using the qrcode and Pillow libraries, dynamically generating QR codes for custom data inputs. Designed for a college grievance management system, the application creates QR codes containing block, floor, room, and machine numbers, allowing easy placement and identification on each floor.
data pillow python qrcode qrcode-generator
Last synced: 16 Mar 2025
https://github.com/alextanhongpin/node-github-api
:page_with_curl: sample github api queries with nodejs for scraping purposes
Last synced: 06 May 2026
https://github.com/gaemapiracicaba/norma_dec_8468-76
Padrões de qualidade e lançamento de efluentes de águas interiores
Last synced: 19 Apr 2026
https://github.com/jameshenderson12/chatbot-utils
Generic data and elements that can be reused or repurposed for chatbot development.
boilerplate chatbot data development elements intents template utterances
Last synced: 04 Mar 2026
https://github.com/shudhanshusaurabh001/super_market-data-analysis-using-python
This project focuses on analyzing supermarket sales data using Python. The goal is to extract meaningful insights from the dataset, such as sales trends, customer purchasing behavior, and product performance.
analysis csv data insights matplotlib numpy pandas project python seaborn
Last synced: 06 Apr 2026
https://github.com/amethyst-php/activity
Someone just did something, should we save who did this and when?
activity amethyst amethyst-package api data laravel
Last synced: 17 May 2026
https://github.com/alsult/alsult
Aliia Sultanova Portfolio
data datascience programming python
Last synced: 23 Jan 2026
https://github.com/amethyst-php/owner
amethyst amethyst-package api data laravel owner
Last synced: 28 Apr 2026
https://github.com/steveanik/kestra
Infinitely scalable, event-driven, language-agnostic orchestration and scheduling platform to manage millions of workflows declaratively in code.
data data-engineering data-integration data-pipeline data-quality elt etl low-code orchestration pipelines scheduler workflow workflow-engine
Last synced: 06 Jan 2026
https://github.com/zainea-bogdan/data_engineer_project_wowcinema
WoWCinema is a project based on a fictional scenario where I stepped into the role of a Data Engineer, designing and building an end-to-end Data Infrastructure. A ETL pipeline ingests data from multiple sources, transforms it, and loads it into a centralized PostgreSQL data warehouse to power analytics, KPI tracking, and reporting
analytics big-data data datawarehousing etl-pipeline postgres python sql
Last synced: 19 May 2026
https://github.com/murshidazher/client-side-data-storage
🚌 A workspace containing client-side data storage implementations
cache cache-storage client-side data indexeddb localstorage sessionstorage storage websql
Last synced: 02 Sep 2025
https://github.com/mikeqfu/network-rail-track-fixity-layer
This project develops a data mining tool for analysing and predicting track movements using asset data, environmental factors and track design knowledge to model key parameters and generate fixity values for the GB rail network.
data data-integration data-mining data-science information-management knowledge-discovery point-cloud rail rail-alignment rail-track track-fixity
Last synced: 02 Sep 2025
https://github.com/purarue/scramble-history
parses rubiks cube scramble history/solve time from cstimer.net, cubers.io, twistytimer -- merges them together giving you uniform averages/data/graphs
cstimer cubing data rubiks-cube speedsolving
Last synced: 11 Jun 2025
https://github.com/bablukumarjha/startup-funding-revenue-analysis-by-sql-and-pandas
SQL project analyzing startup funding, revenue, and founder data to extract business insights using Python and MySQL.
data data-analysis data-platform data-science dataanalysisusingpython dataanalytics pandas-dataframe pandas-library python sql sql-server sqlalchemy sqldatabase
Last synced: 18 May 2026
https://github.com/plnech/never2late
Never 2 Late - a reinterpretation of Everest Pipkin's 'i've never picked a protected flower'
dada dada-science data generative-art glitch-art installation nlp poetry spacy vector-similarity wallpaper
Last synced: 10 Jun 2025
https://github.com/prajakta1321/streetml-a-cityscape-traffic-volume-prognostication
StreetML leverages ML learning techniques to revolutionize urban traffic prediction through precise volume prognostication, aiming to enhance cityscape mobility through data-driven insights.
catboostregressor data datavisualisation exploratory-data-analysis lightgbm-regressor linearregression machine-learning machine-learning-algorithms predictive-analytics random-forest-regression xgboost-regression
Last synced: 08 Apr 2025
https://github.com/solrikk/bluemoon
This project is a Go language tool designed to automatically download, process, and save product data from a remote server into a CSV file.
analyze converter data go golang xml-parser
Last synced: 31 Jul 2025
https://github.com/srvanderplas/statistical_atlas
Framed Charts and the Statistical Atlas of 1870
census data ggplot2 graphics r statistics visualization
Last synced: 29 May 2026
https://github.com/juanandres-montero/dataanalysis
Dedicado al análisis de datos.
Last synced: 10 Aug 2025
https://github.com/sasanthns/sql_data_warehouse_project
A comprehensive guide to building a modern data warehouse with SQL Server, including ETL processes, data modeling, and analytics.
data data-analysis data-science data-warehouse datacleaning etl etlpipeline sql sqlserver
Last synced: 24 Mar 2025
https://github.com/nolanbconaway/rollercoaster-tycoon-data
Every roller coaster I have built in RCT2 for iPad
Last synced: 24 Mar 2025
https://github.com/rubyonworld/ldpath
This is a ruby implementation of LDPath, a language for selecting values linked data resources.
Last synced: 12 Nov 2025
https://github.com/plateformeio/docs
The official documentation of the Plateforme framework
api app asgi async data db docs fastapi plateforme pydantic python restx services sqlalchemy
Last synced: 11 Apr 2026
https://github.com/powersyang/visualization
data visualization templates 数据可视化模板
Last synced: 24 Mar 2025
https://github.com/team-hydrogen/nasa-adc-data
All files relating to the computation of the data provided
data jupyter-notebook nasa-app-development-challenge
Last synced: 25 Mar 2025
https://github.com/parablelab/parable
Work in progress...
data data-management data-platform data-validation database pipelines
Last synced: 28 May 2026
https://github.com/igor-starostenko/sabre
Slice your files like a champ with **sabre**
Last synced: 28 Mar 2025
https://github.com/robson-python/academic-performance
Project to evaluate students' academic performance.
csv-import data data-analysis data-science jupyter-notebook machine-learning matplotlib pandas python scikit-learn seaborn vscode
Last synced: 12 Apr 2026
https://github.com/stdlib-js/ndarray-vector-int8
Create a signed 8-bit integer vector (i.e., a one-dimensional ndarray).
constructor ctor data int8 javascript ndarray node node-js nodejs stdlib structure types vec vector
Last synced: 24 Apr 2026
https://github.com/filipnet/infoscreen
Arduino subscribes values by MQTT and view info on an OLED I2C display
arduino data display i2c mqtt oled-display-ssd1306 visualization weather weatherstation
Last synced: 12 Apr 2026
https://github.com/meizuflux/cion
Python minimal data validation library
data minimal python validation
Last synced: 28 May 2026
https://github.com/mevlutcelik/turkey-cities-data
📍 Türkiye şehirlerine ait şehir verisi paketi: Plaka, koordinat (lat/lon), nüfus (2024 ADNKS) ve coğrafi bölge bilgilerini içerir.
cities coordinates data json nufus plaka turkey turkiye typescript
Last synced: 10 Mar 2026
https://github.com/zoekelepiri/winedataprediction
A machine learning application in wine quality prediction
data descriptive-statistics machine-learning-algorithms
Last synced: 05 Jan 2026
https://github.com/entropyorg/p5-data-testimage
:notebook::camera: interface for retrieving test images
Last synced: 29 May 2026