data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-02 00:07:45 UTC
- JSON Representation
https://github.com/pixlcrashr/stwhh-mensa
Better STWHH Mensa menu data / interface / notifier
api crawler data food studierendenwerk-hamburg university website
Last synced: 07 Aug 2025
https://github.com/sjg/my-search-story
My Search Story is a demo application developed for the Data Portability API Workshop and the #AISprint2025 events. #BuildwithAI
data docker generative-ai google-cloud-platform google-cloud-run nodejs
Last synced: 04 May 2026
https://github.com/analyst-amitbisht/Pizza-Sales-Report-
Its a guided project to practice tools like SSMS + Power BI & also skills like data cleaning, data exploration, data analysis, data visualization, etc.
analytics data data-visualization powerbi sql-server
Last synced: 01 Oct 2025
https://github.com/theprodigyleague/d1g174lx534f00d
react/node bootstrapped project for a digi(company){["SEAFOOD"]}
bootstrap companies data data-conduit digital digital-seafood java javascript node project react seafood
Last synced: 01 Oct 2025
https://github.com/natanast/euroleaguebasketball
An R package providing data on Euroleague Basketball
Last synced: 01 Apr 2025
https://github.com/v-mayya/quantitative-analysis-data-dashboard
Quantitative survey data analysis using R
data data-analysis data-visualization flourish r
Last synced: 01 Apr 2025
https://github.com/suchi25sathavara/data-wrangling-with-r
Analyzing Road Accidents in Victoria, Australia
data r reporting rstudio wrangling-data
Last synced: 01 Apr 2025
https://github.com/suchi25sathavara/r-projects
R projects in Real world Scenerios for Data Analysis
data data-analysis datavisualization r
Last synced: 01 Apr 2025
https://github.com/wraith13/systematic-metasyntactic-variables
This is a list for that you can express the existence of different serieses when using metasyntax variables.
Last synced: 14 Jun 2025
https://github.com/apostolissiampanis/weather-app-api
WeatherApp is a Java-based console application that retrieves and processes weather data using the wttr.in web service.
api data hibernate java json lombok objected-orientated-programing oop spring-boot spring-data-jpa sqlite webflux
Last synced: 05 May 2026
https://github.com/victorowinoke/custmer-segmentation-using-rfm-python-
Customer Segmentation using the Recency, Frequency and Monetary Values
customer-segmentation data data-visualization python3 science time-series-analysis
Last synced: 26 May 2026
https://github.com/elissorokin/data-analyst-portfolio
Это репозиторий, в котором я демонстрирую свои навыки, делюсь проектами и отслеживаю прогресс в области анализа данных и Data Science.
ab-testing data data-analysis datalense matplotlib numpy pandas plotly portfolio postgresql python scipy seaborn sql statistical-analysis
Last synced: 09 Apr 2026
https://github.com/inist-cnrs/ws-data
Modèles et données pour les web services
Last synced: 03 Sep 2025
https://github.com/ashita-ai/ashita-ai.github.io
Ashita AI - The island of misfit data tools
Last synced: 19 Feb 2026
https://github.com/lohithgsk/dynamic-qr-generator
A Python-based QR generator application was developed using the qrcode and Pillow libraries, dynamically generating QR codes for custom data inputs. Designed for a college grievance management system, the application creates QR codes containing block, floor, room, and machine numbers, allowing easy placement and identification on each floor.
data pillow python qrcode qrcode-generator
Last synced: 16 Mar 2025
https://github.com/e22m4u/ts-data-schema
Валидация данных и приведение типов для TypeScript
data schema typescript validation
Last synced: 05 Aug 2025
https://github.com/muhamedlabs/muhamed_onedrive
Muhamed_OneDrive - це надійне і зручне хмарне сховище для файлів, розроблене для безпечного зберігання і легкого обміну даними.
data html5 onedrive programming style
Last synced: 04 Jan 2026
https://github.com/ompreetham/data-structures
binary-search-tree c data data-structures datastructures graph linked-list list stack structures tree
Last synced: 25 Mar 2025
https://github.com/steveanik/kestra
Infinitely scalable, event-driven, language-agnostic orchestration and scheduling platform to manage millions of workflows declaratively in code.
data data-engineering data-integration data-pipeline data-quality elt etl low-code orchestration pipelines scheduler workflow workflow-engine
Last synced: 06 Jan 2026
https://github.com/purarue/scramble-history
parses rubiks cube scramble history/solve time from cstimer.net, cubers.io, twistytimer -- merges them together giving you uniform averages/data/graphs
cstimer cubing data rubiks-cube speedsolving
Last synced: 11 Jun 2025
https://github.com/filiprokita/tobase64
This Python program encodes a file in base64 format and saves the result to a new file with a ".b64" extension. It is a command-line tool that can be used to automate file encoding tasks.
base64 command-line data data-conversion data-manipulation data-privacy data-prottection data-security encoding file file-conversion file-handling python python-script python3 tobase64
Last synced: 30 Jun 2025
https://github.com/srvanderplas/statistical_atlas
Framed Charts and the Statistical Atlas of 1870
census data ggplot2 graphics r statistics visualization
Last synced: 29 May 2026
https://github.com/dug22/jjournal
A Jupyter like notebook software for Java
data data-analysis data-science java jshell jshell-repl notebook swing swing-application
Last synced: 11 Apr 2026
https://github.com/0xhericles/ufcg-geojson
GeoJSON file containing the blocks and buildings of the Federal University of Campina Grande.
data data-visualization geojson map open-source ufcg university
Last synced: 09 Feb 2026
https://github.com/team-hydrogen/nasa-adc-data
All files relating to the computation of the data provided
data jupyter-notebook nasa-app-development-challenge
Last synced: 25 Mar 2025
https://github.com/robson-python/academic-performance
Project to evaluate students' academic performance.
csv-import data data-analysis data-science jupyter-notebook machine-learning matplotlib pandas python scikit-learn seaborn vscode
Last synced: 12 Apr 2026
https://github.com/haimonmon/j3mify
Convert your jejemon word into a formal sentence or word
data jejemon nlp normalization python regex tagalog tokenization
Last synced: 12 Oct 2025
https://github.com/gabya06/twitter_models
Repository used for twitter impression models
data data-science impressions machinelearning python ridge-regression sklearn twitter
Last synced: 04 May 2026
https://github.com/fuzzt/location-analyzer
The Location Data Analyzer is a Spring Boot application that offers insights on location data, such as counting locations by type, calculating average ratings, and identifying the most reviewed and incomplete entries. It features a simple frontend (HTML, CSS, JavaScript) and is deployed on Render.
analysis api average css data deployment docker fetch-api frontend html javascript location maven ratings render restful-api reviews spring-boot techstack
Last synced: 11 Apr 2026
https://github.com/karensaraimoralesmontiel/8-week-sql-challenge
Case Studies Solutions for the 8-Week-SQL-Challenge.
Last synced: 02 Jan 2026
https://github.com/keminghe/osu
Unofficial and publicly-available NPM data-package about The Ohio State University.
college data majors ohio-state organizations public students university unofficial
Last synced: 06 Jan 2026
https://github.com/andrii04/andreamonforte-bi-assignment
Automated Data Pipeline that ingests daily GA4-formatted CSV files from a private Google Cloud Storage bucket, validates and loads them into BigQuery, and prepares analysis-ready views. The solution is built for deployment as a Cloud Function triggered by Cloud Scheduler and uses Python with the Google Cloud Storage and BigQuery client libraries.
automation bigquery cloud cloudfunctions data data-analysis data-engineering etl etlpipeline gcp google googlecloudplatform pipeline python sql
Last synced: 09 Nov 2025
https://github.com/adilsaid64/real-time-data-monitoring
Exploring what a real-time data drift monitoring solution could look like within MLOps
data datadrift grafana machine-learning mlops mlops-workflow prometheus python software-engineering
Last synced: 04 Aug 2025
https://github.com/mamskie/visdat
google collab
colab-notebook data visualization
Last synced: 03 Aug 2025
https://github.com/ddeepanshu-997/support_vector_regression--svr-
In this repository i performed a support vector regression on real life data , initially i performed some data preprocessing technique in order to filter out the data flaws then undergoes the process of model building i.e SVM regression in order to make a machine learning regression model.
data data-science regression-analysis regression-models svm-model svm-regression
Last synced: 03 Aug 2025
https://github.com/farhad2415/job_scraper
Job Site Based Job Scraping with python
automation bash-script data data-scraping data-structures python selenium selenium-python
Last synced: 05 May 2026
https://github.com/luciarevaliente/shell_script_data_cleaning
This project focuses on cleaning and processing datasets using Shell scripts. It is part of the Fundamentals of Informatics course (2022-23) and involves handling movie and show data to create cleaned and filtered datasets for further analysis.
data data-cleaning shell-script
Last synced: 04 Feb 2026
https://github.com/getconversio/dig-the-data
Data visualizations for the Conversio blog
Last synced: 12 Apr 2026
https://github.com/a-poor/datatransform.jl
A package for defining (and performing) tabular-data transformations with JSON.
data data-science data-transformation etl feature-engineering json julia julia-package tabular-data
Last synced: 05 May 2026
https://github.com/guilyx/airplane-booking
Simple airline ticket reservation program.
Last synced: 25 Jun 2025
https://github.com/matheusafonseca/deploy-ml-models-with-streamlit-udemy
This repository is dedicated to storing the code developed during the "Machine Learning Model Deployment with Streamlit" course on Udemy. The course covers basic to advanced techniques for deploying machine learning models using Streamlit.
data data-science data-visualization interface joblib layout machine-learning optimization-algorithms python python3 sklearn sklearn-datasets sklearn-library sklearn-pipeline streamlit
Last synced: 19 Apr 2026
https://github.com/trissim/polystore
Framework-agnostic multi-backend storage abstraction for ML and scientific computing
backend data io jax ml multi-framework numpy pytorch scientific-computing storage tensorflow zarr
Last synced: 12 Apr 2026
https://github.com/jk-oster/laravel-collection-trend
Generate trends from collections. Easily generate charts or reports.
charts collections data laravel php reports trends
Last synced: 03 Aug 2025
https://github.com/contawo/travel-journal
This is a travel journal application for storing all the places that you have visited. I was learning by doing react when creating this project. I learnt a lot with it and upgraded my reactjs skills.
data learning-by-doing props reactjs
Last synced: 05 May 2026
https://github.com/servierhub/adsv
Analyze delimiter-separated values files
csv csv-converter csv-format csv-parser csv-parsing csv-reader csv-reading data data-analysis data-engineering data-mining
Last synced: 28 Sep 2025
https://github.com/vidushibhadana/eda-on-nyc-taxi-data
About Conducting an Exploratory Data Analysis (EDA) on New York City taxi data and visualizing it through countplots, distribution plots (displot), and histograms using Python and it's libraries.
data data-visualization jupyter-notebook matplotlib numpy pandas python seaborn
Last synced: 11 Apr 2026
https://github.com/climate-resource/input4mips_validation
Validation of input4MIPs data
cmip data forcing input4mips validation
Last synced: 20 Jan 2026
https://github.com/unkaktus/pktconn
wrapper around io.ReadWriteCloser that implements gopacket's 'device'
connection data gopacket packet
Last synced: 29 May 2026
https://github.com/dahsie/machine_learning_from_scratch
This project aims to implement some machine learning basic techniques(e.g. MinMaxScaler, StandardScaler, TD-IDF, PCA, Logistic Regression, LDA, KNN, Naive Bayes Classifier) using only pyton, numpy and pandas. This will enable me to have hone my data scientist skills
classification clustering data data-processing datascience machienlearning nlp nltk numpy pandas python regression
Last synced: 04 May 2026
https://github.com/theanujsinha01/mcdonalds-customer-analysis
This project analyzes customer feedback data to understand what drives people to like or dislike McDonald’s. Using Python and data visualization tools in a Jupyter Notebook, we explore how different factors—such as taste, price, health, and visit frequency—affect customer satisfaction.
case-study data data-visualization dataanalysis
Last synced: 05 Sep 2025
https://github.com/kahlery/my-jupyter-notebook-projects
🐊 collection of my data science analysis, actually I store most of my data science projects in my google drive because of google colab
Last synced: 12 Apr 2026
https://github.com/alecxcode/table-parser
Python Table Parser (data extraction)
automation data extraction python robotic-process-automation
Last synced: 04 May 2026
https://github.com/jigyasag18/fake-news-prediction-app
The Fake News Prediction App Repository offers a machine learning project that focuses on identifying the authenticity of news articles as fake or real. It uses a dataset of 20,000 articles and employs methods such as TF-IDF vectorization and the Lemmatization algorithm, achieving ~95% classification accuracy with random forest classifier model
data datapreprocessing logistic-regression machine-learning machine-learning-algorithms numpy pandas prediction stemming streamlit streamlit-webapp vectorization
Last synced: 11 Apr 2026
https://github.com/vara-co/tech-certifications
These are the certifications that back-up some of my skills.
certificates certifications data data-analytics skills
Last synced: 07 Jan 2026
https://github.com/inphyt/quantitative_single_neuron_modeling_competition_2007
Data for the Quantitative Single-Neuron Modeling Competition (2007).
bayesian-inference bayesian-methods bayesian-optimization bayesian-statistics challenge competition computational-neuroscience data electrophysiological-data electrophysiology model-calibration modeling neuronal-models neuroscience neuroscience-competition parameter-estimation simulation simulation-modeling single-neuron-model uncertainty-quantification
Last synced: 26 Jul 2025
https://github.com/abdullahashfaqvirk/Earth-Engine-Data-Scraper
A Python based web scraper designed to extract and organize dataset metadata from the Google Earth Engine Datasets Catalog for research, and analysis purposes.
beautifulsoup data data-science python requests scraper web-scraping
Last synced: 27 Sep 2025
https://github.com/mcraiha/datagensharp
C# managed library for generating data
Last synced: 11 Aug 2025
https://github.com/ournet/topics-data
Ournet topics data package
data ournet storage topic topics topics-data topics-storage
Last synced: 12 Jun 2025
https://github.com/sirmaxx/log_manager
log manager services for microservices
data fastapi logging microservice mongodb
Last synced: 09 Apr 2026
https://github.com/naliferopoulos/datamining
Bring your own pickaxe.
aueb aueb-students data data-mining machine-learning machine-learning-algorithms mining random-forest
Last synced: 25 Jan 2026
https://github.com/illustratien/toolphd
Make your analysis simple and reproducible
academic analysis data phd publications r r-package reproducible-research scientific
Last synced: 26 Jan 2026
https://github.com/yash-rewalia/airbnb_eda_pandas
The goal of the project is to gather information and analyze the detailed information of the different entries in order to provide insights about the host and price of the property in a particular area as per your preference , type of rooms and number of reviews accordingly.
data data-cleaning data-insights data-preprocessing data-visualization matplotlib numpy pandas python seaborn
Last synced: 11 Apr 2026
https://github.com/teragrep/rsm_01
Teragrep record schema mapper library for Java
data data-mining data-science datascience java-library liblognorm log-analysis log-management schema-mapper structured-data structured-logging teragrep unstructured-data
Last synced: 09 Apr 2026
https://github.com/ismailhakkii/digital_vault
This project can be used for securing data, similar to a real vault.
data digital security-data vault
Last synced: 25 Mar 2025
https://github.com/entorb/analyze-ha-energy
Analyze Home Assistant Solar Production Data
data home-assistant pandas photovoltaic pv python
Last synced: 08 May 2026
https://github.com/badranalyst/covid-deaths-and-vaccinations-sql-data-exploration
This project involves exploratory data analysis on COVID-19 deaths and vaccinations data using SQL. It aims to uncover trends, patterns, and insights related to vaccination rates and their impact on mortality. The analysis provides a clearer understanding of the pandemic's dynamics, facilitating data-driven decisions in public health.
covid-19 data data-exploration dataset sql
Last synced: 19 Feb 2026
https://github.com/thedevreda/jadaerospace
A Real life project showing how to improve selling aircraftparts and helping salers to focus more on effective products at JadAero
data data-analysis data-cleaning data-visualization jupyter-notebook powerbi python
Last synced: 02 Aug 2025
https://github.com/anct-cartographie-nationale/mednum-cli
✨ Interface en ligne de commande pour la transformation des données de lieux de médiation numériques collectées dans un format non standard vers le schéma de la mednum et leur publication sur data.gouv
anct betagouv data donnees gouvernement mediation-numerique nodejs open-data transformation
Last synced: 02 Aug 2025
https://github.com/so-cool/junction
My solution to the University of Bristol "Bristol Journey Time" Data Challenge https://So-Cool.github.io/junction
competition data modelling timeseries
Last synced: 02 Apr 2025
https://github.com/oroszgy/hunlp-resources
Scripts and resources for making spaCy understand Hungarian.
corpus-linguistics data hungarian hungarian-language hunlp magyarlanc model natural-language-processing nlp resources script spacy wikipedia
Last synced: 18 May 2026
https://github.com/etmendz/mendz.data.oracle
Provides a generic Mendz.Data-aware context for ADO.Net-compatible access to Oracle databases.
ado-net context data database datasettings mendz oracle
Last synced: 13 Apr 2026
https://github.com/supunlakmal/coronavirus-covid-19-status
Covid 19 cases and death count for each country in a json file.
coronavirus count country covid-19 covid-data covid19 data data-science data-visualization geographical geographical-information-system json
Last synced: 21 Jun 2026
https://github.com/plurid/defocus
Apophatic User Content Resolution [Desearch Concept]
Last synced: 08 Nov 2025
https://github.com/cljoly/data
📊 Data sets to populate some parts of my website (mostly https://cj.rs/open-source/).
Last synced: 03 May 2026
https://github.com/abdiasarsene/edusight-data-driven-insights-for-smarter-education
EduSight transforms educational data into actionable insights, helping NGOs, schools, and policymakers improve academic performance, optimize resources, and evaluate learning programs for better outcomes.
Last synced: 26 Jan 2026
https://github.com/sohomm/predict-insurance-charges
A predictive model to estimate the insurance charges based on a client's attributes, such as age and health factors. It offers a practical application of ml in business, enabling more accurate pricing models and helping companies manage risk while delivering personalized pricing strategies to clients.
administration algorithm bot data decision-trees download easy finance github java machine-learning management model neural-network nlp prediction project science trading university
Last synced: 05 May 2026
https://github.com/agustinmusanti/sqlchallenge-7
Resolución de un extenso desafío de SQL propuesto por el profesor Diego Moisset De Espanes, quien comparte ejercicios para aprender y practicar SQL Server a través de su canal de YouTube.
challenge data learning sqlserver
Last synced: 15 Apr 2025
https://github.com/reubano/devcraft-workshop
Materials for the DevCraft workshop on stream processing
data functional-programming meza python riko stream-processing tutorial
Last synced: 04 May 2026
https://github.com/donmaruko/python-eda-toolkit
CLI-runned EDA with 30 commands utilizing text-related functions, statistical calculations, data visualization, and data manipulation.
data data-analysis data-science data-visualization matplotlib pandas scipy seaborn statistical-analysis statistics wordcloud
Last synced: 06 May 2026
https://github.com/codegouvfr/codegouvfr-sources
🧢 Static web frontend for code.gouv.fr
bluehats codegouvfr data frontend
Last synced: 28 Feb 2025
https://github.com/bhojpur/dlm
The Bhojpur DLM is a software-as-a-service product used for Data Lifecycle Management based on Bhojpur.NET Platform for data delivery.
Last synced: 19 Feb 2026
https://github.com/abhibisht89/data-visualization
data matplotlib pandas ploty python visualization
Last synced: 06 May 2026