data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-03 00:07:49 UTC
- JSON Representation
https://github.com/bilgehangecici/datatypeconverter
Converting integer and floating numbers to appropriate bit-level representation.
data datatypeconverter java machine-level variables
Last synced: 30 Mar 2025
https://github.com/passly-nl/data
Source code of the data layer.
data passly ticketing typescript
Last synced: 27 May 2026
https://github.com/sungchun12/demotron
CLI to delight real people with live demos
Last synced: 26 Feb 2025
https://github.com/agdturner/ccg-data
A modularised Java library for processing data sets with classes for: data records; collections of data records; and identifiers.
Last synced: 12 Jan 2026
https://github.com/gman-au/white-knight-neo4j
Neo4j implementation of White Knight data abstraction library
abstractions data datastore dotnet neo4j repository-pattern specification-pattern
Last synced: 20 Jan 2026
https://github.com/koppalexander/flightdelaychallenge
This project focuses on predicting flight delays using historical data from a Tunisian airline. We analyzed patterns in airport operations and flight schedules to build a machine learning model that can forecast potential delays.
data data-science machine-learning machine-learning-algorithms machinelearning prediction predictive-modeling
Last synced: 19 Jun 2026
https://github.com/shubhammittal-data/hr_dashboard_tableau
An interactive HR Analytics Dashboard built using Tableau. Provides insights into workforce demographics, hiring trends, salary analysis, and employee records for data-driven decision-making.
chatgpt4 data data-analysis data-visualization drawio-tools faker-generator hr-analytics hr-analytics-dashboard human-resources numpy python tableau tableau-public
Last synced: 17 May 2026
https://github.com/badranalyst/data-cleaning-and-exploratory-data-analysis-project
This project uses SQL to clean and analyze a layoffs dataset. Data cleaning tasks include removing duplicates, standardizing values, and handling missing data. Exploratory analysis is performed to identify trends in layoffs across companies, industries, and time periods.
cleaning-data data database dataset mysql mysql-database sql
Last synced: 07 Apr 2025
https://github.com/piyushkumar2025/india-general-elections-2024_data-analyst
Analyzed election data for 540+ constituencies and 100+ parties using SQL. Calculated state-wise seat distributions, classified 30+ parties into alliances, identified top 10 candidates by EVM votes, calculated victory margins, and analyzed voting patterns for 300+ candidates to uncover key insights.
analytics data database mysql sql statistics
Last synced: 22 May 2026
https://github.com/aminnairi/node-decode
Check that your data meet your expectations
check data decode expectations schema
Last synced: 22 Apr 2026
https://github.com/lancewalk87/cls-cloud-sync-ruby-on-rails
Software | SQL Database with automated Cloud Sync for mitigating lost data across dist. servers. Managed by Ruby on Rails.
cloud-computing cloud-storage data database ruby ruby-application ruby-on-rails server sql
Last synced: 24 Jul 2025
https://github.com/team-hydrogen/2025-adc-data
All files relating to the computation of the data provided
data jupyter-notebook nasa-app-development-challenge
Last synced: 11 Apr 2025
https://github.com/tks18/xl-pq-handler
A Pythonic Power Query (.pq) File Manager for Excel & Power BI Automation
analytics automation data excel power-query powerbi python xlwings
Last synced: 20 Jan 2026
https://github.com/snimmagadda1/luigi-etl-example
🔍 Example of an ETL pipeline using Spotify's Luigi
data luigi luigi-pipeline python spotify
Last synced: 30 Mar 2025
https://github.com/carlosrs14/parallel-data-preprocessig-system
A parallel data preprocessing system using threads and synchronization mechanisms (barrier, busy-waiting, condition variables) to clean and prepare data for AI training.
barrier-method c condition-variable data operative-systems parallel-computing posix preprocessing synchronization threads
Last synced: 24 Jul 2025
https://github.com/nmelgar/birthday_sports_dataviz
We will analyze how the Matthew Effect has influenced in professional sports players.
analysis csv data data-analysis data-science data-visualization datavisualization dataviz probability research tableau
Last synced: 08 Jan 2026
https://github.com/als8446/tripleten-data-science-projects
Projects Overview Projects made in the Data Scientist course from TripleTen LatAm
data data-analysis hypothesis-tests machine matplotlib numpy pandas python scipy sklearn
Last synced: 10 Apr 2026
https://github.com/rubyonworld/ruby-cldr
The Unicode Consortium's Common Locale Data Repository (CLDR) contains tons of high-quality locale data such as formatting rules for dates, times, numbers, currencies as well as language, country, calendar-specific names etc.
common data locale repository ruby unicode
Last synced: 12 Nov 2025
https://github.com/ttozatto/sparkify
Churn Prediction for music streaming app with PySpark
analysis churn data learning machine predictive pyspark science spark
Last synced: 16 Jan 2026
https://github.com/nevoland/unchangeable
🧊 Tools for immutable values.
data datastructure functional immutable persistent pure stateless
Last synced: 24 Jul 2025
https://github.com/goutamhegde002/dsa-roadmap-for-beginners
The "DSA Roadmap for Beginners" repository is a comprehensive guide designed to help beginners learn Data Structures and Algorithms (DSA) efficiently. It provides structured content covering fundamental and advanced topics in DSA, with practical examples, exercises, and coding problems.
beginner beginner-friendly beginner-guide coding-practice data data-science data-structures data-structures-and-algorithms dsa dsa-algorithm dsa-learning-series dsa-practice dsa-roadmap interview-preparation interview-resources programming-fundamentals
Last synced: 28 Feb 2026
https://github.com/priyapuranik/data-analytics-using_python
Analyzed data of Hotels and find out meaningful insights from it including booking patterns and seasonal trends and many more.
data pandas python sql visualization
Last synced: 06 Apr 2026
https://github.com/justinyahin/wpdf
Create, filter, sort and display users data on your WordPress site.
Last synced: 18 Apr 2026
https://github.com/itrauco/streaming-data-platform
skeleton streaming data platform on gcp...
big-data data data-engineering data-infrastructure data-science engineering google-cloud platform-engineering python streaming-data
Last synced: 13 Jun 2026
https://github.com/fatihemres/Africa
Africa app by SwiftUI. Using AVFoundation, MapKit, data, models, animations, stickers.
animations avfoundation data mapkit models swift swift-animations swiftui
Last synced: 31 Aug 2025
https://github.com/fatihemres/Fruits
Fruit Details app by SwiftUI. Using data, models, animation and practically onboarding usage.
animations data models onboarding swift swiftui
Last synced: 31 Aug 2025
https://github.com/greatwoman23/car_insurance_analysis
The Car Insurance Analysis project aims to provide a comprehensive examination of a car insurance portfolio using advanced data analytics tools. The analysis offers valuable insights into policy demographics, claims patterns, and financial metrics, helping stakeholders make informed decisions.
bigquery data data-science dataanalytics insurance-claims looker-studio tableau
Last synced: 03 Feb 2026
https://github.com/team810/frcs
FRCS is an online international crowd sources data collection software written for the FRC Competitions. It was created by team 810, The Mechanical Bulls.
Last synced: 14 Mar 2025
https://github.com/srgchrksv/stream-crypto
Crypto trades streaming with azure services
azure binance crypto data databricks dataengineering pyspark python streaming websocket
Last synced: 30 Apr 2026
https://github.com/cmda-tt/course-25-26
🎓 tech track · 2025-2026 · curriculum and syllabus 📊
d3 data datavis functional javascript programming research svelte visualization
Last synced: 20 Jan 2026
https://github.com/anuppm9917/super-store-sales-analysis-power-bi-project
My drive to know which products, regions, categories and customer segments a company should target or avoid, I search and selected an appropriate dataset on kaggle which will match a standard superstore requirement.
data data-analysis data-visualization datacleansing excel exploratory-data-analysis jupyter-notebook numpy pandas plotly powerbi python3
Last synced: 10 Apr 2026
https://github.com/sidneyarcidiacono/data-parser
A node module designed to make reading in large files as easy as calling one function.
Last synced: 05 May 2026
https://github.com/haideratgh/sql-data-analytics-project
This repository contains a collection of SQL scripts demonstrating various analytical techniques, such as changes over time, cumulative, performance, data segmentation, part-to-whole analysis
analytics business-analytics business-intelligence data data-analysis data-analyst data-analytics data-engineering data-science data-scientist database datascience query reporting sql sql-query sql-server window-functions-in-sql
Last synced: 29 Jun 2025
https://github.com/equinor/fmu-sumo-uploader
Upload to Sumo in the FMU context
data fmu python subsurface sumo
Last synced: 06 May 2026
https://github.com/reshmaaiman/fifa
FIFA20
data data-science data-visualization dataanalysisusingpython github jupyter-notebook matplotlib numpy pandas python seaborn-python
Last synced: 10 Apr 2026
https://github.com/hit07/fitgpt-hacksc
AI-Powered Fitness Coach; 🥈 Runner up at HackSC's SoCal Tech Week hackathon
data elasticsearch gpt-4o-mini llm pipeline
Last synced: 28 Feb 2025
https://github.com/cityofnewyork/nyco-wp-open-data-transients
Interface for saving Open Data endpoints as WordPress Transients. Maintained by @NYCOpportunity
civic-tech composer data nycopportunity open-data plugin transients wordpress
Last synced: 10 Apr 2026
https://github.com/maluscat/reactive-storage
[MIRROR] Register, observe and intercept deeply reactive data without the need for proxies
data javascript reactive typescript
Last synced: 10 Mar 2026
https://github.com/mnkanout/patients_medication_prediction
The aim of the project is to create a model that can help medical professionals select the proper medication for patients based on their symptoms. The model uses historical data of other patients to predict what could be the most suitable medication based on the patient's symptoms.
data data-analysis data-science data-visualization decision-tree-classifier machine-learning python3
Last synced: 29 Jun 2025
https://github.com/sharoonjoseph321/insurance_fraud_detection
Fraud Detection using machine learning algorithm-KN Neighbors .Data exploration using Pyspark and matplotlib.
analytics data data-science eda high-performance knn-algorithm knn-classification machine-learning matplotlib-pyplot pyspark python seaborn spark statistics
Last synced: 23 Mar 2025
https://github.com/skysign/dat
데이터분석을 함께 공부하는 스터디입니다.
data data-analysis data-science
Last synced: 02 Jan 2026
https://github.com/ccworld1000/cccomposition
CCComposition for code style, Accept code style conversion business(接受code style转换业务)
cccomposition composit construction data structure visual
Last synced: 04 Jan 2026
https://github.com/andrii04/ga4-gcs-to-bigquery-etl
Automated Data Pipeline that ingests daily GA4-formatted CSV files from a private Google Cloud Storage bucket, validates and loads them into BigQuery, and prepares analysis-ready views. The solution is built for deployment as a Cloud Function triggered by Cloud Scheduler and uses Python with the Google Cloud Storage and BigQuery client libraries.
automation bigquery cloud cloudfunctions data data-analysis data-engineering etl etlpipeline gcp google googlecloudplatform pipeline python sql
Last synced: 18 May 2026
https://github.com/diordany/spicemill
Tool for plotting Ngspice simulation results with Pyplot.
analysis data electrical-engineering electronics frontend integrated-circuit integrated-circuits ngspice plot plotting post-processing pyplot python raw simulation spice
Last synced: 13 Jan 2026
https://github.com/karashiiro/lodestone-character-data-scraper
Lodestone character data scraper.
data ffxiv ffxiv-character lodestone
Last synced: 23 Apr 2026
https://github.com/checco9811/data-engineering-bootcamp-homework
Homework solutions for DataExpert.io data engineering bootcamp
apache-spark data data-engineering sql
Last synced: 14 Mar 2025
https://github.com/iamfrerot/userverse
creating api for data analysis
data data-analytics spring-boot users
Last synced: 23 Mar 2025
https://github.com/nicholas-owen/rdm-calendar
A small utility to manage conference and event information
calendar conference data event research
Last synced: 26 May 2026
https://github.com/open-geodata/sp_bh_pcj-2020-2035
Dados Espaciais da Agência das Bacias PCJ, com informações apresentadas no Plano de Bacias 2020-2035
Last synced: 16 Jan 2026
https://github.com/sanchittechnogeek/overscripted-analysis
Geolocation and user language extraction analysis from Mozilla Overscripted dataset
analysis data data-analysis mozilla
Last synced: 23 Mar 2025
https://github.com/pythoncoderunicorn/tool-discography
Music Band TOOL albums and songs dataset
data data-science metal-music music r songs
Last synced: 26 May 2026
https://github.com/gianlucatruda/titanic
An exhibition of my experience in data processing and visualisation. Python script to process and visualise the Titanic survivor data.
data database flask info matplotlib python science scrape server titanic visualisation web
Last synced: 10 Apr 2026
https://github.com/thicclatka/tetration
New file format for tensors
cli data fileformat mmap tensors
Last synced: 26 May 2026
https://github.com/tomcardoso/journalism-data-intersection
A talk on working at the intersection of journalism and data science
data data-journalism journalism
Last synced: 15 May 2025
https://github.com/wlgs/got-dialogues-data-stats
Game of Thrones dialogues data statistics processed with R and SQLite. Project for Probability and Statistics course 21/22 at AGH UST. The project was about manipulating data and getting many pieces of information from it in addition to visualizing these results.
data game-of-thrones got r statistics stats
Last synced: 22 May 2026
https://github.com/j-sephb-lt-n/plotly-dash-dashboard-template
A data dashboard template
dash data data-visualisation data-vizualization dataviz google-cloud google-cloud-platform plotly plotly-dash python responsive-design responsive-web-design
Last synced: 18 Jun 2025
https://github.com/reubano/devcraft-workshop
Materials for the DevCraft workshop on stream processing
data functional-programming meza python riko stream-processing tutorial
Last synced: 04 May 2026
https://github.com/agustinmusanti/sqlchallenge-7
Resolución de un extenso desafío de SQL propuesto por el profesor Diego Moisset De Espanes, quien comparte ejercicios para aprender y practicar SQL Server a través de su canal de YouTube.
challenge data learning sqlserver
Last synced: 15 Apr 2025
https://github.com/mnazlukhanyan/da-projects
Портфолио с работами по аналитике данных, показывающие мои навыки, умения и опыт
data data-vizualisation hypothesis-tests matplotlib pandas plotly postgresql product-metrics python scipy seaborn sql visualization
Last synced: 11 Apr 2026
https://github.com/oniani/miniframe
Minimal data frames with relational algebra
data dataframe-library haskell haskell-library library
Last synced: 04 Mar 2025
https://github.com/yash-rewalia/airbnb_eda_pandas
The goal of the project is to gather information and analyze the detailed information of the different entries in order to provide insights about the host and price of the property in a particular area as per your preference , type of rooms and number of reviews accordingly.
data data-cleaning data-insights data-preprocessing data-visualization matplotlib numpy pandas python seaborn
Last synced: 11 Apr 2026
https://github.com/illustratien/toolphd
Make your analysis simple and reproducible
academic analysis data phd publications r r-package reproducible-research scientific
Last synced: 26 Jan 2026
https://github.com/itrauco/data-dirtying-tool
a simple command line tool to generate dirty data and do common data things in google cloud
data data-analysis data-engineering data-ops data-pipeline data-science data-visualization data-wrangling dirty-data google-cloud machine-learning
Last synced: 24 Feb 2025
https://github.com/muhammadadilnaeem/student-performance-indicater-end-to-end-data-science-project
This project leverages data science techniques to build a predictive model that estimates a student's exam performance. The project follows a structured data science workflow, including data collection, preprocessing, model building, evaluation, and deployment.
data machine-learning-algorithms pandas pymysql python sql
Last synced: 11 Apr 2026
https://github.com/mierune/tinybufr
[WIP] A Rust library for decoding BUFR (Binary Universal Form for the Representation of meteorological data) files.
bufr data meteorology rust weather wmo
Last synced: 15 May 2025
https://github.com/inphyt/quantitative_single_neuron_modeling_competition_2007
Data for the Quantitative Single-Neuron Modeling Competition (2007).
bayesian-inference bayesian-methods bayesian-optimization bayesian-statistics challenge competition computational-neuroscience data electrophysiological-data electrophysiology model-calibration modeling neuronal-models neuroscience neuroscience-competition parameter-estimation simulation simulation-modeling single-neuron-model uncertainty-quantification
Last synced: 26 Jul 2025
https://github.com/sakshamarora07/whatsapp-chat-analyser
This repository contains code for a WhatsApp Chat Analyzer that uses Python libraries to extract insights from chat messages.
chat data dataanalytics datascience matplotlib pandas python seaborn statistics streamlit whatsapp
Last synced: 04 Jan 2026
https://github.com/jigyasag18/fake-news-prediction-app
The Fake News Prediction App Repository offers a machine learning project that focuses on identifying the authenticity of news articles as fake or real. It uses a dataset of 20,000 articles and employs methods such as TF-IDF vectorization and the Lemmatization algorithm, achieving ~95% classification accuracy with random forest classifier model
data datapreprocessing logistic-regression machine-learning machine-learning-algorithms numpy pandas prediction stemming streamlit streamlit-webapp vectorization
Last synced: 11 Apr 2026
https://github.com/sanogotech/open-source-data-stack
modern open source data stack
airbyte airflow data data-science dbt docker postgresql python
Last synced: 11 Apr 2026
https://github.com/climate-resource/input4mips_validation
Validation of input4MIPs data
cmip data forcing input4mips validation
Last synced: 20 Jan 2026
https://github.com/vidushibhadana/eda-on-nyc-taxi-data
About Conducting an Exploratory Data Analysis (EDA) on New York City taxi data and visualizing it through countplots, distribution plots (displot), and histograms using Python and it's libraries.
data data-visualization jupyter-notebook matplotlib numpy pandas python seaborn
Last synced: 11 Apr 2026
https://github.com/quantumudit/test-store-data-analysis
This repository showcases a web scraper with a pipeline structure for efficient data extraction and transformation from websites. The tool can be tailored to leverage its capabilities for insightful data analysis, providing valuable insights and informed decision-making.
data data-visualization dataanalytics python python-webscraping webscraper webscraping-data
Last synced: 11 Apr 2026
https://github.com/mnz1365/saving-record-time-text
date saving in text file with python
data python txt-files writefile
Last synced: 18 Jul 2025
https://github.com/guilyx/airplane-booking
Simple airline ticket reservation program.
Last synced: 25 Jun 2025
https://github.com/oliver021/helppad-net
Versatile .NET Toolkit: A Comprehensive Set of Miscellaneous Helpers, Classes, and Utilities
assert async checks cryptographic-algorithms data date dotnet fluent functional functional-programming hash helpers parallel pipe pipeline pointers review supports tasks
Last synced: 15 Jun 2026
https://github.com/taeefnajib/ibm-applied-data-science-capstone
This repository is for my IBM Applied Data Science Capstone Project. All the notebooks and other files are uploaded. If you are benefited by this repository by any means, please feel free to "Star" it and follow me. Thanks.
advance capstone capstone-project data data-science ibm ibm-watson jupyter jupyter-notebook notebook notebook-jupyter project science spacex spacex-api
Last synced: 14 Mar 2025
https://github.com/sushmashreeps/python
This repository showcases a comprehensive Python project, demonstrating expertise in backend development, data analysis, and machine learning. Built with Python 3.x, the project utilizes popular libraries like Django, Flask, NumPy, pandas, and scikit-learn. The project features efficient data processing, robust API integration, and scalable archite
api data data-science dataanalysis datavisualization game gamedeveloment python
Last synced: 12 May 2026
https://github.com/fuzzt/location-analyzer
The Location Data Analyzer is a Spring Boot application that offers insights on location data, such as counting locations by type, calculating average ratings, and identifying the most reviewed and incomplete entries. It features a simple frontend (HTML, CSS, JavaScript) and is deployed on Render.
analysis api average css data deployment docker fetch-api frontend html javascript location maven ratings render restful-api reviews spring-boot techstack
Last synced: 11 Apr 2026
https://github.com/nisanth2004/springboot-kafka-real-world-project-wikimedia
Creating a project about Wikimedia using Kafka involves building a system that leverages Apache Kafka for data streaming and processing related to Wikimedia data.
async broker communication data java kafka message real-time real-time-analytics springboot wikimedia
Last synced: 14 May 2026
https://github.com/martinius96/meteostanica-odosielacie-scripty
Meteostanica - Arduino, ESP8266, ESP32 - odosielanie sketche pre reprezentáciu dát vo webovom rozhraní.
arduino bme280 bmp280 data dht22 ds18b20 esp32 esp8266 espressif html meteo meteostanica mysel nodemcu php stanica teplota tlak vlhkost webstranka
Last synced: 11 Apr 2026
https://github.com/justinjjlee/simulation-discrete
Employing data transformations and simulations to answer random questions
analytics data data-science julia python simulation spark
Last synced: 30 Apr 2026
https://github.com/dug22/jjournal
A Jupyter like notebook software for Java
data data-analysis data-science java jshell jshell-repl notebook swing swing-application
Last synced: 11 Apr 2026
https://github.com/filiprokita/tobase64
This Python program encodes a file in base64 format and saves the result to a new file with a ".b64" extension. It is a command-line tool that can be used to automate file encoding tasks.
base64 command-line data data-conversion data-manipulation data-privacy data-prottection data-security encoding file file-conversion file-handling python python-script python3 tobase64
Last synced: 30 Jun 2025
https://github.com/neha-adnani/sql_music-store-analysis
SQL-based data analysis of a digital music store's sales and customer data.
business-analysis data data-analysis database follow-along-projects pgadmin4 portfolio-project postgres queries sql
Last synced: 18 Jun 2025
https://github.com/jhermsmeier/node-leybold-xps
Parse & write the Leybold XPS data format
analysis data esca format leybold parser photoelectron-spectroscopy spectroscopy x-ray xps xpspeak xray
Last synced: 09 Jul 2025
https://github.com/muhamedlabs/muhamed_onedrive
Muhamed_OneDrive - це надійне і зручне хмарне сховище для файлів, розроблене для безпечного зберігання і легкого обміну даними.
data html5 onedrive programming style
Last synced: 04 Jan 2026
https://github.com/diegoperea20/datos-secuenciales-con-ia
Realizacion de procesamiento de señales unidimensionales con modelos auto regresivos, convolución 1d, convolución 2d usando el espectrograma y redes recurrentes
ai artificial-intelligence convolutional-neural-networks data ia secuential-data spectrogram uao
Last synced: 06 Feb 2026
https://github.com/victorowinoke/custmer-segmentation-using-rfm-python-
Customer Segmentation using the Recency, Frequency and Monetary Values
customer-segmentation data data-visualization python3 science time-series-analysis
Last synced: 26 May 2026
https://github.com/apostolissiampanis/weather-app-api
WeatherApp is a Java-based console application that retrieves and processes weather data using the wttr.in web service.
api data hibernate java json lombok objected-orientated-programing oop spring-boot spring-data-jpa sqlite webflux
Last synced: 05 May 2026
https://github.com/zulfachafidz/titanic_explorer_predicting_survival_with_classification_using_knn_algorithm
Tracking Life Safety with the KNN Predictive Analysis Approach. Leveraging the Titanic Dataset, we apply classification analysis to predict the fate of passengers based on a variety of features.
algorithm algorithms data data-analysis data-mining data-science datamodeling datapreprocessing dataset knn-algorithm knn-classification machine-learning machine-learning-algorithms prediction-model
Last synced: 01 Sep 2025