data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-03 00:07:49 UTC
- JSON Representation
https://github.com/prajakta1321/streetml-a-cityscape-traffic-volume-prognostication
StreetML leverages ML learning techniques to revolutionize urban traffic prediction through precise volume prognostication, aiming to enhance cityscape mobility through data-driven insights.
catboostregressor data datavisualisation exploratory-data-analysis lightgbm-regressor linearregression machine-learning machine-learning-algorithms predictive-analytics random-forest-regression xgboost-regression
Last synced: 08 Apr 2025
https://github.com/adadalshabab/data-engineering-gcp-project
An end-to-end modern data engineering project, including deployment of ETL pipeline on Google Cloud Platform, using BigQuery for data analysis and leveraging Looker to generate an insight dashboard.
bigquery data data-science data-visualization databases dataengineering-a engineering etl-pipeline looker-studio powerbi
Last synced: 19 Jan 2026
https://github.com/tyriek-cloud/nyc-dca-etl
Created an ETL pipeline to merge two CSV files (converted to JSON) into a parquet file using Azure Data Factory, The data was extracted from NYC Open Data: https://opendata.cityofnewyork.us/ and I created a Blob Container within an existing storage account.
azure azure-data-factory blob-storage data data-engineering etl-pipeline
Last synced: 21 Jan 2026
https://github.com/shubhamsoni98/project_using_knn
This project applies the K-Nearest Neighbors (KNN) algorithm to predict iPhone purchases based on customer data. Using features like age, salary, and previous purchase behavior, the KNN model classifies customers into buyers and non-buyers.
anaconda analytics data data-science eda knn knn-classification machine-learning-algorithms predict project python scikit-learn tableau
Last synced: 03 Jan 2026
https://github.com/johnelliott/wb-web
Moved —> https://github.com/johnelliott/waybot
arduino browser data iot raspberry-pi web
Last synced: 12 Apr 2026
https://github.com/anburocky3/cbse-schools-data
Fetch CBSE Schools in seconds and use it for your data projects
cbse data data-analysis data-science grabber nextjs
Last synced: 24 Jun 2026
https://github.com/sorairolake/japanese-era-dataset
日本の元号のデータセット / Dataset of the Japanese era
data dataset date japanese-calendar japanese-era json toml wareki yaml
Last synced: 01 May 2026
https://github.com/pchaparro/search-engine
Full stack search-engine created from youtube videos obtained using "web-scraping"
data opensearch python python3 react scraper scraping scraping-websites search search-engine semantic-search sentence-transformers typescript website
Last synced: 17 Apr 2026
https://github.com/dhruvil-26/powerbi-projects
This repository contains Power BI projects showcasing data analysis and interactive dashboards. Each project includes detailed visualizations and insights on diverse topics such as loan analysis, sales performance, and customer behavior.
customer-behavior-analysis data data-analysis interactive-dashboards loan-analysis powerbi sales-performance visualization
Last synced: 04 Feb 2026
https://github.com/eudesgccunha/automated-management-panel
Automated management panel using Power BI
data data-analysis data-visualization database excel powerbi
Last synced: 04 Feb 2026
https://github.com/odiegosilva1/flask-github-style
Página de login usando Jinja no Flask.
data flask jinja2-templates orm python
Last synced: 31 May 2026
https://github.com/polyee99/kaggle-titanic-data-analytics
Jupiter notebook to predict the outcome of passengers who died or not in the tragical Titanic event.
data eda jupiter-notebook matplotlib numpy pandas python regression-analysis test-train-split visualization
Last synced: 05 Feb 2026
https://github.com/isandyawan/simplelinearregression
A application to analyze data using simple linear regression. This application can make regression model from variable and give advice to user if the model break regression assumsion
data linear r regression rstudio shiny statistic
Last synced: 14 Oct 2025
https://github.com/poojaharihar03/wellness-cities-case-study
A case study for dats analysis of city health centers
Last synced: 11 Jun 2026
https://github.com/darkogamerz/dhis2heat
A Comprehensive data management and Health Equity Assessment and Analysis platform that fetches data from DHIS2, optimize, calculate, clean and visualize inequality data.
analytics data data-science dhis2 equality equity health heat inequality r shiny shinydashboard visualization
Last synced: 01 Apr 2025
https://github.com/giuleo129/dataanalysis
This folder contains two projects focused on data analysis and statistical learning using R, covering exploratory data analysis, modeling, and predictive techniques.
data data-analysis data-science statistical-learning
Last synced: 25 Jan 2026
https://github.com/infinitode/pyautoplot
PyAutoPlot is an open-source Python library designed to make dataset analysis much easier by generating helpful detailed plots using matplotlib. It automatically generates appropriate plots based on the dataset you feed it.
analysis automatic csv data dataset dataset-analysis generation matplotlib pandas plots plotting-in-python plotting-library python
Last synced: 16 Mar 2025
https://github.com/gcoronelc/cepsuni-disbd-64505
Taller de Modelamiento de de Base de Datos con Gustavo Coronel
data database databases db2 db2-database modeling oracle oracle-database relational-database relational-database-design relational-databases relationships sql sql-server
Last synced: 02 May 2026
https://github.com/prakhargpt/sql-data-warehouse-project
Building Data Warehouse project using SQL Server, including ETL processes, data modelling and analytics.
analytics data data-analysis data-cleaning data-engineering data-engineering-pipeline data-lakehouse data-science data-warehouse etl etl-job etl-pipeline medallion-architecture sql sql-server
Last synced: 12 Jun 2026
https://github.com/pew-pew-team/hydrator
Hydrator kernel component
data deserializer dto hydrator kernel mapper mapping serializer structure
Last synced: 24 Mar 2025
https://github.com/petzi53/repair
R Datasets of the Open Repair Alliance (ORA).
Last synced: 19 May 2026
https://github.com/rbreeze/dashboard
My personal health dashboard, with daily stats on food and sleep. Undergone several redesigns since 2015.
css dashboard data data-visualization design front-end google-sheets google-sheets-api health html javascript personal-health-record personal-website running static static-site visualization
Last synced: 02 May 2026
https://github.com/ragibasif/bobdylan
Bob Dylan
bob-dylan csv data data-science data-visualization lyrics music python
Last synced: 03 Sep 2025
https://github.com/huseyincenik/power_bi
PowerBI Files
dashboard data data-science data-structures data-visualization dataset datavisualization dataworld microsoft-power-bi powerbi powerbi-custom-visuals powerbi-dashboards powerbi-desktop powerbi-reports powerbi-visuals powerbidashboard powerquery querying user-interface workout-wednesday
Last synced: 05 Jan 2026
https://github.com/vuurvos1/functional-programming
HVA functional-programming
data formatting functional nodejs programming
Last synced: 03 Oct 2025
https://github.com/smeltier/data-structures-c
This repository contains C language implementations of the main data structures covered in the Algorithms and Data Structures course. The implementations were developed as part of my hands-on learning process and include sequential lists, linked lists, and other fundamental structures.
algorithms algorithms-and-data-structures c c-language c-programming data data-structures data-structures-c structures-c
Last synced: 16 May 2025
https://github.com/suryadev99/stream_processing_website_click_data
Stream Processing of website click data using Kafka and monitored and visualised using Prometheus and Grafana
clickdata data dataengineering docker flink-kafka flink-metrics flink-stream-processing git grafana kafka kafka-streams kafka-topic prometheus psql python
Last synced: 10 Mar 2026
https://github.com/2022-04-11588/data-fakes
🔍 Generate realistic fake data for testing and development, enhancing your projects with simple, customizable data solutions.
data dataset developer-tools fake-content faker fakery groovy java mock phoenix python random ruby seeding struct swift-framework test-data testing
Last synced: 11 Apr 2026
https://github.com/ashu3291/blinkit-app-store-
conducted a comprehensive analysis of Blinkit's sales performance, customer satisfaction and inventory distribution to improve the sales performance.
cleaning-data data dataanalysis-projects powerbi-visuals powerbidashboard sql
Last synced: 05 Jan 2026
https://github.com/nel-zi/nuga_bank
Developed an automated data exploration and cleaning pipeline for Nuga Bank to streamline data preparation, ensure consistent data quality, and normalize datasets into structured databases for efficient analysis and reporting.
data data-automation data-visualization datacleaning datatransformation etl-automation etl-pipeline
Last synced: 16 May 2025
https://github.com/vlamug/ratibor
Ratibor is a service for making metrics from data
Last synced: 10 Mar 2026
https://github.com/roovedot/unet-cnn-for-road-segmentation
(In Progress) Unet architecture with CNNs (Convolutional Neural Networks) aimed at Road Segmentation
cnn cnn-for-visual-recognition cnn-pytorch computer-vision data data-engineering data-science unet unet-image-segmentation unet-pytorch
Last synced: 01 Jul 2025
https://github.com/sakan811/show-leaving-soon-tracker-website
This is a Vue.js application that displays shows that are leaving each platform soon, featuring a countdown timer for each title based on the user's local timezone.
data hbo hbomax netflix shows streaming tv-shows vue vuejs web webapp website
Last synced: 18 Mar 2025
https://github.com/vaxdata22/foresight-institution
This is a Data Analysis case study done on the Foresight Institution dataset.
actionable-insights business-analytics business-intelligence data data-analytics data-cleaning data-mining data-processing data-visualization data-wrangling exploratory-data-analysis spreadsheets sql sql-server sql-server-management-studio statistical-analysis t-sql transact-sql
Last synced: 28 May 2026
https://github.com/jamiew/void-runners-analysis
basic data analysis for the Void Runners Genesis Fleet spaceships
Last synced: 29 Mar 2025
https://github.com/satyam4229/iit-and-nit-college-dataset
The dataset for IITs and NITs typically includes information related to these premier engineering institutions in India, such as their names, locations, rankings, academic programs offered, faculty details, student information, admission process, infrastructure and facilities, placements.
college-data csv data excel iit nit
Last synced: 04 Jan 2026
https://github.com/apostolissiampanis/weather-app-api
WeatherApp is a Java-based console application that retrieves and processes weather data using the wttr.in web service.
api data hibernate java json lombok objected-orientated-programing oop spring-boot spring-data-jpa sqlite webflux
Last synced: 05 May 2026
https://github.com/victorowinoke/custmer-segmentation-using-rfm-python-
Customer Segmentation using the Recency, Frequency and Monetary Values
customer-segmentation data data-visualization python3 science time-series-analysis
Last synced: 26 May 2026
https://github.com/diegoperea20/datos-secuenciales-con-ia
Realizacion de procesamiento de señales unidimensionales con modelos auto regresivos, convolución 1d, convolución 2d usando el espectrograma y redes recurrentes
ai artificial-intelligence convolutional-neural-networks data ia secuential-data spectrogram uao
Last synced: 06 Feb 2026
https://github.com/muhamedlabs/muhamed_onedrive
Muhamed_OneDrive - це надійне і зручне хмарне сховище для файлів, розроблене для безпечного зберігання і легкого обміну даними.
data html5 onedrive programming style
Last synced: 04 Jan 2026
https://github.com/pranjaldhamane/social-media-sentiment-analysis
This project aims to analyze sentiment in Twitter data to understand attitudes towards specific topics or entities. It seeks to uncover positive and negative sentiment patterns, detect potential cyberbullying or hate speech, and provide insights into Twitter's overall sentiment landscape.
data dataanalysis logistic-regression nlp-machine-learning python sentiment-analysis twitter
Last synced: 18 Apr 2026
https://github.com/neha-adnani/sql_music-store-analysis
SQL-based data analysis of a digital music store's sales and customer data.
business-analysis data data-analysis database follow-along-projects pgadmin4 portfolio-project postgres queries sql
Last synced: 18 Jun 2025
https://github.com/plateformeio/docs
The official documentation of the Plateforme framework
api app asgi async data db docs fastapi plateforme pydantic python restx services sqlalchemy
Last synced: 11 Apr 2026
https://github.com/rubyonworld/ldpath
This is a ruby implementation of LDPath, a language for selecting values linked data resources.
Last synced: 12 Nov 2025
https://github.com/bertrand31/one-billion-rows-challenge
🌪️ Pushing Scala to its limits to aggregate a billion rows' worth of data in 2.42 seconds
competitive-programming competitive-programming-contests data data-engineering data-processing performance scala
Last synced: 05 Sep 2025
https://github.com/dug22/jjournal
A Jupyter like notebook software for Java
data data-analysis data-science java jshell jshell-repl notebook swing swing-application
Last synced: 11 Apr 2026
https://github.com/bablukumarjha/startup-funding-revenue-analysis-by-sql-and-pandas
SQL project analyzing startup funding, revenue, and founder data to extract business insights using Python and MySQL.
data data-analysis data-platform data-science dataanalysisusingpython dataanalytics pandas-dataframe pandas-library python sql sql-server sqlalchemy sqldatabase
Last synced: 18 May 2026
https://github.com/martinius96/meteostanica-odosielacie-scripty
Meteostanica - Arduino, ESP8266, ESP32 - odosielanie sketche pre reprezentáciu dát vo webovom rozhraní.
arduino bme280 bmp280 data dht22 ds18b20 esp32 esp8266 espressif html meteo meteostanica mysel nodemcu php stanica teplota tlak vlhkost webstranka
Last synced: 11 Apr 2026
https://github.com/mikeqfu/network-rail-track-fixity-layer
This project develops a data mining tool for analysing and predicting track movements using asset data, environmental factors and track design knowledge to model key parameters and generate fixity values for the GB rail network.
data data-integration data-mining data-science information-management knowledge-discovery point-cloud rail rail-alignment rail-track track-fixity
Last synced: 02 Sep 2025
https://github.com/bdr-pro/graphyml
A powerful, interactive Streamlit application to explore, edit, visualize, and query a graph-based database of YAML nodes — ideal for movie metadata, research articles, or structured knowledge graphs.
Last synced: 23 Jul 2025
https://github.com/amethyst-php/activity
Someone just did something, should we save who did this and when?
activity amethyst amethyst-package api data laravel
Last synced: 17 May 2026
https://github.com/shudhanshusaurabh001/super_market-data-analysis-using-python
This project focuses on analyzing supermarket sales data using Python. The goal is to extract meaningful insights from the dataset, such as sales trends, customer purchasing behavior, and product performance.
analysis csv data insights matplotlib numpy pandas project python seaborn
Last synced: 06 Apr 2026
https://github.com/jameshenderson12/chatbot-utils
Generic data and elements that can be reused or repurposed for chatbot development.
boilerplate chatbot data development elements intents template utterances
Last synced: 04 Mar 2026
https://github.com/oliver021/helppad-net
Versatile .NET Toolkit: A Comprehensive Set of Miscellaneous Helpers, Classes, and Utilities
assert async checks cryptographic-algorithms data date dotnet fluent functional functional-programming hash helpers parallel pipe pipeline pointers review supports tasks
Last synced: 15 Jun 2026
https://github.com/bscript07/softuni-javascript-applications
Javascript for Applications course at SoftUni -Oct 2023
architecture-component authentication client-side-rendering-seo data lit-html-template routing
Last synced: 15 Mar 2025
https://github.com/awpala/udemy-my-courses-data-parser
Download Udemy lists and courses metadata for authenticated student user
Last synced: 07 May 2026
https://github.com/faster-games/dynamic-components
Dynamic Runtime Components for Unity3D
Last synced: 11 Apr 2026
https://github.com/vidushibhadana/eda-on-nyc-taxi-data
About Conducting an Exploratory Data Analysis (EDA) on New York City taxi data and visualizing it through countplots, distribution plots (displot), and histograms using Python and it's libraries.
data data-visualization jupyter-notebook matplotlib numpy pandas python seaborn
Last synced: 11 Apr 2026
https://github.com/mierune/tinybufr
[WIP] A Rust library for decoding BUFR (Binary Universal Form for the Representation of meteorological data) files.
bufr data meteorology rust weather wmo
Last synced: 15 May 2025
https://github.com/yash-rewalia/airbnb_eda_pandas
The goal of the project is to gather information and analyze the detailed information of the different entries in order to provide insights about the host and price of the property in a particular area as per your preference , type of rooms and number of reviews accordingly.
data data-cleaning data-insights data-preprocessing data-visualization matplotlib numpy pandas python seaborn
Last synced: 11 Apr 2026
https://github.com/oniani/miniframe
Minimal data frames with relational algebra
data dataframe-library haskell haskell-library library
Last synced: 04 Mar 2025
https://github.com/xp-forge/marshalling
Marshalling
data marshalling object-mapping xp-framework
Last synced: 02 Jul 2025
https://github.com/j-sephb-lt-n/plotly-dash-dashboard-template
A data dashboard template
dash data data-visualisation data-vizualization dataviz google-cloud google-cloud-platform plotly plotly-dash python responsive-design responsive-web-design
Last synced: 18 Jun 2025
https://github.com/pythoncoderunicorn/tool-discography
Music Band TOOL albums and songs dataset
data data-science metal-music music r songs
Last synced: 26 May 2026
https://github.com/mumtaz4118/nlp-course
Programming Assignments and Lectures for Stanford's CS 224: Natural Language Processing with Deep Learning
course data data-analysis data-analytics data-science data-visualization deep-learning education machine-learning natural-language-processing neural-network transfer-learning
Last synced: 24 Nov 2025
https://github.com/jigyasag18/movie-recommendation-system-project
This repository features a personalized movie recommendation system that offers tailored suggestions to users. It leverages a dataset of 5,000 English-language films and utilizes data processing, feature engineering, and a cosine similarity algorithm to analyze user preferences. The system includes an intuitive user interface for easy navigation.
data datacleaning datapreprocessing machine-learning machine-learning-algorithms python streamlit streamlit-webapp
Last synced: 28 May 2026
https://github.com/ot-code/sql-sabor-y-tradicion
A SQL-driven project that integrates menu and order data to reveal insights on dish performance, customer preferences, and spending trends. It informs pricing strategies, menu adjustments, and targeted promotions, ultimately enhancing the overall customer experience and driving business growth.
analytical-queries data data-aggregation data-analysis database-design join-queries mysql order-analytics relational-databases restaurant-data sql sql-script
Last synced: 08 Apr 2025
https://github.com/sanchittechnogeek/overscripted-analysis
Geolocation and user language extraction analysis from Mozilla Overscripted dataset
analysis data data-analysis mozilla
Last synced: 23 Mar 2025
https://github.com/checco9811/data-engineering-bootcamp-homework
Homework solutions for DataExpert.io data engineering bootcamp
apache-spark data data-engineering sql
Last synced: 14 Mar 2025
https://github.com/petzi53/repairdata
Open Repair Alliance Datasets 2021
data open-data open-datasets r repair repair-cafe repairs
Last synced: 22 Jun 2026
https://github.com/natanast/euroleaguebasketball
An R package providing data on Euroleague Basketball
Last synced: 01 Apr 2025
https://github.com/suchi25sathavara/data-wrangling-with-r
Analyzing Road Accidents in Victoria, Australia
data r reporting rstudio wrangling-data
Last synced: 01 Apr 2025
https://github.com/suchi25sathavara/r-projects
R projects in Real world Scenerios for Data Analysis
data data-analysis datavisualization r
Last synced: 01 Apr 2025
https://github.com/wraith13/systematic-metasyntactic-variables
This is a list for that you can express the existence of different serieses when using metasyntax variables.
Last synced: 14 Jun 2025
https://github.com/h-sutiwas/r2de-2025
This repository is related to the Road To Data Engineer Bootcamp by DataTH. It contains all related coursework, some mini projects and other resources within the field of Data Engineering.
data data-engineering data-visualization docker gcp pipeline spark
Last synced: 30 Apr 2026
https://github.com/trollmii/bunnybase
An efficient data managing system
bunnybase data data-science data-structures database datascience python python3
Last synced: 22 Apr 2025
https://github.com/inist-cnrs/ws-data
Modèles et données pour les web services
Last synced: 03 Sep 2025
https://github.com/sushmashreeps/data-science-with-python
This repository showcases a comprehensive data science project utilizing Python, demonstrating expertise in data analysis, visualization, and machine learning. Built with Python 3.x, the project leverages popular libraries like Pandas, NumPy, Matplotlib, Seaborn, Scikit-learn, and TensorFlow. The project features data preprocessing, feature engine
cnn data dataanalysis datascience keras linear-regression matplotlib python python3 regression rnn visualization
Last synced: 14 Apr 2026
https://github.com/lananolana/test_data_generator
Generate test data with Telegram bot in one click: random users, files, texts and credit cards.
credit-card data data-generation fake-data random telegram-bot test-data test-data-generator test-file-generator testing testing-tools text-generation user-generator
Last synced: 18 Jan 2026
https://github.com/alextanhongpin/node-github-api
:page_with_curl: sample github api queries with nodejs for scraping purposes
Last synced: 06 May 2026
https://github.com/ompreetham/data-structures
binary-search-tree c data data-structures datastructures graph linked-list list stack structures tree
Last synced: 25 Mar 2025
https://github.com/steveanik/kestra
Infinitely scalable, event-driven, language-agnostic orchestration and scheduling platform to manage millions of workflows declaratively in code.
data data-engineering data-integration data-pipeline data-quality elt etl low-code orchestration pipelines scheduler workflow workflow-engine
Last synced: 06 Jan 2026
https://github.com/ccworld1000/cccomposition
CCComposition for code style, Accept code style conversion business(接受code style转换业务)
cccomposition composit construction data structure visual
Last synced: 04 Jan 2026