data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-27 00:07:33 UTC
- JSON Representation
https://github.com/sanogotech/open-source-data-stack
modern open source data stack
airbyte airflow data data-science dbt docker postgresql python
Last synced: 11 Apr 2026
https://github.com/oliver021/helppad-net
Versatile .NET Toolkit: A Comprehensive Set of Miscellaneous Helpers, Classes, and Utilities
assert async checks cryptographic-algorithms data date dotnet fluent functional functional-programming hash helpers parallel pipe pipeline pointers review supports tasks
Last synced: 15 Jun 2026
https://github.com/nisanth2004/springboot-kafka-real-world-project-wikimedia
Creating a project about Wikimedia using Kafka involves building a system that leverages Apache Kafka for data streaming and processing related to Wikimedia data.
async broker communication data java kafka message real-time real-time-analytics springboot wikimedia
Last synced: 14 May 2026
https://github.com/dug22/jjournal
A Jupyter like notebook software for Java
data data-analysis data-science java jshell jshell-repl notebook swing swing-application
Last synced: 11 Apr 2026
https://github.com/diegoperea20/datos-secuenciales-con-ia
Realizacion de procesamiento de señales unidimensionales con modelos auto regresivos, convolución 1d, convolución 2d usando el espectrograma y redes recurrentes
ai artificial-intelligence convolutional-neural-networks data ia secuential-data spectrogram uao
Last synced: 06 Feb 2026
https://github.com/apostolissiampanis/weather-app-api
WeatherApp is a Java-based console application that retrieves and processes weather data using the wttr.in web service.
api data hibernate java json lombok objected-orientated-programing oop spring-boot spring-data-jpa sqlite webflux
Last synced: 05 May 2026
https://github.com/srgchrksv/datacamp-projects
Datacamps projects
analytics data data-science dataanalysis education jupyter-notebook learning pandas projects python sql
Last synced: 06 May 2026
https://github.com/sanad343/complete-data-analyst
Data analysis is the process of turning raw data into useful information for decision-making.
data data-visualization datamanipulation eda excel exploratory-data-analysis powerbi python-3 sql tableau
Last synced: 30 Jun 2025
https://github.com/csoren66/financial-budget-analysis
Financial budget for 2021
Last synced: 03 Mar 2025
https://github.com/kuanhungchen/spring-2019-data-structures
📦 Some programming assignments about basic data structures.
Last synced: 25 Feb 2025
https://github.com/abirsaha111/ipl-2022-analysis
The IPL 2022 Analysis project is a data-driven exploration of the Indian Premier League (IPL) 2022 cricket tournament. The analysis focuses on utilizing Python programming and various libraries to analyze and visualize the performance of teams, players, and key metrics in the IPL 2022 season.
data dataana dataanalytics datavi matplotlib python
Last synced: 07 Jun 2026
https://github.com/charon25/weatherdata
17 000 weather measurements collected by a weather station created for a college project.
csv data dataset datasets json measurements strasbourg weather weather-data
Last synced: 16 Jan 2026
https://github.com/jpcadena/palmer-penguins
Palmer Penguins
analytics csv data data-analytics data-science exploratory-data-analysis matplotlib numpy palmer-penguin pandas plotly pylint python seaborn visualization
Last synced: 11 Apr 2026
https://github.com/karo23361/toy-store-kpi-power-bi
PowerBI Portfolio Project
csv data data-visualization powerbi
Last synced: 03 Feb 2026
https://github.com/cracko298/planet-life-save-converter
Convert your Planet-Life Saves To and From Base64 & *.planet files.
base64 base64-decoding base64-encoding data python python-script python3 save-converter save-data save-files
Last synced: 15 Mar 2025
https://github.com/equinor/fmu-sumo
Interaction with Sumo in the FMU context
analytics data fmu python subsurface sumo visualization
Last synced: 01 May 2025
https://github.com/bastianolea/cut_comunas
Versión actualizada de los códigos únicos territoriales (CUT) de las comunas y regiones del país.
Last synced: 24 Jun 2026
https://github.com/eslamdyab21/data-visualization-using-matplotlib-and-seaborn
This is the last project in the nanodegree udacity program. it's about data visualization.
data data-analysis data-visualization matplotlib pandas python seaborn udacity udacity-data-analyst-nanodegree
Last synced: 09 May 2026
https://github.com/avestura/shell-dads
❓ Show a random tip from NIST DADS (https://xlinux.nist.gov/dads) every time you open your terminal
algorithms dads data data-structures ds nist
Last synced: 23 Oct 2025
https://github.com/dhimmel/adeptus
ADEPTUS -- differential gene expression signatures of disease
adeptus data differential-expression disease gene-expression genes rephetio
Last synced: 05 Jan 2026
https://github.com/r-mahesh45/india-news-headlines-analysis
Excited to share my latest project: India News Headlines Analysis (2001–2023). This Power BI report dives deep into 21 years of Indian headlines, uncovering: Trends that defined the nation, Key themes that shaped public discourse, Insights into the evolution of media coverage.
data data-science powerbi visualization
Last synced: 05 Jan 2026
https://github.com/dansalahi/query-builder-experiment
Customized Query Builder for creating Rules and Groups
data data-structures jsonlogic query-builder reactjs typescript validation
Last synced: 11 Apr 2026
https://github.com/roovedot/unet-cnn-for-road-segmentation
(In Progress) Unet architecture with CNNs (Convolutional Neural Networks) aimed at Road Segmentation
cnn cnn-for-visual-recognition cnn-pytorch computer-vision data data-engineering data-science unet unet-image-segmentation unet-pytorch
Last synced: 01 Jul 2025
https://github.com/sakan811/show-leaving-soon-tracker-website
This is a Vue.js application that displays shows that are leaving each platform soon, featuring a countdown timer for each title based on the user's local timezone.
data hbo hbomax netflix shows streaming tv-shows vue vuejs web webapp website
Last synced: 18 Mar 2025
https://github.com/michaelschoenburg/rapidfiretools-computerdatacollector-automation
Automation for RapidFire Tools Computer Data Collector.
automation collector computer data fire powershell powershell-script rapid rapidfire-tools tools
Last synced: 01 Jul 2025
https://github.com/jamiew/void-runners-analysis
basic data analysis for the Void Runners Genesis Fleet spaceships
Last synced: 29 Mar 2025
https://github.com/pranjaldhamane/social-media-sentiment-analysis
This project aims to analyze sentiment in Twitter data to understand attitudes towards specific topics or entities. It seeks to uncover positive and negative sentiment patterns, detect potential cyberbullying or hate speech, and provide insights into Twitter's overall sentiment landscape.
data dataanalysis logistic-regression nlp-machine-learning python sentiment-analysis twitter
Last synced: 18 Apr 2026
https://github.com/bertrand31/one-billion-rows-challenge
🌪️ Pushing Scala to its limits to aggregate a billion rows' worth of data in 2.42 seconds
competitive-programming competitive-programming-contests data data-engineering data-processing performance scala
Last synced: 05 Sep 2025
https://github.com/plnech/never2late
Never 2 Late - a reinterpretation of Everest Pipkin's 'i've never picked a protected flower'
dada dada-science data generative-art glitch-art installation nlp poetry spacy vector-similarity wallpaper
Last synced: 10 Jun 2025
https://github.com/bablukumarjha/startup-funding-revenue-analysis-by-sql-and-pandas
SQL project analyzing startup funding, revenue, and founder data to extract business insights using Python and MySQL.
data data-analysis data-platform data-science dataanalysisusingpython dataanalytics pandas-dataframe pandas-library python sql sql-server sqlalchemy sqldatabase
Last synced: 18 May 2026
https://github.com/amethyst-php/activity
Someone just did something, should we save who did this and when?
activity amethyst amethyst-package api data laravel
Last synced: 17 May 2026
https://github.com/shudhanshusaurabh001/super_market-data-analysis-using-python
This project focuses on analyzing supermarket sales data using Python. The goal is to extract meaningful insights from the dataset, such as sales trends, customer purchasing behavior, and product performance.
analysis csv data insights matplotlib numpy pandas project python seaborn
Last synced: 06 Apr 2026
https://github.com/turner-kendall/turner-kendall
Turner Kendall - dev, opps, sec.
config data github-config go rust security
Last synced: 31 Oct 2025
https://github.com/rahult18/atmo-flow
AtmoFlow is a robust data engineering pipeline built on Google Cloud Platform (GCP) that processes and analyzes weather and air quality data in both batch and streaming modes
airflow data data-modeling data-science data-visualization dataengineering gcp-bigquery gcp-cloud-composer gcp-cloud-functions pyspark
Last synced: 23 Jun 2026
https://github.com/smaug6739/data-bit
This project is a module for converting a structured dataset into a number that can be stored in a database taking up little space.
Last synced: 14 May 2026
https://github.com/jigyasag18/aircraft-data-management
This repository offers a comprehensive simulation of global military air deployments involving 10 countries, aircraft models, mission types, and strategic zones. It analyzes air power distribution, mission intent (offensive, defensive, support), and geopolitical positioning. The project provides structured insights into regional & zone level threat
aircraft-data aircraft-performance data data-analysis data-visualization database database-management dataset datavisualisation mysql powerbi powerbi-report powerbi-visuals sql
Last synced: 04 Feb 2026
https://github.com/ffatahillah7/snowflake-data-governance-warehouses
Welcome to the Powered by Tasty Bytes - Zero to Snowflake Quickstart focused on Data Governance! Within this Quickstart we will learn about Snowflake Roles, Role Based Access Control and deploy both Column and Row Level Security that can scale with your business.
data data-governance snowflake
Last synced: 06 Jan 2026
https://github.com/45harry/potato_disease_classification
Potato Disease Classification - Traning, Rest Api and FrontEnd to Test
cnn-classification data data-science datapreprocessing deep-learning fastapi flaskapi frontend keras restapi tensorflow
Last synced: 12 Apr 2026
https://github.com/pew-pew-team/hydrator
Hydrator kernel component
data deserializer dto hydrator kernel mapper mapping serializer structure
Last synced: 24 Mar 2025
https://github.com/jigyasag18/movie-recommendation-system-project
This repository features a personalized movie recommendation system that offers tailored suggestions to users. It leverages a dataset of 5,000 English-language films and utilizes data processing, feature engineering, and a cosine similarity algorithm to analyze user preferences. The system includes an intuitive user interface for easy navigation.
data datacleaning datapreprocessing machine-learning machine-learning-algorithms python streamlit streamlit-webapp
Last synced: 28 May 2026
https://github.com/eby8zevin/android-intent
Intent & Bundle - Android Studio
android android-development android-studio bundle data intent java xml
Last synced: 03 Sep 2025
https://github.com/aniruddha-biswas/shield-insurance-business-insights
Shield Insurance Business Insights
data data-visualization dataanalysis excel mysql powerbi sql
Last synced: 01 Apr 2025
https://github.com/yashaswitir28/yashaswitir28.github.io
This is my Portfolio Website
data data-analysis-python data-analyst data-cleaning data-science data-visualization excel html-css ms office365 portfolio-website powerbi python sql
Last synced: 29 May 2026
https://github.com/natanast/euroleaguebasketball
An R package providing data on Euroleague Basketball
Last synced: 01 Apr 2025
https://github.com/eudesgccunha/automated-management-panel
Automated management panel using Power BI
data data-analysis data-visualization database excel powerbi
Last synced: 04 Feb 2026
https://github.com/armand-sauzay/datasets
Datasets for machine learning
ai data datasets machine-learning ml
Last synced: 18 Jan 2026
https://github.com/johnelliott/wb-web
Moved —> https://github.com/johnelliott/waybot
arduino browser data iot raspberry-pi web
Last synced: 12 Apr 2026
https://github.com/lananolana/test_data_generator
Generate test data with Telegram bot in one click: random users, files, texts and credit cards.
credit-card data data-generation fake-data random telegram-bot test-data test-data-generator test-file-generator testing testing-tools text-generation user-generator
Last synced: 18 Jan 2026
https://github.com/lohithgsk/dynamic-qr-generator
A Python-based QR generator application was developed using the qrcode and Pillow libraries, dynamically generating QR codes for custom data inputs. Designed for a college grievance management system, the application creates QR codes containing block, floor, room, and machine numbers, allowing easy placement and identification on each floor.
data pillow python qrcode qrcode-generator
Last synced: 16 Mar 2025
https://github.com/zulfachafidz/telco_churn_insight_customer_loss_prediction_with_random_forest_and_decision_tree-algorithms
The main problem in the business world is customer churn, or losing customers, especially in the telecommunications industry, which experiences very tight competition. To overcome this problem, an analysis was carried out to help the company understand how many customers have the potential to switch providers.
data data-science data-visualization dataanalysis dataanalyst dataanalytics datadrivenwithdataprovider decision-tree decision-tree-classifier decision-trees random-forest random-forest-classifier
Last synced: 01 May 2026
https://github.com/thingston/extractor
Collection of PHP classes to extract data from HTML pages.
Last synced: 14 Jan 2026
https://github.com/purarue/scramble-history
parses rubiks cube scramble history/solve time from cstimer.net, cubers.io, twistytimer -- merges them together giving you uniform averages/data/graphs
cstimer cubing data rubiks-cube speedsolving
Last synced: 11 Jun 2025
https://github.com/prajakta1321/streetml-a-cityscape-traffic-volume-prognostication
StreetML leverages ML learning techniques to revolutionize urban traffic prediction through precise volume prognostication, aiming to enhance cityscape mobility through data-driven insights.
catboostregressor data datavisualisation exploratory-data-analysis lightgbm-regressor linearregression machine-learning machine-learning-algorithms predictive-analytics random-forest-regression xgboost-regression
Last synced: 08 Apr 2025
https://github.com/afnanenayet/kaggle-titanic
The classic Kaggle Titanic data science challenge
backprop backpropagation classification classifier data forest kaggle layer learn mlp multi numpy pandas perceptron random science scikit sklearn titanic
Last synced: 12 Apr 2026
https://github.com/powersyang/visualization
data visualization templates 数据可视化模板
Last synced: 24 Mar 2025
https://github.com/robson-python/academic-performance
Project to evaluate students' academic performance.
csv-import data data-analysis data-science jupyter-notebook machine-learning matplotlib pandas python scikit-learn seaborn vscode
Last synced: 12 Apr 2026
https://github.com/keminghe/osu
Unofficial and publicly-available NPM data-package about The Ohio State University.
college data majors ohio-state organizations public students university unofficial
Last synced: 06 Jan 2026
https://github.com/cosmos-loops/cosmos-data
Cosmos.Data is a inline project of COSMOS LOOPS PROGRAMME to provide several SQL-Query, RMDB/ORM and No-SQL components' extensions.
connection-pool data mysql mysqlconnector oracle postgresql sqlite sqlkata sqlserver transaction uow
Last synced: 12 Apr 2026
https://github.com/saikatharryc/motionchart-d3js
A dynamic Motion chart Built with D3 js.
Last synced: 23 Dec 2025
https://github.com/denisecase/620-mod6-web-scraping
Notes on how to get started scraping content from the web
beautifulsoup4 data mining python
Last synced: 11 Apr 2025
https://github.com/plandes/datdesc
Describe and optimize data
data hyperparameter-optimization hyperparameter-tuning latex table
Last synced: 04 Sep 2025
https://github.com/matheusafonseca/deploy-ml-models-with-streamlit-udemy
This repository is dedicated to storing the code developed during the "Machine Learning Model Deployment with Streamlit" course on Udemy. The course covers basic to advanced techniques for deploying machine learning models using Streamlit.
data data-science data-visualization interface joblib layout machine-learning optimization-algorithms python python3 sklearn sklearn-datasets sklearn-library sklearn-pipeline streamlit
Last synced: 19 Apr 2026
https://github.com/zcebeci/odetector
Outlier Detection Using Cluster Analysis
anomaly-detection cluster-analysis clustering clustering-methods data datapreparation datapreprocessing exception-handling fcm fraud-detection fuzzy-clustering novelty-detection outlier-detection outlier-removal outliers partitioning pcm r surprise-exploration
Last synced: 29 Oct 2025
https://github.com/code-str8/time-series-forecasting
Developing a model that effectively forecasts the unit sales of numerous items across various Favorita stores with precision.
data dataanalysis forcasting machine-learning time-series visualizations
Last synced: 31 Mar 2025
https://github.com/muhammed-fazal/student-success-and-early-intervention-analytics-system
To consolidate scattered student performance records into a unified Data Warehouse in SQL Server. Engineer an Interactive Power BI dashboards that visualize academic trends, identifying student performance and implement predictive analytics.
analysis analytics dashboard data data-analysis data-engineering data-science data-visualization database etl etl-pipeline power-bi powerbi python sql sql-server
Last synced: 29 May 2026
https://github.com/rikiitokazu/dataprojects
Data analysis practice using SQL and Python
Last synced: 12 Apr 2026
https://github.com/theanujsinha01/mcdonalds-customer-analysis
This project analyzes customer feedback data to understand what drives people to like or dislike McDonald’s. Using Python and data visualization tools in a Jupyter Notebook, we explore how different factors—such as taste, price, health, and visit frequency—affect customer satisfaction.
case-study data data-visualization dataanalysis
Last synced: 05 Sep 2025
https://github.com/afnanenayet/ds-a
Some interview prep I've been doing. This repo is reimplementations of algorithms and data structures in Python3
algorithms data interview prep python structures
Last synced: 05 Apr 2025
https://github.com/eng-gabrielscardoso/data-science-formation
Data science course walkthrough
data data-science data-visualisation google-colab google-colaboratory google-colaboratory-notebooks python r r-lang
Last synced: 28 Feb 2025
https://github.com/bcongdon/nid-data
National Inventory of Dams Data
data datasette government-data
Last synced: 21 Apr 2026
https://github.com/lexiortiz/advanced-data-analytics
Structured learning notes, code snippets, and key takeaways from the Google Advanced Data Analytics Professional Certificate. Serves as a personal reference for reinforcing concepts and as a resource for others on a similar learning journey.
data data-analysis data-engineering google python-3 sql
Last synced: 29 May 2026
https://github.com/ayush-raj8/godata
Write data to file. Standardizes the format for easy parsing and read by other programs.
Last synced: 18 Jan 2026
https://github.com/codegouvfr/codegouvfr-sources
🧢 Static web frontend for code.gouv.fr
bluehats codegouvfr data frontend
Last synced: 28 Feb 2025
https://github.com/raghavendranhp/youtube_data_harvesting
The "YouTube Data Analyzer" is a versatile tool for businesses and content creators, enabling them to gather, analyze, and harness valuable insights from multiple YouTube channels. With streamlined data collection, storage in MongoDB, migration to SQL, and a user-friendly Streamlit interface, it empowers users to make data-driven decisions
apiintegration data datacollection eda googleapi googleapiclient matplotlib mongodb mysql mysqlconnector numpy oops pandas pymongo python pythonoops sql sqlalchemy streamlit youtube-api
Last synced: 13 Apr 2026
https://github.com/sakshamarora07/blinkit-sales-report-power-bi
This dashboard provides Blinkit with insights to optimize its grocery delivery operations and understand customer preferences. It evaluates sales trends, outlet performance, and item categories to identify key areas for improvement. The interactive visuals allow detailed exploration of sales distribution, customer ratings, and product popularity.
data data-science dataanalytics datavisualization excel powerbi sql
Last synced: 08 Jan 2026
https://github.com/living-with-machines/zoonyper
Code to make it easy to import and process Zooniverse annotations and their metadata in Python/Jupyter Notebooks
crowdsourcing data data-processing data-science python zooniverse
Last synced: 04 Jul 2025
https://github.com/mukhlishga/data-engineering
all about data engineering
airflow beam data data-engineering pyspark python
Last synced: 13 Apr 2026
https://github.com/luminati-io/ZoomInfo-dataset-samples
A sample dataset of over 1000 ZoomInfo companies, extracted using the Bright Data API, ideal for market growth, lead generation, and market analysis.
b2b business companies data data-extraction database dataset datasets web-scraping zoominfo
Last synced: 09 Apr 2025
https://github.com/desoga10/nety-form
In this tutorial, I show you how to send data from a form to the Netlify dashboard. I also show you how to create a form using Materialize.
contact-form css css3 data form forms html html5 materialize materialize-css materializecss-framework netlify
Last synced: 03 Jan 2026
https://github.com/deliprofesor/virtual-reality-in-education-impact-analysis-and-insights
This project examines the impact of Virtual Reality (VR) on education, focusing on its effects on student engagement, learning outcomes, and creativity. It uses data analysis techniques like descriptive statistics, correlation analysis, and clustering to assess VR's effectiveness in enhancing learning.
clustering data data-analysis data-science data-visualization exploratory-data-analysis hypothesis-testing machine-learning python regression-analysis virtual-reality
Last synced: 14 Jun 2025
https://github.com/idhruvs/angular4-smart-table-demo
Angular4 Smart Table Demo Project
angular4 data tables typescript
Last synced: 21 Apr 2026
https://github.com/blackroad-os-inc/blackroad-portal
BlackRoad Portal — unified search routing to 30+ BlackRoad services.
blackroad cloudflare-workers data search
Last synced: 04 Apr 2026