data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-01 00:07:35 UTC
- JSON Representation
https://github.com/jszafran/personal-aws-data-lake
Personal, cloud based (AWS), data lake for experimenting with cloud services.
aws cloud data data-engineering dataengineering datalake etl terraform
Last synced: 20 May 2026
https://github.com/apostolissiampanis/weather-app-api
WeatherApp is a Java-based console application that retrieves and processes weather data using the wttr.in web service.
api data hibernate java json lombok objected-orientated-programing oop spring-boot spring-data-jpa sqlite webflux
Last synced: 05 May 2026
https://github.com/byndyusoft/byndyusoft.data.relational.specifications
byndyusoft data relational specifications
Last synced: 12 Sep 2025
https://github.com/deva-246/excel-power-query-data-cleaning-dashboard
dashboard data datacleaning excel pivottable powerquery slicer
Last synced: 22 Mar 2025
https://github.com/ahadly/sql-data-analytics-project
This repository contains a collection of SQL scripts demonstrating various analytical techniques, such as changes over time, cumulative, performance, data segmentation, part-to-whole analysis.
analytics business-analytics business-intelligence data data-analysis data-analyst data-analytics data-engineering data-science data-scientist database datascience query reporting sql sql-queries sql-query sql-server window-functions window-functions-in-sql
Last synced: 18 May 2026
https://github.com/boratechlife/tensorflow-questions-datasets
A Tensorflow questions Datasets to help you practice Machine learning and Train Models
data datapreprocessing datasets machinelearning modeltrain questions tensorflow
Last synced: 23 Mar 2025
https://github.com/ember-nexus/reference-dataset
Ember Nexus API backup containing different standardized scenarios
Last synced: 25 Jan 2026
https://github.com/yugoff/ml-kaggle-regression-with-a-mohs-hardness-dataset
Your Goal: For this Episode of the Series, your task is to use regression to predict the Mohs hardness of a mineral, given its properties
data gradient-boosting kaggle kaggle-competition regression-models
Last synced: 18 May 2026
https://github.com/mkshah605/personal-brand-development
A data-driven approach to a personal brand development project.
branding data data-science growth music personal
Last synced: 12 Sep 2025
https://github.com/kinshukjainn/dclue-v1
Dsainone is a highly optimized Data Structures and Algorithms (DSA) library designed to provide efficient implementations of graph algorithms, trees, hashing, and linked lists while maintaining exceptional memory efficiency. The library is designed to be as fast and optimized as possible
Last synced: 20 May 2026
https://github.com/kammarah/studentdata
I created & deployed a Streamlit app to store, manage & analyze student data. ππ
connection data data-analysis data-visualization deploy deployments libraries python streamlit streamlit-webapp webapp
Last synced: 18 May 2026
https://github.com/piyushkumar2025/analytical-sql-project-exploring-trends-segmentation-kpis
A complete SQL analytics project using a simulated data warehouse. It analyzes sales, customer, and product data with CTEs, joins, window functions, subqueries, and views to deliver insights on trends, segmentation, and KPIs, showing how SQL enables data-driven decisions without BI tools.
advanced-sql analytics business-intelligence data data-science-projects datascience joins kpi mysql query sql window-functions-in-sql
Last synced: 02 Jul 2025
https://github.com/zulfachafidz/titanic_explorer_predicting_survival_with_classification_using_knn_algorithm
Tracking Life Safety with the KNN Predictive Analysis Approach. Leveraging the Titanic Dataset, we apply classification analysis to predict the fate of passengers based on a variety of features.
algorithm algorithms data data-analysis data-mining data-science datamodeling datapreprocessing dataset knn-algorithm knn-classification machine-learning machine-learning-algorithms prediction-model
Last synced: 01 Sep 2025
https://github.com/roovedot/unet-cnn-for-road-segmentation
(In Progress) Unet architecture with CNNs (Convolutional Neural Networks) aimed at Road Segmentation
cnn cnn-for-visual-recognition cnn-pytorch computer-vision data data-engineering data-science unet unet-image-segmentation unet-pytorch
Last synced: 01 Jul 2025
https://github.com/annaanastasy/mushroom-binary-classification-eda-ml
Explored and modeled a competition dataset of mushroom species, focusing on data cleaning, exploratory data analysis, and building machine learning models for accurate classification of edible and poisonous mushrooms.
binary-classification data data-cleaning-and-preprocessing data-science exploratory-data-analysis machine-learning-algorithms xgboost-classifier
Last synced: 29 Mar 2025
https://github.com/encelo/nctracer-data
Data files for the ncTracer project
Last synced: 15 Jan 2026
https://github.com/pratik-codes/zomato_data_eda
Cleaned, analysed messy data and created a predictive model with and accuracy of 93% with tree Regressor algorithm
bengaluru data data-cleaning data-science famous-restaurants restaurants-delivering-online restraunts
Last synced: 27 Mar 2025
https://github.com/shvbsle/image-augmentation
A light weight CLI for augmenting image datasets for deep learning and ML projects
augmentation data data-augmentation data-augmentation-strategies data-augmentor data-augumentation data-science dataset deep-learning image-processing
Last synced: 12 Sep 2025
https://github.com/newrelic-experimental/newrelic-java-apache-sling
Provides Java instrumentation for Apache Sling framework
apache-sling data instrumentation java nrlabs nrlabs-data nrlabs-java-verify observability-data sling
Last synced: 30 May 2026
https://github.com/simranjeet97/covid-19
Covid-19 Data Analysis and Important Topics to be Covered to get the Impact and Solution.
coronavirus coronavirus-analysis coronavirus-dataset coronavirus-prediction coronavirus-tracking covid-19-data-analysis covid19 covid19-data covid19-india dash dash-app dash-plotly data data-analysis data-science data-science-projects data-visualization python3
Last synced: 18 May 2026
https://github.com/gappeah/layoffs-exploratory-data-analysis
This project uses MySQL to perform data cleaning and exploratory data analysis (EDA) on a dataset detailing company layoffs. The primary goal is to process, clean, and explore the data to gain insights into trends and patterns related to layoffs across various sectors.
data dataanalysis eda mysql sql
Last synced: 12 Jul 2025
https://github.com/ngupta23/data_prep_helper
A helper package for preparing and combining data from a variety of sources
data data-science dataprep datapreparation dataprocessing helpers python
Last synced: 03 Apr 2025
https://github.com/kelvintechnical/web-scraper
Tableau Book Price Analysis
data data-analysis data-science tableau tableau-public
Last synced: 25 Jan 2026
https://github.com/cuadros-code/project-7-whitehouse-petitions
create a petitions from white house API
data jsondecoder uiaction uialertcontroller uibarbuttonitem uimenu url
Last synced: 02 Nov 2025
https://github.com/danpoynor/data-pagination-and-filtering-project
Data pagination exercise using 'vanilla' JavaScript. This script consumes a JSON array containing any number of objects and adds buttons to a page that users can click to navigate to different pages of data.
data javascript json navigation pagination vanilla-javascript
Last synced: 20 Apr 2026
https://github.com/kashirin-alex/thither.direct-onamove
an android skeleton-example application for using data from Thither.Direct platform on mobile applications
android-application data data-analysis data-structures data-visualization mobile-development mobility query research-data-management
Last synced: 27 Apr 2026
https://github.com/brayflex/spy-sector-rotation-google-sheet
Creates a dynamic spreadsheet to visualize SPY and it's 11 largest sector ETFs. See market trends and identify potential sector rotation opportunities.
data etf google-sheets index price rotation script sector spreadsheet spy stock-market
Last synced: 29 Jun 2026
https://github.com/xuender/kstats
Golang statistics library package that supports v1.18+.
algorithms analytics data go golang kstats machine-learning math rounding statistics
Last synced: 20 Jul 2025
https://github.com/Axnjr/csv-parser-utils
Homework task for SWE position at Redhat.
csv data dataanalysis datatools pandas python
Last synced: 30 Oct 2025
https://github.com/ashishsingh789/quantium_data-analysis-_virtual-internship
Completed a job simulation focused on Data Analytics and Commercial Insights for the data science team. Developed expertise in data preparation and customer analytics, utilizing transaction datasets to extract valuable insights and deliver data-driven commercial recommendations
data datawrangling matplotlib pandas pandas-dataframe presentation programming python python-library
Last synced: 07 Apr 2026
https://github.com/jigyasag18/data-analysis-using-ms-excel
This project is on analyzing real-time data from Ambuvians Healthcare, a health products startup. It included data cleaning, such as removing duplicates and addressing missing values, followed by analyses to reveal insights into sales trends, customer demographics, and purchasing behaviors. Visualizations in MS-Excel including bar and pie charts.
analysis data data-visualization dataanalysis datacleaning datapreprocessing dataset msexcel visualization
Last synced: 07 Mar 2026
https://github.com/xjwllmsx/profitable-app-profiles
Analyzes Google Play & App Store data to recommend profitable profiles for free, ad-supported mobile apps
data data-analysis data-cleaning jupyter pandas python
Last synced: 18 May 2026
https://github.com/rid17pawar/friendscircle
Friends Circle is a console based application developed in cpp using Graph Data Structure.
cpp data graph graph-algorithms oop
Last synced: 08 Jun 2026
https://github.com/inekipelov/swift-codable-advance
A library of extensions for Swift Codable protocols, simplifying the process of encoding and decoding objects.
codable data dictionary json swift
Last synced: 25 Jan 2026
https://github.com/estherslabbert/data-exploration
Data analysis and data visualizations for different data sets
data data-analysis data-science data-visualization jupyter-notebook titanic-dataset usa-arrests-dataset
Last synced: 06 Apr 2025
https://github.com/caprogs/paris-events-analyzer
A project to analyze events in Paris using open source data provided by the city.
data data-analysis data-platform dbt docker ingestion python streamlit transformation vizualisation
Last synced: 04 May 2026
https://github.com/henryssondaniel/teacup-service-visualization-mysql-java
Connect your Teacup visualization data to a MySQL database
data mysql service teacup visualization
Last synced: 19 May 2026
https://github.com/furkantosun1607/cse201-data-structure
This repository contains implementations of various data structures completed as part of the CSE201 (Data Structures) course. Each week, a different data structure was implemented during lab sessions.
array arraylist bfs-search binarytree data dfs-search java linkedlist queue stack structure tree-structure
Last synced: 26 Jun 2025
https://github.com/sweta-kaundilya/911-calls-capstone-project
For this capstone project we will be analyzing some 911 call data from Kaggle.
data data-analysis data-visualization jupyter-notebook matplotlib numpy pandas python seaborn
Last synced: 28 Apr 2026
https://github.com/yourdataarchitect/french-realestate-data-pipeline
This repository contains a fully automated data pipeline built with Apache Airflow to extract, clean, analyze, and report real estate listings from Seloger. It pushes data to MongoDB, Elasticsearch, and Google Sheets, with real-time Slack alerts for monitoring.
airlfow data datanalysis datapipeline market-intelligence real-estate
Last synced: 31 Dec 2025
https://github.com/fliplet/fliplet-widget-data-source-query
Data Source Query Provider
Last synced: 11 Apr 2025
https://github.com/jigyasag18/financial-risk-analysis-project
The Credit Card Financial Risk Analysis Dashboard is a real-time Power BI tool designed to provide insights into credit card transactions and customer demographics. It features interactive visualizations, efficient data processing, and actionable insights to support decision-making. Utilizing data from SQL database, the dashboard tracks key metrics
data dataanalysis database datacleaning datapreprocessing dataprocessing datavisualization financial-analysis financialriskanalysis mysql powerbi sql statistical-analysis
Last synced: 06 Mar 2026
https://github.com/jhwa426/database
SQL, MSSQL, MongoDB Database
data data-warehouse data-wrangling database datamodeling entity-relationship-diagram normalization sql sqlite3 ssms
Last synced: 06 Apr 2025
https://github.com/akashlogics/street-data-tracking
Detect, Track and Count number of persons walking across the path(s) making use of YOLO. This Python project tracks people moving across predefined street zones
analysis data excel newdataset object-detection opencv python python3 yolo
Last synced: 19 May 2026
https://github.com/buildinamsterdam/contentful-graphql
Contentful GraphQL connection
Last synced: 05 Jan 2026
https://github.com/official-imvoiid/multifetch
A high-performance web scraper for bulk image and GIF extraction from reliable sources β built for AI/ML data pipelines and large-scale media collection
aiml data dataset gifscraper imagescraper python pythontool tools webscraper windows
Last synced: 19 May 2026
https://github.com/jorgermduarte/mongo-replication
cluster data mongo mongodb mongoose replica replica-set replication
Last synced: 03 Mar 2025
https://github.com/stdlib-js/dstructs-stack
Stack.
collection data data-structure data-structures first-out javascript last-in lifo node node-js nodejs stack stdlib structure
Last synced: 14 May 2026
https://github.com/majorcluster/clj-data-adapter
A Clojure library designed to convert data
Last synced: 12 Jul 2025
https://github.com/dsietz/daas-workshop
Workshop for building a Data as a Service platform using the DaaS SDK.
archconf daas daas-pattern data dataprivacy nfjs rust rust-lang
Last synced: 20 May 2026
https://github.com/codehub001/ai-driven-automation-for-data-quality-monitoring-in-cloud-data-warehouses
This project focuses on leveraging AI to automate data quality monitoring in cloud data warehouses. Traditional data validation methods often require manual intervention and fail to scale with increasing data complexity. By integrating machine learning models, this approach enables real-time anomaly detection, automated data cleansing.
csv-export csv-import dashboard data datacleaning lib modeltraining python testing-library visualization
Last synced: 13 May 2025
https://github.com/wolfchamane/amjs-data-types
Data types for your OOP javascript project
cjs data javascript modules nodejs oop types
Last synced: 20 May 2026
https://github.com/amethyst-php/shipment
amethyst amethyst-package api data laravel shipment
Last synced: 20 May 2026
https://github.com/pyfig/s21_data-science-bootcamp
School21 Bootcamp Data Science
data data-science numpy pandas python school21
Last synced: 26 Jun 2025
https://github.com/danielrosehill/ghg-ebitda-correlations
Streamlit data visualisation examining correlation between emissions & profitability
data sustainability sustainability-data
Last synced: 14 Mar 2025
https://github.com/ournet/embed-providers-data
Embed provides data
data embed embed-providers json providers
Last synced: 03 May 2026
https://github.com/amethyst-php/shipment-zone
amethyst amethyst-package api data laravel shipment-zone
Last synced: 20 May 2026
https://github.com/estherslabbert/final-capstone-unsupervised-ml
Exploration of USArrests data using unsupervised machine learning
arrests correction data data-analysis data-clustering data-visualization jupyter-notebook machine-learning pca-analysis standardised-data usa
Last synced: 26 Jun 2025
https://github.com/amethyst-php/geolocation
amethyst amethyst-package api data geolocation laravel
Last synced: 20 May 2026
https://github.com/harrisonwelch/pythondatascience
Repo of code from the linked-in lesson "Python: Data Analysis"
data data-science matplotlib notes numpy python tutorial
Last synced: 12 Apr 2026
https://github.com/arthurcfranklin/acervo-musical
Este projeto consiste na criaΓ§Γ£o de um banco de dados relacional para auxiliar um DJ na organizaΓ§Γ£o e catalogaΓ§Γ£o do seu acervo musical. O objetivo Γ© fornecer um sistema eficiente para armazenar e gerenciar informaΓ§Γ΅es sobre cantores, bandas, mΓΊsicas e suas versΓ΅es remixadas.
data database mysql mysql-database sql
Last synced: 22 Mar 2025
https://github.com/xmen3em/kaggle-competitions
This collection contains various projects and notebooks developed to tackle a range of Kaggle competitions, showcasing different machine learning techniques, data preprocessing methods, and model optimizations.
data data-science data-visualization deep-learning deployment ensemble-learning machine-learning-algorithms python streamlit
Last synced: 09 Apr 2026
https://github.com/jigyasag18/employee-salary-prediction-jigyasa
PayNexus is a machine learning-powered web app that predicts employee salaries based on role, education, and experience. Built using Python, Streamlit, and scikit-learn, it supports both single and batch predictions. The app includes advanced features like resume parsing via NLP and interactive visual analytics. Ideal for job seekers, HR profession
data dataset decision-tree-regressor gradient-boosting-classifier knearest-neighbor-classifier labelencoder lasso-regression linear-regression machine-learning machine-learning-algorithms machinelearning onehot-encoder pipeline random-forest random-forest-classifier ridge-regression standardscaler svr-regression-prediction xgboost xgboost-classifier
Last synced: 15 May 2026
https://github.com/tearth/test-data-generator
The generator of test data for the school project.
Last synced: 05 Jul 2025
https://github.com/yanaksalvo/all-panel-database-sql
TΓΌrkiye Cumhuriyeti Devleti'nin verilerini Γ§alarak insanlara satarak para kazanan veya bu paralarΔ± kara para aklama Εeklinde aklayarak gelir elde eden kiΕilerin database verileri ve bu sitelere giren kiΕilerin IP Adres bilgileri
api data database devlet ihbar panel panel-data paneldata panels sorgu sorgulama sorgupanel sql usom usomgovtr
Last synced: 06 Apr 2025
https://github.com/amethyst-php/token
amethyst amethyst-package api data laravel token
Last synced: 21 May 2026
https://github.com/fridex/real-estate
My machine learning in real estate
data machine-learning real-estate
Last synced: 27 Jun 2025
https://github.com/iliyasalve/cyclistic_case_study
Analysis of the Bike-Sharing System for the following question: "How do annual members and casual riders use Cyclistic bikes differently?"
bike-sharing data data-analysis data-visualisation r
Last synced: 06 Apr 2025
https://github.com/renebentes/2808
Curso 2808 - Fundamentos do Entity Framework
Last synced: 27 Jun 2025
https://github.com/gagolews/clustering-data-v0
Datasets for Clustering [DEPRECATED β A NEW VERSION IS AVAILABLE]
clustering data dataset machine-learning
Last synced: 15 Sep 2025
https://github.com/parmsam/rweekly.data
R package containing data on Rweekly posts
Last synced: 21 May 2026
https://github.com/indhra/cats-ijcnn-data-2004
CATS IJCNN Data 2004 Competition of Artificial Time Series
2004 artificial cats data ijcnn time-series
Last synced: 22 Mar 2025
https://github.com/matheussoranco/how-to-estimate-required-sample-size-for-model-training
Modeling the relationship between training set size and model accuracy.
artificial-intelligence data jupyter-notebook machine-learning python
Last synced: 22 May 2026
https://github.com/mobinx/easymeet-js
EasyMeetjs is a robust and versatile TypeScript library that provides a solid foundation for building WebRTC-based applications. It simplifies the complexities of WebRTC, enabling developers to easily incorporate real-time communication features into their projects.From simple audio video calling to real time peer to peer file transfer , everything
data meeting react realtime screensharing streaming-video webrtc zoom
Last synced: 03 Jan 2026
https://github.com/thesfinox/fit-the-data
Data analysis using Wolfram Mathematica
analysis data data-analysis lab mathematica wolfram wolfram-mathematica
Last synced: 24 Jan 2026
https://github.com/pyrustic/litedao
Intuitive interaction with SQLite database
auto-init dao data database database-access library lightweight pyrustic python sql sqlite
Last synced: 09 May 2026
https://github.com/ahmad-ali-rafique/random-forest-regressor-modeling
Detailed exploration of random forest regressors, including data cleaning, model building, and performance evaluation on various datasets.
data dataanalytics datacleaning evaluation-metrics modeling random-forest random-forest-regression regression regression-analysis
Last synced: 05 Mar 2025
https://github.com/bala-1409/sales-forecasting-datascience-project
Develop a data science project using historical sales data to build a regression model that accurately predicts future sales. Preprocess the dataset, conduct exploratory analysis, select relevant features, and employ regression algorithms for model development. Evaluate model performance, optimize hyperparameters, and provide actionable insights.
data data-analysis data-science data-visualization datacleaning exploratory-data-analysis machine-learning-algorithms modelfitting prediction predictive-analytics predictive-modeling python3 regression-models salesforecast supervised-learning
Last synced: 26 Apr 2026
https://github.com/moons-14/datapot
Incorporate and serve all information.
ai aiogram api data infomation news newspaper rss video
Last synced: 04 Jan 2026
https://github.com/ayresgneto/use-case-gcp-etl
ELT pipeline GCP. Tecnologias utilizadas: Postgresql, GCP Storage, Airflow (local), Pyspark (local), BigQuery
airflow big-data bigquery data data-engineering etl gcp pipeline postgresql programming-oriented-object pyspark python spark
Last synced: 03 Jan 2026
https://github.com/yourdataarchitect/abyat-scaring-
This Scrapy spider for automates the extraction of product data from the Abyat website using Hidden Backend API, supporting both Arabic and English content.
data database scraper scrapy-crawler
Last synced: 23 Apr 2026
https://github.com/raufjatoi/electricity-consumption-prediction
arima-model customize data kinda-dynamic ml
Last synced: 25 Jul 2025
https://github.com/trevorhobenshield/psychopath
Path Utils for ML Data Prep.
audio data data-science deep-learning filesystem images machine-learning text videos
Last synced: 25 Jul 2025
https://github.com/mysociety/sync-ep-to-jkan
Syncs EveryPolitician data to mySociety's data portal.
data everypolitician jkan politicians
Last synced: 27 Jul 2025
https://github.com/i-rzr-i/domaincommonextensions
The purpose of this repository/library is to provide the most relevant and used extension methods in the life cycle of application development that allow us to improve our code, and writing speed, and use more efficiently dev team time during this period for more complex functionality.
api class data datatype extension helper object parser type util
Last synced: 20 Sep 2025
https://github.com/beastbytes/n6l-phone-number-data-php
NationalPhoneNumerInterface implementation using PHP for storage
data itu-t0202 phone-number php yii3
Last synced: 08 Feb 2026