data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-29 00:07:49 UTC
- JSON Representation
https://github.com/shubhamsoni98/survey-data-analysis
Surey Data Analysis
analysis dashboards data data-mining data-visualization dataanalysis datacleaning datascience datasets insights pivot-tables pivotanalysis
Last synced: 07 Mar 2026
https://github.com/sakan811/gachascope
Evaluate the cost-effectiveness of various in-app purchase bundles available in gacha games.
data data-analysis data-visualization game honkai honkai-star-rail honkai-starrail hoyoverse javascript nextjs tableau tableau-public typescript wutheringwaves
Last synced: 04 May 2026
https://github.com/austinv11/pypeline
A simple data pipeline builder for Python 3+
data leveldb pypeline python python3 stream-processing
Last synced: 20 Aug 2025
https://github.com/dhi13man/rca_ace
RCA Ace is designed for organizations seeking to enhance their understanding and utilization of insights derived from Root Cause Analyses (RCAs).
analytics data enterprise open-source python python3 rca
Last synced: 10 Sep 2025
https://github.com/karaniwachira/baby_names_analysis
Data Analysis: Baby Names Exploration
data data-analysis quarto quartopub r rstats tidyverse-ggplot2
Last synced: 22 Jun 2025
https://github.com/mradkov/secure-data-exchange
Elliptic Curve Diffie-Hellman secure data exchange via smart contracts on Aeternity blockchain
aeternity data exchange key-exchange smart-contracts sophia
Last synced: 22 Jun 2025
https://github.com/dolanmiu/mclaren-task
A front end assessment task for Mclaren
angular data observable observables rxjs
Last synced: 16 May 2026
https://github.com/uttori/uttori-data-tools
Tools for working with binary data.
Last synced: 17 Feb 2026
https://github.com/aliaksandr-master/unipipeline
simple way to build the declarative and destributed data pipelines with python
Last synced: 11 Jul 2025
https://github.com/thetacom/byteclasses
A Python package to manage and interact with binary data in a simple and structured manner.
binary-data bytes data dataclasses package python python3
Last synced: 11 Jul 2025
https://github.com/jensostertag-archive/charts.js
A JavaScript Plugin to draw Charts to visualize Data and Statistics on Websites
charts data javascript statistics webapplication
Last synced: 22 Jun 2025
https://github.com/fintech-lsi/fintech-credit-risk-prediction
This repository provides a machine learning model for predicting credit risk in the financial sector. The model uses borrower information, such as age, income, employment length, loan amount, and credit history, to assess the likelihood of loan repayment or default.
data fintech machine-learning model prediction risk
Last synced: 12 Oct 2025
https://github.com/the-tech-idea/beep.winform.sample
Application for Managing your Different DataSources . Still in Alpha.please be patient
application data data-science database dataset integeration mysql nosql oracle postgres sqlite sqlserver workflow-engine workflows
Last synced: 08 Jul 2025
https://github.com/thanhleviet/vietnam_antibiotics_bidding
This repo contains data of bidding for multiple drugs and antibiotics reported to Vietnam Ministry of Health in 2015, 2016, 2017.
Last synced: 23 Feb 2026
https://github.com/eby8zevin/android-intent
Intent & Bundle - Android Studio
android android-development android-studio bundle data intent java xml
Last synced: 03 Sep 2025
https://github.com/elimu-ai/analytics
📊 Android application which collects, provides and uploads learning event data
csv data data-science dataset edtech egma egra infrastructural learning-analytics
Last synced: 12 Oct 2025
https://github.com/wireservice/workbench-lookup
A port of `agate-lookup` to Workbench
data journalism lookup workbench
Last synced: 08 Jun 2026
https://github.com/madhuresh2011/daily-sql-from-hackerrank
Welcome to my SQL Series, where I tackle SQL problems from HackerRank on a daily basis.
data dataanalysis database question-answering sql
Last synced: 19 Jan 2026
https://github.com/ot-code/sql-sabor-y-tradicion
A SQL-driven project that integrates menu and order data to reveal insights on dish performance, customer preferences, and spending trends. It informs pricing strategies, menu adjustments, and targeted promotions, ultimately enhancing the overall customer experience and driving business growth.
analytical-queries data data-aggregation data-analysis database-design join-queries mysql order-analytics relational-databases restaurant-data sql sql-script
Last synced: 08 Apr 2025
https://github.com/arthurdanjou/studies
💼 This is the repository containing all my projects done during my studies in Python and R.
ai data data-science data-visualization jupyter jupyter-notebook ml python r
Last synced: 08 Apr 2025
https://github.com/0xnu/nfl-picks
NFL match prediction with scores using historical data (1999-Present).
american-football data nfl prediction
Last synced: 12 Oct 2025
https://github.com/mumtaz4118/nlp-course
Programming Assignments and Lectures for Stanford's CS 224: Natural Language Processing with Deep Learning
course data data-analysis data-analytics data-science data-visualization deep-learning education machine-learning natural-language-processing neural-network transfer-learning
Last synced: 24 Nov 2025
https://github.com/drzax/light-up-brisbane
Where, what and why various public places in Brisbane are lit up.
Last synced: 19 Jan 2026
https://github.com/gvatsal60/ds-on-kaggle
A collection of data science projects, experiments, and insights from Kaggle competitions and datasets
data data-science data-visualization numpy pandas python3
Last synced: 29 Apr 2026
https://github.com/jigyasag18/ai-ml-salaries-and-ai-tools-usage-trends
This repository presents an in-depth Power BI analytics report on the AI job market trends and student AI tool usage from 2020 to 2025. It combines structured datasets (job postings, salaries, surveys) with custom DAX measures to uncover key patterns in salaries, remote work, industry demand, and student engagement. 5 interaractive dashboards made.
analysis data data-analysis data-visualization dataanalysis dataanalytics dataset datavisualization power-bi powerbi powerbi-dashboards powerbi-desktop powerbi-report powerbi-visuals powerbidashboard visualization
Last synced: 16 Feb 2026
https://github.com/adadalshabab/data-engineering-gcp-project
An end-to-end modern data engineering project, including deployment of ETL pipeline on Google Cloud Platform, using BigQuery for data analysis and leveraging Looker to generate an insight dashboard.
bigquery data data-science data-visualization databases dataengineering-a engineering etl-pipeline looker-studio powerbi
Last synced: 19 Jan 2026
https://github.com/sauravsrivastav/githubreposearcher
GitHub Repo Searcher 🔍 is a Streamlit web application designed to help you search for GitHub repositories based on a query and view the results in a tabular format. You can also download the results in CSV or Excel format for further analysis. 📊📈
data data-export excel github-api python repository-searcher streamlit webapp
Last synced: 20 Jan 2026
https://github.com/tyriek-cloud/nyc-dca-etl
Created an ETL pipeline to merge two CSV files (converted to JSON) into a parquet file using Azure Data Factory, The data was extracted from NYC Open Data: https://opendata.cityofnewyork.us/ and I created a Blob Container within an existing storage account.
azure azure-data-factory blob-storage data data-engineering etl-pipeline
Last synced: 21 Jan 2026
https://github.com/jhpoelen/bees
Content-based iDigBio prototype
biodiversity data ecololgical informatics provenance
Last synced: 18 Mar 2026
https://github.com/lemaitre4523/old-tiktok-data-report-explorer
An explorer for tiktok data report
data explorer extract package report simple tdre tiktok tiktok-data-explorer
Last synced: 25 Sep 2025
https://github.com/cpietsch/breitband
developer repo of breitband-berlin
d3js data threejs visualization
Last synced: 02 May 2026
https://github.com/juanpablo70/pgad-assignment01
Breast Cancer Coimbra data set analysis
data data-science dataframe dataset jupyter-notebook matplotlib numpy pandas python
Last synced: 08 May 2026
https://github.com/luminati-io/httpx-web-scraping
Web scraping using HTTPX in Python, covering setup, advanced features, comparisons with Requests, and more.
beautifulsoup data html httpx python web-scraper web-scraping
Last synced: 13 Oct 2025
https://github.com/petzi53/repair
R Datasets of the Open Repair Alliance (ORA).
Last synced: 19 May 2026
https://github.com/fatihilhan42/hollywood-theatrical-market-synopsis-1995-to-2021
In this project, the data of hollywood film production companies from 1995 to 2021 were examined. Significant tables and graphs were created using data visualization algorithms, with the tickets sold divided into categories.
data data-analysis data-science data-visualization
Last synced: 23 Mar 2025
https://github.com/janakajain/Joshua_Project
christianity data proselytizing religion
Last synced: 10 Mar 2025
https://github.com/abhinav330/instagram-influencers-analysis
This Jupyter Notebook focuses on preprocessing and visualizing data from an Instagram profiles dataset. It includes data loading, inspection, visualization, and some data preprocessing steps.
data data-science data-visualization exploratory-data-analysis exploratory-data-visualizations influncer-products instagram scikit-learn sklearn
Last synced: 08 Jun 2026
https://github.com/ragibasif/bobdylan
Bob Dylan
bob-dylan csv data data-science data-visualization lyrics music python
Last synced: 03 Sep 2025
https://github.com/0xHericles/ufcg-geojson
GeoJSON file containing the blocks and buildings of the Federal University of Campina Grande.
data data-visualization geojson map open-source ufcg university
Last synced: 24 Mar 2025
https://github.com/beeracs/llama
Run Llama models in your web browser using JavaScript and WebAssembly. Explore light and dark modes easily. 🌐🐱👤
ai data fine-tuning framework gpt langchain large-language-models llama3 llamaindex llm lora machine-learning nlp peft qlora qwen rlhf vllm
Last synced: 10 May 2026
https://github.com/amethyst-php/courier
amethyst amethyst-package api courier data laravel
Last synced: 17 May 2026
https://github.com/samiksha29-patil/hr-employee-data-analysis-visualization-in-python
This project focuses on analyzing an HR Employee Dataset that contains details about employees such as demographics, job status, salaries, performance reviews, satisfaction levels, and attrition reasons.
csv-files data data-visualization dataanalysis matplotlib numpy pandas python seaborn
Last synced: 30 Apr 2026
https://github.com/45harry/potato_disease_classification
Potato Disease Classification - Traning, Rest Api and FrontEnd to Test
cnn-classification data data-science datapreprocessing deep-learning fastapi flaskapi frontend keras restapi tensorflow
Last synced: 12 Apr 2026
https://github.com/jpcadena/ventas-facturas
Ventas con facturas
data data-analysis data-exploration data-extraction data-science excel feature-engineering matplotlib microsoft numpy pandas powerbi product-sales pylint python receipts sales
Last synced: 12 Apr 2026
https://github.com/rrwen/r-reference
Quick reference to learning R
analysis beginner data guide introduction learn r reference statistics stats syntax
Last synced: 02 Jul 2025
https://github.com/ffatahillah7/snowflake-data-governance-warehouses
Welcome to the Powered by Tasty Bytes - Zero to Snowflake Quickstart focused on Data Governance! Within this Quickstart we will learn about Snowflake Roles, Role Based Access Control and deploy both Column and Row Level Security that can scale with your business.
data data-governance snowflake
Last synced: 06 Jan 2026
https://github.com/abhijeetdasbakshi/ecommerce-insights
A Dockerized end-to-end project that combines unsupervised machine learning for customer segmentation with scalable data pipelines. It uses MongoDB for data ingestion, Scikit-learn for clustering, Airflow for orchestration, and Streamlit for interactive visualization — enabling actionable insights into e-commerce
airflow airflow-dags ci-cd-pipeline clustering dags data data-pipelines docker docker-compose docker-container dockerfile git great-expectations kafka mongodb pca-analysis postgresql pyspark t-sne umap-learn
Last synced: 04 Apr 2026
https://github.com/odiegosilva1/flask-github-style
Página de login usando Jinja no Flask.
data flask jinja2-templates orm python
Last synced: 31 May 2026
https://github.com/dhruvsrikanth/superconductor-regression-kaggle-challenge
Kaggle challenge based on superconductor dataset.
data data-science jupyter-notebook kaggle kaggle-challenge kaggle-competition lasso-regression linear-regression machine-learning python random-forest regression sklearn support-vector-regression
Last synced: 30 Apr 2026
https://github.com/polyee99/kaggle-titanic-data-analytics
Jupiter notebook to predict the outcome of passengers who died or not in the tragical Titanic event.
data eda jupiter-notebook matplotlib numpy pandas python regression-analysis test-train-split visualization
Last synced: 05 Feb 2026
https://github.com/isandyawan/simplelinearregression
A application to analyze data using simple linear regression. This application can make regression model from variable and give advice to user if the model break regression assumsion
data linear r regression rstudio shiny statistic
Last synced: 14 Oct 2025
https://github.com/ailixter/gears-dictionary
The project, which Gears Dictionary
arrays data dictionaries dictionary php struct utilities
Last synced: 19 Jul 2025
https://github.com/jigyasag18/aircraft-data-management
This repository offers a comprehensive simulation of global military air deployments involving 10 countries, aircraft models, mission types, and strategic zones. It analyzes air power distribution, mission intent (offensive, defensive, support), and geopolitical positioning. The project provides structured insights into regional & zone level threat
aircraft-data aircraft-performance data data-analysis data-visualization database database-management dataset datavisualisation mysql powerbi powerbi-report powerbi-visuals sql
Last synced: 04 Feb 2026
https://github.com/arush-codes/lgmvip-data-science-task-1
data data-science iris-classification lgmvip virtual-internship
Last synced: 14 Oct 2025
https://github.com/brandonzylstra/essence
🧘🏼♂️ Relaxed Rails Modeling & Migrations
active-record data database gem hcl modeling rails ruby ruby-on-rails yaml
Last synced: 14 Apr 2026
https://github.com/mominurr/fire-gas-leak-detection-system
A real-time fire prevention system integrating IoT sensors and computer vision to trigger evacuations.
ai computer-vision data datascience machine-learning ml python yolo
Last synced: 27 Jan 2026
https://github.com/rafie-b/data-analytics
Activities of Data Analysis.
apache-spark api aws business-analytics data data-analytics data-science database dataframe jupyter-notebook python scikit-learn sql
Last synced: 14 Apr 2026
https://github.com/rahult18/atmo-flow
AtmoFlow is a robust data engineering pipeline built on Google Cloud Platform (GCP) that processes and analyzes weather and air quality data in both batch and streaming modes
airflow data data-modeling data-science data-visualization dataengineering gcp-bigquery gcp-cloud-composer gcp-cloud-functions pyspark
Last synced: 23 Jun 2026
https://github.com/onekiloparsec/arcsecond-swift
The swift client for interacting with the server-side RESTful resources of arcsecond.io.
arcsecond astro-library astronomy data django swift swift-3
Last synced: 30 Apr 2026
https://github.com/datamine/yelp-date
Does being on a date impact the score on a yelp review? Let's find out!
data ipython ipython-notebook pandas python python-2 yelp yelp-reviews
Last synced: 14 Apr 2026
https://github.com/faster-games/dynamic-components
Dynamic Runtime Components for Unity3D
Last synced: 11 Apr 2026
https://github.com/awpala/udemy-my-courses-data-parser
Download Udemy lists and courses metadata for authenticated student user
Last synced: 07 May 2026
https://github.com/vanshuchaudhary/flightpriceanalysis-
The uploaded file is a Jupyter Notebook titled "Flight Analysis". It likely involves analyzing flight-related data, potentially exploring trends, patterns, or insights using data science techniques. The analysis might include data visualization, statistical analysis, or predictive modeling.
business-analytics data data-analysis data-visualization datainsights datascience matplotlib-pyplot python seaborn seaborn-plots seaborn-python sns statistical-analysis
Last synced: 08 May 2026
https://github.com/science-analyse/clv_model
customer lifetime value prediction
banking banking-applications clv clv-analysis data data-science machine-learning
Last synced: 15 Oct 2025
https://github.com/bscript07/softuni-javascript-applications
Javascript for Applications course at SoftUni -Oct 2023
architecture-component authentication client-side-rendering-seo data lit-html-template routing
Last synced: 15 Mar 2025
https://github.com/mmaithani/kaggle-projects
Collection of all the resources from competition, kernal And data section also all the magic code i have been using to get most of out of a problem
computer-vision data data-science image-processing machine-learning python
Last synced: 30 Apr 2026
https://github.com/flowsynx/plugin-base64
FlowSynx plugin to provides encoding and decoding of Base64 strings, allowing workflows to handle Base64 content transformations efficiently.
base64 base64-decoding base64-encoding data data-platform decoding encoding flowsynx flowsynx-plugins
Last synced: 10 Mar 2026
https://github.com/pbinkley/mfmcollections
Project to distill data about published collections of microfilms from library lists
Last synced: 28 May 2026
https://github.com/shudhanshusaurabh001/super_market-data-analysis-using-python
This project focuses on analyzing supermarket sales data using Python. The goal is to extract meaningful insights from the dataset, such as sales trends, customer purchasing behavior, and product performance.
analysis csv data insights matplotlib numpy pandas project python seaborn
Last synced: 06 Apr 2026
https://github.com/raphcodec/rand-org-generator
Rand-Org-Generator attempts mimic real company structures. The dummy data generated by this project is intended to be used in analytics projects or web projects.
data duckdb factory-boy faker org-chart polars python3
Last synced: 30 Apr 2026
https://github.com/heyimsteve/solnftdatadash
This a React-based web application that provides detailed information about NFT collections on the Solana blockchain. It uses the HelloMoon API to fetch and display data about NFT collections, including statistics, loan summaries, ownership information, and floor prices.
dashboard data hellomoon nft react solana solana-nft
Last synced: 30 Jan 2026
https://github.com/j-sephb-lt-n/personal-projects
A history of my personal projects and professional development
ai api auth cloud data llms personal-development web
Last synced: 24 Jan 2026
https://github.com/poissonconsulting/klexdatr
An R package of data from the Kootenay Lake Exploitation Study
cran data fish kootenay-lake rstats
Last synced: 16 Oct 2025
https://github.com/tyriek-cloud/statistical-work-sample
The purpose of this study is to observe if a sample of people that has siblings is independent of a sample of people that possess an opinion of whether patients with incurable diseases should be allowed to die.
analysis data spss statistics t-test
Last synced: 22 Jan 2026
https://github.com/murshidazher/client-side-data-storage
🚌 A workspace containing client-side data storage implementations
cache cache-storage client-side data indexeddb localstorage sessionstorage storage websql
Last synced: 02 Sep 2025
https://github.com/bdr-pro/streamlint
ltra-cool Streamlit app, where you can interact with widgets, see data in action, and even upload and download files
Last synced: 14 Apr 2026
https://github.com/lugolbis/data-immo
End-to-end ETL pipeline
data data-engineering dbt dremio duckdb etl-pipeline lakehouse rust
Last synced: 08 Jun 2026
https://github.com/plnech/never2late
Never 2 Late - a reinterpretation of Everest Pipkin's 'i've never picked a protected flower'
dada dada-science data generative-art glitch-art installation nlp poetry spacy vector-similarity wallpaper
Last synced: 10 Jun 2025
https://github.com/vanduc1102/parse-stackoverflow-data
Parse stackoverflow data
Last synced: 16 Oct 2025
https://github.com/bertrand31/one-billion-rows-challenge
🌪️ Pushing Scala to its limits to aggregate a billion rows' worth of data in 2.42 seconds
competitive-programming competitive-programming-contests data data-engineering data-processing performance scala
Last synced: 05 Sep 2025
https://github.com/saboye/sales-performance-analysis
A dashboard that presents monthly sales performance by product segment and product category to help clients identifying the segments and categories that have met or exceeded their sales targets, as well as those that have not met their sales targets.
dashboard data data-science eda tableau visualization
Last synced: 27 Jan 2026
https://github.com/mat06mat/matbot
My discord bot code
data discord-bot discord-py py-cord
Last synced: 17 Oct 2025
https://github.com/miguelmedinacastro/trabalho-dados-r
Trabalho final da disciplina Análise Exploratória de Dados
data data-science data-science-projects data-visualization database r rstudio
Last synced: 01 May 2026
https://github.com/ronknight/user-data-dashboard
📈 A data visualization tool for analyzing user data using an Excel-based data source.
dashboard data excel ga4 screenshot
Last synced: 17 Oct 2025
https://github.com/sasanthns/sql_data_warehouse_project
A comprehensive guide to building a modern data warehouse with SQL Server, including ETL processes, data modeling, and analytics.
data data-analysis data-science data-warehouse datacleaning etl etlpipeline sql sqlserver
Last synced: 24 Mar 2025