data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-03 00:07:49 UTC
- JSON Representation
https://github.com/mwiatrzyk/modelity
Data parsing and validation library for Python
data library model parsing python tool validation
Last synced: 18 Jan 2026
https://github.com/ppatrzyk/heatmap
Display CSV as a heatmap in terminal
csv data data-visualization terminal
Last synced: 24 Apr 2026
https://github.com/andykee/aurora
A lightweight tool for indexing, cataloging, and browsing data.
catalog data data-catalog data-discovery indexing metadata metadata-extraction search-and-discovery
Last synced: 17 Jan 2026
https://github.com/mehmetkahya0/gallstone_dataset_analysis_project
Safra Taşı Hastalığı (Gallstone-1) Veri Seti Analizi (https://archive.ics.uci.edu/dataset/1150/gallstone-1)
analysis analytics data data-analysis data-science data-visualization database graph matplotlib python
Last synced: 25 Apr 2026
https://github.com/thinkphp/my-react-tictactoeai-app
App React Tic Tac Toe Component based on Artificial Intelligence
ai algoirthms data datastructures games javascript react
Last synced: 25 Apr 2026
https://github.com/rodrigojunqueiradev/curso-python-3-do-basico-ao-avancado
Curso de Python 3 do básico ao avançado - com projetos reais
data data-analysis data-science python python-3 python-library python-script python3
Last synced: 27 Jun 2026
https://github.com/marielachirinosr/hotel-data-analysis
Pandas & Matplotlib Learning Analysis. Repository featuring data analysis projects using Pandas and Matplotlib libraries
data data-analysis matplotlib pandas python
Last synced: 25 Apr 2026
https://github.com/anuraganalog/blog
Data Science Blog
anuraganalog blog data science
Last synced: 26 Apr 2026
https://github.com/jigyasag18/multiple-disease-detection-app
This repository contains the implementation of a Multiple Disease Detection System, which employs advanced machine learning techniques for early detection and prediction of prevalent diseases, including diabetes, heart disease, and Parkinson's disease. The system utilizes a variety of patient health metrics such as demographics and medical history.
data datapreprocessing machine-learning machine-learning-algorithms machinelearningmodel prediction python streamlit streamlit-webapp
Last synced: 07 Jun 2026
https://github.com/quarkgluant/intro_ml_udemy
cours Udemy d'Introduction au Machine Learning
anaconda3 data data-preprocessing data-regression machine-learning python-3 udemy-machine-learning
Last synced: 12 May 2026
https://github.com/fatihemres/africa
Africa app by SwiftUI. Using AVFoundation, MapKit, data, models, animations, stickers.
animations avfoundation data mapkit models swift swift-animations swiftui
Last synced: 27 Apr 2026
https://github.com/demkeys/lazydatatransfer
Lazy method to transfer upto 64kb of data over the network using UDP
data data-trans network python transfer udp
Last synced: 07 Jun 2026
https://github.com/vatshayan/b.tech-project-cancer-predication-system
Cancer Prediction System Project Developed through a Machine learning approach.
btech btechfinalyear cancer collegeproject csv data data-science data-structures datas datasets final-project finalyear india machinelearning project python python-3
Last synced: 07 Jun 2026
https://github.com/santiagoenriquega/custom_database
Python-based database library for database management, indexing, transactions, and constraints, showcasing foundational database concepts.
data data-engineering database database-design python
Last synced: 27 Apr 2026
https://github.com/redgoose-dev/baguni
이미지를 보관하고 탐색하는 웹 프로그램
data explorer file management upload
Last synced: 14 Apr 2026
https://github.com/miniql/notebook-example
An example of MiniQL in a JavaScript Notebook
comma-separated-values csv data data-analysis data-science graphql javascript notebook query query-language
Last synced: 13 May 2026
https://github.com/meicloudie/react-practice-react-router-and-authentication
Learning React Project - @academind-maxschwarzmueller
authentication data javascript practice-project react react-router
Last synced: 13 May 2026
https://github.com/rod-persky/sungrowdatacollector
Data collector for a SunGrow SG8.0RT Inverter
Last synced: 19 Jan 2026
https://github.com/delonnewman/relational
Relational programming for Ruby
csv csv-import data data-analysis database export json relational relational-algebra relational-database relational-model relational-programming reporting reports ruby yaml
Last synced: 28 Apr 2026
https://github.com/priyanshubiswas-tech/e-commerce_data_analysis
Analyzes 9,994 e-commerce transactions to uncover insights on sales trends, customer behavior, profitability, and logistics using EDA and visualization. Identifies top products, customer segments, and shipping efficiencies to optimize marketing, inventory, and operations, making it valuable for retail, finance, and logistics.
data data-analysis data-visualization pandas pandas-dataframe plotly-analytics-projects plotly-express python
Last synced: 28 Apr 2026
https://github.com/quetz-al/quetzal-openapi-client
Autogenerated Python client for the Quetzal API
client data data-science openapi-client openapi3 python quetzal
Last synced: 10 Oct 2025
https://github.com/vim89/flowforge
Let's be honest - most data pipeline frameworks treat types as suggestions. Config files are strings. Schemas are "validated" at runtime. Data quality is an afterthought. So, let's do differently
archetype data data-contracts data-engineering data-pipelines data-quality data-science database dataengineering datapipeline etl etl-framework pipelines scala scalability spark spark-sql spark-streaming
Last synced: 14 Apr 2026
https://github.com/mtalhaofc/nutrition_system
A simple AI-powered web app built using Streamlit that provides personalized weekly meal plans and nutrition recommendations based on user demographics, health goals, and nutritional preferences.
cosine-similarity data data-science food machine-learning model nutrition pandas python streamlit
Last synced: 29 Apr 2026
https://github.com/stdlib-js/array-struct-factory
Return a constructor for creating arrays having a fixed-width composite data type.
array composite data factory javascript node node-js nodejs stdlib struct structure typed typed-array types
Last synced: 29 Apr 2026
https://github.com/steventhompson6460-stack/octoparse-government-listings-scraper
Octoparse workflow for structured government data
data extraction government listings octoparse public-records python scraper scrapy structured web-crawling workflow
Last synced: 31 May 2026
https://github.com/psyteachr/sdg-data
Data relevant to the UN Sustainable Development Goals
Last synced: 09 Oct 2025
https://github.com/diegoperea20/pytorch-vs-tensorflow
Testing the differences of the pytorch and tensorflow libraries in the different prediction and classification applications, each of them gives improvements depending on the problem they are assigned or data set assigned.
classification data images prediction pytorch tensorflow
Last synced: 29 Apr 2026
https://github.com/istinnew/eniac_ab_insight
Dive into a comprehensive analysis aimed at boosting iPhone 13 sales by optimizing the Click-Through Rate (CTR) of the “SHOP NOW” button, compare different button designs and determine the most effective strategy for increasing engagement.
ab-testing data data-analysis data-engineering data-science data-visualization google googlecolab libraries python testing testing-tools visual-studio-code
Last synced: 29 Apr 2026
https://github.com/cburmeister/disc-golf-courses
All the disc golf courses i've played at. Maintained with http://geojson.io/.
Last synced: 21 Jan 2026
https://github.com/prajjwol09/sql_retail_analysis_project
This project demonstrates SQL-based data cleaning, exploration, and business analysis on a retail sales dataset. It involves setting up a database, removing null values, performing EDA, and using SQL queries to extract key insights such as top customers, best-selling categories, and monthly sales trends.
data data-analysis datacleaning dataexploration pgadmin4 sql
Last synced: 15 Feb 2026
https://github.com/axnjr/csv-parser-utils
My own Pandas in Go, Python & Rust, Utility methods for Handling CSV Files in Core Go & Rust with bindings for python.
csv data dataanalysis datatools go golang golang-application pandas python rs rust
Last synced: 29 Apr 2026
https://github.com/lamouchi-bayrem/data-matrix-scanner
A dual-interface tool that leverages AI to **detect and decode QR codes and Data Matrix codes** from images using computer vision
data datamatrix-scanner decoder flask qrcode scanner tkinter-gui webapp
Last synced: 30 Apr 2026
https://github.com/onekiloparsec/arcsecond-swift
The swift client for interacting with the server-side RESTful resources of arcsecond.io.
arcsecond astro-library astronomy data django swift swift-3
Last synced: 30 Apr 2026
https://github.com/mmaithani/kaggle-projects
Collection of all the resources from competition, kernal And data section also all the magic code i have been using to get most of out of a problem
computer-vision data data-science image-processing machine-learning python
Last synced: 30 Apr 2026
https://github.com/lugolbis/data-immo
End-to-end ETL pipeline
data data-engineering dbt dremio duckdb etl-pipeline lakehouse rust
Last synced: 08 Jun 2026
https://github.com/dantetrb/diabetes-readmission-dbt
Predictive analytics on diabetic patient readmissions using dbt, DuckDB and Python – with explainability and clustering.
clustering data dataengineering dbt diabetes duckdb hdbscan healthcare jupyter lime readmission-prediction sql
Last synced: 01 May 2026
https://github.com/cannt39t/wylsacom-analysis-reflinks-datamining
data data-analysis data-mining python3 sql
Last synced: 13 Jun 2026
https://github.com/shauryauppal/mydatatoolkit
A toolkit for data scientists to get work done faster, easier, and in a smarter way.
analytics awesome-list data data-science hacktoberfest
Last synced: 08 Jun 2026
https://github.com/anarya22/e-commerce_analysis
E-Commerce_Analysis is a data analysis project performed on the Superstore_USA dataset. It explores various aspects of e-commerce performance, including sales trends, customer demographics, product categories, and regional performance. The analysis includes data cleaning, visualizations, and insights on factors influencing sales and profitability.
analysis analytics cleaning-data data
Last synced: 09 Oct 2025
https://github.com/anandvai/ai_rag_chatbot_multi_pdf_support
RAG (Retrieval-Augmented Generation) Chatbot built with Streamlit and LangChain, powered by Groq's blazing-fast LLaMA3-8B. It allows you to upload multiple PDFs, ask questions, and get precise, context-aware answers in a conversational format.
ai data data-science data-visualization data-visualizations dataengineering fastapi langchain langgraph python sql streamlit
Last synced: 01 May 2026
https://github.com/word2vect/beijing-pm2.5-data-process
Beijing PM2.5 Data Process for Python Programming 2024 Fall Data Visualization Lab 2
Last synced: 15 Jun 2026
https://github.com/vbshuliar/ktor-http-request-response
This project is part of my Android Development Specialization provided by Meta on Coursera. In this project I practised HTTP requests and responses using Ktor.
android compose data http https json kotlin ktor request response
Last synced: 01 May 2026
https://github.com/sandygcabanes/etl-earthquake-data-from-usgs-google-cloud-composer-airflow
Airflow, Google Cloud Composer, GCS, BigQuery, Python. This automated pipeline pulls daily earthquake data from a trusted public source, stores it securely in the cloud, and organizes it into clean, searchable tables for analysis.
cloud composer dag data engineering etl etl-pipeline google json python
Last synced: 01 May 2026
https://github.com/muhammadadilnaeem/bcg-data-science-job-simulation-on-forage-august-2024
This repository contains all the tasks, code, and documentation completed during the BCG Data Science job simulation on The Forage platform. The simulation focused on analyzing customer churn, building predictive models, and presenting insights for a major utility company.
bcg customer-churn-prediction-with-machine-learning data data-science forage numpy pandas
Last synced: 01 May 2026
https://github.com/gcoronelc/cepsuni-disbd-64505
Taller de Modelamiento de de Base de Datos con Gustavo Coronel
data database databases db2 db2-database modeling oracle oracle-database relational-database relational-database-design relational-databases relationships sql sql-server
Last synced: 02 May 2026
https://github.com/rbreeze/dashboard
My personal health dashboard, with daily stats on food and sleep. Undergone several redesigns since 2015.
css dashboard data data-visualization design front-end google-sheets google-sheets-api health html javascript personal-health-record personal-website running static static-site visualization
Last synced: 02 May 2026
https://github.com/hafs96/prediction_consommation-de-carburant
Dans ce projet, l'objectif est de développer un modèle permettant de prédire si une voiture a une consommation de carburant élevée ou faible en fonction de ses caractéristiques techniques.
analysis data data-visualization machine-learning testing training
Last synced: 09 Jun 2026
https://github.com/vidupriya/aws-glue--data-copy
The function for copying data like CSV, Parquet, avro etc., from a source S3 bucket to a destination S3 bucket using AWS Glue. It includes the necessary setup for the Glue job, logging, reading data from the source bucket, and writing it to the destination bucket
aws awsglue awss3 data data-copying glue glue-job pyspark python3 s3 s3-bucket s3-buckets s3-storage spark
Last synced: 02 May 2026
https://github.com/viniddev/active_finance
Nesse projeto busquei solucionar um problema corriqueiro que é a dificuldade de se manter atualizado sobre as variações do mercado de ações e fundos imobiliários. Usei selenium webdriver para buscar informações e uma API do Telegram para enviar relatórios para o usuário
automation data data-analisis rpa selenium-webdriver telegram-bot
Last synced: 03 May 2026
https://github.com/ayushman0511/data-analytics-project1
This repository contains a collection of SQL scripts demonstrating various analytical techniques, such as changes over time, cumulative, performance, data segmentation, part-to-whole analysis.
analytics busine data data-anal data-enginee data-sci data-scien database datascien query reporting sql sql-query sql-server window-func
Last synced: 17 Jun 2026
https://github.com/tn3w/moviedb-json
A JSON library with 981,530 films.
data database db json movie movie-database movies
Last synced: 03 May 2026
https://github.com/abdellah-laassairi/thyroid-disease-analysis
Thyroid dataset visualization dashboard in R
dashboard data flexdashboard imputation-methods rshiny visualization
Last synced: 18 Jan 2026
https://github.com/fallaciousreasoning/nz-mountains
A list of mountains in NZ, scraped from https://climbnz.org.nz
alpine climbing climbnz data json json-api maps mountaineering scraping
Last synced: 04 May 2026
https://github.com/damisparks/become_data_analyst
Are you new to Data Analysis ? Here you will find simple notebook that will help through your journey. These are personal projects I work on and still working.
data data-analysis data-visualization matplotlib numpy pandas-tutorial
Last synced: 04 May 2026
https://github.com/anand-sony/mttr-dashboard
Streamlit dashboard for MTTR analysis with shift-wise loss insights and machine-level downtime tracking.
analytics business-analytics dashboard data python statistical-analysis
Last synced: 30 May 2026
https://github.com/mbagalman/lattice-doe
Python code to create experimental designs optimized to meet statistical power targets
abtesting data datascience designofexperiments experimentaldesign statistics
Last synced: 19 Jun 2026
https://github.com/sebastianhochreiter/sql-projects
business-intelligence data datascience microsoft microsoft-sql-server sql
Last synced: 22 Feb 2026
https://github.com/munas-git/codm-review-analysis-and-predictions
Sentiment analysis on Call of Duty Mobile Google Play Store user reviews with ML model to classify new reviews.
data flask machine-learning python sentiment-analysis
Last synced: 05 May 2026
https://github.com/amethyst-php/order
amethyst amethyst-package api data laravel order
Last synced: 19 May 2026
https://github.com/programmer-rd-ai/competitive-programming-solutions
A collection of my solutions to various competitive programming problems from platforms like LeetCode. This repository serves as a personal archive of my problem-solving journey, covering a range of algorithms, data structures, and problem-solving techniques.
algorithm algorithms algorithms-and-data-structures data datastructures dsa javascript pandas python structures
Last synced: 01 Mar 2025
https://github.com/moscatellimarco/webscrap-tinydeal
"WebScrap-TinyDeal" is a Scrapy-powered 🕷️ tool for harvesting product information 🏷️ from TinyDeal. It outputs structured CSV data 📁, ready for analysis. Explore the scripts 👨💻 for an interactive scraping adventure or leverage the data for competitive pricing strategies 📈.
css data datascience html pandas python scrapy web webscraper webscraping
Last synced: 14 Apr 2026
https://github.com/mito-ds/mitosheet_helper_config
The mitosheet_helper_config package used by enterprises to configure the mitosheet package.
data data-analytics data-science data-visualization jupyter pandas python
Last synced: 05 May 2026
https://github.com/metapsy-project/data-psychosis-psyctr
Database of psychological interventions for schizophrenia and psychosis compared to control conditions.
Last synced: 16 Mar 2026
https://github.com/shreshthvashisht/instgram-user-analytics
SQL Fundamentals
data data-analysis data-science mysql social-network-analysis
Last synced: 09 Jun 2026
https://github.com/rileynwong/forecasting-coffee-prices
Predict coffee prices in Kenya
data data-analysis data-scraping data-visualization forecasting forecasting-models forecasting-prices jupyter-notebook prophet prophet-model
Last synced: 20 Jun 2026
https://github.com/johndelatto/automate-your-job-search-ai-applies-to-1000-positions
Automate Your Job Search: AI Applies to 1000 Positions Overnight & Get 100+ Interviews! In today’s fast-paced and highly competitive job market, finding and securing your dream job can be both time-consuming and exhausting.
ai data non-profit open-ai open-source
Last synced: 28 Jan 2026
https://github.com/encelo/wetpaper-data
Data files for the WetPaper project
Last synced: 23 Jan 2026
https://github.com/fatihemres/pinch
File reader app with SwiftUI. Using data and models.
Last synced: 17 May 2026
https://github.com/woctezuma/epic-games-js
JavaScript on the Epic Games store.
data datamining egs epic epic-games epic-games-api epic-games-launcher epic-games-store epicgames epicgames-api epicgames-launcher epicgames-store graphql graphql-api javascript webpack
Last synced: 27 Oct 2025
https://github.com/dhanish03/reliance-sales-report-dashboard
This project, Reliance Sales Report Dashboard, showcases a dynamic and interactive Power BI dashboard designed to analyze sales performance. The dashboard provides key insights into various aspects of sales data, including product-wise performance, region-based revenue, and profitability trends.
data datavisualization-project powerbi visualization
Last synced: 23 Jan 2026
https://github.com/leevilaukka/alkometriikka
Tool to search Alko database and see some fun stats about different beverages
data gh-pages svelte typescript xlsx
Last synced: 18 May 2026
https://github.com/OneMoreDavid/python-like-a-boss
This is where I stash my Python study material.
data data-analysis data-engineering data-science data-visualization datascience ipynb ipynb-jupyter-notebook ipynb-notebook numpy pandas python python3
Last synced: 28 Oct 2025
https://github.com/remcostoeten/github-and-vercel-api-showcase-dashboard
Showcase results of possible fetched data from the Github and Vercel API built in all vanilla js.
api-rest da data express-js github-api nodejs vercel-api
Last synced: 07 Mar 2026
https://github.com/robertoostenveld/dccn.dsc_3015055.00_583_v1
The FieldTrip-SimBio Pipeline for EEG Forward Solutions [Data set].
Last synced: 24 Jan 2026
https://github.com/thais81/gamesbox
Another desktop app in JSE/Jswing with hangman game and tic-tac-toe game. This project was made at LDNR school with 4 friends
data database hangman-game jse tictactoe tictactoe-game
Last synced: 28 Jan 2026
https://github.com/andrewl/danelaw
Geopackage containing the boundary of the Danelaw
data geospatial medieval viking
Last synced: 23 Jan 2026
https://github.com/cmdrvl/rvl
rvl reveals the smallest set of numeric changes that explain what actually changed between two datasets — or confidently tells you nothing changed.
cli csv data data-quality data-validation diff finance numerical-analysis open-source ops rust tooling
Last synced: 25 Feb 2026
https://github.com/paezha/bsantiago
A data package with the results of a travel and well-being survey conducted in Santiago in 2016
data equity package r santiago survey travel well-being
Last synced: 18 Mar 2025
https://github.com/nitheshgoutham/sentinel-2-data-processing-for-pichavaram-mangrove-forest-using-cnn
Image Processing using CNN
cnn cnn-classification cnn-keras data deep-learning matplotlib ploty python seaborn-python visualization
Last synced: 29 Jun 2026
https://github.com/udofia2/crudwithdatabase
A simple Nodejs app that connect to a database.
Last synced: 08 Oct 2025
https://github.com/aimin-nur/data-analyst-model-predictive
Sebuah Project data analyst yang bertujuan untuk mengindentifikasi karakteristik customer untuk menerima penawaran campaign marketing.
analyst data mechine-learning visualization
Last synced: 29 Jan 2026
https://github.com/rosacarla/databases
Bases de dados utilizados em atividades práticas do MBA Data Analytics do IGTI.
Last synced: 19 Mar 2026
https://github.com/themost-framework/cache
MOST Web Framework Caching Module
Last synced: 12 Feb 2026