data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-30 00:07:50 UTC
- JSON Representation
https://github.com/raphaellaude/usaschooldata
Cleaned and accessible school enrollment data for US schools
data duckdb duckdb-wasm education object-storage oss wasm
Last synced: 12 May 2026
https://github.com/ashu3291/blinkit-app-store-
conducted a comprehensive analysis of Blinkit's sales performance, customer satisfaction and inventory distribution to improve the sales performance.
cleaning-data data dataanalysis-projects powerbi-visuals powerbidashboard sql
Last synced: 05 Jan 2026
https://github.com/writetome51/page-load-access
A TypeScript/Javascript class that loads a batch (array) of data from a larger set too big to be loaded all at once.
batch class data javascript load loader typescript
Last synced: 16 May 2026
https://github.com/fiddlydigital/anonimizer
A lib to replace and rehydrate sensitive data in text
anonimize anonymize data data-security prompt sanitize string string-manipulation text
Last synced: 15 Mar 2025
https://github.com/mnkanout/patients_medication_prediction
The aim of the project is to create a model that can help medical professionals select the proper medication for patients based on their symptoms. The model uses historical data of other patients to predict what could be the most suitable medication based on the patient's symptoms.
data data-analysis data-science data-visualization decision-tree-classifier machine-learning python3
Last synced: 29 Jun 2025
https://github.com/charon25/weatherdata
17 000 weather measurements collected by a weather station created for a college project.
csv data dataset datasets json measurements strasbourg weather weather-data
Last synced: 16 Jan 2026
https://github.com/hit07/fitgpt-hacksc
AI-Powered Fitness Coach; 🥈 Runner up at HackSC's SoCal Tech Week hackathon
data elasticsearch gpt-4o-mini llm pipeline
Last synced: 28 Feb 2025
https://github.com/ppmim/papi4k_old2
PAPI: the PANIC data reduction pipeline
data near-infrared pipeline processing
Last synced: 23 Jun 2025
https://github.com/taquece/goals-per-match
basic script to calculate average football goals per match from .CSV
beginner csv data football nodejs python sports-analytics
Last synced: 09 May 2026
https://github.com/haideratgh/sql-data-analytics-project
This repository contains a collection of SQL scripts demonstrating various analytical techniques, such as changes over time, cumulative, performance, data segmentation, part-to-whole analysis
analytics business-analytics business-intelligence data data-analysis data-analyst data-analytics data-engineering data-science data-scientist database datascience query reporting sql sql-query sql-server window-functions-in-sql
Last synced: 29 Jun 2025
https://github.com/equinor/fmu-sumo
Interaction with Sumo in the FMU context
analytics data fmu python subsurface sumo visualization
Last synced: 01 May 2025
https://github.com/tazeenrashid/orders-analysis-using-python-sql-server-and-tableau
I sourced some Orders data through Kaggle; did EDA using Python and then fetched some insights out of cleaned data using SQL Server (SSMS). Then, I built a Tableau Dashboard for some visual insights. Have a look and share your feedback!
analytics data eda jupyter-notebook python sql tableau
Last synced: 29 Apr 2026
https://github.com/cmda-tt/course-25-26
🎓 tech track · 2025-2026 · curriculum and syllabus 📊
d3 data datavis functional javascript programming research svelte visualization
Last synced: 20 Jan 2026
https://github.com/vuurvos1/functional-programming
HVA functional-programming
data formatting functional nodejs programming
Last synced: 03 Oct 2025
https://github.com/team810/frcs
FRCS is an online international crowd sources data collection software written for the FRC Competitions. It was created by team 810, The Mechanical Bulls.
Last synced: 14 Mar 2025
https://github.com/tupizz/python-data-manipulation
Data manipulation and visualization with Python 2.x
Last synced: 09 May 2026
https://github.com/avestura/shell-dads
❓ Show a random tip from NIST DADS (https://xlinux.nist.gov/dads) every time you open your terminal
algorithms dads data data-structures ds nist
Last synced: 23 Oct 2025
https://github.com/dhimmel/adeptus
ADEPTUS -- differential gene expression signatures of disease
adeptus data differential-expression disease gene-expression genes rephetio
Last synced: 05 Jan 2026
https://github.com/nel-zi/nuga_bank
Developed an automated data exploration and cleaning pipeline for Nuga Bank to streamline data preparation, ensure consistent data quality, and normalize datasets into structured databases for efficient analysis and reporting.
data data-automation data-visualization datacleaning datatransformation etl-automation etl-pipeline
Last synced: 16 May 2025
https://github.com/asma-hachaichi/imdb-movies-rating-prediction
This project collects movies information from IMDb using web scraping, then uses this data to guess movie ratings. It combines the skills of gathering data from the internet to predict how well movies are liked.
beautifulsoup4 data data-science machine-learning movies movies-reviews prediction python scraping
Last synced: 31 Mar 2025
https://github.com/nevoland/unchangeable
🧊 Tools for immutable values.
data datastructure functional immutable persistent pure stateless
Last synced: 24 Jul 2025
https://github.com/miniql/miniql-inline
A MiniQL query resolver for inline data.
Last synced: 27 May 2026
https://github.com/rubyonworld/ruby-cldr
The Unicode Consortium's Common Locale Data Repository (CLDR) contains tons of high-quality locale data such as formatting rules for dates, times, numbers, currencies as well as language, country, calendar-specific names etc.
common data locale repository ruby unicode
Last synced: 12 Nov 2025
https://github.com/carlosrs14/parallel-data-preprocessig-system
A parallel data preprocessing system using threads and synchronization mechanisms (barrier, busy-waiting, condition variables) to clean and prepare data for AI training.
barrier-method c condition-variable data operative-systems parallel-computing posix preprocessing synchronization threads
Last synced: 24 Jul 2025
https://github.com/foufou-exe/finegourmet
Project Business Intelligence Architecture
business-intelligence data docker docker-compose etl mysql powerbi pyspark python workflow
Last synced: 11 Apr 2026
https://github.com/82luli02/sakila_dvd_rental_database_analysis
Analysis of the Sakila DVD Rental database using SQL
data data-analysis data-science data-visualization sql
Last synced: 10 Mar 2026
https://github.com/wittyicon29/kritika-iit-b-2023
Seletcion task for the summer projects of Kritika IIT-B
data data-analysis data-science
Last synced: 15 Mar 2025
https://github.com/koppalexander/flightdelaychallenge
This project focuses on predicting flight delays using historical data from a Tunisian airline. We analyzed patterns in airport operations and flight schedules to build a machine learning model that can forecast potential delays.
data data-science machine-learning machine-learning-algorithms machinelearning prediction predictive-modeling
Last synced: 19 Jun 2026
https://github.com/gman-au/white-knight-neo4j
Neo4j implementation of White Knight data abstraction library
abstractions data datastore dotnet neo4j repository-pattern specification-pattern
Last synced: 20 Jan 2026
https://github.com/bilgehangecici/datatypeconverter
Converting integer and floating numbers to appropriate bit-level representation.
data datatypeconverter java machine-level variables
Last synced: 30 Mar 2025
https://github.com/olekscode/datageneration
Exploring the methods of data generation for different Machine Learning algorithms
data javascript machine-learning
Last synced: 05 Apr 2025
https://github.com/mateuszskoczek/generatorcsv
GeneratorCSV is a students and teachers data converter for Microsoft 365 Admin Center. The project was implemented for Sobolew High School.
admin converter data microsoft365 python school tkinter
Last synced: 26 Aug 2025
https://github.com/0xnu/data-analyst-training
The repository contains training materials for data analysts.
data data-analysis data-analyst
Last synced: 25 Aug 2025
https://github.com/ssiarhei115/shop-customers-segmentation
Shop customers segmentation
data data-analysis data-science data-visualization
Last synced: 24 Aug 2025
https://github.com/anuragagarwal96/hospital-mortality-rate-sql-analysis
In this project, I have taken a hospital dataset from Kaggle, analysed it and predicted the mortality rate of patients who have been admitted in hospitals. I have utilised a combination of SQL, Tableau and Microsoft Excel for this project.
data data-visualization dataanalysis dataanalysisusingsql excel msexcel mssqlserver sql tableau tableau-public
Last synced: 09 Mar 2026
https://github.com/karolkrupa/javascript-orm-mapper
ORM mapping library. Especially for Rest API
api data data-mapper entity es6 javascript mapper model mongo mysql node nuxt orm relational rest typescript vue vuex
Last synced: 10 Apr 2026
https://github.com/parablelab/parable
Work in progress...
data data-management data-platform data-validation database pipelines
Last synced: 28 May 2026
https://github.com/plateformeio/docs
The official documentation of the Plateforme framework
api app asgi async data db docs fastapi plateforme pydantic python restx services sqlalchemy
Last synced: 11 Apr 2026
https://github.com/rubyonworld/ldpath
This is a ruby implementation of LDPath, a language for selecting values linked data resources.
Last synced: 12 Nov 2025
https://github.com/rachelresende/projeto-finan-as
Este repositório é referente a um curso de análise de dados para finanças que realizei em 2025 na Udemy.
analytics data financas finance finance-management
Last synced: 19 Aug 2025
https://github.com/h4fide/politicalcompassbot
This Python project allows you to take a quiz and find out where you fit on the political compass. Give it a try and see where you stand!
bot data greedy-algorithms politics python python3 sql telegram
Last synced: 19 Aug 2025
https://github.com/KarajMiglani-DataScientist/karajmiglaniFAKE-NEWS-DETECTION
FAKE_NEWS_PREDICTION
algorithms data data-science flask machine-learning probability-statistics python statistics structure
Last synced: 19 Aug 2025
https://github.com/wittyicon29/zeotap-ds-assignment
Internship application assignment
Last synced: 19 Aug 2025
https://github.com/progati00/marketing-mix-modeling-mmm-for-marketing-budget-optimization
A Marketing Mix Modeling (MMM) project using Python to analyze channel performance, calculate ROI, and simulate marketing budget changes for better business decisions. Includes a trained Linear Regression model, ROI analytics, and a Flask API for revenue prediction.
api budget-optimization data data-analysis data-science ecommerce eda flask jupyter-notebook linear-regression machine-learning marketing-analytics marketing-mix-modeling python roi-analysis vscode
Last synced: 14 Apr 2026
https://github.com/docuvesta/shiseido_skincare_usa_fr_infographics
Découvrir les indicateurs de performance liés aux avis d'un sérum très réputé de la marque de beauté luxe japonaise Shiseido. Cette comparaison concerne les sites web USA et FR 💯
analysis automatisation data datanalysis graphique infographie pandas plotly python skincare soins
Last synced: 11 Apr 2026
https://github.com/moeabbas6/bq_data_loader
A Python script for executing and logging batch SQL commands in Google BigQuery. Includes tracking of execution times, unique job and statement IDs, and automated logging to a specified BigQuery table.
Last synced: 24 Mar 2025
https://github.com/murshidazher/client-side-data-storage
🚌 A workspace containing client-side data storage implementations
cache cache-storage client-side data indexeddb localstorage sessionstorage storage websql
Last synced: 02 Sep 2025
https://github.com/sirmaxx/log_manager
log manager services for microservices
data fastapi logging microservice mongodb
Last synced: 09 Apr 2026
https://github.com/amethyst-php/owner
amethyst amethyst-package api data laravel owner
Last synced: 28 Apr 2026
https://github.com/pbinkley/mfmcollections
Project to distill data about published collections of microfilms from library lists
Last synced: 28 May 2026
https://github.com/jameshenderson12/chatbot-utils
Generic data and elements that can be reused or repurposed for chatbot development.
boilerplate chatbot data development elements intents template utterances
Last synced: 04 Mar 2026
https://github.com/master-helix/ibm-data-analyst-certification-stock-analysis-project
This is a mini project repository of my IBM Certification involving stock analysis and plotting of Tesla and GameStop
analytics data data-analysis data-visualization ibm matplotlib pandas python web-scraping
Last synced: 09 May 2026
https://github.com/seqeralabs/ffq-api
A minimal wrapper to make ffq searches available via a REST API.
api data fastq fetch-fastq ffq genomics
Last synced: 15 Aug 2025
https://github.com/mtwn105/phonepe-pulse-plus
An API on top of PhonePe Pulse Data APIs
cors data data-science express finance hacktoberfest heroku javascript nodejs phonepe pulse
Last synced: 09 Apr 2026
https://github.com/smaug6739/data-bit
This project is a module for converting a structured dataset into a number that can be stored in a database taking up little space.
Last synced: 14 May 2026
https://github.com/ddofer/ddofer.github.io
Dan's Blog
blog cv data data-science machine-learning
Last synced: 12 Aug 2025
https://github.com/ailixter/gears-dictionary
The project, which Gears Dictionary
arrays data dictionaries dictionary php struct utilities
Last synced: 19 Jul 2025
https://github.com/amethyst-php/cycle
amethyst amethyst-package api cycle data laravel
Last synced: 17 May 2026
https://github.com/ffatahillah7/snowflake-data-governance-warehouses
Welcome to the Powered by Tasty Bytes - Zero to Snowflake Quickstart focused on Data Governance! Within this Quickstart we will learn about Snowflake Roles, Role Based Access Control and deploy both Column and Row Level Security that can scale with your business.
data data-governance snowflake
Last synced: 06 Jan 2026
https://github.com/xp-forge/marshalling
Marshalling
data marshalling object-mapping xp-framework
Last synced: 02 Jul 2025
https://github.com/45harry/potato_disease_classification
Potato Disease Classification - Traning, Rest Api and FrontEnd to Test
cnn-classification data data-science datapreprocessing deep-learning fastapi flaskapi frontend keras restapi tensorflow
Last synced: 12 Apr 2026
https://github.com/andrii04/andreamonforte-bi-assignment
Automated Data Pipeline that ingests daily GA4-formatted CSV files from a private Google Cloud Storage bucket, validates and loads them into BigQuery, and prepares analysis-ready views. The solution is built for deployment as a Cloud Function triggered by Cloud Scheduler and uses Python with the Google Cloud Storage and BigQuery client libraries.
automation bigquery cloud cloudfunctions data data-analysis data-engineering etl etlpipeline gcp google googlecloudplatform pipeline python sql
Last synced: 09 Nov 2025
https://github.com/doppelgunner/baby
A program for storing data just for fun
data doppelgunner java note storing
Last synced: 12 Jun 2026
https://github.com/ashita-ai/ashita-ai.github.io
Ashita AI - The island of misfit data tools
Last synced: 19 Feb 2026
https://github.com/ragibasif/bobdylan
Bob Dylan
bob-dylan csv data data-science data-visualization lyrics music python
Last synced: 03 Sep 2025
https://github.com/xiaomingx/10000-public-apis-and-data
Public APIs are interfaces that allow developers to access various services, features, or data from external systems or platforms.
api-ecosystem api-integration data developer-friendly-apis open-api-access public-api-tools third-party-services
Last synced: 30 Jul 2025
https://github.com/petzi53/repair
R Datasets of the Open Repair Alliance (ORA).
Last synced: 19 May 2026
https://github.com/srindot/fwuav-average-flight-data-collection
This repository is designed for collecting average data for a flapping wing UAV. The script acg_coeff_data_collection.py runs the necessary data collection, and the resulting data is saved into a CSV file called AverageFlightData.csv.
Last synced: 10 Aug 2025
https://github.com/sauravsrivastav/githubreposearcher
GitHub Repo Searcher 🔍 is a Streamlit web application designed to help you search for GitHub repositories based on a query and view the results in a tabular format. You can also download the results in CSV or Excel format for further analysis. 📊📈
data data-export excel github-api python repository-searcher streamlit webapp
Last synced: 20 Jan 2026
https://github.com/mumtaz4118/nlp-course
Programming Assignments and Lectures for Stanford's CS 224: Natural Language Processing with Deep Learning
course data data-analysis data-analytics data-science data-visualization deep-learning education machine-learning natural-language-processing neural-network transfer-learning
Last synced: 24 Nov 2025
https://github.com/shahsuvarli/election-voters-data-analysis-pandas
Educational project analyzing Azerbaijan voter demographics with pandas, focusing on data cleaning, grouping, and visualization.
cleaning data grouping matplotlib numpy pandas python visualization
Last synced: 12 Apr 2026
https://github.com/chubek/pyramid-dashboard
A Dashboard to Show Data Made Using Plotly Dash
dash data docker ml plotly plotly-dash python
Last synced: 19 May 2026
https://github.com/infinitode/pyautoplot
PyAutoPlot is an open-source Python library designed to make dataset analysis much easier by generating helpful detailed plots using matplotlib. It automatically generates appropriate plots based on the dataset you feed it.
analysis automatic csv data dataset dataset-analysis generation matplotlib pandas plots plotting-in-python plotting-library python
Last synced: 16 Mar 2025
https://github.com/beriberikix/senml-zephyr
A codec for encoding and decoding Sensor Measurement Lists (SenML) for Zephyr
codec data iot senml sensor zephyr-rtos
Last synced: 24 Mar 2025
https://github.com/chompfoods/sdk-java
Java SDK for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.
api branded chomp data database food gradle grocery ingredients jar java java-sdk nutrition openapi raw recipe-api recipes sdk
Last synced: 09 Apr 2026
https://github.com/mchenryspagg/wrangle-and-analyze-data
This project which is known as 'wrangle and analyze data' involves the wrangling of WeRateDogs twitter archive data from the period of 2015 to 2017
api data dataanalysis datacollection datawrangling datetime json numpy os pandas pil python requests tweepy-api visualization
Last synced: 09 Apr 2026
https://github.com/sourceduty/text_file_metadata
📄 Extract metadata from .txt files and record the metadata in .txt files.
data datascience metadata metafile practice sourceduty
Last synced: 08 Aug 2025
https://github.com/darkogamerz/dhis2heat
A Comprehensive data management and Health Equity Assessment and Analysis platform that fetches data from DHIS2, optimize, calculate, clean and visualize inequality data.
analytics data data-science dhis2 equality equity health heat inequality r shiny shinydashboard visualization
Last synced: 01 Apr 2025
https://github.com/ludwing-mj/manipulacion_ej
Ejercicio utilizado en la seccion numero ocho del manual para ejemplificar las herramientas proporcionadas por el tydyverse para la manipulacion de datos.
data manipulate-data package r
Last synced: 01 Apr 2025
https://github.com/h-sutiwas/r2de-2025
This repository is related to the Road To Data Engineer Bootcamp by DataTH. It contains all related coursework, some mini projects and other resources within the field of Data Engineering.
data data-engineering data-visualization docker gcp pipeline spark
Last synced: 30 Apr 2026
https://github.com/e22m4u/ts-data-schema
Валидация данных и приведение типов для TypeScript
data schema typescript validation
Last synced: 05 Aug 2025
https://github.com/haimonmon/j3mify
Convert your jejemon word into a formal sentence or word
data jejemon nlp normalization python regex tagalog tokenization
Last synced: 12 Oct 2025
https://github.com/johnelliott/wb-web
Moved —> https://github.com/johnelliott/waybot
arduino browser data iot raspberry-pi web
Last synced: 12 Apr 2026
https://github.com/shubhamsoni98/project_using_knn
This project applies the K-Nearest Neighbors (KNN) algorithm to predict iPhone purchases based on customer data. Using features like age, salary, and previous purchase behavior, the KNN model classifies customers into buyers and non-buyers.
anaconda analytics data data-science eda knn knn-classification machine-learning-algorithms predict project python scikit-learn tableau
Last synced: 03 Jan 2026
https://github.com/alextanhongpin/node-github-api
:page_with_curl: sample github api queries with nodejs for scraping purposes
Last synced: 06 May 2026
https://github.com/baranasoftware/curricular-api
The design and implementation of a REST API for student and course data for a Higher Ed institution.
aws data data-pipeline go golang lambda rest rest-api sqlite3 system-design terraform
Last synced: 09 May 2026
https://github.com/zulfachafidz/telco_churn_insight_customer_loss_prediction_with_random_forest_and_decision_tree-algorithms
The main problem in the business world is customer churn, or losing customers, especially in the telecommunications industry, which experiences very tight competition. To overcome this problem, an analysis was carried out to help the company understand how many customers have the potential to switch providers.
data data-science data-visualization dataanalysis dataanalyst dataanalytics datadrivenwithdataprovider decision-tree decision-tree-classifier decision-trees random-forest random-forest-classifier
Last synced: 01 May 2026