data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-03 00:07:49 UTC
- JSON Representation
https://github.com/chompfoods/sdk-jaxrs-cxf
JAXRS-CXF SDK for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.
apache-cxf api branded chomp cxf data database food grocery ingredients java jax-rs nutrition raw recipe-api recipes sdk
Last synced: 30 Apr 2026
https://github.com/lugolbis/data-immo
End-to-end ETL pipeline
data data-engineering dbt dremio duckdb etl-pipeline lakehouse rust
Last synced: 08 Jun 2026
https://github.com/marielachirinosr/nyc-taxi-trip-exploration-2019-2020
Explores passenger behavior & impact of COVID-19 on NYC taxi industry (Q1 2019-2020).
bigquery data data-analysis data-visualization python sql tableau
Last synced: 15 Jun 2026
https://github.com/lut-ful/ibm-capstone-project-stack-overflow-job-survey
IBM Data Analyst professionale certificate program final project.
cognos data data-analytics looker power-bi python sql statics
Last synced: 01 May 2026
https://github.com/shauryauppal/mydatatoolkit
A toolkit for data scientists to get work done faster, easier, and in a smarter way.
analytics awesome-list data data-science hacktoberfest
Last synced: 08 Jun 2026
https://github.com/skygenesisenterprise/aether-meet
Aether Meet is a lightweight, open-source client built for privacy, speed, and seamless integration within the Aether Office ecosystem
applications data docker javascript meeting nextjs notes typescript voip
Last synced: 01 May 2026
https://github.com/bertrand31/one-billion-rows-challenge
🌪️ Pushing Scala to its limits to aggregate a billion rows' worth of data in 2.42 seconds
competitive-programming competitive-programming-contests data data-engineering data-processing performance scala
Last synced: 05 Sep 2025
https://github.com/gabrielf7/relogiohd
:watch: Relógio com Horário e Data
clock css data horario html javascript relogio relogio-hd relogio-javascript watch
Last synced: 01 May 2026
https://github.com/lafkpages/minecraft-crafting-info
Scrapes https://www.minecraftcrafting.info for crafting recipes.
Last synced: 17 Jun 2026
https://github.com/sorairolake/japanese-era-dataset
日本の元号のデータセット / Dataset of the Japanese era
data dataset date japanese-calendar japanese-era json toml wareki yaml
Last synced: 01 May 2026
https://github.com/eshitakundu/disease-outbreak-predictor
Disease Outbreak Predictor: A Streamlit-based web application for predicting diabetes, heart disease, and Parkinson's disease using machine learning models.
data data-science disease-prediction healthcare-application jupyter-notebook machinelearning ml notebook prediction python streamlit streamlit-webapp
Last synced: 01 May 2026
https://github.com/nolanbconaway/rollercoaster-tycoon-data
Every roller coaster I have built in RCT2 for iPad
Last synced: 24 Mar 2025
https://github.com/0xhericles/spamdetector
:email: A Simple Python Spam Detector with Scikit-Learn
data ham machine-learning python sklearn spam
Last synced: 02 May 2026
https://github.com/gcoronelc/ucv_gdi-1_202302-a2
Taller de Gestión de Datos e Información I con Gustavo Coronel.
data data-science database databases machine-learning machinelearning oracle sql sql-server
Last synced: 02 May 2026
https://github.com/mubashirsidiki/certifications_work
his repository contains my work, projects, and solutions from various professional certification programs.
analysis coursera data data-science google ibm john-hopkins machine-learning michigan udemy
Last synced: 01 Jul 2025
https://github.com/mubashirsidiki/olympics-data-enigeering
Worked with Azure Data Factory, Databricks, Data Lake Storage, and Synapse Analytics to build an ETL pipeline for processing and analyzing Olympic Games data from Kaggle.
analytics azure big-data data dataengineering devops pipeline
Last synced: 02 May 2026
https://github.com/vidupriya/aws-glue--data-copy
The function for copying data like CSV, Parquet, avro etc., from a source S3 bucket to a destination S3 bucket using AWS Glue. It includes the necessary setup for the Glue job, logging, reading data from the source bucket, and writing it to the destination bucket
aws awsglue awss3 data data-copying glue glue-job pyspark python3 s3 s3-bucket s3-buckets s3-storage spark
Last synced: 02 May 2026
https://github.com/viniddev/active_finance
Nesse projeto busquei solucionar um problema corriqueiro que é a dificuldade de se manter atualizado sobre as variações do mercado de ações e fundos imobiliários. Usei selenium webdriver para buscar informações e uma API do Telegram para enviar relatórios para o usuário
automation data data-analisis rpa selenium-webdriver telegram-bot
Last synced: 03 May 2026
https://github.com/antoineaugusti/youtubers-tips
Collecting data about tips given to Youtubers
data economy youtube youtubers
Last synced: 03 May 2026
https://github.com/stdlib-js/ndarray-vector-int8
Create a signed 8-bit integer vector (i.e., a one-dimensional ndarray).
constructor ctor data int8 javascript ndarray node node-js nodejs stdlib structure types vec vector
Last synced: 24 Apr 2026
https://github.com/asacxyz/flutter_aplicando_persistencia_de_dados
Para acompanhamento do curso Flutter: aplicando persistência de dados
dart data data-storage flutter persistence persistent-storage sqflite sql sqlite
Last synced: 03 May 2026
https://github.com/jpcadena/palmer-penguins
Palmer Penguins
analytics csv data data-analytics data-science exploratory-data-analysis matplotlib numpy palmer-penguin pandas plotly pylint python seaborn visualization
Last synced: 11 Apr 2026
https://github.com/srking501/uk-groceries-images
Repository Containing UK Groceries Images
data groceries grocery images links playwright playwright-python webscraping-data webscrapper
Last synced: 04 May 2026
https://github.com/wittyicon29/kritika-iit-b-2023
Seletcion task for the summer projects of Kritika IIT-B
data data-analysis data-science
Last synced: 15 Mar 2025
https://github.com/dimitryzub/russo-ukraine-war-prediction-losses
Highlights rusian losses with predictions based on historic data from Ministry Defence of Ukraine 🐱👤
data dataanalysis dataanalytics matplotlib pandas prophet python
Last synced: 04 May 2026
https://github.com/dineshram0212/youtube-analysis
This YouTube Analysis Package provides tools for analyzing YouTube video data, including metrics on views, likes, comments, and engagement trends. Ideal for gaining insights into video performance and audience interaction patterns.
data data-visualization pandas python webscraping youtube-api-v3
Last synced: 19 Jun 2026
https://github.com/rylan12/apscores
A quick way to visualize how the AP score distributions have changed from year to year.
advanced-placement analysis ap-exam data scores
Last synced: 19 Jun 2026
https://github.com/artcc/coredatademo
Demo for CoreDataGenericModule implementation
core coredata coredata-model data encrypted encrypted-data encryption persist
Last synced: 19 Jun 2026
https://github.com/digital-media/cv_data
Datasets used for courses/tutorials at the Digital Media Department
computer-vision data image-processing images
Last synced: 14 Oct 2025
https://github.com/soenneker/soenneker.data.email.disposables
Simply adds a list of compiled disposable/temporary email domains, updated daily (if available)
csharp data disposable disposables domain dotnet email mailinator
Last synced: 29 May 2026
https://github.com/mominurr/fire-gas-leak-detection-system
A real-time fire prevention system integrating IoT sensors and computer vision to trigger evacuations.
ai computer-vision data datascience machine-learning ml python yolo
Last synced: 27 Jan 2026
https://github.com/amethyst-php/notification
amethyst amethyst-package api data laravel notification
Last synced: 15 May 2026
https://github.com/instagram-automations/scrape-data-from-instagram
scrape data from instagram and automation toolkit
api automation bot data doker instagram nodejs playwright procy scrape selenium toolkit
Last synced: 14 Oct 2025
https://github.com/mr-chang95/udacity-starbucks-challenge
Data Science Project for Udacity's Data Scientist Program. Using Python in Jupyter Notebook.
data data-science data-visualization numpy pandas sklearn
Last synced: 14 Apr 2026
https://github.com/contawo/travel-journal
This is a travel journal application for storing all the places that you have visited. I was learning by doing react when creating this project. I learnt a lot with it and upgraded my reactjs skills.
data learning-by-doing props reactjs
Last synced: 05 May 2026
https://github.com/myavuzokumus/simplemodelcomparison
This application allows users to upload datasets, handle missing data, and compare different imputation strategies.
algorithm data data-science machine-learning preprocessing streamlit
Last synced: 21 Jan 2026
https://github.com/chowington/bg-counter-tools
A set of tools that can pull data from Biogents BG-Counter smart mosquito traps and convert them into a Darwin Core compliant format.
bg-counter biogents darwin-core data internet-of-things mosquito-prevalence population-dynamics
Last synced: 10 Oct 2025
https://github.com/fatihilhan42/nba-players-data-1950-to-2021
In this project, the data of the NBA players between the years 1950-2021 were examined. After the NBA players' season, height, performance, averages of points, teams and positions they played were obtained through csv files, important tables and graphs were created using data cleaning and data visualization algorithms.
data data-analysis data-engineering data-science data-visualization
Last synced: 16 Oct 2025
https://github.com/loaiwalid07/automation_data_overviwe
This is Streamlit app that gives an overview for a dataset you upload
automation data data-analysis data-exploration data-science data-transformation data-visualization
Last synced: 19 May 2026
https://github.com/sillyash/untappd-viz
A data visualisation page using public datasets and HTML/CSS/JS with D3.js.
beer beer-statistics data data-analysis data-visualization kaggle kaggle-dataset public-dataset school-project
Last synced: 18 May 2026
https://github.com/ronknight/user-data-dashboard
📈 A data visualization tool for analyzing user data using an Excel-based data source.
dashboard data excel ga4 screenshot
Last synced: 17 Oct 2025
https://github.com/enoch208/eventmaster
A user-friendly application that helps you easily record and play back your keyboard and mouse actions. With its modern design using `tkinter` and `ttkthemes`, it provides a smooth and easy-to-use interface. The app combines reliable technical features to give you a great experience.
automation data key keylogging-python replay spy tools
Last synced: 01 Jun 2026
https://github.com/kaijagahm/2023-10-20-stlzoo
Data Carpentry workshop, hosted at the St. Louis Zoo. Beta testing the new ecology data lesson.
data data-science ecology r rstudio
Last synced: 05 Feb 2026
https://github.com/psgebeline/harvard-data-science
My work for the nine courses in Harvard's data science program, each with notes/assignments. Work in progress.
data linear-regression machine-learning modeling probability-theory r visualization wrangling
Last synced: 19 Oct 2025
https://github.com/svetlanam/kbl-to-csv-s3
Keboola extractor, that converts excel to CSV based on input mapping criteria and upload to S3 bucket
data data-cleaning data-transformation etl keboola s3-bucket
Last synced: 20 Jun 2026
https://github.com/coderixc/rforai
Learn R Programming Language for Statistics & Data Science
artificial-neural-networks data data-science deep-neural-networks machine-learning probability quant-analyst r science
Last synced: 09 Oct 2025
https://github.com/erencelik/binance-public-data-node
Nodejs downloader and unzipper script for Binance Public Data
binance data downloader nodejs public script
Last synced: 15 May 2026
https://github.com/djdhairya/whatsapp-chat-analysis
WhatsApp chat analysis is a multidimensional process that delves into the content, structure, and dynamics of conversations within the platform. It provides valuable insights for personal reflection, organizational decision-making, and improving communication strategies.
data data-science dataanalytics datapreprocessing machine-learning ml
Last synced: 08 Oct 2025
https://github.com/rahul1582/bank-loan-classification
Classifying whether a person is taking personal loan or not using all the Classification Algorithms.
algorithm analysis classi data
Last synced: 08 Oct 2025
https://github.com/cemc-oper/nmc-typhoon-db-client
A CLI client for NMC Typhoon Database.
Last synced: 01 Jun 2026
https://github.com/danieljdufour/fast-b64
Quickly Convert between B64 and Binary Strings
b64 base64 base64-decoding base64-encoding binary bits compression data
Last synced: 08 Oct 2025
https://github.com/muthupillai1204/diwali_sales_analysis
The Diwali sales analysis reviews past data to identify trends, peak buying times, popular products, and customer demographics. It assesses sales volume, revenue growth, and promotional effectiveness, helping businesses optimize marketing and inventory for future seasons.
data datacleaning eda excel jupyter-notebook matlplotlib numpy pandas python seaborn visualization
Last synced: 05 May 2026
https://github.com/eharshit/end-to-end-vendor-insights
End-to-end analysis of vendor performance for wholesale/retail businesses, featuring data ingestion, cleaning, insights, and interactive Power BI dashboards.
analysis analysis-algorithms analytics dashboard data data-analysis datascience jupyter jupyter-notebook pandas powerbi powerbi-report retail wholesale
Last synced: 07 Oct 2025
https://github.com/mito-ds/mitosheet_helper_config
The mitosheet_helper_config package used by enterprises to configure the mitosheet package.
data data-analytics data-science data-visualization jupyter pandas python
Last synced: 05 May 2026
https://github.com/rysteq/abstract-data-structures
This repository contains two programs written in C about the stack and queue ADT's
abstract-data-structures c data queue stack
Last synced: 06 Oct 2025
https://github.com/tsbarr/belly-button-challenge
Using front-end development tools (javascript, html and css) I built an interactive dashboard to explore the Belly Button Biodiversity dataset, which catalogs the microbes that colonize human navels.
data data-visualization javascript
Last synced: 04 Mar 2026
https://github.com/robertoostenveld/dcn.dsc_62002071_01_114_v1
Simon task M/EEG data [Data set].
Last synced: 23 Jan 2026
https://github.com/flyconnectome/hnf
Documentation for the hierarchical neuron format
annotations data dotprops hdf5 mesh neurons skeleton storage
Last synced: 17 Jan 2026
https://github.com/gabrieldim/complete-analysis-covid-19
Analysis of the Covid 19.
analysis covid-19 covid19 data data-science science virus
Last synced: 23 Jan 2026
https://github.com/pathilink/ebury_case
Technical case study in Analytics Engineering using BigQuery, focusing on dimensional modeling and SQL queries for payment and client analysis.
Last synced: 05 Oct 2025
https://github.com/donmaruko/python-eda-toolkit
CLI-runned EDA with 30 commands utilizing text-related functions, statistical calculations, data visualization, and data manipulation.
data data-analysis data-science data-visualization matplotlib pandas scipy seaborn statistical-analysis statistics wordcloud
Last synced: 06 May 2026
https://github.com/jigyasag18/bird-strikes-in-aviation-project
This project analyzes over a decade of U.S. bird strike data (2000–2011) to evaluate safety risks, damage trends, and cost implications in aviation. Using PostgreSQL for database management and Power BI for dashboard visualization, it uncovers critical insights into when, where, and how wildlife impacts aircraft. Key findings inform strategically.
bird-strike-prevention bird-strike-prevention-in-real-airport data data-analysis data-analysis-project data-visualisation data-visualization data-visualization-project data-visualizations database dataset dax-query postgresql postgresql-database powerbi powerbi-desktop powerbi-report powerbi-visuals sql sql-database
Last synced: 09 May 2026
https://github.com/lukakerr/us-surnames
US Surname data visualisation using R. Displays top 25 US surnames and race/ethnic percentage per name.
Last synced: 05 Oct 2025
https://github.com/abhibisht89/data-visualization
data matplotlib pandas ploty python visualization
Last synced: 06 May 2026
https://github.com/mattjesc/ddo-semiconductor
Data-Driven Optimization of Semiconductor Processes and Forecasting
ai artificial-intelligence data data-science data-visualization deep-learning keras machine-learning manufacturing ml prophet python pytorch semiconductor semiconductor-manufacturing semiconductors tensorflow
Last synced: 23 Feb 2026
https://github.com/parthds02/analyzing-student-success-with-data
Discover key factors influencing student performance through data analysis and visualization. Explore gender, parental education, sports, and ethnicity impacts.
data datascience jupyter-notebook kaggle python pythonlibraries
Last synced: 06 May 2026
https://github.com/zevio/acl
ACL Anthology corpus sample
data dataset scholarly-articles
Last synced: 01 Mar 2026
https://github.com/brianlesko/r_data_science_stat5730
Written by Brian Lesko, the repository contains R Scripts demonstrating data science topics largely originating from study at Ohio State. Contents are written in R studio using the R markdown file. As of 1/21/23 Future projects concerning data science, statistics, and machine learning will be in python in my machine learning Repository
data data-analysis flight-data ggplot2 olympics-data r-markdown tidyverse
Last synced: 23 Jan 2026
https://github.com/harmanveer-2546/reducing-data-entries
Way to delete data entries from csv/excel file using. For excel file, use excel instead of csv in the code.
csv data data-entry delete-data excel numpy pandas python
Last synced: 05 May 2026
https://github.com/miniql/miniql-inline
A MiniQL query resolver for inline data.
Last synced: 27 May 2026
https://github.com/deliprofesor/health-score-prediction-model-the-impact-of-lifestyle-and-demographic-factors
A machine learning project predicting health scores based on lifestyle and demographic factors like age, BMI, diet, and exercise. Techniques include Random Forest, Polynomial Regression, and Linear Regression, with a focus on model performance and actionable health insights.
cross-validation data data-science data-visualization feature-engineering linear-regression machine-learning polynomial-regression random-forest
Last synced: 10 Apr 2025
https://github.com/moscatellimarco/webscrap-tinydeal
"WebScrap-TinyDeal" is a Scrapy-powered 🕷️ tool for harvesting product information 🏷️ from TinyDeal. It outputs structured CSV data 📁, ready for analysis. Explore the scripts 👨💻 for an interactive scraping adventure or leverage the data for competitive pricing strategies 📈.
css data datascience html pandas python scrapy web webscraper webscraping
Last synced: 14 Apr 2026
https://github.com/ztgx/muvera
MUVERA: Making multi-vector retrieval as fast as single-vector search
algorithms data google muvera retrieval rust search structure vector
Last synced: 25 Oct 2025
https://github.com/byndyusoft/byndyusoft.data.relational
Relational abstractions for Byndyusoft.Data.Relational.
byndyusoft data dataaccess db relational-databases
Last synced: 25 Oct 2025
https://github.com/romaintailhurat/dagster-playground
Playing with Dagster 🐙
Last synced: 14 Jun 2025
https://github.com/shef4793/hackerrank-sql-challenges-solutions
The solutions of all SQL challenges on HackerRank executed on either MySQL or MS SQL environment.
data data-engineering hackerrank hackerrank-challenges hackerrank-solutions mssql mssql-server mysql problem-solving solutions sql sql-challenges sql-query
Last synced: 11 Mar 2026
https://github.com/merekat/hb-oil-assets
Eine Analyse der Assetentwicklung im Zusammenhang mit schockartigen Anstiegen des Ölpreises seit des Markteintritts von Brent-Öl in 1986.
analyze asset data datajournalism oil python
Last synced: 16 Mar 2026
https://github.com/nmelgar/healthy_child_dataviz
Data visualization project to analyze what a healthy child is.
analysis data data-analysis data-science data-visualization dataviz research tableau visualization
Last synced: 23 Feb 2026
https://github.com/tomquirk/sunshine-coast-council-rates-data
Rates data for the Sunshine Coast, Australia
australia data property rates real-estate
Last synced: 24 Feb 2026