data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-03 00:07:49 UTC
- JSON Representation
https://github.com/julienmalka/shiftgenerator
ShiftGenerator WeSki 2018
data data-science latex python
Last synced: 06 May 2026
https://github.com/domarps/grad-project-reports
Write-ups of a few key semester-long projects I have worked during my Masters
circuit data deeplearning graph-algorithms matlab question-answering
Last synced: 26 Mar 2025
https://github.com/chompfoods/stub-jaxrs-jersey
JAX-RS Jersey server stub for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.
api branded chomp data database food grocery ingredients jax-rs jersey nutrition raw recipe-api recipes server server-stub stub stub-server
Last synced: 02 May 2026
https://github.com/abhibisht89/data-visualization
data matplotlib pandas ploty python visualization
Last synced: 06 May 2026
https://github.com/jigyasag18/credit-card-fraud-detection-using-machine-learning
This repository presents a credit card fraud detection system utilizing a Logistic Regression model trained on a dataset of 284,807 transactions with significant class imbalance. After employing under-sampling for balance, the model achieves a test accuracy of around 93.40%, showcasing the effectiveness of ML in identifying fraudulent transactions.
credit-card-fraud creditcardfrauddetection data dataset logistic-regression logisticregression machine-learning machine-learning-algorithms mlproject mlprojects
Last synced: 02 Sep 2025
https://github.com/gagolews/clustering-data-v0
Datasets for Clustering [DEPRECATED – A NEW VERSION IS AVAILABLE]
clustering data dataset machine-learning
Last synced: 15 Sep 2025
https://github.com/ntnn/dataparse
Parsing, transforming and unmarshalling data.
data data-parser data-parsing data-transformation golang golang-lib
Last synced: 30 Jun 2026
https://github.com/ressuman/csv-writer-project
CSV Writer with TypeScript. This project demonstrates my implementation of a CSV writer using plain TypeScript and JavaScript, without relying on any frameworks.
Last synced: 15 May 2026
https://github.com/amazenmb/web-scraping
Web Scraping Methods using Python
analytics beautifulsoup data lxml pyautogui-automation python scheduling schedulingscraping selenium webdriver webscraping xpath
Last synced: 06 May 2026
https://github.com/reshmaaiman/liver-patient-prediction
Liver Disease Prediction
data data-science data-visualization dataanalysis jupyter-notebook numpy pandas python seaborn
Last synced: 16 Apr 2026
https://github.com/badranalyst/covid-deaths-dashboard-with-tableau
This project showcases an interactive dashboard developed in Tableau to visualize COVID-19 deaths data. It provides insights into trends, geographical distributions, and key metrics related to mortality during the pandemic. The dashboard aims to enhance understanding of the data, supporting public health analysis and decision-making.
covid-19 dashboard data data-analysis data-visualization dataset tableau tableau-dashboards visualization
Last synced: 02 Mar 2026
https://github.com/ekoepplin/dbt-bigquery-core
How to get data to BigQuery (or duckDB) and setup dbt tests for SODA cloud monitoring
bigquery data data-quality dbt dlt duckdb gcp soda
Last synced: 06 May 2026
https://github.com/j2kun/terrorism-usa-post-9-11
A copy of the terror data published by NewAmerica
data politics terrorism transparency
Last synced: 02 Mar 2026
https://github.com/dms-codes/www.usu.ac.ididdirektori
Faculty and Docent Data Retrieval Script The faculty_and_docent_data_retrieval.py script is a Python script for retrieving faculty and docent data from a university website using Selenium. It includes functions to extract faculty names and docent profiles, as well as a multithreading approach to fetch data for multiple faculty-docent pairs.
Last synced: 26 May 2026
https://github.com/kenjyco/mongo-helper
Helper funcs and tools for working with MongoDB
aggregation-pipeline data database kenjyco mongo mongodb python
Last synced: 28 Jan 2026
https://github.com/omari-kd/environmental-impact-on-food-production
The goal of this project is to assess the environmental impact of food production at both macro and micro levels and propose data-driven insights to mitigate the negative effects of food production on the environment.
data data-analysis data-science data-visualization environmental-impact-analysis r
Last synced: 30 Mar 2025
https://github.com/hupili/djworkshop-cuc2018
data data-journalism data-visualization
Last synced: 27 Mar 2026
https://github.com/fabsdevx/file-format-converter-handout
Data Engineering project for learning purposes. Credits to itversity
csv csv-import data data-engineering database pandas python
Last synced: 06 May 2026
https://github.com/coderjolly/spotify-api-data-analysis
The project leverages Apache Airflow for automating Spotify API data analysis, focusing on user activity. Extracting, transforming, and loading data efficiently, it provides insights via PowerBI dashboards.
airflow airflow-dags data data-engineering etl etl-pipeline microsoft-sql-server power-bi python scripting sql
Last synced: 27 Mar 2026
https://github.com/dakostu/grabbag.h
A data structure for non-deterministic element selection in C++11
cpluscplus cpp cpp-component cpp-library cpp11 data data-structure data-structures generics non-deterministic random randomization template
Last synced: 19 Oct 2025
https://github.com/nagar2nd/financial-analysis-power-bi
This project analyzes financial and credit card usage data using Power BI and DAX, focusing on customer behavior, credit risk, and financial performance. It includes insights on spending trends, delinquency rates, churn indicators, and satisfaction scores to drive better financial management and customer retention strategies.
analysis data dax dax-functions dax-query excel powerbi
Last synced: 03 Mar 2026
https://github.com/inzhenerka/scooters_data_generator
Generate data of scooter trips for analysis
Last synced: 02 Jun 2026
https://github.com/nikashj/pizza-sales-dashboard-analysis
Pizza sales analysis using Power Bi
data data-analysis data-visualization dax-expression excel powerbi
Last synced: 06 Apr 2026
https://github.com/metapsy-project/data-depression-anxiety-transdiagnostic
Database of transdiagnostic treatment of depression and anxiety
Last synced: 01 Apr 2026
https://github.com/dakshdeephere/bank_eda-practice
EDA analysis of Bank.csv dataset
analysis data data-visualization dataanalysis matplotlib numpy pandas python3 seaborn
Last synced: 07 May 2026
https://github.com/ashamethedestroyer/data-structures
Dedication of all Data Structures Creation 🛠
cpp data data-structures implementation implementation-of-data-structures structure structured-data
Last synced: 23 May 2026
https://github.com/ngupta23/data_prep_helper
A helper package for preparing and combining data from a variety of sources
data data-science dataprep datapreparation dataprocessing helpers python
Last synced: 03 Apr 2025
https://github.com/jillmpla/kaggle_notebooks
Kaggle-based data analysis, data science, and data visualization.
data data-science data-visualization kaggle machine-learning
Last synced: 16 Apr 2026
https://github.com/agdturner/ccg-data
A modularised Java library for processing data sets with classes for: data records; collections of data records; and identifiers.
Last synced: 12 Jan 2026
https://github.com/shubhamsoni98/analysis-with-sql
This project focuses on creating and managing a database for a music record company to perform various analyses on bands, albums, and songs. Using SQL, the goal is to create a structured relational database with relevant tables, insert necessary data, and perform queries that provide insights into the relationships between bands, albums, and songs.
analys analysis data data-science database dbms mysql mysqlworkbench project query schema sql
Last synced: 03 Jan 2026
https://github.com/bagustris/dataits
Web for DataITS17: Summer School on Data Science
Last synced: 28 Jun 2025
https://github.com/passly-nl/data
Source code of the data layer.
data passly ticketing typescript
Last synced: 27 May 2026
https://github.com/edjoukou/human_resources
A data analysis project using MySQL Server database
analysis data mysql powerbi sql visualization
Last synced: 25 Sep 2025
https://github.com/frnt-end/ts-context-items-list
⚛️ React Typescript project - Fetch data and display it as a list of 10 items in 10 (pagination) pages. click on each item leads to more details page- using axios, Context and Styled Components.
api axios context context-api data fetch list pagination router router-dom styled-components typescript
Last synced: 19 May 2026
https://github.com/caprogs/paris-events-analyzer
A project to analyze events in Paris using open source data provided by the city.
data data-analysis data-platform dbt docker ingestion python streamlit transformation vizualisation
Last synced: 04 May 2026
https://github.com/badranalyst/data-cleaning-and-exploratory-data-analysis-project
This project uses SQL to clean and analyze a layoffs dataset. Data cleaning tasks include removing duplicates, standardizing values, and handling missing data. Exploratory analysis is performed to identify trends in layoffs across companies, industries, and time periods.
cleaning-data data database dataset mysql mysql-database sql
Last synced: 07 Apr 2025
https://github.com/juanpablo70/pgad-assignment02
Alzheimer data set analysis
data data-science dataframe dataset jupyter-notebook r
Last synced: 18 May 2026
https://github.com/fordinand45/bdp_a_kelompok_3
Project Big Data Python yang diadakan oleh Digitalent Kominfo. Berikut adalah yang ikut serta pada project, yaitu : Dhian Prameswari, Fordinand Pasaribu, dan Muhdad Alfaris Bachmid
data data-analytics data-science linear-regression python3
Last synced: 12 Apr 2026
https://github.com/realbxnnie/accountservice
A Simple DataStoreService wrapper with session backuping and session locking.
Last synced: 29 Jul 2025
https://github.com/raghavendranhp/attrition-alchemy
This project uses machine learning to predict and analyze employee attrition in Company.By developing three predictive models,it identifies key factors influencing turnover,providing actionable insights to mitigate attrition challenges.The analysis focuses on enhancing job satisfaction,work-life balance and career growth opportunities.
data datawrangling decision-trees eda gradient-boosting logistic-regression macine-learning pandas preprocessing random-forest-classifier skicit-learn svm
Last synced: 18 May 2026
https://github.com/aminnairi/node-decode
Check that your data meet your expectations
check data decode expectations schema
Last synced: 22 Apr 2026
https://github.com/lancewalk87/cls-cloud-sync-ruby-on-rails
Software | SQL Database with automated Cloud Sync for mitigating lost data across dist. servers. Managed by Ruby on Rails.
cloud-computing cloud-storage data database ruby ruby-application ruby-on-rails server sql
Last synced: 24 Jul 2025
https://github.com/rid17pawar/friendscircle
Friends Circle is a console based application developed in cpp using Graph Data Structure.
cpp data graph graph-algorithms oop
Last synced: 08 Jun 2026
https://github.com/olekscode/datageneration
Exploring the methods of data generation for different Machine Learning algorithms
data javascript machine-learning
Last synced: 05 Apr 2025
https://github.com/tks18/xl-pq-handler
A Pythonic Power Query (.pq) File Manager for Excel & Power BI Automation
analytics automation data excel power-query powerbi python xlwings
Last synced: 20 Jan 2026
https://github.com/michael-sebero/data-recovery-tools
This tool suite recovers sensitive data.
algiz-linux archive corruption data data-recovery linux recover recovery rust tool tool-suite tools
Last synced: 18 May 2026
https://github.com/carlosrs14/parallel-data-preprocessig-system
A parallel data preprocessing system using threads and synchronization mechanisms (barrier, busy-waiting, condition variables) to clean and prepare data for AI training.
barrier-method c condition-variable data operative-systems parallel-computing posix preprocessing synchronization threads
Last synced: 24 Jul 2025
https://github.com/cannt39t/data-mining-spider-vk
Паук который собирают всю информацию о рекламных постах в группе VK
data data-mining python3 vk vkontakte
Last synced: 05 Apr 2025
https://github.com/als8446/tripleten-data-science-projects
Projects Overview Projects made in the Data Scientist course from TripleTen LatAm
data data-analysis hypothesis-tests machine matplotlib numpy pandas python scipy sklearn
Last synced: 10 Apr 2026
https://github.com/jigyasag18/amazon-power-bi-dashboard
The Amazon Power BI Dashboard Project repository provides an interactive analytics dashboard for visualizing and analyzing sales performance across various product categories within Amazon's ecosystem. Utilizing comprehensive sales data, it empowers stakeholders with actionable insights to enhance decision-making and improve business strategies.
data data-visualization dataanalysis dataanalytics dataset datasets datavisualization-project powerbi powerbi-report powerbi-visuals powerbidashboard
Last synced: 07 Mar 2026
https://github.com/bonnevoyager/quick-storage
Simple key/value storage module with persistency.
browser data fs indexeddb javascript key-value nodejs persistence quick server storage
Last synced: 16 Apr 2026
https://github.com/rudxain/xorsum
Get XOR checksum with this command-line tool
binary checksum cli data digest file files hexadecimal rust-crate xor
Last synced: 08 Mar 2026
https://github.com/priyapuranik/data-analytics-using_python
Analyzed data of Hotels and find out meaningful insights from it including booking patterns and seasonal trends and many more.
data pandas python sql visualization
Last synced: 06 Apr 2026
https://github.com/natarizkie2/neurochain-airdrop-bot
🍋 — A smart bot designed to complete data tasks like true/false selections automatically, with multi-account support for extra convenience.
airdrop automated bot data multi-account natarizkie neurochain nodejs web3
Last synced: 10 Jun 2026
https://github.com/justinyahin/wpdf
Create, filter, sort and display users data on your WordPress site.
Last synced: 18 Apr 2026
https://github.com/mendel5/wifi
Information about Wi-Fi (wifi, WLAN, wireless LAN)
bitrate data data-transmission ethernet internet latency speed throughput transfer transmission wi-fi wifi wireless wireless-lan wlan
Last synced: 02 Aug 2025
https://github.com/itrauco/streaming-data-platform
skeleton streaming data platform on gcp...
big-data data data-engineering data-infrastructure data-science engineering google-cloud platform-engineering python streaming-data
Last synced: 13 Jun 2026
https://github.com/jigyasag18/airline-performance-and-passenger-satisfaction-project-using-big-data-analytics
This project analyzes 10 years of U.S. domestic airline data (~3GB) using Hadoop (Cloudera) and Hive for data processing. Power BI dashboards visualize key metrics like delays, on-time rates, air time, and diversions. The solution includes Hive queries, DAX measures, HDFS ingestion scripts, and year-wise insights with recommendations.
big-data big-data-analytics bigdata cloudera cloudera-hadoop cloudera-hadoop-framework data data-analysis data-visualization database hadoop hive power-bi powerbi powerbi-dashboard powerbi-dashboards powerbi-report powerbi-visuals powerbi-visuals-tools powerbidashboard
Last synced: 01 Aug 2025
https://github.com/fatihemres/Africa
Africa app by SwiftUI. Using AVFoundation, MapKit, data, models, animations, stickers.
animations avfoundation data mapkit models swift swift-animations swiftui
Last synced: 31 Aug 2025
https://github.com/fatihemres/Fruits
Fruit Details app by SwiftUI. Using data, models, animation and practically onboarding usage.
animations data models onboarding swift swiftui
Last synced: 31 Aug 2025
https://github.com/hess125/data-visualizations
A repository of data visualization projects
data data-analysis data-science data-visualization powerbi projects sql sqlite tableau
Last synced: 31 Aug 2025
https://github.com/erickpeirson/jhb-data
Data from the forthcoming paper: Quantitative Perspectives on Fifty Years of the Journal of the History of Biology
data geolocation history-of-biology named-entity-recognition topic-modeling
Last synced: 04 Mar 2026
https://github.com/chompfoods/sdk-typescript-angular
Angular TypeScript SDK for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.
angular api branded chomp data database food grocery ingredients nutrition raw recipe-api recipes sdk typescript
Last synced: 09 May 2026
https://github.com/greatwoman23/car_insurance_analysis
The Car Insurance Analysis project aims to provide a comprehensive examination of a car insurance portfolio using advanced data analytics tools. The analysis offers valuable insights into policy demographics, claims patterns, and financial metrics, helping stakeholders make informed decisions.
bigquery data data-science dataanalytics insurance-claims looker-studio tableau
Last synced: 03 Feb 2026
https://github.com/ashakoen/bls-data-extract
This repository contains scripts and a database schema to set up and manage a local SQLite database for storing and querying the Average Price data from the U.S. Bureau of Labor Statistics. It includes tools for downloading the latest data from the BLS website and fetching Consumer Price Index (CPI) data via the BLS API.
Last synced: 01 Apr 2026
https://github.com/thomasjewson/cci-data-science-textbook
This is a short, interactive textbook aimed at introducing data science to non-IT university undergraduates. Funded by Erasmus+.
data data-science learning python textbook
Last synced: 16 Apr 2026
https://github.com/e-kotov/albofr
alboFr: Get French Data on Tiger Mosquito Colonisation
aedes-albopictus data france tiger-mosquito
Last synced: 11 Jun 2026
https://github.com/redatargaoui/dataconverter
Data conversion functionality to integrate into the software used for autism detection research.
apache-poi data dataconversion excel java
Last synced: 06 Sep 2025
https://github.com/team810/frcs
FRCS is an online international crowd sources data collection software written for the FRC Competitions. It was created by team 810, The Mechanical Bulls.
Last synced: 14 Mar 2025
https://github.com/amethyst-php/account
account amethyst amethyst-package api data laravel
Last synced: 18 May 2026
https://github.com/srgchrksv/stream-crypto
Crypto trades streaming with azure services
azure binance crypto data databricks dataengineering pyspark python streaming websocket
Last synced: 30 Apr 2026
https://github.com/yadavkaushal/datascience-e-commerce-shopping-details
This project analyzes customer purchase data including details such as location, company, credit card usage, browser info, job roles and purchase price. It explores patterns in payment methods, spending behavior and online transactions. Using Pandas, Matplotlib and Seaborn, we clean analyze and visualize key trends to derive actionable insights.
data datacleaning dataframe datapreprocessing dataset libraries matplotlib numpy pandas plots visulaization
Last synced: 06 May 2026
https://github.com/bryanhe24/data_analysis_app
A full-stack web application that allows users to upload CSV datasets, analyze the data with statistical summaries and visualizations, and interact with an AI-powered assistant for querying the dataset.
ai data data-analysis data-visualization fullstack-development javascript math python reactjs
Last synced: 07 May 2026
https://github.com/dms-codes/scrape_tripsantai
Trip Santai Tour Data Scraper This Python script is a web scraper designed to extract and collect information about tours from the Trip Santai website. It utilizes the requests library to fetch web pages, BeautifulSoup for parsing HTML, and writes the collected data to a CSV file.
beautifulsoup4 data python requests scraper webscraper
Last synced: 21 May 2026
https://github.com/cmda-tt/course-25-26
🎓 tech track · 2025-2026 · curriculum and syllabus 📊
d3 data datavis functional javascript programming research svelte visualization
Last synced: 20 Jan 2026
https://github.com/luminati-io/google-search-api
Two methods to collect real Google SERP data—a free scraper for basic use and the enterprise-grade Bright Data API for high-volume demands.
data google-scraper html python serp-api web-scraping
Last synced: 25 Jun 2025
https://github.com/fastpix/android-data-bitmovin
FastPix Video Data SDK to monitor and analyze video playback metrics within Bitmovin for android
analytics android-sdk bitmovin data fastpix metrics player sdk video
Last synced: 16 Apr 2026
https://github.com/talitalobo/statistics-with-python
Repo about statistical concepts and (not always) their python implementation.
data data-science machine-learning statistics
Last synced: 11 Jan 2026
https://github.com/mekramy/ircity
Iran province, county and city data in json format.
Last synced: 05 Apr 2025
https://github.com/jigyasag18/power-bi-dashboard-project
The Ecommerce Sales Analysis Dashboard project utilizes Power BI to provide detailed insights into ecommerce sales data, enabling stakeholders to track key performance metrics and uncover trends. This interactive dashboard allows users to explore the data in real-time, offering features such as drill-down capabilities, customizable filters.
dashboard data data-visualization datacleaning datanalysis datanalytics datapreprocessing powerbi visulaization
Last synced: 04 Mar 2026
https://github.com/ksimicevic/discord-message-analyzer
Analyzing discord messages in Jupyter notebook
analysis data discord messages
Last synced: 16 Apr 2026
https://github.com/byndyusoft/byndyusoft.data.relational.specifications
byndyusoft data relational specifications
Last synced: 12 Sep 2025
https://github.com/haideratgh/sql-data-analytics-project
This repository contains a collection of SQL scripts demonstrating various analytical techniques, such as changes over time, cumulative, performance, data segmentation, part-to-whole analysis
analytics business-analytics business-intelligence data data-analysis data-analyst data-analytics data-engineering data-science data-scientist database datascience query reporting sql sql-query sql-server window-functions-in-sql
Last synced: 29 Jun 2025