data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-02 00:07:45 UTC
- JSON Representation
https://github.com/thibautre/dataipsum
Configurable data generator (with crumbles inside)
algorithm data random-generation
Last synced: 21 Jul 2025
https://github.com/ubc-library-rc/tableau-intro
Intro to data visualization with Tableau
Last synced: 01 Jul 2026
https://github.com/karajmiglani-datascientist/karajmiglanifake-news-detection
FAKE_NEWS_PREDICTION
algorithms data data-science flask machine-learning probability-statistics python statistics structure
Last synced: 22 May 2026
https://github.com/eryks1999/data-collection-project_python
This project allowed me to practice classes, populating json files as well as extracting data.
Last synced: 16 Apr 2026
https://github.com/ubc-library-rc/intro-web-scraping
Research Commons Introduction to Web Scraping Workshop.
data digital-scholarship workshop
Last synced: 01 Jul 2026
https://github.com/jensostertag-archive/charts.js
A JavaScript Plugin to draw Charts to visualize Data and Statistics on Websites
charts data javascript statistics webapplication
Last synced: 22 Jun 2025
https://github.com/Axnjr/csv-parser-utils
Homework task for SWE position at Redhat.
csv data dataanalysis datatools pandas python
Last synced: 30 Oct 2025
https://github.com/e22m4u/ts-projection
Модуль для работы с проекцией данных для TypeScript
Last synced: 12 Apr 2025
https://github.com/rickstaa/ai-compute-visualizer
A StreamLit-based web application to visualize GPU inventory and AI capabilities on the Livepeer network.
Last synced: 28 Jun 2025
https://github.com/juniorreisx/movelo-logstica
Movelo is a lightweight logistics simulator built with TypeScript that provides mock order and delivery data for developing and testing UIs, dashboards, and backend features without external APIs.
data hooks lucide-react react tailwindcss typescript
Last synced: 12 Apr 2025
https://github.com/ubc-library-rc/tableau-dashboard
Designing dashboards with Tableau
Last synced: 01 Jul 2026
https://github.com/jigyasag18/ibm-power-bi-dashboard-project
IBM Power BI Dashboard Project is a data-driven analysis of employees using IBM's comprehensive dataset, providing insights into key factors contributing to employee turnover and enabling organizations to strategize effectively towards improved employee retention and satisfaction.
data data-visualization dataanalysis dataanalytics dataset datavisualisation datavisualization-project powerbi powerbi-dashboards powerbi-report powerbi-visuals powerbidashboard
Last synced: 07 Mar 2026
https://github.com/matheussoranco/how-to-estimate-required-sample-size-for-model-training
Modeling the relationship between training set size and model accuracy.
artificial-intelligence data jupyter-notebook machine-learning python
Last synced: 22 May 2026
https://github.com/skygenesisenterprise/aether-calendar
Aether Calendar is a lightweight, open-source client built for privacy, speed, and seamless integration within the Aether Office ecosystem
applications calendar capacitorjs data javascript linux macos nextjs typescript windows
Last synced: 12 Apr 2026
https://github.com/madihanazir/ds-using-c
Basic insights into Data Structures (inspired by Abdul Bari course but in C language)
data self-learning structures-in-c
Last synced: 17 Mar 2025
https://github.com/lu-sketch/chocolate-imports-dataset
Chocolate Imports for South Africa
Last synced: 18 May 2026
https://github.com/athari22/statistics-from-stock-data
Statistics from Stock Data
cvs data data-science dataanalysis datacleaning dataframe jupyter pandas pandas-python python statistics stock table
Last synced: 16 Feb 2026
https://github.com/dan149/uselesscontentcreator
Useless Content Creator (UCC) is a fake content generator, text, html and pdf files.
content customizable data easy-to-use fake-data fake-data-generator faker-generator generator lightweight open-source opensource python python3
Last synced: 03 Apr 2025
https://github.com/brunosalerno/osm_data
Ruby objects for dealing with OSM data, and generating XML files
Last synced: 21 Apr 2026
https://github.com/byndyusoft/byndyusoft.data.relational.specifications
byndyusoft data relational specifications
Last synced: 12 Sep 2025
https://github.com/ashishsingh789/quantium_data-analysis-_virtual-internship
Completed a job simulation focused on Data Analytics and Commercial Insights for the data science team. Developed expertise in data preparation and customer analytics, utilizing transaction datasets to extract valuable insights and deliver data-driven commercial recommendations
data datawrangling matplotlib pandas pandas-dataframe presentation programming python python-library
Last synced: 07 Apr 2026
https://github.com/garcane/layoffs-exploratory-data-analysis
This project uses MySQL to perform data cleaning and exploratory data analysis (EDA) on a dataset detailing company layoffs. The primary goal is to process, clean, and explore the data to gain insights into trends and patterns related to layoffs across various sectors.
data dataanalysis eda mysql sql
Last synced: 29 Oct 2025
https://github.com/webdevcave/collections-php
A PHP library for managing collections of data with support for nested keys.
array collection data helper library nested-keys package php utility utility-classes
Last synced: 28 Jun 2026
https://github.com/iota-pico/data
IOTA Pico Framework Data Structures and Helpers
data iota iota-pico-framework javascript typescript
Last synced: 18 May 2026
https://github.com/raufjatoi/electricity-consumption-prediction
arima-model customize data kinda-dynamic ml
Last synced: 25 Jul 2025
https://github.com/jigyasag18/data-analysis-using-ms-excel
This project is on analyzing real-time data from Ambuvians Healthcare, a health products startup. It included data cleaning, such as removing duplicates and addressing missing values, followed by analyses to reveal insights into sales trends, customer demographics, and purchasing behaviors. Visualizations in MS-Excel including bar and pie charts.
analysis data data-visualization dataanalysis datacleaning datapreprocessing dataset msexcel visualization
Last synced: 07 Mar 2026
https://github.com/jigyasag18/amazon-power-bi-dashboard
The Amazon Power BI Dashboard Project repository provides an interactive analytics dashboard for visualizing and analyzing sales performance across various product categories within Amazon's ecosystem. Utilizing comprehensive sales data, it empowers stakeholders with actionable insights to enhance decision-making and improve business strategies.
data data-visualization dataanalysis dataanalytics dataset datasets datavisualization-project powerbi powerbi-report powerbi-visuals powerbidashboard
Last synced: 07 Mar 2026
https://github.com/yusuf4030/the-data-analyst-toolkit
📊 Explore essential data analysis tools organized by role and task, empowering users from students to professionals with quick access to valuable resources.
budget budget-management business-intelligence charts cookbook cureated-list data data-analysis-python data-visualization internet-of-everything internet-of-transport large-language-models nse open-source python selenium stock-market traffic-analysis
Last synced: 18 May 2026
https://github.com/ubc-library-rc/r-microdata
Workshop showcasing available microdata at the UBC library
Last synced: 01 Jul 2026
https://github.com/alexdonh/adonis-cache
Another cache provider for AdonisJs. Supports Object, File, Db and Redis cache. With cache dependencies!
adonis-framework adonisjs cache data dependency redis storing
Last synced: 15 May 2026
https://github.com/pooja-manjunatha/nyc_parking_violations_dbt
This project uses dbt to transform NYC parking violations data through a layered architecture: Bronze: Raw ingested data Silver: Cleaned and enriched data Gold: Aggregated tables for analytics Using DuckDB as the warehouse backend, it ensures data quality with tests and documentation. The project enables reliable analysis of parking violations
data data-analysis data-engineering dbt duckdb python sql
Last synced: 14 May 2026
https://github.com/mightymetrika/mmirestriktor
Informative Hypothesis Testing Web Applications
data hypothesis infomative power r restriktor statistics testing
Last synced: 17 Mar 2025
https://github.com/cannt39t/data-mining-spider-vk
Паук который собирают всю информацию о рекламных постах в группе VK
data data-mining python3 vk vkontakte
Last synced: 05 Apr 2025
https://github.com/pbinkley/tweets-online-classes-covid19
A twarc harvest of tweets related to online classes during the COVID-19 outbreak, starting 2020-03-02
Last synced: 06 Mar 2026
https://github.com/luminovrym/crawler-tools-js
Crawler Tools Js adalah sebuah aplikasi yang digunakan untuk scrapping data pada sebuah web
crawler crawler-js data js web-scraping
Last synced: 08 Sep 2025
https://github.com/os-climate/rmi-utility-transition-hub-ingestion-pipeline
Data ingest for RMI's Utility Transition Hub data (as of March 7, 2022)
data emissions-co2 energy-data os-climate
Last synced: 12 Apr 2025
https://github.com/michael-sebero/data-recovery-tools
This tool suite recovers sensitive data.
algiz-linux archive corruption data data-recovery linux recover recovery rust tool tool-suite tools
Last synced: 18 May 2026
https://github.com/vdutts7/speedtests
crawl data isp latency ookla speedtests
Last synced: 01 Jul 2026
https://github.com/antononcube/raku-data-typesystem
Data type system for different data structures.
data data-structures rakulang type-system
Last synced: 09 Jul 2025
https://github.com/siongui/xemaauj9k5qn34x88m4h
No source code. Only serve JSON files of Pāli words
Last synced: 15 May 2026
https://github.com/ahadly/sql-data-analytics-project
This repository contains a collection of SQL scripts demonstrating various analytical techniques, such as changes over time, cumulative, performance, data segmentation, part-to-whole analysis.
analytics business-analytics business-intelligence data data-analysis data-analyst data-analytics data-engineering data-science data-scientist database datascience query reporting sql sql-queries sql-query sql-server window-functions window-functions-in-sql
Last synced: 18 May 2026
https://github.com/gui-sitton/games
Identify patterns that determine whether a game is successful or not. This will allow you to identify potential big winners and plan advertising campaigns.
data data-analysis data-analysis-python data-science data-visualization python
Last synced: 18 May 2026
https://github.com/amethyst-php/issue
amethyst amethyst-package api data issue laravel task ticket
Last synced: 18 May 2026
https://github.com/devprnvk/pycryptochain
A implementation of a blockchain-based cryptocurrency in Python. This project aims to provide a fundamental understanding of blockchain technology and cryptocurrency by building a basic version from scratch. Features include blockchain creation, transaction handling, mining rewards, simulation.
blockchain crypto data decryption encryption hashing processing py python salting storage
Last synced: 09 Mar 2026
https://github.com/xjwllmsx/profitable-app-profiles
Analyzes Google Play & App Store data to recommend profitable profiles for free, ad-supported mobile apps
data data-analysis data-cleaning jupyter pandas python
Last synced: 18 May 2026
https://github.com/rellyson/data-engineering-tools
This repository holds examples and documentation about the most used tools in the data engineering ecosystem.
apache-airflow apache-spark data data-engineering jupyter-notebook python tools
Last synced: 17 Jan 2026
https://github.com/bastianolea/servel_elecciones
Resultados electorales desde Servel (2024)
chile comunas data elecciones genero
Last synced: 08 Jul 2025
https://github.com/rid17pawar/friendscircle
Friends Circle is a console based application developed in cpp using Graph Data Structure.
cpp data graph graph-algorithms oop
Last synced: 08 Jun 2026
https://github.com/nodamu/apache-beam-studies
Personal Apache Beam studies repository
apachebeam batch-processing data dataeng dataengineering datapipeline stream-processing
Last synced: 04 Nov 2025
https://github.com/raghavendranhp/attrition-alchemy
This project uses machine learning to predict and analyze employee attrition in Company.By developing three predictive models,it identifies key factors influencing turnover,providing actionable insights to mitigate attrition challenges.The analysis focuses on enhancing job satisfaction,work-life balance and career growth opportunities.
data datawrangling decision-trees eda gradient-boosting logistic-regression macine-learning pandas preprocessing random-forest-classifier skicit-learn svm
Last synced: 18 May 2026
https://github.com/yugoff/ml-kaggle-regression-with-a-mohs-hardness-dataset
Your Goal: For this Episode of the Series, your task is to use regression to predict the Mohs hardness of a mineral, given its properties
data gradient-boosting kaggle kaggle-competition regression-models
Last synced: 18 May 2026
https://github.com/mkshah605/personal-brand-development
A data-driven approach to a personal brand development project.
branding data data-science growth music personal
Last synced: 12 Sep 2025
https://github.com/kammarah/studentdata
I created & deployed a Streamlit app to store, manage & analyze student data. 📊🎓
connection data data-analysis data-visualization deploy deployments libraries python streamlit streamlit-webapp webapp
Last synced: 18 May 2026
https://github.com/ubc-library-rc/r-markdown
Introduction to using R Markdown
Last synced: 01 Jul 2026
https://github.com/analyticslover/salifort-motors-turnover-project
The Salifort Motors H.R. Project serves as the capstone for the Google Advanced Analytics Program on Coursera. This project presents a business scenario and a problem on the scnario context, employee turnover. In this project, essential techniques as EDA and Data Modeling are used to analyze and predict the employee turnover rates in the company.
data data-analysis datamodeling eda machine-learning pandas python sklearn
Last synced: 10 Apr 2026
https://github.com/the-universal-linux-society/sysreport
Bash script to give you a full system report. Just by running the script it offers insight into CPU data, disk space, temperature readings, network configuration, MAC addresses, firewall status, and system logs for error analysis.
analysis bash bash-script bash-scripting data report reporting system
Last synced: 15 May 2026
https://github.com/valyaevgeorgiy/r_basic
Работа с основами среды R и тем самым изучения нового языка программирования, связанного непосредственно с анализом данных и построением графиков и диаграмм.
coding data data-analysis r rstudio
Last synced: 12 Dec 2025
https://github.com/meltymooncakes/blockdata
Minecraft Block data
api data json minecraft minecraft-data
Last synced: 13 Apr 2025
https://github.com/fastpix/flutter-core-data-sdk
A comprehensive Flutter SDK for video player analytics and event tracking, designed to provide detailed insights into video playback behavior and user engagement metrics.
Last synced: 15 May 2026
https://github.com/amethyst-php/delivery-point
amethyst amethyst-package api data delivery-point laravel
Last synced: 18 May 2026
https://github.com/pedrozamecki/datatube
Site Open Source para análise de dados de canais do YouTube.
data estatistica statistical-analysis statistics youtube
Last synced: 18 May 2026
https://github.com/skygenesisenterprise/api-service
The Official Sky Genesis Enterprise API Service Ecosystem
api-service client cryptography data dns docker javascript nextjs service stalwart typescript websocket
Last synced: 31 Dec 2025
https://github.com/inekipelov/swift-codable-advance
A library of extensions for Swift Codable protocols, simplifying the process of encoding and decoding objects.
codable data dictionary json swift
Last synced: 25 Jan 2026
https://github.com/fordinand45/bdp_a_kelompok_3
Project Big Data Python yang diadakan oleh Digitalent Kominfo. Berikut adalah yang ikut serta pada project, yaitu : Dhian Prameswari, Fordinand Pasaribu, dan Muhdad Alfaris Bachmid
data data-analytics data-science linear-regression python3
Last synced: 12 Apr 2026
https://github.com/dscamilo/gestion-clientes-springboot
Proyecto de gestión de clientes aplicando Java y Springboot, haciendo uso de Lombok, uso de interface, inyección de dependencias, uso de anotaciones Service, Data, RestController . Consumo de API haciendo uso de Postman.
data interface java lombok-maven restcontroller spring-boot
Last synced: 15 May 2026
https://github.com/annaanastasy/mushroom-binary-classification-eda-ml
Explored and modeled a competition dataset of mushroom species, focusing on data cleaning, exploratory data analysis, and building machine learning models for accurate classification of edible and poisonous mushrooms.
binary-classification data data-cleaning-and-preprocessing data-science exploratory-data-analysis machine-learning-algorithms xgboost-classifier
Last synced: 29 Mar 2025
https://github.com/juanpablo70/pgad-assignment02
Alzheimer data set analysis
data data-science dataframe dataset jupyter-notebook r
Last synced: 18 May 2026
https://github.com/ubc-library-rc/basics_of_data_viz
Basics of Data Visualization
Last synced: 01 Jul 2026
https://github.com/ubc-library-rc/intro-data-analysis-python
Introduction to Python for Data Analysis
Last synced: 01 Jul 2026
https://github.com/santoshshinde2012/medallion-architecture-databrics
Medallion Architecture: Principles and Practical Exploration
data data-plat data-science databricks databricks-notebooks medallion-architecture
Last synced: 26 Jul 2025
https://github.com/estherslabbert/data-exploration
Data analysis and data visualizations for different data sets
data data-analysis data-science data-visualization jupyter-notebook titanic-dataset usa-arrests-dataset
Last synced: 06 Apr 2025
https://github.com/estherslabbert/regression-models
Different regression explorations for different datasets
data data-science diabetes-dataset hourly-wage-dataset insurance-dataset iris-dataset jupyter-notebook linear-regression logistic-regression multiple-linear-regression regression-analysis regression-models
Last synced: 06 Apr 2025
https://github.com/caprogs/paris-events-analyzer
A project to analyze events in Paris using open source data provided by the city.
data data-analysis data-platform dbt docker ingestion python streamlit transformation vizualisation
Last synced: 04 May 2026
https://github.com/gsmithun4/expressjs-field-validator
Plugin for validating JSON request, middleware for expressjs
data express-js expressjs json-request middleware nodejs request rest-api validation
Last synced: 06 Mar 2026
https://github.com/ramonrsv/f1_data
Provides consolidated access to various sources of Formula 1 information and data, including event schedules, session results, timing and telemetry data, as well as historical information about drivers, constructors, circuits, etc.
Last synced: 07 Apr 2026
https://github.com/giosil/export-as
A convenience library for exporting data in different formats.
data data-export export exporter java
Last synced: 26 Jul 2025
https://github.com/alexis-gss/games-data
Games Data is a library of informations about all games, realised under NuxtJs
css3 data games nuxtjs tailwindcss typescript vuejs
Last synced: 13 Mar 2025
https://github.com/thetacom/byteclasses
A Python package to manage and interact with binary data in a simple and structured manner.
binary-data bytes data dataclasses package python python3
Last synced: 11 Jul 2025
https://github.com/shrutakeerti/eye-gaze-detection
This repo contains everything that I have done at IIT Jodhpur Summer Internship May 15 - July 15
ai aiml data eda eeg eeg-signals eye jodhpur mlflow
Last synced: 17 Mar 2025
https://github.com/The-Tech-Idea/Beep.winform.Sample
Application for Managing your Different DataSources . Still in Alpha.please be patient
application data data-science database dataset integeration mysql nosql oracle postgres sqlite sqlserver workflow-engine workflows
Last synced: 04 Nov 2025
https://github.com/mksingh431/sql-complete-notes
SQL, or Structured Query Language, is a robust and specialized programming language designed for efficient management and manipulation of relational databases. With SQL, you can seamlessly interact with databases like MySQL, PostgreSQL, Microsoft SQL Server, Oracle,.
Last synced: 21 Apr 2026
https://github.com/clagiordano/weblibs-data-export
Library for generic data export to various formats
clagiordano data export weblibs xlsx
Last synced: 01 Jul 2026
https://github.com/ramonmeza/mysteamstats
Visualize your stats from your favorite games on Steam!
data statistics steam steam-api videogame visualization
Last synced: 17 Mar 2025
https://github.com/karensaraimoralesmontiel/8-week-sql-challenge
Case Studies Solutions for the 8-Week-SQL-Challenge.
Last synced: 02 Jan 2026
https://github.com/amethyst-php/target
amethyst amethyst-package api data laravel target
Last synced: 22 May 2026
https://github.com/saisurajmatta/data-warehousing-and-advanced-data-analytics
Data Analytics Project: Analyzed Promotions and Provided Tangible Insights to Sales Director
data data-analysis data-architecture data-flow-analysis data-modeling data-pipeline data-segmentation data-visualization data-warehousing docker etl etl-pipeline mssql sql tableau
Last synced: 17 May 2026
https://github.com/encelo/nctracer-data
Data files for the ncTracer project
Last synced: 15 Jan 2026
https://github.com/donbarbos/pypi-typing
csv data dump pypi pypi-packages python python-statistics python-typing script types typing typing-statistics typing-status
Last synced: 13 Sep 2025