data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-02 00:07:45 UTC
- JSON Representation
https://github.com/eloyhere/semantic-java
Semantic-Java is a modern, maven Java stream processing framework with zero dependencies. It elegantly blends the fluency of Java Streams, the laziness of JavaScript generators, and intelligent index-based control inspired by database indexing — perfect for time-series, event streams, and high-performance data pipelines as a maven pendency.
data functional functional-programming java pipeline stream
Last synced: 07 Apr 2026
https://github.com/theduardomaciel/cc-pe
Conteúdos, scripts em R e datasets utilizados durante a matéria de Probabilidade e Estatística.
Last synced: 27 Mar 2025
https://github.com/vijaykumar1303/sales-data-analysis-and-dashboard-development
To analyze sales data to uncover insights into sales performance, trends, and patterns, and to develop an interactive dashboard that provides a comprehensive view of sales metrics and KPIs.
data dataanalysis datacleaning datavisualisation dax-query powerbi powerquery sql sqldataanalysis
Last synced: 11 Feb 2026
https://gitlab.com/sean-c/pdf_rules
Turn PDFs into CSVs by defining rules
Data Cleaning automation data data parsing
Last synced: 14 Apr 2025
https://github.com/campiohe/geomask
A very simple lib for creating geometric masks from spatial data using regular grids.
Last synced: 30 Dec 2025
https://github.com/gui-sitton/prepaid
In this project I work as an analyst for the telecommunications company Megaline. The company offers its customers prepaid plans, Surf and Ultimate. The sales department wants to know which plans bring in the most revenue in order to adjust the advertising budget
data data-analysis data-analysis-python data-science data-visualization python
Last synced: 22 May 2026
https://github.com/samaalharbi2/virtual-work-experience---data-analysis-at-stc
Virtual Work Experience in Data Analysis at STC
analysis data data-visualization misk stc
Last synced: 20 Jun 2025
https://github.com/chocolateboy/corrigenda
Corrections, addenda, and deltas for data that's wrong on the Internet
addenda api corrections corrigenda data json json-data
Last synced: 27 Mar 2025
https://github.com/mapaor/horaris-rodalies
Web que utilitza la API de rodalies de Catalunya per mostrar els horaris d'una manera més divertida
adif api ave barcelona bordils catalunya dades data distancia generalitat girona horaris md r11 regional renfe rodalies sants tren viajes
Last synced: 16 May 2026
https://github.com/danicaalana/breast-cancer-random-forest
This project is developed as part of Digital Skill Fair (DSF) 35.0 - Data Science by Dibimbing. I am using Wisconsin Breast Cancer Diagnostic Dataset from scikit-learn, which is a classic and very easy binary classification dataset.
breast-cancer-classification breast-cancer-wisconsin data eda machine-learning-algorithms python random-forest-classifier
Last synced: 16 May 2026
https://github.com/domarps/grad-project-reports
Write-ups of a few key semester-long projects I have worked during my Masters
circuit data deeplearning graph-algorithms matlab question-answering
Last synced: 26 Mar 2025
https://github.com/chrisrobertsjr/chrisrobertsjr
Welcome to my Github Profile!
data data-analysis java r sql statistics
Last synced: 03 May 2026
https://github.com/jorgeatgu/dataset-elecciones-28a
Datasets generados a partir del dataset de elecciones generales de El País
28a data elecciones2019 elections spain
Last synced: 16 May 2026
https://github.com/nabilaagha/chest-x-ray-medical-diagnosis-using-deep-learning
This project uses deep learning to classify chest X-ray images for disease detection. It involves data preprocessing, pre-trained CNN models, and the ChestX-ray8 dataset to enhance medical diagnostics with AI.
computer-vision data data-processing deep-learning juypter-notebook medical-image-processing x-ray-images
Last synced: 15 Dec 2025
https://github.com/erictleung/2018-new-coder-survey
:beginner: Code to wrangle data from the 2018 New Coder Survey by freeCodeCamp
data data-cleaning dataset freecodecamp new-coders-survey programmers
Last synced: 03 Apr 2025
https://github.com/jor-/measurements
Python functions to handle, statistically analyze and plot measurement data.
Last synced: 17 Mar 2025
https://github.com/eslamdyab21/apara-data-gui
Custom application for Apara's data wrangling scripts, Technologies used are Qt-designer, PyQt5 for the GUI and Pandas, Numpy for the data work.
csv data data-analysis data-wrangling gui pandas pyqt5-desktop-application qt5-gui
Last synced: 17 May 2026
https://github.com/woctezuma/humble-choice-leak
Retrieve leaks for Humble Choice.
data datamining humble-bundle humble-bundle-games humble-bundle-leak humble-choice humble-choice-leak humblebundle humblebundle-leak leak leaks steam steam-games
Last synced: 27 Mar 2025
https://github.com/greatwoman23/sentiment-analysis-on-amazon-products-review
Sentiment_Analysis_On_Amazon_Product_Review
analysis dashboard-application data data-science datascientistproject machine-learning publication python remotejob
Last synced: 17 May 2026
https://github.com/ericmaddox/nyc-crime-analytics
Analyzes and visualizes crime data from the NYC Police Department using interactive maps and heatmaps, leveraging the NYC Open Data API.
crime-analysis crimedata data datavisualization esri folium heatmap nycopendata python python3 rtcc
Last synced: 24 Jun 2025
https://github.com/cemoktra/data_series
time series handling
data lazy-evaluation time-series
Last synced: 29 Oct 2025
https://github.com/huspacy/huspacy-resources
Resources for building and evaluating huspacy
Last synced: 21 Mar 2025
https://github.com/amethyst-php/office
amethyst amethyst-package api data laravel office
Last synced: 17 May 2026
https://github.com/octoenergy/tentaclio-gdrive
A python project containing all the dependencies for the gdrive tentaclio schema
Last synced: 24 Jun 2025
https://github.com/octoenergy/tentaclio-databricks
Module to give tentaclio support to databricks
Last synced: 24 Jun 2025
https://github.com/octoenergy/tentaclio-s3
A python project containing all the dependencies for s3 tentaclio schema.
Last synced: 24 Jun 2025
https://github.com/octoenergy/tentaclio-athena
A python project containing all the dependencies for awsathena+rest tentaclio schema.
Last synced: 24 Jun 2025
https://github.com/octoenergy/tentaclio-postgres
A python project containing all the dependencies for postgresq tentaclio schema.
Last synced: 24 Jun 2025
https://github.com/octoenergy/tentaclio-gs
A python project containing all the dependencies for gs tentaclio schema.
Last synced: 24 Jun 2025
https://github.com/ahabdel/amazon-web-scraper
Amazon Web Scraper to scrape pricing adjustments and provide updates on a day to day basis
Last synced: 29 Oct 2025
https://github.com/takamoso/umami
Cross browser compatibility data.
browser compat compatibility data dataset json
Last synced: 27 Mar 2025
https://github.com/kaizadp/bbwm_moisture
HOBO data for soil moisture - Bear Brook Watershed in Maine
Last synced: 17 May 2026
https://github.com/hyfi06/unam-careers
A utility package for retrieving career information from UNAM.
Last synced: 16 May 2026
https://github.com/stdlib-js/ndarray-base-assert-is-data-type-string
Test if an input value is a supported built-in ndarray data type string.
array assert base check data dtype is javascript multidimensional ndarray node node-js nodejs stdlib test types util utilities utility utils
Last synced: 16 May 2026
https://github.com/analyticslover/sales-python-dashboard
Dashboard Ventas Japon 2023
dashboards data data-analysis jupyter-notebook python3 sales streamlit
Last synced: 09 Apr 2026
https://github.com/wilcotomassen/lorem-datum-core
Java based data generator for data simulation
data dataset generator java lorem-ipsum simulated-data
Last synced: 11 Jan 2026
https://github.com/samridhisainii/airbnb-data-analysis
Data analysis of airbnb dataset
analysis data data-visualization eda models
Last synced: 16 May 2026
https://github.com/ahmad-ali-rafique/logistic-regression-modeling
An in-depth exploration of logistic regression models, including data cleaning, model building, and performance evaluation on various datasets.
accuracy confusion-matrix data dataanalytics logistic-regression logistic-regression-classifier machine-learning-algorithms mlmodels model modelling regression-models
Last synced: 11 Sep 2025
https://github.com/lukaszkn/data-software-engineering-interview-questions
Data and Software engineering interview questions
data engineering interview-questions python
Last synced: 20 Jul 2025
https://github.com/adadalshabab/machine-predictive-maintenance-classification
This repository hosts a machine predictive maintenance classification project, aimed at predicting the maintenance needs of industrial machinery before they fail. By leveraging machine learning algorithms, this project seeks to enhance operational efficiency and reduce downtime by identifying potential maintenance requirements proactively.
data data-science datanalysis datanalytics machine-learning machine-learning-algorithms matplotlib-pyplot pandas
Last synced: 17 May 2026
https://github.com/antoninpvr/battery-logger
Simple scripts to record data from my laptop battery
Last synced: 17 May 2026
https://github.com/basinghse/covid19simulator
Real Time Assessment and Simulation of COVID-19 - showing current numbers of cases, deaths and treated patients globally.
coronavirus covid-19 data real-time simulation visualisation visualisation-data-ingester
Last synced: 05 Apr 2025
https://github.com/amethyst-php/setting
Give the user the ability to configure his own settings
amethyst amethyst-package api data laravel setting
Last synced: 19 May 2026
https://github.com/amethyst-php/employee
amethyst amethyst-package api data employee laravel
Last synced: 14 May 2026
https://github.com/mightymetrika/scdtb
Single Case Design Toolbox
data math r science statistics
Last synced: 04 Jan 2026
https://github.com/amethyst-php/price
Define prices and attach them to any model
amethyst amethyst-package api data laravel price
Last synced: 17 May 2026
https://github.com/ramtinsoltani/safe-cli
A simple Command-line Interface which encrypts and decrypts UTF-8 files using AES-256.
aes-256 cli data data-hook decryption encryption generator handlebars hooks markup partial partial-decryption password safe swap temp temporary tool
Last synced: 16 Apr 2026
https://github.com/amethyst-php/data-view
amethyst amethyst-package api data data-view laravel
Last synced: 19 May 2026
https://github.com/amethyst-php/source
The source of information. It can be used to save the origin of whatever information (news, books, etc.. )
amethyst amethyst-package api data laravel source
Last synced: 27 Apr 2026
https://github.com/pawlo77/messenger-analyser
Repo for Data Visualization project, part of IAD study program at Faculty of Mathematics and Information Science, Warsaw University of Technology
Last synced: 17 May 2026
https://github.com/newrelic-experimental/newrelic-java-aws-kinesis
Provides instrumenation of the Amazon Kinesis Client and Producer
amazon aws client data instrumentation java kinesis nrlabs nrlabs-data nrlabs-odp observability-data producer
Last synced: 15 May 2026
https://github.com/theleopard65/isa-imitation
This repository contains a simple C++ implementation of a Von-Neumann architecture simulator. The program mimics the behavior of a basic computer architecture that uses a single memory space for both instructions and data. Users can load programs, execute them, and view the current state of the memory and registers.
32-bit 64-bit ac architecture c-plus-plus data executable explained implementation ir isa mar mdr memory pc registers simulation von-neumann x64 x86
Last synced: 18 Mar 2025
https://github.com/shivamsharma32/ipl-2022-analysis
The IPL 2022 Analysis project is a data-driven exploration of the Indian Premier League (IPL) 2022 cricket tournament. The analysis focuses on utilizing Python programming and various libraries to analyze and visualize the performance of teams, players, and key metrics in the IPL 2022 season.
data dataana dataanalytics datavi matplotlib python
Last synced: 17 May 2026
https://github.com/nathanieliskandar26/data-analysis-project
This project demonstrates my ability to clean and analyze data using Python and SQL so far. The dataset used for this analysis focuses on general customer information. Through this project, I aimed to uncover meaningful insights and trends by cleaning the data and performing structured queries.
analysis data data-cleaning jupyter-notebook mysql mysql-database python
Last synced: 19 Apr 2026
https://github.com/apparaomulpuri/readline
Explains you the usage of readLine function in Swift.
data fromkeyboard keyboard reading readline swift
Last synced: 29 Mar 2025
https://github.com/vin20777/drone-data-layer
Drone Project Data Layer
csharp data drone layer software-design
Last synced: 18 May 2026
https://github.com/srindot/average_flightdata_collection_fwuaav
This repository is designed for collecting average data for a flapping wing UAV. The script acg_coeff_data_collection.py runs the necessary data collection, and the resulting data is saved into a CSV file called AverageFlightData.csv.
Last synced: 18 Aug 2025
https://github.com/pedelriomarron/spanish-api-covid19
Data from Spain of COVID-19 (by Datadista) as a service
api covid-19 covid-19-spain data now spain zeit
Last synced: 12 Mar 2025
https://github.com/solrikk/vargen
VarGen (Variation Generator) is a user-friendly desktop application designed to simplify the creation of product variations from CSV files.
csv-files csv-format csv-parser data data-engineering excel excelparser python
Last synced: 29 Mar 2025
https://github.com/ezmiller/boe-election-data
CSV files containing parsed NYC Bureau of Elections data for 2009 and 2013
Last synced: 18 Oct 2025
https://github.com/zolabar/zolabar.github.io
selection of zolabar python projects
conformal-mapping data data-science optimization python regression sympy webapp
Last synced: 16 Aug 2025
https://github.com/fintech-lsi/fintech-credit-risk-prediction
This repository provides a machine learning model for predicting credit risk in the financial sector. The model uses borrower information, such as age, income, employment length, loan amount, and credit history, to assess the likelihood of loan repayment or default.
data fintech machine-learning model prediction risk
Last synced: 12 Oct 2025
https://github.com/ebrizzzz/data-visualization-project-using-tableau
A data visualization project for the Visual Data Analysis course (Spring Term 2025) at the University of Skövde. This project explores the factors influencing national happiness scores across different global regions from 2005 to 2022.
analytics data data-analysis data-science data-visualization python regression tableau
Last synced: 16 Jun 2025
https://github.com/metapsy-project/data-depression-psiloctr
Database of psilocybin-assisted therapies for adults with depression versus control conditions.
Last synced: 01 Mar 2026
https://github.com/a-poor/taro
A package for repeatable rectangular data transformations in Python.
data data-science data-transformation pipeline pypi-package python
Last synced: 13 Oct 2025
https://github.com/amethyst-php/nutrition
amethyst amethyst-package api data laravel nutrition
Last synced: 19 May 2026
https://github.com/yash22222/olympic-games-analytics-using-apache-spark
The "Olympic Games Analytics Using Apache Spark Databricks" project explores data from the Olympic Games (1896-2016) to identify trends and insights. Using Apache Spark for big data processing and Databricks for visualization, the project analyzes key factors like top-performing countries and athlete attributes, showcasing real-world analytics.
apache apache-kafka apache-spark big-data-analytics csv data data-analytics data-visualization databricks excel mysql olympics regions
Last synced: 03 May 2026
https://github.com/hallmx/mx_utils
Utility scripts for software development in data science
colaboratory data development nbdev python science scripts software utlities
Last synced: 19 May 2026
https://github.com/gsmith257-cyber/BIT3434CVE
BI T3434 Project on data mining CVEs and Exploits
cve data data-mining exploits research-project
Last synced: 10 Mar 2025
https://github.com/simranjeet97/covid-19
Covid-19 Data Analysis and Important Topics to be Covered to get the Impact and Solution.
coronavirus coronavirus-analysis coronavirus-dataset coronavirus-prediction coronavirus-tracking covid-19-data-analysis covid19 covid19-data covid19-india dash dash-app dash-plotly data data-analysis data-science data-science-projects data-visualization python3
Last synced: 18 May 2026
https://github.com/shvbsle/image-augmentation
A light weight CLI for augmenting image datasets for deep learning and ML projects
augmentation data data-augmentation data-augmentation-strategies data-augmentor data-augumentation data-science dataset deep-learning image-processing
Last synced: 12 Sep 2025
https://github.com/amethyst-php/file
amethyst amethyst-package api data file laravel
Last synced: 18 May 2026
https://github.com/amethyst-php/email-subscription
Subscribe your email to our mailing-list, we'll promise no spam will be delivered.
amethyst amethyst-package api data email-subscription laravel
Last synced: 17 Mar 2025
https://github.com/pratik-codes/zomato_data_eda
Cleaned, analysed messy data and created a predictive model with and accuracy of 93% with tree Regressor algorithm
bengaluru data data-cleaning data-science famous-restaurants restaurants-delivering-online restraunts
Last synced: 27 Mar 2025
https://github.com/hidayathamir/telegram-group-data
1,865,827 message data in telegram group. Text, identity, datetime.
bahasa-indonesia data python3 scrape telegram telethon
Last synced: 17 May 2026
https://github.com/amethyst-php/file-generator
amethyst amethyst-package api data file file-generator generator laravel template
Last synced: 22 May 2026
https://github.com/amethyst-php/manga
amethyst amethyst-package api data laravel manga
Last synced: 17 May 2026
https://github.com/encelo/nctracer-data
Data files for the ncTracer project
Last synced: 15 Jan 2026
https://github.com/amethyst-php/legal-entity
amethyst amethyst-package api data laravel legal-entity
Last synced: 17 May 2026
https://github.com/paulveillard/cybersecurity-analytics
An ongoing collection of awesome software, libraries, learning tutorials, documents and books, technical resources and cool stuff about Analytics Engineering in Cybersecurity.
analytics bigdata bigquery cybernetics cybersecurity data data-engineering data-science encryption encryption-decryption seo seo-friendly seo-optimization
Last synced: 28 Mar 2025
https://github.com/ramonmeza/mysteamstats
Visualize your stats from your favorite games on Steam!
data statistics steam steam-api videogame visualization
Last synced: 17 Mar 2025
https://github.com/alexis-gss/games-data
Games Data is a library of informations about all games, realised under NuxtJs
css3 data games nuxtjs tailwindcss typescript vuejs
Last synced: 13 Mar 2025
https://github.com/naufalbasara/superstores-pipeline
Data Pipeline on Dummy E-commerce with Apache Airflow
airflow data data-engineering data-pipeline data-warehouse postgresql
Last synced: 16 May 2026
https://github.com/ubc-library-rc/intro-data-analysis-python
Introduction to Python for Data Analysis
Last synced: 01 Jul 2026
https://github.com/annaanastasy/mushroom-binary-classification-eda-ml
Explored and modeled a competition dataset of mushroom species, focusing on data cleaning, exploratory data analysis, and building machine learning models for accurate classification of edible and poisonous mushrooms.
binary-classification data data-cleaning-and-preprocessing data-science exploratory-data-analysis machine-learning-algorithms xgboost-classifier
Last synced: 29 Mar 2025
https://github.com/wellingtonmwadali/alx-low_level_programming
ALX sprint one C programming
c data datastructures linked-list loops pointers-and-arrays string structures
Last synced: 04 Apr 2025