data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-23 00:07:41 UTC
- JSON Representation
https://github.com/lorinczakos/sql-projects
This is a collection of my SQL scripts that I wrote and were approved through my course with GoIT Romania Data Analyst course
bigquery cte data data-analysis dbeaver marketing-analytics postgresql project-repository sql vscode
Last synced: 16 May 2026
https://github.com/the-tech-idea/beep.winform.sample
Application for Managing your Different DataSources . Still in Alpha.please be patient
application data data-science database dataset integeration mysql nosql oracle postgres sqlite sqlserver workflow-engine workflows
Last synced: 08 Jul 2025
https://github.com/brunosalerno/osm_data
Ruby objects for dealing with OSM data, and generating XML files
Last synced: 21 Apr 2026
https://github.com/vin20777/drone-data-layer
Drone Project Data Layer
csharp data drone layer software-design
Last synced: 18 May 2026
https://github.com/garcane/layoffs-exploratory-data-analysis
This project uses MySQL to perform data cleaning and exploratory data analysis (EDA) on a dataset detailing company layoffs. The primary goal is to process, clean, and explore the data to gain insights into trends and patterns related to layoffs across various sectors.
data dataanalysis eda mysql sql
Last synced: 29 Oct 2025
https://github.com/citizenlabsgr/data.world
Work with data sets prior to uploading to data.world
Last synced: 26 Mar 2025
https://github.com/apparaomulpuri/readline
Explains you the usage of readLine function in Swift.
data fromkeyboard keyboard reading readline swift
Last synced: 29 Mar 2025
https://github.com/webdevcave/collections-php
A PHP library for managing collections of data with support for nested keys.
array collection data helper library nested-keys package php utility utility-classes
Last synced: 23 Feb 2025
https://github.com/nathanieliskandar26/data-analysis-project
This project demonstrates my ability to clean and analyze data using Python and SQL so far. The dataset used for this analysis focuses on general customer information. Through this project, I aimed to uncover meaningful insights and trends by cleaning the data and performing structured queries.
analysis data data-cleaning jupyter-notebook mysql mysql-database python
Last synced: 19 Apr 2026
https://github.com/priyanshubiswas-tech/farmlab-report-and-case-study-iot
This project was developed through live interviews and case studies with farmers in the year 2023 to address key agricultural challenges. The device provides real-time farm insights for better decision-making. Future plans include a digital portal, increased range, more sensors, and improved design. Open to collaboration!
arduino-ide c case case-study data data-analysis iot iot-device serialization
Last synced: 15 Jul 2025
https://github.com/pbinkley/tweets-online-classes-covid19
A twarc harvest of tweets related to online classes during the COVID-19 outbreak, starting 2020-03-02
Last synced: 06 Mar 2026
https://github.com/luminovrym/crawler-tools-js
Crawler Tools Js adalah sebuah aplikasi yang digunakan untuk scrapping data pada sebuah web
crawler crawler-js data js web-scraping
Last synced: 08 Sep 2025
https://github.com/aliasgarsogiawala/dashboards
Power BI dashboards , each folder contains a pbix file and a pdf file with explanation of the dashboard
analysis dashboards data data-visualization powerbi
Last synced: 12 Feb 2026
https://github.com/md-emranhossen/leetcode-practice
This repository stores my solutions to LeetCode problems, organized by problem number and title.
cpp data datastructures-algorithms leetcode-solutions
Last synced: 26 Jun 2025
https://github.com/sharmadhiraj/plot-pi
Graphical Representation of PI
data data-visualization html javascript js mathematics plot
Last synced: 28 Mar 2025
https://github.com/nonsignificantp/enfermedades-inmunoprevenibles
Analisis sobre el efecto de las vacunas y la incidencia de casos de enfermedades inmunoprevenibles en la Ciudad de Buenos Aires entre los años 1995 y 2016
a analysis argentina buenosaires data hepatitis science vaccination
Last synced: 18 Jun 2026
https://github.com/siongui/xemaauj9k5qn34x88m4h
No source code. Only serve JSON files of Pāli words
Last synced: 15 May 2026
https://github.com/kwame-mintah/ml-data-copy-to-aws-s3
Automatically copy new data to an AWS S3 bucket for Machine Learning.
Last synced: 14 May 2026
https://github.com/devprnvk/pycryptochain
A implementation of a blockchain-based cryptocurrency in Python. This project aims to provide a fundamental understanding of blockchain technology and cryptocurrency by building a basic version from scratch. Features include blockchain creation, transaction handling, mining rewards, simulation.
blockchain crypto data decryption encryption hashing processing py python salting storage
Last synced: 09 Mar 2026
https://github.com/peter7775/mysql-graph-visualizer
SQL database conversion and visualisation as graph / in development
analytics analyzer conversion converter data database go golang graph graphql mysql neo4j neo4j-graph refactoring sql visualization
Last synced: 14 Mar 2025
https://github.com/rellyson/data-engineering-tools
This repository holds examples and documentation about the most used tools in the data engineering ecosystem.
apache-airflow apache-spark data data-engineering jupyter-notebook python tools
Last synced: 17 Jan 2026
https://github.com/ayushman0511/data-warehouse-project1
A comprehensive guide to building a data warehouse with SQL Server, including ETL processes, data modeling, and analytics.
data data-ana data-anal data-cleaning data-enginee data-lakehou datalake datasci dataware datawarehouse datawarehousi etl etl-job etl-pipeline medallion sql sql-quer sql-query sql-server sqlserver
Last synced: 26 Jun 2025
https://github.com/majorcluster/clj-data-adapter
A Clojure library designed to convert data
Last synced: 12 Jul 2025
https://github.com/bastianolea/servel_elecciones
Resultados electorales desde Servel (2024)
chile comunas data elecciones genero
Last synced: 08 Jul 2025
https://github.com/dsietz/daas-workshop
Workshop for building a Data as a Service platform using the DaaS SDK.
archconf daas daas-pattern data dataprivacy nfjs rust rust-lang
Last synced: 20 May 2026
https://github.com/deliprofesor/cardiac-data-analysis-exploring-cholesterol-and-heart-rate
This project analyzes a heart disease dataset to explore the relationship between cholesterol, heart rate, and chest pain type. It includes normality tests, outlier detection, correlation analysis, MANOVA, post-hoc tests, and VIF analysis, with visualizations using histograms, heatmaps, and boxplots.
correlation-analysis data data-cleaning data-visualization machine-learning manova post-hoc-analysis python tukey-hsd vif
Last synced: 17 May 2026
https://github.com/vatshayan/ip-address-data-analysis-
Extraction of 100's of IP Address and using Machine Learning algorithm for detecting threats
data data-analysis data-science data-visualization dataset ip ipconfig ipv4-address jupyter-notebook machine-learning machine-learning-algorithms supervised-machine-learning unsupervised-learning
Last synced: 15 Jul 2025
https://github.com/bcodmo/workshop_bios_oceanographic_data
Repository holding lesson on Data Management Basics. See webpage for rendered view: https://bcodmo.github.io/workshop_bios_oceanographic_data/
bco-dmo data datamanagement fair workshop
Last synced: 08 Apr 2026
https://github.com/newrelic-experimental/newrelic-java-aws-kinesis
Provides instrumenation of the Amazon Kinesis Client and Producer
amazon aws client data instrumentation java kinesis nrlabs nrlabs-data nrlabs-odp observability-data producer
Last synced: 15 May 2026
https://github.com/jigyasag18/orders-sales-analysis-report-using-power-bi
This repository analyzes and visualizes office supply sales data to improve profitability. It examines sales performance by various factors, using charts to provide insights and actionable recommendations for sales optimization, market research, and product mix.
data dataanalysis dataanalytics dataset powerbi powerbi-dashboards powerbi-report powerbi-reports powerbi-visuals powerbidashboard
Last synced: 18 Feb 2026
https://github.com/pawlo77/messenger-analyser
Repo for Data Visualization project, part of IAD study program at Faculty of Mathematics and Information Science, Warsaw University of Technology
Last synced: 17 May 2026
https://github.com/codehub001/ai-driven-automation-for-data-quality-monitoring-in-cloud-data-warehouses
This project focuses on leveraging AI to automate data quality monitoring in cloud data warehouses. Traditional data validation methods often require manual intervention and fail to scale with increasing data complexity. By integrating machine learning models, this approach enables real-time anomaly detection, automated data cleansing.
csv-export csv-import dashboard data datacleaning lib modeltraining python testing-library visualization
Last synced: 13 May 2025
https://github.com/dolanmiu/mclaren-task
A front end assessment task for Mclaren
angular data observable observables rxjs
Last synced: 16 May 2026
https://github.com/ubeydgur/car-price-prediction
Predicting the price of a used car
ai artificial-intelligence data data-science data-visualization machine-learning machine-learning-algorithms
Last synced: 08 Jun 2026
https://github.com/wolfchamane/amjs-data-types
Data types for your OOP javascript project
cjs data javascript modules nodejs oop types
Last synced: 20 May 2026
https://github.com/andreabozzo/andreabozzo
My personal Repo!
analytics data data-engineering data-visualization database datamodelling developer-profile github-pages github-profile go interactive-animation open-data portfolio python readme-profile rust
Last synced: 17 May 2026
https://github.com/circlexo/circlexo
Open-source project to seamlessly integrate and manage your business workflow, connecting Jira, GitHub, Discord, Stripe, RevenueCat, and OpenAI all in one intuitive platform.
bussiness-intelligence data discord-bot forge github google jira kpis ploi revenuecat stripe vapor
Last synced: 20 May 2026
https://github.com/shimul-zahan/all-practices-tukitaki
This is repository for all the practice tasks or learning new things. Cause environment are setup and no need to setup a new project or environments.
data data-science datapreprocessing deep-learning machine-learning neural-network practice python visualization
Last synced: 12 Jan 2026
https://github.com/furkankarakuz/turkey_earthquake
This project focuses on analyzing and visualizing earthquake data specific to Turkey. It aims to provide insightful visualizations on topics such as earthquake frequency, location, and magnitude using data obtained from Boğaziçi University Kandilli Observatory and Earthquake Research Institute.
api data data-visualization earthquake python python3 request streamlit turkey turkey-earthquake
Last synced: 20 May 2026
https://github.com/the-universal-linux-society/sysreport
Bash script to give you a full system report. Just by running the script it offers insight into CPU data, disk space, temperature readings, network configuration, MAC addresses, firewall status, and system logs for error analysis.
analysis bash bash-script bash-scripting data report reporting system
Last synced: 15 May 2026
https://github.com/heshamalsaqqaf2/python-projects
Beginner Level Python Projects
Last synced: 22 Jul 2025
https://github.com/raufjatoi/electricity-consumption-prediction
arima-model customize data kinda-dynamic ml
Last synced: 25 Jul 2025
https://github.com/fastpix/flutter-core-data-sdk
A comprehensive Flutter SDK for video player analytics and event tracking, designed to provide detailed insights into video playback behavior and user engagement metrics.
Last synced: 15 May 2026
https://github.com/clagiordano/marketplaces-data-export
LIbrary that share the same interface and provide adapters for online marketplaces services
adapter amazon api clagiordano data ebay ebay-api export marketplaces mws mws-api rest soap
Last synced: 22 Mar 2025
https://github.com/amethyst-php/taxonomy
amethyst amethyst-package api data laravel taxonomy
Last synced: 18 Jan 2026
https://github.com/joseluisq/input-verifier
Some useful functions to check common data input.
Last synced: 19 Jul 2025
https://github.com/amethyst-php/shipment
amethyst amethyst-package api data laravel shipment
Last synced: 20 May 2026
https://github.com/jodus-melodus/queue
Simple Queue
data datastructures linear queue queues
Last synced: 10 Sep 2025
https://github.com/jose-mwangi/my-portfolio
my-portfolio
analytics aws data data-science excel seo-optimization vba-excel webscraping
Last synced: 28 Jul 2025
https://github.com/thesfinox/fit-the-data
Data analysis using Wolfram Mathematica
analysis data data-analysis lab mathematica wolfram wolfram-mathematica
Last synced: 24 Jan 2026
https://github.com/patrikcze/meshtatic_data
Meshtastic Data Transfer - Trying some stupid thing, like transferring files over LORA network.
data meshtastic meshtastic-python
Last synced: 03 Feb 2026
https://github.com/dscamilo/gestion-clientes-springboot
Proyecto de gestión de clientes aplicando Java y Springboot, haciendo uso de Lombok, uso de interface, inyección de dependencias, uso de anotaciones Service, Data, RestController . Consumo de API haciendo uso de Postman.
data interface java lombok-maven restcontroller spring-boot
Last synced: 15 May 2026
https://github.com/saisurajmatta/data-warehousing-and-advanced-data-analytics
Data Analytics Project: Analyzed Promotions and Provided Tangible Insights to Sales Director
data data-analysis data-architecture data-flow-analysis data-modeling data-pipeline data-segmentation data-visualization data-warehousing docker etl etl-pipeline mssql sql tableau
Last synced: 17 May 2026
https://github.com/jigyasag18/fake-news-prediction-project
The Fake News Prediction App Repository offers a machine learning project that focuses on identifying the authenticity of news articles as fake or real. It uses a dataset of 20,000 articles and employs methods such as TF-IDF vectorization and the Porter stemming algorithm, achieving around 97% classification accuracy with logistic regression model.
data datapreprocessing logistic-regression machine-learning machine-learning-algorithms numpy pandas prediction stemming vectorization
Last synced: 08 Jun 2026
https://github.com/ramtinsoltani/safe-cli
A simple Command-line Interface which encrypts and decrypts UTF-8 files using AES-256.
aes-256 cli data data-hook decryption encryption generator handlebars hooks markup partial partial-decryption password safe swap temp temporary tool
Last synced: 16 Apr 2026
https://github.com/mightymetrika/scdtb
Single Case Design Toolbox
data math r science statistics
Last synced: 04 Jan 2026
https://github.com/krescruz/pegaso-data
Utilerías para el analisis de datos del Proveedor de Certificación de Factura Pegaso
Last synced: 29 Apr 2026
https://github.com/germanpaul12/flights-data-sky-scraper-api
Sky Scraper - Python app for searching flight information using the Sky Scrapper API.
data flights flights-api scraping
Last synced: 15 Jul 2025
https://github.com/pyfig/s21_data-science-bootcamp
School21 Bootcamp Data Science
data data-science numpy pandas python school21
Last synced: 26 Jun 2025
https://github.com/null-none/py-fear-and-greed
Fear & Greed Index
data fear-and-greed python trading
Last synced: 16 Jul 2025
https://github.com/shrutakeerti/eye-gaze-detection
This repo contains everything that I have done at IIT Jodhpur Summer Internship May 15 - July 15
ai aiml data eda eeg eeg-signals eye jodhpur mlflow
Last synced: 17 Mar 2025
https://github.com/mksingh431/sql-complete-notes
SQL, or Structured Query Language, is a robust and specialized programming language designed for efficient management and manipulation of relational databases. With SQL, you can seamlessly interact with databases like MySQL, PostgreSQL, Microsoft SQL Server, Oracle,.
Last synced: 21 Apr 2026
https://github.com/meta-llama/synthetic-data-kit
Tool for generating high quality Synthetic datasets
data generation llm python synthetic
Last synced: 08 May 2025
https://github.com/vaxdata22/foresight-pharmaceutical
This is a Data Analysis case study done on the Foresight Pharmaceutical Company dataset.
actionable-insights business-analytics business-intelligence data data-analytics data-cleaning data-mining data-visualization data-wrangling exploratory-data-analysis spreadsheets sql sql-server sql-server-management-studio statistical-analysis t-sql transact-sql
Last synced: 05 Mar 2025
https://github.com/danielrosehill/ghg-ebitda-correlations
Streamlit data visualisation examining correlation between emissions & profitability
data sustainability sustainability-data
Last synced: 14 Mar 2025
https://github.com/athari22/statistics-from-stock-data
Statistics from Stock Data
cvs data data-science dataanalysis datacleaning dataframe jupyter pandas pandas-python python statistics stock table
Last synced: 16 Feb 2026
https://github.com/pyrustic/litedao
Intuitive interaction with SQLite database
auto-init dao data database database-access library lightweight pyrustic python sql sqlite
Last synced: 09 May 2026
https://github.com/dhi13man/rca_ace
RCA Ace is designed for organizations seeking to enhance their understanding and utilization of insights derived from Root Cause Analyses (RCAs).
analytics data enterprise open-source python python3 rca
Last synced: 10 Sep 2025
https://github.com/ethenkem/pygraphsurvey
A python base web app that provide graphical analysis on data collected from surveys and the system has its on built in form fiiling where admin can set question and sent a link for the forms to be filled and then the system provide anylysis on the collected data. Form feature include selection options, range values file inputs etc
Last synced: 12 Jan 2026
https://github.com/colour-science/colour-streamlit-tm-30-18
Generates the "ANSI/IES TM-30-18 Colour Rendition Report" using Colour and Streamlit
color color-science color-space color-spaces colorspace colorspaces colour colour-science colour-space colour-spaces colourspace colourspaces data dataset datasets streamlit
Last synced: 23 Feb 2025
https://github.com/stdlib-js/dstructs-circular-buffer
Circular buffer.
buffer circular collection cyclic data data-structure data-structures fifo first-in-first-out javascript node node-js nodejs queue ring stdlib structure
Last synced: 20 May 2026
https://github.com/ournet/embed-providers-data
Embed provides data
data embed embed-providers json providers
Last synced: 03 May 2026
https://github.com/gui-sitton/carsells
In this project I am an analyst on the Crankshaft List. Hundreds of free vehicle advertisements are published on the site every day. I need to study the data collected over the last few years and determine which factors influence the price of a vehicle.
data data-analysis data-analysis-python data-science data-visualization python
Last synced: 20 May 2026
https://github.com/amethyst-php/contact
amethyst amethyst-package api contact data laravel
Last synced: 20 May 2026
https://github.com/avijeetpandey/quizzez
Implementation of quizzez application using kotlin
Last synced: 20 May 2026
https://github.com/amethyst-php/shipment-zone
amethyst amethyst-package api data laravel shipment-zone
Last synced: 20 May 2026
https://github.com/pooja-manjunatha/nyc_parking_violations_dbt
This project uses dbt to transform NYC parking violations data through a layered architecture: Bronze: Raw ingested data Silver: Cleaned and enriched data Gold: Aggregated tables for analytics Using DuckDB as the warehouse backend, it ensures data quality with tests and documentation. The project enables reliable analysis of parking violations
data data-analysis data-engineering dbt duckdb python sql
Last synced: 14 May 2026
https://github.com/ahmad-ali-rafique/random-forest-classifier-modeling
Detailed exploration of random forest classifiers, including data cleaning, model building, and performance evaluation on various datasets.
classification classification-models data dataanalytics datamodel dataset model-checking models random-forest random-forest-classifier
Last synced: 01 Jun 2026
https://github.com/basis-company/data-player.js
in memory data layer for fast access to plain normalized data
collection data model traversal
Last synced: 25 Feb 2025
https://github.com/szc126/metadata-nnd-vocalo-twitter
ボカロ系新着動画ツイートを収集 - "new VOCALOID/UTAU videos" tweet collection
data nico-nico-douga niconico vocaloid
Last synced: 20 May 2026
https://github.com/estherslabbert/final-capstone-unsupervised-ml
Exploration of USArrests data using unsupervised machine learning
arrests correction data data-analysis data-clustering data-visualization jupyter-notebook machine-learning pca-analysis standardised-data usa
Last synced: 26 Jun 2025
https://github.com/raruto/cockpit-sample-data
Sample data installer addon for Cockpit CMS
Last synced: 17 Mar 2025
https://github.com/lukaszkn/data-software-engineering-interview-questions
Data and Software engineering interview questions
data engineering interview-questions python
Last synced: 20 Jul 2025
https://github.com/disruptek/bloom
bloom filters
bloom data filter hash membership nim probability set structure
Last synced: 04 Apr 2025
https://github.com/stdlib-js/array-base-any-has-property
Test whether at least one element in a provided array has a specified property, either own or inherited.
any array assert data generic has javascript node node-js nodejs prop property stdlib structure test types validate
Last synced: 20 May 2026
https://github.com/tkxwaweru/python_data_manipulation
Manipulating the MASSIVE dataset using python
data dataanalysis excel python
Last synced: 11 Jan 2026
https://github.com/pcpp94/elexon_pipeline_gb_demand
Guidelines and code snippets for extracting and processing Elexon gross demand data on Databricks. Provides half-hourly GB demand at sectoral (Domestic, Non-domestic), GSP-area granularity, settlement demand, and embedded generation. Supports non-commodity cost calculations for CfD, RO, and FiT.
data electricity elexon gb octopusenergy power powerdata pypsa uk
Last synced: 12 Jul 2025
https://github.com/ahmad-ali-rafique/logistic-regression-modeling
An in-depth exploration of logistic regression models, including data cleaning, model building, and performance evaluation on various datasets.
accuracy confusion-matrix data dataanalytics logistic-regression logistic-regression-classifier machine-learning-algorithms mlmodels model modelling regression-models
Last synced: 11 Sep 2025
https://github.com/phtrempe/l2a
This is a small project which aims to show an example of applied machine learning in Python 3 with the Keras library and its TensorFlow backend to train a neural network model for it to learn to add two integers.
applied data data-science deep-learning keras machine-learning neural-network tensorboard tensorflow
Last synced: 05 May 2026
https://github.com/ahmad-ali-rafique/random-forest-regressor-modeling
Detailed exploration of random forest regressors, including data cleaning, model building, and performance evaluation on various datasets.
data dataanalytics datacleaning evaluation-metrics modeling random-forest random-forest-regression regression regression-analysis
Last synced: 05 Mar 2025
https://github.com/maximkrouk/storage
Lightweight framework for storing data (beta)
cache data keychain memmory storage swift swift5-1 userdefaults
Last synced: 30 Oct 2025
https://github.com/wilcotomassen/lorem-datum-core
Java based data generator for data simulation
data dataset generator java lorem-ipsum simulated-data
Last synced: 11 Jan 2026
https://github.com/ahmad-ali-rafique/electricity-consumption-analysis-household-dataset
This repository contains analysis and predictive modeling of household electricity consumption using Python. It includes data cleaning, exploratory data analysis (EDA), time series forecasting (ARIMA, SARIMA, LSTM), and model evaluation to optimize energy usage.
arima-forecasting artificial-intelligence artificial-neural-networks data data-science dataanalytics datacleaning evaluation-metrics exploratory-data-analysis long-short-term-memory lstmmodel modeling time-series timeseries-forecasting
Last synced: 23 Jun 2025
https://github.com/kunalkumar2001/coffee_sales_project_using_excel_power-bi_and_sql
Coffee Shop Sales Dashboard built using Power BI for visualization and SQL for data extraction and transformation. The project dives deep into sales performance, providing actionable insights for data-driven decisions.
analytics data dataanalytics mssql powerbi sql
Last synced: 26 Jun 2025