data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-27 00:07:33 UTC
- JSON Representation
https://github.com/panodata/tikray
A compact data transformation engine.
data data-transformation data-transformation-pipeline data-transformer jmes jmespath jq jqlang json json-pointer json-transform json-transformation json-translate json-translator transformation transon
Last synced: 04 Oct 2025
https://github.com/mierune/tinybufr
[WIP] A Rust library for decoding BUFR (Binary Universal Form for the Representation of meteorological data) files.
bufr data meteorology rust weather wmo
Last synced: 15 May 2025
https://github.com/aaisha-nexus/sql_company_insights
A beginner-friendly SQL project for managing employee records, departments, and sales transactions. Includes table creation, optimized queries, stored procedures, and window functions to extract business insights.
business-analytics data data-analysis dataanalysis-projects dataanalytics database-schema mssql-database query relational-databases sql sql-query ssms
Last synced: 12 Aug 2025
https://github.com/passly-nl/data
Source code of the data layer.
data passly ticketing typescript
Last synced: 27 May 2026
https://github.com/agdturner/ccg-data
A modularised Java library for processing data sets with classes for: data records; collections of data records; and identifiers.
Last synced: 12 Jan 2026
https://github.com/pchaparro/search-engine
Full stack search-engine created from youtube videos obtained using "web-scraping"
data opensearch python python3 react scraper scraping scraping-websites search search-engine semantic-search sentence-transformers typescript website
Last synced: 17 Apr 2026
https://github.com/samhollings/nhs_data_cleansing
A repo of reusable functions for cleansing data
cleansing data data-cleaning data-cleansing preprocessing pyspark python python3
Last synced: 05 Oct 2025
https://github.com/mnazlukhanyan/da-projects
Портфолио с работами по аналитике данных, показывающие мои навыки, умения и опыт
data data-vizualisation hypothesis-tests matplotlib pandas plotly postgresql product-metrics python scipy seaborn sql visualization
Last synced: 11 Apr 2026
https://github.com/amethyst-php/cycle
amethyst amethyst-package api cycle data laravel
Last synced: 17 May 2026
https://github.com/pathilink/ebury_case
Technical case study in Analytics Engineering using BigQuery, focusing on dimensional modeling and SQL queries for payment and client analysis.
Last synced: 05 Oct 2025
https://github.com/shreeparab1890/indian-elections-2019-analysis-eda
This ipython notebook is the Exploratory data analysis (EDA) of the Indian Lok Sabha Elections 2019.
data data-analysis data-science data-visualization eda exploratory-data-analysis matplotlib numpy pandas plotly python python3 visualization
Last synced: 28 Apr 2026
https://github.com/h-sutiwas/r2de-2025
This repository is related to the Road To Data Engineer Bootcamp by DataTH. It contains all related coursework, some mini projects and other resources within the field of Data Engineering.
data data-engineering data-visualization docker gcp pipeline spark
Last synced: 30 Apr 2026
https://github.com/sebastianhochreiter/sql-projects
business-intelligence data datascience microsoft microsoft-sql-server sql
Last synced: 22 Feb 2026
https://github.com/oroszgy/hunlp-resources
Scripts and resources for making spaCy understand Hungarian.
corpus-linguistics data hungarian hungarian-language hunlp magyarlanc model natural-language-processing nlp resources script spacy wikipedia
Last synced: 18 May 2026
https://github.com/sagarkhese40/python-assginment
python assignment
assignment data data-science data-visualization python seaborn-plots
Last synced: 28 Apr 2026
https://github.com/andrii04/andreamonforte-bi-assignment
Automated Data Pipeline that ingests daily GA4-formatted CSV files from a private Google Cloud Storage bucket, validates and loads them into BigQuery, and prepares analysis-ready views. The solution is built for deployment as a Cloud Function triggered by Cloud Scheduler and uses Python with the Google Cloud Storage and BigQuery client libraries.
automation bigquery cloud cloudfunctions data data-analysis data-engineering etl etlpipeline gcp google googlecloudplatform pipeline python sql
Last synced: 09 Nov 2025
https://github.com/armand-sauzay/datasets
Datasets for machine learning
ai data datasets machine-learning ml
Last synced: 18 Jan 2026
https://github.com/tsbarr/belly-button-challenge
Using front-end development tools (javascript, html and css) I built an interactive dashboard to explore the Belly Button Biodiversity dataset, which catalogs the microbes that colonize human navels.
data data-visualization javascript
Last synced: 04 Mar 2026
https://github.com/0xhericles/ufcg-geojson
GeoJSON file containing the blocks and buildings of the Federal University of Campina Grande.
data data-visualization geojson map open-source ufcg university
Last synced: 09 Feb 2026
https://github.com/vim89/flowforge
Let's be honest - most data pipeline frameworks treat types as suggestions. Config files are strings. Schemas are "validated" at runtime. Data quality is an afterthought. So, let's do differently
archetype data data-contracts data-engineering data-pipelines data-quality data-science database dataengineering datapipeline etl etl-framework pipelines scala scalability spark spark-sql spark-streaming
Last synced: 14 Apr 2026
https://github.com/ashita-ai/ashita-ai.github.io
Ashita AI - The island of misfit data tools
Last synced: 19 Feb 2026
https://github.com/amethyst-php/legal-entity
amethyst amethyst-package api data laravel legal-entity
Last synced: 17 May 2026
https://github.com/ramonmeza/mysteamstats
Visualize your stats from your favorite games on Steam!
data statistics steam steam-api videogame visualization
Last synced: 17 Mar 2025
https://github.com/kunalkumar2001/coffee_sales_project_using_excel_power-bi_and_sql
Coffee Shop Sales Dashboard built using Power BI for visualization and SQL for data extraction and transformation. The project dives deep into sales performance, providing actionable insights for data-driven decisions.
analytics data dataanalytics mssql powerbi sql
Last synced: 26 Jun 2025
https://github.com/amliyanage/data-structures
arrays binary-tree data data-structures graph hashtable linked-list stack
Last synced: 06 Apr 2025
https://github.com/alexis-gss/games-data
Games Data is a library of informations about all games, realised under NuxtJs
css3 data games nuxtjs tailwindcss typescript vuejs
Last synced: 13 Mar 2025
https://github.com/badr-moufad/dashboard-agriedge-data
Prepare data for dashboard. This is part of my research internship.
acquisition dashboard data data-morocco data-science data-visualisation weather weather-dashboard weather-data
Last synced: 04 Apr 2025
https://github.com/preranarao03/madhav_e-commerce_dashboard
This repository features the Madhav_E-Commerce_Dashboard built with Power BI. It provides interactive visualizations for analyzing e-commerce sales performance, product categories, customer segments, and geographic data, aiding in data-driven business decisions.
Last synced: 30 Jan 2026
https://github.com/jszafran/personal-aws-data-lake
Personal, cloud based (AWS), data lake for experimenting with cloud services.
aws cloud data data-engineering dataengineering datalake etl terraform
Last synced: 20 May 2026
https://github.com/amethyst-php/geolocation
amethyst amethyst-package api data geolocation laravel
Last synced: 20 May 2026
https://github.com/redinfinitypro/scientificsharp
Rating: (5/10) The code is a Windows Forms application for a basic scientific calculator, allowing users to perform mathematical operations like addition, subtraction, multiplication, division, trigonometrics, and logarithms.
componentmodel cryptography data drawing forms generic linq system tasks text
Last synced: 06 Apr 2025
https://github.com/antononcube/raku-data-typesystem
Data type system for different data structures.
data data-structures rakulang type-system
Last synced: 09 Jul 2025
https://github.com/mightymetrika/mmirestriktor
Informative Hypothesis Testing Web Applications
data hypothesis infomative power r restriktor statistics testing
Last synced: 17 Mar 2025
https://github.com/alexdonh/adonis-cache
Another cache provider for AdonisJs. Supports Object, File, Db and Redis cache. With cache dependencies!
adonis-framework adonisjs cache data dependency redis storing
Last synced: 15 May 2026
https://github.com/anzerr/storage.ts
Util to store data used in a service
data nodejs storage typescript util
Last synced: 20 May 2026
https://github.com/deva-246/excel-power-query-data-cleaning-dashboard
dashboard data datacleaning excel pivottable powerquery slicer
Last synced: 22 Mar 2025
https://github.com/samharrison7/datamapper
Making mapping between datasets as simple as possible.
data data-mapper data-mapping data-science data-structures
Last synced: 17 Mar 2025
https://github.com/kylepw/multistack
Example of multiple stacks in one array.
algorithms array data data-structures python stack
Last synced: 17 Mar 2025
https://github.com/harrisonwelch/pythondatascience
Repo of code from the linked-in lesson "Python: Data Analysis"
data data-science matplotlib notes numpy python tutorial
Last synced: 12 Apr 2026
https://github.com/farovictor/mongodbloader
This project is intended to be used as a data loader to support ELT pipelines or any kind of process that requires a heavy data load into a MongoDb database.
Last synced: 15 May 2026
https://github.com/stdlib-js/array-base-index-of-same-value
Return the index of the first element which equals a provided search element according to the same value algorithm.
array data find generic index javascript locate node node-js nodejs same scan search stdlib structure types
Last synced: 15 May 2026
https://github.com/mbiushelix/soilresp
Geofag 1 feltarbeid fra Vg2
data data-visualization geology global-warming norwegian-language soil-quality-testing soil-respiration
Last synced: 23 Jul 2025
https://github.com/dina-hosny/sequence-trigger-pair-for-all-schema-tables-plsql
A PLSQL script that creates Sequence Trigger Pair for all Schema's Tables
data oracle plsql sequence sequencetrigger sql toad trigger
Last synced: 06 Mar 2026
https://github.com/arthurcfranklin/acervo-musical
Este projeto consiste na criação de um banco de dados relacional para auxiliar um DJ na organização e catalogação do seu acervo musical. O objetivo é fornecer um sistema eficiente para armazenar e gerenciar informações sobre cantores, bandas, músicas e suas versões remixadas.
data database mysql mysql-database sql
Last synced: 22 Mar 2025
https://github.com/kinshukjainn/dclue-v1
Dsainone is a highly optimized Data Structures and Algorithms (DSA) library designed to provide efficient implementations of graph algorithms, trees, hashing, and linked lists while maintaining exceptional memory efficiency. The library is designed to be as fast and optimized as possible
Last synced: 20 May 2026
https://github.com/erkylima/algorithms
Python project to refresh knowledge on algorithms and data structures. Interactive examples of Bubble, Merge, Quick Sort, along with Lists, Stacks, Queues, and Trees. Challenges included. Recycle your expertise! 🚀 #Python #Algorithms #DataStructures
algorithms algorithms-and-data-structures data data-structures
Last synced: 19 Jan 2026
https://github.com/tpetzoldt/datasets
teaching data sets
data data-analysis-in-r teaching-materials
Last synced: 16 Feb 2026
https://github.com/piyushkumar2025/analytical-sql-project-exploring-trends-segmentation-kpis
A complete SQL analytics project using a simulated data warehouse. It analyzes sales, customer, and product data with CTEs, joins, window functions, subqueries, and views to deliver insights on trends, segmentation, and KPIs, showing how SQL enables data-driven decisions without BI tools.
advanced-sql analytics business-intelligence data data-science-projects datascience joins kpi mysql query sql window-functions-in-sql
Last synced: 02 Jul 2025
https://github.com/lord3008/instances-of-data-analysis
This repository of mine shows my work on data analysis of various projects that I made. I feel data analysis is the very key to investigate a solution. Further more it enlightens the direction towards model building.
Last synced: 03 Mar 2025
https://github.com/xmen3em/kaggle-competitions
This collection contains various projects and notebooks developed to tackle a range of Kaggle competitions, showcasing different machine learning techniques, data preprocessing methods, and model optimizations.
data data-science data-visualization deep-learning deployment ensemble-learning machine-learning-algorithms python streamlit
Last synced: 09 Apr 2026
https://github.com/francois-lenne/portofolio_flenne_streamlit
portofolio francois lenne using streamlit
data portofolio python slack-api streamlit
Last synced: 15 May 2026
https://github.com/newrelic-experimental/newrelic-java-apache-sling
Provides Java instrumentation for Apache Sling framework
apache-sling data instrumentation java nrlabs nrlabs-data nrlabs-java-verify observability-data sling
Last synced: 30 May 2026
https://github.com/madihanazir/ds-using-c
Basic insights into Data Structures (inspired by Abdul Bari course but in C language)
data self-learning structures-in-c
Last synced: 17 Mar 2025
https://github.com/dan149/uselesscontentcreator
Useless Content Creator (UCC) is a fake content generator, text, html and pdf files.
content customizable data easy-to-use fake-data fake-data-generator faker-generator generator lightweight open-source opensource python python3
Last synced: 03 Apr 2025
https://github.com/brunosalerno/osm_data
Ruby objects for dealing with OSM data, and generating XML files
Last synced: 21 Apr 2026
https://github.com/garcane/layoffs-exploratory-data-analysis
This project uses MySQL to perform data cleaning and exploratory data analysis (EDA) on a dataset detailing company layoffs. The primary goal is to process, clean, and explore the data to gain insights into trends and patterns related to layoffs across various sectors.
data dataanalysis eda mysql sql
Last synced: 29 Oct 2025
https://github.com/clagiordano/weblibs-data-export
Library for generic data export to various formats
clagiordano data export weblibs xlsx
Last synced: 22 Mar 2025
https://github.com/webdevcave/collections-php
A PHP library for managing collections of data with support for nested keys.
array collection data helper library nested-keys package php utility utility-classes
Last synced: 23 Feb 2025
https://github.com/jigyasag18/employee-salary-prediction-jigyasa
PayNexus is a machine learning-powered web app that predicts employee salaries based on role, education, and experience. Built using Python, Streamlit, and scikit-learn, it supports both single and batch predictions. The app includes advanced features like resume parsing via NLP and interactive visual analytics. Ideal for job seekers, HR profession
data dataset decision-tree-regressor gradient-boosting-classifier knearest-neighbor-classifier labelencoder lasso-regression linear-regression machine-learning machine-learning-algorithms machinelearning onehot-encoder pipeline random-forest random-forest-classifier ridge-regression standardscaler svr-regression-prediction xgboost xgboost-classifier
Last synced: 15 May 2026
https://github.com/kelvintechnical/web-scraper
Tableau Book Price Analysis
data data-analysis data-science tableau tableau-public
Last synced: 25 Jan 2026
https://github.com/tearth/test-data-generator
The generator of test data for the school project.
Last synced: 05 Jul 2025
https://github.com/cuadros-code/project-7-whitehouse-petitions
create a petitions from white house API
data jsondecoder uiaction uialertcontroller uibarbuttonitem uimenu url
Last synced: 02 Nov 2025
https://github.com/errea/vet_clinic_database
For this project you need special preparation. As the goal of this project is to solve some performance issue, first we need to introduce those issues. In order to do that, you will populate your database with a significant number of data.
data data-analysis data-structures data-visualization database
Last synced: 21 May 2026
https://github.com/danpoynor/data-pagination-and-filtering-project
Data pagination exercise using 'vanilla' JavaScript. This script consumes a JSON array containing any number of objects and adds buttons to a page that users can click to navigate to different pages of data.
data javascript json navigation pagination vanilla-javascript
Last synced: 20 Apr 2026
https://github.com/pbinkley/tweets-online-classes-covid19
A twarc harvest of tweets related to online classes during the COVID-19 outbreak, starting 2020-03-02
Last synced: 06 Mar 2026
https://github.com/kashirin-alex/thither.direct-onamove
an android skeleton-example application for using data from Thither.Direct platform on mobile applications
android-application data data-analysis data-structures data-visualization mobile-development mobility query research-data-management
Last synced: 27 Apr 2026
https://github.com/luminovrym/crawler-tools-js
Crawler Tools Js adalah sebuah aplikasi yang digunakan untuk scrapping data pada sebuah web
crawler crawler-js data js web-scraping
Last synced: 08 Sep 2025
https://github.com/i-rzr-i/domaincommonextensions
The purpose of this repository/library is to provide the most relevant and used extension methods in the life cycle of application development that allow us to improve our code, and writing speed, and use more efficiently dev team time during this period for more complex functionality.
api class data datatype extension helper object parser type util
Last synced: 20 Sep 2025
https://github.com/soenkekluth/micromitter
minimal and performant event emitter / dispatcher
data dispatch dispatcher emit emitter event eventdriven handler on send trigger
Last synced: 02 Nov 2025
https://github.com/yanaksalvo/all-panel-database-sql
Türkiye Cumhuriyeti Devleti'nin verilerini çalarak insanlara satarak para kazanan veya bu paraları kara para aklama şeklinde aklayarak gelir elde eden kişilerin database verileri ve bu sitelere giren kişilerin IP Adres bilgileri
api data database devlet ihbar panel panel-data paneldata panels sorgu sorgulama sorgupanel sql usom usomgovtr
Last synced: 06 Apr 2025
https://github.com/viglino/forets-de-cassini
couche SIG l’ensemble des contours des forêts représentées sur la carte de Cassini (hal-01267936)
Last synced: 18 Feb 2026
https://github.com/siongui/xemaauj9k5qn34x88m4h
No source code. Only serve JSON files of Pāli words
Last synced: 15 May 2026
https://github.com/devprnvk/pycryptochain
A implementation of a blockchain-based cryptocurrency in Python. This project aims to provide a fundamental understanding of blockchain technology and cryptocurrency by building a basic version from scratch. Features include blockchain creation, transaction handling, mining rewards, simulation.
blockchain crypto data decryption encryption hashing processing py python salting storage
Last synced: 09 Mar 2026
https://github.com/amethyst-php/product
An item that is made to be sold or bought
amethyst amethyst-package api data laravel product
Last synced: 21 May 2026
https://github.com/gui-sitton/bank-loans
In this project I will prepare a report for a bank's loan division. I find out whether a customer's marital status and number of children have an impact on loan default, as well as other factors
data data-analysis data-analysis-python data-science data-visualization python
Last synced: 21 May 2026
https://github.com/amethyst-php/token
amethyst amethyst-package api data laravel token
Last synced: 21 May 2026
https://github.com/nyxblabs/mimikra
🔄 Sleek data morphing tool from one file to another
data file filesystem morphing node nodejs sleek tool
Last synced: 21 May 2026
https://github.com/rellyson/data-engineering-tools
This repository holds examples and documentation about the most used tools in the data engineering ecosystem.
apache-airflow apache-spark data data-engineering jupyter-notebook python tools
Last synced: 17 Jan 2026
https://github.com/bastianolea/servel_elecciones
Resultados electorales desde Servel (2024)
chile comunas data elecciones genero
Last synced: 08 Jul 2025
https://github.com/arekflo2002/analiza_danych-rstudio-_dyskryminacja_kobiet
Wykorzystując rstudio oraz zestawy dane ze strony https://www.gapminder.org/data/ badam tematykę dyskrminacjii kobiet na poszczególnych kontynentach i wyciągam odpowiednie wnioski
data data-preparation-and-analysis data-visualization rstudio statistics
Last synced: 14 Apr 2025
https://github.com/the-universal-linux-society/sysreport
Bash script to give you a full system report. Just by running the script it offers insight into CPU data, disk space, temperature readings, network configuration, MAC addresses, firewall status, and system logs for error analysis.
analysis bash bash-script bash-scripting data report reporting system
Last synced: 15 May 2026
https://github.com/jun-labs/algorithm
📝 자료구조, 알고리즘 학습 저장소.
algorithm data data-structures leetcode problem-solving programmers ps structure
Last synced: 14 Mar 2025
https://github.com/fastpix/flutter-core-data-sdk
A comprehensive Flutter SDK for video player analytics and event tracking, designed to provide detailed insights into video playback behavior and user engagement metrics.
Last synced: 15 May 2026
https://github.com/dscamilo/gestion-clientes-springboot
Proyecto de gestión de clientes aplicando Java y Springboot, haciendo uso de Lombok, uso de interface, inyección de dependencias, uso de anotaciones Service, Data, RestController . Consumo de API haciendo uso de Postman.
data interface java lombok-maven restcontroller spring-boot
Last synced: 15 May 2026
https://github.com/shrutakeerti/eye-gaze-detection
This repo contains everything that I have done at IIT Jodhpur Summer Internship May 15 - July 15
ai aiml data eda eeg eeg-signals eye jodhpur mlflow
Last synced: 17 Mar 2025
https://github.com/mksingh431/sql-complete-notes
SQL, or Structured Query Language, is a robust and specialized programming language designed for efficient management and manipulation of relational databases. With SQL, you can seamlessly interact with databases like MySQL, PostgreSQL, Microsoft SQL Server, Oracle,.
Last synced: 21 Apr 2026