data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-02 00:07:45 UTC
- JSON Representation
https://github.com/chaitanyac22/hr_policy_query_resolution_with_retrieval_augmented_generation_rag
This repository contains an HR Policy Query Resolution system using Retrieval-Augmented Generation (RAG). It leverages a 4-bit quantized Mistral-7B-Instruct-v0.2 LLM and JP Morgan Chase’s publicly available Code of Conduct documents to generate accurate, contextually relevant responses for HR policy queries.
artificial-intelligence data hr large-language-models llm mistral-7b nlp pipeline prompt-engineering quantization rag retrieval-augmented-generation
Last synced: 12 Feb 2026
https://github.com/farovictor/mongodbextractor
This project is intended to be used as a data extractor to support ELT pipelines or any kind of process that requires a heavy data dump from MongoDb databases.
Last synced: 14 Jan 2026
https://github.com/tayeva/eia-client-python
EIA Open Data API Client - Python
data open-source python python-3 python3
Last synced: 14 Oct 2025
https://github.com/colour-science/colour-hdri-examples-datasets
Colour - HDRI - Examples Datasets
color color-science color-space color-spaces colorspace colorspaces colour colour-science colour-space colour-spaces colourspace colourspaces data dataset datasets hdr hdri raw tone-mapping tonemapping
Last synced: 19 Mar 2026
https://github.com/colour-science/colour-demosaicing-tests-datasets
Colour - Demosaicing - Tests Datasets
color color-science color-space color-spaces colorspace colorspaces colour colour-science colour-space colour-spaces colourspace colourspaces data dataset datasets de-mosaicing debayering demosaicing demosaicking raw
Last synced: 19 Mar 2026
https://github.com/skylinenando/javascript
autocomplete browser data disable events javascript language loop
Last synced: 14 Feb 2026
https://github.com/priyanka7411/customer-segmentation-churn-dashboard
📊 Streamlit + Plotly dashboard for customer segmentation, RFM analysis, and churn prediction using machine learning.
churn data machine-learning pandas prediction python rfm rfm-analysis streamlit visualization
Last synced: 14 Apr 2026
https://github.com/ium101/files-and-folders-lister-z
Files and Folders Lister Z is a utility for listing the contents of directories on your computer. It provides both a command-line and a graphical user interface (GUI) for easy use.
application application-code brasil brazil cmd command data database databases exe filemanagement filesystem linux lowcode macos python sh tool utility windows
Last synced: 09 Oct 2025
https://github.com/mednour2019/devolap
OLAP Cube Dispatcher Tool
analysis-services csharp data excel excel-export kpi mdx metroframework mvvm-architecture sql wpf
Last synced: 27 Jan 2026
https://github.com/yeisonmontoya1815/machine-learning_prediction_can_inflation
we aim to predict trends in the Canadian market basket using sentiment analysis techniques. Sentiment analysis involves analyzing text data to determine the sentiment expressed, whether positive, negative, or neutral.
algorithms-and-data-structures data data-analysis data-science data-visualization feature-engineering machine-learning matplotlib-pyplot numerical-analysis numpy pandas pipelines python sklearn structured-data super unsupervised-learning
Last synced: 05 Feb 2026
https://github.com/ngambip/diabetes_factors_2024
Exploring BMI Categories and Health Factors.
dashboards data datacleaning dax-languague powerbi sql sqlstudio tsql visualization
Last synced: 03 Mar 2026
https://github.com/antoineaugusti/purchasing-power
Archive daily data about purchasing power parity: how much goods should cost in various countries
archive data purchasing-power-parity
Last synced: 28 Oct 2025
https://github.com/leechristophermurray/parquetframe
Unlocking the power of Parquets
data data-analysis dataframe entity-framework etl graph interactive python rust workflow worklow zanzibar
Last synced: 28 May 2026
https://github.com/mostafanabieh/image-classification-with-data-augmentation
Project for Data augmentation with tensorflow v2
data deep-learning image-classification machine-learning tensorflow tensorflow2
Last synced: 07 May 2026
https://github.com/helixspiral/ndbc
Golang wrapper for the National Data Buoy Center (NDBC)
data data-science golang government-data ndbc ndbc-buoy-data noaa noaa-api noaa-buoys noaa-data noaa-weather wrapper wrapper-api wrapper-library
Last synced: 14 Jun 2025
https://github.com/stanford-oval/medxchange
Medical Data Exchange (MedXchange) platform
data ethereum exchange medical medxchange
Last synced: 16 May 2026
https://github.com/mohammadkarbalaee/introduction-to-data-science-sbu
Reports and full documentation of the introduction to data science course held at SBU
data data-science python shahid-beheshti-university
Last synced: 27 Mar 2025
https://github.com/royruddle/vizdataquality
Python package for visualizing data quality
data data-science data-visualization jupyter-notebook missing-data python
Last synced: 05 May 2025
https://github.com/stdlib-js/array-int32
Int32Array.
array data int int32 int32array integer javascript long node node-js nodejs signed stdlib structure typed typed-array types
Last synced: 27 May 2026
https://github.com/rrighart/rrighart.github.io
A webpage about data science, programming, statistics and related topics
analyses data data-mining programming statistics
Last synced: 20 Jan 2026
https://github.com/aaronmeder/social-history
A quick look into your history on social media. Drop in the archives you've downloaded from Facebook and Instagram and see some stats about your time on the networks.
archives data facebook instagram statistics stats
Last synced: 27 Mar 2025
https://github.com/thekartikeyamishra/data_cleaning_project
Welcome to the Data Cleaning and Visualization project! This repository demonstrates how to clean messy data and create insightful visualizations using Python with Pandas and Matplotlib.
data dataanalysis matplotlib matplotlib-pyplot pandas python
Last synced: 02 May 2026
https://github.com/godeltech/godeltech.data
.NET library to access data storage with Unit of Work, Repository and Entity classes
data entity repository unitofwork
Last synced: 30 Apr 2025
https://github.com/hariprashad-ravikumar/ai-datascience-lab
AI‑DataScience‑Lab is a web app for uploading CSV datasets, cleaning with Pandas, and running quick exploratory analyses and regression models using scikit‑learn. Its modular design supports future AI extensions, like deep learning with TensorFlow or insight generation via the OpenAI API.
ai api azure cloudcomputing data data-analysis data-science data-visualization mathplotlib numpy openai pandas python scikit-learn
Last synced: 02 Aug 2025
https://github.com/wamphlett/input-collection
A smarter and stricter way to capture and validate request data
Last synced: 27 May 2026
https://github.com/courtois-neuromod/anat
Anatomical sub-dataset of Courtois-Neuromod project.
Last synced: 17 Jan 2026
https://github.com/bgmp/tesis-german-deuster
Datos estadísticos para tercería de una tésis
Last synced: 28 Mar 2025
https://github.com/lmantw/binarion
A simple binary format for storing JavaScript objects.
binary data decoding encoding format javascript
Last synced: 02 Sep 2025
https://github.com/ubc-library-rc/data-manipulation-dplyr
Workshop about data manipulation using the dplyr R package
Last synced: 01 Jul 2026
https://github.com/samboycoding/hungergames-data
data hunger-games javascript json
Last synced: 15 May 2026
https://github.com/richardschoen/ibmixmlservicestd
IBM i XMLSERVICE C# and VB.Net Data Access Service Wrapper for .Net 4.6.1 and above and .Net Core 2.0 and above
as400 cl cobol command data database db2 ddm drda ibm ibmi os400 pase program qcmdexc qcmdexec queue rpg service xmlservice
Last synced: 18 Apr 2025
https://github.com/doughtnerd/pod
Read and write Excel data with Java
data excel extract poi-library
Last synced: 08 Apr 2025
https://github.com/erwan-simon/aws-data-platform-framework
A unified framework to industrialize data ingestion, transformation and pipeline execution on AWS using Terraform, from infrastructure provisioning to runtime execution, designed as a reusable and standalone data platform.
aws data data-framework datalake docker iceberg python spark step-functions terraform terraform-module
Last synced: 23 May 2026
https://github.com/stdlib-js/datasets-suthaharan-single-hop-sensor-network
Labeled wireless sensor network data set collected from a simple single-hop wireless sensor network deployment using TelosB motes.
data dataset datasets javascript labeled machine-learning ml mote motes network node node-js nodejs outlier outliers sample sensor statistics stats stdlib
Last synced: 03 Mar 2025
https://github.com/lukanedimovic/table_editor
A simple table data editor, with easily scalable functions and operations & a nice GUI
data data-science formula java parser parsing preprocessing swing tokenizer
Last synced: 04 Apr 2025
https://github.com/thejeshgn/thejeshgn
data data-visualization datameet india opendata public-interest
Last synced: 15 Jan 2026
https://github.com/unaygney/js-challenges-data-structures-and-algorithms
Repo of the challenges I'm trying to solve to understand data structures and algorithms..
algorithms-and-data-structures data javascript structure
Last synced: 29 Oct 2025
https://github.com/parimala24-ds/datascientistmlinterviewprep24
DATASCIENTST ML INTERVIEW PREP24
data decisiontree interviewquestions linear-regression logistic machine-learning matplotlib numpy pandas python seaborn sklearn
Last synced: 12 Apr 2025
https://github.com/vatshayan/final-year-project-image-recognition
Machine Learning project to recognize faces from an Image
btech computerscience data facial final image imageclassification learning machine project recognition science students year
Last synced: 29 May 2026
https://github.com/cheminfo/cheminfo-types
chemistry data hacktoberfest schema typescript
Last synced: 03 Apr 2026
https://github.com/nikoshet/exploratory-data-analysis-using-r
Exploratory Data Analysis using R Course Project for M.Sc. 'Data Science and Machine Learning' in NTUA
data data-analysis data-science eda exploratory-data-analysis ggplot2 r
Last synced: 14 May 2026
https://github.com/azawawi/perl6-msgpack
Perl 6 Interface to libmsgpack
data messagepack msgpack perl6 wrapper
Last synced: 12 Jun 2025
https://github.com/ryanmorr/typed
Statically typed properties for object literals
data javascript object properties statically-typed
Last synced: 12 Jun 2026
https://github.com/wpp-public/akqa-nz-tagmanager-connector
A simple javascript library to send events to a tag manager container
Last synced: 05 Apr 2025
https://github.com/eosdis-nasa/earthdata-pub-dashboard
Front-end Dashboard for Earthdata Pub
data earthdata edpub publication
Last synced: 15 Jan 2026
https://github.com/stdlib-js/array-ones-like
Create an array filled with ones and having the same length and data type as a provided array.
array data float32array float64array int16array int32array javascript matrix ndarray node node-js nodejs stdlib structure typed typed-array types uint32array vector
Last synced: 05 Jan 2026
https://github.com/inspect-js/is-data-view
Is this value a JS DataView? This module works cross-realm/iframe, does not depend on instanceof or mutable properties, and despite ES6 Symbol.toStringTag.
data dataview ecmascript javascript typedarray typedarrays view
Last synced: 05 Apr 2025
https://github.com/keosariel/nairagazer-clustered-news
Providing clustered News data specifically Nigeria news. In hindsight this repo contain nigeria news and it's coverage. Data is from Nairagazer
ai data data-science news nigeria nigerian-data python
Last synced: 30 Aug 2025
https://github.com/jimut123/scrapers
All Scrapers that I'll build
bs4 data python3 real-time-visualisations scrapers scrapy wget
Last synced: 16 Jan 2026
https://github.com/leeper/mcode
Functions to merge and recode across multiple variables
data data-transformation r recode recoding
Last synced: 16 May 2025
https://github.com/ahmedkhalf/arabic-keyword-scraper
Stop wasting your time! And obtain Arabic definitions without having to look it up.
arabic data definitions scraper sentences wordsearch
Last synced: 12 Mar 2025
https://github.com/hmeleiro/alquilermad
Housing rent map in Comunidad de Madrid / Mapa del alquiler en la Comunidad de Madrid
data data-science data-visualization datascience housing-location-visualization rent renting
Last synced: 13 Sep 2025
https://github.com/zarr-developers/cookiecutter-zarr-store
Cookiecutter for Zarr store implementations
chunked data n-dimensional zarr
Last synced: 16 Jun 2025
https://github.com/ymougenel/referencecollector
Helps you gather, store and share references links
ansible data docker keycloak kotlin spring-boot thymeleaf
Last synced: 14 Apr 2026
https://github.com/marlenezw/speech-to-text
Turn any video or audio recording into a written transcript using python
data data-science python speech speech-recognition speech-synthesis speech-to-text
Last synced: 27 Apr 2026
https://github.com/dhruvldrp9/simpledht
A Python-based Distributed Hash Table (DHT) implementation enabling cross-network key-value storage, automatic node discovery, and data replication with a simple CLI and library interface.
cross-network-node-communation data data-replication data-synchronization dht dht-python distributed-hash-table key-value-storage nat netowork node-discovery peer-to-peer peer-to-peer-network python sha-256 simple udp udp-socket-communication
Last synced: 28 Feb 2026
https://github.com/serhatkacmaz/cpp-datastructuresandalgortihms
Contains codes related to data structures
algorithms cplusplus data data-structures
Last synced: 10 Jul 2025
https://github.com/johntocci/nullaxe
Nullaxe is a powerful and user-friendly Python library designed for cleaning and preprocessing data. It works seamlessly with both pandas and polars DataFrames, making it a versatile tool for data scientists and developers.
data data-analysis data-science datacleaning pandas polars python
Last synced: 06 Apr 2026
https://github.com/datafold/vhol-demo
Get hands-on examples of dbt + Datafold CI/CD workflows
data data-engineering datafold dbt diff
Last synced: 28 Dec 2025
https://github.com/ciscorn/tinybufr
A Rust library for decoding BUFR meteorological observation data format
bufr data meteorology rust weather wmo
Last synced: 11 Jan 2026
https://github.com/utrechtuniversity/dataprivacysurvey
Code for analysing data from the Data Privacy Survey (2022)
data gdpr open-science privacy rdm research research-data-management survey utrecht-university
Last synced: 16 Jun 2025
https://github.com/bastgau/snow-revoke-privileges
Script designed to simplify the management of permissions in your Snowflake databases.
data database dba dev-container python snowflake
Last synced: 20 Apr 2025
https://github.com/memair/apps
App Store for Memair
apps appstore data data-science quantified-self
Last synced: 06 Apr 2026
https://github.com/victoorv/breast_cancer
Mammographic images classification.
breast-cancer breast-cancer-classification classification cnn cnn-classification convnext convnext-tiny convolutional-neural-networks data data-science data-visualization feature-tuning image image-classification mammogram-images mammographic-images neural-network resnet-50 resnet50 transfer-learning
Last synced: 27 Jan 2026
https://github.com/andrei-vataselu/data-science-snippets
🧰 Essential EDA and Data Cleaning Helpers for Any DataFrame This collection of functions is designed to accelerate exploratory data analysis (EDA), quickly surface data quality issues, and offer high-level insights into the structure and content of your dataset.
artificial-intelligence data data-science eda feature-engineering hyperparamater-tunning library loading model-evaluation modeling preprocessing python snippets text-processing time-series visualization
Last synced: 10 Mar 2026
https://github.com/radekbednarik/data_generator
Random data generator using Python. Generate data files with random string, floats, ints, dates via console or TOML files..
csv data generator python python3 random test-data-generator
Last synced: 13 Dec 2025
https://github.com/antvis/create-antv-demo
A simple CV-dashboard framework for practicing how to use AntV.
antv cv dashboard data resume resume-template resume-website visualization
Last synced: 09 Apr 2025
https://github.com/jackallabs/canine-oracle
The Oracle Daemon for the Jackal Blockchain
blockchain cosmos data feed jackal oracle stream
Last synced: 06 Feb 2026
https://github.com/praveenpuglia/css-support
The source of truth for CSS browser support of info
api browser compatibility css data properties selectors support
Last synced: 31 Mar 2025
https://github.com/cosmos-loops/cosmos-efcore
Cosmos.EntityFrameworkCore is a part of Cosmos.Data, a inline project of COSMOS LOOPS PROGRAMME. This repository provides a package of Microsoft.EntityFrameworkCore to improve development efficiency.
cosmos-loops data efcore entityframeworkcore
Last synced: 14 Aug 2025
https://github.com/real-veersandhu/scifaa-covid-19-project
📈 COVID-19 Data Science Project (2021 Internship @ SCI-FAA)
covid-19 data data-science data-visualization python
Last synced: 14 May 2026
https://github.com/shysolocup/aepl
A Node.JS multi-layered class creation package with built-in parenting systems that let you get info from classes above as well as better function and property makers for easier to read and understand development and modding support inspired by Roblox's Studio API.
aepl backend classes data framework game-development game-framework javascript js js-class js-framework lightweight nodejs package
Last synced: 28 Oct 2025
https://github.com/satyam4229/college-predictor-system
The college predictor system is a Python-based application that utilizes a machine learning model to predict colleges and their corresponding degree programs and branches based on a student's JEE (Joint Entrance Examination) score.
data data-science jupyter-notebook kaggle prediction python
Last synced: 06 Apr 2026
https://github.com/mahmoud-saeed-mahmoud/loading_state_handler
The StateHandlerWidget manages different UI states—loading, error, empty, and normal—allowing you to customize the displayed widgets for each state.
dart data error flutter flutter-package flutter-widget loading state
Last synced: 10 Mar 2026
https://github.com/kom-senapati/ghw-data-hacks
🌍 Global Hack Week data projects, 📊 focused on exploration, manipulation, and analysis...
Last synced: 12 Mar 2025
https://github.com/danlsn/causality
A Personal Data Platform and the culmination of years of curiosity and learning in the Data Engineering space.
data data-engineering datawarehousing personal-data quantified-self
Last synced: 06 Mar 2026
https://github.com/felixklauke/atomizer
Playing around with butter knife, android bindings and rx java.
binding butterknife data java react rx rxjava
Last synced: 15 May 2026
https://github.com/vikyw89/usesyncv
a simplistic react global store with pregenerated CRUD, and built in async fetch
data fetch mobx reactjs reactquery redux state state-management store swr zustand
Last synced: 06 Jan 2026
https://github.com/itzshoaib/hashtegrity
A library for generating hash, validating data integrity, monitoring file/directory integrity, offchain data integrity
crypto-hash data data-integrity hacktoberfest hash integrity
Last synced: 07 Mar 2026
https://github.com/heikomuller/histore
Library for maintaining snapshots of evolving tabular data sets
Last synced: 10 Apr 2025
https://github.com/stdlib-js/array-uint16
Uint16Array.
array data int integer javascript node node-js nodejs short stdlib structure typed typed-array types uint uint16 uint16array unsigned
Last synced: 22 Apr 2025