An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with data-harmonization

A curated list of projects in awesome lists tagged with data-harmonization .

https://github.com/scai-bio/datastew

Python library for intelligent data stewardship using Large Language Model (LLM) embeddings

data-harmonization data-stewardship large-language-models

Last synced: 30 Apr 2025

https://github.com/scai-bio/index

Intelligent data steward toolbox using Large Language Model embeddings for automated Data-Harmonization

data-harmonization data-stewardship embeddings large-language-models semantic-mapping

Last synced: 30 Mar 2025

https://github.com/scai-bio/kitsune

Kitsune is a next-generation data steward and harmonization tool.

data-harmonization data-stewardship embeddings large-language-models semantic-mapping

Last synced: 30 Apr 2025

https://github.com/harmonydata/harmony_examples

Example Jupyter notebook and R scripts using Harmony in real research problems

data data-harmonisation data-harmonization harmonisation psychology python r research

Last synced: 20 Dec 2024

https://github.com/dfornika/amrhike

Proof-of-concept for storing and querying harmonized AMR Genomic Analysis Results in datahike

antimicrobial-resistance clojure data-harmonization datahike triplestore

Last synced: 18 Mar 2025

https://github.com/harmonydata/harmonydata.github.io

Blog for NLP data harmonisation project Harmony, open source solution using Python for psychologists

data-harmonisation data-harmonization deep-learning github-pages harmony harmonydata hugo ssg static-site static-site-generator

Last synced: 18 Feb 2025

https://github.com/jcaperella29/clinical-text-mining_r_script

A lightweight R script for text mining and harmonizing medical phenotype data. Cleans, standardizes, and maps diagnoses to ICD-10 codes, with clinical annotations for enhanced data usability.

biomedical-data clinical-informatics data-cleaning data-harmonization database-integration icd-10 machine-learning medical-data nlp-machine-learning one-hot-encoding phenotype r text-mining

Last synced: 02 Mar 2025