Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/gforge/medbench-dataprep

Preparation of data for the MedBench project
https://github.com/gforge/medbench-dataprep

Last synced: 17 days ago
JSON representation

Preparation of data for the MedBench project

Awesome Lists containing this project

README

        

# MedBench-DataPrep

## Overview

Welcome to the MedBench-DataPrep repository. This repository contains R functions necessary for generating data for the MedBench research study. The study aims to create a benchmark dataset of fictional Electronic Health Records (EHRs) to evaluate and improve the performance of Large Language Models (LLMs) in medical documentation tasks.

## Study Objectives

- **Dataset Creation**: Develop a comprehensive dataset of fictional EHRs, covering a wide range of medical scenarios.
- **Benchmarking**: Establish benchmarks to assess LLM performance in handling medical jargon, demographic diversity, and factual inconsistencies.
- **Evaluation**: Implement methods to quantitatively and qualitatively evaluate the generated medical documentation.

## Methods

- **EHR Structure**: Each case includes core medical notes and related lab and medication data.
- **Dataset Composition**: The dataset will reflect standard medical scenarios and include variations to test LLM capabilities under different conditions.
- **Prompt Methodology**: Explore advanced prompting techniques to enhance LLM performance in generating medical documentation.

## Contributing

We welcome contributions from the community. Please refer to our contribution guidelines for more information.

## License

This project is licensed under the MIT License. See the LICENSE file for details.