https://github.com/jxtse/GEC-Metrics-DSGram

Metrics for Grammatical Error Correction models that is closer to human feedback, proposed a novel dynamic weighting evaluation method 一种新颖的语法纠错模型评价无参考指标，采用大语言模型生成动态权重的评价方法
https://github.com/jxtse/GEC-Metrics-DSGram

Last synced: about 1 year ago
JSON representation

Host: GitHub
URL: https://github.com/jxtse/GEC-Metrics-DSGram
Owner: jxtse
Created: 2024-05-28T09:17:10.000Z (about 2 years ago)
Default Branch: main
Last Pushed: 2024-12-22T06:48:36.000Z (over 1 year ago)
Last Synced: 2024-12-22T07:28:18.493Z (over 1 year ago)
Language: Python
Homepage:
Size: 2.22 MB
Stars: 2
Watchers: 1
Forks: 0
Open Issues: 1
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

StarryDivineSky - jxtse/GEC-Metrics-DSGram - Metrics-DSGram项目旨在为语法纠错（GEC）模型提供更贴近人工反馈的评估指标。该项目提出了一种新颖的动态权重评估方法，利用大型语言模型（LLM）生成动态权重，以更准确地反映不同类型语法错误的严重程度。这种无参考指标无需人工标注的参考答案即可进行评估，降低了评估成本。项目核心在于利用LLM的强大语言理解能力，赋予不同错误类型不同的权重，从而使评估结果更符合人类的直觉。通过动态调整权重，DSGram能够更有效地识别和惩罚严重的语法错误，提高GEC模型评估的准确性和可靠性。该项目为GEC领域的研究人员和开发者提供了一种更有效的模型评估工具，有助于推动GEC技术的进步。 (A01_文本生成_文本对话 / 大语言对话模型及数据)

README

# DSGram: Dynamic Weighting Sub-Metrics for Grammatical Error Correction

## Introduction

**DSGram** is a novel evaluation framework designed to enhance the performance evaluation of Grammatical Error Correction (GEC) models, especially in the era of large language models (LLMs). Traditional reference-based evaluation metrics often fall short due to the inherent discrepancies between model-generated corrections and provided gold references. DSGram addresses this issue by introducing a dynamic weighting mechanism that integrates Semantic Coherence, Edit Level, and Fluency.

This repository contains the code and data associated with the paper: **"DSGram: Dynamic Weighting Sub-Metrics for Grammatical Error Correction in the Era of Large Language Models"** by Jinxiang Xie, Yilin Li, Xunjian Yin, and Xiaojun Wan.

![图片](https://github.com/jxtse/GEC_Metrics_LLM/blob/main/images/FlowChart.png)

## Repository Structure

- `Dataset/`: Contains the datasets used for evaluation, including human-annotated and LLM-simulated sentences.
- `results/`: Directory to store the evaluation results.
- `DSGram/`: Source code for implementing the DSGram evaluation framework.

![图片](https://github.com/jxtse/GEC_Metrics_LLM/blob/main/images/Example.png)

## Citation

If you use DSGram in your research, please cite our paper:
```
@misc{xie2024dsgramdynamicweightingsubmetrics,
title={DSGram: Dynamic Weighting Sub-Metrics for Grammatical Error Correction in the Era of Large Language Models},
author={Jinxiang Xie and Yilin Li and Xunjian Yin and Xiaojun Wan},
year={2024},
eprint={2412.12832},
archivePrefix={arXiv},
primaryClass={cs.CL},
url={https://arxiv.org/abs/2412.12832},
}
```

# Tech Used
![Python](https://img.shields.io/badge/python-3670A0?style=for-the-badge&logo=python&logoColor=ffdd54) ![Azure](https://img.shields.io/badge/azure-%230072C6.svg?style=for-the-badge&logo=azure-devops&logoColor=white)

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/jxtse/GEC-Metrics-DSGram

Awesome Lists containing this project

README