Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/clap-purdue/gricean-pragmatics

https://clap-purdue.github.io/gricean-pragmatics/
https://github.com/clap-purdue/gricean-pragmatics

benchmark evaluation large-language-models linguistic-competence llama multilingual pragmatics

Last synced: about 1 month ago
JSON representation

https://clap-purdue.github.io/gricean-pragmatics/

Awesome Lists containing this project

README

        

# gricean-pragmatics

A library of five metrics evaluating large language models' pragmatic competence:

1. **Naturalness**: LLMs will generate surprisal scores as a proxy to text naturalness for each sentence in a minimal pair, which reflect how unexpected a sentence is, given the preceding context. Hypothetically, if LLMs show pragmatic sensitivity, LLMs should assign a lower surprisal score to the intended implied meaning in an appropriate context.
2. **Sensitivity to different Shades of Meaning (SSM)**
3. **Pragmatic Reasoning Chains (PRC)**
4. **Implicature Recovery Rate (IRR)**
5. **Pragmatic Sensitivity Index (PSI)**

Benchmark datasets (work-in-progress), examples, and documentation are also provided.