https://github.com/lgrz/pairwise-ttest

Scripts to perform pairwise t-test on TREC run files
https://github.com/lgrz/pairwise-ttest

bonferroni err evaluation information-retrieval map ndcg rbp statistics trec ttest

Last synced: 7 months ago
JSON representation

Scripts to perform pairwise t-test on TREC run files

Host: GitHub
URL: https://github.com/lgrz/pairwise-ttest
Owner: lgrz
Created: 2018-06-19T00:55:13.000Z (almost 8 years ago)
Default Branch: master
Last Pushed: 2021-07-08T04:52:22.000Z (almost 5 years ago)
Last Synced: 2024-12-29T06:04:45.923Z (over 1 year ago)
Topics: bonferroni, err, evaluation, information-retrieval, map, ndcg, rbp, statistics, trec, ttest
Language: Shell
Size: 8.79 KB
Stars: 8
Watchers: 2
Forks: 2
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# Pairwise t-test

Scripts to perform pairwise t-test on TREC run files.

### Requirements

* R
* [reshape2][reshape2]
* [gdeval.pl][gdeval] - note this is a fork that adds options `-k ` and
`-j ` to the original [trec-web/trec-web-2013][trecweb]
* [trec\_eval][treceval]
* [rbp\_eval][rbpeval]

[reshape2]: https://cran.r-project.org/web/packages/reshape2/index.html
[gdeval]: https://github.com/lgrz/trec-web-2013
[treceval]: https://trec.nist.gov/trec_eval
[rbpeval]: https://people.eng.unimelb.edu.au/ammoffat/rbp_eval-0.2.tar.gz
[trecweb]: https://github.com/trec-web/trec-web-2013

### Usage

There are two bash scripts to run. First run `pairwise-eval.sh` to evaluate the
TREC run files. Then run `pairwise-ttest.sh` to compute statistical
significance.

The bash scripts assume that `rbp_eval`, `gdeval.pl` and `trec_eval` can be
found in your `PATH` environment.

To compute a pairwise t-test of all run files in the `runs` directory for
NDCG@10 using `foo.qrels` (which contains the relevance judgments), run
the following:

```
./pairwise-eval.sh ndcg 10 foo.qrels runs/*.run
./pairwise-ttest.sh runs/*.run.ndcg10 > result.txt
cat result.txt
```

The `pairwise-eval.sh` script can compute ERR, NDCG, RBP and MAP. `gdeval.pl`
is used for ERR and NDCG, `rbp_eval` for RBP, and `trec_eval` is used for MAP.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/lgrz/pairwise-ttest

Awesome Lists containing this project

README