https://github.com/rupt/bailed-statistics

Generate "upper limit" and "discovery" statistics without memory leaks --- let your system bail the leaks overboard --- for use with HistFitter. Clone from CERN GitLab for posterity..
https://github.com/rupt/bailed-statistics

memes statistics

Last synced: about 1 year ago
JSON representation

Generate "upper limit" and "discovery" statistics without memory leaks --- let your system bail the leaks overboard --- for use with HistFitter. Clone from CERN GitLab for posterity..

Host: GitHub
URL: https://github.com/rupt/bailed-statistics
Owner: Rupt
Created: 2022-08-08T10:43:59.000Z (almost 4 years ago)
Default Branch: master
Last Pushed: 2022-08-08T10:48:32.000Z (almost 4 years ago)
Last Synced: 2023-03-04T15:02:54.781Z (over 3 years ago)
Topics: memes, statistics
Language: Python
Homepage: https://gitlab.cern.ch/rtombs/bailed_statistics
Size: 87.9 KB
Stars: 1
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# Upper limits with limited memory

Freely allocate memory, and let the operating system clean up.

This is an efficient design pattern for short-lived programs;
to track and free memory oneself is duplication of effort.

RooStats and its HistFitter extensions operate under this design,
so we must ensure short lives to our programs using them.

We execute leaky code to simulate small batches of toys in a
pool of processes which frees resources after each batch.

Care is taken to use unique random seeds so results combine accurately.

## Usage

Set up your normal HistFitter environment;
mine uses `lsetup "views LCG_99python2 x86_64-centos7-gcc8-opt"`.

Point `upper_limit_results.py` to your discovery workspace.

Make upper limit results with `invert` and discovery p-values with `test`.

Make plots and tables with with `output` with results combined by `-load`.

## Examples

### Toys

Simulate toys.

```bash
./upper_limit_results.py invert test dump \
-filename results/disc1/Discovery_DRInt_combined_NormalMeasurement_model.root \
-prefix results/disc1/example1 \
-poi mu_Discovery \
-points 0 30 6 \
-ntoys 3000 \
-nbatch 100 \
-processes 16 \
-seed 1

```

Plot and tabulate.

```bash
./upper_limit_results.py output \
-load results/disc1/merged_dump.pickle \
-prefix results/disc1/example \
-poi mu_Discovery \
-lumi 139 \
-channel DR-Example

```

Optional: combine multiple output files.

```bash
./upper_limit_results.py dump \
-load results/disc1/example*_dump.pickle \
-prefix results/disc1/merged

```

### Asymptotics

```bash
./upper_limit_results.py invert test output \
-filename results/disc1/Discovery_DRInt_combined_NormalMeasurement_model.root \
-prefix results/disc1/example_asym \
-poi mu_Discovery \
-lumi 139 \
-channel DRInt \
-points 0 30 6 \
-calculator asymptotic

```

### Find more in ./examples/

## Help
```
usage: upper_limit_results.py [-h] [-lumi LUMI] [-prefix PREFIX]
[-load [LOAD [LOAD ...]]] [-filename FILENAME]
[-workspace WORKSPACE] [-poi POI]
[-points START STOP COUNT] [-ntoys NTOYS]
[-seed SEED] [-nbatch NBATCH]
[-processes PROCESSES] [-calculator CALCULATOR]
[-statistic STATISTIC] [-channel CHANNEL]
[-cl CL] [-use_cls USE_CLS]
operations [operations ...]

Make plots and tables for discovery fit statistics.

positional arguments:
operations instructions from {invert test dump output}; `invert'
scans for upper limits; `test' evaluates a discovery
p-value; `dump' saves results to *_dump.pickle;
`output' saves the plots and table

optional arguments:
-h, --help show this help message and exit
-lumi LUMI luminosity in inverse femtobarns
-prefix PREFIX output file paths' prefix (default: upper_limit)
-load [LOAD [LOAD ...]]
filenames of pickled results from previous runs to
combine; for `dump' or `output' (default: [])
-filename FILENAME workspace file path
-workspace WORKSPACE workspace name in its file (default: combined)
-poi POI parameter of interest name (default: mu_SIG)
-points START STOP COUNT
inclusive linear spacing of poi points; for `invert'
(default: [0.0, 40.0, 20])
-ntoys NTOYS number of toys to simulate (default: 3000)
-seed SEED random seed in [0, 2**16); make yours unique; if None,
we use a mix of time and process id (default: None)
-nbatch NBATCH size of batches which execute leaky code; reduce to
cut memory usage (default: 100)
-processes PROCESSES maximum number of processes for generating toys; also
capped by your cpu count (default: 1)
-calculator CALCULATOR
calculator type in {frequentist, hybrid, asymptotic,
asimov}; see bailed_roostats.CalculatorType;
frequentist is standard with toys; asymptotic is
standard without toys (default: frequentist)
-statistic STATISTIC test statistic type from bailed_roostats.TestStatistic
(default: profile_likelihood_one_sided)
-channel CHANNEL channel name for the `output' tex table (default: DR-
WHO)
-cl CL level for 'upper limits', in (0, 1) (default: 0.95)
-use_cls USE_CLS use CLs for limits; else use CLs+b (default: True)

```

## Absolution

Hypotheses compare through the relative likelihoods they assign to data.

A p-value is a cumulative distribution function evaluated at data; CLs is a
ratio of p-values.

Please present results of this software accurately and clearly.
Examples of false or unclear presentations of a p-value or CLs include:

- as a probability that an hypothesis is true or false,
- as a probability of compatibility with an hypothesis,
- as a probability that data occurred at random, by chance, or by
statistical fluctuation,
- as a likelihood or importance of data,
- as necessary for an optimal or rational decision rule.

Please also respect that the association of a p-value or CLs with the words
'test', 'limit', 'confidence', 'significance', 'exclusion', 'evidence',
'observation' or 'discovery' is nominal, and may not reflect the words' meanings
in English.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/rupt/bailed-statistics

Awesome Lists containing this project

README