An open API service indexing awesome lists of open source software.

https://github.com/udialter/equivalence-testing-multiple-regression

I constructed a simulation study to evaluate the statistical performance of two equivalence-based tests and compared it to the common, but inappropriate, method of concluding no effect by failing to reject the null hypothesis of the traditional test. I further propose two R functions to supply researchers with open-access and easy-to-use tools that they can flexibly adopt in their own research.
https://github.com/udialter/equivalence-testing-multiple-regression

equivalence-testing multiple-regression null-hypothesis psychology quantitative-methods quantitative-psychology

Last synced: 3 months ago
JSON representation

I constructed a simulation study to evaluate the statistical performance of two equivalence-based tests and compared it to the common, but inappropriate, method of concluding no effect by failing to reject the null hypothesis of the traditional test. I further propose two R functions to supply researchers with open-access and easy-to-use tools that they can flexibly adopt in their own research.

Awesome Lists containing this project

README

          

# Equivalence Testing for Multiple Regression

Equivalence testing can be applied to evaluate whether an observed
effect from an individual predictor in a multiple regression model is
small enough to be considered statistically and practically negligible
([Alter & Counsell, 2021](https://psyarxiv.com/ugc9e/)). For more
information, please refer to the [OSF
page](https://osf.io/w96xe/ "Equivalence Testing for Multiple Regression OSF Page")
and/or a freely available [preprint on
PsyArXiv](https://psyarxiv.com/ugc9e/ "PsyArXiv preprint").

The following functions offer appropriate equivalence-based alternatives
for concluding negligible effect between a predictor and outcome in
multiple regression

These R functions are designed to accommodate multiple research contexts
effortlessly, with or without access to the full dataset. The two
functions, `reg.equiv.fd()` and `reg.equiv()`, provide similar output
but differ on the type of input information required by the user.

Specifically, the first function, `reg.equiv.fd()`, requires the full
dataset and model in R (`lm` object), whereas the second does not. The
`reg.equiv()` is intended for researchers who do not have access to the
complete dataset but still wish to evaluate a certain predictor’s lack
of association with the outcome variable in multiple regression, for
example, using information typically presented in a results section or
table reported in a published article.

## `reg.equiv.fd()`: full dataset required

### *Required* input:

- `datfra=` a data frame (e.g., mtcars)
- `model=` the model, an lm object (e.g., `mod1`, where
`mod1<- mpg~hp+cyl`)
- `delta=` the smallest effect size of interest (SESOI), minimally
meaningful effect size (MMES), or upper bound of the equivalence
interval (đť›…) (e.g., .15)
- `predictor=` the name of the predictor to be tested (e.g., `"cyl"`)

### Default settings:

- `test=` test type is set automatically to Two One-Sided Test (TOST;
Schuirmann, 1987), the other option is the Anderson-Hauck (AH;
Anderson & Hauck, 1983)
- `std=` the delta (or, SESOI) is the set as as standardized by
default. Indicate `std=FALSE` to assume unstandardized units
- `alpha=` the nominal Type I error rate is set to .05 by default. To
change, simply indicate the alpha level. E.g., `alpha=.10`

#### `reg.equiv.fd()` example:
![alttext](https://github.com/udialter/equivalence-testing-multiple-regression/blob/master/Figure%204%20reg.equiv.fd%20.png)

## `reg.equiv()`: full dataset _not_ required

### *Required* input:

- `b=` the estimated effect size associated with the predictor of
interest, this could be either standardized or unstandardized (e.g.,
.02)
- `se=` the standard error associated with the effect size of the
predictor of interest (if the effect size is standardized, make sure
the `se` value is tied to the standardized and not the raw effect)
- `p=` the number of total predictors in the regression model
(excluding intercept)
- `n=` sample size
- `delta=` the smallest effect size of interest (SESOI), minimally
meaningful effect size (MMES), or upper bound of the equivalence
interval (đť›…) (e.g., .15)
- `predictor=` the name of the predictor to be tested (e.g., `"cyl"`)

### Default settings:

- `test=` test type is set automatically to Two One-Sided Test (TOST;
Schuirmann, 1987), the other option is the Anderson-Hauck (AH;
Anderson & Hauck, 1983)
- `std=` the delta (or, SESOI) and the indicated effect size are set
as as standardized by default. Indicate `std=FALSE` to assume
unstandardized units
- `alpha=` the nominal Type I error rate is set to .05 by default. To
change, simply indicate the alpha level. E.g., `alpha=.10`

#### `reg.equiv()` example:
![alttext](https://github.com/udialter/equivalence-testing-multiple-regression/blob/master/Figure%205%20reg.equiv.png)

### Recommendations for Test Result Interpretation

Equivalence testing is a method designed within the null-hypothesis
significance testing (NHST) framework. NHST has been heavily criticized
for its overreliance on the dichotomous results of *p* values with
little, or no consideration of the effect’s magnitude or its
implications in practice (e.g., Cumming, 2012; Fidler & Loftus, 2009;
Harlow, 1997; Kirk, 2003; Lee, 2016 2014). Researchers must be mindful
of the limitations of NHST, and disentangle the practical and
statistical aspects of the test results.

To minimize the limitations of *p* values, it is more informative to
**interpret the observed effect’s magnitude and precision** beyond the
conclusion of “negligible effects” or “insufficient evidence for
negligible effects.” **Observed effects should be construed in relation
to the equivalence bounds, the extent of their uncertainty, and their
practical implications (or lack thereof)**. For this reason, **the two R
functions offered here also include a graphical representation of the
observed effect and its associated uncertainty in relation to the
equivalence interval. The resulting plot aids in illustrating how close
or far and wide or narrow the observed effect and its margin of error
are from the equivalence bounds; inferring about the proportion and
position of the confidence band in relation to the equivalence interval
can help interpret the results over and above *p* values**.

## For an in-depth literature review, disucssion, recommendations, and detailed tutorials, please refer to [Alter & Counsell (2021)](https://psyarxiv.com/ugc9e/).

#### References

1. Alter, U., & Counsell, A. (2021, June 17). _Equivalence Testing for
Multiple Regression_.
2. Anderson, S., & Hauck, W. W. (1983). A new procedure for testing
equivalence in comparative bioavailability and other clinical
trials. _Statistics and Communications-Theory and Methods, 12_
,2663-2692.
3. Cumming, G. (2012). _Understanding The New Statistics: Effect Sizes,
Confidence Intervals, and Meta-Analysis_. New York, NY: Taylor &
Francis Group, LLC
4. Cumming, G. (2014). The New Statistics: Why and How. _Psychological
Science, 25_(1), 7–29.
5. Fidler, F., & Loftus, G. (2009). Why figures with error bars should
replace p values: Some conceptual arguments and empirical
demonstrations. _Zeitschrift FĂĽr Psychologie/Journal of Psychology,
217_(1), 27-37.
6. Harlow, L.L. (1997). _Significance Testing in Introduction and
Overview_. In L.L. Harlow, S.A. Muliak & J.H. Steiger (Eds.). What If
There Were No Significance Tests? (pp.1-17). Mahwah, NJ, USA:
Lawrence Erlbaum.
7. Hauck, W. W., & Anderson, S. (1984). A new statistical procedure for
testing equivalence in two-group comparative bioavailability trials.
_Journal of Pharmacokinetics and Biopharmaceutics, 12_(1), 83-91.

8. Kirk, R. E. (2003). _The importance of effect magnitude_. In S. F.
Davis (Ed.), Handbook of research methods in experimental psychology
(pp. 83–105). Malden, MA: Blackwell.
9. Lee, D. K. (2016). Alternatives to P value: Confidence interval and
effect size. _Korean Journal of Anesthesiology, 69_(6), 555-562.

10. Schuirmann, D. J. (1987). A comparison of the two one-sided tests
procedure and the power approach for assessing the equivalence of
average bioavailability. _Journal of Pharmacokinetics and
Biopharmaceutics, 15_, 657-680.
11. Seli, P., Risko, E. F., Purdon, C., & Smilek, D. (2017). Intrusive thoughts: Linking spontaneous mind wandering and OCD symptomatology. _Psychological Research, 81_(2), 392-398. https://doi.org/10.1007/s00426-016-0756-3![image]