#statstab #492 A tiny review on e-values and e-processes

Thoughts: How to be a bayesian while wearing a frequentist hat.

#evalues #eprocess #samplesize #evidence #power #sequential #error #type1

https://www.math.uwaterloo.ca/~wang/files/e-review.pdf

#statstab #376 Safe Flexible Hypothesis Tests for Practical Scenarios

Thoughts: e-values & safe tests "conserve the type I error guarantee (false positive rate) regardless of the sample size"

#SafeAnytimeValidInference #SAVI #evalues #evidence
#safetests
https://cran.r-project.org/web/packages/safestats/vignettes/safestats-vignette.html

Safe Flexible Hypothesis Tests for Practical Scenarios

Just gave my talk at the @KingstonUni Psychology Research Conference! Sharing insights on e-values and Safe Anytime Valid Inferences (SAVI)β€”powerful tools for making statistical inferences that remain valid no matter when you stop collecting data.

#conference #evalues #savi

#statstab #308 A Tutorial on Safe Anytime-Valid Inference [t-test]

Thoughts: Safe Anytime Valid Inferences (SAVI) are the future for frequentist stats. Since #253 I'm more convinced.

#SafeAnytimeValidInference #SAVI #evalues #likelihood #evidence #ttest

https://osf.io/mdbqe/

Supplementary material: A Tutorial on Safe Anytime-Valid Inference: Practical Maximally Flexible Sampling Designs for Experiments Based on e-Values

Hosted on the Open Science Framework

OSF

#statstab #253 Anytime-valid inference in N-of-1 trials

Thoughts: Are frequentists ready to talk about evidence? Safe Anytime Valid Inferences (SAVI) seem like the future.

#SafeAnytimeValidInference #SAVI #evalues #likelihood #evidence #nof1

https://arxiv.org/abs/2309.07353

Anytime-valid inference in N-of-1 trials

App-based N-of-1 trials offer a scalable experimental design for assessing the effects of health interventions at an individual level. Their practical success depends on the strong motivation of participants, which, in turn, translates into high adherence and reduced loss to follow-up. One way to maintain participant engagement is by sharing their interim results. Continuously testing hypotheses during a trial, known as "peeking", can also lead to shorter, lower-risk trials by detecting strong effects early. Nevertheless, traditionally, results are only presented upon the trial's conclusion. In this work, we introduce a potential outcomes framework that permits interim peeking of the results and enables statistically valid inferences to be drawn at any point during N-of-1 trials. Our work builds on the growing literature on valid confidence sequences, which enables anytime-valid inference with uniform type-1 error guarantees over time. We propose several causal estimands for treatment effects applicable in an N-of-1 trial and demonstrate, through empirical evaluation, that the proposed approach results in valid confidence sequences over time. We anticipate that incorporating anytime-valid inference into clinical trials can significantly enhance trial participation and empower participants.

arXiv.org

{first of 2025 πŸ₯³}
#statstab #251 E-values

Thoughts: These seem like the impossible; they are generalizations of likelihood ratios (i.e., used for composite hypotheses), and can provide evidence for/against a hypothesis.

#evalues #evidence #likelihood

https://sas.uwaterloo.ca/~wang/files/e-review.pdf