Beyond the p Value: Reform Spreads Across the World and Across Disciplines

...as evidenced by this article from Brazil, which I'm delighted to see:

The article's header

I salute Karen Grimmer, JECP co-editor, for publishing it,

https://thenewstatistics.com/itns/2026/03/04/beyond-the-p-value-reform-spreads-across-the-world-and-across-disciplines/

#NHST #OpenScience #Replication #StatisticalReform

#statstab #489 On the performance of the Neyman Allocation with small pilots

Thoughts: If you know your treatment condition will have larger variance you can optimise your sample size.

#nhst #samplesize #neynan #heterogeneity #welch #variance #pilot #se

https://www.sciencedirect.com/science/article/pii/S0304407624001398

#statstab #487 More than meets the ITT: A guide for anticipating and investigating nonsignificant results in survey experiments

Thoughts: I see a lot of papers that make at least one of the 7 errors for "no effect".

#survey #nhst #nulleffects #nonsignificant #pvalue #power

https://doi.org/10.1017/XPS.2024.1

More than meets the ITT: A guide for anticipating and investigating nonsignificant results in survey experiments | Journal of Experimental Political Science | Cambridge Core

More than meets the ITT: A guide for anticipating and investigating nonsignificant results in survey experiments - Volume 12 Issue 1

Cambridge Core

#statstab #478 Equivalence Tests {marginaleffects}

Thoughts: Often you want to test "no difference" in more complex models than many packages or software permit.
With a few lines of code you can do that for most models.

#Equivalence #noeffect #rstats #TOST #EQ #NHST #hypothesistesting

https://marginaleffects.com/chapters/predictions.html#sec-predictions_visualization

5  Predictions โ€“ Model to Meaning

#statstab #476 Experimental : causal

Thoughts: Randomized experiments are the gold standard for inference for a reason. But they are hard to design.

#design #r #statistics #methods #experiment #tutorial #pedagogy #education #hypothesis #nhst #causal #ancova

https://book.declaredesign.org/library/experimental-causal.html

18  Experimental : causal โ€“ Research Design in the Social Sciences

#statstab #467 Replication, statistical consistency, and publication bias

Thoughts: Is the replication of a a finding the "gold standard" for scientific discovery? Maybe not.

#replication #metascience #metapsychology #statistics #bias #QRPs #nhst

https://doi.org/10.1016/j.jmp.2013.02.003

Thankyou Jiangang! A Ten-Year Journey to Significance Roulette

Jiangang Xia is an enterprising professor at the University of Nebraska who, among many other things, teaches into China. He alerted me some years ago to the difficulty his students in China had accessing my videos because YouTube was blocked for them. So I mou

https://thenewstatistics.com/itns/2026/01/16/thankyou-jiangang-a-ten-year-journey-to-significance-roulette/

#ITNS #NHST #OpenScience #Teaching #TheNewStatistics

#statstab #463 One-and-a-half sided test

Thoughts: Alberson has a more different take on the issues from #461 and #462: human behaviour.

(excerpt from the book)

#pvalue #onesided #NHST #directional #hypothesis #testing #logic

https://www.routledge.com/Statistics-As-Principled-Argument/Abelson/p/book/9780805805284

#statstab #462 The paradox of one-sided vs. two-sided tests of significance

Thoughts: A solution to Royall's paradox from #461. The "null" is not one thing.

#pvalue #Royall #paradox #onesided #nhst #null #hypothesis #logic

https://www.onesided.org/articles/the-paradox-of-one-sided-v-two-sided-tests-of-significance.php

The paradox of one-sided vs. two-sided tests of significance | OneSided.org

A lot of people find as paradoxical the claim that a one-sided test of significance at a given p-value offers the same type I error guarantees as a two-sided test that produced the same p-value. Here I solve the paradox in its informal version and a formal version put forth by Royall.

www.onesided.org

#statstab #458 There is still only one test

Thoughts: Downey make a good point that all/any testing ensemble does essentially the same thing.

#randomization #testing #nhst #pvalue

https://allendowney.blogspot.com/2016/06/there-is-still-only-one-test.html

There is still only one test

In 2011 I wrote an article called "There is Only One Test" , where I explained that all hypothesis tests are based on the same framework, wh...