ICYMI this paper really is a must read: https://doi.org/10.1111/ele.14033
Comparing and choosing best models based on AIC etc, then interpret coefficients causally (what's the effect of X on Y?) is flawed, yet so common
We must draw causal assumptions first (DAG)
"Model selection is not a valid method for inferring causal relationships. It's appropriate for predictive inference (which model best predicts Y?), which is fundamentally distinct from causal inference (what is the effect of X on Y?)"
1/