I wrote an #rstats blog post to remind myself that just because a log transform improves an ordinary least squares regression in conventional terms (eg makes it more linear and residuals less heteroskedastic), doesn't mean it is more fit for purpose. http://freerangestats.info/blog/2023/07/30/log-transforms
Log transforms, geometric means and estimating population totals

A model that is 'improved' (in terms of making standard assumptions more plausible) by using a logarithm transform of the response will not necessarily be improved for estimating population totals.

free range statistics
@peter_ellis would doing log transformed analysis plus bias correction (i.e., compute mean of predicted lognormal distribution, exp(mu + sigma^2/2)) give you the best of both worlds?
@bbolker maybe I will do a sequel with that and iterative weighted least squares.
@peter_ellis @bbolker (Best of both worlds not guaranteed! AFAIK the standard bias correction is only correct if the underlying population is roughly lognormal; I think this is discussed in Permutt’s paper “Do covariates change the estimand”)
@bbolker @peter_ellis Yes, if the variable is roughly log-normal, as mentioned below. Interestingly, a small bias remains in this case, but the variance of the estimate is improved (σ at 35 vs 36).
@peter_ellis Nice! This is something where it depends on the task at hand - in your example of population totals the geometric mean is bad, if you wanted something closer to a typical individual in a skewed population the geometric mean can be better than the arithmetic.
Thanks @peter_ellis That's a great read. Fascinating how "better model" leads to a worse estimate.
@peter_ellis Super clear and interesting, as always 🙌