Asking for #rl opinions.
Is a value function a model in the RL sense? Why? Why not?
Feels like the difference between model-based and value-based methods is getting more and more arbitrary.
Asking for #rl opinions.
Is a value function a model in the RL sense? Why? Why not?
Feels like the difference between model-based and value-based methods is getting more and more arbitrary.
@jhamrick Yes! I was implicitly referring to value-equivalent/value-aware models.
Since they are are not constrained to be similar to the actual transition model, I sometimes wonder if it is more natural to think of them simply as inducing particular inductive biases (maybe more precisely, learning architectures) for value-based RL, and not really as part of model-based methods.