'On the Sample Complexity and Metastability of Heavy-tailed Policy Search in Continuous Control', by Amrit Singh Bedi, Anjaly Parayil, Junyu Zhang, Mengdi Wang, Alec Koppel.

http://jmlr.org/papers/v25/21-1343.html

#reinforcement #optimality #exploration

On the Sample Complexity and Metastability of Heavy-tailed Policy Search in Continuous Control