Policy Gradients in Complex Plan

A Natural Gradients Algorithm for Complex-Valued Reinforcement Learning under Humble Systems Theory

Humble Systems Theorey