Policy Mirror Descent with Lookahead

Publication
To appear in NeurIPS 2024

Related