Anas Barakat
Anas Barakat
Home
Research
Talks
Teaching
CV
Contact
Light
Dark
Automatic
Policy Mirror Descent with Lookahead
Kimon Protopapas
,
Anas Barakat
March 2024
Proceedings
Arxiv
Type
Conference paper
Publication
NeurIPS 2024
Reinforcement Learning
Related
A Prospect-Theoretic Policy Gradient Framework for Behaviorally Nuanced Reinforcement Learning
On the Global Optimality of Policy Gradient Methods in General Utility Reinforcement Learning
Reinforcement Learning with General Utilities: Simpler Variance Reduction and Large State-Action Space
Stochastic Policy Gradient Methods: Improved Sample Complexity for Fisher-non-degenerate Policies
Analysis of a Target-Based Actor-Critic Algorithm with Linear Function Approximation
Cite
×