Anas Barakat
Anas Barakat
Home
Research
Talks
Teaching
CV
Contact
Light
Dark
Automatic
Reinforcement Learning
A Prospect-Theoretic Policy Gradient Framework for Behaviorally Nuanced Reinforcement Learning
On the Global Optimality of Policy Gradient Methods in General Utility Reinforcement Learning
Policy Mirror Descent with Lookahead
Reinforcement Learning with General Utilities: Simpler Variance Reduction and Large State-Action Space
Stochastic Policy Gradient Methods: Improved Sample Complexity for Fisher-non-degenerate Policies
Analysis of a Target-Based Actor-Critic Algorithm with Linear Function Approximation
Cite
×