Anas Barakat
Anas Barakat
Home
Research
Talks
Teaching
CV
Contact
Light
Dark
Automatic
Stochastic Policy Gradient Methods: Improved Sample Complexity for Fisher-non-degenerate Policies
Ilyas Fatkhullin
,
Anas Barakat
,
Anastasia Kireeva
,
Niao He
February 2023
Poster
Proceedings
Arxiv
Type
Conference paper
Publication
ICML 2023
Reinforcement Learning
Related
Reinforcement Learning with General Utilities: Simpler Variance Reduction and Large State-Action Space
Why Pass@k Optimization Can Degrade Pass@1: Prompt Interference in LLM Post-training
Policy Gradients for Cumulative Prospect Theory in Reinforcement Learning
On the Global Optimality of Policy Gradient Methods in General Utility Reinforcement Learning
Policy Mirror Descent with Lookahead
Cite
×