Anas Barakat
Anas Barakat
Home
Research
Talks
Teaching
CV
Contact
Light
Dark
Automatic
Analysis of a Target-Based Actor-Critic Algorithm with Linear Function Approximation
Anas Barakat
,
Pascal Bianchi
,
Julien Lehmann
January 2022
Poster
Proceedings
Arxiv
Type
Conference paper
Publication
AISTATS 2022
Reinforcement Learning
Related
Why Pass@k Optimization Can Degrade Pass@1: Prompt Interference in LLM Post-training
Policy Gradients for Cumulative Prospect Theory in Reinforcement Learning
On the Global Optimality of Policy Gradient Methods in General Utility Reinforcement Learning
Policy Mirror Descent with Lookahead
Reinforcement Learning with General Utilities: Simpler Variance Reduction and Large State-Action Space
Cite
×