Anas Barakat
Anas Barakat
Home
Research
Talks
Teaching
CV
Contact
Light
Dark
Automatic
Policy Mirror Descent with Lookahead
Kimon Protopapas
,
Anas Barakat
February 2024
Arxiv
Type
Conference paper
Publication
Under review
Related
Independent Learning in Constrained Markov Potential Games
Reinforcement Learning with General Utilities: Scaling to Large State Action Spaces via Occupancy Measure Approximation
Learning Zero-Sum Linear Quadratic Games with Improved Sample Complexity
Reinforcement Learning with General Utilities: Simpler Variance Reduction and Large State-Action Space
Stochastic Policy Gradient Methods: Improved Sample Complexity for Fisher-non-degenerate Policies
Cite
×