You are not logged in | Log in

All-Action Policy Gradients

Speaker(s)
Michal Nauman
Affiliation
Uniwersytet Warszawski
Date
Jan. 19, 2023, 12:15 p.m.
Room
room 5050
Seminar
Seminarium "Machine Learning"

In this talk, we will discuss policy gradients with many action samples. We will investigate decompositions of policy gradient variance, as well as measure the variance reduction effect stemming form increasing the number of state and action samples used in estimation. Finally, we will compare various strategies of simulating additional samples using neural networks.