Son's Notation

라벨이 Basis function인 게시물 표시전체 보기

[강화학습] 11. Policy Gradient Methods

Sutton and Barto

[강화학습] 11. Policy Gradient Methods

손쓰 12월 07, 2020

Policy Gradient Methods 지금까지 true value function들을 update, estimate하는 방법에 대해서 많…

자세한 내용 보기

[강화학습] 8. On-Policy Prediction with Approximation

Tile coding

[강화학습] 8. On-Policy Prediction with Approximation

손쓰 12월 06, 2020

On-Policy Prediction with Approximation 지금까지 한 모든 방법론은 state/action function을 t…

자세한 내용 보기