Skip to content

Commit d0a25a5

Browse files
committed
bugfix: rewards -> deltas
1 parent db8c0e5 commit d0a25a5

1 file changed

Lines changed: 1 addition & 1 deletion

File tree

src/ppo/main.clj

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -767,7 +767,7 @@
767767
0.0
768768
(reverse (map vector deltas dones truncates)))))))
769769

770-
;; For example when all rewards are 1.0 and if using an discount factor of 0.5, the advantages approach 2.0 assymptotically when going backwards in time.
770+
;; For example when all deltas are 1.0 and if using an discount factor of 0.5, the advantages approach 2.0 assymptotically when going backwards in time.
771771
(advantages {:dones [false false false] :truncates [false false false]}
772772
[1.0 1.0 1.0]
773773
0.5

0 commit comments

Comments
 (0)