Behavior learning based on a policy gradient method: Separation of environmental dynamics and state values in policies

Seiji Ishihara, Harukazu Igarashi

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Fingerprint

Dive into the research topics of 'Behavior learning based on a policy gradient method: Separation of environmental dynamics and state values in policies'. Together they form a unique fingerprint.

Mathematics

Engineering & Materials Science