Unified curiosity-Driven learning with smoothed intrinsic reward estimation

Fuxian Huang; Weichao Li; Jiabao Cui; Yongjian Fu; Xi Li

doi:https://doi.org/10.1016/j.patcog.2021.108352

doi.org/10.1016/j.patcog.2021.108352

Unified curiosity-Driven learning with smoothed intrinsic reward estimation

,

,

..., Xi Li

33

Pattern Recognition8.00

Volume: 123, Pages: 108352 - 108352

Published: Mar 1, 2022

Abstract

In reinforcement learning (RL), the intrinsic reward estimation is necessary for policy learning when the extrinsic reward is sparse or absent. To this end, Unified Curiosity-driven Learning with Smoothed intrinsic reward Estimation (UCLSE) is proposed to address the sparse extrinsic reward problem from the perspective of completeness of intrinsic reward estimation. We further propose state distribution-aware weighting method and policy-aware...

Paper Fields

Paper Details

Title

Unified curiosity-Driven learning with smoothed intrinsic reward estimation

DOI

doi.org/10.1016/j.patcog.2021.108352

Published Date

Mar 1, 2022

Journal

Pattern Recognition

Volume

123

Pages

108352 - 108352

Citation AnalysisPro

You’ll need to upgrade your plan to Pro

Looking to understand the true influence of a researcher’s work across journals & affiliations?

Scinapse’s Top 10 Citation Journals & Affiliations graph reveals the quality and authenticity of citations received by a paper.
Discover whether citations have been inflated due to self-citations, or if citations include institutional bias.

Learn more

Notes

History