Unified curiosity-Driven learning with smoothed intrinsic reward estimation

Volume: 123, Pages: 108352 - 108352
Published: Mar 1, 2022
Abstract
In reinforcement learning (RL), the intrinsic reward estimation is necessary for policy learning when the extrinsic reward is sparse or absent. To this end, Unified Curiosity-driven Learning with Smoothed intrinsic reward Estimation (UCLSE) is proposed to address the sparse extrinsic reward problem from the perspective of completeness of intrinsic reward estimation. We further propose state distribution-aware weighting method and policy-aware...
Paper Details
Title
Unified curiosity-Driven learning with smoothed intrinsic reward estimation
Published Date
Mar 1, 2022
Volume
123
Pages
108352 - 108352
Citation AnalysisPro
  • Scinapse’s Top 10 Citation Journals & Affiliations graph reveals the quality and authenticity of citations received by a paper.
  • Discover whether citations have been inflated due to self-citations, or if citations include institutional bias.