Search Results - Springer

Page %P

Close Plain text

Sort By Newest First Oldest First

Article

Decentralized multi-task reinforcement learning policy gradient method with momentum over networks

To find the optimal policy quickly for reinforcement learning problems, policy gradient (PG) method is very effective, it parameters the policy and updates policy parameter directly. Besides, momentum methods ...

Shi Junru, Wang Qiong, Liu Muhua, Ji Zhihang, Zheng Ruijuan… in Applied Intelligence (2023)