Search Page | SpringerLink

Reward Function Design Method for Long Episode Pursuit Tasks Under Polar Coordinate in Multi-Agent Reinforcement Learning

Multi-agent reinforcement learning has recently been applied to solve pursuit problems. However, it suffers from a large number of time steps per...

Yubo Dong, Tao Cui, ... Peng Dong in Journal of Shanghai Jiaotong University (Science)

Article 08 April 2024

Capturing Reward Functions for Autonomous Driving: Smooth Feedbacks, Random Explorations and Explanation-Based Learning

End-to-end reinforcement learning methods recently has yielded successful models [9, 12, 14, 16] in simulation environments such as CARLA [7]. These...

M. C. Güney, Yakup Genç in Smart Applications with Advanced Machine Learning and Human-Centred Problem Design

Conference paper 2023

A reward-based performability modelling of a fault-tolerant safety–critical system

Nowadays, various computer system carries out critical functions. The failure of these systems leads to unacceptable loss. Such systems are called...

Shakeel Ahamad, Ratneshwer Gupta in International Journal of System Assurance Engineering and Management

Article 03 August 2023

Self-Adaptive LSAC-PID Approach Based on Lyapunov Reward Sha** for Mobile Robots

In order to solve the control problem of multiple-input multiple-output (MIMO) systems in complex and variable control environments, a model-free...

**nyi Yu, Siyu Xu, ... Linlin Ou in Journal of Shanghai Jiaotong University (Science)

Article 08 August 2023

Dynamic Weight-based Multi-Objective Reward Architecture for Adaptive Traffic Signal Control System

An Adaptive Traffic Signal Control (ATSC) system uses real-time traffic information to control traffic lights and makes the public transport system...

Abu Rafe Md Jamil, Naushin Nower in International Journal of Intelligent Transportation Systems Research

Article 29 April 2022

Predicting the Reward System of Knowledge Sharing in the Industrialized Engineering Sector Based on Regulatory Mechanisms

Limitations of knowledge sharing within companies are a major cause of unsatisfactory productivity within industrialized engineering production...

Fredrick Ahenkora Boamah, Jianhua Zhang, ... Maryam Tariq in Iranian Journal of Science and Technology, Transactions of Civil Engineering

Article 06 April 2023

A novel tri-stage with reward-switching mechanism for constrained multiobjective optimization problems

The effective exploitation of infeasible solutions plays a crucial role in addressing constrained multiobjective optimization problems (CMOPs)....

Jiqing Qu, Xuefeng Li, Hui **ao in Complex & Intelligent Systems

Article Open access 30 March 2024

Navigation of Mobile Robots Based on Deep Reinforcement Learning: Reward Function Optimization and Knowledge Transfer

This paper presents an end-to-end online learning navigation method based on deep reinforcement learning (DRL) for mobile robots, whose objective is...

Weijie Li, Ming Yue, ... Ye ** in International Journal of Control, Automation and Systems

Article 30 January 2023

Faster Robotic Arm Movement Planning via Guided Attenuation Reward Sha**

The expensive learning cost has become a serious problem in robotic arm movement planning using reinforcement learning method. A significant amount...

**aodong Han, Dapeng Tao in Advances in Guidance, Navigation and Control

Conference paper 2023

Achieving Goals Using Reward Sha** and Curriculum Learning

Real-time control for robotics is a popular research area in the reinforcement learning community. Through the use of techniques such as reward...

Mihai Anca, Jonathan D. Thomas, ... Matthew Studley in Proceedings of the Future Technologies Conference (FTC) 2023, Volume 1

Conference paper 2023

Integrated energy optimization scheduling with active/passive demand response and reward and punishment ladder carbon trading

With the increase in energy demand and carbon emission requirements, energy management and CO ₂emission reduction on the user side are significant...

Tao Zhang, ** Wang, ... Wenli Liu in Electrical Engineering

Article 24 May 2023

Accelerated Reward Policy (ARP) for Robotics Deep Reinforcement Learning

Reward policy is a crucial part for Deep Reinforcement Learning (DRL) applications in Robotics. The challenges for autonomous systems with...

Harry Li, Chee Vang, ... Shuwen Zheng in Advances in Information and Communication

Conference paper 2022

Optimizing Reward Function Weights and Enhancing Control Mechanisms for Bipedal Robots Using LSTM and Attention Mechanisms

This paper introduces an optimized control approach for bipedal robots, merging Bayesian optimization for reward function weights and a novel neural...

Lingzhi Cui, Tianqi Deng, ... Wenhao He in Proceedings of International Conference on Image, Vision and Intelligent Systems 2023 (ICIVIS 2023)

Conference paper 2024

Penalty-Reward Analysis with Uninorms: A Study of Customer (Dis)Satisfaction

In customer (dis)satisfaction research, analytic methods are needed to capture the complex relationship between overall (dis)satisfaction with a...

Koen Vanhoof, Pieter Pauwels, ... Geert Wets in Intelligent Data Mining

Chapter

A Second-Order Adaptive Network Model for Collective Emotional Response During Reward-Based Gaming

In this paper, a second-order adaptive self-modelling network model is introduced to model collective emotional response during frequent repetitive...

Harry Thavaganeshan, Joy Wu, Jan Treur in Proceedings of Seventh International Congress on Information and Communication Technology

Conference paper 2023

Two-stage reward allocation with decay for multi-agent coordinated behavior for sequential cooperative task by using deep reinforcement learning

We propose a two-stage reward allocation method with decay using an extension of replay memory to adapt this rewarding method for deep reinforcement...

Yuki Miyashita, Toshiharu Sugawara in Autonomous Intelligent Systems

Article Open access 27 May 2022

Evaluation of Safe Reinforcement Learning with CoMirror Algorithm in a Non-Markovian Reward Problem

In reinforcement learning, an agent in an environment improves the skill depending on a reward, which is the feedback from an environment. For...

Megumi Miyashita, Shiro Yano, Toshiyuki Kondo in Intelligent Autonomous Systems 17

Conference paper 2023

Multi-Agent Reward-Iteration Fuzzy Q-Learning

Fuzzy Q-learning extends Q-learning to continuous state space and has been applied to a wide range of applications such as robot control. But in a...

Lixiong Leng, **gchen Li, ... Haobin Shi in International Journal of Fuzzy Systems

Article 13 April 2021

Path Planning for Parafoil Airdrop System Based on TD3 Algorithm: Reward Sha** with Potential Field

The flight time of the parafoil airdrop system is limited by the release altitude, and how to accurately guide the parafoil landing within the...

Feilong Tao, Hao Sun, ... Zhongxin Liu in Proceedings of 2023 Chinese Intelligent Systems Conference

Conference paper 2023

Spatial Consciousness Model of Intrinsic Reward in Partially Observable Environments

In reinforcement learning navigation, agent exploration based on intrinsic rewards has uncertainties, including observation, action, and neural...

Zhenghongyuan Ni, Ye **, ... Wei Zhao in Journal of Intelligent & Robotic Systems

Article 06 December 2022

Search

Filters

Search Results

Search

Navigation