Search Page | SpringerLink

An improved scheduling with advantage actor-critic for Storm workloads

Various resources as the essential elements of data centers, and their utilization is vital to resource managers. In terms of the persistence, the...

Gaoqiang Dong, Jia Wang, ... Tingting Su in Cluster Computing

Article 29 June 2024

A World Model for Actor–Critic in Reinforcement Learning

Abstract—

Model-based reinforcement learning is a hybrid approach that combines planning with a world model and model-free policy learning, a major...

A. I. Panov, L. A. Ugadiarov in Pattern Recognition and Image Analysis

Article 26 September 2023

A double Actor-Critic learning system embedding improved Monte Carlo tree search

As the bias between the estimated value and the true value, overestimation is a basic problem in reinforcement learning, which leads to a lower total...

Hongjun Zhu, Yong **e, Suijun Zheng in Neural Computing and Applications

Article 23 February 2024

Large-scale UAV swarm confrontation based on hierarchical attention actor-critic algorithm

In large-scale unmanned aerial vehicle (UAV) swarm confrontation scenarios, the design of decision-making and coordination strategies becomes...

**aohong Nian, Mengmeng Li, ... Hongyun **ong in Applied Intelligence

Article 23 February 2024

Sampling-efficient path planning and improved actor-critic-based obstacle avoidance for autonomous robots

Autonomous robots have garnered extensive utilization in diverse fields. Among the critical concerns for autonomous systems, path planning holds...

Yefeng Yang, Tao Huang, ... Chih-yung Wen in Science China Information Sciences

Article 26 April 2024

On the sample complexity of actor-critic method for reinforcement learning with function approximation

Reinforcement learning, mathematically described by Markov Decision Problems, may be approached either through dynamic programming or policy search....

Harshat Kumar, Alec Koppel, Alejandro Ribeiro in Machine Learning

Article 16 February 2023

Image captioning with residual swin transformer and Actor-Critic

Image captioning is one essential work in the multi-modal area, which employs computer vision and natural language processing technology together to...

Zhibo Zhou, Yang Yang, ... Feiran Huang in Neural Computing and Applications

Article 05 October 2022

UAV-enabled fair offloading for MEC networks: a DRL approach based on actor-critic parallel architecture

Data processing is a key challenge for computationally limited Ground Users (GUs) in various applications. Unmanned Aerial Vehicles (UAVs) equipped...

Wei Li, Si Li, ... Yi Zhou in Applied Intelligence

Article 29 February 2024

Robustness Assessment of Asynchronous Advantage Actor-Critic Based on Dynamic Skewness and Sparseness Computation: A Parallel Computing View

Reinforcement learning as autonomous learning is greatly driving artificial intelligence (AI) development to practical applications. Having...

Tong Chen, Ji-Qiang Liu, ... Gang Li in Journal of Computer Science and Technology

Article 30 September 2021

Improving actor-critic structure by relatively optimal historical information for discrete system

Recently, actor-critic structure based neural networks are widely used in many reinforcement learning tasks. It consists of two main parts: (i) an...

**nyu Zhang, Weidong Li, ... **ao-Yuan **g in Neural Computing and Applications

Article 26 February 2022

Actor-critic reinforcement learning leads decision-making in energy systems optimization—steam injection optimization

Steam injection is a popular technique to enhance oil recovery in mature oil fields. However, the conventional approach of using a constant steam...

Ramez Abdalla, Wolfgang Hollstein, ... Philip Jaeger in Neural Computing and Applications

Article Open access 27 April 2023

A heuristic multi-objective task scheduling framework for container-based clouds via actor-critic reinforcement learning

Container-based cloud technology has changed the delivery mode of traditional applications and brought a breakthrough development to the field of...

Lilu Zhu, Feng Wu, ... **nmei Tian in Neural Computing and Applications

Article 17 March 2023

Optimal fractional-order PID controller based on fractional-order actor-critic algorithm

In this paper, an online optimization approach of a fractional-order PID controller based on a fractional-order actor-critic algorithm (FOPID-FOAC)...

Raafat Shalaby, Mohammad El-Hossainy, ... Tarek A. Mahmoud in Neural Computing and Applications

Article Open access 24 August 2022

Integrating short-term stochastic production planning updating with mining fleet management in industrial mining complexes: an actor-critic reinforcement learning approach

Short-term production planning in industrial mining complexes involves defining daily, weekly or monthly decisions that aim to achieve production...

Joao Pedro de Carvalho, Roussos Dimitrakopoulos in Applied Intelligence

Article Open access 06 July 2023

An Advantage Actor-Critic Deep Reinforcement Learning Method for Power Management in HPC Systems

A primary concern when deploying a High-Performance Computing (HPC) system is its high energy consumption. Typical HPC systems consist of hundreds to...

Fitra Rahmani Khasyah, Kadek Gemilang Santiyuda, ... Hiroyuki Takizawa in Parallel and Distributed Computing, Applications and Technologies

Conference paper 2023

Actor-critic multi-objective reinforcement learning for non-linear utility functions

We propose a novel multi-objective reinforcement learning algorithm that successfully learns the optimal policy even for non-linear utility...

Mathieu Reymond, Conor F. Hayes, ... Ann Nowé in Autonomous Agents and Multi-Agent Systems

Article 28 April 2023

Improved gradient boosting hybrid spectrum sharing and actor critic channel allocation in 6G CR-IOT

The fast advancement of wireless communication technology and the growth in the reputation of Internet of Things (IoT) applications have led to the...

Mayank Kothari, Suresh Kurumbanshi in International Journal of Information Technology

Article 23 June 2024

A novel semi-supervised generative adversarial network based on the actor-critic algorithm for compound fault recognition

Vibration signals can be used to extract effective fault features for fault diagnosis. However, traditional supervised learning requires considerable...

Zisheng Wang, Jian** Xuan, Tielin Shi in Neural Computing and Applications

Article 09 March 2022

SAC-FACT: Soft Actor-Critic Reinforcement Learning for Counterfactual Explanations

Explainable AI (XAI) techniques are essential for improving the interpretability of machine learning models, which are generally regarded as black...

Fatima Ezzeddine, Omran Ayoub, ... Silvia Giordano in Explainable Artificial Intelligence

Conference paper 2023

Multi-source Domain Adaptation Based on Data Selector with Soft Actor-Critic

Multi-source domain adaptation (MDA) aims to transfer the knowledge learned from multiple-sources domains to the target domain. Although the source...

Qiquan Cui, Xuanyu **, ... Wanzeng Kong in Human Brain and Artificial Intelligence

Conference paper 2023

Search

Filters

Search Results

Search

Navigation