Search Page | SpringerLink

Proximal policy optimization for formation navigation and obstacle avoidance

In this paper, a formation control problem of second-order holonomic agents is considered, where agents navigate around obstacles using proximal...

Priyam Sadhukhan, Rastko R. Selmic in International Journal of Intelligent Robotics and Applications

Article 16 June 2022

A policy primer and roadmap on AI worker surveillance and productivity scoring tools

Algorithmic worker surveillance and productivity scoring tools powered by artificial intelligence (AI) are becoming prevalent and ubiquitous...

Merve Hickok, Nestor Maslej in AI and Ethics

Article 20 March 2023

Parameter-Free Reduction of the Estimation Bias in Deep Reinforcement Learning for Deterministic Policy Gradients

Approximation of the value functions in value-based deep reinforcement learning induces overestimation bias, resulting in suboptimal policies. We...

Baturay Saglam, Furkan Burak Mutlu, ... Suleyman Serdar Kozat in Neural Processing Letters

Article Open access 02 March 2024

“Learning as a Strategy” for Better EU Policy Understanding and Implementation in the Digital Era

The European Commission established the Better Regulation agenda of the European Union (EU) in 2015, but problems in EU implementation persist. A...

Paul Hearn, Leticia Elias, Eleonora Ganescu in Digital Society

Article Open access 20 April 2023

Collaborative framework for UAVs-assisted mobile edge computing: a proximity policy optimization approach

Recently, unmanned aerial vehicles (UAVs) have been widely used in mobile edge computing (MEC) scenarios due to their flexibility, rapid deployment,...

Ruizhong Du, Bowen Cao, Yan Gao in The Journal of Supercomputing

Article 27 December 2023

FeMIP: detector-free feature matching for multimodal images with policy gradient

Feature matching for multimodal images is an important task in image processing. However, most methods perform image feature detection, description,...

Yide Di, Yun Liao, ... Mingyu Lu in Applied Intelligence

Article 17 July 2023

Hybrid cryptosystem based healthcare data sharing with access control policy in cloud environment

Healthcare cloud computing environments are expanding quickly, and security and confidentiality of patient records are top priorities. Academics and...

S. Vinothkumar, J. Amutharaj in International Journal of Information Technology

Article 10 May 2024

Optimal policy trees

We propose an approach for learning optimal tree-based prescription policies directly from data, combining methods for counterfactual estimation from...

Maxime Amram, Jack Dunn, Ying Daisy Zhuo in Machine Learning

Article 09 March 2022

APPCorp: a corpus for Android privacy policy document structure analysis

With the increasing popularity of mobile devices and the wide adoption of mobile Apps, an increasing concern of privacy issues is raised. Privacy...

Shuang Liu, Fan Zhang, ... Meishan Zhang in Frontiers of Computer Science

Article 12 September 2022

Efficient policy evaluation by matrix sketching

In the reinforcement learning, policy evaluation aims to predict long-term values of a state under a certain policy. Since high-dimensional...

Cheng Chen, Weinan Zhang, Yong Yu in Frontiers of Computer Science

Article 08 January 2022

Net versus relative impacts in public policy automation: a conjoint analysis of attitudes of Black Americans

The use of algorithms and automated systems, especially those leveraging artificial intelligence (AI), has been exploding in the public sector, but...

Ryan Kennedy, Amanda Austin, ... Peter Salib in AI & SOCIETY

Article Open access 13 July 2024

Policy Representation Opponent Sha** via Contrastive Learning

To acquire results with higher social welfare in social dilemmas, agents need to maintain cooperation. Independent agents manage to navigate social...

Yuming Chen, Yuanheng Zhu in Neural Information Processing

Conference paper 2024

Energy-Based Policy Constraint for Offline Reinforcement Learning

Offline RL suffers from the distribution shift problem. One way to address this issue is to constrain the divergence between the target policy and...

Zhiyong Peng, Changlin Han, ... Zongtan Zhou in Artificial Intelligence

Conference paper 2024

Basic flight maneuver generation of fixed-wing plane based on proximal policy optimization

Autonomous agile flight control has been a challenging problem due to complex highly nonlinear dynamics, and generating feasible basic flight...

Lun Li, Xuebo Zhang, ... Runhua Wang in Neural Computing and Applications

Article 31 January 2023

Resilience & Vulnerability: Concepts and Policy Contexts

Climate change is an unparalleled global challenge, with profound implications for the environment, societies, and economies. As the Earth’s climate...

Syed Shahid Mazhar, Farhina Sardar Khan, ... Ambrina Sardar Khan in Geospatial Technology to Support Communities and Policy

Chapter 2024

Authorization and Policy Enforcement

The previous chapters covered the mechanics of authorizing an API call and authenticating a user. This chapter will discuss authorization vs. the...

Yvonne Wilson, Abhishek Hingnikar in Solving Identity Management in Modern Applications

Chapter 2023

Automated cloud resources provisioning with the use of the proximal policy optimization

Many modern applications, both scientific and commercial, are deployed to cloud environments and often employ multiple types of resources. That...

Włodzimierz Funika, Paweł Koperek, Jacek Kitowski in The Journal of Supercomputing

Article Open access 10 November 2022

A distributed and energy-efficient KNN for EEG classification with dynamic money-saving policy in heterogeneous clusters

Due to energy consumption’s increasing importance in recent years, energy-time efficiency is a highly relevant objective to address in...

Juan José Escobar, Francisco Rodríguez, ... Miguel Damas in Computing

Article Open access 27 June 2023

UAVs rounding up inspired by communication multi-agent depth deterministic policy gradient

UAVs rounding up is a game between UAV swarm and targets. The main challenge lies in achieving efficient collaboration between UAVs and the setting...

Longting Jiang, Ruixuan Wei, Dong Wang in Applied Intelligence

Article 06 September 2022

Resource Allocation Using Deep Deterministic Policy Gradient-Based Federated Learning for Multi-Access Edge Computing

The study focuses on utilizing the computational resources present in vehicles to enhance the performance of multi-access edge computing (MEC)...

Zheyu Zhou, Qi Wang, ... Ziyuan Li in Journal of Grid Computing

Article 27 June 2024

Search

Filters

Search Results

Search

Navigation