We are improving our search experience. To check which content you have full access to, or for advanced search, go back to the old search.

Search

Please fill in this field.
Filters applied:

Search Results

Showing 81-100 of 10,000 results
  1. Proximal policy optimization for formation navigation and obstacle avoidance

    In this paper, a formation control problem of second-order holonomic agents is considered, where agents navigate around obstacles using proximal...

    Article 16 June 2022
  2. A policy primer and roadmap on AI worker surveillance and productivity scoring tools

    Algorithmic worker surveillance and productivity scoring tools powered by artificial intelligence (AI) are becoming prevalent and ubiquitous...

    Merve Hickok, Nestor Maslej in AI and Ethics
    Article 20 March 2023
  3. Parameter-Free Reduction of the Estimation Bias in Deep Reinforcement Learning for Deterministic Policy Gradients

    Approximation of the value functions in value-based deep reinforcement learning induces overestimation bias, resulting in suboptimal policies. We...

    Baturay Saglam, Furkan Burak Mutlu, ... Suleyman Serdar Kozat in Neural Processing Letters
    Article Open access 02 March 2024
  4. “Learning as a Strategy” for Better EU Policy Understanding and Implementation in the Digital Era

    The European Commission established the Better Regulation agenda of the European Union (EU) in 2015, but problems in EU implementation persist. A...

    Paul Hearn, Leticia Elias, Eleonora Ganescu in Digital Society
    Article Open access 20 April 2023
  5. Collaborative framework for UAVs-assisted mobile edge computing: a proximity policy optimization approach

    Recently, unmanned aerial vehicles (UAVs) have been widely used in mobile edge computing (MEC) scenarios due to their flexibility, rapid deployment,...

    Ruizhong Du, Bowen Cao, Yan Gao in The Journal of Supercomputing
    Article 27 December 2023
  6. FeMIP: detector-free feature matching for multimodal images with policy gradient

    Feature matching for multimodal images is an important task in image processing. However, most methods perform image feature detection, description,...

    Yide Di, Yun Liao, ... Mingyu Lu in Applied Intelligence
    Article 17 July 2023
  7. Hybrid cryptosystem based healthcare data sharing with access control policy in cloud environment

    Healthcare cloud computing environments are expanding quickly, and security and confidentiality of patient records are top priorities. Academics and...

    S. Vinothkumar, J. Amutharaj in International Journal of Information Technology
    Article 10 May 2024
  8. Optimal policy trees

    We propose an approach for learning optimal tree-based prescription policies directly from data, combining methods for counterfactual estimation from...

    Maxime Amram, Jack Dunn, Ying Daisy Zhuo in Machine Learning
    Article 09 March 2022
  9. APPCorp: a corpus for Android privacy policy document structure analysis

    With the increasing popularity of mobile devices and the wide adoption of mobile Apps, an increasing concern of privacy issues is raised. Privacy...

    Shuang Liu, Fan Zhang, ... Meishan Zhang in Frontiers of Computer Science
    Article 12 September 2022
  10. Efficient policy evaluation by matrix sketching

    In the reinforcement learning, policy evaluation aims to predict long-term values of a state under a certain policy. Since high-dimensional...

    Cheng Chen, Weinan Zhang, Yong Yu in Frontiers of Computer Science
    Article 08 January 2022
  11. Net versus relative impacts in public policy automation: a conjoint analysis of attitudes of Black Americans

    The use of algorithms and automated systems, especially those leveraging artificial intelligence (AI), has been exploding in the public sector, but...

    Ryan Kennedy, Amanda Austin, ... Peter Salib in AI & SOCIETY
    Article Open access 13 July 2024
  12. Policy Representation Opponent Sha** via Contrastive Learning

    To acquire results with higher social welfare in social dilemmas, agents need to maintain cooperation. Independent agents manage to navigate social...
    Yuming Chen, Yuanheng Zhu in Neural Information Processing
    Conference paper 2024
  13. Energy-Based Policy Constraint for Offline Reinforcement Learning

    Offline RL suffers from the distribution shift problem. One way to address this issue is to constrain the divergence between the target policy and...
    Zhiyong Peng, Changlin Han, ... Zongtan Zhou in Artificial Intelligence
    Conference paper 2024
  14. Basic flight maneuver generation of fixed-wing plane based on proximal policy optimization

    Autonomous agile flight control has been a challenging problem due to complex highly nonlinear dynamics, and generating feasible basic flight...

    Lun Li, Xuebo Zhang, ... Runhua Wang in Neural Computing and Applications
    Article 31 January 2023
  15. Resilience & Vulnerability: Concepts and Policy Contexts

    Climate change is an unparalleled global challenge, with profound implications for the environment, societies, and economies. As the Earth’s climate...
    Syed Shahid Mazhar, Farhina Sardar Khan, ... Ambrina Sardar Khan in Geospatial Technology to Support Communities and Policy
    Chapter 2024
  16. Authorization and Policy Enforcement

    The previous chapters covered the mechanics of authorizing an API call and authenticating a user. This chapter will discuss authorization vs. the...
    Yvonne Wilson, Abhishek Hingnikar in Solving Identity Management in Modern Applications
    Chapter 2023
  17. Automated cloud resources provisioning with the use of the proximal policy optimization

    Many modern applications, both scientific and commercial, are deployed to cloud environments and often employ multiple types of resources. That...

    Włodzimierz Funika, Paweł Koperek, Jacek Kitowski in The Journal of Supercomputing
    Article Open access 10 November 2022
  18. A distributed and energy-efficient KNN for EEG classification with dynamic money-saving policy in heterogeneous clusters

    Due to energy consumption’s increasing importance in recent years, energy-time efficiency is a highly relevant objective to address in...

    Juan José Escobar, Francisco Rodríguez, ... Miguel Damas in Computing
    Article Open access 27 June 2023
  19. UAVs rounding up inspired by communication multi-agent depth deterministic policy gradient

    UAVs rounding up is a game between UAV swarm and targets. The main challenge lies in achieving efficient collaboration between UAVs and the setting...

    Longting Jiang, Ruixuan Wei, Dong Wang in Applied Intelligence
    Article 06 September 2022
  20. Resource Allocation Using Deep Deterministic Policy Gradient-Based Federated Learning for Multi-Access Edge Computing

    The study focuses on utilizing the computational resources present in vehicles to enhance the performance of multi-access edge computing (MEC)...

    Zheyu Zhou, Qi Wang, ... Ziyuan Li in Journal of Grid Computing
    Article 27 June 2024
Did you find what you were looking for? Share feedback.