We are improving our search experience. To check which content you have full access to, or for advanced search, go back to the old search.

Search

Please fill in this field.
Filters applied:

Search Results

Showing 1-20 of 10,000 results
  1. Target-oriented policy diffusion analysis: a case study of China’s information technology policy

    Current quantitative research on policy diffusion tends to focus on the citation relationship between policies, while ignoring the nature of policy...

    Chao Yang, Cui Huang in Scientometrics
    Article 05 February 2024
  2. Cautious policy programming: exploiting KL regularization for monotonic policy improvement in reinforcement learning

    In this paper, we propose cautious policy programming (CPP), a novel value-based reinforcement learning (RL) algorithm that exploits the idea of...

    Lingwei Zhu, Takamitsu Matsubara in Machine Learning
    Article Open access 24 August 2023
  3. Off-policy and on-policy reinforcement learning with the Tsetlin machine

    The Tsetlin Machine is a recent supervised learning algorithm that has obtained competitive accuracy- and resource usage results across several...

    Saeed Rahimi Gorji, Ole-Christoffer Granmo in Applied Intelligence
    Article Open access 03 February 2023
  4. Policy citations of scientometric articles: an altmetric study

    Policy citations are considered as one of the important indicators of the societal impact of research. Scientometrics is a field that, among other...

    Hashem Atapour, Robabeh Maddahi, Rasoul Zavaraqi in Scientometrics
    Article 27 June 2024
  5. Policy-based optimization: single-step policy gradient method seen as an evolution strategy

    This research reports on the recent development of black-box optimization methods based on single-step deep reinforcement learning and their...

    J. Viquerat, R. Duvigneau, ... E. Hachem in Neural Computing and Applications
    Article 14 September 2022
  6. TFPsocialmedia: a public dataset for studying Turkish foreign policy

    Objectives

    This data note introduces the TFPsocialmedia dataset, designed to aid social media researchers investigating Turkish Foreign Policy (TFP)....

    Hakan Mehmetcik, Murat Can Ganiz, ... Emre Tortumlu in Discover Data
    Article Open access 02 April 2024
  7. IOB: integrating optimization transfer and behavior transfer for multi-policy reuse

    Humans have the ability to reuse previously learned policies to solve new tasks quickly, and reinforcement learning (RL) agents can do the same by...

    Siyuan Li, Hao Li, ... Chongjie Zhang in Autonomous Agents and Multi-Agent Systems
    Article 09 December 2023
  8. Size matters: contextual factors in local policy translations of National School Digitalisation Policy

    National policies on school digitalisation take shape in their local contexts. Consequently, to understand the outcome of national policy, the local...

    Article Open access 19 May 2022
  9. Algorithmic governance and AI: balancing innovation and oversight in Indonesian policy analyst

    The objective of this study is to examine the effects of generative artificial intelligence (AI) tools, with a specific focus on ChatGPT, on the...

    Bevaola Kusumasari, Bernardo Nugroho Yahya in AI & SOCIETY
    Article 01 July 2024
  10. Intelligent analysis of android application privacy policy and permission consistency

    With the continuous development of mobile devices, mobile applications bring a lot of convenience to people’s lives. The abuse of mobile device...

    Tengfei Tu, Hua Zhang, ... Qiaoyan Wen in Artificial Intelligence Review
    Article Open access 13 June 2024
  11. Online Pareto optimal control of mean-field stochastic multi-player systems using policy iteration

    In this study, the Pareto optimal strategy problem was investigated for multi-player mean-field stochastic systems governed by Itô differential...

    **ushan Jiang, Yanshuang Wang, ... Ling Shi in Science China Information Sciences
    Article 27 March 2024
  12. Model gradient: unified model and policy learning in model-based reinforcement learning

    Model-based reinforcement learning is a promising direction to improve the sample efficiency of reinforcement learning with learning a model of the...

    Chengxing Jia, Fuxiang Zhang, ... Yang Yu in Frontiers of Computer Science
    Article 27 December 2023
  13. A certified access control policy language: TEpla

    Access control is an information security process which guards protected resources against unauthorized access, as specified by restrictions in...

    Article 28 August 2023
  14. Policy semantic networks associated with ICT utilization in Africa

    Information and communications technology (ICT) research finds that greater utilization of ICTs leads to economic growth. This effect has led...

    James A Danowski, Aaron Van Klyton, ... Said Rutabayiro-Ngoga in Social Network Analysis and Mining
    Article 19 April 2023
  15. Analyzing sentiments towards E-Levy policy implementation in Ghana using twitter data

    A newly proposed or implemented government policy often encounters challenges. Ghanaian citizens have always look down negatively upon their...

    Peter Appiahene, Stephen Afrifa, ... Mukesh Prasad in International Journal of Information Technology
    Article Open access 16 March 2024
  16. Policy regularization for legible behavior

    In this paper we propose a method to augment a Reinforcement Learning agent with legibility. This method is inspired by the literature in Explainable...

    Michele Persiani, Thomas Hellström in Neural Computing and Applications
    Article Open access 26 October 2022
  17. Difference rewards policy gradients

    Policy gradient methods have become one of the most popular classes of algorithms for multi-agent reinforcement learning. A key challenge, however,...

    Jacopo Castellini, Sam Devlin, ... Rahul Savani in Neural Computing and Applications
    Article Open access 11 November 2022
  18. Empirical analysis of the impact of China’s carbon emissions trading policy using provincial-level data

    Investigating the impact of carbon emissions trading policy and elucidating the underlying mechanisms are crucial for enhancing policy effectiveness...

    **aoguo Jiang, Weiwei Xu, Lixia Du in Energy Informatics
    Article Open access 22 May 2024
  19. Towards Jum** Skill Learning by Target-guided Policy Optimization for Quadruped Robots

    Endowing quadruped robots with the skill to forward jump is conducive to making it overcome barriers and pass through complex terrains. In this...

    Chi Zhang, Wei Zou, ... Shuomo Zhang in Machine Intelligence Research
    Article 22 February 2024
  20. Off-policy evaluation for tabular reinforcement learning with synthetic trajectories

    This paper addresses the problem of offline evaluation in tabular reinforcement learning (RL). We propose a novel method that leverages synthetic...

    Weiwei Wang, Yuqiang Li, **anyi Wu in Statistics and Computing
    Article 17 November 2023
Did you find what you were looking for? Share feedback.