Synthesising Reinforcement Learning Policies Through Set-Valued Inductive Rule Learning

نویسندگان

چکیده

Today’s advanced Reinforcement Learning algorithms produce black-box policies, that are often difficult to interpret and trust for a person. We introduce policy distilling algorithm, building on the CN2 rule mining distills into rule-based decision system. At core of our approach is fact an RL process does not just learn policy, mapping from states actions, but also produces extra meta-information, such as action values indicating quality alternative actions. This meta-information can indicate whether more than one near-optimal certain state. extend make it able leverage knowledge about equally-good actions distill fewer rules, increasing its interpretability by Then, ensure rules explain valid, non-degenerate we refinement algorithm fine-tunes obtain good performance when executed in environment. demonstrate applicability Mario AI benchmark, complex task requires modern reinforcement learning including neural networks. The explanations capture learned only few allow person understand what agent learned. Source code: https://gitlab.ai.vub.ac.be/yocoppen/svcn2.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Rule Combination in Inductive Learning

This paper describes the work on methods for combining rules obtained by machine learning systems. Three methods for obtaining the classification of examples with those rules are compared. The advantages and disadvantages of each method are discussed and the results obtained on three real world domains are commented. The methods compared are: selection of the best rule; PROSPECTOR-like probabil...

متن کامل

Advances in Inductive Rule Learning

A extension to the k-nearest neighbor classifier is described in which automatically induced rules are used as binary features, which are active in an instance when the left-hand side of the corresponding rule matches with the instance. The ripper rule induction algorithm is employed to produce the rules. The similarity between a memory instance and a new instance is based on the rules the two ...

متن کامل

Imitative Policies for Reinforcement Learning

We discuss a reinforcement learning framework where learners observe experts interacting with the environment. Our approach is to construct from these observations exploratory policies which favor selection of actions the expert has taken. This imitation strategy can be applied at any stage of learning, and requires neither that information regarding reinforcement be conveyed from the expert to...

متن کامل

Learning Relational Options for Inductive Transfer in Relational Reinforcement Learning

In reinforcement learning problems, a learning agent has the task of learning a good or optimal strategy from interaction with his environment. At the start of the learning task, the agent usually has very little information. Therefore, when faced with complex problems that have a large state space, learning a good strategy might be infeasible or too slow to work in practice. One way to overcom...

متن کامل

Inductive Rule Learning on the Knowledge

We present an approach to learning sets of recursive rules based on analytical inductive programming. We propose that our approach can be used within cognitive architectures to model regularity detection and generalization over experience. Induced rule sets represent the knowledge underlying systematic behavior in complex situations. Such rule sets can explain systematicity and productivity of ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Lecture Notes in Computer Science

سال: 2021

ISSN: ['1611-3349', '0302-9743']

DOI: https://doi.org/10.1007/978-3-030-73959-1_15