Demonstration of the EMPATHIC Framework for Task Learning from Implicit Human Feedback
نویسندگان
چکیده
Reactions such as gestures, facial expressions, and vocalizations are an abundant, naturally occurring channel of information that humans provide during interactions. An agent could leverage understanding implicit human feedback to improve its task performance at no cost the human. This approach contrasts with common teaching methods based on demonstrations, critiques, or other guidance need be attentively intentionally provided. In this work, we demonstrate a novel data-driven framework for learning from feedback, EMPATHIC. two-stage method consists (1) mapping relevant statistics reward, optimality, advantage; (2) using learn task. We instantiate first stage three second-stage evaluations learned mapping. To do so, collect dataset reactions while participants observe execute sub-optimal policy prescribed training train deep neural network data ability infer relative reward ranking events in prerecorded reactions; live (3) transfer domain which it evaluates robot manipulation trajectories. video, focus demonstrating online capability our instantiation
منابع مشابه
A Generic Coordinate Descent Framework for Learning from Implicit Feedback
In recent years, interest in recommender research has shifted from explicit feedback towards implicit feedback data. A diversity of complex models has been proposed for a wide variety of applications. Despite this, learning from implicit feedback is still computationally challenging. So far, most work relies on stochastic gradient descent (SGD) solvers which are easy to derive, but in practice ...
متن کاملRobot Task Learning from Human Demonstration
Today, most robots used in the industry are preprogrammed and require a welldefined and controlled environment. Reprogramming such robots is often a costly process requiring an expert. By enabling robots to learn tasks from human demonstration, robot installation and task reprogramming are simplified. In a longer time perspective, the vision is that robots will move out of factories into our ho...
متن کاملEvaluating the Robustness of Learning from Implicit Feedback
This paper evaluates the robustness of learning from implicit feedback in web search. In particular, we create a model of user behavior by drawing upon user studies in laboratory and real-world settings. The model is used to understand the effect of user behavior on the performance of a learning algorithm for ranked retrieval. We explore a wide range of possible user behaviors and find that lea...
متن کاملNonintentional task set activation: evidence from implicit task sequence learning.
Studies have shown that task sets could be configured endogenously (i.e., on the basis of memory) according to an explicit sequence or exogenously according to a task cue. In two experiments, we examined whether an implicitly learned sequence could facilitate task set configuration without participants' intention. These experiments led to opposite conclusions regarding this question, but their ...
متن کاملIntention Learning From Human Demonstration
Equipped with better sensing and learning capabilities, robots nowadays are meant to perform versatile tasks. To remove the load of detailed analysis and programming from the engineer, a concept has been proposed that the robot may learn how to execute the task from human demonstration by itself. Following the idea, in this paper, we propose an approach for the robot to learn the intention of t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence
سال: 2021
ISSN: ['2159-5399', '2374-3468']
DOI: https://doi.org/10.1609/aaai.v35i18.17998