نتایج جستجو برای: apprenticeship

تعداد نتایج: 1472  

Journal: :ANU Historical Journal II 2019

Journal: :Classical Philology 1920

Journal: :Classical Philology 1920

Journal: :Academic Medicine 2017

Journal: :Estudios Irlandeses 2012

Journal: :Proceedings of the ... AAAI Conference on Artificial Intelligence 2022

In Apprenticeship Learning (AL), we are given a Markov Decision Process (MDP) without access to the cost function. Instead, observe trajectories sampled by an expert that acts according some policy. The goal is find policy matches expert's performance on predefined set of functions. We introduce online variant AL (Online Learning; OAL), where agent expected perform comparably while interacting ...

2008
Umar Syed Robert E. Schapire

1 The MWAL Algorithm For reference, the MWAL algorithm from the main paper is repeated below.

Journal: :Relations industrielles 1950

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید