Deep reinforcement learning with a particle dynamics environment applied to emergency evacuation of a room with obstacles
نویسندگان
چکیده
Efficient emergency evacuation is crucial for survival. A very successful model simulating the social-force model. At heart of self-driven force that applied to an agent and directed towards exit. However, it not clear if application this results in optimal evacuation, especially complex environments with obstacles. In paper, we develop a deep reinforcement learning algorithm association social train agents find fastest path. During training, penalize every step room give zero reward at We adopt Dyna-Q approach, which incorporates both model-free Q-learning model-based method, update neural network used approximate action value functions. first show case without obstacles resulting points directly exit as To quantitatively validate our compare total time elapsed when escape one door employing result obtained using median intervals calculated two methods are significantly different. confirm proposed method obtains trajectories minimize travel by comparing generated geodesics-based adaptive pedestrian dynamics. Then, investigate obstacle produces similar convex. concave obstacles, sometimes can act traps governed purely prohibit complete approach clearly advantageous since derives policy object avoidance additional assumptions. also study multiple exits. able evacuate efficiently from nearest through shared trained single agent. Finally, test robustness environment exits Overall, model, based on handle modeling where difficult obtain intuitive rule fast evacuation.
منابع مشابه
a comparative study of language learning strategies employmed by bilinguals and monolinguals with reference to attitudes and motivation
هدف از این تحقیق بررسی برخی عوامل ادراکی واحساسی یعنی استفاده از شیوه های یادگیری زبان ، انگیزه ها ونگرش نسبت به زبان انگلیسی در رابطه با زمینه زبانی زبان آموزان می باشد. هدف بررسی این نکته بود که آیا اختلافی چشمگیر میان زبان آموزان دو زبانه و تک زبانه در میزان استفاده از شیوه های یادگیری زبان ، انگیزه ها نگرش و سطح مهارت زبانی وجود دارد. همچنین سعی شد تا بهترین و موثرترین عوامل پیش بینی کننده ...
15 صفحه اولa comparison of teachers and supervisors, with respect to teacher efficacy and reflection
supervisors play an undeniable role in training teachers, before starting their professional experience by preparing them, at the initial years of their teaching by checking their work within the proper framework, and later on during their teaching by assessing their progress. but surprisingly, exploring their attributes, professional demands, and qualifications has remained a neglected theme i...
15 صفحه اولMelanoma detection with a deep learning model
Background: Skin cancer is one of the most common forms of cancer in the world and melanoma is the deadliest type of skin cancer. Both melanoma and melanocytic nevi begin in melanocytes (cells that produce melanin). However, melanocytic nevi are benign whereas melanoma is malignant. This work proposes a deep learning model for classification of these two lesions. Methods: In this analytic s...
متن کاملDeep Reinforcement Learning with Surrogate Agent-Environment Interface
In this paper we propose surrogate agent-environment interface (SAEI) in reinforcement learning. We also state that learning based on probability surrogate agent-environment interface gives optimal policy of task agent-environment interface. We introduce surrogate probability action and develope the probability surrogate action deterministic policy gradient (PSADPG) algorithm based on SAEI. Thi...
متن کاملvalidation of a revised logical-mathematical intelligence scale and exploring its relationship with english language proficiency
نظریه هوش چندگانه قسمتهای متفاوت هوش بشری را مورد بررسی قرار می دهد که با شناخت آن شخص به درک بهتری از توانایی های خود میرسد و در نتیجه سعی در استفاده از آن جهت یادگیری بهتر میکند. همچنین با شناخت استعداد دانش آموزان، فرایند یادگیری بهتر میشود. هدف از انجام دادن این تحقیق بررسی رابطه بین هوش ریاضی و استعداد یادگیری زبان انگلیسی میباشد. برای انجام این تحقیق از پرسشنامه هوش ریاضی که توسط شیرر در ...
ذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Physica D: Nonlinear Phenomena
سال: 2021
ISSN: ['1872-8022', '0167-2789']
DOI: https://doi.org/10.1016/j.physa.2021.125845