Multimodal Dialog System for Kyoto Sightseeing Guide

نویسندگان

  • Hideki Kashioka
  • Teruhisa Misu
  • Etsuo Mizukami
  • Yoshinori Shiga
  • Kentaro Kayama
  • Chiori Hori
  • Hisashi Kawai
چکیده

We proposed a dialog system on Kyoto tourist information assistance in a client-server fashion. Our proposed system is called the “proactive dialog system” and aims to present acceptable information in an acceptable time. We developed two prototype systems. The first one is designed for mobile use. It was implemented in iPhone and its application is opened to the public in AppStore. The second one is designed for multi-modal information integration on large display panel. It can detect non-verbal information, such as changes in gaze and facial direction as well as head gestures of the user during dialog, and recommend suitable information. These two prototype client systems are basically connecting to the server module. This server module uses a weighted finite-state transducer (WFST) in which user concept and system action tags are input and output of the transducer. We implemented a dialog scenario to present sightseeing information on the system. In our proposed dialog system, we designed our system’s behavior like human behavior. One of the most enduring problems in spoken dialogue systems research is realizing a natural dialogue in a human-human form. One-direction researchers have been utilizing spontaneous nonverbal and paralinguistic information. So that we collect human to human dialog corpus, and semi-automatically design a scenario which handles dialog in response to user’ input so as to accomplish a task efficiently. Especially we focus on users’ verbal feedback and non-verbal feedback in the form of nods. This paper presents our proposed system’s outline and its function. After that in this paper, we display the results of an evaluation of image processing techniques for estimating facial direction from a camera for a multi-modal spoken dialog system on a large display panel. Experiments that consist of 100 sessions with 80 subjects were conducted to evaluate the system’s efficiency. The system grows particularly clear when dialog contains recommendations.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Annotating communicative function and semantic content in dialogue act for construction of consulting dialogue systems

Our goal in this study is to train a dialogue manager that can handle consulting dialogues through spontaneous interactions from a tagged dialogue corpus. We have collected 130 hours of consulting dialogues in sightseeing guidance domain. This paper provides our taxonomy of dialogue act (DA) annotation that can describe two aspects of utterances. One is a communicative function (speech act), an...

متن کامل

Dialog management using weighted finite-state transducers

We are aiming to construct an expandable and adaptable dialog system which handles multiple tasks and senses users’ intention via multiple modalities. A flexible platform to integrate different dialog strategies and modalities is indispensable for this purpose. In this paper, we propose an efficient approach to manage a dialog system using a weighted finitestate transducer (WFST) in which users...

متن کامل

Multimodal Interaction with a Virtual Guide

We demonstrate the Virtual Guide, an embodied conversational agent that gives directions in a 3D environment. We briefly describe multimodal dialogue management, language and gesture generation, and a special feature of the Virtual Guide: the ability to align her linguistic style to the user’s level of politeness.

متن کامل

Wizard of Oz Method for Learning Dialog Agents

This paper describes a framework to construct interface agents with example dialogs based on the tasks by the machine learning technology. The Wizard of Oz method is used to collect example dialogs, and a finite state machine-based model is used for the dialog model. We implemented a Web-based system which includes these functions, and empirically examined the system which treats with a guide t...

متن کامل

Integrating Pointing Gestures into a Spanish-spoken Dialog System for Conversational Service Robots

In this paper we present our work on the integration of human pointing gestures into a spoken dialog system in Spanish for conversational service robots. The dialog system is composed by a dialog manager, an interpreter that guides the spoken dialog and robot actions, in terms of user intentions and relevant environment stimuli associated to the current conversational situation. We demonstrate ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011