Model-Based, Multimodal Interaction in Document Browsing

نویسندگان

  • Parisa Eslambolchilar
  • Roderick Murray-Smith
چکیده

In this paper we introduce a dynamic system approach to the design of multimodal interactive systems. We use an example where we support human behavior in browsing a document, by adapting the dynamics of navigation and the visual feedback (using a focus-in-context (F+C) method) to support the current inferred task. We also demonstrate non-speech audio feedback, based on a language model. We argue that to design interaction we need models of key aspects of the process, here for example, we need models for the dynamic system, language model and sonification. We show how the user’s intention is coupled to the visualization technique via the dynamic model, and how the focus-incontext method couples details in context to audio samples via the language identification system. We present probabilistic audio feedback as an example of a multimodal approach to sensing different languages in a multilingual text. This general approach is well suited to mobile and wearable applications, and shared displays.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multimodal Feedback for Tilt Controlled Speed Dependent Automatic Zooming

Speed Dependent Automatic Zooming proposed by Igarashi and Hinckley is a powerful tool for document navigation on mobile devices. We show that browsing and targeting can be facilitated by using a model-based sonification approach to generate audio feedback about document structure, in a tilt-controlled SDAZ interface. We implemented this system for a text browser on a Pocket PC instrumented wit...

متن کامل

Using Static Documents as Structured and Thematic Interfaces to Multimedia Meeting Archives

Static documents play a central role in multimodal applications such as meeting recording and browsing. They provide a variety of structures, in particular thematic, for segmenting meetings, structures that are often hard to extract from audio and video. In this article, we present four steps for creating a strong link between static documents and multimedia meeting archives. First, a document-...

متن کامل

Documents statiques et multimodalité. L'alignement temporel pour structurer des archives multimédias de réunions

Printable documents play a central role in multimodal applications such as meeting recording and browsing. They provide a variety of structures, in particular thematic, for segmenting meetings, structures that are often hard to extract from audio and video. In this article, we present four steps for bridging a temporal link between documents and multimedia meeting archives. First, a document-ce...

متن کامل

Adapted Multimodal End-User Interfaces for XML-Content

Personalization of user interfaces for browsing content is a key concept to ensure content accessibility. This personalization is especially needed for people with disabilities (e.g,. visually impaired) and/or for highly mobile individuals (driving, off-screen environments) and/or for people with limited devices (PDAs, mobile phones, etc.). In this direction, we introduce mechanisms, based on a...

متن کامل

Multimodal Corpus Using Multimodal Dictionary in Lohorung

Lohorung is one of the minority Tibeto-Burman languages of Kirati group spoken in the Eastern part of Nepal. It lacks a written tradition. Crowd sourcing can be a good way to document a language in terms of resources, especially in the context of a linguistically rich country like Nepal. We have built a multimodal dictionary, browsing and authoring tool with which Lohorung community members, ev...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006