The FASil speech and multimodal corpora
نویسندگان
چکیده
In the context of the FASiL project, we have studied natural language interactions in a unimodal (speech only) and multimodal (speech and graphics) interface to a personal information management database. We collected multilingual corpora to investigate these interactions in Portuguese, English and Swedish. The corpora are used to train language models, to update acoustic models, to study semantic concepts, multimodal interactions, and dialogue management strategies. The corpora are annotated in a uniform way, with timings, transcriptions, and semantics. We report on the structure and design of the corpora which are now available via ELRA.
منابع مشابه
A Portuguese spoken and multi
This paper presents an overview of the spoken and multimodal dialog Portuguese corpora collected in the context of the FASiL (Flexible and Adaptive Spoken Language and Multi-Modal Interfaces) project. The project developed a Virtual Personal Assistant application in the Personal Information Management domain, exploiting the state-of-theart of speech and multi-modal technology. The FASiL corpora...
متن کاملEvaluating Factors Impacting the Accuracy of Forced Alignments in a Multimodal Corpus
People, when processing human-to-human communication, utilize everything they can in order to understand that communication, including speech and information such as the time and location of an interlocutor’s gesture and gaze. Speech and gesture are known to exhibit a synchronous relationship in human communication; however, the precise nature of that relationship requires further investigation...
متن کاملThe SmartWeb Corpora: Multimodal Access to the Web in Natural Environments
As a result from the German SmartWeb project three speech corpora, one of them multimodal, have been published by the Bavarian Archive for Speech Signals (BAS). They contain speech and video signals from human–machine interactions in real indoor and outdoor environments. The scenarios for these corpora are a typicial handheld PDA interaction (SHC), an interaction on a running motorcycle (SMC) a...
متن کاملIntegration of Speech and Deictic Gesture in a Multimodal Grammar
In this paper we present a constraint-based analysis of the form-meaning mapping of deictic gesture and its synchronous speech signal. Based on an empirical study of multimodal corpora, we capture generalisations about well-formed multimodal utterances that support the preferred interpretations in the final context-of-use. More precisely, we articulate a multimodal grammar whose construction ru...
متن کاملWinPitch Corpus, a Text to Speech Alignment Tool for Multimodal Corpora
WinPitch Corpus is an innovative software program for computer-aided alignment of large corpora. It provides a method for easy and precise selection of alignment units, ranging from syllable to whole sentences in a hierarchical storing system of aligned data. The method is based on the ability to link visually and select with a mouse click a text segment with the perception of the corresponding...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005