A system for left-to-right intonation specification from text

نویسنده

  • Alex I. C. Monaghan
چکیده

This paper describes some computational strategies employed in the implementation of the model outlined in papers by Ladd and Ladd & Monaghan in this volume. There are two main categories of problern involved in extracting intonation from text: the first derives from the (currently) very limited nature of higher-level information deducible from text; the secend consists of prpblems of interpretation which would exist even if perfect high-level analyses of text were available. Both these categories can be resolved with reasonable success in our Left-to-Right process model by using various levels of representation and employing computational techniques such as default spefification and recursion. Factcrs affecting intonation at a high level include semantic, syntactic and pragmatic considerations, most of which are not explicit in text. Our model uses a small number of abstract PITCH ACCENT types in conjunction with limited syntactic and pragmatic information and a number of default clauses to generate a wide range of intonation contours. 1 LEVELS OF REPRESENTATION Any adequate model requires explicit representation of all relevant levels, and considerable thought was given to determining precisely which levels were relevant to intonation. The desire to avoid speakerspecific representations as much as possible, and the adoption of intonational tunes (see Ladd, this volume), led to the choice of three descriptive levels: abstract phonological, abstract phonetic and concrete phonetic. 1.1 Abstract Phonological Representation. This level defines a fairly abstract intonational contour by specifying accent location and degree (major or minor). Also represented at this stage are register steps and phrasal boundaries (see Ladd, ibid.). The current system specifies default locations for all these elements automatically, and other items (such as non-default boundaries or extra register steps) can be entered by hand. 1.2 Abstract Phonetic Representation. The phonological representation is mapped onto this level using the tune chosen for the utterance: each accent degree phoneme is specified as the appropriate accent type from the tune, to give a number of abstract targets generally three for major and two for minor accents. Boundaries are treated in a similar manner, but register steps are not interpreted at this level. *Centre for Speech Technology Research, University of Edinburgh.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A contribution to the synthesis of Italian intonation

A first approximation of a model for the automatic synthesis of Italian intonation is proposed. In line with Pierrehumbert's theory of intonational description [1] fundamental frequency contours of Italian are modelled as sequences of abstract tonal elements aligned with the text. Different levels of prosodic phrasing (accent units, intermediate and intonational phrases) are taken into account ...

متن کامل

استخراج پیکره‌ موازی از اسناد قابل‌مقایسه برای بهبود کیفیت ترجمه در سیستم‌های ترجمه ماشینی

Data used for training statistical machine translation method are usually prepared from three resources: parallel, non-parallel and comparable text corpora. Parallel corpora are an ideal resource for translation but due to lack of these kinds of texts, non-parallel and comparable corpora are used either for parallel text extraction. Most of existing methods for exploiting comparable corpora loo...

متن کامل

A case study of tone and intonation in two Tibetic language varieties

This paper presents a case study looking at the interaction between lexical tone and post-lexical intonation in two very similar Tibetic language varieties spoken in Nepal: Lamjung Yolmo and Kagate. In these two varieties, we find preliminary evidence that in both monosyllabic and disyllabic words, lexical tone is only specified at the left edge of the word, while the right edge of the word is ...

متن کامل

Structural Data-Driven Prosody Model for TTS Synthesis

This paper introduces a new data-driven prosody model for the text-to-speech system ARTIC. The model is intended to be almost language-independent and to generate naturally sounding intonation with a link to semantics. It is based on text parametrisation using a new prosodic grammar and on automatic speech corpora analysis methods. Its performance is evaluated by results of presented listening ...

متن کامل

Modeling the Circle of Willis Using Electrical Analogy Method under both Normal and Pathological Circumstances

Background and objective: The circle of Willis (COW) supports adequate blood supply to the brain. The cardiovascular system, in the current study, is modeled using an equivalent electronic system focusing on the COW.Method: In our previous study we used 42 compartments to model whole car- diovascular system. In the current study, nevertheless, we extended our model by using 63 compartments to m...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1987