Describing intonation with a parametric model

نویسنده

  • Gregor Möhler
چکیده

In this study a data-based approach to intonation modeling is presented. The model incorporates knowledge from intonation theories like the expected types of F 0 movements and syllable anchoring. The knowledge is integrated into the model using an appropriate approximation function for F 0 parametrization. The F 0 parameters that result from the parametrization are predicted from a set of features using neural nets. The quality of the generated contours is assessed by means of numerical measures and perception tests. They show that the basic hypotheses about intonation description and modeling are in principle correct and that they have the potential to be successfully applied to speech synthesis. We argue for a clear interface with a linguistic description (using pitch-accent and boundary labels as input) and discourse structure (using pitch-range normalized F 0 parameters), even though current text-to-speech systems usually still do not have the capability to predict most of the appropriate information.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Describing the development of intonational categories using a target-oriented parametric approach

In this paper we analyze the relation between adults’ intonational categories as described in the ToBI framework and children’s intonation contours, using a parametric approach and cluster evaluation methods. In the field of prosody, an increasing number of studies on the development of intonation apply the intonational categories of adult speech described as a sequence of high (H) and low (L) ...

متن کامل

Comparing two different principles of parametric F0 modeling

A number of data-based approaches to intonation modeling represent F0 movements using continuous parameters. This is contradictory to most intonation theories, which suggest that intonation can be modeled with a set of distinct phonological entities that are phonetically realized as F0 movements. This principle has rarely been incorporated into data-based intonation modeling. In this study we c...

متن کامل

The Copasul Intonation Model

A new data-driven and linguistically interpretable intonation model for the automatic analysis and synthesis of fundamental frequency contours is introduced: the CoPaSul model, which provides a contour-based (Co), parametric (Pa), and superpositional (Sul) intonation representation. Its application in F0 analysis and generation is described as well as its linguistic anchoring with respect to se...

متن کامل

Personality prediction based on intonation stylization

This study’s aim is to predict speaker personality from intonation patterns in spoken dialogs. Intonation patterns were extracted by a parametric superpositional stylization approach that allows for pattern description on a parametric as well as on a categorical level. Based on features derived from these representations we trained support vector machines and fitted generalized linear regressio...

متن کامل

Totally data-driven intonation prediction model using a novel F0 contour parametric representation

This paper proposes a novel parametric representation of mandarin intonation based on orthogonal polynomial approximation. The polynomial is a simplified representation of Parallel Encoding and Target Approximation (PENTA) intonation model that includes a target component and an approximation component. We also propose predicting the polynomial parameters from linguistic and phonetic attributes...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998