Latent Semantic Analysis for Dialogue Act Classification

نویسندگان

  • Riccardo Serafin
  • Barbara Di Eugenio
  • Michael Glass
چکیده

This paper presents our experiments in applying Latent Semantic Analysis (LSA) to dialogue act classification. We employ both LSA proper and LSA augmented in two ways. We report results on DIAG, our own corpus of tutoring dialogues, and on the CallHome Spanish corpus. Our work has the theoretical goal of assessing whether LSA, an approach based only on raw text, can be improved by using additional features of the text.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

FLSA: Extending Latent Semantic Analysis with Features for Dialogue Act Classification

We discuss Feature Latent Semantic Analysis (FLSA), an extension to Latent Semantic Analysis (LSA). LSA is a statistical method that is ordinarily trained on words only; FLSA adds to LSA the richness of the many other linguistic features that a corpus may be labeled with. We applied FLSA to dialogue act classification with excellent results. We report results on three corpora: CallHome Spanish,...

متن کامل

Dialogue Act Classification, Higher Order Dialogue Structure, and Instance-Based Learning

The main goal of this paper is to explore the predictive power of dialogue context on Dialogue Act classification, both as concerns the linear context provided by previous dialogue acts, and the hierarchical context specified by conversational games. As our learning approach, we extend Latent Semantic Analysis (LSA) as Feature LSA (FLSA), and combine FLSA with the k-Nearest Neighbor algorithm. ...

متن کامل

Semantic Features for Dialogue Act Recognition

Dialogue act recognition commonly relies on lexical, syntactic, prosodic and/or dialogue history based features. However, few approaches exploit semantic information. The main goal of this paper is thus to propose semantic features and integrate them into a dialogue act recognition task to improve the recognition score. Three different feature computation approaches are proposed, evaluated and ...

متن کامل

Speech act identification using an ontology-based partial pattern tree

This paper presents an ontology-based partial pattern tree to identify the speech act in a spoken dialogue system. This study first extracts the key words/concepts in an application domain using latent semantic analysis (LSA). A partial pattern tree is used to deal with the ill-formed sentence problem in a spoken dialogue system. Concept expansion based on domain ontology is adopted to improve ...

متن کامل

GnuTutor: An Open Source Intelligent Tutoring System Based on AutoTutor

This paper presents GnuTutor, an open source intelligent tutoring system (ITS) inspired by the AutoTutor ITS. The goal of GnuTutor is to create a freely available, open source ITS platform that can be used by schools and researchers alike. To achieve this goal, significant departures from AutoTutor’s current design were made so that GnuTutor would use a smaller, non-proprietary code base but ha...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003