Generating and Evaluating Summaries for Partial Email Threads: Conversational Bayesian Surprise and Silver Standards
نویسندگان
چکیده
We define and motivate the problem of summarizing partial email threads. This problem introduces the challenge of generating reference summaries for partial threads when human annotation is only available for the threads as a whole, particularly when the human-selected sentences are not uniformly distributed within the threads. We propose an oracular algorithm for generating these reference summaries with arbitrary length, and we are making the resulting dataset publicly available1. In addition, we apply a recent unsupervised method based on Bayesian Surprise that incorporates background knowledge into partial thread summarization, extend it with conversational features, and modify the mechanism by which it handles redundancy. Experiments with our method indicate improved performance over the baseline for shorter partial threads; and our results suggest that the potential benefits of background knowledge to partial thread summarization should be further investigated with larger datasets.
منابع مشابه
Using Question-Answer Pairs in Extractive Summarization of Email Conversations
While sentence extraction as an approach to summarization has been shown to work in documents of certain genres, because of the conversational nature of email communication, sentence extraction may not result in a coherent summary. In this paper, we present our work on augmenting extractive summaries of threads of email conversations with automatically detected question-answer pairs. We compare...
متن کاملDetection of Imperative and Declarative Question-Answer Pairs in Email Conversations
Question-answer pairs extracted from email threads can help construct summaries of the thread, as well as inform semantic-based assistance with email. Previous work dedicated to email threads extracts only questions in interrogative form. We extend the scope of question and answer detection and pairing to encompass also questions in imperative and declarative forms, and to operate at sentence-l...
متن کاملA Method of Computing Measure for Evaluating Conversational Coherency in E-mail Communication
In this paper, we de ne a measure for evaluating conversational coherency and propose a method of computing it. We de ne a deliberation stream in email communication and construct deliberation structure which the deliberation streams are introduced into. And we de ne discontinuity between utterances as the measure for evaluating conversational coherency by using the deliberation structure, con ...
متن کاملNaval Postgraduate School Monterey , California Thesis a Study of Topic and Topic Change in Conversational Threads
This thesis applies Latent Dirichlet Allocation (LDA) to the problem of topic and topic change in conversational threads using e-mail. We demonstrate that LDA can be used to successfully classify raw e-mail messages with threads to which they belong, and compare the results with those for processed threads, where quoted and reply text have been removed. Raw thread classification performs better...
متن کاملGenerating Supplementary Travel Guides from Social Media
In this paper we study how to summarize travel-related information in forum threads to generate supplementary travel guides. Such summaries presumably can provide additional and more up-to-date information to tourists. Existing multi-document summarization methods have limitations for this task because (1) they do not generate structured summaries but travel guides usually follow a certain temp...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2017