Conversation level syntax similarity metric.

نویسندگان

  • Reihane Boghrati
  • Joe Hoover
  • Kate M Johnson
  • Justin Garten
  • Morteza Dehghani
چکیده

The syntax and semantics of human language can illuminate many individual psychological differences and important dimensions of social interaction. Accordingly, psychological and psycholinguistic research has begun incorporating sophisticated representations of semantic content to better understand the connection between word choice and psychological processes. In this work we introduce ConversAtion level Syntax SImilarity Metric (CASSIM), a novel method for calculating conversation-level syntax similarity. CASSIM estimates the syntax similarity between conversations by automatically generating syntactical representations of the sentences in conversation, estimating the structural differences between them, and calculating an optimized estimate of the conversation-level syntax similarity. After introducing and explaining this method, we report results from two method validation experiments (Study 1) and conduct a series of analyses with CASSIM to investigate syntax accommodation in social media discourse (Study 2). We run the same experiments using two well-known existing syntactic metrics, LSM and Coh-Metrix, and compare their results to CASSIM. Overall, our results indicate that CASSIM is able to reliably measure syntax similarity and to provide robust evidence of syntax accommodation within social media discourse.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Generalized Representation of Syntactic Structures

Analysis of language provides important insights into the underlying psychological properties of individuals and groups. While the majority of language analysis work in psychology has focused on semantics, psychological information is encoded not just in what people say, but how they say it. In the current work, we propose Conversation Level Syntax Similarity Metric-Group Representations (CASSI...

متن کامل

Empirical Analysis of Descriptor Spaces and Metrics for Image Classification

The objective of this work is to devise an approach for measuring similarity of low-level content features in multi-descriptor spaces. Since no single descriptor is able to represent all the properties and patterns encapsulated in natural images, the combination of several descriptors appears to be a sensible strategy to increase their discrimination power and classification properties. We are ...

متن کامل

An Automatic Machine Translation Evaluation Metric Based on Dependency Parsing Model

Most of the syntax-based metrics obtain the similarity by comparing the substructures extracted from the trees of hypothesis and reference. These substructures are defined by human and can’t express all the information in the trees because of the limited length of substructures. In addition, the overlapped parts between these sub-structures are computed repeatedly. To avoid these problems, we p...

متن کامل

Semantic Cosine Similarity

Cosine similarity is a widely implemented metric in information retrieval and related studies. This metric models a text as a vector of terms and the similarity between two texts is derived from cosine value between two texts' term vectors. Cosine similarity however still can't handle the semantic meaning of the text perfectly. This paper proposes an enhancement of cosine similarity measurement...

متن کامل

Learning to Detect Conversation Focus of Threaded Discussions

In this paper we present a novel featureenriched approach that learns to detect the conversation focus of threaded discussions by combining NLP analysis and IR techniques. Using the graph-based algorithm HITS, we integrate different features such as lexical similarity, poster trustworthiness, and speech act analysis of human conversations with featureoriented link generation functions. It is th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Behavior research methods

دوره   شماره 

صفحات  -

تاریخ انتشار 2017