FRanCo - A Ground Truth Corpus for Fact Ranking Evaluation
نویسندگان
چکیده
The vast amount of information on the Web poses a challenge when trying to identify the most important facts. Many fact ranking algorithms have emerged, however, thus far there is a lack of a general domain, objective gold standard that would serve as an evaluation benchmark for comparing such systems. We present FRanCo, a ground truth for fact ranking acquired using crowdsourcing. The corpus is built on a representative DBpedia sample of 541 entities and made freely available. We have published both the aggregated and the raw data collected, including identified nonsense statements that contribute to improving data
منابع مشابه
A Tangled Web: The Faint Signals of Deception in Text - Boulder Lies and Truth Corpus (BLT-C)
We present an approach to creating corpora for use in detecting deception in text, including a discussion of the challenges peculiar to this task. Our approach is based on soliciting several types of reviews from writers and was implemented using Amazon Mechanical Turk. We describe the multi-dimensional corpus of reviews built using this approach, available free of charge from LDC as the Boulde...
متن کاملRanking-Based Emotion Recognition for Experimental Music
Emotion recognition is an open problem in Affective Computing the field. Music emotion recognition (MER) has challenges including variability of musical content across genres, the cultural background of listeners, reliability of ground truth data, and the modeling human hearing in computational domains. In this study, we focus on experimental music emotion recognition. First, we present a music...
متن کاملExpert Search Evaluation by Supporting Documents
An expert search system assists users with their “expertise need” by suggesting people with relevant expertise to their query. Most systems work by ranking documents in response to the query, then ranking the candidates using information from this initial document ranking and known associations between documents and candidates. In this paper, we aim to determine whether we can approximate an ev...
متن کاملGlobal Entity Ranking Across Multiple Languages
We present work on building a global long-tailed ranking of entities across multiple languages using Wikipedia and Freebase knowledge bases. We identify multiple features and build a model to rank entities using a ground-truth dataset of more than 10 thousand labels. The final system ranks 27 million entities with 75% precision and 48% F1 score. We provide performance evaluation and empirical e...
متن کاملGround Truth, Reference Truth & “Omniscient Truth” -- Parallel Phrases in Parallel Texts for MT Evaluation
Recently introduced automated methods of evaluating machine translation (MT) systems require the construction of parallel corpora of source language (SL) texts with human reference translations in the target language (TL). We present a novel method of exploiting and augmenting these resources for task-based MT evaluation, assessing how accurately people can extract Who, When, and Where elements...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015