نتایج جستجو برای: task evaluation

تعداد نتایج: 1091650  

1998
Hongyan Jing Regina Barzilay

Two methods are used for evaluation of summarization systems: an evaluation of generated summaries against an "ideal" summary and evaluation of how well summaries help a person perform in a task such as informa. tion retrieval. We carried out two large experiments to study the two evaluation methods. Our results show that different parameters of an experiment can (h-amatically affect how well a...

2014
Bogdan Ionescu Adrian Popescu Mihai Lupu Alexandru-Lucian Gînsca Henning Müller

This paper provides an overview of the Retrieving Diverse Social Images task that is organized as part of the MediaEval 2014 Benchmarking Initiative for Multimedia Evaluation. The task addresses the problem of result diversification in the context of social photo retrieval. We present the task challenges, the proposed data set and ground truth, the required participant runs and the evaluation m...

2010
Enrique Amigó Javier Artiles Julio Gonzalo Damiano Spina Bing Liu Adolfo Corujo

This paper summarizes the definition, resources, evaluation methodology and metrics, participation and comparative results for the second task of the WEPS-3 evaluation campaign. The so-called OnlineReputation Management task consists of filtering Twitter posts containing a given company name depending of whether the post is actually related with the company or not. Five research groups submitte...

2016
Xiaojun Wan Jianmin Zhang Jin-ge Yao Tianming Wang

Live webcast scripts are valuable resources for describing the process of sports games. This shared task aims to automatically generate sports news articles from live webcast scripts. The task can be considered a special case of single document summarization. In this overview paper, we will introduce the task, the evaluation dataset, the participating teams and the evaluation results. The datas...

2013
Bogdan Ionescu María Menéndez Henning Müller Adrian Popescu

This paper provides an overview of the Retrieving Diverse Social Images task that is organized as part of the MediaEval 2013 Benchmarking Initiative for Multimedia Evaluation. The task addresses the problem of result diversification in the context of social photo retrieval. We present the task challenges, the proposed data set and ground truth, the required participant runs and the evaluation m...

2013
Hwee Tou Ng Siew Mei Wu Yuanbin Wu Christian Hadiwinoto Joel R. Tetreault

The CoNLL-2013 shared task was devoted to grammatical error correction. In this paper, we give the task definition, present the data sets, and describe the evaluation metric and scorer used in the shared task. We also give an overview of the various approaches adopted by the participating teams, and present the evaluation results.

2013
Siew Mei Wu

The CoNLL-2013 shared task was devoted to grammatical error correction. In this paper, we give the task definition, present the data sets, and describe the evaluation metric and scorer used in the shared task. We also give an overview of the various approaches adopted by the participating teams, and present the evaluation results.

Abbas Zare-ee

This article reports on the findings of a study that investigated the impact of manipulating task performance conditions on listening task performance by learners of English as a foreign language (EFL). The study was designed to explore the effects of changing complexity dimensions on listening task performance and to achieve two aims: to see how listening comprehension task performance was aff...

2009
Preben Hansen Jussi Karlgren

This technical report collects three years of experimentation in interactive crosslanguage information retrieval by SICS in the annual Cross-language Evaluation Forum (CLEF) evaluation campaigns 2003, 2004, and 2005. We varied simulated task context and measured user performance in document assessment task to find that choice of language and task context indeed have effects on the amount of eff...

2017
Claire-Hélène Demarty Mats Sjöberg Bogdan Ionescu Thanh-Toan Do Michael Gygli Ngoc Q. K. Duong

In this paper, the Predicting Media Interestingness task which is running for the second year as part of the MediaEval 2017 Benchmarking Initiative for Multimedia Evaluation, is presented. For the task, participants are expected to create systems that automatically select images and video segments that are considered to be the most interesting for a common viewer. All task characteristics are d...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید