evaluation metrics

نتایج جستجو برای: evaluation metrics

تعداد نتایج: 878773 فیلتر نتایج به سال:

Findings of the 2012 Workshop on Statistical Machine Translation

2012

Chris Callison-Burch Philipp Koehn Christof Monz Matt Post Radu Soricut Lucia Specia

This paper presents the results of the WMT12 shared tasks, which included a translation task, a task for machine translation evaluation metrics, and a task for run-time estimation of machine translation quality. We conducted a large-scale manual evaluation of 103 machine translation systems submitted by 34 teams. We used the ranking of these systems to measure how strongly automatic metrics cor...

متن کامل

Criteria Based Evaluation Pattern for Software Design Model

2014

Rashmi Rekha Sahu Abhishek Ray Durga Prasad Mohapatra

Software design evaluation plays an important role in software development process, to generate software product with high levels of productivity and efficiency. To achieve high level of qualitative software product, implementation of pattern can create a domain specific framework to provide consistency throughout a software solution. This paper proposed an evaluation pattern called Criteria Ba...

متن کامل

Document-Level Machine Translation Evaluation Metrics Enhanced with Simplified Lexical Chain

2015

Zhengxian Gong Guodong Zhou

Document-level Machine Translation (MT) has been drawing more and more attention due to its potential of resolving sentencelevel ambiguities and inconsistencies with the benefit of wide-range context. However, the lack of simple yet effective evaluation metrics largely impedes the development of such document-level MT systems. This paper proposes to improve traditional MT evaluation metrics by ...

متن کامل

Mining the Correlation between Human and Automatic Evaluation at Sentence Level

2010

Yanli Sun

Automatic evaluation metrics are fast and cost-effective measurements of the quality of a Machine Translation (MT) system. However, as humans are the end-user of MT output, human judgement is the benchmark to assess the usefulness of automatic evaluation metrics. While most studies report the correlation between human evaluation and automatic evaluation at corpus level, our study examines their...

متن کامل

Continuous, Dynamic and Comprehensive Article-Level Evaluation of Scientific Literature

Journal: :CoRR 2015

Xianwen Wang Zhichao Fang Yang Yang

It is time to make changes to the current research evaluation system, which is built on the journal selection. In this study, we propose the idea of continuous, dynamic and comprehensive article-level-evaluation based on article-level-metrics. Different kinds of metrics are integrated into a comprehensive indicator, which could quantify both the academic and societal impact of the article. At d...

متن کامل

Normalized Compression Distance Based Measures for MetricsMATR 2010

2010

Marcus Dobrinkat Tero Tapiovaara Jaakko Väyrynen Kimmo Kettunen

We present the MT-NCD and MT-mNCD machine translation evaluation metrics as submission to the machine translation evaluation shared task (MetricsMATR 2010). The metrics are based on normalized compression distance (NCD), a general information theoretic measure of string similarity, and evaluated against human judgments from the WMT08 shared task. The experiments show that 1) our metric improves...

متن کامل

Evaluation of Machine Translation Metrics for Czech as the Target Language

Journal: :Prague Bull. Math. Linguistics 2009

Kamil Kos Ondrej Bojar

In the present work we study semi-automatic evaluation techniques of machine translation (MT) systems. These techniques are based on a comparison of theMT system’s output to human translations of the same text. Various metrics were proposed in the recent years, ranging from metrics using only a unigram comparison to metrics that try to take advantage of additional syntactic or semantic informat...

متن کامل

The Prague Bulletin of Mathematical Linguistics NUMBER ? ? ? SEPTEMBER 2008 1 – 11 Evaluation of Machine Translation Metrics for Czech as the Target Language

2008

Kamil Kos Ondřej Bojar

In the present work we study semi-automatic evaluation techniques of machine translation (MT) systems. These techniques are based on a comparison of the MT system’s output to human translations of the same text. Various metrics were proposed in the recent years, ranging from metrics using only a unigram comparison to metrics that try to take advantage of additional syntactic or semantic informa...

متن کامل

Machine translation evaluation inside QARLA

2005

Jesús Giménez Enrique Amigó Chiori Hori

In this work we present the fundamentals of the IQMT framework for MT evaluation. IQMT offers a common workbench on which existing evaluation metrics can be utilized. We suggest the IQ measure and test it on the Chinese-toEnglish data from the IWSLT 2004 Evaluation Campaign. We show how the correlation with human assessments at the system level improves substantially for most individual metrics...

متن کامل

Evaluating Automatic Topic Segmentation as a Segment Retrieval Task

2017

Abdessalam Bouchekif Delphine Charlet Géraldine Damnati Nathalie Camelin Yannick Estève

Several evaluation metrics have been proposed for topic segmentation. Most of them rely on the paradigm that segmentation is mainly a task that detects boundaries, and thus are oriented on boundary detection evaluation. Nevertheless, this paradigm is not appropriate to get homogeneous chapters, which is one of the major applications of topic segmentation. For instance on Broadcast News, topic s...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید