YSDA Participation in the WMT'16 Quality Estimation Shared Task

نویسندگان

  • Anna Kozlova
  • Mariya Shmatova
  • Anton Frolov
چکیده

This paper describes Yandex School of Data Analysis (YSDA) submission for WMT2016 Shared Task on Quality Estimation (QE) / Task 1: Sentence-level prediction of post-editing effort. We solve the problem of quality estimation by using a machine learning approach, where we try to learn a regressor from feature space to HTER score. By enriching the baseline features with the syntactical features and additional translation system based features, we achieve Pearson correlation of 0.525 on the test set.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

FBK-UEdin Participation to the WMT13 Quality Estimation Shared Task

In this paper we present the approach and system setup of the joint participation of Fondazione Bruno Kessler and University of Edinburgh in the WMT 2013 Quality Estimation shared-task. Our submissions were focused on tasks whose aim was predicting sentence-level Human-mediated Translation Edit Rate and sentence-level post-editing time (Task 1.1 and 1.3, respectively). We designed features that...

متن کامل

LIMSI Submission for WMT'14 QE Task

This paper describes LIMSI participation to the WMT’14 Shared Task on Quality Estimation; we took part to the wordlevel quality estimation task for English to Spanish translations. Our system relies on a random forest classifier, an ensemble method that has been shown to be very competitive for this kind of task, when only a few dense and continuous features are used. Notably, only 16 features ...

متن کامل

Black Box Features for the WMT 2012 Quality Estimation Shared Task

In this paper we introduce a number of new features for quality estimation in machine translation that were developed for the WMT 2012 quality estimation shared task. We find that very simple features such as indicators of certain characters are able to outperform complex features that aim to model the connection between two languages.

متن کامل

An Approach Using Style Classification Features for Quality Estimation

In this paper we describe our participation to the WMT13 Shared Task on Quality Estimation. The main originality of our approach is to include features originally designed to classify text according to some author’s style. This implies the use of reference categories, which are meant to represent the quality of the MT output.

متن کامل

Unbabel's Participation in the WMT16 Word-Level Translation Quality Estimation Shared Task

This paper presents the contribution of the Unbabel team to the WMT 2016 Shared Task on Word-Level Translation Quality Estimation. We describe our two submitted systems: (i) UNBABELLINEAR, a feature-rich sequential linear model with syntactic features, and (ii) UNBABEL-ENSEMBLE, a stacked combination of the linear system with three different deep neural networks, mixing feedforward, convolution...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016