Machine translation: practical issues

نویسنده

  • J. Slocum
چکیده

Many of the problems with (and consequently much of the resistance to) machine translation (MT) has sprung from ignorance: on the part of early developers, who failed to appreciate the magnitude of the problems they faced, and hence made outrageous claims; on the part of opponents, who frequently fail to understand what translation is about, and how it is (properly) performed; on the part of zealots for theoretical formalisms, who seldom if ever try to produce working systems and consequently lack appreciation of the magnitude of the practical problems to be faced; on the part of modern developers, too many of whom know little about translators' needs and other human engineering concerns; and on the part of translators, who often fail to realize the potential for improvement in the quality of their working lives when computers become able to take over much of the 'grunt work' in translation. In this chapter we confront these and other problems. We characterize areas where knowledge has been lacking, and discuss the consequences of such ignorance; provide background knowledge enabling the reader to acquire a more balanced perspective from which to judge the developments in the field; and finally present the author's opinions, based upon his experience as an MT system developer, regarding some of the most important issues in the construction of a practical system. This will entail, in places, an exposition of certain aspects of the underlying technology; however, complete understanding of these technical matters is not a prerequisite for appreciation of the major points.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A new model for persian multi-part words edition based on statistical machine translation

Multi-part words in English language are hyphenated and hyphen is used to separate different parts. Persian language consists of multi-part words as well. Based on Persian morphology, half-space character is needed to separate parts of multi-part words where in many cases people incorrectly use space character instead of half-space character. This common incorrectly use of space leads to some s...

متن کامل

Using Machine Translation for the Localization of Electronic Support Content: Evaluating End-User Satisfaction

This paper discusses how to measure the impact of online content localized by machine translation in meeting the business need of commercial users, i.e., reducing the volume of telephone calls to the Call Center (call deflection). We address various design, conceptual and practical issues encountered in proving the value of machine translation and conclude that the approach that will give the b...

متن کامل

Do we need phrases? Challenging the conventional wisdom in Statistical Machine Translation

We begin by exploring theoretical and practical issues with phrasal SMT, several of which are addressed by syntax-based SMT. Next, to address problems not handled by syntax, we propose the concept of a Minimal Translation Unit (MTU) and develop MTU sequence models. Finally we incorporate these models into a syntax-based SMT system and demonstrate that it improves on the state of the art transla...

متن کامل

How to Overcome the Domain Barriers in Pattern-Based Machine Translation System

One of difficult issues in pattern-based machine translation system is maybe to find how to overcome the domain difference in adapting a system from one domain to other domain. This paper describes how we have resolved such barriers among domains as default target word of any domain, domain-specific patterns, and domain adaptation of engine modules in pattern-based machine translation system, e...

متن کامل

Challenges in Using an Example-Based MT System for a Transnational Digital Government Project

We describe ongoing efforts towards and challenges in using an Example-Based Machine Translation (EBMT) system in the context of a multi-national, multi-university and multi-agency transnational digital government project. The project is aimed at applying information technology to the problem of collecting and sharing information securely in a multilingual context. We report on a number of issu...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011