VideoDubber: Machine Translation with Speech-Aware Length Control for Video Dubbing

نویسندگان

چکیده

Video dubbing aims to translate the original speech in a film or television program into target language, which can be achieved with cascaded system consisting of recognition, machine translation and synthesis. To ensure translated well aligned corresponding video, length/duration should as close possible that speech, requires strict length control. Previous works usually control number words characters generated by model similar source sentence, without considering isochronicity duration words/characters different languages varies. In this paper, we propose VideoDubber, tailored for task video dubbing, directly considers each token translation, match speech. Specifically, sentence guiding prediction word information, including itself how much is left remaining words. We design experiments on four language directions (German -> English, Spanish Chinese English), results show VideoDubber achieves better ability than baseline methods. make up lack real-world datasets, also construct test set collected from films provide comprehensive evaluations task.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Name-aware Machine Translation

We propose a Name-aware Machine Translation (MT) approach which can tightly integrate name processing into MT model, by jointly annotating parallel corpora, extracting name-aware translation grammar and rules, adding name phrase table and name translation driven decoding. Additionally, we also propose a new MT metric to appropriately evaluate the translation quality of informative words, by ass...

متن کامل

Search-Aware Tuning for Machine Translation

Parameter tuning is an important problem in statistical machine translation, but surprisingly, most existing methods such as MERT, MIRA and PRO are agnostic about search, while search errors could severely degrade translation quality. We propose a searchaware framework to promote promising partial translations, preventing them from being pruned. To do so we develop two metrics to evaluate parti...

متن کامل

Algorithms for Syntax-Aware Statistical Machine Translation

All of the non-trivial algorithms that are necessary for building and applying a rudimentary syntax-aware statistical machine translation system are generalized parsers. This paper extends the “translation by parsing” architecture by adding two components that are invariably used by state-of-the-art statistical machine translation systems. First, the paper shows how a generic syntax-aware trans...

متن کامل

Context-Aware Smoothing for Neural Machine Translation

In Neural Machine Translation (NMT), each word is represented as a lowdimension, real-value vector for encoding its syntax and semantic information. This means that even if the word is in a different sentence context, it is represented as the fixed vector to learn source representation. Moreover, a large number of Out-OfVocabulary (OOV) words, which have different syntax and semantic informatio...

متن کامل

Motivating Personality-aware Machine Translation

Language use is known to be influenced by personality traits as well as by sociodemographic characteristics such as age or mother tongue. As a result, it is possible to automatically identify these traits of the author from her texts. It has recently been shown that knowledge of such dimensions can improve performance in NLP tasks such as topic and sentiment modeling. We posit that machine tran...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2023

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v37i11.26613