VideoDubber: Machine Translation with Speech-Aware Length Control for Video Dubbing
نویسندگان
چکیده
Video dubbing aims to translate the original speech in a film or television program into target language, which can be achieved with cascaded system consisting of recognition, machine translation and synthesis. To ensure translated well aligned corresponding video, length/duration should as close possible that speech, requires strict length control. Previous works usually control number words characters generated by model similar source sentence, without considering isochronicity duration words/characters different languages varies. In this paper, we propose VideoDubber, tailored for task video dubbing, directly considers each token translation, match speech. Specifically, sentence guiding prediction word information, including itself how much is left remaining words. We design experiments on four language directions (German -> English, Spanish Chinese English), results show VideoDubber achieves better ability than baseline methods. make up lack real-world datasets, also construct test set collected from films provide comprehensive evaluations task.
منابع مشابه
Name-aware Machine Translation
We propose a Name-aware Machine Translation (MT) approach which can tightly integrate name processing into MT model, by jointly annotating parallel corpora, extracting name-aware translation grammar and rules, adding name phrase table and name translation driven decoding. Additionally, we also propose a new MT metric to appropriately evaluate the translation quality of informative words, by ass...
متن کاملSearch-Aware Tuning for Machine Translation
Parameter tuning is an important problem in statistical machine translation, but surprisingly, most existing methods such as MERT, MIRA and PRO are agnostic about search, while search errors could severely degrade translation quality. We propose a searchaware framework to promote promising partial translations, preventing them from being pruned. To do so we develop two metrics to evaluate parti...
متن کاملAlgorithms for Syntax-Aware Statistical Machine Translation
All of the non-trivial algorithms that are necessary for building and applying a rudimentary syntax-aware statistical machine translation system are generalized parsers. This paper extends the “translation by parsing” architecture by adding two components that are invariably used by state-of-the-art statistical machine translation systems. First, the paper shows how a generic syntax-aware trans...
متن کاملContext-Aware Smoothing for Neural Machine Translation
In Neural Machine Translation (NMT), each word is represented as a lowdimension, real-value vector for encoding its syntax and semantic information. This means that even if the word is in a different sentence context, it is represented as the fixed vector to learn source representation. Moreover, a large number of Out-OfVocabulary (OOV) words, which have different syntax and semantic informatio...
متن کاملMotivating Personality-aware Machine Translation
Language use is known to be influenced by personality traits as well as by sociodemographic characteristics such as age or mother tongue. As a result, it is possible to automatically identify these traits of the author from her texts. It has recently been shown that knowledge of such dimensions can improve performance in NLP tasks such as topic and sentiment modeling. We posit that machine tran...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence
سال: 2023
ISSN: ['2159-5399', '2374-3468']
DOI: https://doi.org/10.1609/aaai.v37i11.26613