نتایج جستجو برای: types of subtitles

تعداد نتایج: 21170771  

2010
Ondrej Bojar Adam Liska Zdenek Zabokrtský

CzEng 0.9 is the third release of a large parallel corpus of Czech and English. For the current release, CzEng was extended by significant amount of texts from various types of sources, including parallel web pages, electronically available books and subtitles. This paper describes and evaluates filtering techniques employed in the process in order to avoid misaligned or otherwise damaged paral...

2016
Pelin Dogan Markus H. Gross Jean Charles Bazin

In this paper we consider videos (e.g. Hollywood movies) and their accompanying natural language descriptions in the form of narrative sentences (e.g. movie scripts without timestamps). We propose a method for temporally aligning the video frames with the sentences using both visual and textual information, which provides automatic timestamps for each narrative sentence. We compute the similari...

2013
Ernesto López Luis Chiruzzo Dina Wonsever

Pronominal and verbal voseo is a wellestablished variant in spoken language, and also very common in some written contexts web sites, literary works, screenplays or subtitles in Rio de la Plata Spanish. An implementation of Río de la Plata Spanish (including voseo) was made in the open source collaborative system Apertium, whose design is suited for the development of new translation pairs. Thi...

2016
Jörg Tiedemann

OpenSubtitles.org provides a large collection of user contributed subtitles in various languages for movies and TV programs. Subtitle translations are valuable resources for cross-lingual studies and machine translation research. A less explored feature of the collection is the inclusion of alternative translations, which can be very useful for training paraphrase systems or collecting multi-re...

2008
Hugo Meinedo Márcio Viveiros João Paulo da Silva Neto

Broadcast news play an important role in our lives providing access to news, information and entertainment. The existence of subtitles is an important medium for inclusion of people with special needs and also an advantage on noisy and populated environments. In this work we will describe and evaluate a system for subtitling live broadcast news for RTP (Rádio Televisão de Portugal) the Portugue...

2014
David Ameixa Luísa Coheur Pedro Fialho Paulo Quaresma

Even when the role of a conversational agent is well known users persist in confronting them with Out-of-Domain input. This often results in inappropriate feedback, leaving the user unsatisfied. In this paper we explore the automatic creation/enrichment of conversational agents’ knowledge bases by taking advantage of natural language interactions present in the Web, such as movies subtitles. Th...

Journal: :CoRR 2017
Konstantinos Bougiatiotis

In this paper we examine the existence of correlation between movie similarity and low level features from respective movie content. In particular, we demonstrate the extraction of multi-modal representation models of movies based on subtitles, audio and metadata mining. We emphasize our research in topic modeling of movies based on their subtitles. In order to demonstrate the proposed content ...

2005
STELIOS PIPERIDIS IASON DEMIROS PROKOPIS PROKOPIDIS

The expansion of digital television and the increasing demand to manipulate audiovisual content underlie the need for tools and systems that will automate the multilingual subtitle generation process. In this setting the MUSA project aims at providing a system which combines speech recognition, advanced text analysis, and machine translation to help generate multilingual subtitles. In its curre...

2011
Patrick Wambacq Kris Demuynck

This paper describes the use of speech alignment to aid in the process of subtitling Dutch TV programs. The recognizer aligns the audio stream with an existing transcript. The goal is therefore not to transcribe but to generate the correct timing of every word. The system performs subtasks such as audio segmentation, transcript preprocessing, alignment and subtitle compression. The result is no...

2015
Gideon Mendels Erica Cooper Victor Soto Julia Hirschberg Mark J. F. Gales Kate Knill Anton Ragni Haipeng Wang

We describe the use of text data scraped from the web to augment language models for Automatic Speech Recognition and Keyword Search for Low Resource Languages. We scrape text from multiple genres including blogs, online news, translated TED talks, and subtitles. Using linearly interpolated language models, we find that blogs and movie subtitles are more relevant for language modeling of conver...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید