نتایج جستجو برای: punctuation
تعداد نتایج: 1336 فیلتر نتایج به سال:
Our visual Voice-Mail-to-Text (VMTT) transcription system takes a conventional voice mail and converts it to formatted text following standard punctuation, capitalization and presentation conventions. The text can then be used in a plethora of applications, from emails, to databases, text messages etc., which in turn allow searching, classification, data extraction, statistical analyses and oth...
Modern statistical dependency parsers assign lexical heads to punctuations as well as words. Punctuation parsing errors lead to low parsing accuracy on words. In this work, we propose an alternative approach to addressing punctuation in dependency parsing. Rather than assigning lexical heads to punctuations, we treat punctuations as properties of their neighbouring words, used as features to gu...
This paper shows that in the context of statistical weblog classification for splog filtering based on n-grams of tokens in the URL, further segmenting the URLs beyond the standard punctuation is helpful. Many splog URLs contain phrases in which the words are glued together in order to avoid splog filtering techniques based on punctuation segmentation and unigrams. A technique which segments lo...
This paper focuses on the task of inserting punctuation symbols into transcribed conversational speech texts, without relying on prosodic cues. We investigate limitations associated with previous methods, and propose a novel approach based on dynamic conditional random fields. Different from previous work, our proposed approach is designed to jointly perform both sentence boundary and sentence ...
In this paper we describe a hybrid approach to Chinese-toEnglish spoken language translation system used for the IWSLT 2006 evaluation campaign. In this system, the phrasebased statistical machine translation (SMT) engine is combined with the template-based machine translation (TBMT) engine and a simple way is proposed to select the best translation from the results generated by the two transla...
This paper presents a linguistic revision process of a speech corpus of Portuguese broadcast news focusing on metadata annotation for rich transcription, and reports on the impact of the new data on the performance for several modules. The main focus of the revision process consisted on annotating and revising structural metadata events, such as disfluencies and punctuation marks. The resultant...
the main priority in literary criticism, is defining a structure and abstaining from imposing personal views. in the framework of discourse analysis theory (style, intonation and punctuation marks are considered) and on the basis of contrastive linguistics, the author has tried to criticize the translation of a story “french lessons” written by valentine rasputin and translated by mrs. maryam s...
Automatic speech transcripts can be made more readable and useful for further processing by enriching them with punctuation marks and other meta-linguistic information. We study in this work how to improve automatic recovery of one of the most difficult punctuation marks, commas, in French and in Czech. We show that commas detection performances are largely improved in both languages by integra...
نمودار تعداد نتایج جستجو در هر سال
با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید