نتایج جستجو برای: text length

تعداد نتایج: 467834  

2008
Tomás Flouri Costas S. Iliopoulos Mohammad Sohel Rahman Ladislav Vagner Michal Vorácek

In this paper, we present the Truncated Generalized Suffix Automaton (TGSA) and present an efficient on-line algorithm for its construction. TGSA is a novel type of finite automaton suitable for indexing DNA and RNA sequences, where the text is degenerate i.e. contains sets of characters. TGSA indexes the so called k-factors, the factors of the degenerate text with length not exceeding a given ...

2001
Olivier Boëffard

The best voices in text-to-speech synthesis are currently obtained via acoustic units concatenation-based systems. In such systems, the choice of units whose concatenations will produce an acoustic message is a crucial stage. Moreover, it can be observed that current TTS systems use acoustic units which most often correspond to variable-length phonetic descriptions. In this article, an original...

2005
Björn Hoffmeister Thomas Zeugmann

When dealing with knowledge federation over text documents one has to gure out whether or not documents are related by context. A new approach is proposed to solve this problem. This leads to the design of a new search engine for literature research and related problems. The idea is that one has already some documents of interest. These documents are taken as input. Then all documents known to ...

2010
Alberto Barrón-Cedeño Chiara Basile Mirko Degli Esposti Paolo Rosso

The automatic detection of shared content in written documents –which includes text reuse and its unacknowledged commitment, plagiarism– has become an important problem in Information Retrieval. This task requires exhaustive comparison of texts in order to determine how similar they are. However, such comparison is impossible in those cases where the amount of documents is too high. Therefore, ...

2008
Ryoichi Kato

A new almost linear time algorithm for indexed full-text search is presented. The algorithm uses a modified factor oracle as an index database. A factor oracle is an acyclic automaton made from an input text, and it recognizes all of substrings in the text. It can be built in time proportional to the text length, and it is more space economical and easy to implement comparing similar machinerie...

1999
Takuya Kida Masayuki Takeda Ayumi Shinohara Setsuo Arikawa

This paper considers the Shift-And approach to the problem of pattern matching in LZW compressed text, and gives a new algorithm that solves it. The algorithm is indeed fast when a pattern length is at most 32, or the word length. After an O(m + |Σ|) time and O(|Σ|) space preprocessing of a pattern, it scans an LZW compressed text in O(n + r) time and reports all occurrences of the pattern, whe...

2016
Meenu Bhagat

Word Length( i.e. number of characters )plays an important role in non-word error distribution of typed text .It plays an important role in Natural Language Interfaces, spellchecker, OCR and language related technology development etc .Though considerable work has been done in the area for English and related languages, the Indian Language scenario is still far behind. This paper focuses on the...

2014
Annie Louis Ani Nenkova

Length constraints impose implicit requirements on the type of content that can be included in a text. Here we propose the first model to computationally assess if a text deviates from these requirements. Specifically, our model predicts the appropriate length for texts based on content types present in a snippet of constant length. We consider a range of features to approximate content type, i...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید