Mining Maximal Frequent Contiguous Sequences in Biological Data Sequences
نویسندگان
چکیده
منابع مشابه
Frequent Contiguous Pattern Mining Algorithms for Biological Data Sequences
Transaction sequences in market-basket analysis have large set of alphabets with small length, whereas bio-sequences have small set of alphabets of long length with gap. There is the difference in pattern finding algorithms of these two sequences. The chances of repeatedly occurring small patterns are high in bio-sequences than in the transaction sequences. These repeatedly occurring small patt...
متن کاملFinding All Maximal Frequent Sequences in Text
In this paper we present a novel algorithm for discovering maximal frequent sequences in a set of documents, i.e., such sequences of words that are frequent in the document collection and, moreover, that are not contained in any other longer frequent sequence. A sequence is considered to be frequent if it appears in at least documents, when is the frequency threshold given. Our approach combine...
متن کاملIncremental Mining of Frequent Sequences in Environmental Sensor Data
The mining of sequential patterns in environment sensor data is a challenging task. Most of sequential mining techniques requires periodically complete data. Furthermore, this kind of data can be incomplete, present noises and be sparse in time. Consequently, there is a lack of methods that can mine sequential patterns in sensor data. In this paper, we proposed IncMSTS, an incremental algorithm...
متن کاملDatascope: Mining Biological Sequences
MUCH OF THE WORK IN THE data-mining community defines mining data as a collection of techniques for extracting knowledge out of large databases. This definition is a bit ambiguous because the “knowledge” extracted from databases varies dramatically across systems. In the spirit of intelligent systems, I suggest we define data mining roughly as a collection of techniques that produce representat...
متن کاملAn Efficient and Incremental System to Mine Contiguous Frequent Sequences
Mining frequent patterns is an important component of many prediction systems. One common usage in web applications is the mining of users’ access behavior for the purpose of predicting and hence pre-fetching the web pages that the user is likely to visit. Frequent sequence mining approaches in the literature are often based on the use of an Apriori-like candidate generation strategy, which typ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of Contents
سال: 2007
ISSN: 1738-6764
DOI: 10.5392/ijoc.2007.3.2.018