Fast Computation of Subpath Kernel for Trees
نویسندگان
چکیده
The kernel method is a popular approach to analyzing structured data such as sequences, trees, and graphs; however, unordered trees have not been investigated extensively. Kimura et al. (2011) proposed a kernel function for unordered trees on the basis of their subpaths, which are vertical substructures of trees responsible for hierarchical information in them. Their kernel exhibits practically good performance in terms of accuracy and speed; however, lineartime computation is not guaranteed theoretically, unlike the case of the other unordered tree kernel proposed by Vishwanathan and Smola (2003). In this paper, we propose a theoretically guaranteed linear-time kernel computation algorithm that is also practically fast, and we present an efficient prediction algorithm whose running time depends only on the size of the input tree. Experimental results show that the proposed algorithms are quite efficient in practice.
منابع مشابه
Efficient Sentence Retrieval Based on Syntactic Structure
This paper proposes an efficient method of sentence retrieval based on syntactic structure. Collins proposed Tree Kernel to calculate structural similarity. However, structual retrieval based on Tree Kernel is not practicable because the size of the index table by Tree Kernel becomes impractical. We propose more efficient algorithms approximating Tree Kernel: Tree Overlapping and Subpath Set. T...
متن کاملA Subpath Kernel for Learning Hierarchical Image Representations
Tree kernels have demonstrated their ability to deal with hierarchical data, as the intrinsic tree structure often plays a discriminative role. While such kernels have been successfully applied to various domains such as nature language processing and bioinformatics, they mostly concentrate on ordered trees and whose nodes are described by symbolic data. Meanwhile, hierarchical representations ...
متن کاملA Parallel N -Body Data Mining Framework
The N-body or multi-tree approach for accelerating data mining methods has spurred some of the fastest known solutions for a significant class of fundamental methods. We present a standard mathematical model and associated programming model that allows these problems to be scaled further via parallelization, without significant extra programmer effort. With the framework, we derive a strategy f...
متن کاملApproximate Kernels for Trees
Convolution kernels for trees provide effective means for learning with treestructured data, such as parse trees of natural language sentences. Unfortunately, the computation time of tree kernels is quadratic in the size of the trees as all pairs of nodes need to be compared: large trees render convolution kernels inapplicable. In this paper, we propose a simple but efficient approximation tech...
متن کاملEffects of Deficit and Cutoff Irrigation During Different Phenological Stages of Fruit Growth on Production in Mature Almond Trees cv. ‘Mamaei’
Regulated deficit irrigation (RDI) is commonly used during different phenological stages of fruit growth and development in almond trees to reduce the amount of irrigation water applied without or with only very small reductions in yield. Therefore, to study the effects of deficit and cutoff irrigation during different phenological stages of fruit growth and development in almond cv. “Mamaei” p...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1206.4642 شماره
صفحات -
تاریخ انتشار 2012