Latent Constraints on Unsupervised Text-Graph Alignment with Information Asymmetry
نویسندگان
چکیده
Unsupervised text-graph alignment (UTGA) is a fundamental task that bidirectionally generates texts and graphs without parallel data. Most available models of UTGA suffer from information asymmetry, common phenomenon include additional invisible to each other. On the one hand, these fail supplement asymmetric effectively due lack ground truths. other it challenging indicate with explicit indicators because cannot be decoupled data directly. To address challenge posed by we propose assumption encoded in unobservable latent variables only affects one-way generation processes. These corresponding should obey prior distributions recovered approximately original Therefore, first taxonomy variable classifies into transferrable (TV) non-transferable (NTV) further distinguish NTV as dependent (DV) independent (IV). Next, three VAE-based regularizations on TV, DV, IV constrain their well-designed introduce enhance preservation shared contents. Finally, impose proposed constraints cycle-consistent learning framework, back-translation (BT), named ConstrainedBT. Experimental results tasks demonstrate effectiveness ConstrainedBT information-asymmetric challenge.
منابع مشابه
Unsupervised Text Syllabification with Information from Audio
The objectives of this internship were: studies on supervised and unsupervised text syllabification techniques, with a focus on unsupervised methods, implementation and evaluation of several unsupervised audio syllabification algorithms, implementation and evaluation of an unsupervised text syllabification algorithms, improve the performance of text syllabification with features from acoustics....
متن کاملLearning Frames from Text with an Unsupervised Latent Variable Model
We develop a probabilistic latent-variable model to discover semantic frames—types of events or relations and their participants—from corpora. Our key contribution is a model in which (1) frames are latent categories that explain the linking of verb-subject-object triples in a given document context; and (2) cross-cutting semantic word classes are learned, shared across frames. We also introduc...
متن کاملUnsupervised Alignment of Actions in Video with Text Descriptions
Advances in video technology and data storage have made large scale video data collections of complex activities readily accessible. An increasingly popular approach for automatically inferring the details of a video is to associate the spatiotemporal segments in a video with its natural language descriptions. Most algorithms for connecting natural language with video rely on pre-aligned superv...
متن کاملLatent Semantic Matching: Application to Cross-language Text Categorization without Alignment Information
Unsupervised object matching (UOM) is a promising approach to cross-language natural language processing such as bilingual lexicon acquisition, parallel corpus construction, and cross-language text categorization, because it does not require labor-intensive linguistic resources. However, UOM only finds one-to-one correspondences from data sets with the same number of instances in source and tar...
متن کاملEnhancing Multilingual Latent Semantic Analysis with Term Alignment Information
Latent Semantic Analysis (LSA) is based on the Singular Value Decomposition (SVD) of a term-by-document matrix for identifying relationships among terms and documents from cooccurrence patterns. Among the multiple ways of computing the SVD of a rectangular matrix X, one approach is to compute the eigenvalue decomposition (EVD) of a square 2 × 2 composite matrix consisting of four blocks with X ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence
سال: 2023
ISSN: ['2159-5399', '2374-3468']
DOI: https://doi.org/10.1609/aaai.v37i11.26600