Automatically Temporal Labeled Data Generation Using Positional Lexicon Expansion for Focus Time Estimation of News Articles
نویسندگان
چکیده
Many facts change over time, which is a fundamental aspect of our physical environment. In the case pandemic articles, user not interested in creation date document, but and cause last pandemic. Fake news can be better combated by having document with temporal focus. Currently, neither sequence events nor focus considered when obtaining documents. Despite limited number aspects available datasets, it difficult to test evaluate conclusions model. The goal this work develop article retrieval model based on co-training advance research semi-supervised learning. A mapping dataset performed using 1) evolving time 2) method coincidence contexts for learning low-dimensional continuous vectors neural contrast embedding models generating time-based query sequential articles facilitate understanding vectors. diverse used effectiveness proposed method. With lexicon expansion, result developed achieve 89%. than previous baselines traditional machine improvements 12.65% 4.7%, respectively.
منابع مشابه
Time-travel Translator: Automatically Contextualizing News Articles
Fully understanding an older news article requires context knowledge from the time of article creation. Finding information about such context is a tedious and time-consuming task, which distracts the reader. Simple contextualization via Wikification is not sufficient here. The retrieved context information has to be time-aware, concise (not full Wiki pages) and focused on the coherence of the ...
متن کاملAutomatically Linking News Articles to Blog Entries
People often write in their blogs about news articles or events in news articles. In this case, however, the details of the news articles or events are often poorly described in such blog entries. Therefore, the readers of blogs need to find the original articles, which contain more details of the news articles, when they want to know about them. In this paper, we propose a method for linking n...
متن کاملAutomatically Labeled Data Generation for Large Scale Event Extraction
Modern models of event extraction for tasks like ACE are based on supervised learning of events from small hand-labeled data. However, hand-labeled training data is expensive to produce, in low coverage of event types, and limited in size, which makes supervised methods hard to extract large scale of events for knowledge base population. To solve the data labeling problem, we propose to automat...
متن کاملEvent Template Generation for News Articles
In this paper we focus on event extraction from Tamil news article. This system utilizes a scoring scheme for extracting and grouping event-specific sentences. Using this scoring scheme eventspecific clustering is performed for multiple documents. Events are extracted from each document using a scoring scheme based on feature score and condition score. Similarly event specific sentences are clu...
متن کاملSemantic Data Mining of Financial News Articles
Subgroup discovery aims at constructing symbolic rules that describe statistically interesting subsets of instances with a chosen property of interest. Semantic subgroup discovery extends standard subgroup discovery approaches by exploiting ontological concepts in rule construction. Compared to previously developed semantic data mining systems SDM-SEGS and SDM-Aleph, this paper presents a gener...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: ACM Transactions on Asian and Low-Resource Language Information Processing
سال: 2022
ISSN: ['2375-4699', '2375-4702']
DOI: https://doi.org/10.1145/3568164