Automatically Temporal Labeled Data Generation Using Positional Lexicon Expansion for Focus Time Estimation of News Articles

نویسندگان

چکیده

Many facts change over time, which is a fundamental aspect of our physical environment. In the case pandemic articles, user not interested in creation date document, but and cause last pandemic. Fake news can be better combated by having document with temporal focus. Currently, neither sequence events nor focus considered when obtaining documents. Despite limited number aspects available datasets, it difficult to test evaluate conclusions model. The goal this work develop article retrieval model based on co-training advance research semi-supervised learning. A mapping dataset performed using 1) evolving time 2) method coincidence contexts for learning low-dimensional continuous vectors neural contrast embedding models generating time-based query sequential articles facilitate understanding vectors. diverse used effectiveness proposed method. With lexicon expansion, result developed achieve 89%. than previous baselines traditional machine improvements 12.65% 4.7%, respectively.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Time-travel Translator: Automatically Contextualizing News Articles

Fully understanding an older news article requires context knowledge from the time of article creation. Finding information about such context is a tedious and time-consuming task, which distracts the reader. Simple contextualization via Wikification is not sufficient here. The retrieved context information has to be time-aware, concise (not full Wiki pages) and focused on the coherence of the ...

متن کامل

Automatically Linking News Articles to Blog Entries

People often write in their blogs about news articles or events in news articles. In this case, however, the details of the news articles or events are often poorly described in such blog entries. Therefore, the readers of blogs need to find the original articles, which contain more details of the news articles, when they want to know about them. In this paper, we propose a method for linking n...

متن کامل

Automatically Labeled Data Generation for Large Scale Event Extraction

Modern models of event extraction for tasks like ACE are based on supervised learning of events from small hand-labeled data. However, hand-labeled training data is expensive to produce, in low coverage of event types, and limited in size, which makes supervised methods hard to extract large scale of events for knowledge base population. To solve the data labeling problem, we propose to automat...

متن کامل

Event Template Generation for News Articles

In this paper we focus on event extraction from Tamil news article. This system utilizes a scoring scheme for extracting and grouping event-specific sentences. Using this scoring scheme eventspecific clustering is performed for multiple documents. Events are extracted from each document using a scoring scheme based on feature score and condition score. Similarly event specific sentences are clu...

متن کامل

Semantic Data Mining of Financial News Articles

Subgroup discovery aims at constructing symbolic rules that describe statistically interesting subsets of instances with a chosen property of interest. Semantic subgroup discovery extends standard subgroup discovery approaches by exploiting ontological concepts in rule construction. Compared to previously developed semantic data mining systems SDM-SEGS and SDM-Aleph, this paper presents a gener...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: ACM Transactions on Asian and Low-Resource Language Information Processing

سال: 2022

ISSN: ['2375-4699', '2375-4702']

DOI: https://doi.org/10.1145/3568164