Partially Supervised Named Entity Recognition via the Expected Entity Ratio Loss
نویسندگان
چکیده
Abstract We study learning named entity recognizers in the presence of missing annotations. approach this setting as tagging with latent variables and propose a novel loss, Expected Entity Ratio, to learn models systematically tags. show that our is both theoretically sound empirically useful. Experimentally, we find it meets or exceeds performance strong state-of-the-art baselines across variety languages, annotation scenarios, amounts labeled data. In particular, significantly outperforms previous methods from Mayhew et al. (2019) Li (2021) by +12.7 +2.3 F1 score challenging only 1,000 biased annotations, averaged 7 datasets. also that, when combined approach, sparse scheme exhaustive for modest budgets.1
منابع مشابه
Supervised Named Entity Recognition for Clinical Data
Clinical Named Entity Recognition is a part of Task 1b, organised by CLEF eHealth organisation in 2015. The aim is to automatically identify clinically relevant entities in medical text in French. A supervised learning approach has been used for training the tagger. For the purpose of training, Conditional Random Fields(CRF) has been used. An extensive set of features was used for training. Pre...
متن کاملNamed Entity Recognition in Persian Text using Deep Learning
Named entities recognition is a fundamental task in the field of natural language processing. It is also known as a subset of information extraction. The process of recognizing named entities aims at finding proper nouns in the text and classifying them into predetermined classes such as names of people, organizations, and places. In this paper, we propose a named entity recognizer which benefi...
متن کاملNamed Entity Chunking Techniques in Supervised Learning for Japanese Named Entity Recognition
This 1)aper focuses on the issue of named entity chunking in Japanese named entity recognition. We apply the SUl)ervised decision list lean> ing method to Japanese named entity recognition. We also investigate and in(:ori)orate several named-entity noun phrase chunking tech.niques and experimentally evaluate and con> t)are their l)erfornlanee, ill addition, we t)rot)ose a method for incorporati...
متن کاملWeakly supervised named entity classification
In this paper, we describe a new method for the problem of named entity classification for specialized or technical domains, using distant supervision. Our approach relies on a simple observation: in some specialized domains, named entities are almost unambiguous. Thus, given a seed list of names of entities, it is cheap and easy to obtain positive examples from unlabeled texts using a simple s...
متن کاملCross-Lingual Named Entity Recognition via Wikification
Named Entity Recognition (NER) models for language L are typically trained using annotated data in that language. We study cross-lingual NER, where a model for NER in L is trained on another, source, language (or multiple source languages). We introduce a language independent method for NER, building on cross-lingual wikification, a technique that grounds words and phrases in nonEnglish text in...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Transactions of the Association for Computational Linguistics
سال: 2021
ISSN: ['2307-387X']
DOI: https://doi.org/10.1162/tacl_a_00429