Methods and techniques to automatic entity linking in Russian

نویسندگان

چکیده

Nowadays, there is a growing interest in solving NLP tasks using external knowledge storage, for example, information retrieval, question-answering systems, dialogue etc. Thus it important to establish relations between entities the processed text and base. This article devoted entity linking, where Wikidata used as an We consider scientific terms Russian entities. Traditional linking system has three stages: recognition, candidates (from base) generation, candidate ranking. Our takes raw with defined input. To generate we use string match input from Wikidata. The ranking stage most complicated one because requires semantic information. Several experiments different models were conducted, including approach based on cosine similarity, classical machine learning algorithms, neural networks. Also, extended RUSERRC dataset, adding manually annotated data model training. results showed that similarity leads better compared others doesn’t require data. dataset are open-sourced available other researchers.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic Generation of Benchmarks for Entity Recognition and Linking

The velocity dimension of Big Data plays an increasingly important role in processing unstructured data. Heretofore, no large-scale benchmarks were available to evaluate the performance of named entity recognition and entity linking solutions. This unavailability was due to the creation of gold standards for named entity recognition and entity linking being a time-intensive, costly and error-pr...

متن کامل

UBC Entity Discovery and Linking & Diagnostic Entity Linking

This paper describe the runs submitted by the UBC team at TAC-KBP 2014 for both English Entity Discovery and Linking (EDL) and Diagnostic Entity Linking (DEL) tasks. Our main interest was to compare the performance between two totally different name entity recognizer systems and to combine them with three different name entity disambiguation systems that were developed for the TACKBP 2013 EL ta...

متن کامل

Trust, but Verify! Better Entity Linking through Automatic Verification

We introduce automatic verification as a post-processing step for entity linking (EL). The proposed method trusts EL system results collectively, by assuming entity mentions are mostly linked correctly, in order to create a semantic profile of the given text using geospatial and temporal information, as well as fine-grained entity types. This profile is then used to automatically verify each li...

متن کامل

ELES: Combining Entity Linking and Entity Summarization

The automatic annotation of textual content with entities from a knowledge base is a well established field. Applications, such as DBpedia Spotlight and GATE enable to identify and disambiguate entities of text at high levels of accuracy. The output of such systems can be used in many different ways. One way is to show knowledge panels which provide a fact-based summary of an entity and provide...

متن کامل

specialized methods to teach spelling: comparing three methods

چکیده: بررسی ادبیات مربوطه در کشور در زمینه یادگیری زبان انگلیسی نشان می‎دهد که علی‎رغم اهمیت املا در فرآیند یادگیری به طور عام و یادگیری زبان انگلیسی به طور خاص، این مولفه از جایگاهی متناسب با اهمیت آن برخوردار نیست و عمدتاً نادیده گرفته شده است. تحقیقات گسترده‎ای در خارج از کشور در مورد ماهیت این مولفه صورت گرفته است، در حالی که به جرأت می‎توان گفت در داخل کشور گامی در مورد درک ماهیت آن و فرآی...

15 صفحه اول

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Trudy Instituta sistemnogo programmirovaniâ

سال: 2022

ISSN: ['2079-8156', '2220-6426']

DOI: https://doi.org/10.15514/ispras-2022-34(4)-13