Methods and techniques to automatic entity linking in Russian
نویسندگان
چکیده
Nowadays, there is a growing interest in solving NLP tasks using external knowledge storage, for example, information retrieval, question-answering systems, dialogue etc. Thus it important to establish relations between entities the processed text and base. This article devoted entity linking, where Wikidata used as an We consider scientific terms Russian entities. Traditional linking system has three stages: recognition, candidates (from base) generation, candidate ranking. Our takes raw with defined input. To generate we use string match input from Wikidata. The ranking stage most complicated one because requires semantic information. Several experiments different models were conducted, including approach based on cosine similarity, classical machine learning algorithms, neural networks. Also, extended RUSERRC dataset, adding manually annotated data model training. results showed that similarity leads better compared others doesn’t require data. dataset are open-sourced available other researchers.
منابع مشابه
Automatic Generation of Benchmarks for Entity Recognition and Linking
The velocity dimension of Big Data plays an increasingly important role in processing unstructured data. Heretofore, no large-scale benchmarks were available to evaluate the performance of named entity recognition and entity linking solutions. This unavailability was due to the creation of gold standards for named entity recognition and entity linking being a time-intensive, costly and error-pr...
متن کاملUBC Entity Discovery and Linking & Diagnostic Entity Linking
This paper describe the runs submitted by the UBC team at TAC-KBP 2014 for both English Entity Discovery and Linking (EDL) and Diagnostic Entity Linking (DEL) tasks. Our main interest was to compare the performance between two totally different name entity recognizer systems and to combine them with three different name entity disambiguation systems that were developed for the TACKBP 2013 EL ta...
متن کاملTrust, but Verify! Better Entity Linking through Automatic Verification
We introduce automatic verification as a post-processing step for entity linking (EL). The proposed method trusts EL system results collectively, by assuming entity mentions are mostly linked correctly, in order to create a semantic profile of the given text using geospatial and temporal information, as well as fine-grained entity types. This profile is then used to automatically verify each li...
متن کاملELES: Combining Entity Linking and Entity Summarization
The automatic annotation of textual content with entities from a knowledge base is a well established field. Applications, such as DBpedia Spotlight and GATE enable to identify and disambiguate entities of text at high levels of accuracy. The output of such systems can be used in many different ways. One way is to show knowledge panels which provide a fact-based summary of an entity and provide...
متن کاملspecialized methods to teach spelling: comparing three methods
چکیده: بررسی ادبیات مربوطه در کشور در زمینه یادگیری زبان انگلیسی نشان میدهد که علیرغم اهمیت املا در فرآیند یادگیری به طور عام و یادگیری زبان انگلیسی به طور خاص، این مولفه از جایگاهی متناسب با اهمیت آن برخوردار نیست و عمدتاً نادیده گرفته شده است. تحقیقات گستردهای در خارج از کشور در مورد ماهیت این مولفه صورت گرفته است، در حالی که به جرأت میتوان گفت در داخل کشور گامی در مورد درک ماهیت آن و فرآی...
15 صفحه اولذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Trudy Instituta sistemnogo programmirovaniâ
سال: 2022
ISSN: ['2079-8156', '2220-6426']
DOI: https://doi.org/10.15514/ispras-2022-34(4)-13