Digging for Names in the Mountains: Combined Person Name Recognition and Reference Resolution for German Alpine Texts
نویسندگان
چکیده
In this paper we introduce a module that combines person name recognition and reference resolution for German. Our data consisted of a corpus of Alpine texts. This text type poses special challenges because of a multitude of toponyms, some of which interfere with person names. Our reference resolution algorithm outputs person entities based on their last names and first names along with their associated features (jobs, addresses, academic titles). DOI: https://doi.org/10.1007/978-3-319-08958-4_16 Posted at the Zurich Open Repository and Archive, University of Zurich ZORA URL: https://doi.org/10.5167/uzh-50451 Accepted Version Originally published at: Ebling, S; Sennrich, R; Klaper, D; Volk, Martin (2011). Digging for names in the mountains: Combined person name recognition and reference resolution for German alpine texts. In: 5th Language Technology Conference, Poznan, Poland, 25 November 2011 27 November 2011. DOI: https://doi.org/10.1007/978-3-319-08958-4_16 Digging for Names in the Mountains: Combined Person Name Recognition and Reference Resolution for German Alpine Texts Sarah Ebling, Rico Sennrich, David Klaper, Martin Volk Institute of Computational Linguistics, University of Zurich Binzmühlestrasse 14, 8050 Zurich, Switzerland {ebling,sennrich,volk}@ifi.uzh.ch, [email protected]
منابع مشابه
Challenges in Building a Multilingual Alpine Heritage Corpus
This paper describes our efforts to build a multilingual heritage corpus of alpine texts. Currently we digitize the yearbooks of the Swiss Alpine Club which contain articles in French, German, Italian and Romansch. Articles comprise mountaineering reports from all corners of the earth, but also scientific topics such as topography, geology or glacierology as well as occasional poetry and lyrics...
متن کاملتشخیص اسامی اشخاص با استفاده از تزریق کلمههای نامزد اسم در میدانهای تصادفی شرطی برای زبان عربی
Named Entity Recognition and Extraction are very important tasks for discovering proper names including persons, locations, date, and time, inside electronic textual resources. Accurate named entity recognition system is an essential utility to resolve fundamental problems in question answering systems, summary extraction, information retrieval and extraction, machine translation, video interpr...
متن کاملسیستم شناسایی و طبقه بندی اسامی در متون فارسی
Name entity recognition (NER) is a system that can identify one or more kinds of names in a text and classify them into specified categories. These categories can be name of people, organizations, companies, places (country, city, street, etc.), time related to names (date and time), financial values, percentages, etc. Although during the past decade a lot of researches has been done on NER in ...
متن کاملClassifying Named Entities in an Alpine Heritage Corpus
In the project “Text+Berg" we digitize and archive the heritage of alpine literature from various European countries. In a first step our group digitizes all yearbooks of the Swiss Alpine Club from 1864 until today. The books comprise articles in German, French and Italian, a total of around 100.000 pages. This paper describes the corpus and the project phases towards its digitalization. We the...
متن کاملA new nomenclature for fungi
Important changes brought about by the Melbourne International Code of Nomenclature for Algae,FungiandPlantsare briefly reviewed concerning a clarification of the spelling and typification of sanctioned fungal names, the recognition of electronic publication for the validity of nomenclatural novelties, permission to use English diagnoses or descriptions for their valid publication, and the requ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011