NLP for Answer Extraction in Technical Domains

نویسندگان

  • Diego Mollá
  • Rolf Schwitter
  • Fabio Rinaldi
  • James Dowdall
  • Michael Hess
چکیده

In this paper we argue that questionanswering (QA) over technical domains is distinctly different from TREC-based QA or Web-based QA and it cannot benefit from data-intensive approaches. Technical questions arise in situations where concrete problems require specific answers and explanations. Finding a justification of the answer in the context of the document is essential if we have to solve a real-world problem. We show that NLP techniques can be used successfully in technical domains for high-precision access to information stored in documents. We present ExtrAns, an answer extraction system over technical domains, its architecture, its use of logical forms for answer extractions and how terminology extraction becomes an important part of the system.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A New Method for Improving Computational Cost of Open Information Extraction Systems Using Log-Linear Model

Information extraction (IE) is a process of automatically providing a structured representation from an unstructured or semi-structured text. It is a long-standing challenge in natural language processing (NLP) which has been intensified by the increased volume of information and heterogeneity, and non-structured form of it. One of the core information extraction tasks is relation extraction wh...

متن کامل

Answer Extraction Towards Better Evaluations Of NLP Systems

We argue that reading comprehension tests are not particularly suited for the evaluation of NLP systems. Reading comprehension tests are specifically designed to evaluate human reading skills, and these require vast amounts of world knowledge and common-sense reasoning capabilities. Experience has shown that this kind of full-fledged question answering (QA) over texts from a wide range of domai...

متن کامل

Terminology as Knowledge in Answer Extraction

It is well known that one of the greatest hurdles in automatically processing technical documentation is the large amount of specific terminology that characterizes these domains. Terminology poses two major challenges to the developers of NLP applications: how to identify domain specific terms in the documents and how to efficiently process them. In this paper we will present methodologies tha...

متن کامل

Explotación computacional del metalenguaje en corpus especializados para la generación de lexicones no convencionales

This paper presents the application of automatic analysis (of statistical and symbolic nature) for the detection and processing of metalanguage in highly technical texts from various domains. The selective metalinguistic information extraction performed by the MOP system allows compilation of non-conventional lexicons to aid domain-restricted NLP.

متن کامل

Question answering from structured knowledge sources

We present an implemented approach for domain-restricted question answering from structured knowledge sources, based on robust semantic analysis in a hybrid NLP system architecture. We perform question interpretation and answer extraction in an architecture that builds on a lexical-conceptual structure for question interpretation, which is interfaced with domain-specific concepts and properties...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003