Linguistic Issues in Language Technology – LiLT
نویسندگان
چکیده
To make sense of an utterance, people identify in its linear linguistic expression the concepts and the connections between them. A concept normally has a lexical realization; connections between concepts often do not, but they are perceived even without the benefit of lexical cues. Making these connections – called semantic relations in the field of natural language processing – relies on the form and structure of linguistic expressions, and the concepts these expressions evoke. This implies two levels: the level of the text, the linguistic expression with its form and (grammatical) structure, and the level of the concepts which the speaker wants to convey. An overview of the literature shows that semantic relations are, for pragmatic reasons, a means to an end – extract information, explain the links between the head of a phrase and its arguments, and so on – and that is why they are analyzed from the perspective of what they link. At the text level, the process of semantic relation analysis is informed by syntactic elements – noun phrases, verbs and their arguments, clauses and so on – thus differentiating semantic relations based on the complexity of the syntactic constructions in 1 LiLT Volume 2, Issue 3, December 2009. . Copyright c © 2009, CSLI Publications. 2 / LILT VOLUME 2, ISSUE 3 DECEMBER 2009 which their arguments appear. At the conceptual level, the same semantic relation is assigned to pairs of concepts, regardless of their surface expression. The process can be said to disregard the implications of having syntactic constructions of various complexity correspond to the concepts linked. In this article, we propose to put semantic relations first: analyze them, determine what constraints they place on the concepts they connect, and how those concepts can be lexicalized. Lexicalization takes place via expressions of increasing syntactic complexity: phrases, clauses and multi-clause sentences. Next, we show how the linguistic phenomena involved in producing different lexicalizations explain – in a systematic manner – how semantic relations can have instances in syntactic constructions of various complexity. We focus on binary semantic relations between concepts/textual elements within sentences. This kind of analysis leads to a better understanding of the relations themselves and to a systematic account of phenomena related to their occurrence in texts. It reveals some of the assumptions and linguistic gaps people fill when they recognize relations in text. From the computational point of view of text processing, such a solid basis of the analysis of semantic relations adds consistency. Evidence for a particular relation can come from all its instances in a text, regardless of the syntactic form of the concepts it connects. Knowledge of the expected concepts and their syntactic realization may signal the presence of covert or implied information, which we can then work to retrieve. Assigning a semantic relation should be a conscious choice, with the understanding of what implications such a tag has both for the implicitly and explicitly expressed elements of a concept.
منابع مشابه
Linguistic Issues in Language Technology LiLT
In this paper, we overview the ways in which computational methods can serve the goals of analysis and theory development in linguistics, and encourage the reader to become involved in the emerging cyberinfrastructure for linguistics. We survey examples from diverse subfields of how computational methods are already being used, describe the current state of the art in cyberinfrastructure for li...
متن کاملLinguistic Issues in Language Technology – LiLT
Lakoff (1974) argues that affective demonstratives in English are markers of solidarity, with exclamative overtones deriving from their close association with evaluative predication. Focusing on this, we seek to inform these claims using quantitative corpus evidence. Our experiments suggest that affectivity is not limited to specific uses of this, but rather that it arises in a wide range of li...
متن کاملLinguistic Issues in Language Technology – LiLT
Morphology is a key component for many Language Technology applications. However, morphological relations, especially those relying on the derivation and compounding processes, are often addressed in a superficial manner. In this article, we focus on assessing the relevance of deep and motivated morphological knowledge in Natural Language Processing applications. We first describe an annotation...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009