Invited Talk: The Case for Universal Dependencies
نویسنده
چکیده
Universal Dependencies is a recent initiative to develop a linguistically informed, cross-linguistically consistent dependency grammar analysis and treebanks for many languages, with the goal of enabling multilingual natural language processing applications of parsing and natural language understanding. I outline the needs behind the initiative and how some of the design principles follow from these requirements. I suggest that the design of Universal Dependencies tries to optimize a quite subtle trade-off between a number of goals: an analysis which is reasonably satisfactory on linguistic grounds, an analysis that is reasonably comprehensible to non-linguist users, an analysis which can be automatically applied with good accuracy, and an analysis which supports language understanding tasks, such as relation extraction. I suggest that this is best achieved by a simple, fairly spartan lexicalist approach, which focuses on capturing a level of analysis of (syntactic) grammatical relations, something that can be found similarly defined in many theories of syntax. We take hope from the fact that already many people, coming from quite different syntactic traditions, have felt that Universal Dependencies is near enough to right that they can join the effort and contribute. However, the current proposal is certainly not perfect, and I will also touch on some of the thorny issues and how the current standard might yet be improved.
منابع مشابه
An annotation scheme for Persian based on Autonomous Phrases Theory and Universal Dependencies
A treebank is a corpus with linguistic annotations above the level of the parts of speech. During the first half of the present decade, three treebanks have been developed for Persian either originally or subsequently based on dependency grammar: Persian Treebank (PerTreeBank), Persian Syntactic Dependency Treebank, and Uppsala Persian Dependency Treebank (UPDT). The syntactic analysis of a sen...
متن کاملWhy We Must Talk About Institutional Corruption to Understand Wrongdoing in the Health Sector; Comment on “We Need to Talk About Corruption in Health Systems”
While various forms of corruption are common in many health systems around the world, defining wrongdoing in terms of legality and the use of public office for private gain obstructs our understanding of its nature and intractability. To address this, I suggest, we must not only break the silence about the extent of wrongdoing in the health sector, but also talk differe...
متن کاملطرح همگانی یادگیری برای دانشآموزان با نیازهای ویژه
Background: Universal design for learning (UDL) has become a popular instructional approach in special education with the growing awareness of the necessities to providing access to the general curriculum for individuals with special needs. The aim of UDL is to reduce all potential barriers to learning and enhance learning opportunities for students with special needs. Universal design for lear...
متن کاملSchema Mappings and Data Examples: Deriving Syntax from Semantics (Invited Talk)
Schema mappings are high-level specifications that describe the relationship between two database schemas. Schema mappings are considered to be the essential building blocks in such critical data interoperability tasks as data exchange and data integration. For this reason, they have been the focus of extensive research investigations over the past several years. Since in real-life applications...
متن کاملJet Production in Deep Inelastic Scattering at HERA ∗
Two-jet cross sections in deep inelastic scattering at HERA are calculated in next-to-leading order. The importance of higher order corrections and recombination scheme dependencies is studied for various jetalgorithms. Some implications for the determination of αs(μ 2 R), the determination of the gluon density and the associated forward jet production in the low x regime at HERA are briefly di...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015