نتایج جستجو برای: text linguistic

تعداد نتایج: 208900  

2014
Dani Yogatama Noah A. Smith

We introduce three linguistically motivated structured regularizers based on parse trees, topics, and hierarchical word clusters for text categorization. These regularizers impose linguistic bias in feature weights, enabling us to incorporate prior knowledge into conventional bagof-words models. We show that our structured regularizers consistently improve classification accuracies compared to ...

1998
Hajime Mochizuki Takeo Honda Manabu Okumura

In general, a certain range of sentences in a text, is widely assumed to form a coherent unit which is called a discourse segment. Identifying the segment boundaries is a first step to recognize the structure of a text. In this paper, we describe a method for identifying segment boundaries of a Japanese text with the aid of multiple surface linguistic cues, though our experiments might be small...

2005
Kevin R. Gee Diane J. Cook

Inspired by the goal to more accurately classify text, we describe an effort to map tokens and their characteristic linguistic elements into a graph and use that expressive representation to classify text phrases. We outperform the bag-of-words approach by exploiting word order and the semantic and syntactic characteristics within the phases. In this study, we map tagged corpora into a placehol...

1984
Yutaka Kusanagi

This paper proposes a system of mapping classes of syntactic structures as instruments for automatic text understanding. The system illustrated in Japanese consists of a set of verb classes and information on mapping them together with noun phrases, tense and aspect. The system. having information on direction of possible inferences between the verb classes with information on tense and aspect,...

2012
Miriam Buendía-Castro Beatriz Sánchez-Cárdenas

This paper outlines a proposal for encoding and describing verb phrase constructions in the knowledge base on the environment EcoLexicon, with the objective of helping translators in specialized text production. In order to be able to propose our own template, the characteristics and limitations of the most representative terminographic resources that include phraseological information were ana...

2006
Olivier Blanc Matthieu Constant

We present Outilex, a generalist linguistic platform for text processing. The platform includes several modules implementing the main operations for text processing and is designed to use large-coverage Language Resources. These resources (dictionaries, grammars, annotated texts) are formatted into XML, in accordance with current standards. Evaluations on efficiency are given.

1987
Laurence Danlos

The last 30 years have witnessed the emergence and growth of the discipline Computational Linguistics. This field, which is also known as linguistic data processing, language data processing, natural language processing, and information linguistics, reflects the interests of many different disciplines ranging from philology, linguistics, computer and information science, psychology, and psychol...

2008
Erin Fitzgerald Frederick Jelinek

The output of a speech recognition system is not always ideal for subsequent downstream processing, in part because speakers themselves often make mistakes. A system would accomplish speech reconstruction of its spontaneous speech input if its output were to represent, in flawless, fluent, and content-preserving English, the message that the speaker intended to convey. These cleaner speech tran...

Journal: :Decision Support Systems 2008
Kristof Coussement Dirk Van den Poel

Customer complaint management is becoming a critical key success factor in today's business environment. This study introduces a methodology to improve complaint-handling strategies through an automatic email-classification system that distinguishes complaints from non-complaints. As such, complaint handling becomes less time-consuming and more successful. The classification system combines tra...

2013
Laura K. Varner G. Tanner Jackson Erica L. Snow Danielle S. McNamara

This study investigates methods to automatically assess the features of content texts within an intelligent tutoring system (ITS). Coh-Metrix was used to calculate linguistic indices for texts (n = 66) within the reading strategy ITS, iSTART. Coh-Metrix indices for the system texts were compared to students’ (n = 126) self-explanation scores to examine the degree to which linguistic indices pre...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید