text linguistic

نتایج جستجو برای: text linguistic

تعداد نتایج: 208900 فیلتر نتایج به سال:

Linguistic Structured Sparsity in Text Categorization

2014

Dani Yogatama Noah A. Smith

We introduce three linguistically motivated structured regularizers based on parse trees, topics, and hierarchical word clusters for text categorization. These regularizers impose linguistic bias in feature weights, enabling us to incorporate prior knowledge into conventional bagof-words models. We show that our structured regularizers consistently improve classification accuracies compared to ...

متن کامل

Text Segmentation with Multiple Surface Linguistic Cues

1998

Hajime Mochizuki Takeo Honda Manabu Okumura

In general, a certain range of sentences in a text, is widely assumed to form a coherent unit which is called a discourse segment. Identifying the segment boundaries is a first step to recognize the structure of a text. In this paper, we describe a method for identifying segment boundaries of a Japanese text with the aid of multiple surface linguistic cues, though our experiments might be small...

متن کامل

Text Classification Using Graph-Encoded Linguistic Elements

2005

Kevin R. Gee Diane J. Cook

Inspired by the goal to more accurately classify text, we describe an effort to map tokens and their characteristic linguistic elements into a graph and use that expressive representation to classify text phrases. We outperform the bag-of-words approach by exploiting word order and the semantic and syntactic characteristics within the phases. In this study, we map tagged corpora into a placehol...

متن کامل

Some Linguistic Aspects For Automatic Text Understanding

1984

Yutaka Kusanagi

This paper proposes a system of mapping classes of syntactic structures as instruments for automatic text understanding. The system illustrated in Japanese consists of a set of verb classes and information on mapping them together with noun phrases, tense and aspect. The system. having information on direction of possible inferences between the verb classes with information on tense and aspect,...

متن کامل

Linguistic knowledge for specialized text production

2012

Miriam Buendía-Castro Beatriz Sánchez-Cárdenas

This paper outlines a proposal for encoding and describing verb phrase constructions in the knowledge base on the environment EcoLexicon, with the objective of helping translators in specialized text production. In order to be able to propose our own template, the characteristics and limitations of the most representative terminographic resources that include phraseological information were ana...

متن کامل

Outilex, a Linguistic Platform for Text Processing

2006

Olivier Blanc Matthieu Constant

We present Outilex, a generalist linguistic platform for text processing. The platform includes several modules implementing the main operations for text processing and is designed to use large-coverage Language Resources. These resources (dictionaries, grammars, annotated texts) are formatted into XML, in accordance with current standards. Evaluations on efficiency are given.

متن کامل

The Linguistic Basis of Text Generation

1987

Laurence Danlos

The last 30 years have witnessed the emergence and growth of the discipline Computational Linguistics. This field, which is also known as linguistic data processing, language data processing, natural language processing, and information linguistics, reflects the interests of many different disciplines ranging from philology, linguistics, computer and information science, psychology, and psychol...

متن کامل

Linguistic Resources for Reconstructing Spontaneous Speech Text

2008

Erin Fitzgerald Frederick Jelinek

The output of a speech recognition system is not always ideal for subsequent downstream processing, in part because speakers themselves often make mistakes. A system would accomplish speech reconstruction of its spontaneous speech input if its output were to represent, in flawless, fluent, and content-preserving English, the message that the speaker intended to convey. These cleaner speech tran...

متن کامل

Improving customer complaint management by automatic email classification using linguistic style features as predictors

Journal: :Decision Support Systems 2008

Kristof Coussement Dirk Van den Poel

Customer complaint management is becoming a critical key success factor in today's business environment. This study introduces a methodology to improve complaint-handling strategies through an automatic email-classification system that distinguishes complaints from non-complaints. As such, complaint handling becomes less time-consuming and more successful. The classification system combines tra...

متن کامل

Linguistic Content Analysis as a Tool for Improving Adaptive Instruction

2013

Laura K. Varner G. Tanner Jackson Erica L. Snow Danielle S. McNamara

This study investigates methods to automatically assess the features of content texts within an intelligent tutoring system (ITS). Coh-Metrix was used to calculate linguistic indices for texts (n = 66) within the reading strategy ITS, iSTART. Coh-Metrix indices for the system texts were compared to students’ (n = 126) self-explanation scores to examine the degree to which linguistic indices pre...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید