corpus analysis

نتایج جستجو برای: corpus analysis

تعداد نتایج: 2874375 فیلتر نتایج به سال:

a hybrid accurate alignment method for large persian-english corpus construction based on statistical analysis and lexicon/persian word net

Journal: :international journal of information science and management 0

mohammad bagher dastgheib ph.d. candidate department of computer science and engineering, shiraz university, shiraz, iran seyed mostafa fakhrahmad department of computer science and engineering, shiraz university, shiraz, iran mansour zolghadri jahromi department of computer science and engineering, shiraz university, shiraz, iran

a bilingual corpus is considered as a very important knowledge source and an inevitable requirement for many natural language processing (nlp) applications in which two languages are involved. for some languages such as persian, lack of such resources is much more significant. several applications, including statistical and example-based machine translation needs bilingual corpora, in which lar...

متن کامل

a comparative study of hedging in the introduction and discussion sections of english articles of agriculture written by iranian and non-iranian writers

Journal: :مطالعات زبان و ترجمه 0

حسن سودمند افشار روژین قصلانی بهروز کلانتری

this study set out to investigate the similarities and differences in frequency of incidence and type of hedging devices used in research articles written by iranian and non-iranian writers. for the purposes of the study, a corpus including 40 agriculture articles in english (20 written by iranian and 20 by non-iranian writers) were selected. collection and classification of the hedging devices...

متن کامل

STUDENTS’ TRANSLATIONS OF ADJECTIVAL COMPOUNDS: A CORPUS ANALYSIS

Journal: :РАДОВИ ФИЛОЗОФСКОГ ФАКУЛТЕТА - ФИЛОЛОШКЕ НАУКЕ 2020

متن کامل

Mining Characteristic Patterns for Comparative Music Corpus Analysis

Journal: :Applied Sciences 2020

متن کامل

FB-NEWS15: A Topic-Annotated Facebook Corpus for Emotion Detection and Sentiment Analysis

2016

Lucia C. Passaro Alessandro Bondielli Alessandro Lenci

English. In this paper we present the FBNEWS15 corpus, a new Italian resource for sentiment analysis and emotion detection. The corpus has been built by crawling the Facebook pages of the most important newspapers in Italy and it has been organized into topics using LDA. In this work we provide a preliminary analysis of the corpus, including the most debated news in 2015. Italiano. In questo la...

متن کامل

ارایه یک پیکره‌ پرسش و پاسخ مذهبی در زبان فارسی

ژورنال: پردازش علائم و داده ها 2018

برشبان, یاسمن, میرروشندل, سیدابوالقاسم, یوسفی نسب, حامد,

Question answering system is a field in natural language processing and information retrieval noticed by researchers in these decades. Due to a growing interest in this field of research, the need to have appropriate data sources is perceived. Most researches about developing question answering corpus area have been done in English so far, but in other languages as Persian, the lack of these co...

متن کامل

IUS-LEX-CORPUS: CORPUS MYSTICUM

Journal: :Trans/Form/Ação 2014

متن کامل

A Corpus-based Approach to Taizhou’s Image in English News Media

Journal: :International Journal of Linguistics 2023

The study of national and regional image is a significant topic in language, culture, communication research. However, there has been limited research on the Taizhou City as portrayed foreign media. This employs corpus-based approach to examine Taizhou's English Using online corpus, News Web, researchers created virtual corpus (Taizhou Corpus) containing 98 relevant reports from news analyzed m...

متن کامل

Deriving an Ambiguous Word's Part-of-Speech Distribution from Unannotated Text

2007

Reinhard Rapp

A distributional method for part-of-speech induction is presented which, in contrast to most previous work, determines the part-of-speech distribution of syntactically ambiguous words without explicitly tagging the underlying text corpus. This is achieved by assuming that the word pair consisting of the left and right neighbor of a particular token is characteristic of the part of speech at thi...

متن کامل

SuperCAT: The (New and Improved) Corpus Analysis Toolkit

Journal: :LREC ... International Conference on Language Resources & Evaluation : [proceedings]. International Conference on Language Resources and Evaluation 2016

K. Bretonnel Cohen William A. Baumgartner Irina P. Temnikova

This paper reports SuperCAT, a corpus analysis toolkit. It is a radical extension of SubCAT, the Sublanguage Corpus Analysis Toolkit, from sublanguage analysis to corpus analysis in general. The idea behind SuperCAT is that representative corpora have no tendency towards closure-that is, they tend towards infinity. In contrast, non-representative corpora have a tendency towards closure-roughly,...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید