نتایج جستجو برای: corpus analysis
تعداد نتایج: 2874375 فیلتر نتایج به سال:
a bilingual corpus is considered as a very important knowledge source and an inevitable requirement for many natural language processing (nlp) applications in which two languages are involved. for some languages such as persian, lack of such resources is much more significant. several applications, including statistical and example-based machine translation needs bilingual corpora, in which lar...
this study set out to investigate the similarities and differences in frequency of incidence and type of hedging devices used in research articles written by iranian and non-iranian writers. for the purposes of the study, a corpus including 40 agriculture articles in english (20 written by iranian and 20 by non-iranian writers) were selected. collection and classification of the hedging devices...
English. In this paper we present the FBNEWS15 corpus, a new Italian resource for sentiment analysis and emotion detection. The corpus has been built by crawling the Facebook pages of the most important newspapers in Italy and it has been organized into topics using LDA. In this work we provide a preliminary analysis of the corpus, including the most debated news in 2015. Italiano. In questo la...
Question answering system is a field in natural language processing and information retrieval noticed by researchers in these decades. Due to a growing interest in this field of research, the need to have appropriate data sources is perceived. Most researches about developing question answering corpus area have been done in English so far, but in other languages as Persian, the lack of these co...
The study of national and regional image is a significant topic in language, culture, communication research. However, there has been limited research on the Taizhou City as portrayed foreign media. This employs corpus-based approach to examine Taizhou's English Using online corpus, News Web, researchers created virtual corpus (Taizhou Corpus) containing 98 relevant reports from news analyzed m...
A distributional method for part-of-speech induction is presented which, in contrast to most previous work, determines the part-of-speech distribution of syntactically ambiguous words without explicitly tagging the underlying text corpus. This is achieved by assuming that the word pair consisting of the left and right neighbor of a particular token is characteristic of the part of speech at thi...
This paper reports SuperCAT, a corpus analysis toolkit. It is a radical extension of SubCAT, the Sublanguage Corpus Analysis Toolkit, from sublanguage analysis to corpus analysis in general. The idea behind SuperCAT is that representative corpora have no tendency towards closure-that is, they tend towards infinity. In contrast, non-representative corpora have a tendency towards closure-roughly,...
نمودار تعداد نتایج جستجو در هر سال
با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید