NaDiR: Naive Distributional Response Generation
نویسندگان
چکیده
This paper describes NaDiR (Naive DIstributional Response generation), a corpus-based system that, from a set of word stimuli as an input, generates a response word relying on association strength and distributional similarity. NaDiR participated in the CogALex 2014 shared task on multiword associations (restricted systems track), operationalizing the task as a ranking problem: candidate words from a large vocabulary are ranked by their average association or similarity to a given set of stimuli. We also report on a number of experiments conducted on the shared task data, comparing first-order models (based on co-occurrence and statistical association) to second-order models (based on distributional similarity).
منابع مشابه
Naive (Bayes) at Forty: The Independence Assumption in Information Retrieval
The naive Bayes classiier, currently experiencing a renaissance in machine learning, has long been a core technique in information retrieval. We review some of the variations of naive Bayes models used for text retrieval and classiication, focusing on the distributional assumptions made about word occurrences in documents.
متن کاملHow to make words with vectors: Phrase generation in distributional semantics
We introduce the problem of generation in distributional semantics: Given a distributional vector representing some meaning, how can we generate the phrase that best expresses that meaning? We motivate this novel challenge on theoretical and practical grounds and propose a simple data-driven approach to the estimation of generation functions. We test this in a monolingual scenario (paraphrase g...
متن کاملMaturation of Lymphocyte Immunophenotypes and Memory T Helper Cell Differentiation During Development in Mice
The goal of this study was to systematically investigate the ontogeny of lymphoid populations throughout postnatal development. In CD-1 mice, peak lymphocyte numbers occurred in blood on postnatal day 10 (d10) including those for natural killers (NK1.1), B cells (CD19), T helper (CD3CD4), naïve T helper (CD4CD62LposCD44low), memory T helper (CD4CD62LnegCD44high), and T cytotoxic (CD3CD8) cells....
متن کاملCortisol and epinephrine control opposing circadian rhythms in T cell subsets.
Pronounced circadian rhythms in numbers of circulating T cells reflect a systemic control of adaptive immunity whose mechanisms are obscure. Here, we show that circadian variations in T cell subpopulations in human blood are differentially regulated via release of cortisol and catecholamines. Within the CD4(+) and CD8(+) T cell subsets, naive cells show pronounced circadian rhythms with a dayti...
متن کاملBag-of-Embeddings for Text Classification
Words are central to text classification. It has been shown that simple Naive Bayes models with word and bigram features can give highly competitive accuracies when compared to more sophisticated models with part-of-speech, syntax and semantic features. Embeddings offer distributional features about words. We study a conceptually simple classification model by exploiting multiprototype word emb...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014