inter rater reliability

نتایج جستجو برای: inter rater reliability

تعداد نتایج: 255783 فیلتر نتایج به سال:

Human Similarity Judgments: Implications for the Design of Formal Evaluations

2007

M. Cameron Jones J. Stephen Downie Andreas F. Ehmann

This paper presents findings of a series of analyses of human similarity judgments from the Symbolic Melodic Similarity, and Audio Music Similarity tasks from the Music Information Retrieval Evaluation Exchange (MIREX) 2006. The categorical judgment data generated by the evaluators is analyzed with regard to judgment stability, inter-grader reliability, and patterns of disagreement, both within...

متن کامل

A Political News Corpus in Chinese for Opinion Analysis

2008

Benjamin Ka-Yin T'sou Bin Lu

In this paper, we present an annotated corpus of political election news in Chinese for opinion analysis, and discuss some issues in the manual annotation process. The annotation scheme is described with examples, and inter-annotator agreement is explored for different levels of annotation: expression, sentence and document.

متن کامل

Creating a Test Corpus of Clinical Notes Manually Tagged for Part-of-Speech Information

2004

Serguei V. S. Pakhomov Anni Coden Christopher G. Chute

This paper presents a project whose main goal is to construct a corpus of clinical text manually annotated for part-of-speech information. We describe and discuss the process of training three domain experts to perform linguistic annotation. We list some of the challenges as well as encouraging results pertaining to inter-rater agreement and consistency of annotation. We also present preliminar...

متن کامل

Relating PhonePass overall scores to the Council of Europe Framework level descriptors

2002

John de Jong Jared Bernstein

This study is a preliminary report on an experiment relating PhonePass SET-10 scores to the scale of level descriptors in the Council of Europe Framework. This scale describes the content and level of second language proficiency from a functional communicative perspective. Speech samples from 121 non-native speakers of English were: (1) scored in SET-10, the automatic test of spoken English, an...

متن کامل

Erlangen-CLP: A Large Annotated Corpus of Speech from Children with Cleft Lip and Palate

2014

Tobias Bocklet Andreas K. Maier Korbinian Riedhammer Ulrich Eysholdt Elmar Nöth

In this paper we describe Erlangen-CLP, a large speech database of children with Cleft Lip and Palate. More than 800 German children with CLP (most of them between 4 and 18 years old) and 380 age matched control speakers spoke the semi-standardized PLAKSS test that consists of words with all German phonemes in different positions. So far 250 CLP speakers were manually transcribed, 120 of these ...

متن کامل

[A systematic social observation tool: methods and results of inter-rater reliability].

Journal: :Cadernos de saude publica 2013

Eulilian Dias de Freitas Vitor Passos Camargos César Coelho Xavier Waleska Teixeira Caiaffa Fernando Augusto Proietti

Systematic social observation has been used as a health research methodology for collecting information from the neighborhood physical and social environment. The objectives of this article were to describe the operationalization of direct observation of the physical and social environment in urban areas and to evaluate the instrument's reliability. The systematic social observation instrument ...

متن کامل

THE ASSESSMENT OF INTERRATER AGREEMENT FOR MULTIPLE ATTRIBUTE RESPONSES by

2008

Lawrence L. Kupper Kerry B. Hafner

متن کامل

Assessing performance outcomes of new graduates utilizing simulation in a military transition program.

Journal: :Journal for nurses in professional development 2013

Robie V Hughes Sherrill J Smith Clair M Sheffield Grady Wier

This multi-site, quasi-experimental study examined the performance outcomes of nurses (n = 152) in a military nurse transition program. A modified-performance instrument was used to assess participants in two high-fidelity simulation scenarios. Although results indicated a significant increase in scores posttraining, only moderate interrater reliability results were found for the new instrument...

متن کامل

Annotating Opinions in the World Press

2003

Theresa Wilson Janyce Wiebe

In this paper we present a detailed scheme for annotating expressions of opinions, beliefs, emotions, sentiment and speculation (private states) in the news and other discourse. We explore inter-annotator agreement for individual private state expressions, and show that these lowlevel annotations are useful for producing higher-level subjective sentence annotations.

متن کامل

Perceptual evaluation of voice quality: review, tutorial, and a framework for future research.

Journal: :Journal of speech and hearing research 1993

J Kreiman B R Gerratt G B Kempster A Erman G S Berke

The reliability of listeners' ratings of voice quality is a central issue in voice research because of the clinical primacy of such ratings and because they are the standard against which other measures are evaluated. However, an extensive literature review indicates that both intrarater and interrater reliability fluctuate greatly from study to study. Further, our own data indicate that rating...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید