Automating Multi-Level Annotations of Orthographic Properties of German Words and Children’s Spelling Errors

نویسنده

  • Ronja Laarmann-Quante
چکیده

This paper presents the automatic annotation of orthographic properties of German words and spelling errors in texts of German primary school children according to a new multi-layered annotation scheme [1]. The scheme is closely linked to the principles of the German writing system and is supposed to allow the pursuit of new research questions concerning the relationship between spelling errors of competent and less competent spellers and the regularities of the German graphematic system. A novelty of the automatic annotation is that it takes an intended, correctly spelled word as input and applies a set of rules to generate a list of error candidates containing systematic spelling errors. As a further novelty, the annotation of additional wordand error-related properties is presented such as whether the spelling error changes the word’s pronunciation and whether a spelling can be derived from a related word form. This gives rise to more detailed analyses of the errors but also allows us to develop an application for learners that generates automatic advice for the correct spelling. A first evaluation shows that the automatic annotation of the presented categories and features can come close to human annotations.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Annotating Spelling Errors in German Texts Produced by Primary School Children

We present a new multi-layered annotation scheme for orthographic errors in freely written German texts produced by primary school children. The scheme is closely linked to the German graphematic system and defines categories for both general structural word properties and errorrelated properties. Furthermore, it features multiple layers of information which can be used to evaluate an error. Th...

متن کامل

Design and implementation of Persian spelling detection and correction system based on Semantic

Persian Language has a special feature (grapheme, homophone, and multi-shape clinging characters) in electronic devices. Furthermore, design and implementation of NLP tools for Persian are more challenging than other languages (e.g. English or German). Spelling tools are used widely for editing user texts like emails and text in editors.  Also developing Persian tools will provide Persian progr...

متن کامل

Children’s written and oral spelling

For adults, written spelling is generally superior to oral spelling. To determine whether the same holds true for children in kindergarten through second grade, we compared children’s ability to spell real words (Experiment 1) and nonsense words (Experiment 2) orally and in writing. Building on the work of Tangel and Blachman (1992, 1995) and others, we developed a reliable system to assess the...

متن کامل

Learning to spell in Hebrew: Phonological and morphological factors

Learning to spell in Hebrew: Phonological and morphological factors This paper investigates children’s developing knowledge of the Hebrew spelling system in view of the claim that language-specific typology affects the rate and the pattern of development of orthographic spelling. Hebrew is a morphologically synthetic language with a phonologically “deep” orthography, on the one hand, and a cons...

متن کامل

Children's Oral Reading Corpus (CHOREC): Description and Assessment of Annotator Agreement

Within the scope of the SPACE project, the CHildren’s Oral REading Corpus (CHOREC) is developed. This database contains recorded, transcribed and annotated read speech (42 GB or 130 hours) of 400 Dutch speaking elementary school children with or without reading difficulties. Analyses of interand intra-annotator agreement are carried out in order to investigate the consistency with which reading...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016