A Multi-Tier NL-Knowledge Clustering for Classifying Students' Essays

نویسندگان

  • Umarani Pappuswamy
  • Dumisizwe Bhembe
  • Pamela W. Jordan
  • Kurt VanLehn
چکیده

In this paper, we describe a multi-tier Natural Language (NL) clustering approach to text classification for classifying students’ essays for tutoring applications. The main task of the classifier is to map the students’ essay statements into target concepts, namely physics principles and misconceptions. A simple `Bag-Of-Words (BOW)’ classifier using a naïve-Bayes algorithm was unsatisfactory for our purposes as it frequently misclassified due to the semantic relatedness of the NL descriptions of the target concepts. We describe how we used the NL descriptions to define clusters of concepts that reduce the dimensionality of the data when classifying students’ essays. The clustering generated multi-tier tagging schemata (cluster, sub-cluster and class) which led to better classification of the student’s essay.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Supervised Clustering Method for Text Classification

This paper describes a supervised three-tier clustering method for classifying students’ essays of qualitative physics in the Why2-Atlas tutoring system. Our main purpose of categorizing text in our tutoring system is to map the students’ essay statements into principles and misconceptions of physics. A simple `bag-of-words’ representation using a naïve-bayes algorithm to categorize text was un...

متن کامل

A Multi-Objective Approach to Fuzzy Clustering using ITLBO Algorithm

Data clustering is one of the most important areas of research in data mining and knowledge discovery. Recent research in this area has shown that the best clustering results can be achieved using multi-objective methods. In other words, assuming more than one criterion as objective functions for clustering data can measurably increase the quality of clustering. In this study, a model with two ...

متن کامل

The Comparison of Typed and Handwritten Essays of Iranian EFL Students in terms of Length, Spelling, and Grammar

This study attempted to compare typed and handwritten essays of Iranian EFL students in terms of length, spelling, and grammar. To administer the study, the researchers utilized Alice Touch Typing Tutor software to select 15 upper intermediate students with higher ability to write two essays: one typed and the other handwritten. The students were both males and females between the ages of 22 to...

متن کامل

Using Linguistic Features to Predict Readability of Short Essays for Senior High School Students in Taiwan

We investigated the problem of classifying short essays used in comprehension tests for senior high school students in Taiwan. The tests were for first and second year students, so the answers included only four categories, each for one semester of the first two years. A random-guess approach would achieve only 25% in accuracy for our problem. We analyzed three publicly available scores for rea...

متن کامل

Construction of Evaluative Meanings by Kurdish-Speaking Learners of English: A Comparison of High- and Low-Graded Argumentative Essays

Academic writing ability is an important goal that learners of English as a Second Language (ESL) or English as a Foreign Language (EFL) try to attain. While ESL students’ academic writings have been widely explored, owing to few studies investigating appraisal resources in EFL students’ argumentative writing, the gap still exists about EFL students’ academic writing. This study aimed to see ho...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005