Identifying Indonesian-core vocabulary for teaching English to Indonesian preschool children: a corpus- based research
ثبت نشده
چکیده
This corpus-based research focuses on building a corpus of Indonesian children’s storybooks to find the frequent content words in order to identify Indonesian-core vocabulary for teaching English to Indonesian preschool children. The data was gathered from 131 Indonesian children’s storybooks, which resulted in a corpus of 134,320 words. These data were run through a frequency menu in MonoConc Pro, a corpus program. Data analysis was analyzed by selecting the frequent nouns, verbs, adjectives, and adverbs before each of them was lemmatized. The result showed that the children were already exposed to both ordinary and imaginative concepts, antonym in adjective, time reference, and compound nouns. The narrative discourse clearly influenced the kind of verbs the children exposed to.
منابع مشابه
Naturalization in Translation:A Case Study on the Translation of English-Indonesian Medical Terms
Naturalization is a translation procedure that is predominantly utilized in the translation of English medical terms into Indonesian. This study focuses on identifying types of naturalization involving the adjustment of spelling and pronunciation and investigating whether naturalization has been appropriately applied based on the rules in the Indonesian general guidance of term formation. The d...
متن کاملNaturalization in Translation:A Case Study on the Translation of English-Indonesian Medical Terms
Naturalization is a translation procedure that is predominantly utilized in the translation of English medical terms into Indonesian. This study focuses on identifying types of naturalization involving the adjustment of spelling and pronunciation and investigating whether naturalization has been appropriately applied based on the rules in the Indonesian general guidance of term formation. The d...
متن کاملCross Lingual Modelling Experiments for Indonesian
The extension of Large Vocabulary Continuous Speech Recognition (LVCSR) to resource poor languages such as Indonesian is hindered by the lack of transcribed acoustic data and appropriate pronunciation lexicons. Research has generally been directed toward establishing robust cross-lingual acoustic models, with the assumption that phonetic lexicons are readily available. This is not the case for ...
متن کاملGMM-Based Identification of Indonesian Speech
This paper reports the performance of identification of Indonesian speech within a ten-language corpus: English, German, Hungarian, Indonesian, Italian, Korean, Mandarin, Polish, Portuguese, and Swedish. The tasks are performed by implementing Gaussian Mixture Model (GMM) on MelFrequency Cepstral Coefficients (MFCCs). Two types of experiments that have been undertaken: pair-wise and tenlanguage...
متن کاملTowards an Indonesian-English SMT System: A Case Study of an Under-Studied and Under-Resourced Language, Indonesian
This paper describes a work on preparing an Indonesian-English Statistical Machine Translation (SMT) System. It includes the creation of Indonesian morphological analyzer, MorphInd, and the composing of an Indonesian-English parallel corpus, IDENTIC. We build an SMT system using the state-of-the-art phrase-based SMT system, MOSES. We show several scenarios where the morphological tool is used t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012