Chess as a Testbed for Language Model State Tracking

نویسندگان

چکیده

Transformer language models have made tremendous strides in natural understanding tasks. However, the complexity of makes it challenging to ascertain how accurately these are tracking world state underlying text. Motivated by this issue, we consider task modeling for game chess. Unlike language, chess notations describe a simple, constrained, and deterministic domain. Moreover, observe that appropriate choice notation allows directly probing state, without requiring any additional probing-related machinery. We find that: (a) With enough training data, transformer can learn track pieces predict legal moves with high accuracy when trained solely on move sequences. (b) For small sets providing access board information during yield significant improvements. (c) The success is dependent entire history i.e. “full attention”. Approximating full attention results performance drop. propose testbed as benchmark future work development analysis models.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

critical period effects in foreign language learning:the influence of maturational state on the acquisition of reading,writing, and grammar in english as a foreign language

since the 1960s the age effects on learning both first and second language have been explored by many linguists and applied linguists (e.g lennerberg, 1967; schachter, 1996; long, 1990) and the existence of critical period for language acquisition was found to be a common ground of all these studies. in spite of some common findings, some issues about the impacts of age on acquiring a second or...

15 صفحه اول

A tri state mechanism for oxygen release in fish hemoglobin: Using Barbus sharpeyi as a model

Hemoglobin is a porphyrin containing protein with an a2b2 tetrameric structure and like other porphyrin compounds shows spectral behavior of species specific characteristics. Researchers tend to relate bands in the hemoglobin spectra to certain structural and/or functional features. Given the fact that hemoglobin is the main oxygen carrier in animals functioning through the Oxy«Deoxy equilibriu...

متن کامل

A TESTBED FOR THE MSX ATTITUDE AND TRACKING PROCESSORS A Testbed for the MSX Attitude and Tracking Processors

he Midcourse Space Experiment (MSX) spacecraft employs infrared, ultraviolet, and visible light sensors to collect images and spectrographic signatures on a variety of targets, especially missiles, other satellites, and auroral phenomena. These instruments are fixed in the satellite body and face in a common direction. Their fields of view vary, with some of them being quite small (1 3 3°). Thu...

متن کامل

A MODEL FOR EVOLUTIONARY DYNAMICS OF WORDS IN A LANGUAGE

Human language, over its evolutionary history, has emerged as one of the fundamental defining characteristic of the modern man. However, this milestone evolutionary process through natural selection has not left any ’linguistic fossils’ that may enable us to trace back the actual course of development of language and its establishment in human societies. Lacking analytical tools to fathom the cr...

متن کامل

Chinese Chess State Recognition

In this paper, we present an algorithm that can correctly recognize the state of a Chinese chess game by processing a photo of the chessboard. Some major steps of the algorithm include chessboard rectification using Hough transformation and homographic transformation, chess piece detection using circular Hough transformation and chess piece recognition using SIFT (Scale-Invariant Feature Transf...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2022

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v36i10.21390