Recognizing Handwritten Source Code

نویسندگان

  • Qiyu Zhi
  • Ronald A. Metoyer
چکیده

Supporting programming on touchscreen devices requires effective text input and editing methods. Unfortunately, the virtual keyboard can be inefficient and uses valuable screen space on already small devices. Recent advances in stylus input make handwriting a potentially viable text input solution for programming on touchscreen devices. The primary barrier, however, is that handwriting recognition systems are built to take advantage of the rules of natural language, not those of a programming language. In this paper, we explore this particular problem of handwriting recognition for source code. We collect and make publicly available a dataset of handwritten Python code samples from 15 participants and we characterize the typical recognition errors for this handwritten Python source code when using a state-of-the-art handwriting recognition tool. We present an approach to improve the recognition accuracy by augmenting a handwriting recognizer with the programming language grammar rules. Our experiment on the collected dataset shows an 8.6% word error rate and a 3.6% character error rate which outperforms standard handwriting recognition systems and compares favorably to typing source code on virtual keyboards.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A blackboard-based approach to handwritten ZIP code recognition

A methodology for recognizing ZIP codes @ostal codes) in handwritten addresses is presented. The method uses many diverse pattern recognition and image processing algorithms. Given a high-resolution image of a hand-written address block, the solution invokes routines capable of hypothesizing the location of the ZIP Code, segmenting and recognizing ZIP Code digits, locating and recognizing City ...

متن کامل

Off-line Arabic Handwritten Recognition Using a Novel Hybrid HMM-DNN Model

In order to facilitate the entry of data into the computer and its digitalization, automatic recognition of printed texts and manuscripts is one of the considerable aid to many applications. Research on automatic document recognition started decades ago with the recognition of isolated digits and letters, and today, due to advancements in machine learning methods, efforts are being made to iden...

متن کامل

Using Audio Based Disambiguation for Improving Handwritten Mathematical Content Recognition in Classroom Videos

We consider the problem of recognizing handwritten mathematical content in classroom videos that capture the content written on the whiteboard and the content spoken by the instructor. While the problem of recognizing handwritten textual content from videos has been studied before, recognition of handwritten mathematical content and the use of audio content from classroom videos to assist in re...

متن کامل

Understanding Handwritten Text in a Structured Environment: Determining ZIP Codes from Addresses

Understanding a block of handwritten text means mapping it into a semantic representation, We describe an approach to reading a I>lock of handwritten text when there arc certain loose constraints placed on the spatial layout and syntax 01 the tnt. Early recognition of primitives guides the location of syntactic components. A system to read handwritten postal addresses is described as an instanc...

متن کامل

Handwritten digit recognition using biologically inspired features

Image recognition problems are usually difficult to solve using raw pixel data. To improve the recognition it is often needed some form of feature extraction to represent the data in a feature space. We use the output of a biologically inspired model for visual recognition as a feature space. The output of the model is a binary code which is used to train a linear classifier for recognizing han...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017