Smart Library: Identifying Books in a Library using Richly Supervised Deep Scene Text Reading

نویسندگان

  • Xiao Yang
  • Dafang He
  • Wenyi Huang
  • Zihan Zhou
  • Alexander Ororbia
  • Dan Kifer
  • C. Lee Giles
چکیده

Physical library collections are valuable and long standing resources for knowledge and learning. However, managing books in a large bookshelf and finding books on it often leads to tedious manual work, especially for large book collections where books might be missing or misplaced. Recently, deep neural models, such as Convolutional Neural Networks (CNN) and Recurrent Neural Networks (RNN) have achieved great success for scene text detection and recognition. Motivated by these recent successes, we aim to investigate their viability in facilitating book management, a task that introduces further challenges including large amounts of cluttered scene text, distortion, and varied lighting conditions. In this paper, we present a library inventory building and retrieval system based on scene text reading methods. We specifically design our scene text recognition model using rich supervision to accelerate training and achieve state-of-the-art performance on several benchmark datasets. Our proposed system has the potential to greatly reduce the amount of human labor required in managing book inventories as well as the space needed to store book information.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Smartphones and Our Students:A Case of Undergraduate Students in an EFL Context

Immoderate smart phone usage usually makes the students addicted to it and spends less time reading lecture notes and textbooks. This study aims to determine university students' usage of smart phones and perceived rejection of paper books in an EFL context. The study collected data through a 20-item structured questionnaire consisting of the general characteristics, the number and hours of gen...

متن کامل

Assigning Library of Congress Classification Codes to Books Based Only on their Titles

Many publishers follow the Library of Congress Classification (LCC) scheme to indicate a classification code on the first pages of their books. This is useful for many libraries worldwide because it makes possible to search and retrieve books by content type, and this scheme has become a de facto standard. However, not every book has been pre-classified by the publisher; in particular, in many ...

متن کامل

Are E-Books Making Us Stupid?: Why Electronic Collections Mean Trouble for Libraries and Their Patrons

In 2008, Nicholas Carr published a provocative article titled “Is Google making us stupid?” in which he ponders the effect of the internet and electronic sources generally on the brain. This paper discusses one source specifically, e-books, and explores whether libraries are acting wisely by moving from print to electronic book collections. The topic is considered from the vantage point of the ...

متن کامل

Semi-supervised Learning Approach for Automatic Emotional Expression Extraction from eBook Text

We have developed an approach for the automatic extraction of emotion expression from text data of ebooks, such as novels and short stories. The embedding of the extraction results as metadata allows a text-to-speech system to enable the expressive reading of these texts along with the selection of a dictionary of voices associated with emotions. As a text prefilter for the automatic extraction...

متن کامل

Blending Evidence and Users for TEL: An Overture

TERENCE is an adaptive learning system for reasoning about stories with children having deep text comprehension problems. It develops reading interventions in the form of smart games for stimulating the text comprehension of such children. In order to ensure the pedagogical effectiveness and the usability of the smart games, and of the system in general, TERENCE was designed combining the user ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1611.07385  شماره 

صفحات  -

تاریخ انتشار 2016