Skip to main content
Kent Academic Repository

A Synthesised Word Approach to Word Retrieval in Handwritten Documents

Liang, Yiqing, Fairhurst, Michael, Guest, Richard (2012) A Synthesised Word Approach to Word Retrieval in Handwritten Documents. Pattern Recognition, 45 (12). pp. 4225-4236. ISSN 0031-3203. (doi:10.1016/j.patcog.2012.05.024) (The full text of this publication is not currently available from this repository. You may be able to access a copy if URLs are provided) (KAR id:31422)

The full text of this publication is not currently available from this repository. You may be able to access a copy if URLs are provided.
Official URL:
http://dx.doi.org/10.1016/j.patcog.2012.05.024

Abstract

Recent technological advances have enhanced the computer-based indexing and searching of digitised printed books. The performance now achievable in this domain, however, does not at present extend to handwritten texts which inherently contain more significant letter-based variation within their content. Furthermore, in most studies that address the handwritten text retrieval problem, a large training dataset is required which, very often, influences the context and search lexicon. In this paper a novel method is described to overcome the training data problem using a character-based modelling (termed grapheme spectrum) approach and a word modelling technique (termed synthesised word) enabling the retrieval of keywords that have not explicitly been seen in the training set. When tested on an illustrative historical manuscript the performance of the proposed word retrieval technique shows a clear advantage over existing methods.

Item Type: Article
DOI/Identification number: 10.1016/j.patcog.2012.05.024
Uncontrolled keywords: Handwriting analysis; Digital archives; Handwritten word retrieval; Word spotting; Information retrieval; Handwriting recognition; Historical manuscript analysis
Subjects: T Technology > TA Engineering (General). Civil engineering (General) > TA1637 Image processing
T Technology > TK Electrical engineering. Electronics. Nuclear engineering > TK7800 Electronics > TK7880 Applications of electronics > TK7882.P3 Pattern recognition systems
Divisions: Divisions > Division of Computing, Engineering and Mathematical Sciences > School of Engineering and Digital Arts
Depositing User: J. Harries
Date Deposited: 09 Oct 2012 10:48 UTC
Last Modified: 16 Nov 2021 10:09 UTC
Resource URI: https://kar.kent.ac.uk/id/eprint/31422 (The current URI for this page, for reference purposes)

University of Kent Author Information

Fairhurst, Michael.

Creator's ORCID:
CReDIT Contributor Roles:

Guest, Richard.

Creator's ORCID: https://orcid.org/0000-0001-7535-7336
CReDIT Contributor Roles:
  • Depositors only (login required):

Total unique views for this document in KAR since July 2020. For more details click on the image.