The architecture of TrueViz: a groundTRUth/metadata editing and VIsualiZing ToolKit

作者:

Highlights:

摘要

Tools for visualizing and creating groundtruth and metadata are crucial for document image analysis research. In this paper we describe TrueViz (TRUEVIZ User's Manual, August 2000; Proceedings of the SPIE Conference on Document Recognition and Retrieval, San Jose, CA, 2001, pp. 1–12), which is a tool for visualizing and editing groundtruth/metadata. We first describe the groundtruthing task and the requirements for any interactive groundtruthing tool. Next we describe the system design of TrueViz and discuss how a user can use it to create groundtruth. TrueViz is implemented in the Java programming language and works on various platforms including Windows and Unix. TrueViz reads and stores groundtruth/metadata in XML format, and reads a corresponding image stored in TIFF image file format. Multilingual text editing, display, and search modules based on the Unicode representation for text are also provided. This software is being made available free of charge to researchers.

论文关键词:Annotation,Groundtruth,Visualization,Multilingual,Multiplatform,Java,XML,OCR

论文评审过程:Received 5 December 2001, Accepted 20 March 2002, Available online 14 November 2002.

论文官网地址:https://doi.org/10.1016/S0031-3203(02)00101-2