Data- and Model-Driven Multiresolution Processing

作者:

Highlights:

摘要

We introduce a technique formultiresolution processingwhich elegantly fits in our framework for visual recognition, described in earlier papers. The input is processedsimultaneouslyat a coarse resolution throughout the image and at finer resolution within a small window (fovea). We introduce an approach for controlling the movement of the high-resolution window which allows for both data- and model-driven selection of fixation points. Three fixation modes have been implemented, one based on large unexplained areas in the data, one on conflicts in the object-model database, and one on a 2D “space filling” algorithm. We argue that this kind of multiresolution processing is not only useful in limiting the computational time, as has been widely recognized, but also can be a deciding factor in making the entire vision problem a tractable and stable one. To demonstrate the approach, we introduce a class of3D surface texturesas a feature for recognition in our system. Surface texture recognition typically requires higher-resolution processing than that required for the extraction of the underlying surface. As examples, surface texture is used to discriminate between a ping-pong ball and a golf ball, and “curve texture” is used to recognize different types of gears. Other experimental results also are included to show the advantages and the implications of our approach.

论文关键词:

论文评审过程:Received 20 May 1993, Accepted 5 December 1994, Available online 22 April 2002.

论文官网地址:https://doi.org/10.1006/cviu.1996.0003