On denoising and compression of DNA microarray images

作者:

Highlights:

摘要

We address the problems of noise and huge data sizes in microarray images. First, we propose a mixture model for describing the statistical and structural properties of microarray images. Then, based on the microarray image model, we present methods for denoising and for compressing microarray images. The denoising method is based on a variant of the translation-invariant wavelet transform. The compression method introduces the notion of approximate contexts (rather than traditional exact contexts) in modeling the symbol probabilities in a microarray image. This inexact context modeling approach is important in dealing with the noisy nature of microarray images. Using the proposed denoising and compression methods, we describe a near-lossless compression scheme suitable for microarray images. Results on both denoising and compression are included, which show the performance of the proposed methods. Further experiments using the results of the proposed near-lossless compression scheme in gene clustering using cell-cycle microarray data for S. cerevisiae showed a general improvement in the clustering performance, when compared with using the original data. This provides an indirect validation of the effectiveness of the proposed denoising method.

论文关键词:DNA microarrays,Microarray image compression,Microarray image denoising,Microarray gene clustering,Image context modeling

论文评审过程:Received 6 July 2005, Revised 31 January 2006, Accepted 15 February 2006, Available online 17 April 2006.

论文官网地址:https://doi.org/10.1016/j.patcog.2006.02.019