Hybrid method for the analysis of time series gene expression data

作者:

Highlights:

摘要

Time series analysis plays an increasingly important role in the study of gene expression data. Some problems, such as a large amount of noise and a small number of replicates, are computational challenges in time series expression data analysis. This paper proposes a hybrid method for analyzing time series gene expression data (HMTS). In the HMTS method, we employ a combination of K-means clustering, regression analysis and piecewise polynomial curve fitting. The K-means clustering procedure is used to divide noisy time series into different clusters, and regression analysis is used to delete outliers according to different clusters. All time series data are divided into multiple segmentations, and polynomial curve fitting is used to fit all segmentation data. The HMTS method can obtain good estimates, especially when there is noise in the data.

论文关键词:Time series analysis,Gene expression,Regression analysis,Function approximation,K-means clustering

论文评审过程:Received 2 September 2011, Revised 23 March 2012, Accepted 1 April 2012, Available online 12 April 2012.

论文官网地址:https://doi.org/10.1016/j.knosys.2012.04.003