Kohonen map-wise regression applied to interval data

作者:

Highlights:

摘要

Kohonen maps, also known as self-organizing maps, is a powerful clustering method which groups data using multiple nodes that converge to clusters. Therefore, it is not necessary to know a priori the exact number of clusters in the data. This paper proposes a clusterwise regression method that combines self-organizing maps and a parametrized linear regression approach for interval data in the framework of Symbolic Data Analysis. The linear regression approach adapts itself to use the best set of points inside intervals to build the regression model, generalizing all approaches that select points of interest, such as center, range, minimum and maximum, to provide linear regression for interval data. Here, self-organizing map nodes are responsible for detecting local linear regression structures and the parametrized linear regression builds local regression models, one for each node. Finally, outputs are given as weighted averages of the local model predictions. Since our model fits the linear models using the map nodes, it avoids weak performances that might arise when clusters are not easily separable. As another contribution, we provide a way to extract predictions from an existing clusterwise regression method. Experiments with synthetic and real data sets show the usefulness of the proposed method.

论文关键词:Self-organizing map,Interval data,Clusterwise linear regression,Symbolic data analysis

论文评审过程:Received 18 August 2020, Revised 27 February 2021, Accepted 26 April 2021, Available online 27 April 2021, Version of Record 30 April 2021.

论文官网地址:https://doi.org/10.1016/j.knosys.2021.107091