Data reduction for Boolean matrix factorization algorithms based on formal concept analysis

作者:

Highlights:

摘要

Data size reduction is an important step in many data mining techniques. We present a novel approach based on formal concept analysis to data reduction tailored for Boolean matrix factorization methods. A general aim of these methods is to find factors that exactly or approximately explain data. The presented approach is able to significantly reduce the size of data by choosing a representative set of rows, and preserve (with a little loss) factors behind the data, i.e. it only slightly affects a quality of the factors produced by Boolean matrix factorization algorithms.

论文关键词:Boolean matrix factorization,Formal concept analysis,Data reduction

论文评审过程:Received 30 November 2017, Revised 19 May 2018, Accepted 22 May 2018, Available online 27 May 2018, Version of Record 6 July 2018.

论文官网地址:https://doi.org/10.1016/j.knosys.2018.05.035