Finding maximal homogeneous clique sets

作者:Pierre-Nicolas Mougel, Christophe Rigotti, Marc Plantevit, Olivier Gandrillon

摘要

Many datasets can be encoded as graphs with sets of labels associated with the vertices. We consider this kind of graphs and we propose to look for patterns called maximal homogeneous clique sets, where such a pattern is a subgraph that is structured in several large cliques and where all vertices share enough labels. We present an algorithm based on graph enumeration to compute all patterns satisfying user-defined constraints on the number of separated cliques, on the size of these cliques, and on the number of labels shared by all the vertices. Our approach is tested on real datasets based on a social network of scientific collaborations and on a biological network of protein–protein interactions. The experiments show that the patterns are useful to exhibit subgraphs organized in several core modules of interactions. Performances are reported on real data and also on synthetic ones, showing that the approach can be applied on different kinds of large datasets.

论文关键词:Graph mining, Interaction network, Attributed graph, Clique set, Homogeneous cliques, Scientific collaborations, Protein interactions, Gene expression

论文评审过程:

论文官网地址:https://doi.org/10.1007/s10115-013-0625-y