Min-Wise Independent Permutations

作者:

Highlights:

摘要

We define and study the notion of min-wise independent families of permutations. We say that F⊆Sn (the symmetric group) is min-wise independent if for any set X⊆[n] and any x∈X, when π is chosen at random in F we havePr(min{π(X)}=π(x))=1|X| . In other words we require that all the elements of any fixed set X have an equal chance to become the minimum element of the image of X under π. Our research was motivated by the fact that such a family (under some relaxations) is essential to the algorithm used in practice by the AltaVista web index software to detect and filter near-duplicate documents. However, in the course of our investigation we have discovered interesting and challenging theoretical questions related to this concept—we present the solutions to some of them and we list the rest as open problems.

论文关键词:

论文评审过程:Received 22 June 1998, Available online 25 May 2002.

论文官网地址:https://doi.org/10.1006/jcss.1999.1690