Analysis of collisions when hashing by division

作者:

Highlights:

摘要

In this paper simple formulas are derived for the calculation of overflow records when the division method is used for hashing keys in the following distributions: 1.(1) keys in a file are randomly distributed;2.(2) some keys in a file form clusters of various lengths and others are randomly distributed. Using these formulas it is shown that the division method indeed produces less collisions then the theoretically perfect randomization method as indicated in some previous experimental results. In addition the paper also shows that the number of collisions depends mainly on the ratio of number of keys in a file to the number of slots available to store these keys. Some numerical examples are given.

论文关键词:

论文评审过程:Received 15 September 1973, Available online 17 June 2003.

论文官网地址:https://doi.org/10.1016/0306-4379(75)90023-X