DRaWS: A dual random-walk based sampling method to efficiently estimate distributions of degree and clique size over social networks

作者:

Highlights:

• This paper proposes propose a dual random-walk based sampling process DRaWS, the first of its kind, with high accuracy and low cost to estimate the degree and clique structures simultaneously.

• The sampling method proposed in this paper drastically reduces its sampling costs while significantly reducing sampling errors.

• The paper proposes different re-weighted estimators by theoretical analysis on the dual random-walk based process to accurately estimate the degree and clique structures in large graphs with samples obtained by DRaWS.

• Extensive experiments are carried out to present the accuracy and efficiency of the newly proposed method.

摘要

•This paper proposes propose a dual random-walk based sampling process DRaWS, the first of its kind, with high accuracy and low cost to estimate the degree and clique structures simultaneously.•The sampling method proposed in this paper drastically reduces its sampling costs while significantly reducing sampling errors.•The paper proposes different re-weighted estimators by theoretical analysis on the dual random-walk based process to accurately estimate the degree and clique structures in large graphs with samples obtained by DRaWS.•Extensive experiments are carried out to present the accuracy and efficiency of the newly proposed method.

论文关键词:Random-walk based graph sampling,Estimator,Distributions of degree and clique structures,Social networks

论文评审过程:Received 18 November 2019, Revised 5 April 2020, Accepted 6 April 2020, Available online 13 April 2020, Version of Record 24 April 2020.

论文官网地址:https://doi.org/10.1016/j.knosys.2020.105891