On the study of nearest neighbor algorithms for prevalence estimation in binary problems

作者:

Highlights:

摘要

This paper presents a new approach for solving binary quantification problems based on nearest neighbor (NN) algorithms. Our main objective is to study the behavior of these methods in the context of prevalence estimation. We seek for NN-based quantifiers able to provide competitive performance while balancing simplicity and effectiveness. We propose two simple weighting strategies, PWK and PWKα, which stand out among state-of-the-art quantifiers. These proposed methods are the only ones that offer statistical differences with respect to less robust algorithms, like CC or AC. The second contribution of the paper is to introduce a new experiment methodology for quantification.

论文关键词:Quantification,Prevalence estimation,Nearest neighbor,Methodology

论文评审过程:Received 17 January 2012, Revised 27 July 2012, Accepted 31 July 2012, Available online 9 August 2012.

论文官网地址:https://doi.org/10.1016/j.patcog.2012.07.022