Classification on defective items using unidentified samples
作者:
Highlights:
•
摘要
A set of unlabelled items is used to establish a decision rule to classify defective items. The lifetime of an item has an exponential distribution. It is known that the Bayes decision rule, which classifies good and defective items, gives a minimum probability of misclassification. The Bayes decision rule needs to know the prior probability (defective percentage) and two mean lifetimes. In the set of unidentified samples, the defective percentage and two mean lifetimes are unknown. Hence, before we can use the Bayes decision rule, we have to estimate the three unknown parameters. In this study, a set of unlabelled samples is used to estimate the three unknown parameters. The Bayes decision rule with these estimated parameters is an empirical Bayes (EB) decision rule. A stochastic approximation procedure using the set of unidentified samples is established to estimate the three unknown parameters. When the size of unlabelled items increases, the estimates computed by the procedure converge to the real parameters and hence gradually adapt our EB decision rule to be a better classifier until it becomes the Bayes decision rule. The results of a Monte Carlo simulation study are presented to demonstrate the convergence of the correct classification rates made by the EB decision rule to the highest correct classification rates given by the Bayes decision rule.
论文关键词:Bayes decision rule,Classification,Empirical Bayes,Exponential distribution,Quality control,Stochastic approximation,Unsupervised learning
论文评审过程:Received 18 September 2002, Revised 1 December 2003, Accepted 19 May 2004, Available online 23 August 2004.
论文官网地址:https://doi.org/10.1016/j.patcog.2004.05.008