Removing confounding factors via constraint-based clustering: An application to finding homogeneous groups of multiple sclerosis patients

作者:

Highlights:

摘要

ObjectivesConfounding factors in unsupervised data can lead to undesirable clustering results. For example in medical datasets, age is often a confounding factor in tests designed to judge the severity of a patient's disease through measures of mobility, eyesight and hearing. In such cases, removing age from each instance will not remove its effect from the data as other features will be correlated with age. Motivated by the need to find homogeneous groups of multiple sclerosis (MS) patients, we apply our approach to remove physician subjectivity from patient data.

论文关键词:Constraint-based clustering,Confounding factor,Mining medical data,Physician subjectivity,Multiple sclerosis

论文评审过程:Available online 11 July 2015, Version of Record 19 October 2015.

论文官网地址:https://doi.org/10.1016/j.artmed.2015.06.004