A cross-domain recommender system through information transfer for medical diagnosis
作者:
Highlights:
• A new cross-domain recommender system is developed to support medical diagnoses with insufficient medical records.
• A new dissimilarity measurement is constructed to measure the dissimilarities between diagnoses with interval numbers.
• A space alignment method is designed to handle the mismatch of the different symptom spaces in two domains.
• The proposed method alleviates data sparsity in medical diagnosis and provides personalized recommendation for physicians.
摘要
The electronic diagnostic records of patients, primarily collected by hospitals, comprise valuable data for the development of recommender systems to support physicians in predicting the risks associated with various diseases. For some diseases, the diagnostic record data are not sufficient to train a prediction model to generate recommendations; this is referred to as the data sparsity problem. Cross-domain recommender systems offer a solution to this problem by transferring knowledge from a similar domain (source domain) with sufficient data for modeling to facilitate prediction in the current domain (target domain). However, building a cross-domain recommender system for medical diagnosis presents two challenges: (1) uncertain representations, such as the symptoms characterized by interval numbers, are often used in medical records, and (2) given two different diseases, the feature spaces of the two diagnostic domains are often disparate because the diseases are only likely to share a few symptoms. This study addresses these challenges by proposing a cross-domain recommender system, named information transfer for medical diagnosis (ITMD), to provide physicians with personalized recommendations for disease risks. In ITMD, a novel dissimilarity measurement was performed for diagnosis, represented as interval numbers. The space alignment technique eliminated the feature space divergence caused by different symptoms between two diseases, and the development of collective matrix factorization enabled knowledge transfer between the source and target domains. Experiments and a case study using real-world data demonstrated that ITMD outperforms four baselines and improves the accuracy of recommendations for disease risks in patients to support physicians in determining a final medical diagnosis.
论文关键词:Recommender systems,Cross-domain,Collaborative filtering,Medical diagnosis
论文评审过程:Received 12 April 2020, Revised 25 December 2020, Accepted 27 December 2020, Available online 31 December 2020, Version of Record 21 February 2021.
论文官网地址:https://doi.org/10.1016/j.dss.2020.113489