Discovering Classification from Data of Multiple Sources

作者:Charles X. Ling, Qiang Yang

摘要

In many large e-commerce organizations, multiple data sources are often used to describe the same customers, thus it is important to consolidate data of multiple sources for intelligent business decision making. In this paper, we propose a novel method that predicts the classification of data from multiple sources without class labels in each source. We test our method on artificial and real-world datasets, and show that it can classify the data accurately. From the machine learning perspective, our method removes the fundamental assumption of providing class labels in supervised learning, and bridges the gap between supervised and unsupervised learning.

论文关键词:new solutions for multiple data source mining, learning from multiple sources of data, learning classifications from unlabeled data of multiple sources

论文评审过程:

论文官网地址:https://doi.org/10.1007/s10618-005-0013-7