Simple decision forests for multi-relational classification
作者:
Highlights:
•
摘要
An important task in multi-relational data mining is link-based classification which takes advantage of attributes of links and linked entities, to predict the class label. The relational Naive Bayes classifier exploits independence assumptions to achieve scalability. We introduce a weaker independence assumption to the effect that information from different data tables is independent given the class label. The independence assumption entails a closed-form formula for combining probabilistic predictions based on decision trees learned on different database tables. Logistic regression learns different weights for information from different tables and prunes irrelevant tables. In experiments, learning was very fast with competitive accuracy.
论文关键词:Link-based classification,Multi-relational Naive Bayes classifiers,Multi-relational decision trees,Logistic regression
论文评审过程:Received 30 November 2011, Revised 20 September 2012, Accepted 25 November 2012, Available online 3 December 2012.
论文官网地址:https://doi.org/10.1016/j.dss.2012.11.017