Extraction and optimization of classification rules for temporal sequences: Application to hospital data

作者:

Highlights:

摘要

This study focuses on the problem of supervised classification on heterogeneous temporal data featuring a mixture of attribute types (numeric, binary, symbolic, temporal). We present a model for classification rules designed to use both non-temporal attributes and sequences of temporal events as predicates. We also propose an efficient local search-based metaheuristic algorithm to mine such rules in large scale, real-life data sets extracted from a hospital’s information system. The proposed algorithm, MOSC (Multi-Objective Sequence Classifier), is compared to standard classifiers and previous works on these real data sets and exhibits noticeably better classification performance. While designed with medical applications in mind, the proposed approach is generic and can be used for problems from other application domains.

论文关键词:Data mining,Classification,Temporal data,Optimization

论文评审过程:Received 29 September 2016, Revised 30 January 2017, Accepted 1 February 2017, Available online 1 February 2017, Version of Record 27 February 2017.

论文官网地址:https://doi.org/10.1016/j.knosys.2017.02.001