User modeling: Through statistical analysis and subspace learning
作者:
Highlights:
•
摘要
One of the challenges which must be faced in the field of the information processing is the need to cope with huge amounts of data. There exist many different environments in which large quantities of information are produced. For example, in a command-line interface, a computer user types thousands of commands which can hide information about the behavior of her/his. However, processing this kind of streaming data on-line is a hard problem.This paper addresses the problem of the classification of streaming data from a dimensionality reduction perspective. We propose to learn a lower dimensionality input model which best represents the data and improves the prediction performance versus standard techniques. The proposed method uses maximum dependence criteria as distance measurement and finds the transformation which best represents the command-line user. We also make a comparison between the dimensionality reduction approach and using the full dataset. The results obtained give some deeper understanding in advantages and drawbacks of using both perspectives in this user classifying environment.
论文关键词:User modeling,Behavior recognition,Dimensionality reduction,High dimensional data,Pattern recognition
论文评审过程:Available online 12 November 2011.
论文官网地址:https://doi.org/10.1016/j.eswa.2011.11.015