An investigation of data and text mining methods for real world deception detection

作者：

Highlights：

•

摘要

Uncovering lies (or deception) is of critical importance to many including law enforcement and security personnel. Though these people may try to use many different tactics to discover deception, previous research tells us that this cannot be accomplished successfully without aid. This manuscript reports on the promising results of a research study where data and text mining methods along with a sample of real-world data from a high-stakes situation is used to detect deception. At the end, the information fusion based classification models produced better than 74% classification accuracy on the holdout sample using a 10-fold cross validation methodology. Nonetheless, artificial neural networks and decision trees produced accuracy rates of 73.46% and 71.60% respectively. However, due to the high stakes associated with these types of decisions, the extra effort of combining the models to achieve higher accuracy is well warranted.

论文关键词：Deception detection,Data mining,Text mining,Information fusion,Classification,Credibility assessment

论文评审过程：Available online 26 January 2011.

论文官网地址：https://doi.org/10.1016/j.eswa.2011.01.032