Signature-Based Methods for Data Streams

作者:Corinna Cortes, Daryl Pregibon

摘要

We have been developing signature-based methods in the telecommunications industry for the past 5 years. In this paper, we describe our work as it evolved due to improvements in technology and our aggressive attitude toward scale. We discuss the types of features that our signatures contain, nuances of how these are updated through time, our treatment of outliers, and the trade-off between time-driven and event-driven processing. We provide a number of examples, all drawn from the application of signatures to toll fraud detection.

论文关键词:transactional data streams, signatures, large scale data mining

论文评审过程:

论文官网地址:https://doi.org/10.1023/A:1011464915332