Multi-stream join answering for mining significant cross-stream correlations

作者:Robert Gwadera

摘要

Sliding-window multi-stream join (SWMJ) is a fundamental operation for correlating information from different streams. We provide a solution to the problem of assessing significance of the SWMJ result by focusing on the relative frequency of windows satisfying a given equijoin predicate as the most important parameter of the SWMJ result. In particular, we derive a formula for computing the expected relative frequency of windows satisfying a given equijoin predicate that can be evaluated in quadratic time in the window size given a proposed probabilistic model of the multi-stream. In experiments conducted on a daily rainfall data set we demonstrate the remarkable accuracy of our method, which confirms our theoretical analysis.

论文关键词:probabilistic data streams, stream summarization, stream sketch, window aggregate queries

论文评审过程:

论文官网地址:https://doi.org/10.1007/s11704-012-2862-8