Multi-stream join answering for mining significant cross-stream correlations
作者:Robert Gwadera
摘要
Sliding-window multi-stream join (SWMJ) is a fundamental operation for correlating information from different streams. We provide a solution to the problem of assessing significance of the SWMJ result by focusing on the relative frequency of windows satisfying a given equijoin predicate as the most important parameter of the SWMJ result. In particular, we derive a formula for computing the expected relative frequency of windows satisfying a given equijoin predicate that can be evaluated in quadratic time in the window size given a proposed probabilistic model of the multi-stream. In experiments conducted on a daily rainfall data set we demonstrate the remarkable accuracy of our method, which confirms our theoretical analysis.
论文关键词:probabilistic data streams, stream summarization, stream sketch, window aggregate queries
论文评审过程:
论文官网地址:https://doi.org/10.1007/s11704-012-2862-8