Data Set A is a Pattern Matching Problem
作者:Jens Kohlmorgen, Klaus-Robert Müller
摘要
Several data sets have been proposed for benchmarking in time series prediction. A popular one is Data Set A from the Santa Fe Competition. This data set was the subject of analysis in many papers. In this note, it is shown that predicting the continuation of Data Set A is nothing else than a pattern matching problem. Looking at studies of this data set, it is remarkable that most of the very good forecasts of Data Set A used upsampled training data. We explain why upsampling is crucial for this data set. Finally, it is demonstrated that simple pattern matching performs as good as sophisticated prediction methods on Data Set A.
论文关键词:benchmarking, pattern matching, Santa Fe Competition, time series prediction
论文评审过程:
论文官网地址:https://doi.org/10.1023/A:1009684621686