Machine learning the harness track: Crowdsourcing and varying race history

作者:

Highlights:

摘要

Racing prediction schemes have been with mankind a long time. From following crowd wisdom and betting on favorites to mathematical methods like the Dr. Z System, we introduce a different class of prediction system, the S&C Racing system that derives from machine learning. We demonstrate the S&C Racing system using Support Vector Regression (SVR) to predict finishes and analyzed it on fifteen months of harness racing data from Northfield Park, Ohio. We found that within the domain of harness racing, our system outperforms crowds and Dr. Z Bettors in returns per dollar wagered on seven of the most frequently used wagers: Win $1.08 return, Place $2.30, Show $2.55, Exacta $19.24, Quiniela $18.93, Trifecta $3.56 and Trifecta Box $21.05. Furthermore, we also analyzed a range of race histories and found that a four race history maximized system accuracy and payout. The implications of this work suggest that an informational inequality exists within the harness racing market that was exploited by S&C Racing. While interesting, the implications of machine learning in this domain show promise.

论文关键词:Business intelligence,Data mining,Support Vector Regression,Harness racing,S&C Racing system,Crowdsourcing,Dr. Z System

论文评审过程:Received 29 September 2011, Revised 9 November 2012, Accepted 4 December 2012, Available online 28 December 2012.

论文官网地址:https://doi.org/10.1016/j.dss.2012.12.013