A comprehensive survey of mostly textual document segmentation algorithms since 2008
作者:
Highlights:
• Extensive review of the state of the art with a well defined scope.
• Analysis of the algorithms from a scientific and an industrial point of view.
• Well defined document and algorithm typologies.
• Discussion on the trends of the field and the evaluation of the algorithms.
摘要
Highlights•Extensive review of the state of the art with a well defined scope.•Analysis of the algorithms from a scientific and an industrial point of view.•Well defined document and algorithm typologies.•Discussion on the trends of the field and the evaluation of the algorithms.
论文关键词:Document,Segmentation,Survey,Evaluation,Trends,Typology
论文评审过程:Received 3 June 2016, Revised 17 October 2016, Accepted 19 October 2016, Available online 23 October 2016, Version of Record 28 October 2016.
论文官网地址:https://doi.org/10.1016/j.patcog.2016.10.023