A comprehensive survey of mostly textual document segmentation algorithms since 2008

作者:

Highlights:

• Extensive review of the state of the art with a well defined scope.

• Analysis of the algorithms from a scientific and an industrial point of view.

• Well defined document and algorithm typologies.

• Discussion on the trends of the field and the evaluation of the algorithms.

摘要

Highlights•Extensive review of the state of the art with a well defined scope.•Analysis of the algorithms from a scientific and an industrial point of view.•Well defined document and algorithm typologies.•Discussion on the trends of the field and the evaluation of the algorithms.

论文关键词:Document,Segmentation,Survey,Evaluation,Trends,Typology

论文评审过程:Received 3 June 2016, Revised 17 October 2016, Accepted 19 October 2016, Available online 23 October 2016, Version of Record 28 October 2016.

论文官网地址:https://doi.org/10.1016/j.patcog.2016.10.023