A quadrilateral scene text detector with two-stage network architecture
作者:
Highlights:
• We propose a novel quadrilateral regression algorithm for generating quadrilateral proposals and text detections.
• We introduce a dual-branch structure of a quadrilateral detection head and an additional rotated rectangle detection head to train the detection head at the second stage.
• We design a novel weighted RoI pooling module with learned weight masks to pool the feature.
• We propose an accelerated NMS algorithm for quadrilateral text proposals and detections.
• We integrate the above four novel modules into the two-stage architecture of Faster R-CNN, and achieve state-of-the-art results on multiple public datasets.
摘要
•We propose a novel quadrilateral regression algorithm for generating quadrilateral proposals and text detections.•We introduce a dual-branch structure of a quadrilateral detection head and an additional rotated rectangle detection head to train the detection head at the second stage.•We design a novel weighted RoI pooling module with learned weight masks to pool the feature.•We propose an accelerated NMS algorithm for quadrilateral text proposals and detections.•We integrate the above four novel modules into the two-stage architecture of Faster R-CNN, and achieve state-of-the-art results on multiple public datasets.
论文关键词:Scene text detection,Deep learning,Quadrilateral regression
论文评审过程:Received 31 March 2019, Revised 31 December 2019, Accepted 23 January 2020, Available online 24 January 2020, Version of Record 7 February 2020.
论文官网地址:https://doi.org/10.1016/j.patcog.2020.107230