ReLaText: Exploiting visual relationships for arbitrary-shaped scene text detection with graph convolutional networks
作者:
Highlights:
• We propose a new arbitrary-shaped text detection approach by formulating text detection as a visual relationship detection problem and demonstrate the effectiveness of this new formulation by starting from using a “link” relationship to address the challenging text-line grouping problem.
• We adopt a GCN-based visual relationship detection framework to effectively leverage context information to improve link prediction accuracy so that even text instances with large inter-character or very small inter-line spacing can be robustly detected by our text detector.
• Significantly improved text detection accuracy demonstrates the effectiveness of visual relationship prediction based text-line grouping and the effectiveness of GCN-based link relationship prediction.
• Our new text detector has achieved state-of-the-art performance on five public text detection benchmarks, namely RCTW-17, MSRA-TD500, Total-Text, CTW1500 and DAST1500.
摘要
•We propose a new arbitrary-shaped text detection approach by formulating text detection as a visual relationship detection problem and demonstrate the effectiveness of this new formulation by starting from using a “link” relationship to address the challenging text-line grouping problem.•We adopt a GCN-based visual relationship detection framework to effectively leverage context information to improve link prediction accuracy so that even text instances with large inter-character or very small inter-line spacing can be robustly detected by our text detector.•Significantly improved text detection accuracy demonstrates the effectiveness of visual relationship prediction based text-line grouping and the effectiveness of GCN-based link relationship prediction.•Our new text detector has achieved state-of-the-art performance on five public text detection benchmarks, namely RCTW-17, MSRA-TD500, Total-Text, CTW1500 and DAST1500.
论文关键词:Arbitrary-Shaped text detection,Graph convolutional network,Link prediction,Visual relationship detection
论文评审过程:Received 15 March 2020, Revised 26 June 2020, Accepted 24 September 2020, Available online 25 September 2020, Version of Record 5 October 2020.
论文官网地址:https://doi.org/10.1016/j.patcog.2020.107684