Candidate document retrieval for cross-lingual plagiarism detection using two-level proximity information
作者:
Highlights:
• Proposing a candidate retrieval model for cross-lingual plagiarism detection
• The method relies on using two levels of proximity information
• Proposing a topic-based text segmentation method
• Comparing the method with other cross-lingual plagiarism detection approaches
• Showing improvements using text segmentation and positional language models
摘要
•Proposing a candidate retrieval model for cross-lingual plagiarism detection•The method relies on using two levels of proximity information•Proposing a topic-based text segmentation method•Comparing the method with other cross-lingual plagiarism detection approaches•Showing improvements using text segmentation and positional language models
论文关键词:Candidate document retrieval,Cross-language plagiarism detection,Text segmentation,Proximity-based retrieval
论文评审过程:Received 21 February 2015, Revised 11 April 2016, Accepted 18 April 2016, Available online 29 April 2016, Version of Record 28 September 2016.
论文官网地址:https://doi.org/10.1016/j.ipm.2016.04.006