Morphological compression of Arabic text
作者:
Highlights:
•
摘要
The morphological compression of Arabic text is a compression technique that replaces some words in the original text by their roots and morphological patterns. This method is studied by developing a new method to reduce Arabic words to their roots and patterns, and by a compression algorithm that encodes reducible words into a three byte format. The technique is implemented and tested by utilizing different texts. The results indicate a reduction ratio of 20% to 30% due to the morphological property of the language alone. However, 40% reduction is attainable if the morphological compression is used in conjunction with space elimination from the original text.
论文关键词:
论文评审过程:Received 15 March 1989, Accepted 15 March 1989, Available online 19 July 2002.
论文官网地址:https://doi.org/10.1016/0306-4573(90)90033-X