Classification of legal texts by computer

作者:

Highlights:

摘要

A suite of computer programs has been developed for representing the full text of lengthy documents in vector form and classifying them by a clustering method. The programs have been applied to the full text of the Conventions and Agreements of the Council of Europe which consist of some 280,000 words in the English version and a similar number in the French. Results of the clustering experiments are presented in the form of dendrograms (tree diagrams) using both the treaty and article as the clustering unit. The conclusion is that vector techniques based on the full text provide an effective method of classifying legal documents.

论文关键词:

论文评审过程:Received 21 October 1975, Available online 13 July 2002.

论文官网地址:https://doi.org/10.1016/0306-4573(76)90043-1