Knowledge extraction from the learning of sequences in a long short term memory (LSTM) architecture

作者:

Highlights:

摘要

Transparency and trust in machine learning algorithms have been deemed to be fundamental and yet, from a practical point of view, they remain difficult to implement. Particularly, explainability and interpretability are certainly among the most difficult capabilities to be addressed and imply to be able to understand a decision in terms of simple cues and rules. In this article, we address this specific problem in the context of sequence learning by recurrent neuronal models (and more specifically Long Short Term Memory model). We introduce a general method to extract knowledge from the latent space based on the clustering of the internal states. From these hidden states, we explain how to build and validate an automaton that corresponds to the underlying (unknown) grammar, and allows to predict if a given sequence is valid or not. Finally, we show that it is possible for such complex recurrent model, to extract the knowledge that is implicitly encoded in the sequences and we report a high rate of recognition of the sequences extracted from the original grammar. This method is illustrated on artificial grammars (Reber grammar variants) as well as on a real use-case in the electrical domain, whose underlying grammar is unknown.

论文关键词:Sequence learning,Knowledge extraction,Long Short Term Memory,Latent space,Implicit learning,Recurrent Neural Networks

论文评审过程:Received 9 November 2020, Revised 24 October 2021, Accepted 26 October 2021, Available online 29 October 2021, Version of Record 9 November 2021.

论文官网地址:https://doi.org/10.1016/j.knosys.2021.107657