BERT contextual embeddings for taxonomic classification of bacterial DNA sequences
作者:
Highlights:
• Interactions and positions of biological components are significant.
• BERT Contextual Embeddings capture salient sequence interactions.
• Convolutional Neural Networks efficiently classify biological sequences.
• Data augmentation methods boost classification performance
摘要
•Interactions and positions of biological components are significant.•BERT Contextual Embeddings capture salient sequence interactions.•Convolutional Neural Networks efficiently classify biological sequences.•Data augmentation methods boost classification performance
论文关键词:DNA,Taxonomic classification,BERT,Contextual embedding,Deep Learning,Convolutional Neural Network
论文评审过程:Received 13 June 2021, Revised 7 May 2022, Accepted 22 June 2022, Available online 30 June 2022, Version of Record 16 July 2022.
论文官网地址:https://doi.org/10.1016/j.eswa.2022.117972