DARE: Deceiving Audio–Visual speech Recognition model

作者:

Highlights:

• We initiate a targeted attack on AVSR model and detection network simultaneously.

• Our attack shows promising results on the publicly available well known LRW dataset.

• We successfully circumvent popular defences while maintaining imperceptibility.

摘要

•We initiate a targeted attack on AVSR model and detection network simultaneously.•Our attack shows promising results on the publicly available well known LRW dataset.•We successfully circumvent popular defences while maintaining imperceptibility.

论文关键词:Audio–Visual Speech Recognition,Adversarial attacks,Cross-modality,Detection network

论文评审过程:Received 13 April 2021, Revised 19 August 2021, Accepted 15 September 2021, Available online 20 September 2021, Version of Record 27 September 2021.

论文官网地址:https://doi.org/10.1016/j.knosys.2021.107503