DARE: Deceiving Audio–Visual speech Recognition model

作者：

Highlights：

• We initiate a targeted attack on AVSR model and detection network simultaneously.

• Our attack shows promising results on the publicly available well known LRW dataset.

• We successfully circumvent popular defences while maintaining imperceptibility.

摘要

•We initiate a targeted attack on AVSR model and detection network simultaneously.•Our attack shows promising results on the publicly available well known LRW dataset.•We successfully circumvent popular defences while maintaining imperceptibility.

论文关键词：Audio–Visual Speech Recognition,Adversarial attacks,Cross-modality,Detection network

论文评审过程：Received 13 April 2021, Revised 19 August 2021, Accepted 15 September 2021, Available online 20 September 2021, Version of Record 27 September 2021.

论文官网地址：https://doi.org/10.1016/j.knosys.2021.107503

原文链接
谷歌学术
必应学术
百度学术