Institute of Electrical and Electronics Engineers (IEEE)

, том 34 , страницы 1722-1740

Transferable Multimodal Attack on Vision-Language Pre-training Models

Тип публикации: Proceedings Article

Дата публикации: 2024-05-19

Institute of Electrical and Electronics Engineers (IEEE)

DOI: 10.1109/sp54263.2024.00102

Скопировать DOI

Краткое описание

Vision-Language Pre-training (VLP) models have achieved remarkable success in practice, while easily being misled by adversarial attack. Though harmful, adversarial attacks are valuable in revealing the blind-spots of VLP models and promoting their robustness. However, existing adversarial attacking studies pay insufficient attention to the key roles of different modality-correlated features, leading to unsatisfactory transferable attacking performance. To tackle this issue, we propose the Transferable MultiModal (TMM) attack framework, which tailors both the modality consistency and modality discrepancy features. To promote transferability, we propose the attention-directed feature perturbation to disturb the modality-consistency features in critical attention regions. In light of the commonly employed cross-attention can represent the consistent features among diverse models, it is more possible to mislead the similar model perception for activating stronger transferability. For improving attacking ability, we proposed the orthogonal-guided feature heterogenization to guide the adversarial perturbation to contain more modality-discrepancy features in the encoded embeddings. Since VLP models rely more on aligned features among different modalities during decision-making, increasing the modality-discrepant could confuse the learned representation for better attacking ability. Extensive experiments under diverse settings demonstrate that the proposed TMM outperforms the comparisons by large margins, i.e., 20.47% improvements in transferable attacking ability on average. Moreover, we highlight that our TMM also shows outstanding attacking performance on large models, such as MiniGPT-4, Otter, etc.

Для доступа к списку цитирований публикации необходимо авторизоваться.

Войти с ORCID

Топ-30

Журналы

	1 2 3 4
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)	IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 4, 17.39% IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 4 публикации, 17.39%
IEEE Transactions on Information Forensics and Security	IEEE Transactions on Information Forensics and Security, 4, 17.39% IEEE Transactions on Information Forensics and Security 4 публикации, 17.39%
IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)	IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), 1, 4.35% IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 1 публикация, 4.35%
Visual Intelligence	Visual Intelligence, 1, 4.35% Visual Intelligence 1 публикация, 4.35%
Information Sciences	Information Sciences, 1, 4.35% Information Sciences 1 публикация, 4.35%
Sensors	Sensors, 1, 4.35% Sensors 1 публикация, 4.35%
IEEE Transactions on Pattern Analysis and Machine Intelligence	IEEE Transactions on Pattern Analysis and Machine Intelligence, 1, 4.35% IEEE Transactions on Pattern Analysis and Machine Intelligence 1 публикация, 4.35%
International Journal of Machine Learning and Cybernetics	International Journal of Machine Learning and Cybernetics, 1, 4.35% International Journal of Machine Learning and Cybernetics 1 публикация, 4.35%
Complex & Intelligent Systems	Complex & Intelligent Systems, 1, 4.35% Complex & Intelligent Systems 1 публикация, 4.35%
ACM Computing Surveys	ACM Computing Surveys, 1, 4.35% ACM Computing Surveys 1 публикация, 4.35%
Lecture Notes in Networks and Systems	Lecture Notes in Networks and Systems, 1, 4.35% Lecture Notes in Networks and Systems 1 публикация, 4.35%
IEEE Transactions on Consumer Electronics	IEEE Transactions on Consumer Electronics, 1, 4.35% IEEE Transactions on Consumer Electronics 1 публикация, 4.35%
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing	IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 1, 4.35% IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing 1 публикация, 4.35%
IEEE Transactions on Neural Networks and Learning Systems	IEEE Transactions on Neural Networks and Learning Systems, 1, 4.35% IEEE Transactions on Neural Networks and Learning Systems 1 публикация, 4.35%
IEEE Transactions on Dependable and Secure Computing	IEEE Transactions on Dependable and Secure Computing, 1, 4.35% IEEE Transactions on Dependable and Secure Computing 1 публикация, 4.35%
IEEE Communications Surveys and Tutorials	IEEE Communications Surveys and Tutorials, 1, 4.35% IEEE Communications Surveys and Tutorials 1 публикация, 4.35%
Neurocomputing	Neurocomputing, 1, 4.35% Neurocomputing 1 публикация, 4.35%
	1 2 3 4

Издатели

	2 4 6 8 10 12 14 16
Institute of Electrical and Electronics Engineers (IEEE)	Institute of Electrical and Electronics Engineers (IEEE), 15, 65.22% Institute of Electrical and Electronics Engineers (IEEE) 15 публикаций, 65.22%
Springer Nature	Springer Nature, 4, 17.39% Springer Nature 4 публикации, 17.39%
Elsevier	Elsevier, 2, 8.7% Elsevier 2 публикации, 8.7%
MDPI	MDPI, 1, 4.35% MDPI 1 публикация, 4.35%
Association for Computing Machinery (ACM)	Association for Computing Machinery (ACM), 1, 4.35% Association for Computing Machinery (ACM) 1 публикация, 4.35%
	2 4 6 8 10 12 14 16

Мы не учитываем публикации, у которых нет DOI.
Статистика публикаций обновляется еженедельно.

Вы ученый?

Создайте профиль, чтобы получать персональные рекомендации коллег, конференций и новых статей.

Войти с ORCID

Метрики

Ошибка в публикации?

Издатель

Institute of Electrical and Electronics Engineers (IEEE)