Тип публикации: Proceedings Article
Дата публикации: 2024-05-19
Краткое описание
Vision-Language Pre-training (VLP) models have achieved remarkable success in practice, while easily being misled by adversarial attack. Though harmful, adversarial attacks are valuable in revealing the blind-spots of VLP models and promoting their robustness. However, existing adversarial attacking studies pay insufficient attention to the key roles of different modality-correlated features, leading to unsatisfactory transferable attacking performance. To tackle this issue, we propose the Transferable MultiModal (TMM) attack framework, which tailors both the modality consistency and modality discrepancy features. To promote transferability, we propose the attention-directed feature perturbation to disturb the modality-consistency features in critical attention regions. In light of the commonly employed cross-attention can represent the consistent features among diverse models, it is more possible to mislead the similar model perception for activating stronger transferability. For improving attacking ability, we proposed the orthogonal-guided feature heterogenization to guide the adversarial perturbation to contain more modality-discrepancy features in the encoded embeddings. Since VLP models rely more on aligned features among different modalities during decision-making, increasing the modality-discrepant could confuse the learned representation for better attacking ability. Extensive experiments under diverse settings demonstrate that the proposed TMM outperforms the comparisons by large margins, i.e., 20.47% improvements in transferable attacking ability on average. Moreover, we highlight that our TMM also shows outstanding attacking performance on large models, such as MiniGPT-4, Otter, etc.
Найдено
Ничего не найдено, попробуйте изменить настройки фильтра.
Для доступа к списку цитирований публикации необходимо авторизоваться.
Топ-30
Журналы
|
1
2
3
4
|
|
|
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
4 публикации, 17.39%
|
|
|
IEEE Transactions on Information Forensics and Security
4 публикации, 17.39%
|
|
|
IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)
1 публикация, 4.35%
|
|
|
Visual Intelligence
1 публикация, 4.35%
|
|
|
Information Sciences
1 публикация, 4.35%
|
|
|
Sensors
1 публикация, 4.35%
|
|
|
IEEE Transactions on Pattern Analysis and Machine Intelligence
1 публикация, 4.35%
|
|
|
International Journal of Machine Learning and Cybernetics
1 публикация, 4.35%
|
|
|
Complex & Intelligent Systems
1 публикация, 4.35%
|
|
|
ACM Computing Surveys
1 публикация, 4.35%
|
|
|
Lecture Notes in Networks and Systems
1 публикация, 4.35%
|
|
|
IEEE Transactions on Consumer Electronics
1 публикация, 4.35%
|
|
|
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
1 публикация, 4.35%
|
|
|
IEEE Transactions on Neural Networks and Learning Systems
1 публикация, 4.35%
|
|
|
IEEE Transactions on Dependable and Secure Computing
1 публикация, 4.35%
|
|
|
IEEE Communications Surveys and Tutorials
1 публикация, 4.35%
|
|
|
Neurocomputing
1 публикация, 4.35%
|
|
|
1
2
3
4
|
Издатели
|
2
4
6
8
10
12
14
16
|
|
|
Institute of Electrical and Electronics Engineers (IEEE)
15 публикаций, 65.22%
|
|
|
Springer Nature
4 публикации, 17.39%
|
|
|
Elsevier
2 публикации, 8.7%
|
|
|
MDPI
1 публикация, 4.35%
|
|
|
Association for Computing Machinery (ACM)
1 публикация, 4.35%
|
|
|
2
4
6
8
10
12
14
16
|
- Мы не учитываем публикации, у которых нет DOI.
- Статистика публикаций обновляется еженедельно.
Вы ученый?
Создайте профиль, чтобы получать персональные рекомендации коллег, конференций и новых статей.
Метрики
23
Всего цитирований:
23
Цитирований c 2025:
18
(78.26%)
Самый цитирующий журнал
Цитирований в журнале:
4
Ошибка в публикации?