Robustness-Aware Word Embedding Improves Certified Robustness to Adversarial Word Substitutions

Yibin Wang, Yichen Yang, Di He, Kun He

June, 2023

Abstract

Natural Language Processing (NLP) models have gained great success on clean texts, but they are known to be vulnerable to adversarial examples typically crafted by synonym substitutions. In this paper, we target to solve this problem and find that word embedding is important to the certified robustness of NLP models. Given the findings, we propose the Embedding Interval Bound Constraint (EIBC) triplet loss to train robustness-aware word embeddings for better certified robustness.

Type

Conference paper

Publication

In Findings of ACL 2023

Certified Robustness Trustworthy AI Natural Language Processing

Robustness-Aware Word Embedding Improves Certified Robustness to Adversarial Word Substitutions

Abstract

Yibin Wang

Intern