About

This code accompanies the paper "Universal Adversarial Attacks with Natural Triggers for Text Classification" https://arxiv.org/abs/2005.00174, accepted by NAACL 2021.

Dependencies

Pytorch, AllenNLP, Hugging Face Transformers (see requirements.txt).

Perform universal attacks

First, download the pretrained ARAE model here, and unzip into the "./ARAR/oneb_pretrained" folder.

Then, go to sst or snli directory and run python sst_attack.py or python snli_attack.py.
The argument attack_class is used to select the class label to attack, and the argument len_lim specifies the length of attack trigger.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

About

Dependencies

Perform universal attacks

Files

README.md

Latest commit

History

README.md

File metadata and controls

About

Dependencies

Perform universal attacks