Updated README - detailed examples, new paper

mamengyiyi · Jan 24, 2018 · 2b482c4 · 2b482c4
1 parent 8a72661
commit 2b482c4
Showing 1 changed file with 18 additions and 13 deletions.
diff --git a/README.md b/README.md
@@ -1,7 +1,7 @@
 # Crafting Adversarial Example Attacks on Policy Learners 
 
 
-Framework for experimental analysis of adversarial example attacks on policy learning in Deep RL. Attack methodologies are based on our paper "Vulnerability of Deep Reinforcement Learning to Policy Induction Attacks" (Behzadan & Munir, 2017 - https://arxiv.org/abs/1701.04143 ).  
+Framework for experimental analysis of adversarial example attacks on policy learning in Deep RL. Attack methodologies are detailed in our paper "Whatever Does Not Kill Deep Reinforcement Learning, Makes It Stronger" (Behzadan & Munir, 2017 - https://arxiv.org/abs/1712.09344 ).  
 
 This project provides an interface between [@openai/baselines](https://github.com/openai/baselines) and [@tensorflow/cleverhans](https://github.com/tensorflow/cleverhans) to facilitate the crafting and implementation of adversarial example attacks on deep RL algorithms. We would also like to thank [@andrewliao11/NoisyNet-DQN](https://github.com/andrewliao11/NoisyNet-DQN) for inspiring solutions to implementing the [NoisyNet](https://arxiv.org/abs/1706.10295) algorithm for DQN.
 
@@ -22,52 +22,57 @@ pip install -e .
 ```
 
 ### Examples
-Current version includes only DQN attacks.
+Two example scripts are included. 
 
-Test-time, No attack, testing a DQN model of Breakout trained without parameter noise:
+- enjoy-adv.py : sample implementation of test-time FGSM attack on pre-trained DQN Atari agents.
+- train.py: sample implementation of training-time FGSM attack on DQN Atari agents.
+
+Some example executions on the Breakout game environment are: 
+
+- Test-time, No attack, testing a DQN model of Breakout trained without parameter noise:
 
 ```
 $> python3 enjoy-adv.py --env Breakout --model-dir ./data/Breakout/model-173000 --video ./Breakout.mp4
 ```
 
-Test-time, No attack, testing a DQN model of Breakout trained with parameter noise (NoisyNet implementation):
+- Test-time, No attack, testing a DQN model of Breakout trained with parameter noise (NoisyNet implementation):
 ```
 $> python3 enjoy-adv.py --env Breakout --noisy --model-dir ./data/Breakout/model-173000 --video ./Breakout.mp4
 ```
 
-Test-time, Whitebox FGSM attack, testing a DQN model of Breakout trained without parameter noise:
+- Test-time, Whitebox FGSM attack, testing a DQN model of Breakout trained without parameter noise:
 ```
 $> python3 enjoy-adv.py --env Breakout --model-dir ./data/Breakout/model-173000 --attack fgsm --video ./Breakout.mp4
 ```
 
-Test-time, Whitebox FGSM attack, testing a DQN model of Breakout trained with parameter noise (NoisyNet implementation):
+- Test-time, Whitebox FGSM attack, testing a DQN model of Breakout trained with parameter noise (NoisyNet implementation):
 ```
 $> python3 enjoy-adv.py --env Breakout --noisy --model-dir ./data/Breakout/model-173000 --attack fgsm --video ./Breakout.mp4
 ```
 
-Test-time, Blackbox FGSM attack, testing a DQN model of Breakout trained without parameter noise:
+- Test-time, Blackbox FGSM attack, testing a DQN model of Breakout trained without parameter noise:
 ```
 $> python3 enjoy-adv.py --env Breakout --model-dir ./data/Breakout/model-173000 --attack fgsm --blackbox --model-dir2 ./data/Breakout/model-173000-2 --video ./Breakout.mp4
 ```
 
-Test-time, Blackbox FGSM attack, testing a DQN model of Breakout trained with parameter noise (NoisyNet implementation), replica model trained without parameter noise:
+- Test-time, Blackbox FGSM attack, testing a DQN model of Breakout trained with parameter noise (NoisyNet implementation), replica model trained without parameter noise:
 ```
 $> python3 enjoy-adv.py --env Breakout --noisy --model-dir ./data/Breakout/model-173000 --attack fgsm --blackbox --model-dir2 ./data/Breakout/model-173000-2 --video ./Breakout.mp4
 ```
 
-Test-time, Blackbox FGSM attack, testing a DQN model of Breakout trained with parameter noise (NoisyNet implementation), replica model trained with parameter noise:
+- Test-time, Blackbox FGSM attack, testing a DQN model of Breakout trained with parameter noise (NoisyNet implementation), replica model trained with parameter noise:
 ```
 $> python3 enjoy-adv.py --env Breakout --noisy --model-dir ./data/Breakout/model-173000 --attack fgsm --blackbox --model-dir2 ./data/Breakout/model-173000-2 --noisy2 --video ./Breakout.mp4
 ```
 
-Training-time, Whitebox attack, no parameter noise, injecting adversarial exam at every 2 step:
+- Training-time, Whitebox attack, no parameter noise, injecting adversarial example with 20% probability:
 
 ```
-$> python3 train.py --env Breakout --save-dir ./data/Breakout/ --attack fgsm --num-steps 200000000 --attack-freq 2 
+$> python3 train.py --env Breakout --save-dir ./data/Breakout/ --attack fgsm --num-steps 200000000 --attack-prob 0.2 
 ```
 
-Training-time, Whitebox attack, NoisyNet parameter noise, injecting adversarial exam at every 1 step:
+- Training-time, Whitebox attack, NoisyNet parameter noise, injecting adversarial example with 100% probability:
 
 ```
-$> python3 train.py --env Breakout --noisy --save-dir ./data/Breakout/ --attack fgsm --num-steps 200000000 --attack-freq 1
+$> python3 train.py --env Breakout --noisy --save-dir ./data/Breakout/ --attack fgsm --num-steps 200000000 --attack-prob 1.0
 ```