set_normalization_used #191

talrub · 2024-06-28T09:57:09Z

❔ Any questions

Hi,

According to attack.py line 501, it seems that if I perform atk.set_normalization_used(mean=[...], std=[...]) before applying the attack, the inputs will be denormalized (by self.inverse_normalize(inputs)). Can you explain why this is necessary? After all, I am training the model with normalized images, so why wouldn't we use the normalized images for the attack?

For example, for the MNIST dataset I am using:

def get_mnist_statistics():
train_set = torchvision.datasets.MNIST(root='./data', train=True, download=True, transform=transforms.ToTensor())
data = torch.cat([d[0] for d in DataLoader(train_set)])
return data.mean(dim=[0, 2, 3]), data.std(dim=[0, 2, 3])

mean, std = get_mnist_statistics()
transform = transforms.Compose([transforms.ToTensor(), transforms.Normalize(mean, std), transforms.Lambda(lambda x: x.view(784, 1))])
trainset = torchvision.datasets.MNIST(root='./data', train=True, download=True, transform=transform )

rikonaka · 2024-06-28T15:03:30Z

The internal algorithm is built on input data between 0 and 1([0, 1]), so if you used normalization during training, we need to reduce it to the 0 to 1 interval for the internal algorithm to process it. There is actually no ambiguity here.

The first principle of program design is to have a unified input and output.

So the internal algorithm here defaults to a unified input of data in the range of 0 to 1, and there is no other meaning.

talrub · 2024-07-01T13:31:17Z

I understand, but what about my MNIST example? The input data is already between 0 and 1, but I still normalize it, as it is common to do so.

rikonaka · 2024-07-01T13:35:40Z

If you plan to use normalization in your code, just use it. I don't know what's the problem here?

talrub · 2024-07-02T14:34:37Z

I don't understand why, in the case of MNIST, which contains [0,1] values before normalization, we need to use 'set_normalization_used' when we normalize the data. Why can't we provide the attack algorithm with the same data we used for training without using 'set_normalization_used'?

talrub added the enhancement New feature or request label Jun 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

set_normalization_used #191

set_normalization_used #191

talrub commented Jun 28, 2024

rikonaka commented Jun 28, 2024

talrub commented Jul 1, 2024

rikonaka commented Jul 1, 2024

talrub commented Jul 2, 2024

set_normalization_used #191

set_normalization_used #191

Comments

talrub commented Jun 28, 2024

❔ Any questions

rikonaka commented Jun 28, 2024

talrub commented Jul 1, 2024

rikonaka commented Jul 1, 2024

talrub commented Jul 2, 2024