Simplify EMA to use Pytorch's update_parameters #5469

xiaohu2015 · 2022-02-24T11:29:21Z

The PR is about #5284.

with torch nightly version1.12.0.dev20220225+cu113, it runs OK:

import torch                                                                                                         


class ExponentialMovingAverageV1(torch.optim.swa_utils.AveragedModel):
    """Maintains moving averages of model parameters using an exponential decay.
    ``ema_avg = decay * avg_model_param + (1 - decay) * model_param``
    `torch.optim.swa_utils.AveragedModel <https://pytorch.org/docs/stable/optim.html#custom-averaging-strategies>`_    is used to compute the EMA.
    """
    def __init__(self, model, decay, device='cpu'):
        ema_avg = (lambda avg_model_param, model_param, num_averaged:
                   decay * avg_model_param + (1 - decay) * model_param)
        super().__init__(model, device, ema_avg) 

    def update_parameters(self, model):
        for p_swa, p_model in zip(self.module.state_dict().values(), model.state_dict().values()):
            device = p_swa.device 
            p_model_ = p_model.detach().to(device)
            if self.n_averaged == 0:
                p_swa.detach().copy_(p_model_)
            else:
                p_swa.detach().copy_(self.avg_fn(p_swa.detach(), p_model_,
                                     self.n_averaged.to(device)))
        self.n_averaged += 1

class ExponentialMovingAverage(torch.optim.swa_utils.AveragedModel):
    """Maintains moving averages of model parameters using an exponential decay.
    ``ema_avg = decay * avg_model_param + (1 - decay) * model_param``
    `torch.optim.swa_utils.AveragedModel <https://pytorch.org/docs/stable/optim.html#custom-averaging-strategies>`_
    is used to compute the EMA.    """
    def __init__(self, model, decay, device='cpu'):
        ema_avg = (lambda avg_model_param, model_param, num_averaged:
                   decay * avg_model_param + (1 - decay) * model_param)
        super().__init__(model, device, ema_avg, use_buffers=True)



class ToyModel(torch.nn.Module):
    
    def __init__(self):
        super().__init__()
        self.x = torch.nn.Parameter(torch.zeros(5))
        self.register_buffer('y', torch.zeros(5))


    def forward(self, input):
        self.x += input
        self.y += input
        return self.x, self.y

decay = 0.9
model1 = ToyModel()
ema1 = ExponentialMovingAverageV1(model1, decay)
model2 = ToyModel()
ema2 = ExponentialMovingAverage(model2, decay)

x = torch.ones(5)

for _ in range(10):
    with torch.no_grad():
        model1(x)
        model2(x)
        ema1.update_parameters(model1)
        ema2.update_parameters(model2)

        assert torch.equal(ema1.module.x, ema2.module.x)
        assert torch.equal(ema1.module.y, ema2.module.y)

facebook-github-bot · 2022-02-24T11:29:28Z

💊 CI failures summary and remediations

As of commit 31b0e04 (more details on the Dr. CI page):

💚 💚 Looks good so far! There are no failures yet. 💚 💚

This comment was automatically generated by Dr. CI (expand for details).

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

datumbox

@xiaohu2015 Thanks a lot for the contribution and for providing a snippet that verifies that the implementation works fine. I think we can merge this. :)

Summary: Co-authored-by: Vasilis Vryniotis <[email protected]> Reviewed By: datumbox Differential Revision: D34579515 fbshipit-source-id: 6f563a48305dc1c9d99274d40c15416075c9b20f

Simplify EMA to use Pytorch's update_parameters

f17c0f8

pytorch-bot bot added the ciflow/default label Feb 24, 2022

facebook-github-bot added the cla signed label Feb 24, 2022

xiaohu2015 marked this pull request as draft February 24, 2022 11:29

Merge branch 'pytorch:main' into patch-1

b9de5fe

xiaohu2015 marked this pull request as ready for review February 26, 2022 04:45

Merge branch 'main' into patch-1

7a7cb10

datumbox mentioned this pull request Feb 27, 2022

[RFC] Batteries Included - Phase 2 #5410

Closed

24 tasks

datumbox added enhancement module: reference scripts labels Feb 27, 2022

datumbox approved these changes Feb 27, 2022

View reviewed changes

Merge branch 'main' into patch-1

31b0e04

datumbox merged commit f40c8df into pytorch:main Feb 27, 2022

datumbox linked an issue Feb 27, 2022 that may be closed by this pull request

Simplify EMA to use Pytorch's update_parameters #5284

Closed

xiaohu2015 deleted the patch-1 branch February 27, 2022 14:26

datumbox mentioned this pull request Jul 27, 2022

[RFC] Batteries Included - Phase 3 #6323

Open

16 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Simplify EMA to use Pytorch's update_parameters #5469

Simplify EMA to use Pytorch's update_parameters #5469

xiaohu2015 commented Feb 24, 2022 •

edited

Loading

facebook-github-bot commented Feb 24, 2022 •

edited

Loading

datumbox left a comment

Simplify EMA to use Pytorch's update_parameters #5469

Simplify EMA to use Pytorch's update_parameters #5469

Conversation

xiaohu2015 commented Feb 24, 2022 • edited Loading

facebook-github-bot commented Feb 24, 2022 • edited Loading

💊 CI failures summary and remediations

datumbox left a comment

Choose a reason for hiding this comment

xiaohu2015 commented Feb 24, 2022 •

edited

Loading

facebook-github-bot commented Feb 24, 2022 •

edited

Loading