Using GPU Gaussian blur at DarkPose unbiased decoding & megvii #332

HoBeom · 2020-12-04T06:20:59Z

mmpose/mmpose/core/evaluation/top_down_eval.py

Line 266 in 9c047f6

def _gaussian_blur(heatmaps, kernel=11):

It can be process in gpu using pytorch module
https://discuss.pytorch.org/t/gaussian-kernel-layer/37619

class GaussianLayer(nn.Module):
    def __init__(self):
        super(GaussianLayer, self).__init__()
        self.seq = nn.Sequential(
            nn.ReflectionPad2d(10), 
            nn.Conv2d(3, 3, 21, stride=1, padding=0, bias=None, groups=3)
        )

        self.weights_init()
    def forward(self, x):
        return self.seq(x)

    def weights_init(self):
        n= np.zeros((21,21))
        n[10,10] = 1
        k = scipy.ndimage.gaussian_filter(n,sigma=3)
        for name, f in self.named_parameters():
            f.data.copy_(torch.from_numpy(k))

or this
https://www.programmersought.com/article/17644345494/

class GaussianBlurConv(nn.Module):
    def __init__(self, channels=3):
        super(GaussianBlurConv, self).__init__()
        self.channels = channels
        kernel = [[0.00078633, 0.00655965, 0.01330373, 0.00655965, 0.00078633],
                  [0.00655965, 0.05472157, 0.11098164, 0.05472157, 0.00655965],
                  [0.01330373, 0.11098164, 0.22508352, 0.11098164, 0.01330373],
                  [0.00655965, 0.05472157, 0.11098164, 0.05472157, 0.00655965],
                  [0.00078633, 0.00655965, 0.01330373, 0.00655965, 0.00078633]]
        kernel = torch.FloatTensor(kernel).unsqueeze(0).unsqueeze(0)
        kernel = np.repeat(kernel, self.channels, axis=0)
        self.weight = nn.Parameter(data=kernel, requires_grad=False)
 
    def __call__(self, x):
        x = F.conv2d(x.unsqueeze(0), self.weight, padding=2, groups=self.channels)

The text was updated successfully, but these errors were encountered:

innerlee · 2020-12-04T06:25:27Z

Yeah many ops in cpu can be moved to gpu.
Ideally we should do a profiling and find all bottlenecks. If they are happens to be cpu ops, then we can re-implement them in gpu.

For tools of profile, ref the last comment in #73

HoBeom · 2020-12-04T06:38:47Z

thanks. I'll go over it using cProfile #73.
But I need some time until next week.

HoBeom · 2020-12-21T09:16:13Z

before gaussian blur

after using torch gpu

#378
sorry for pull requests (click miss 😄 )

HoBeom · 2021-02-15T09:54:09Z

It doesn't seem necessary. It is a very small performance improvement, but it requires a lot of modifications.
Thank you for comments. 👍 @innerlee

ykk648 · 2022-08-11T07:54:57Z

@HoBeom Hey, your GaussianBlur gpu version may not proform good enough on your test, but for datasets like cocowholebody, there's 133 keypoints needs GaussianBlur to recover from heatmap, I find your codes helpful, made big model like HRNet48 dark reach batch realtime, thanks a lot !

innerlee added the speed label Dec 4, 2020

innerlee assigned yaochaorui Dec 4, 2020

HoBeom closed this as completed Feb 15, 2021

rollingman1 pushed a commit to rollingman1/mmpose that referenced this issue Nov 5, 2021

chmod +x stat.py (open-mmlab#332)

0126ee1

jin-s13 assigned ly015 Aug 12, 2022

HAOCHENYE pushed a commit to HAOCHENYE/mmpose that referenced this issue Jun 27, 2023

[Fix]: fix load_checkpoint (open-mmlab#332)

12f7d3a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Using GPU Gaussian blur at DarkPose unbiased decoding & megvii #332

Using GPU Gaussian blur at DarkPose unbiased decoding & megvii #332

HoBeom commented Dec 4, 2020 •

edited

Loading

innerlee commented Dec 4, 2020

HoBeom commented Dec 4, 2020

HoBeom commented Dec 21, 2020

HoBeom commented Feb 15, 2021

ykk648 commented Aug 11, 2022

Using GPU Gaussian blur at DarkPose unbiased decoding & megvii #332

Using GPU Gaussian blur at DarkPose unbiased decoding & megvii #332

Comments

HoBeom commented Dec 4, 2020 • edited Loading

innerlee commented Dec 4, 2020

HoBeom commented Dec 4, 2020

HoBeom commented Dec 21, 2020

HoBeom commented Feb 15, 2021

ykk648 commented Aug 11, 2022

HoBeom commented Dec 4, 2020 •

edited

Loading