Improve the flexibility of `standardize_dtype` and fix `pad` in torch backend #828

james77777778 · 2023-09-01T02:48:26Z

One of the advantages of Keras Core is that we can integrate the workflow with different backends.
For example, we can train a tensorflow model using a torch dataloader.

However, operations containing standardize_dtype might fail when the dtype is torch.Tensor.dtype and the backend is NOT torch.

import os

os.environ["KERAS_BACKEND"] = "tensorflow"

import torch

from keras_core import ops

x = torch.randn(4, 16, 16, 3)
y = ops.convert_to_tensor(x, dtype=x.dtype)  # failed w/o this PR
print(y.dtype)

This PR has addressed the issue by implementing a better check for torch.Tensor.dtype.
A unit test for this behavior has been included.

fchollet

Thanks for the PR

fchollet · 2023-09-01T03:37:20Z

keras_core/backend/common/variables.py

@@ -408,7 +408,7 @@ def standardize_dtype(dtype):
            dtype = "int32"
    if hasattr(dtype, "name"):
        dtype = dtype.name
-    elif config.backend() == "torch":
+    if hasattr(dtype, "__str__") and "torch" in str(dtype):


You can use elif for better performance (and above too)

james77777778 · 2023-09-01T05:47:02Z

Hi @fchollet
I have updated the standardize_dtype to give the best performance as far as I know.

This change catched subtle bugs in:

digitize (numpy backend)
pad and isclose (torch backend)

It is surprising that x.dtype == "int" is True when the dtype is np.int64. This results in strange behavior in standardize_dtype.

>>> import numpy as np
>>> x = np.array([0.0, 1.0, 3.0, 1.6])
>>> bins = np.array([0.0, 3.0, 4.5, 7.0])
>>> np.digitize(x, bins).dtype
dtype('int64')
>>> np.digitize(x, bins).dtype == "int"
True

In torch backend:

The casting policy in pad(..., mode="reflect") has been updated.
Enhance supported types of functional.pad pytorch/pytorch#40763
We can use torch.result_type in isclose for the consistency

fchollet · 2023-09-01T17:15:05Z

keras_core/backend/torch/numpy.py

@@ -676,7 +677,8 @@ def pad(x, pad_width, mode="constant"):
        mode = "replicate"
    if mode != "constant" and x.ndim < 3:
        new_dims = [1] * (3 - x.ndim)
-        x = cast(x, torch.float32) if x.dtype == torch.int else x
+        if x.dtype not in (torch.float32, torch.float64):


What about float16?

I have tried float16 with mode=reflect and it is not supported by torch

>>> x = torch.randn(3, 10, 10, dtype=torch.float16) >>> torch.nn.functional.pad(x, [1, 1], mode="reflect") Traceback (most recent call last): File "<stdin>", line 1, in <module> RuntimeError: "reflection_pad1d" not implemented for 'Half'

I believe the solution in this PR should be the same as the official one (torchvision)

https://github.com/pytorch/vision/blob/main/torchvision/transforms/_functional_tensor.py#L418-L423

@fchollet
Should we restore the original dtype after the op? The current code cast to float32 for non-constant padding mode.

Should we restore the original dtype after the op?

In fact, yes -- that is the behavior that other backends follow. Does the unit test only check float32?

In fact, yes -- that is the behavior that other backends follow. Does the unit test only check float32?

Actually, the unit test only checked int64.

keras-core/keras_core/ops/numpy_test.py

Lines 2936 to 2945 in fa547ec

def test_pad(self):

x = np.array([[1, 2], [3, 4]])

self.assertAllClose(

knp.pad(x, ((1, 1), (1, 1))),

np.pad(x, ((1, 1), (1, 1))),

)

self.assertAllClose(

knp.pad(x, ((1, 1), (1, 1))),

np.pad(x, ((1, 1), (1, 1))),

)

Please see the new comment below

james77777778 · 2023-09-02T14:26:24Z

@fchollet

I have refactored pad in torch backend to accommodate the restriction of torch.nn.functional.pad.
https://pytorch.org/docs/stable/generated/torch.nn.functional.pad.html

Replicate and reflection padding are implemented for padding the last 3 dimensions of a 4D or 5D input tensor, the last 2 dimensions of a 3D or 4D input tensor, or the last dimension of a 2D or 3D input tensor.

In the example below, we can find that reflect padding is not working when pad_width is a 5D list even it is a 3D padding. However, it works if we remove the redundant 0.

>>> x = torch.ones((2, 3, 4, 5, 6))
>>> torch.nn.functional.pad(x, [2, 3, 1, 1, 1, 1, 0, 0, 0, 0], mode="reflect").shape
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
NotImplementedError: Only 2D, 3D, 4D, 5D padding with non-constant padding are supported for now
>>> torch.nn.functional.pad(x, [2, 3, 1, 1, 1, 1], mode="reflect").shape
torch.Size([2, 3, 6, 7, 11])
>>>

I have also updated the unit test to improve the coverage for various shapes, dtypes and modes.

fchollet

LGTM, thank you!

improve flexibility in dtype check for torch

53d169c

fchollet reviewed Sep 1, 2023

View reviewed changes

james77777778 added 3 commits September 1, 2023 04:26

Update

45e65fd

Fix bugs

c647e11

Update

41ab784

james77777778 requested a review from fchollet September 1, 2023 05:56

fchollet reviewed Sep 1, 2023

View reviewed changes

Fix padding for torch backend

a6f9693

james77777778 changed the title ~~Improve the flexibility in dtype check for torch.Tensor.dtype~~ Improve the flexibility of standardize_dtype and fix pad in torch backend Sep 2, 2023

james77777778 requested a review from fchollet September 2, 2023 14:34

Update

d3bc716

fchollet approved these changes Sep 2, 2023

View reviewed changes

fchollet merged commit de510e9 into keras-team:main Sep 2, 2023
6 checks passed

james77777778 deleted the improve-flexibility-in-dtype branch September 4, 2023 01:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve the flexibility of `standardize_dtype` and fix `pad` in torch backend #828

Improve the flexibility of `standardize_dtype` and fix `pad` in torch backend #828

james77777778 commented Sep 1, 2023 •

edited

Loading

fchollet left a comment

fchollet Sep 1, 2023

james77777778 Sep 1, 2023

james77777778 commented Sep 1, 2023

fchollet Sep 1, 2023

james77777778 Sep 2, 2023

james77777778 Sep 2, 2023

fchollet Sep 2, 2023

james77777778 Sep 2, 2023

james77777778 commented Sep 2, 2023 •

edited

Loading

fchollet left a comment

	def test_pad(self):
	x = np.array([[1, 2], [3, 4]])
	self.assertAllClose(
	knp.pad(x, ((1, 1), (1, 1))),
	np.pad(x, ((1, 1), (1, 1))),
	)
	self.assertAllClose(
	knp.pad(x, ((1, 1), (1, 1))),
	np.pad(x, ((1, 1), (1, 1))),
	)

Improve the flexibility of standardize_dtype and fix pad in torch backend #828

Improve the flexibility of standardize_dtype and fix pad in torch backend #828

Conversation

james77777778 commented Sep 1, 2023 • edited Loading

fchollet left a comment

Choose a reason for hiding this comment

fchollet Sep 1, 2023

Choose a reason for hiding this comment

james77777778 Sep 1, 2023

Choose a reason for hiding this comment

james77777778 commented Sep 1, 2023

fchollet Sep 1, 2023

Choose a reason for hiding this comment

james77777778 Sep 2, 2023

Choose a reason for hiding this comment

james77777778 Sep 2, 2023

Choose a reason for hiding this comment

fchollet Sep 2, 2023

Choose a reason for hiding this comment

james77777778 Sep 2, 2023

Choose a reason for hiding this comment

james77777778 commented Sep 2, 2023 • edited Loading

fchollet left a comment

Choose a reason for hiding this comment

Improve the flexibility of `standardize_dtype` and fix `pad` in torch backend #828

Improve the flexibility of `standardize_dtype` and fix `pad` in torch backend #828

james77777778 commented Sep 1, 2023 •

edited

Loading

james77777778 commented Sep 2, 2023 •

edited

Loading