[BUG] SetVals failed on tensors that are created from user pointers #175

AtomicVar · 2022-05-02T08:33:39Z

Describe the bug
When using SetVals on tensors that are created from user pointers, a SegmentFault is thrown.

To Reproduce

float* dev_float;
cudaMalloc(&dev_float, sizeof(float) * 6);

auto t = matx::make_tensor<float, 2, matx::non_owning>(dev_float, {2, 3});
t.SetVals({{1, 2, 3}, {4, 5, 6}});
t.Print();

VSCode Debug Error:

Expected behavior
The SetVals should work for tensors created from user pointers.

System details (please complete the following information):

OS: Ubuntu 20.04
CUDA version: CUDA 11.4
g++ version: 9.3.0

The text was updated successfully, but these errors were encountered:

luitjens · 2022-05-02T12:51:43Z

I'm not sure this is a bug. You can't call SetVals on non managed memory as it runs on the host. Can you change cudaMalloc to cudaMallocManaged instead?

cliffburdick · 2022-05-02T13:16:16Z

there are two options here: we can throw an exception and shut down, or we can detect it's not managed memory and do a cudaMemcpy. I tend to think the latter is better since it should be more consistent.

cliffburdick · 2022-05-02T16:06:05Z

@ZJUGuoShuai the issue with passing a device pointer and doing a cudaMemcpy is that you're technically allowed to do something like:

auto a = make_tensor<float>({3,3});
a.SetVals({{0},{1,2}});

In this example we set partial values per row and leave the rest unitialized. Obviously we can't do a bulk cudaMemcpy in that case. The other option is we launch a cudaMemcpy for every single value. This will be extremely inefficient, but users of SetVals are hopefully not doing this inside performance-critical code anyways since it's intended for setting up values at the beginning of the application. I'm inclined to say this method is "best" because it will give the expected results and we can comment that it will be very slow to run.

Either way, this also has to be blocking on stream 0 since we can't make this asynchronous at the moment.

luitjens · 2022-05-02T16:40:57Z

I think this is just an invalid usage of the APIs and not a bug. Detecting all cases of invalid memory accesses is not a requirement that any library/SDK should take on. We can clean up documentation to make it clear that wrapping existing memory may be limited if it is not addressable from the host and device.

cliffburdick · 2022-05-02T19:55:56Z

@ZJUGuoShuai we closed it and print an error now for the reasons above. Please reopen if you have suggestions on how to improve it.

cliffburdick mentioned this issue May 2, 2022

Throw an exception if using SetVals on non-managed pointer #176

Merged

cliffburdick closed this as completed in #176 May 2, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] SetVals failed on tensors that are created from user pointers #175

[BUG] SetVals failed on tensors that are created from user pointers #175

AtomicVar commented May 2, 2022

luitjens commented May 2, 2022

cliffburdick commented May 2, 2022

cliffburdick commented May 2, 2022

luitjens commented May 2, 2022

cliffburdick commented May 2, 2022

[BUG] SetVals failed on tensors that are created from user pointers #175

[BUG] SetVals failed on tensors that are created from user pointers #175

Comments

AtomicVar commented May 2, 2022

luitjens commented May 2, 2022

cliffburdick commented May 2, 2022

cliffburdick commented May 2, 2022

luitjens commented May 2, 2022

cliffburdick commented May 2, 2022