Large file is is not read correctly when using `UseWasmSharedBuffer` #161

BrzVlad · 2020-09-21T10:56:12Z

Describe the bug
Large file is is not read correctly when using UseWasmSharedBuffer

To Reproduce

Clone/download project https://github.com/layomia/WasmProj
dotnet run -c Release
Open browser to localhost:5001
Click "Browse" and select the Testdata-2018-08-09-15.json file
Wait for data to load (can take a few minutes)
Click "Convert text"

Expected behavior
Deserialization completes successfully

Project type
CSB

Environment

Ubuntu 18.04 / Chromium
5.0.0-preview.8.20407.11

Additional context
The data provided by ReadAllText is incorrect when UseWasmSharedBuffer is enabled. This later leads to a crash in json deserialization. A simple checksum can be used instead to verify the data.

The text was updated successfully, but these errors were encountered:

Tewr · 2020-09-21T18:36:20Z

Thanks for the bug report! Probably some quirk in the unsupported APIs I rely on.

Tewr · 2020-09-21T20:24:44Z

~~SHA-256 is OK for this file under SDK 5.0.0-preview.1.20120.5~~
This is unknown atm

Tewr · 2020-09-22T08:39:19Z

This has sent me down the compatibility/upgrade hole. I'm most likely not going to support 5.0.0-preview.8, that's going to be too much of an effort, there are breaking changes in 5.0.0-rc1, which will be my current target.

BrzVlad · 2020-09-22T15:40:39Z

@Tewr Do you know what exactly is causing this issue ? Is it a regression ? Anything on the runtime side that is not working as expected ?

Tewr · 2020-09-22T16:12:16Z

The buffer byte[] passed InvokeUnmarshalled is corrupted -- or I won't get the actual referenced buffer back from dotnet. I need to do some more testing before I'm sure, it's a bit hard to debug. But my first impression is that its the "wrong" buffer.

If I copy the entire stream to a MemoryStream first in the same manner, there is no problem. So I'm guessing that the slightly more complicated consumer code in StreamReader provokes more GC or something, which in turn provokes the problem.

Interesting parts of faulty code:

BlazorFileReader/src/Blazor.FileReader/script/FileReaderComponent.ts

Line 191 in f0ea46e

    
           public ReadFileParamsPointer = (readFileParamsPointer: Pointer): IReadFileParams => {

BlazorFileReader/src/Blazor.FileReader/script/FileReaderComponent.ts

Line 209 in f0ea46e

const dotNetBufferView = Blazor.platform.toUint8Array(readFileParams.buffer);

BlazorFileReader/src/Blazor.FileReader/ReadFileParams.cs

Line 6 in f0ea46e

[StructLayout(LayoutKind.Explicit)]

not sure if a regression as I havent tested much reading streams as strings, at least not that large.

Tewr · 2020-09-22T20:20:44Z

Ok so the problem is not really corruption. I have managed to reproduce this a bit quicker, by simply running the working version quickly after. Basically, after a while (325 calls, to be more exact) the shared memory mechanism stops working.

The correct buffer instance is sent over, I tagged it with some data to read it on the js side. But for some reason, the data that I write (edit: clarification: on the js side) is no longer being written to the buffer (no longer visible in the c# side). If I do a second call ("retry"), it works. But there is, of course, no way for me to know that the call has failed unless I use the slow, stable marshalled method to compare.

The repro is here: https://github.com/Tewr/BlazorFileReader/tree/JsonStringBugin50rc1
If you launch the Blazor.FileReader.Wasm.Demo project and supply the test file, the interesting output is in the console. It's a lot faster than waiting for the entire file to be encoded into UTF-8, only takes a couple of seconds.

This is the line that no longer "works":

BlazorFileReader/src/Blazor.FileReader/script/FileReaderComponent.ts

Line 218 in 71d5065

dotNetBufferView.set(new Uint8Array(arrayBuffer), readFileParams.bufferOffset);

I implemented this a slightly differently way in a version from last year. I used a normal call and then an explicit callback to allocate a buffer, I think it was heavily inspired by how HttpClient does it. It was a bit slower but I may have to go back to that one, I'm really clueless on why this happens.

BrzVlad · 2020-09-22T22:49:12Z

From what I understand, C# sends over a buffer to JS, but JS doesn't see the real data in the buffer that was sent over. This would suggest that the GC did its own thing with the buffer, either moved or collected it.

I don't have any experience with the JS interop layer. @kg Do you want to take a look at this and see if you spot anything fishy in the library ? Or maybe we are doing something wrong ? cc @lewing

kg · 2020-09-23T00:17:58Z

If the native heap grows all your existing typed arrays will become detached, because growing the native heap creates a new bigger array buffer to replace the old one. Any existing typed array instances (i.e. Uint8Array) will become dead. If you're keeping typed array instances alive across calls (like malloc) that can grow the native heap, your code is broken. That's likely the problem here.

This is unfortunately very hard to debug because the design of the typed arrays spec is wholly deficient in this area.

kg · 2020-09-23T00:33:05Z

I read through the library C# and typescript and didn't see any obvious red flags, but I don't completely understand all of what's going on in there, in part because I have very little experience with JS async programming or blazor. My guess is still that this is breaking due to the native heap growing, which explains why retrying will succeed.

You can see here that when the heap grows, emscripten needs to reallocate all the arrays and views it exposes: https://github.com/emscripten-core/emscripten/blob/9456db2b95771f0d85de4b78cc0beeeb4848dd6e/src/library.js#L628
And the logic for doing that is here: https://github.com/emscripten-core/emscripten/blob/e156f1a53fbf2756a70a7fc7a8d2e7ba251ef684/src/preamble_minimal.js#L118

Unfortunately emscripten is not designed to notify applications that the heap has grown, so there is no trivial way to ensure all of your buffers are current other than not allowing them to persist across calls that could enlarge the native heap. In practice this means it is unsafe to use typed arrays in most circumstances unless you allocate them fresh and then immediately discard them.

Tewr · 2020-09-23T06:20:52Z

Thanks a lot for the explanation, really helpful!
StreamReader uses an exceptionally low buffer size (1024b). This is probably the reason this has not come up before; the low buffer size puts a high pressure on the heap. Another solution that I'll look into might be to force a minimum buffersize. I have found that the best compromise between speed and memory usage lies around 82kb, but I'll use this case to benchmark a minimum value when the problem no longer occurs.

Tewr · 2020-09-25T05:50:43Z

Minimum buffersize, while working better, doesn't work all the time, so it's not a viable solution. Having read this sentence again

keeping typed array instances alive across calls (like malloc) that can grow the native heap, your code is broken.

I interpret this as following
My current code creates the typed array, launches the reader, and then waits for the callback on the reader to return, and write the data to the array. The async "pause" breaks my code.

So for correct usage, I should launch the reader, and then create the typed array when the async call returns.

Or find some other solution but I have to do all my processing while staying sync, or I risk loosing my array if the heap is under pressure.

@kg does this sound right?

kg · 2020-09-25T06:18:59Z

I'm not completely sure what the right solution here is, but if it helps, think of it this way: Any time control transitions from JavaScript to C or C# (wasm), any typed arrays alive at that point are potentially invalid. Once you transition back to JS, you're fine. The async "pause" itself isn't the issue, but almost certainly some C or C# is going to run between the start of the pause and the end unless you find a way to do all of that processing in JS.

The async nature of it isn't itself the problem, if you do this synchronously you will still encounter issues when calling into C/C# because those method calls can grow the heap and invalidate your typed arrays.

Tewr · 2020-09-25T16:25:23Z

Ok, during the async pause control goes back to c# which will execute await taskCompletionSource.Task.

What is behind that line is quite complex and that is probably what will trigger the problem. If I allocate the buffer after the pause, there will be no more c# executing before I'm done writing to the buffer so it should be safe. I'll give it a go!

Tewr · 2020-10-01T15:50:42Z

@erikthysell can you confirm the latest release resolves your problem?

v2.1 if you're using blazor 3.2 or v3.0 for blazor 5rc1

BrzVlad mentioned this issue Sep 21, 2020

System.Text.Json.JsonSerializer.Deserialize crashes when parsing large string using Blazor dotnet/runtime#41604

Closed

Tewr added the bug Something isn't working label Sep 21, 2020

Tewr self-assigned this Sep 21, 2020

Tewr added Compatibility Issue relating to breaking changes in Blazor possible-blazor-issue This possibly a problem in the blazor framework that needs a workaround or a bugfix in blazor itself labels Sep 22, 2020

Tewr removed Compatibility Issue relating to breaking changes in Blazor possible-blazor-issue This possibly a problem in the blazor framework that needs a workaround or a bugfix in blazor itself labels Sep 25, 2020

Tewr mentioned this issue Sep 29, 2020

Buffer reference lost #164

Merged

Tewr closed this as completed in #164 Sep 29, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Large file is is not read correctly when using `UseWasmSharedBuffer` #161

Large file is is not read correctly when using `UseWasmSharedBuffer` #161

BrzVlad commented Sep 21, 2020

Tewr commented Sep 21, 2020

Tewr commented Sep 21, 2020 •

edited

Loading

Tewr commented Sep 22, 2020 •

edited

Loading

BrzVlad commented Sep 22, 2020

Tewr commented Sep 22, 2020 •

edited

Loading

Tewr commented Sep 22, 2020 •

edited

Loading

BrzVlad commented Sep 22, 2020

kg commented Sep 23, 2020

kg commented Sep 23, 2020

Tewr commented Sep 23, 2020 •

edited

Loading

Tewr commented Sep 25, 2020 •

edited

Loading

kg commented Sep 25, 2020

Tewr commented Sep 25, 2020

Tewr commented Oct 1, 2020 •

edited

Loading

Large file is is not read correctly when using UseWasmSharedBuffer #161

Large file is is not read correctly when using UseWasmSharedBuffer #161

Comments

BrzVlad commented Sep 21, 2020

Tewr commented Sep 21, 2020

Tewr commented Sep 21, 2020 • edited Loading

Tewr commented Sep 22, 2020 • edited Loading

BrzVlad commented Sep 22, 2020

Tewr commented Sep 22, 2020 • edited Loading

Tewr commented Sep 22, 2020 • edited Loading

BrzVlad commented Sep 22, 2020

kg commented Sep 23, 2020

kg commented Sep 23, 2020

Tewr commented Sep 23, 2020 • edited Loading

Tewr commented Sep 25, 2020 • edited Loading

kg commented Sep 25, 2020

Tewr commented Sep 25, 2020

Tewr commented Oct 1, 2020 • edited Loading

Large file is is not read correctly when using `UseWasmSharedBuffer` #161

Large file is is not read correctly when using `UseWasmSharedBuffer` #161

Tewr commented Sep 21, 2020 •

edited

Loading

Tewr commented Sep 22, 2020 •

edited

Loading

Tewr commented Sep 22, 2020 •

edited

Loading

Tewr commented Sep 22, 2020 •

edited

Loading

Tewr commented Sep 23, 2020 •

edited

Loading

Tewr commented Sep 25, 2020 •

edited

Loading

Tewr commented Oct 1, 2020 •

edited

Loading