[WIP] Moving the UTF8 Parser/Formatter code into System.Private.CoreLib #20873

tannergooding · 2018-11-08T05:37:33Z

Today, the UTF8 Parser/Formatter code lives in System.Memory. However, this presents a potential problem with sharing bits of code (namely the Decimal/Double/Single formatters/parsers).

I talked with @GrabYourPitchforks a few times about this and suggested we move the code into System.Private.CoreLib. This allows us to readily share the more complicated code and keep the code in sync between the two.

We now have a single NumberBuffer instance that carries a Span<byte>
- The Span<char> buffer previously used by the UTF16 Parser never contained digits outside 0-9 and null anyways.
I tried to do the minimal number of changes, outside moving to share the NumberToDecimal, NumberToDouble, and NumberToSingle code where possible
- There are several other places (some tracked by comments) of where we could share more code between the UTF8/UTF16 parser and formatter

…pan<char>`

tannergooding · 2018-11-08T05:38:38Z

CC. @danmosemsft, @GrabYourPitchforks, @jkotas, @stephentoub

I'd appreciate some input on this, and if it is a direction we would like to persue. If not, it might be good if we could come up with an alternative for sharing the code (where possible).

jkotas · 2018-11-08T05:58:32Z

Sounds good to me.

src/System.Private.CoreLib/src/System/ThrowHelper.cs

…buffer

…s Double

tannergooding · 2018-11-08T16:44:33Z

Hmmm. Utf8Formatter.TryFormat(bool, Span<byte>, out int, StandardFormat) is currently failing in crossgen (in EvalFuncForConstantArgs) when dealing with FalsValueLowercase.

I am going to speculate that this is for the ReverseEndianness call in WriteUInt32BigEndian, but I am still trying to confirm. CC. @GrabYourPitchforks

stephentoub · 2018-11-08T16:58:34Z

@tannergooding, does this help/hurt the throughput of operations like uint.TryParse and uint.TryFormat?

tannergooding · 2018-11-08T17:02:30Z

does this help/hurt the throughput of operations like uint.TryParse and uint.TryFormat

Still need to check, but I wouldn't expect it to. We were already only dealing with ASCII characters (once we got it into a NumberBuffer) and most of the parsing code was already casting back and forth from int to char (for the various arithmetic operations).

stephentoub · 2018-11-08T17:04:20Z

src/System.Private.CoreLib/shared/System/Buffers/StandardFormat.cs

@@ -0,0 +1,174 @@
+// Licensed to the .NET Foundation under one or more agreements.


Purely a logistical comment, but should we instead do this by moving the code in corefx to shared, letting it mirror over, and then updating the rest of coreclr with the necessary changes? I think that would make the history easier to follow.

That sounds like a good idea.

tannergooding · 2018-11-08T17:41:20Z

Closing this and breaking it into multiple PRs:

First will be on CoreCLR and will move NumberBuffer to carry Span<byte>.
Second will be on CoreFX moving the Utf8Parser and Utf8Formatter to shared
Third will be on CoreCLR pulling the Utf8Parser and Utf8Formatter into S.P.Corelib
- This includes minimally fixing it to share the newer floating-point parsing code
Final will be on CoreFX removing the Utf8Parser and Utf8Formatter from System.Memory and type-forwarding it to Corelib

GrabYourPitchforks · 2018-11-08T18:23:26Z

I'm also looking into the JIT assert.

AndyAyersMS · 2018-11-08T18:41:19Z

We generally won't prejit methods that rely on HW intrinsics.

tannergooding added 5 commits November 7, 2018 21:11

Moving the Utf8Formatter and Utf8Parser code into System.Private.Corelib

9e9c257

Changing Number.NumberBuffer to carry a Span<byte> rather than a `S…

a76f26b

…pan<char>`

Renaming NumberBuffer.Sign to NumberBuffer.IsNegative

a9348ba

Modifying the Utf8 Formatter/Parser to use System.Number.NumberBuffer

5ce5139

Fixing TryParseNumber to return -0 for "-0"

7a908f8

Fixing DecimalToNumber to adjust the pointer appropriately

de17e0c

jkotas reviewed Nov 8, 2018

View reviewed changes

src/System.Private.CoreLib/src/System/ThrowHelper.cs Outdated Show resolved Hide resolved

jkotas reviewed Nov 8, 2018

View reviewed changes

src/System.Private.CoreLib/src/System/ThrowHelper.cs Outdated Show resolved Hide resolved

tannergooding added 3 commits November 8, 2018 07:19

Fixing the Utf8Formatter/Utf8Parser to correctly allocate the digits …

d816cea

…buffer

Updating the Utf8Parser to properly have different paths for Single v…

ae9adf5

…s Double

Moving a couple of Utf8Parser/Formatter specific throw helpers

005f26e

stephentoub reviewed Nov 8, 2018

View reviewed changes

tannergooding closed this Nov 8, 2018

tannergooding mentioned this pull request Nov 8, 2018

Changing Number.NumberBuffer to carry a Span<byte> rather than a Span<char> #20879

Merged

GrabYourPitchforks mentioned this pull request Nov 8, 2018

Enlighten ValueNumStore::EvalOpSpecialized about bswap nodes #20883

Merged

tannergooding mentioned this pull request Nov 10, 2018

Moving the Utf8Formatter and Utf8Parser into S.P.Corelib #20934

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Moving the UTF8 Parser/Formatter code into System.Private.CoreLib #20873

[WIP] Moving the UTF8 Parser/Formatter code into System.Private.CoreLib #20873

tannergooding commented Nov 8, 2018

tannergooding commented Nov 8, 2018

jkotas commented Nov 8, 2018

tannergooding commented Nov 8, 2018

stephentoub commented Nov 8, 2018

tannergooding commented Nov 8, 2018

stephentoub Nov 8, 2018

tannergooding Nov 8, 2018

tannergooding commented Nov 8, 2018

GrabYourPitchforks commented Nov 8, 2018

AndyAyersMS commented Nov 8, 2018

		@@ -0,0 +1,174 @@
		// Licensed to the .NET Foundation under one or more agreements.

[WIP] Moving the UTF8 Parser/Formatter code into System.Private.CoreLib #20873

[WIP] Moving the UTF8 Parser/Formatter code into System.Private.CoreLib #20873

Conversation

tannergooding commented Nov 8, 2018

tannergooding commented Nov 8, 2018

jkotas commented Nov 8, 2018

tannergooding commented Nov 8, 2018

stephentoub commented Nov 8, 2018

tannergooding commented Nov 8, 2018

stephentoub Nov 8, 2018

Choose a reason for hiding this comment

tannergooding Nov 8, 2018

Choose a reason for hiding this comment

tannergooding commented Nov 8, 2018

GrabYourPitchforks commented Nov 8, 2018

AndyAyersMS commented Nov 8, 2018