Unify parsing part of BigInteger with CoreLib #85978

huoyaoyuan · 2023-05-09T15:04:32Z

Part of #28657. This is my third attempt of working on this. I'd like to keep formatting and further cleanup to follow-up PRs to help reviewing.

This PR does not refactor the algorithm part. It just adapts corelib-style patterns for BigInteger. Reviewing commit by commit is recommended.

ghost · 2023-05-09T15:04:50Z

Tagging subscribers to this area: @dotnet/area-system-numerics
See info in area-owners.md if you want to be subscribed.

Issue Details

Part of #28657. This is my third attempt of working on this. I'd like to keep formatting and further cleanup to follow-up PRs to help reviewing.

This PR does not refactor the algorithm part. It just adapts corelib-style patterns for BigInteger. Reviewing commit by commit is recommended.

Author:	huoyaoyuan
Assignees:	-
Labels:	`area-System.Numerics`
Milestone:	-

huoyaoyuan · 2023-05-09T15:07:25Z

Also asking a question about formatting code here: the following pattern is heavily used in formatting code to share between UTF8 and UTF16:

private static void FormatNumber<TChar>(ref ValueListBuilder<TChar> vlb, ref NumberBuffer number, int nMaxDigits, NumberFormatInfo info) where TChar : unmanaged, IUtfChar<TChar>

What's the best approach to share those code out of CoreLib? Using #ifdef?

huoyaoyuan · 2023-05-10T09:32:39Z

It's better to review and merge #84792 first.

danmoseley · 2023-07-05T04:39:05Z

Merge conflicts - and then this is reviewable? Looks like the other one went in.

huoyaoyuan · 2023-07-05T05:32:53Z

There's also potential massive conflict with #85392/#86875, and minor dependency with other numeric PRs. Can someone determine an order to review these?

huoyaoyuan · 2023-07-19T19:13:20Z

@tannergooding wanna to discuss how to deal with this.

#86875 uses IUtf8Char in parsing, brings the same problem from formatting. IUtf8Char can't be used in System.Runtime.Numerics thus the majority of code need to update.

The approach I tried looks like this:

#if !SYSTEM_PRIVATE_CORELIB
using TChar = System.Char;
#pragma warning disable SA1121 // Use built-in type alias
#endif

#if SYSTEM_PRIVATE_CORELIB
        internal static unsafe TChar* UInt32ToDecChars<TChar>(TChar* bufferEnd, uint value, int digits) where TChar : unmanaged, IUtfChar<TChar>
#else
        internal static unsafe char* UInt32ToDecChars(char* bufferEnd, uint value, int digits)
#endif


#if SYSTEM_PRIVATE_CORELIB
        private static ReadOnlySpan<TChar> NegativeSign<TChar>(NumberFormatInfo info)
            where TChar : unmanaged, IUtfChar<TChar>
            => info.NegativeSignTChar<TChar>();
#else
        private static ReadOnlySpan<char> NegativeSign<TChar>(NumberFormatInfo info) => info.NegativeSign;
#endif

The workaround just works, but isn't expandable if we want to support UTF8 for BigInteger. What do you think about the best approach? Should I ask Stephen or someone else?

adamsitnik

Please excuse me for our Team not providing any review for so long. We have been hit by a wonderful talent redeployment quite hard, and this caused the delay.

Big thanks for removing the code duplication!

I've added some comments and I made it clear which can be ignored (or addressed in separate PR).

Overall the PR looks good, it's very thoughtful. However there are merge conflict so I am going to hit "request changes".

@huoyaoyuan thank you for your contribution!

src/libraries/System.Private.CoreLib/src/System.Private.CoreLib.Shared.projitems

adamsitnik · 2023-10-30T12:34:12Z

src/libraries/Common/src/System/Number.Parsing.Common.cs

+                        int exp = 0;
+                        do
+                        {
+                            // Check if we are about to overflow past our limit of 9 digits


This refactoring change introduces a behavior change: https://www.diffchecker.com/ycRc6AAy/

I used git blame to verify that this is most likely desired, as it was introduced in #73643 as a bug fix with no breaking change label. More than a year has passed, so I assume it's safe.

cc @tannergooding

src/libraries/Common/src/System/Number.Parsing.Common.cs

adamsitnik · 2023-10-30T12:59:28Z