[Parser] Templatize lexing of integers #6272

tlively · 2024-02-02T01:27:16Z

Have a single implementation for lexing each of unsigned, signed, and
uninterpreted integers, each generic over the bit width of the integer. This
reduces duplication in the existing code and it will make it much easier to
support lexing more 8- and 16-bit integers.

Have a single implementation for lexing each of unsigned, signed, and uninterpreted integers, each generic over the bit width of the integer. This reduces duplication in the existing code and it will make it much easier to support lexing more 8- and 16-bit integers.

tlively · 2024-02-02T01:27:30Z

Current dependencies on/for this PR:

main
- PR [Parser] Templatize lexing of integers #6272 👈

This stack of pull requests is managed by Graphite.

kripken · 2024-02-02T02:07:37Z

src/parser/input-impl.h

@@ -100,7 +100,7 @@ inline std::optional<uint64_t> ParseInput::takeOffset() {
      if (subLexer == subLexer.end()) {
        return {};
      }
-      if (auto o = subLexer->getU64()) {
+      if (auto o = subLexer->getU<uint64_t>()) {


getU<uint64_t> mentions "unsigned" twice. Can this be get<uint64_t>? (maybe using https://en.cppreference.com/w/cpp/types/is_unsigned)

getU and getI both return unsigned integers, so I think it's good to differentiate them in the method name and not favor one over the other (and all the other getXXX methods) by giving it the shorter get name.

One alternative I considered was to make the template parameter just 32 or 64, etc. but that would require more complex machinery in the implementation and the nicer API didn't seem worth the extra complexity. If you're interested, I can add a commit with that design so we can see the difference, though.

Ah, maybe I've forgotten what those functions mean. I don't see comments on

binaryen/src/parser/lexer.h

Lines 128 to 135 in 845e070

std::optional<uint64_t> getU64() const;

std::optional<int64_t> getS64() const;

std::optional<uint64_t> getI64() const;

std::optional<uint32_t> getU32() const;

std::optional<int32_t> getS32() const;

std::optional<uint32_t> getI32() const;

std::optional<double> getF64() const;

std::optional<float> getF32() const;

Perhaps it's worth adding some? Reading the code, the meaning seems to be "U is unsigned, S is signed, I accepts both as inputs but returns unsigned" - ?

Yes, that's right. uN, sN, and iN come directly from the grammar in the spec, but I can add comments here as well.

Right, sorry, I keep reading this code as normal C++ and I forget it mirrors the spec...

Have a single implementation for lexing each of unsigned, signed, and uninterpreted integers, each generic over the bit width of the integer. This reduces duplication in the existing code and it will make it much easier to support lexing more 8- and 16-bit integers.

tlively requested a review from kripken February 2, 2024 01:27

kripken reviewed Feb 2, 2024

View reviewed changes

kripken approved these changes Feb 5, 2024

View reviewed changes

tlively merged commit ed15efe into main Feb 5, 2024
15 checks passed

tlively deleted the parser-template-ints branch February 5, 2024 18:24

gkdn mentioned this pull request Aug 31, 2024

stringconsts gkdn/binaryen#1

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Parser] Templatize lexing of integers #6272

[Parser] Templatize lexing of integers #6272

tlively commented Feb 2, 2024

tlively commented Feb 2, 2024

kripken Feb 2, 2024

tlively Feb 2, 2024

kripken Feb 5, 2024

tlively Feb 5, 2024

kripken Feb 5, 2024

	std::optional<uint64_t> getU64() const;
	std::optional<int64_t> getS64() const;
	std::optional<uint64_t> getI64() const;
	std::optional<uint32_t> getU32() const;
	std::optional<int32_t> getS32() const;
	std::optional<uint32_t> getI32() const;
	std::optional<double> getF64() const;
	std::optional<float> getF32() const;

[Parser] Templatize lexing of integers #6272

[Parser] Templatize lexing of integers #6272

Conversation

tlively commented Feb 2, 2024

tlively commented Feb 2, 2024

kripken Feb 2, 2024

Choose a reason for hiding this comment

tlively Feb 2, 2024

Choose a reason for hiding this comment

kripken Feb 5, 2024

Choose a reason for hiding this comment

tlively Feb 5, 2024

Choose a reason for hiding this comment

kripken Feb 5, 2024

Choose a reason for hiding this comment