Non-throwing x3::expect #788

saki7 · 2024-08-11T16:13:55Z

This PR is a fresh implementation of the non-throwing x3::expect, which was originally abandoned in 2017.

supersedes and closes #242
cc: @djowel @wanghan02

Summary

Expectations like lit('a') > 'b' will be parsed 90 times faster 🚀 than the current implementation.
Non-throwing mode is only enabled when the macro is explicitly specified by user, so this PR has 100% backward compatibility.

Show benchmark C++ code

Stolen from #242 (comment), modified by saki7

cd test/x3
Save the code in a file
Add run your_file.cpp ; to Jamfile
../../../../b2 toolset=msvc cxxstd=17 cxxflags=/utf-8 release -j24

#define BOOST_SPIRIT_X3_THROW_EXPECTATION_FAILURE 0

#include <boost/spirit/home/x3.hpp>
#include <boost/core/lightweight_test.hpp>

#include <iostream>
#include <iomanip>
#include <chrono>
#include <type_traits>

namespace x3 = boost::spirit::x3;

constexpr int N = 100000;

template <typename Iterator, typename Parser>
bool do_parse(Iterator& first, Iterator const& last, Parser const& parser)
{
#if BOOST_SPIRIT_X3_THROW_EXPECTATION_FAILURE
    try
    {
        return x3::parse(first, last, parser);
    }
    catch (x3::expectation_failure<Iterator> const&)
    {
        // ...
    }
    return false;

#else
    x3::expectation_failure_optional<Iterator> failure;
    return x3::parse(
        first, last,
        x3::with<x3::expectation_failure_tag>(std::ref(failure))[
            parser
        ]);
#endif
}

template <typename Parser>
int benchmark_main(std::string const& input, Parser const& parser)
{
    int n = 0;
    auto const last = input.end();

    for (int i = 0; i < N; ++i)
    {
        auto first = std::next(input.begin(), i);

        if (do_parse(first, last, parser))
        {
            ++n;
        }
    }
    return n;
}


int main(int argc, char* argv[])
{
    // use argc to avoid whole benchmark getting optimized away
    std::string const input(N + argc - 1, 'a');

    auto t0 = std::chrono::high_resolution_clock::now();
    int n1 = benchmark_main(input, x3::lit('a') >> 'b');
    auto t1 = std::chrono::high_resolution_clock::now();
    int n2 = benchmark_main(input, x3::lit('a') > 'b');
    auto t2 = std::chrono::high_resolution_clock::now();

    using duration_type = std::chrono::duration<double, std::milli>;
    auto const dur0 = std::chrono::duration_cast<duration_type>(t1 - t0);
    auto const dur1 = std::chrono::duration_cast<duration_type>(t2 - t1);

    std::cerr << "Test with sequence:    " << std::fixed << std::setprecision(3) << dur0.count() <<"ms\n";
    std::cerr << "Test with expectation: " << std::fixed << std::setprecision(3) << dur1.count() <<"ms\n";
    std::cerr << std::flush;
    BOOST_TEST(n1 == n2);
    BOOST_TEST(false); // for b2 to output messages
    return boost::report_errors();
}

Features

#define BOOST_SPIRIT_X3_THROW_EXPECTATION_FAILURE
- 1: Default value. This is the traditional behavior, throws x3::expectation_failure when x3::expect[p] fails.
- 0: Non-throwing mode. The error will be stored into a boost/std::optional variable and can be retrieved by x3::get<x3::expectation_failure_tag>(context).

Usage

Documented as per doc/x3/tutorial/non_throwing_expectations.qbk.

Throwing mode (aka traditional behavior)

#include <boost/spirit/home/x3.hpp>

// ...

try {
    bool const ok = x3::parse(first, last, parser);
    if (!ok) {
        // error handling
    }

} catch (x3::expectation_failure<Iterator> const& failure) {
   // error handling
}

Non-throwing mode

#define BOOST_SPIRIT_X3_THROW_EXPECTATION_FAILURE 0
#include <boost/spirit/home/x3.hpp>

// ...

x3::expectation_failure_optional<Iterator> failure;
bool const ok = x3::parse(first, last, x3::with<x3::expectation_failure_tag>(failure)[parser]);
if (!ok) {
    if (failure.has_value()) {
        // error handling
    }
}

That's all. I believe that potentially all X3 users can experience a performance boost by just changing few lines of code in their application.

Implementation notes (for X3 developers)

The idea of passing around x3::expectation_failure by optional comes from the original PR back in 2017. Therefore I have added the original author's name as well as mine to the license attribution, great thanks to the original author.

That said, my new implementation is different than the reference, both in terms of semantics and code quality. My implementation is mimicking the internal behavior of throwing version, whereas the original PR was disposing expectation_failure context incorrectly in various locations. On my implementation, all iterators (i.e. e.where()) and source info (i.e. e.which()) should be identical to the values in throwing version.

Quoting from original PR (#242 (comment)):

If expect operator is used and fails in skipper parser, the main parser should not fail (The skipper parser is a totally different parser with a different context. We should use it only to skip. If the skipper parser fails, it should mean no more skipping and should return to the main parser). We use nothrow expect as the default way to call skipper parser.

This should not be the expected behavior.

Usage of x3::expect[p] in skippers is legit X3 code and it should be supported. In such cases the library user must provide a context which can store the x3::expectation_failure value so that X3 can store the error in it. So then I carefully investigated all possible combinations of parser primitives and skippers; I have included my discoveries in unit tests.

Tests

$ cd test/x3
$ ..\..\..\..\b2 toolset=msvc cxxstd=17 cxxflags=/utf-8 -j24

All tests are passing, however, you should expect Internal Compiler Errors as usual. 🤓
It's not the test case's issue so I can do nothing about it. Just do rebuild several times and you should be getting proper results.

My environment is

Windows 11, 10.0.22631
Visual Studio 2022, 17.10.5
Core i9-14900KF
Antivirus turned off
Decent amount of memory
Decent amount of disk space

I'd really appreciate it if you could test this PR on your environment.

I've added ~100 testcases to make sure that the throwing/non-throwing versions both yield same result when x3::expect is involved. For this purpose, I have redesigned the whole test suite test/x3/expect.ipp so that b2 automatically runs all expect-related tests for both versions.

Documentation

Documentation was initially marked as TODO but was later added to this PR.

Bonus

Originally the runtime failures in x3::expect[p] were leading to tremendous amount of error logs on debuggers attached. It was sadly making my output window useless since it literally washes away every useful information including non-X3 errors. This PR solves this issue too.

I have also confirmed that non-throwing mode performs 10% to 70% faster when debugger is attached.

saki7 · 2024-08-11T16:19:36Z

@djowel Could I ask you to review this PR?
FYI, tests are failing since I used C++17 features (which I presume is valid since the README says so). It would be very nice if you could fix CI.

djowel · 2024-08-11T22:25:45Z

@djowel Could I ask you to review this PR? FYI, tests are failing since I used C++17 features (which I presume is valid since the README says so). It would be very nice if you could fix CI.

Will do! Thanks so much for doing this! It seems to be very comprehensive and well thought!

djowel · 2024-08-11T22:40:32Z

While I'm used to writing comments in codes, I'm still experiencing a big language barrier on writing lengthy documentation. Although I'm thinking that I should be responsible for writing the docs, it would be very nice if a maintainer could support me 🙇🙇🙇

I think your English is very good for a non-native speaker. Please note that I too am not a native English speaker, and neither is the current maintainer. I'd say just go ahead, and I'll do the review and copy editing. This PR will not be complete with proper documentation, since this is a major change.

djowel · 2024-08-11T22:42:41Z

include/boost/spirit/home/x3/operator/kleene.hpp

@@ -32,7 +35,7 @@ namespace boost { namespace spirit { namespace x3
            while (detail::parse_into_container(
                this->subject, first, last, context, rcontext, attr))


Shouldn't parse_into_container short circuit the loop on !has_expectation_failure(context); ? Or is it already done in this PR somewhere?

That's a good question; it does short circuit. There are 3 patterns here:

subject parser succeeded (-> returned true)

subject parser failed normally (-> returned false)

subject parser failed with expectation failure (-> returned false)

So the while statement does break when expectation failure is raised. All we need to do in kleene is to distinguish the difference between case 2 and 3, then return false if it was the latter.

djowel · 2024-08-12T05:51:53Z

@djowel Could you do me a favor and bump GCC/Clang version on CI? I lost my old Linux installation since I just built a new PC and I also have other personal reasons that I can't spend much time on building Linux environment from scratch.

Which version?

include/boost/spirit/home/x3/support/expectation.hpp

include/boost/spirit/home/x3/directive/skip.hpp

Kojoley · 2024-08-12T06:27:21Z

include/boost/spirit/home/x3/core/skip_over.hpp

@@ -91,9 +151,9 @@ namespace boost { namespace spirit { namespace x3

    template <typename Iterator, typename Context>
    inline void skip_over(


I guess throwing expectation failure inside a skipper and catching it in a rule does work but I don't think there any use for it. That's probably an omission that it's not catched here. Is it really need to propagate expectation failures outside from skippers?

Consider the case like below:

https://github.com/boostorg/spirit/pull/788/files#diff-098440cf78f4367f618d21a85375e53988bc19c9d193a26876e4fb40d81ff5f1R448-R451

TEST_FAILURE("a..b.z", lit('a') >> 'b', lit('.') > '.', { BOOST_TEST(x.which() == "'.'"sv); BOOST_TEST(x.where() == "z"sv); });

Traditionally in throwing mode, this kind of grammar led to an expectation_failure being thrown. To get the same x.where() and x.which() values in non-throwing mode, we need to propagate the failures.

djowel · 2024-08-12T06:35:18Z

@djowel The latest GCC/Clang versions available in GitHub Actions would be nice.

I don't know what that is. I'll go with Clang 18 and GCC 13. One less the latest.

I presume that the actual solution would not require the most recent compiler, but I just want to do sanity checks against most modern compilers for now. (I mean, even if it happens that we would need to write nasty compiler detection macros, I need an information whether it works on the most decent compiler in the first place.)

BTW I just started writing docs.

Wonderful!

djowel · 2024-08-12T06:42:04Z

If there's an option to pull any version from external package repository, GCC 14 and Clang 18 is the best choice.

Let's try GCC 13 and Clang 18 first.

saki7 · 2024-08-12T08:36:54Z

https://github.com/boostorg/spirit/compare/d63d3a4fbce11e1fb8d8c91a8b874e983000d437..88d9987167799e009427b8ec48154809d8db37dc

CWG2518 problem and some semantic issue has been resolved in the recent diff, and I see the tests are passing, Since the source of error is gone, this PR should compile on older GCC/Clang now.

As far as I understand it, the only remaining to-dos regarding implementation is the ODR issue raised by Kojoley. I'm still not sure what exactly is the right code that I should write.

saki7 · 2024-08-12T14:24:40Z

Added documentation. I have configured my environment to build & preview quickbook locally, there are no syntax errors. (BTW, quickbook setup was so painful)

saki7 · 2024-08-13T11:21:34Z

@Kojoley @djowel I've fixed the ODR violation issue in the latest commit. What I've done is just wrapping throwing/non-throwing implementations in separate namespaces. I think inline namespace is the proper solution as described in Stack Overflow.

I've also added a test case (previously failing) for mixing throwing/non-throwing modes in a single executable.

Documentation is done in yesterday's commit.
I believe all issues has been resolved, waiting for final review.

djowel · 2024-08-15T22:18:02Z

It's looking good as far as I can tell. @Kojoley do you have any further thoughts and review?

djowel · 2024-08-15T22:21:10Z

Thank you for working on the docs too! It's hard to review the docs, out of context though, without generating the html, which I haven't done in a long time already.

djowel · 2024-08-15T22:25:12Z

Another quick thought, while reading your doc: should we [[deprecate...]] the throwing code with a message pointing to the docs?

saki7 · 2024-08-17T08:47:00Z

Thanks for your review!

Another quick thought, while reading your doc: should we [[deprecate...]] the throwing code with a message pointing to the docs?

That's great. I was thinking of how to encourage people to use this feature, and that's going to be a good way to effectively informing the existing users. I think we can handle it in a separate PR.

saki7 · 2024-08-17T13:58:51Z

It's hard to review the docs, out of context though, without generating the html, which I haven't done in a long time already.

Here's the rendered result of the documentation:

djowel · 2024-08-17T22:09:09Z

It's hard to review the docs, out of context though, without generating the html, which I haven't done in a long time already.

Here's the rendered result of the documentation:

[snip]

Wonderful!

djowel · 2024-08-17T22:12:40Z

Merged! Thank you so much for contributing this code! It is well thought and well implemented.

saki7 · 2024-08-18T09:52:12Z

Thanks!

djowel self-assigned this Aug 11, 2024

djowel reviewed Aug 11, 2024

View reviewed changes

saki7 force-pushed the nothrow-expect branch 2 times, most recently from 0286301 to 83ff3ef Compare August 12, 2024 04:17

This comment was marked as resolved.

Sign in to view

Kojoley reviewed Aug 12, 2024

View reviewed changes

saki7 force-pushed the nothrow-expect branch from 83ff3ef to d63d3a4 Compare August 12, 2024 07:08

This comment was marked as resolved.

Sign in to view

Implement non-throwing version of x3::expect

88d9987

saki7 force-pushed the nothrow-expect branch from d63d3a4 to 88d9987 Compare August 12, 2024 08:19

Add documentation for non-throwing expectations

cf643c1

Fix ODR violation, add tests

d9b53fe

djowel merged commit 52c8d66 into boostorg:develop Aug 17, 2024
12 checks passed

saki7 deleted the nothrow-expect branch August 18, 2024 09:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Non-throwing x3::expect #788

Non-throwing x3::expect #788

saki7 commented Aug 11, 2024 •

edited

Loading

saki7 commented Aug 11, 2024

djowel commented Aug 11, 2024

djowel commented Aug 11, 2024

djowel Aug 11, 2024

saki7 Aug 12, 2024 •

edited

Loading

This comment was marked as resolved.

djowel commented Aug 12, 2024

This comment was marked as resolved.

This comment was marked as resolved.

Kojoley Aug 12, 2024

saki7 Aug 12, 2024 •

edited

Loading

djowel commented Aug 12, 2024

djowel commented Aug 12, 2024

This comment was marked as resolved.

saki7 commented Aug 12, 2024

saki7 commented Aug 12, 2024

saki7 commented Aug 13, 2024

djowel commented Aug 15, 2024

djowel commented Aug 15, 2024

djowel commented Aug 15, 2024

saki7 commented Aug 17, 2024

saki7 commented Aug 17, 2024

djowel commented Aug 17, 2024

djowel commented Aug 17, 2024

saki7 commented Aug 18, 2024

		@@ -32,7 +35,7 @@ namespace boost { namespace spirit { namespace x3
		while (detail::parse_into_container(
		this->subject, first, last, context, rcontext, attr))

		@@ -91,9 +151,9 @@ namespace boost { namespace spirit { namespace x3

		template <typename Iterator, typename Context>
		inline void skip_over(

Non-throwing x3::expect #788

Non-throwing x3::expect #788

Conversation

saki7 commented Aug 11, 2024 • edited Loading

Summary

Features

Usage

Throwing mode (aka traditional behavior)

Non-throwing mode

Implementation notes (for X3 developers)

Tests

Documentation

Bonus

saki7 commented Aug 11, 2024

djowel commented Aug 11, 2024

djowel commented Aug 11, 2024

djowel Aug 11, 2024

Choose a reason for hiding this comment

saki7 Aug 12, 2024 • edited Loading

Choose a reason for hiding this comment

This comment was marked as resolved.

djowel commented Aug 12, 2024

This comment was marked as resolved.

This comment was marked as resolved.

Kojoley Aug 12, 2024

Choose a reason for hiding this comment

saki7 Aug 12, 2024 • edited Loading

Choose a reason for hiding this comment

djowel commented Aug 12, 2024

djowel commented Aug 12, 2024

This comment was marked as resolved.

saki7 commented Aug 12, 2024

saki7 commented Aug 12, 2024

saki7 commented Aug 13, 2024

djowel commented Aug 15, 2024

djowel commented Aug 15, 2024

djowel commented Aug 15, 2024

saki7 commented Aug 17, 2024

saki7 commented Aug 17, 2024

djowel commented Aug 17, 2024

djowel commented Aug 17, 2024

saki7 commented Aug 18, 2024

saki7 commented Aug 11, 2024 •

edited

Loading

saki7 Aug 12, 2024 •

edited

Loading

saki7 Aug 12, 2024 •

edited

Loading