This is an experimental library that has evolved to P1371, being proposed for C++23.
MPark.Patterns is an experimental pattern matching library for C++17.
It determines whether a given value matches a pattern and, if it does, binds the desired portions of the value to a handler.
Pattern matching has been introduced to many programming languages outside of the functional world, and this library draws inspiration from languages such as Haskell, OCaml, Rust, Scala, and Swift.
#include <cstdio>
#include <mpark/patterns.hpp>
void fizzbuzz() {
using namespace mpark::patterns;
for (int i = 1; i <= 100; ++i) {
match(i % 3, i % 5)(
pattern(0, 0) = [] { std::printf("fizzbuzz\n"); },
pattern(0, _) = [] { std::printf("fizz\n"); },
pattern(_, 0) = [] { std::printf("buzz\n"); },
pattern(_, _) = [i] { std::printf("%d\n", i); });
}
}
int main() {
fizzbuzz();
}
using namespace mpark::patterns;
IDENTIFIERS(<identifier>...); // optional
match(<expr>...)(
pattern(<pattern>...) = [](<binding>...) { /* ... */ },
pattern(<pattern>...) = [](<binding>...) { /* ... */ },
// ...
);
An expression pattern matches if the value compares equal to the given expression.
Given an expression e
and a value x
, e == x
must be a valid expression.
<expr>
int factorial(int n) {
using namespace mpark::patterns;
return match(n)(pattern(0) = [] { return 1; },
// ^ expression
pattern(_) = [n] { return n * factorial(n - 1); });
}
An arg pattern matches any value, and binds the value by passing it to the corresponding handler. This pattern can be repeated in a single pattern, and they match independent of each other.
None.
arg
arg(<pattern>)
int factorial(int n) {
using namespace mpark::patterns;
return match(n)(pattern(0) = [] { return 1; },
pattern(arg) = [](auto n) { return n * factorial(n - 1); });
// ^^^ arg
}
using P = std::pair<int, int>;
P p = {101, 202};
using namespace mpark::patterns;
match(p)(pattern(arg(ds(101, _))) = [](P &p) { std::cout << "pair!\n"; },
// ^^^ pass the pair!
pattern(_) = [] { std::cout << "not a pair\n"; });
// prints: "pair!"
A wildcard pattern matches any value and discards the value by not passing it to the corresponding handler. This pattern can be repeated within a single pattern, and they match independent of each other.
None.
_
(underscore)
int factorial(int n) {
using namespace mpark::patterns;
return match(n)(pattern(0) = [] { return 1; },
pattern(_) = [n] { return n * factorial(n - 1); });
// ^ wildcard
}
An identifier pattern is similar to arg
and _
in that it matches any value.
Two major differences are:
- Ability to operate on names rather than just
arg
or_
. - When an identifier is repeated within a single pattern, the values bound to the repeated identifiers must compare equal for the pattern to match.
There are two types of identifier patterns: binding and discarding.
If no identifiers are repeated, none.
Otherwise, let x
be a repeated identifer and v, vs...
be the group of values
matched by the repeated instances of x
. Then (... && (v == vs))
must be a
valid expression. The same requirement applies to any other repeated identifiers
in a single pattern.
IDENTIFIERS(<identifier>...);
<identifier>
<identifier>(<pattern>)
If <identifier>
has a leading _
(underscore), it is a discarding
identifier. Otherwise, it is a binding identifier.
void is_same(int lhs, int rhs) {
using namespace mpark::patterns;
IDENTIFIERS(x, y);
match(lhs, rhs)(
pattern(x, x) = [](auto) { std::cout << "same\n"; },
// ^ ^ binding identifier (repeated)
pattern(x, y) = [](auto, auto) { std::cout << "diff\n"; }
// ^ ^ binding identifier
);
}
is_same(101, 101); // prints: "same"
is_same(101, 202); // prints: "diff"
void is_same(int lhs, rhs) {
using namespace mpark::patterns;
IDENTIFIERS(_x, _y);
match(42, 42)(
pattern(_x, _x) = [] { std::cout << "same\n"; },
// ^ ^ discarding identifier (repeated)
pattern(_x, _y) = [] { std::cout << "diff\n"; }
// ^ ^ discarding identifier
);
}
is_same(101, 101); // prints: "same"
is_same(101, 202); // prints: "diff"
std::tuple<int, int> t = {101, 202};
std::optional<int> o = 101;
using namespace mpark::patterns;
IDENTIFIERS(x, y);
match(t, o)(
pattern(ds(x, x), some(x)) = [](auto x) {
std::cout << "They're all " << x << "!\n";
},
pattern(ds(x, y), some(x)) = [](auto x, auto y) {
std::cout << "I recognize x-y-x!\n";
},
pattern(_) = [] { std::cout << "I don't know.\n"; });
// prints: "I recognize x-y-x!"
A destructure pattern matches values that hold multiple values.
Given a value x
of type T
, T
must satisfy one of the following conditions:
std::is_array_v<T> == true
, orstd::is_class_v<T> == true
andstd::tuple_size<T>
is a complete type, orstd::is_aggregate_v<T> == true
If (2), the following conditions must also be satisfied.
std::tuple_size<T>::value
must be a well-formed integer constant expression, andx.get<I>()
orget<I>(x)
must be a valid expression for allI
in[0, std::tuple_size<T>::value)
For aggregate types (3), the current implementation only supports types that contain upto 16 non-static data members.
NOTE: These requirements are very similar to the requirements for C++17 Structured Binding.
ds(<pattern>...)
auto t = std::make_tuple(101, "hello", 1.1);
// C++17 Structured Binding:
const auto & [x, y, z] = t;
// ...
// MPark.Patterns:
using namespace mpark::patterns;
IDENTIFIERS(x, y, z);
match(t)(
pattern(ds(x, y, z)) = [](auto &&x, auto &&y, auto &&z) { /* ... */ });
// ^^^^^^^^^^^ destructure
NOTE: The top-level is wrapped by a tuple
, allowing us to write:
void fizzbuzz() {
for (int i = 1; i <= 100; ++i) {
using namespace mpark::patterns;
match(i % 3, i % 5)(
pattern(0, 0) = [] { std::cout << "fizzbuzz\n"; },
pattern(0, _) = [] { std::cout << "fizz\n"; },
pattern(_, 0) = [] { std::cout << "buzz\n"; },
pattern(_, _) = [i] { std::cout << i << '\n'; });
}
}
An optional pattern matches values that can be dereferenced,
and tested as a bool
.
Given a value x
of type T
, T
must satisfy all of the following conditions:
*x
is a valid expression, andx
is contextually convertible tobool
some(<pattern>)
none
int *p = nullptr;
using namespace mpark::patterns;
match(p)(pattern(some(_)) = [] { std::cout << "some\n"; },
pattern(none) = [] { std::cout << "none\n"; });
// prints: "none"
std::optional<int> o = 42;
using namespace mpark::patterns;
match(o)(
pattern(some(arg)) = [](auto x) { std::cout << "some(" << x << ")\n"; },
pattern(none) = [] { std::cout << "none\n"; });
// prints: "some(42)"
An as pattern matches values that is capable of holding a value of multiple types.
Given a value x
of type T
, and a pattern as<U>(<pattern>)
,
T
must satisfy one of the following conditions:
std::is_polymorphic_v<T> == true
, orstd::is_class_v<T> == true
andstd::variant_size<T>
is a complete type, orx.any_cast<U>()
orany_cast<U>(&x)
is a valid expression
- If (1),
dynamic_cast<U'>(&x)
must be a valid expression, whereU'
is pointer toU
with the same cv-qualifiers asdecltype(x)
- If (2),
x.get_if<U>()
orget_if<U>(&x)
must be a valid expression
as<U>(<pattern>)
struct Shape { virtual ~Shape() = default; };
struct Circle : Shape {};
struct Square : Shape {};
std::unique_ptr<Shape> shape = std::make_unique<Square>();
using namespace mpark::patterns;
match(shape)(pattern(some(as<Circle>(_))) = [] { std::cout << "Circle\n"; },
pattern(some(as<Square>(_))) = [] { std::cout << "Square\n"; });
// prints: "Square"
using str = std::string;
std::variant<int, str> v = 42;
using namespace mpark::patterns;
match(v)(pattern(as<int>(_)) = [] { std::cout << "int\n"; },
pattern(as<str>(_)) = [] { std::cout << "str\n"; });
// prints: "int"
using str = std::string;
std::any a = str("hello");
using namespace mpark::patterns;
match(a)(pattern(as<int>(_)) = [] { std::cout << "int\n"; },
pattern(as<str>(_)) = [] { std::cout << "str\n"; });
// prints: "str"
A visit pattern matches values that is capable of holding a value of closed set of types, and can be visited.
Given a value x
, The following code must be valid.
using std::visit;
visit([](auto &&) {}, x);`
vis(<pattern>)
using str = std::string;
std::variant<int, str> v = "hello world!";
struct Visitor {
void operator()(int n) const { std::cout << "int: " << n << '\n'; }
void operator()(const str &s) const { std::cout << "str: " << s << '\n'; }
};
using namespace mpark::patterns;
match(v)(pattern(vis(arg)) = Visitor{});
// prints: "str: hello world!".
An alternation pattern matches values that match any of the given patterns.
None.
anyof(<pattern>...)
std::string s = "large";
using namespace mpark::patterns;
match(s)(
pattern(anyof("big", "large", "huge")) = [] { std::cout << "big!\n"; },
pattern(anyof("little", "small", "tiny")) = [] { std::cout << "small.\n"; },
pattern(_) = [] { std::cout << "unknown size.\n"; });
A variadic pattern matches 0 or more values that match a given pattern.
- A variadic pattern can only appear within a destructure pattern.
- A variadic pattern can only appear once.
variadic(<pattern>)
auto x = std::make_tuple(101, "hello", 1.1);
using namespace mpark::patterns;
match(x)(
pattern(ds(variadic(arg))) = [](const auto &... xs) {
int dummy[] = { (std::cout << xs << ' ', 0)... };
(void)dummy;
});
// prints: "101 hello 1.1 "
This could also be used to implement C++17 std::apply
:
template <typename F, typename Tuple>
decltype(auto) apply(F &&f, Tuple &&t) {
using namespace mpark::patterns;
return match(std::forward<T>(t))(
pattern(ds(variadic(arg))) = std::forward<F>(f));
}
and even C++17 std::visit
:
template <typename F, typename... Vs>
decltype(auto) visit(F &&f, Vs &&... vs) {
using namespace mpark::patterns;
return match(std::forward<Vs>(vs)...)(
pattern(variadic(vis(arg))) = std::forward<F>(f));
}
We can even get a little fancier:
int x = 42;
auto y = std::make_tuple(101, "hello", 1.1);
using namespace mpark::patterns;
match(x, y)(
pattern(arg, ds(variadic(arg))) = [](const auto &... xs) {
int dummy[] = { (std::cout << xs << ' ', 0)... };
(void)dummy;
});
// prints: "42 101 hello 1.1 "
While pattern matching is very useful to match values against patterns, it mainly focuses on equality comparisons. Pattern guards are used to test whether the bound values satisfy some predicate.
A WHEN
clause appears inside the handler as the only expression.
A WHEN
clause differs from an if
statement in that if the conditional
expression evaluates to false
, the WHEN
clause falls through to the
next case, whereas the if
statement would stay in the handler.
The WHEN
clause may only appear once inside the handler as the only
expression, and the conditional expression must be contextually
convertible to bool
.
WHEN(<condition>) { /* ... */ };
using namespace mpark::patterns;
IDENTIFIERS(x, y);
match(101, 202)(
pattern(x, x) = [](int) { std::cout << "EQ\n"; },
pattern(x, y) = [](int x, int y) { WHEN(x > y) { std::cout << "GT\n"; }; },
pattern(x, y) = [](int x, int y) { WHEN(x < y) { std::cout << "LT\n"; }; });
// prints: "LT"
This construct is simply syntactic sugar for the following situation:
std::optional<int> o = 42;
using namespace mpark::patterns;
match(o)(
pattern(some(arg)) = [](auto x) { ... },
pattern(_) = [] {});
For this situation, if_let
provides a cleaner interface:
std::optional<int> o = 42;
using namespace mpark::patterns;
if_let (pattern(some(arg)) = o) = [](auto x) {
// ...
};
Suppose we want to match each element in a range-based for
loop with pattern.
We could imagine being able to write something like:
std::vector<std::pair<int, int>> pairs = {{0, 1}, {1, 1}, {0, 0}, {0, 2}};
for ((0, x) : pairs) {
std::cout << x << ' ';
}
// prints: "1 0 2 "
With if_let
we can say something close-ish:
std::vector<std::pair<int, int>> pairs = {{0, 1}, {1, 1}, {0, 0}, {0, 2}};
for (const auto &p : pairs) {
using namespace mpark::patterns;
if_let (pattern(ds(0, arg)) = p) = [](auto x) {
std::cout << x << ' ';
};
}
// prints: "1 0 2 "
for_let
allows the above example to be written more concisely:
std::vector<std::pair<int, int>> pairs = {{0, 1}, {1, 1}, {0, 0}, {0, 2}};
using namespace mpark::patterns;
for_let (pattern(ds(0, arg)) = pairs) = [](auto x) {
std::cout << x << ' ';
};
// prints: "1 0 2 "
for_let
also provides the break
and continue
statements with
return Break;
and return Continue;
respectively.
std::vector<std::pair<int, int>> pairs = {{0, 1}, {1, 1}, {0, 0}, {0, 2}};
using namespace mpark::patterns;
for_let (pattern(ds(0, arg)) = pairs) = [](auto x) {
if (x == 1) return Break;
std::cout << x << ' ';
return Continue;
};
// prints: "0 "
std::vector<std::pair<int, int>> pairs = {{0, 1}, {1, 1}, {0, 0}, {0, 2}};
using namespace mpark::patterns;
for_let (pattern(ds(0, arg)) = pairs) = [](auto x) {
if (x == 0) return Continue;
std::cout << x << ' ';
return Continue;
};
// prints: "1 2 "
This library requires a standard conformant C++17 compiler. The following compilers are continously tested:
Compiler | Operating System | Version String |
---|---|---|
GCC 7.4.0 | Ubuntu 16.04.6 LTS | g++-7 (Ubuntu 7.4.0-1ubuntu1 |
GCC 8.1.0 | Ubuntu 16.04.6 LTS | g++-8 (Ubuntu 8.1.0-5ubuntu1~16.04) 8.1.0 |
Clang 5.0.0 | Ubuntu 16.04.6 LTS | clang version 5.0.0-3~16.04.1 (tags/RELEASE_500/final) |
Clang 6.0.0 | Ubuntu 16.04.6 LTS | clang version 6.0.0-1ubuntu2~16.04.1 (tags/RELEASE_600/final) |
Clang 7.1.0 | Ubuntu 16.04.6 LTS | clang version 7.1.0-svn353565-1 |
-
MPARK_PATTERNS_INCLUDE_TESTS
:BOOL
(default:OFF
)Build tests.
Refer to test/README.md.
Distributed under the Boost Software License, Version 1.0.