Improve validators graph #212

Stranger6667 · 2021-05-03T21:27:19Z

The core idea behind this library is to store all keywords as a tree similar to the original schema JSON object. However, the implementation is not as efficient as I'd like it to be. Primarily, it has the following disadvantages:

Each node's siblings are stored in a separate vector, which leads to a very fragmented graph that negatively affects cache locality;
It doesn't allow cycles. The primary use case is the $ref keyword, where some compilation logic is happening during validation. These things are cached, though, but the cost of RWLock is visible in flame graphs. Also, when logically the same validator is called recursively -> some nodes are re-created, but theoretically, they can be re-used.

This issue is the starting point to rethink the implementation and maybe to implement something better.

Here are some thoughts I have in mind:

It might be possible to use an arena to allocate all boxed validators, then navigating could be done with indexes (maybe);
An interesting article on using indexes in graphs;
It might be nice to have some tool to display the validators graph.
More ideas

Later I'll draw the current implementation, so all the overhead is more visible.

The text was updated successfully, but these errors were encountered:

Stranger6667 · 2021-12-29T14:10:45Z

At the moment, I think that the following layout will be a good step in this direction:

Store validators in a vector
Store metadata (relative_path / absolute_path) in a separate vector
Pass &[BoxedValidator] to each validator in is_valid / validate / apply so each validator can call dependent validators
Use indexes to access dependent validators/metadata (as a single one or a range)

This should improve cache locality as all validators will be in the same vector + remove costs for following references between the validation nodes (it will be array access by index) + is_valid won't have extra costs for loading larger SchemaNode as it will be just boxed validator

As the next step, it will be easier to evaluate $ref during the compilation phase

Stranger6667 added Priority: Low Type: Enhancement Topic: Performance Difficulty: Hard labels May 3, 2021

tamasfe mentioned this issue Jul 12, 2021

WIP: Validator Layout and Resolver #247

Closed

Stranger6667 mentioned this issue Aug 15, 2021

Output formats #274

Merged

Stranger6667 mentioned this issue May 3, 2022

simd support? #368

Closed

Stranger6667 closed this as completed Oct 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve validators graph #212

Improve validators graph #212

Stranger6667 commented May 3, 2021 •

edited

Loading

Stranger6667 commented Dec 29, 2021 •

edited

Loading

Improve validators graph #212

Improve validators graph #212

Comments

Stranger6667 commented May 3, 2021 • edited Loading

Stranger6667 commented Dec 29, 2021 • edited Loading

Stranger6667 commented May 3, 2021 •

edited

Loading

Stranger6667 commented Dec 29, 2021 •

edited

Loading