Query: Add caching of generated RelationalCommand based on nullability of parameters #15892

smitpatel · 2019-06-01T00:51:21Z

No description provided.

roji · 2019-07-19T22:32:14Z

Here's a requirement from the PostgreSQL side...

Thanks to PostgreSQL's fully support for arrays, we can translate the following:

var ids = new[] { 1, 2, 3 };
var customers = ctx.Customers.Where(c => ids.Contains(c.Id)).ToList();

Into the following:

SELECT * FROM customers WHERE id = ANY (@ids);

This obviates doing expansion of parameters into constants at runtime. The one place where this fails is if there's a null somewhere inside ids; to preserve null semantics, we'd need to translate to:

SELECT * FROM customers WHERE id = ANY (@ids) OR id IS null;

In other words, I'd ideally be able to cache to commands based on whether the array parameter's value contains a null anywhere.

smitpatel · 2019-07-19T23:22:15Z

I don't think it is relevant here in that context.
parameter nullability is @ids being null. Sniffing if @ids array contains a null value is no different than caching based on parameter values. And later has significantly low value.

smitpatel · 2019-07-19T23:25:05Z

Also modifying parameter values during Sql generation is not possible (not it should be made possible). So while there is possibility of adding additional term (just like how Inexpression expands) but @ids will still contain null values, unless you expand it out.

roji · 2019-07-20T08:51:57Z

Sniffing if @ids array contains a null value is no different than caching based on parameter values

Depending on how you look at it, it's another form of that caching. Looking at the nullability of a parameter isn't exactly the same as looking whether an array parameter contains null.

And later has significantly low value.

Why do you say that? In the non-PostgreSQL case, expanding a parameterized array to constant can yield potentially endless SQL based on the array's contents. As an optimization, instead of expanding the array to a constant we could expand to a set of parameterized scalar parameters, e.g. x IN (@p1, @p2) for two parameters, but that still leaves a different query for each number of parameters in the array.

PostgreSQL allows you to sidestep all that by having only two SQLs - one for when the array contains null, and one for when it does not. Note that different queries can have a pretty important perf impact (plan caching at the server, possibility to prepare statements...). That is why I'm looking into this.

Also modifying parameter values during Sql generation is not possible

I wasn't asking for that anywhere, only to add the additional nullability check in SQL.

divega · 2019-07-26T02:51:41Z

In the non-PostgreSQL case, expanding a parameterized array to constant can yield potentially endless SQL based on the array's contents. As an optimization, instead of expanding the array to a constant we could expand to a set of parameterized scalar parameters, e.g. x IN (@p1, @p2) for two parameters, but that still leaves a different query for each number of parameters in the array.

We have discussed doing this (see parameter rewrite in #13617 (comment)) and even repeating the value of the last element of the array to fill-in to fixed array sizes.

I see that, what @roji is describing, using a TVPs (which incidentally would be very similar to what @roji is describing), and a string we would need to build to pass to STRING_SPLIT() as possible cases of specialized parameter binding in which constants in the source expression tree end up not mapping 1:1 with constants in the generated SQL.

In all those cases there is some processing required to get from the input values to the actual parameters for the query, so I don't think we should reject the idea of sniffing into the individual parameter values to check if any of them is null, either to generate different SQL or to producing an extra bool parameter to short-circuit the null check, e.g.:

SELECT * FROM customers WHERE id = ANY (@ids) OR (@any_id_is_null AND id IS null);

Of course, the simpler and more performant the solution, the better.

FWIW, in the original example this extra sniffing shouldn't be needed because the array is of type int [] and not int? []

From what I remember, we have seen other cases in which matching values other than null when we sniff parameters could lead to simpler SQL. I am not convinced the value of this is low, but it hasn't been high enough so far.

smitpatel · 2019-07-26T04:16:02Z

This issue does not track caching based on parameter value sniffing. While it may look tempting but wrt implementation cost, it may not cross value-cost bar.

divega · 2019-07-26T04:59:27Z

I agree we should split this part of the discussion into a separate issue.

roji · 2019-07-26T06:13:51Z

@smitpatel is there already another issue for caching based on parameter sniffing? Did a quick search was surprised not to find it yet.

While it may look tempting but wrt implementation cost, it may not cross value-cost bar.

The point for me is that if I understand correctly, we intend to implement parameter sniffing and caching in any way, to able able to get do tighter null semantics (i.e. eliminate current unneeded null checks). So what I'm asking for can hopefully be a relatively small incremental addition over that (sniff values instead parameter arrays as opposed to only null parameters).

FWIW, in the original example this extra sniffing shouldn't be needed because the array is of type int [] and not int? []

Good point :)

smitpatel · 2019-07-26T14:18:36Z

we intend to implement parameter sniffing and caching in any way, to able able to get do tighter null semantics

For caching we intend to only check if parameter is null or not null. No more than 2 states. We don't care what is the value if it is not null.
If we actually look into non-null value then it won't be cached.

roji · 2019-07-26T15:48:08Z

@smitpatel I posted on the above precisely to discuss this, am assuming design/decisions haven't been locked down yet... It should a function of added complexity/effort no?

smitpatel · 2019-07-26T15:56:53Z

This issue specifically track the second level caching of select expression we had in previous pipeline. Hence my very first comment that it is not relevant in this issue.
We had one working & effective system in past. We can re-implement it. It does not mean that we cannot extend it. But as of now, we do not have design for such caching which sniff parameter values. I see negative value trying to add a functionality for which we don't have design blocking addition of working component from past. Please file a new issue, and present a design on how parameter sniffing based caching is supposed to work.

roji · 2019-09-03T21:18:16Z

Proposal for a more general parameter-sniffing caching mechanism: #17598

Resolves #15892

ajcvickers added this to the Backlog milestone Jun 3, 2019

ajcvickers added area-perf type-enhancement labels Jun 3, 2019

roji mentioned this issue Jul 20, 2019

Handle InExpression as array parameter instead of expanding to constant npgsql/efcore.pg#916

Closed

smitpatel mentioned this issue Aug 27, 2019

Slow compilation when using many .ThenInclude()'s #17455

Closed

divega mentioned this issue Aug 31, 2019

Queries really slow due to null checks #17543

Closed

roji mentioned this issue Sep 3, 2019

General QueryContext-based mechanism for caching of generated RelationalCommand #17598

Open

smitpatel added a commit that referenced this issue Sep 25, 2019

Add RelationalCommandCaching based on parameter value nullability

7360972

Resolves #15892

This was referenced Sep 25, 2019

Only avoid caching of RelationalCommand when visitor says so #18034

Open

Add RelationalCommandCaching based on parameter value nullability #18035

Merged

smitpatel removed this from the Backlog milestone Oct 7, 2019

ajcvickers added type-bug and removed type-enhancement labels Oct 10, 2019

ajcvickers assigned smitpatel Oct 10, 2019

ajcvickers added this to the 3.1.0 milestone Oct 10, 2019

ajcvickers added the closed-fixed The issue has been fixed and is/will be included in the release indicated by the issue milestone. label Oct 10, 2019

smitpatel mentioned this issue Oct 16, 2019

Significant Query Slowdown When Using Multiple Joins Due To Changes In 3.0 #18022

Closed

smitpatel added a commit that referenced this issue Oct 21, 2019

Add RelationalCommandCaching based on parameter value nullability

4562f7b

Resolves #15892

smitpatel closed this as completed in #18035 Oct 21, 2019

smitpatel added a commit that referenced this issue Oct 21, 2019

Add RelationalCommandCaching based on parameter value nullability

9d4e349

Resolves #15892

ajcvickers modified the milestones: 3.1.0, 3.1.0-preview2 Oct 24, 2019

ajcvickers modified the milestones: 3.1.0-preview2, 3.1.0 Dec 2, 2019

bachratyg mentioned this issue Apr 13, 2021

Query: parameter nullability sniffing broken for global query filters #24645

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Query: Add caching of generated RelationalCommand based on nullability of parameters #15892

Query: Add caching of generated RelationalCommand based on nullability of parameters #15892

smitpatel commented Jun 1, 2019

roji commented Jul 19, 2019

smitpatel commented Jul 19, 2019

smitpatel commented Jul 19, 2019

roji commented Jul 20, 2019

divega commented Jul 26, 2019 •

edited

Loading

smitpatel commented Jul 26, 2019

divega commented Jul 26, 2019

roji commented Jul 26, 2019

smitpatel commented Jul 26, 2019

roji commented Jul 26, 2019

smitpatel commented Jul 26, 2019 •

edited

Loading

roji commented Sep 3, 2019

Query: Add caching of generated RelationalCommand based on nullability of parameters #15892

Query: Add caching of generated RelationalCommand based on nullability of parameters #15892

Comments

smitpatel commented Jun 1, 2019

roji commented Jul 19, 2019

smitpatel commented Jul 19, 2019

smitpatel commented Jul 19, 2019

roji commented Jul 20, 2019

divega commented Jul 26, 2019 • edited Loading

smitpatel commented Jul 26, 2019

divega commented Jul 26, 2019

roji commented Jul 26, 2019

smitpatel commented Jul 26, 2019

roji commented Jul 26, 2019

smitpatel commented Jul 26, 2019 • edited Loading

roji commented Sep 3, 2019

divega commented Jul 26, 2019 •

edited

Loading

smitpatel commented Jul 26, 2019 •

edited

Loading