Add support for sql variable inside query in snowflake and mysql dialect #265

eyalleshem · 2020-08-16T16:20:50Z

see :
https://docs.snowflake.com/en/sql-reference/session-variables.html
https://dev.mysql.com/doc/refman/8.0/en/user-variables.html

see : https://docs.snowflake.com/en/sql-reference/session-variables.html https://dev.mysql.com/doc/refman/8.0/en/user-variables.html

coveralls · 2020-08-16T16:22:43Z

Pull Request Test Coverage Report for Build 210968877

47 of 47 (100.0%) changed or added relevant lines in 5 files are covered.
No unchanged relevant lines lost coverage.
Overall coverage increased (+0.08%) to 92.02%

Totals
Change from base Build 205645794:	0.08%
Covered Lines:	4601
Relevant Lines:	5000

💛 - Coveralls

nickolay · 2020-09-28T02:20:34Z

Sorry for the long delay.

This was previously discussed in #48, where using a custom dialect, in which $ can start an identifier, was deemed a good enough solution.

@alex-dukhno recently noted that a custom dialect would fail all the dialect_of! checks we've started to add, so perhaps it is time to reconsider.

Doing this in the parser results in accepting $ var as a variable, which is weird. I guess you did that to support MySQL's @"quoted identifier" notation, but still it seems like this logic belongs in the tokenizer.

I'd appreciate it if the PR included the logic we're trying to implement, instead of simply a link to the docs. For snowflake it seems that the relevant bits are the following, and we're focused on implementing the first two only?

all variables must be prefixed with a $ sign (the documentation does not explain what can follow the dollar sign though..)
can be used in Snowflake anywhere a literal constant is allowed
Variables can also contain identifier names, such as table names (e.g. SELECT * FROM IDENTIFIER($MY_VARIABLE))

eyalsatori · 2020-10-05T19:02:11Z

About the custom dialect i think about 2 options :

Maybe it's would be better to use the dialect_of macro only in the parser - and if we need a specific behaviour in the Tokenizer , maybe it will be better to add another function to the "Dialect" trait.
Maybe we could add to the trait some kind of "follow_dialect" function - and change the "dialect_of" macro to return true if it is the current dialect or dialect that following the current dialect .

About the current PR - Do you think it's will be better if we take the whole variable as a single token ? (meanwhile with the current dialect_of macro ..)

About snowflake - i don't think that i want to treat the third case different then the others , I think the "IDENTIFIER" should be parsed as a function , and the value should be expression with kind of sql-variable.

nickolay · 2020-10-06T09:50:26Z

About the general points you raised:

"use the dialect_of macro [... or ...] add another function to the "Dialect" trait." -- I believe we should use dialect_of! by default to handle differences between the dialects we support directly, and consider other solutions when there's a problem with dialect_of.

I agree tokenizer will probably end up not using dialect_of! much, but designing an alternative compatible with all the dialects requires more upfront research (that's why I was OK with merging [snowflake] Support single line comments starting with '#' or '//' #264, which used dialect_of in the tokenizer)
The follow_dialect idea is somewhat off-topic for this issue, unless you brought it up as another workaround for us not supporting bind variables. If you or someone else need something like follow_dialect specifically, I'd like to discuss this in a separate issue.

About the current PR - Do you think it's will be better if we take the whole variable as a single token ?

This seems more appropriate, yes, given the $ var (with the space between the dollar sign and the variable name) issue I brought up. But again, I'd like us to start with defining the problem we're trying to solve.

For instance implementing the "can be used in Snowflake anywhere a literal constant is allowed" requirement will require rather invasive changes to the parser. Considering other dialects will require more research.

If you want to implement a subset of the Snowflake dialect that recognizes $vars in expression context only for now, that can be achieved relatively easily. This is how I would do it:

Define what characters can follow $ in a variable name, and use dialect_of! to conditionally parse it into a Variable token.
Parse the Variable token in parse_value to a new Value variant
Match on the Variable token along with Number and others unconditionally in parse_prefix.

alex-dukhno · 2020-11-29T10:47:09Z

Hi @nickolay

developing PostgreSQL protocol compatible database I collect some knowledge around $var for Postgres

$ can follow only by a numbers, starting from 1. Query examples using official doc:
- prepare n (int2) as insert into numbers values ($abc); gives you ERROR: syntax error at or near "$"
- after query prepare n (int2) as insert into numbers values ($2);
  - if you try to execute execute n(1); you will see ERROR: wrong number of parameters for prepared statement "n" DETAIL: Expected 2 parameters but got 1.
  - however, if you execute execute n(1, 10); 10 will be inserted into a table
By intuition, it should follow $[parameter_index] pattern where parameter_index is index in values from query like execute n(1, 10);
It can be used in insert queries as variable values e.g. insert into <table_name> values ($1, $2);
It can be used in update queries in SET <column_name>=$1 expressions
It can be used in where, join and having predicates.
I've checked select $1 it also works
It doesn't work with SET <param_name> = e.g. prepare set_stmt as SET extra_float_digits = $1; results into ERROR: syntax error at or near "SET"

I am wondering if based on above info your suggestion:

If you want to implement a subset of the Snowflake dialect that recognizes $vars in expression context only for now, that can be achieved relatively easily. This is how I would do it:

Define what characters can follow $ in a variable name, and use dialect_of! to conditionally parse it into a Variable token.

Parse the Variable token in parse_value to a new Value variant

Match on the Variable token along with Number and others unconditionally in parse_prefix.

could be applied to PostgreSqlDialect?

andygrove

LGTM! Thanks @eyalleshem

andygrove · 2021-02-07T15:08:32Z

@eyalleshem Could you rebase please so I can merge this?

alamb · 2021-08-20T18:20:18Z

Hi @eyalleshem -- sorry for the delay in review. I am going to help out now with this repo and we are working to clear the backlog. Is this PR still something you would like to work on to help contribute?

alamb · 2021-09-09T15:47:35Z

I am closing what look like stale PRs in this repo; I apologize in advance if this is a mistake -- please feel free to reopen if you want to keep working on this issue.

Add support for sql variable inside query in snowflake and mysql dialect

51a2627

see : https://docs.snowflake.com/en/sql-reference/session-variables.html https://dev.mysql.com/doc/refman/8.0/en/user-variables.html

eyalleshem force-pushed the snowflake_variable_name branch from a998778 to 51a2627 Compare August 16, 2020 16:22

nickolay mentioned this pull request Sep 29, 2020

added parsing for PostgreSQL operations #267

Merged

alex-dukhno mentioned this pull request Jan 28, 2021

Parameterized queries #291

Open

andygrove approved these changes Feb 7, 2021

View reviewed changes

alamb closed this Sep 9, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for sql variable inside query in snowflake and mysql dialect #265

Add support for sql variable inside query in snowflake and mysql dialect #265

eyalleshem commented Aug 16, 2020

coveralls commented Aug 16, 2020 •

edited

Loading

nickolay commented Sep 28, 2020

eyalsatori commented Oct 5, 2020 •

edited

Loading

nickolay commented Oct 6, 2020

alex-dukhno commented Nov 29, 2020

andygrove left a comment

andygrove commented Feb 7, 2021

alamb commented Aug 20, 2021

alamb commented Sep 9, 2021

Add support for sql variable inside query in snowflake and mysql dialect #265

Add support for sql variable inside query in snowflake and mysql dialect #265

Conversation

eyalleshem commented Aug 16, 2020

coveralls commented Aug 16, 2020 • edited Loading

Pull Request Test Coverage Report for Build 210968877

💛 - Coveralls

nickolay commented Sep 28, 2020

eyalsatori commented Oct 5, 2020 • edited Loading

nickolay commented Oct 6, 2020

alex-dukhno commented Nov 29, 2020

andygrove left a comment

Choose a reason for hiding this comment

andygrove commented Feb 7, 2021

alamb commented Aug 20, 2021

alamb commented Sep 9, 2021

coveralls commented Aug 16, 2020 •

edited

Loading

eyalsatori commented Oct 5, 2020 •

edited

Loading