Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Keep the correct case for the charset for canonical string #12105

Merged

Conversation

dbussink
Copy link
Contributor

The canonical string representation needs to keep the correct case for the charset. Those are never upcased in MySQL so we should not enforce that here either.

Related Issue(s)

Fixes #12104

Checklist

  • "Backport to:" labels have been added if this change should be back-ported
  • Tests were added or are not required
  • Documentation was added or is not required

The canonical string representation needs to keep the correct case for
the charset. Those are never upcased in MySQL so we should not enforce
that here either.

Signed-off-by: Dirkjan Bussink <[email protected]>
@vitess-bot vitess-bot bot added NeedsDescriptionUpdate The description is not clear or comprehensive enough, and needs work NeedsWebsiteDocsUpdate What it says labels Jan 16, 2023
@vitess-bot
Copy link
Contributor

vitess-bot bot commented Jan 16, 2023

Review Checklist

Hello reviewers! 👋 Please follow this checklist when reviewing this Pull Request.

General

  • Ensure that the Pull Request has a descriptive title.
  • If this is a change that users need to know about, please apply the release notes (needs details) label so that merging is blocked unless the summary release notes document is included.
  • If a test is added or modified, there should be a documentation on top of the test to explain what the expected behavior is what the test does.

If a new flag is being introduced:

  • Is it really necessary to add this flag?
  • Flag names should be clear and intuitive (as far as possible)
  • Help text should be descriptive.
  • Flag names should use dashes (-) as word separators rather than underscores (_).

If a workflow is added or modified:

  • Each item in Jobs should be named in order to mark it as required.
  • If the workflow should be required, the maintainer team should be notified.

Bug fixes

  • There should be at least one unit or end-to-end test.
  • The Pull Request description should include a link to an issue that describes the bug.

Non-trivial changes

  • There should be some code comments as to why things are implemented the way they are.

New/Existing features

  • Should be documented, either by modifying the existing documentation or creating new documentation.
  • New features should have a link to a feature request issue or an RFC that documents the use cases, corner cases and test cases.

Backward compatibility

  • Protobuf changes should be wire-compatible.
  • Changes to _vt tables and RPCs need to be backward compatible.
  • vtctl command output order should be stable and awk-able.
  • RPC changes should be compatible with vitess-operator
  • If a flag is removed, then it should also be removed from VTop, if used there.

@dbussink dbussink added Type: Bug Component: Query Serving and removed NeedsDescriptionUpdate The description is not clear or comprehensive enough, and needs work NeedsWebsiteDocsUpdate What it says labels Jan 16, 2023
Copy link
Contributor

@shlomi-noach shlomi-noach left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should we enforce lower casing the charset though?

@dbussink
Copy link
Contributor Author

Should we enforce lower casing the charset though?

We don't do that anywhere in the parser at the moment at all. We could maybe, but that's a whole different effort that goes beyond just the charset here, but applies to anything that's always lowercase and would need separate logic to be enforced then.

@GuptaManan100
Copy link
Member

GuptaManan100 commented Jan 16, 2023

I don't think we should enforce the lower casing, because mixed case and upper case do work with MySQL -

mysql [localhost:5731] {msandbox} (test) > select convert('abc' using utf8mb4);
+------------------------------+
| convert('abc' using utf8mb4) |
+------------------------------+
| abc                          |
+------------------------------+
1 row in set (0.00 sec)

mysql [localhost:5731] {msandbox} (test) > select convert('abc' using UTF8mb4);
+------------------------------+
| convert('abc' using UTF8mb4) |
+------------------------------+
| abc                          |
+------------------------------+
1 row in set (0.00 sec)

mysql [localhost:5731] {msandbox} (test) > select convert('abc' using UTF8MB4);
+------------------------------+
| convert('abc' using UTF8MB4) |
+------------------------------+
| abc                          |
+------------------------------+
1 row in set (0.00 sec)

@shlomi-noach shlomi-noach merged commit 0711feb into vitessio:main Jan 17, 2023
@shlomi-noach shlomi-noach deleted the dbussink/keep-correct-charset-case branch January 17, 2023 11:00
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Bug Report: wrong case for sqlparser.CanonicalString and CONVERT(... USING...)
3 participants