Style/StringLiterals ignores strings with non-ascii characters #3017

deivid-rodriguez · 2016-04-07T10:41:04Z

echo '"Esp"' | rubocop --stdin --only Style/StringLiterals -

results in

Inspecting 1 file
C

Offenses:

-:1:1: C: Prefer single-quoted strings when you don't need string interpolation or special symbols.
"Esp"
^^^^^

1 file inspected, 1 offense detected

whereas

echo '"España"' | rubocop --stdin --only Style/StringLiterals -

results in

Inspecting 1 file
.

1 file inspected, no offenses detected

$ rubocop -V
0.39.0 (using Parser 2.3.0.7, running on ruby 2.3.0 x86_64-linux)

The text was updated successfully, but these errors were encountered:

alexdowad · 2016-05-31T09:26:18Z

Hi! This was by design. Please see lines 231-238 of util.rb:

      # If double quoted string literals are found in Ruby code, and they are
      # not the preferred style, should they be flagged?
      def double_quotes_acceptable?(string)
        # If a string literal contains hard-to-type characters which would
        # not appear on a "normal" keyboard, then double-quotes are acceptable
        double_quotes_required?(string) ||
          string.codepoints.any? { |cp| cp < 32 || cp > 126 }
      end

...I don't remember what the reasoning was behind this code, but if you want to argue for something else, please do so. Anyways, this is not a "bug" in the sense of unintentional behavior.

bbatsov · 2016-05-31T09:27:14Z

...I don't remember what the reasoning was behind this code, but if you want to argue for something else, please do so. Anyways, this is not a "bug" in the sense of unintentional behavior.

I don't remember this either.

alexdowad · 2016-05-31T09:32:34Z

LOL. I think it was me who wrote it.

deivid-rodriguez · 2016-05-31T09:35:58Z

Yeah, I was git-blaming this and it seems so! :)

alexdowad · 2016-05-31T11:24:46Z

OK, I remember why this is so.

The Ruby parser doesn't differentiate between your string and "Espa\u00f1a". Both of them come out just the same in the AST which we analyze.

So when we find a double-quoted string literal with a \U00F1 in it, we don't flag it, because the programmer may have entered a literal \U00F1 and we don't want to force them to enter the funny little squiggly little "n" thing instead. We might confuse some poor ignorant souls, who don't know how to enter those fancy European characters. (Like... me...)

deivid-rodriguez · 2016-05-31T12:36:33Z

@alexdowad I'm not sure I quite get it. What's the technical limitation to not fix this?

alexdowad · 2016-05-31T13:32:45Z

The "technical limitation" is that if we fix the handling of "España", we will also fix the handling of "Espa\u001fa". And when I say we will "fix" it, I mean that in the sense of "breaking" it.

That is, unless you actually re-parse the source code for the string yourself, and distinguish between the 2 cases. The AST which parser gives us is the same for the 2 cases mentioned above.

deivid-rodriguez · 2016-06-01T01:23:18Z

That is, unless you actually re-parse the source code for the string yourself, and distinguish between the 2 cases.

I guess this would be the way to go, then. Maybe I'll give it a try!

…n_ascii_strings [Fix #3017][Fix #3056] `Style/StringLiterals` now works with non-ascii strings

mikegee mentioned this issue Apr 8, 2016

Style/StringLiterals doesn't correctly handle strings with non-ASCII characters in them #3024

Closed

deivid-rodriguez mentioned this issue Apr 12, 2016

Style/StringLiterals: link_to '#foo' not detected as violating double_quotes #3037

Closed

deivid-rodriguez mentioned this issue Jun 1, 2016

[Fix #3017][Fix #3056] Style/StringLiterals now works with non-ascii strings #3189

Merged

9 tasks

bbatsov closed this as completed in #3189 Jun 15, 2016

bbatsov added a commit that referenced this issue Jun 15, 2016

Merge pull request #3189 from deivid-rodriguez/string_literals_for_no…

2e6fb07

…n_ascii_strings [Fix #3017][Fix #3056] `Style/StringLiterals` now works with non-ascii strings

johanlunds mentioned this issue Jul 11, 2016

Undesired string literal replacements when UTF8 locale not set in ENV #3311

Closed

rrosenblum mentioned this issue Sep 13, 2017

Run tests against supported jRuby versions #4703

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Style/StringLiterals ignores strings with non-ascii characters #3017

Style/StringLiterals ignores strings with non-ascii characters #3017

deivid-rodriguez commented Apr 7, 2016

alexdowad commented May 31, 2016

bbatsov commented May 31, 2016

alexdowad commented May 31, 2016

deivid-rodriguez commented May 31, 2016

alexdowad commented May 31, 2016

deivid-rodriguez commented May 31, 2016

alexdowad commented May 31, 2016

deivid-rodriguez commented Jun 1, 2016

Style/StringLiterals ignores strings with non-ascii characters #3017

Style/StringLiterals ignores strings with non-ascii characters #3017

Comments

deivid-rodriguez commented Apr 7, 2016

alexdowad commented May 31, 2016

bbatsov commented May 31, 2016

alexdowad commented May 31, 2016

deivid-rodriguez commented May 31, 2016

alexdowad commented May 31, 2016

deivid-rodriguez commented May 31, 2016

alexdowad commented May 31, 2016

deivid-rodriguez commented Jun 1, 2016