Extend ECMA-262 syntax into a superset of JSON #1188

gibson042 · 2018-05-07T21:37:38Z

This implements https://github.com/tc39/proposal-json-superset , currently at stage 3.

- Narrow the StringLiteral restriction in sec-line-terminators

ljharb · 2018-05-07T22:09:57Z

Tests: tc39/test262#1543 / tc39/test262#1544

littledan · 2018-05-10T22:39:05Z

Thanks for your contribution, @gibson042 .

ljharb · 2018-05-21T22:41:30Z

spec.html

@@ -9908,7 +9908,7 @@ <h2>Syntax</h2>
  <!-- es6num="11.3" -->
  <emu-clause id="sec-line-terminators">
    <h1>Line Terminators</h1>
-    <p>Like white space code points, line terminator code points are used to improve source text readability and to separate tokens (indivisible lexical units) from each other. However, unlike white space code points, line terminators have some influence over the behaviour of the syntactic grammar. In general, line terminators may occur between any two tokens, but there are a few places where they are forbidden by the syntactic grammar. Line terminators also affect the process of automatic semicolon insertion (<emu-xref href="#sec-automatic-semicolon-insertion"></emu-xref>). A line terminator cannot occur within any token except a |StringLiteral|, |Template|, or |TemplateSubstitutionTail|. Line terminators may only occur within a |StringLiteral| token as part of a |LineContinuation|.</p>
+    <p>Like white space code points, line terminator code points are used to improve source text readability and to separate tokens (indivisible lexical units) from each other. However, unlike white space code points, line terminators have some influence over the behaviour of the syntactic grammar. In general, line terminators may occur between any two tokens, but there are a few places where they are forbidden by the syntactic grammar. Line terminators also affect the process of automatic semicolon insertion (<emu-xref href="#sec-automatic-semicolon-insertion"></emu-xref>). A line terminator cannot occur within any token except a |StringLiteral|, |Template|, or |TemplateSubstitutionTail|. &lt;LF&gt; and &lt;CR&gt; line terminators cannot occur within a |StringLiteral| token except as part of a |LineContinuation|.</p>


For clarity and future explorers, can you elaborate on why this restriction is narrowed?

It's the fundamental change of proposal-json-superset, just in a part of the spec that I didn't realize included affected text until working on tc39/test262#1565 . As of this PR landing, StringLiteral will be able to include any LineTerminator as part of a LineContinuation (backslash followed by LineTerminatorSequence, used to statically define long strings over multiple lines without affecting their content), and—invalidating the old text—will also be able to include U+2028 LINE SEPARATOR and U+2029 PARAGRAPH SEPARATOR line terminators in any position (as is the case for JSON strings).

Previously, no line terminators could occur in string literals (except as part of a LineContinuation).

With the change, U+2028 and U+2029 can occur in string literals, even outside of LineContinuation.

In other words, this change clarifies that LineContinuations are unaffected by the change, i.e. \ followed by U+2028 or U+2029 in a string literal still evaluates to the empty string. This was already clear from the proposal, but updating this note as part of this patch improves accuracy.

Thanks, sounds good!

littledan · 2018-06-24T15:32:58Z

This proposal reached Stage 4 in the May 2018 TC39 meeting. Is the patch ready to land?

mathiasbynens · 2018-07-18T21:32:14Z

Side note: this commit has just been merged, yet it doesn’t show up in the first few entries on https://github.com/tc39/ecma262/commits/master. Instead, it’s further down the list here:

Note that May 7th is the date the original PR was created.

Is this confusing to anyone else or just me?

AFAICT until now, the commit log always followed merge order. Are we changing that now?

ljharb · 2018-07-18T21:39:14Z

This is actually a bug in github; it sorts by (and has sorted by) commit date, not actual merge order. cc @keithamus; is this something that we could request?

(The reason this happened here is that when I rebased the PR, i used --committer-date-is-author-date, which avoids obscuring the original commit date)

mathiasbynens · 2018-07-18T21:41:53Z

(The reason this happened here is that when I rebased the PR, i used --committer-date-is-author-date, which avoids obscuring the original commit date)

This way, the commit order doesn’t make sense even in git log. Can we not do that please?

ljharb · 2018-07-18T23:37:54Z

My git log locally works in proper merge order (ie, starting at the branch HEAD, and moving backwards one commit parent at a time). I'm not sure why it's working differently for you; perhaps aliases, or an outdated git version?

mathiasbynens · 2018-07-19T05:46:10Z

You’re right, git log does show in proper merge order. My bad — I was confused again by the fact that a) the commit dates don’t match the merge dates for recent commits, as explained before, and b) the commit overview doesn’t link to the PRs for any of the recently-merged commits (another difference from how things have been done until now).

Please don’t use --committer-date-is-author-date, and include the PR links in the commit message when merging, just like we’ve always done for this repository.

ljharb · 2018-07-19T05:59:30Z

I'll definitely include the PR links; that's a convention that wasn't stressed in our editors' meeting (however, if you click on any commit, the header on github shows the PR numbers, branches, and tags that it's contained in - so that link isn't strictly required).

PR tc39#1188 deleted the text "but using the alternative definition of |DoubleStringCharacter| provided below" from step 4 of JSON.parse's algorithm. At that point, the accompanying Note's phrase "as modified by Step 4 above" became obsolete.

PR #1188 deleted the text "but using the alternative definition of |DoubleStringCharacter| provided below" from step 4 of JSON.parse's algorithm. At that point, the accompanying Note's phrase "as modified by Step 4 above" became obsolete.

…2013) PR tc39#1188 deleted the text "but using the alternative definition of |DoubleStringCharacter| provided below" from step 4 of JSON.parse's algorithm. At that point, the accompanying Note's phrase "as modified by Step 4 above" became obsolete.

These two tests were expected to raise a syntax error because they involve a string literal containing U+2028 (LINE SEPARATOR) or U+2029 (PARAGRAPH SEPARATOR), which used to be disallowed. However, those code points have been allowed in string literals since the merge of tc39/ecma262#1188

Normative: Extend ECMA-262 syntax into a superset of JSON

cef8d49

- Narrow the StringLiteral restriction in sec-line-terminators

mathiasbynens approved these changes May 7, 2018

View reviewed changes

ljharb added the has test262 tests label May 7, 2018

ljharb approved these changes May 7, 2018

View reviewed changes

ljharb requested review from bterlson and bmeck May 7, 2018 22:11

bmeck approved these changes May 7, 2018

View reviewed changes

ljharb assigned bterlson May 16, 2018

bterlson approved these changes May 18, 2018

View reviewed changes

ljharb reviewed May 21, 2018

View reviewed changes

littledan removed the pending stage 4 This proposal has not yet achieved stage 4, but may otherwise be ready to merge. label Jun 24, 2018

ljharb assigned ljharb and unassigned bterlson Jul 18, 2018

ljharb force-pushed the proposal-json-superset branch from d64d3c6 to cef8d49 Compare July 18, 2018 19:06

ljharb merged commit cef8d49 into tc39:master Jul 18, 2018

ljharb mentioned this pull request Aug 10, 2020

Clarify which regular expression production rule from the ECMA specification is intended json-schema-org/json-schema-spec#821

Closed

jmdyck mentioned this pull request Jan 30, 2021

Move 2 tests from fail/ to pass/ due to LS and PS tc39/test262-parser-tests#30

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extend ECMA-262 syntax into a superset of JSON #1188

Extend ECMA-262 syntax into a superset of JSON #1188

gibson042 commented May 7, 2018

ljharb commented May 7, 2018 •

edited

Loading

littledan commented May 10, 2018

ljharb May 21, 2018

gibson042 May 21, 2018

mathiasbynens May 21, 2018

ljharb May 21, 2018

littledan commented Jun 24, 2018

mathiasbynens commented Jul 18, 2018

ljharb commented Jul 18, 2018

mathiasbynens commented Jul 18, 2018

ljharb commented Jul 18, 2018

mathiasbynens commented Jul 19, 2018

ljharb commented Jul 19, 2018

Extend ECMA-262 syntax into a superset of JSON #1188

Extend ECMA-262 syntax into a superset of JSON #1188

Conversation

gibson042 commented May 7, 2018

ljharb commented May 7, 2018 • edited Loading

littledan commented May 10, 2018

ljharb May 21, 2018

Choose a reason for hiding this comment

gibson042 May 21, 2018

Choose a reason for hiding this comment

mathiasbynens May 21, 2018

Choose a reason for hiding this comment

ljharb May 21, 2018

Choose a reason for hiding this comment

littledan commented Jun 24, 2018

mathiasbynens commented Jul 18, 2018

ljharb commented Jul 18, 2018

mathiasbynens commented Jul 18, 2018

ljharb commented Jul 18, 2018

mathiasbynens commented Jul 19, 2018

ljharb commented Jul 19, 2018

ljharb commented May 7, 2018 •

edited

Loading