Basic Syntax #162

jsiek · 2020-09-19T19:06:00Z

proposals/p0162.md

chandlerc

I still want to go through in more detail, but I had a few minor notes I captured as we were looking at this during the meeting this week and didn't want them to get lost so posting them right away.

proposals/p0162.md

Co-authored-by: Dmitri Gribenko <[email protected]>

josh11b

Please take my comments as a starting point for discussion rather than demands. I think I have formed a picture in my head of where I thought the syntax was going and this PR has been great for (a) firming up those ideas, and (b) helping me examine the places where there are weaknesses. I fully expect there are others on the team who have different ideas on specific syntax points, and we have mostly been postponing resolving those differences until we have agreed on more of the rest of the design. I think this is a great forum where we can hash out things in a very concrete way.

proposals/p0162.md

zygoloid · 2020-10-29T00:30:42Z

proposals/p0162.md

+The grammar rule
+
+```Bison
+expression:  "fnty" tuple return_type
+```
+
+is for function types. They are meant to play the role that function pointers
+play in C and that `std::function` plays in C++.


I think it's premature to include this. If the rest of the proposal doesn't depend on it, can we remove function types for now and reintroduce them in a later proposal if needed?

The problem with removing function types is that it significantly reduces the expressiveness of this little language. You would no longer be able to put functions inside of tuples and you would no longer be able to pass functions as arguments or return values of other functions. In other words, functions won't be "first class" values. That will reduce the scope of examples that we'll be able to handle with this subset. I'd prefer that we put functions types in right away, and then when we have a better alternative, it can be swapped in.

The problem with including them is that (from my perspective) we often care more about an overload set than a specific function signature. Having machinery about function types sets us down the path of re-introducing that mistake.

Tracking issue for this question: #191

zygoloid · 2020-10-29T00:34:24Z

proposals/p0162.md

+precedence. Proposal 168 differs in that the operator groups are partially
+ordered instead of being totally ordered.


I don't think that necessarily follows. We won't be able to use bison's operator precedence parsing system. But you can implement operator precedence in other ways, for example using the precedence-climbing method (https://en.wikipedia.org/wiki/Operator-precedence_parser#Precedence_climbing_method) -- that's how the C++ specification describes operator precedence, for example -- and I would expect bison to be able to handle that. And I think the precedence-climbing method does support partial precedence orders such as the one suggested by #168. It also has the advantage of removing ambiguities from the formal grammar rather than doing so in a separate set of rules -- for that reason I'd prefer that our formal specification uses that formulation or one like it rather than a precedence table even if we don't use a partial precedence ordering.

zygoloid · 2020-10-29T00:41:45Z

proposals/p0162.md

+struct Expression {
+  ExpressionKind tag;
+  union {
+    struct { string* name; } variable;
+    struct { Expression* aggregate; string* field; } get_field;
+    struct { Expression* aggregate; Expression* offset; } index;
+    struct { string* name; Expression* type; } pattern_variable;
+    int integer;
+    bool boolean;
+    struct { vector<pair<string,Expression*> >* fields; } tuple;
+    struct {
+      Operator operator_;
+      vector<Expression*>* arguments;
+    } primitive_op;
+    struct { Expression* function; Expression* argument; } call;
+    struct { Expression* parameter; Expression* return_type;} function_type;
+  } u;
+};


Is there a reason to prefer a union here over a class hierarchy? I think it would be clearer for expositionary purposes to use a class hierarchy.

In the executable semantics, I don't think you ever mutate the tag of an expression after creating it, and generally treating AST nodes as immutable seems preferable, especially in the executable semantics.

The use of enums and unions goes well with the use of switch statements to express dispatching on the kind of AST node. I much prefer switch statements to virtual functions for that purpose, because 1) it keeps the code together instead of spreading it out over many classes, 2) it is easier to access variables that are defined in the scope enclosing the switch statement, and 3) there is less extra boilerplate code compared to virtual functions.

Why do you think that a class hierarchy is clearer for expository purposes? Is it that you think that the readers will be more used to thinking in terms of object-oriented ways to organize code, or do you have other reasons in mind?

The class hierarchy approach would avoid needing pointers everywhere (presumably done to keep the union members trivial) and would avoid the .u.foos all over the place. The fact that there's a "tag" field isn't part of the semantic model, it's an implementation detail, and one that we could hide with a class hierarchy approach.

With a small helper utility you can express dispatch on the node kind as something like:

visit(expr) .Case([&] (VariableExpr &v) { ... handle variable case ... }) .Case([&] (GetFieldExpr &g) { ... handle get_field case ... }) .Default([&] { ... handle default case ... });

... which I think would handle your concerns (1), (2), and (3).

Ahh, Smalltalk style control flow. Fun! Yes, I can see that working just fine.
Does this approach provide exhaustiveness checking for the cases? That's pretty important
when adding new features to a language.
The approach you're proposing has the great advantage of being more type safe than the
old-school enum-struct-union approach.
The down side is that it adds some complexity to the code which affects readability and
reasoning about the semantics... e.g. one needs to completely understand order of evaluation
of the code to understand what the executable semantics is saying about the language being defined.
In this case I think the added complexity is relatively small so that's not a big down-side.

Regarding the pointers everywhere, yes, that was to keep the members trivial, which
was forced on me by Bison. I'd love to have a solution for that.

proposals/p0162.md

zygoloid · 2020-10-29T00:57:09Z

proposals/p0162.md

+The `pattern` non-terminal is defined in terms of `expression`. It could instead
+be the other way around, defining `expression` in terms of `pattern`.


From a specification perspective, I'd prefer that we define a parameterized grammar. One way to do this: add an optional suffix to each production, and say that where the suffix is omitted, the implication is that the production is the same for all possible suffixes. So:

patexp ::= patexp [ patexp ]
...
patexp_P ::= patexp_P : identifier
pattern ::= patexp_P
expression ::= patexp_E

where

patexp ::= patexp [ patexp ]

is shorthand for

patexp_P ::= patexp_P [ patexp_P ]
patexp_E ::= patexp_E [ patexp_E ]

The ECMA-262 specification does something like this, though their approach is a bit more explicit.

… josh11b and zygoloid

gribozavr · 2020-11-10T17:48:37Z

proposals/p0162.md

+The grammar rule
+
+```Bison
+expression:  "fnty" tuple return_type


I filed issue #188 "Repeated 'fnty' in curried function types".

jonmeow

I've filed #191 and #192 on behalf of the core team to track some important open questions arising from this proposal. However, please go ahead and commit this proposal as-is. #190 and opening tracking issues should be able to address any remaining open questions.

jonmeow · 2020-11-10T22:15:10Z

proposals/p0162.md

+The grammar rule
+
+```Bison
+expression:  "fnty" tuple return_type
+```
+
+is for function types. They are meant to play the role that function pointers
+play in C and that `std::function` plays in C++.


Tracking issue for this question: #191

jonmeow · 2020-11-30T18:15:55Z

@jsiek This PR would have been fine to commit after being accepted. Can you commit it?

jsiek · 2020-11-30T18:21:57Z

@jonmeow I'm not sure what you mean by "commit" it. Do you mean "merge"? I see a button for that but it says I'm not authorized.

jonmeow · 2020-11-30T19:35:18Z

Yes, merge. Can you resolve the README.md conflict? (merge changes from trunk to your branch, pre-commit run -a, add the generated changes) I'll check with Chandler if he had an intent behind restricting merges to the core team and mtainainers.

jsiek · 2020-11-30T21:53:27Z

Ok, looks like I've fixed the conflict in the README. Please go ahead and merge.

jonmeow · 2020-11-30T23:00:25Z

I think you should have access to merge now: can you please try, to verify? (not quite set up how I'd like long-term, but should be good enough for now)

jsiek · 2020-12-01T14:09:00Z

I don't see a button for merge on this github.com page. How do I merge this PR into trunk?

jsiek · 2020-12-01T16:56:32Z

Yeah, the merge button appeared!

jonmeow · 2020-12-01T17:19:14Z

👍

- [Proposal PR](#162) - [RFC topic](https://forums.carbon-lang.dev/t/rfc-basic-syntax/142) - [Decision request](https://forums.carbon-lang.dev/t/request-for-decision-basic-syntax-162/165) - [Decision announcement](https://forums.carbon-lang.dev/t/accepted-basic-syntax-162/170) Tracking issues filed as part of decision: - Should there be a function type? #191 - Should types be values? #192 Finalized on 2020-11-24

jonmeow · 2021-01-13T00:28:33Z

I just noticed the executable syntax code hasn't made it to a separate PR. Would it help if I pull that code together for commit?

jsiek · 2021-01-13T17:54:22Z

Hi Jon, that would be great. The code that exactly corresponds to the basic syntax proposal is no longer on github... I removed it as per your request. It's on my hard drive. How would you like me to communicate that to you?

P.S. The executable-semantics branch on my fork of carbon-lang has a bunch of extra stuff, experiments, that were not in the basic syntax proposal.

jonmeow · 2021-01-13T18:44:24Z

I've done a first pass pulling in the code from the previous state of this PR:
#237

I'll work on cleaning that up (still plenty to do, it's not compiling for me at present). If you think it's worth pulling in the rest at the same time, an easy way to share it would be to commit it to a branch on your fork and/or make a PR (could even close the PR once done). I could grab it via GitHub via either of those. An alternative would be that, once I get that cleaned up for commit, maybe it'll be easier for you to contribute more incrementally? Perhaps we can figure that out once this first lump is in?

jsiek · 2021-01-13T18:48:10Z

Ahh, good idea to grab the code from the previous state.

I think it will be easier to go with the incremental approach.
Thanks!

* first draft of proposal for basic syntax * rename proposal * formating * fixing typos * precendence and associativity * minor edit * added abstract syntax * some revisions based on feedback from meeting today * optional return type * oops, not optional for function declarations * replacing abbreviations with full names * name changes * Update proposals/p0162.md Co-authored-by: Dmitri Gribenko <[email protected]> * removed = from precedence table * for fun decl, back to optional return type, shorthand for void return * more fiddling with return types * change base case of statement_list to empty * added pattern non-terminal * added expression style function definitions * change && to and, || to or * change and to have equal precedence * updating text to match grammar, fix typo in grammar regarding pattern * adding trailing comma thing to tuples * removed 'alt' keyword, not necessary * flipped expression and pattern * added period for named arguments and documented the reason * change alternative syntax to use tuple instead of expression * comment about abstract syntax * code block language annotations * added alternative designs, some cleanup for pre-commit * spell checking * filled out the TOC * minor edits * minor edit * trying to fix pre-commit error * changes from pre-commit? * changes based on meeting today * describe alternatives regarding methods * more rationale in discussion of alternatives * addressing comments * typo fix, added text about next steps * removed *, changed a ! to not * Update proposals/p0162.md Co-authored-by: Geoff Romer <[email protected]> * edits from feedback * pre-commit * added second reason for period in field initializer * added executable semantics, fixed misunderstanding regarding associativity * update README * added a paragraph about tuples and tuple types * Update proposals/p0162.md Co-authored-by: josh11b <[email protected]> * addressing comments from josh11b * Update executable-semantics/README.md Co-authored-by: Dmitri Gribenko <[email protected]> * link to implicit parameters in generics proposal * added non-terminal for designator as per geoffromer's suggestion * changed handling of tuples in function call and similar places as per josh11b and zygoloid * moving code to separate PR Co-authored-by: Dmitri Gribenko <[email protected]> Co-authored-by: Geoff Romer <[email protected]> Co-authored-by: josh11b <[email protected]>

first draft of proposal for basic syntax

b43a9ce

googlebot added the cla: yes PR meets CLA requirements according to bot. label Sep 19, 2020

jsiek added proposal A proposal WIP labels Sep 19, 2020

jsiek added 3 commits September 19, 2020 15:08

rename proposal

53b8dd5

formating

f048c11

fixing typos

86cedc3

jsiek changed the title ~~first draft of proposal for basic syntax~~ Basic Syntax Sep 20, 2020

jsiek added 3 commits September 21, 2020 13:47

precendence and associativity

eed621c

minor edit

e26781c

added abstract syntax

c8d833e

josh11b reviewed Sep 23, 2020

View reviewed changes

proposals/p0162.md Outdated Show resolved Hide resolved

chandlerc reviewed Sep 23, 2020

View reviewed changes

proposals/p0162.md Outdated Show resolved Hide resolved

proposals/p0162.md Outdated Show resolved Hide resolved

proposals/p0162.md Outdated Show resolved Hide resolved

proposals/p0162.md Outdated Show resolved Hide resolved

jsiek added 5 commits September 23, 2020 14:49

some revisions based on feedback from meeting today

974d91a

optional return type

2279899

oops, not optional for function declarations

42068f7

replacing abbreviations with full names

c6daf89

name changes

556deab

gribozavr reviewed Sep 24, 2020

View reviewed changes

proposals/p0162.md Outdated Show resolved Hide resolved

proposals/p0162.md Outdated Show resolved Hide resolved

jsiek and others added 2 commits September 24, 2020 16:59

Update proposals/p0162.md

0855d05

Co-authored-by: Dmitri Gribenko <[email protected]>

removed = from precedence table

b02522b

josh11b reviewed Sep 24, 2020

View reviewed changes

proposals/p0162.md Outdated Show resolved Hide resolved

jsiek added 7 commits September 25, 2020 09:02

for fun decl, back to optional return type, shorthand for void return

43c4b9f

more fiddling with return types

28bfaf5

change base case of statement_list to empty

3ef030a

added pattern non-terminal

dd4cf52

added expression style function definitions

34b50f2

change && to and, || to or

5f95ac3

change and to have equal precedence

2cc4f67

zygoloid reviewed Oct 29, 2020

View reviewed changes

jsiek added 3 commits October 29, 2020 09:18

added non-terminal for designator as per geoffromer's suggestion

535d46f

changed handling of tuples in function call and similar places as per…

1036bec

… josh11b and zygoloid

moving code to separate PR

2a81ff4

gribozavr mentioned this pull request Nov 10, 2020

Repeated 'fnty' in curried function types #188

Open

gribozavr reviewed Nov 10, 2020

View reviewed changes

jonmeow mentioned this pull request Nov 10, 2020

Decision for: Basic Syntax #162 #190

Merged

jonmeow added proposal accepted Decision made, proposal accepted and removed comment deadline labels Nov 10, 2020

This was referenced Nov 10, 2020

Should there be a function type? #191

Closed

Should types be values? #192

Closed

jonmeow approved these changes Nov 10, 2020

View reviewed changes

resolved merge conflict in proposals/README

6020b32

jsiek merged commit e07d945 into carbon-language:trunk Dec 1, 2020

zygoloid mentioned this pull request Apr 16, 2021

Add operator precedence parser. #464

Merged

zygoloid mentioned this pull request Feb 24, 2022

Implement the current set of rules for precedence partial ordering #1096

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Basic Syntax #162

Basic Syntax #162

jsiek commented Sep 19, 2020 •

edited by jonmeow

Loading

chandlerc left a comment

josh11b left a comment

zygoloid Oct 29, 2020

jsiek Oct 29, 2020

tituswinters Nov 3, 2020

jonmeow Nov 10, 2020

zygoloid Oct 29, 2020

zygoloid Oct 29, 2020

jsiek Oct 29, 2020

zygoloid Oct 29, 2020

jsiek Oct 30, 2020

zygoloid Oct 29, 2020

gribozavr Nov 10, 2020

jonmeow left a comment

jonmeow Nov 10, 2020

jonmeow commented Nov 30, 2020

jsiek commented Nov 30, 2020

jonmeow commented Nov 30, 2020 •

edited

Loading

jsiek commented Nov 30, 2020

jonmeow commented Nov 30, 2020

jsiek commented Dec 1, 2020

jsiek commented Dec 1, 2020

jonmeow commented Dec 1, 2020

jonmeow commented Jan 13, 2021

jsiek commented Jan 13, 2021

jonmeow commented Jan 13, 2021

jsiek commented Jan 13, 2021

		precedence. Proposal 168 differs in that the operator groups are partially
		ordered instead of being totally ordered.

		The `pattern` non-terminal is defined in terms of `expression`. It could instead
		be the other way around, defining `expression` in terms of `pattern`.

Basic Syntax #162

Basic Syntax #162

Conversation

jsiek commented Sep 19, 2020 • edited by jonmeow Loading

chandlerc left a comment

Choose a reason for hiding this comment

josh11b left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jonmeow left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jonmeow commented Nov 30, 2020

jsiek commented Nov 30, 2020

jonmeow commented Nov 30, 2020 • edited Loading

jsiek commented Nov 30, 2020

jonmeow commented Nov 30, 2020

jsiek commented Dec 1, 2020

jsiek commented Dec 1, 2020

jonmeow commented Dec 1, 2020

jonmeow commented Jan 13, 2021

jsiek commented Jan 13, 2021

jonmeow commented Jan 13, 2021

jsiek commented Jan 13, 2021

jsiek commented Sep 19, 2020 •

edited by jonmeow

Loading

jonmeow commented Nov 30, 2020 •

edited

Loading