Feature: scope context variables decorators (input, output, internal) #112

denismerigoux · 2021-04-30T16:36:35Z

The problem

Scopes in Catala can have many context variables. But as the number of context variables grows, it is more and more difficult to figure out which of these variables are output, which are input and which ones are intermediate variables that are not relevant from outside the scope. Catala users have already started using comments to annotate scope context variable declarations with this classification.

While it is the essence of Catala that context variables are neither input nor outputs by default, since they can be redefined by a calling scope, we could benefit from user annotations to enable helper lints and better code generation in the different backends.

Specification of the decorations

A regular context variable declaration looks like this:

scope Foo:
  context a content bool

This proposal would allow replacing the context with the following keywords:

input: this scope variable be defined by the caller, cannot be defined in the scope
output: this scope variable cannot be defined by the caller, has to be defined in the scope
internal: this scope variable cannot be defined by the caller, has to be defined in the scope, and does not appear in the outputs
output: this scope variable can defined by the caller, can be defined in the scope, and appears in the outputs

The classifications internal/input/context on the one hand, and internal/output on the other hand, form two independent classifications for respectively the input and output behavior of a scope variables. From this independent combinations of two choices between respectively 3 and 2 options are yielded 6 different possibilities for fully qualifying the input/output behavior of a scope variable:

internal
output
input
input output
context
context output

This specification defines informally two permissiveness lattices between the kinds of scope variables. Here it is, the most permissive being at the top:

 CONTEXT
    | 
    |  
  INPUT                       OUTPUT
    |                            |
    |                            |
 INTERNAL                     INTERNAL

Linting

If we have these four keywords, we can enforce their specification in three different ways.

ERROR LINT: When calling a subscope Foo, we can ensure that all the variables of Foo redefined in the caller are either context or input, but not internal.
ERROR LINT: When calling a subscope Foo, we can ensure that all the variables of Foo used (as outputs) in defining variables of the caller are output, but not internal
WARNING LINT: Inside a scope, we can ensure all variables defined are either context or internal
WARNING LINT: Inside a scope, we can check that all variables marked as output or internal have at least one definition.

Code generation

The decorations can also help us generate code that has easier signatures than the current compilation scheme that exposes all context variables in both the output and input structs of the scope. More specifically:

The input struct should contain the (no keyword) and input variables, but not the internal or output variables
The output struct should contain the (no keyword) and output variables, but not the internal or input variables

Implementation

The implementation of this feature will impact quite a lot of areas of the compiler:

Adding syntax keywords
Extending the surface, desugared and scopelang intermediate representations with the kind for each scope variable
Implement the lints presented above in the scopelang intermediate representation
Modify the dcalc and lcalc translations using the variable kind information according to the specification above
Fix the OCaml backend

The text was updated successfully, but these errors were encountered:

msprotz · 2021-04-30T16:47:34Z

Thanks Denis for summarizing the proposal! Here's a few suggestions to simplify, or at least have a first simplified design that can be refined incrementally.

Can we (for the time being) leave context out of the discussion since it's the default behavior?
Linting:
- caller-redefined variables cannot be internal or output
- caller-bound variables cannot be internal or input
- points 3. and 4. become optional
Code-gen
- input and internal variables do not appear in the output struct
- output and internal variables do not appear in the input struct

What do you think?

denismerigoux · 2021-04-30T17:16:02Z

These are all good suggestions, I updated the design post above accordingly.

EmileRolley · 2021-06-09T09:20:52Z

I don't really get the differences between the no keyword and the current context keyword.

denismerigoux · 2021-06-09T14:33:10Z

You read correctly, because there is none :) I propose we remove the context keyword for the default case after Jonathan's remark:

Can we (for the time being) leave context out of the discussion since it's the default behavior?

EmileRolley · 2021-06-09T15:00:15Z

For me the keyword context is meaningful. Is there any reasons to not simply keep context and add keywords internal, input and output?

denismerigoux · 2021-06-09T15:20:03Z

Having no keyword for the context case is a nudge for programmers to clarify the role of their scope variables. Compare

declaration scope Foo:
  internal x content integer
  output y content boolean
  context z content date

with

declaration scope Foo:
  internal x content integer
  output y content boolean
  z content date

It is more obvious in the second version that something is missing to qualify z, which we want to encourage the programmer to do since it clarifies the use. Also in the case where the programmer has not yet labeled the scope parameters, it is more convenient to write

declaration scope Foo:
  x content integer
  y content boolean
  z content date

rather than

declaration scope Foo:
  context x content integer
  context y content boolean
  context z content date

All of these observations lead me to refine my proposal. I propose that we allow both (no keyword) and context, both having the same semantics.

EmileRolley · 2021-06-09T18:36:19Z

Okey, I agree with you. Do you think I can handle the implementation?

denismerigoux · 2021-06-10T07:50:28Z

This is definitely more ambitious than the wildcard issue, since you have to go down the entire compilation stack. The general architecture is presented here https://catala-lang.org/ocaml_docs/catala/index.html, and the formalization is here https://hal.inria.fr/hal-03159939. I guess you can take a look a those, and we can schedule a call next week to sync up before you start. Is that good for you ?

EmileRolley · 2021-06-11T04:30:56Z

Yes, thanks. I guess I can start to look at it and write down some questions.

Cleaning up scopelang encoding and adding some default optimizations (beginning of #112)

denismerigoux · 2022-02-14T11:23:56Z

Implemented in #185 and #189.

denismerigoux added ✨ enhancement New feature or request 🔧 compiler Issue concerns the compiler 💡 language Language design labels Apr 30, 2021

denismerigoux added the 👪 help wanted Extra attention is needed label Apr 30, 2021

msprotz mentioned this issue Sep 17, 2021

Support for easily carrying variables into a sub-scope #140

Open

denismerigoux mentioned this issue Jan 11, 2022

Catala as a proof platform #175

Merged

2 tasks

denismerigoux self-assigned this Jan 21, 2022

denismerigoux added this to the Catala as a Proof platform milestone Jan 21, 2022

denismerigoux mentioned this issue Jan 21, 2022

Inlining variables in verification conditions #183

Open

denismerigoux removed the 👪 help wanted Extra attention is needed label Jan 21, 2022

denismerigoux added a commit that referenced this issue Jan 27, 2022

Added parsing support for #112, missing all later compilation steps

05a0bfc

denismerigoux mentioned this issue Jan 27, 2022

Cleaning up scopelang encoding and adding some default optimizations (beginning of #112) #185

Merged

5 tasks

denismerigoux added a commit that referenced this issue Feb 2, 2022

Merge pull request #185 from CatalaLang/io-qualifiers-112

2081ac0

Cleaning up scopelang encoding and adding some default optimizations (beginning of #112)

denismerigoux mentioned this issue Feb 4, 2022

Implementation of scope variable visibility qualifiers #189

Merged

9 tasks

denismerigoux closed this as completed Feb 14, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature: scope context variables decorators (input, output, internal) #112

Feature: scope context variables decorators (input, output, internal) #112

denismerigoux commented Apr 30, 2021 •

edited

Loading

msprotz commented Apr 30, 2021 •

edited

Loading

denismerigoux commented Apr 30, 2021

EmileRolley commented Jun 9, 2021

denismerigoux commented Jun 9, 2021

EmileRolley commented Jun 9, 2021

denismerigoux commented Jun 9, 2021

EmileRolley commented Jun 9, 2021

denismerigoux commented Jun 10, 2021

EmileRolley commented Jun 11, 2021

denismerigoux commented Feb 14, 2022

Feature: scope context variables decorators (input, output, internal) #112

Feature: scope context variables decorators (input, output, internal) #112

Comments

denismerigoux commented Apr 30, 2021 • edited Loading

The problem

Specification of the decorations

Linting

Code generation

Implementation

msprotz commented Apr 30, 2021 • edited Loading

denismerigoux commented Apr 30, 2021

EmileRolley commented Jun 9, 2021

denismerigoux commented Jun 9, 2021

EmileRolley commented Jun 9, 2021

denismerigoux commented Jun 9, 2021

EmileRolley commented Jun 9, 2021

denismerigoux commented Jun 10, 2021

EmileRolley commented Jun 11, 2021

denismerigoux commented Feb 14, 2022

denismerigoux commented Apr 30, 2021 •

edited

Loading

msprotz commented Apr 30, 2021 •

edited

Loading