Run info in notebook metadata #106747

jrieken · 2020-09-15T11:18:47Z

We have NotebookCellMetadata#runnable and NotebookDocumentMetadata#runnable and it's unclear what this means given that we have support to execute cells/notebooks with different kernels. Should a kernel update all cells when becoming active? Would it be better if kernels somehow express what cells they can execute?

The text was updated successfully, but these errors were encountered:

jrieken · 2021-02-12T14:49:45Z

Let's add this to the Feb discussions backlog. IMO the "runnability" is determined by:

having a kernel or not
the kernel supporting the language of the code cell
the cell being a code cell

So, in my head there is no metadata that can say "runnable". I also believe that this cut helps in decoupling the notebook content provider APIs from the kernel/execution API

#106747

roblourens · 2021-02-23T05:14:23Z

I pushed the change to remove the runnable metadata and only infer it, but then reverted it, because we need to think about how this works with untrusted notebooks.

What we really need is to get Jupyter onto real vscode trusted workspaces, I pointed @DonJayamanne towards Steven for that, but in the meantime we need to make sure that they still have a way to accomplish this. It looks like Jupyter still attaches a kernel when the notebook is untrusted.

@DonJayamanne would it be ok to not provide a kernel when the notebook/workspace is untrusted, to cause cells to not be runnable? Or do you need kernels to still be available?

jrieken · 2021-02-23T07:54:26Z

Can't we just control this? We know whether a workspace or a notebook is trusted and in that case we simply don't ask for kernels (or prevent execution)

DonJayamanne · 2021-02-23T15:47:29Z

would it be ok to not provide a kernel when the notebook/workspace is untrusted

Yes that's ok, we don't need a kernel when a notebook is not trusted.

roblourens · 2021-02-23T18:50:08Z

Yeah @jrieken, I think we can discuss whether this would be enforced by vscode or by an extension, since many extensions probably don't need to care about workspace trust. And I guess we already added some notebook metadata related to trust? We can discuss how trust should work, but it sounds like I can re-merge the change next week.

I'll ping again when I do it @DonJayamanne, but runnable metadata will go away, and you will want to not return a kernel when a notebook is not trusted.

DonJayamanne · 2021-02-23T21:15:43Z

I'll ping again when I do it @DonJayamanne, but runnable metadata will go away, and you will want to not return a kernel when a notebook is not trusted.

Hmm, interesting. Who should own the concept of trust here.
If the content provider says that a notebook is not trusted, then shouldn't it be able to control whether any kernel can run any of its code.

It seems like with the the proposed model we're passing responsibility of trust to a kernel.
I.e. if a different kernel exists that can run a notebook then VSC will allow the ability for the user to run that notebook, even when the contents are not trusted.

How do we protect the user here? Personally i think the content provider should protect the user here.

An excellent example is .NET Interactive extension today, they provide kernels as well.
But Jupyter owns ipynb files, hence if an ipynb is not trusted and jupyter will not return a kernel (based on suggestion). However .NET extension could still return a kernel & they could end running code in an untrusted notebook.
Sure they too can then check if the notebook is trusted or not, etc.., but feels it should be done higher up, so that kernels don't have to check if they are allowed to do something or now.

I.e. execution should be blocked, as opposed to making it an opt out.

DonJayamanne · 2021-02-23T21:19:03Z

Can't we just control this? We know whether a workspace or a notebook is trusted and in that case we simply don't ask fo

Will workspace trust make trusting notebooks unnecessary? Or do we still need both.
If both, then extension owns the notion of trust, hence we need to make changes at our end, but what about other kernel running code against untrusted notebooks.

jrieken · 2021-02-24T08:14:59Z

If the content provider says that a notebook is not trusted, then shouldn't it be able to control whether any kernel can run any of its code.

That's a great point and I totally agree that you need to trust the kernel. About the document I am unsure. For me there is two kinds of trust

trust the kernel executing arbitrary code
trust existing output that "renders" by running code, e.g some html/js output, no kernel attached

So, I am actually unsure what we are trying to achieve? Trust the kernel, trust the output, have two separated trusts for each kind? @DonJayamanne maybe you can help to clarify what we are aiming for or what other notebook UI do

roblourens · 2021-02-24T18:09:10Z

I think the only thing in scope here is trusting the content of a workspace. Jupyter's model is only about trusting the notebook document's content, right? That's compatible with what vscode is implementing in workspace trust.

Trusting a kernel/extension is up to the user.

DonJayamanne · 2021-02-24T19:58:23Z

Jupyter's model is only about trusting the notebook document's content, right

I'd say yes, but we also prevent users form running a kernel against an untrusted notebook, basically the notebook is readly - so they cannot execute the code if they dont trust it.
Sounds like VS Code will still allow users to run code in a document that is not trusted.

aybe you can help to clarify what we are aiming for or what other notebook UI do

Time to bring in the big guns..Hopefully our PM and manager can provide the requirements clearly (or change how we work)

Pinging @claudiaregio @greazer

roblourens · 2021-02-24T20:49:00Z

Sounds like VS Code will still allow users to run code in a document that is not trusted.

I don't think we are saying that necessarily. I think we're trying to decide who enforces the rule or what the rule is.

And a key difference here is that vscode's model trusts entire workspaces, not individual documents like Jupyter. There's no problem with that, right?

DonJayamanne · 2021-02-24T21:47:39Z

And a key difference here is that vscode's model trusts entire workspaces, not individual documents like Jupyter. There's no problem with that, right?

@claudiaregio @greazer /cc

greazer · 2021-02-25T02:12:51Z

Big guns... haha...

I'm not clear as to the question that's being asked to state any opinion about. What I do know is that cell renderers (i.e. cell outputs) that can execute code should not be rendered on opening a file, folder or workspace until the user explicitly trusts every cell in an opened file. I don't think it matters what kernel is going to be used to run the notebook. If the file is part of a trusted workspace, then the file should automatically be considered trusted.

It's also true that other notebook extensions may not have a need to support the notion of trust as their outputs can never be some script that can run arbitrary code.

Finally, the way Jupyter Classic and Jupyter Lab models work is to tie trust to each cell output. So that means that when a notebook is loaded no output is shown. At this point the user can explicitly state that the notebook is trusted so as to cause each output to be made visible. OR they can run a cell or set of cells. Running a cell implies that a cell is trusted. If the user runs all the cells, then the entire notebook is considered trusted automatically.

So based on all of that, have we considered adding some sort of "can-run-arbitrary-code" type capability setting for renderers? If we supported this model, then it seems like VS Code should own the notion of trust for files. Maybe we already have something like this that I'm not aware of (sorry if that's the case :)

Closer to what this issue was opened about, I think a question is whether we could or should duplicate the read-only nature of an untrusted notebook as we supported in our jupyter extension. I don't really think that it's necessary, but if any extension owner would like to enforce read-only because their source language can be used in a way that obfuscates what code is run, or their afraid there users can be easily tricked into running cells without realizing what they're doing, then having a way to disable running cells altogether until a notebook is trusted would be needed.

Hopefully this answered some questions?

roblourens · 2021-03-09T19:56:15Z

I will do some more thinking about notebook trust on our side. But for now I intend to merge the original change today, FYI @DonJayamanne

#106747 (comment)

#106747

jrieken added api notebook under-discussion Issue is under discussion for relevance, priority, approach labels Sep 15, 2020

jrieken mentioned this issue Sep 15, 2020

Notebook API evolution #106744

Closed

aeschli assigned rebornix Sep 15, 2020

rebornix assigned roblourens Oct 5, 2020

rebornix added this to the October 2020 milestone Oct 5, 2020

rebornix modified the milestones: October 2020, On Deck Oct 26, 2020

jrieken modified the milestones: On Deck, February 2021 Feb 12, 2021

rebornix removed their assignment Feb 22, 2021

roblourens added a commit that referenced this issue Feb 23, 2021

Begin eliminating cell runnable metadata

65711c6

#106747

roblourens closed this as completed in ce45b0d Feb 23, 2021

roblourens reopened this Feb 23, 2021

roblourens modified the milestones: February 2021, March 2021 Feb 23, 2021

roblourens mentioned this issue Mar 9, 2021

📢 Notebook API announcements #93265

Closed

roblourens closed this as completed in 225a8c2 Mar 9, 2021

roblourens added a commit that referenced this issue Mar 9, 2021

Remove notebook document runnable metadata

bcb8c8a

#106747

roblourens mentioned this issue Mar 9, 2021

Explore notebook trust + workspace trust #118584

Closed

DonJayamanne mentioned this issue Mar 16, 2021

Add context keys for cell execution state (to be used in cell toolbars) #119132

Closed

github-actions bot locked and limited conversation to collaborators Apr 23, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Run info in notebook metadata #106747

Run info in notebook metadata #106747

jrieken commented Sep 15, 2020

jrieken commented Feb 12, 2021

roblourens commented Feb 23, 2021

jrieken commented Feb 23, 2021

DonJayamanne commented Feb 23, 2021

roblourens commented Feb 23, 2021 •

edited

Loading

DonJayamanne commented Feb 23, 2021 •

edited

Loading

DonJayamanne commented Feb 23, 2021 •

edited

Loading

jrieken commented Feb 24, 2021

roblourens commented Feb 24, 2021

DonJayamanne commented Feb 24, 2021 •

edited

Loading

roblourens commented Feb 24, 2021 •

edited

Loading

DonJayamanne commented Feb 24, 2021

greazer commented Feb 25, 2021

roblourens commented Mar 9, 2021

Run info in notebook metadata #106747

Run info in notebook metadata #106747

Comments

jrieken commented Sep 15, 2020

jrieken commented Feb 12, 2021

roblourens commented Feb 23, 2021

jrieken commented Feb 23, 2021

DonJayamanne commented Feb 23, 2021

roblourens commented Feb 23, 2021 • edited Loading

DonJayamanne commented Feb 23, 2021 • edited Loading

DonJayamanne commented Feb 23, 2021 • edited Loading

jrieken commented Feb 24, 2021

roblourens commented Feb 24, 2021

DonJayamanne commented Feb 24, 2021 • edited Loading

roblourens commented Feb 24, 2021 • edited Loading

DonJayamanne commented Feb 24, 2021

greazer commented Feb 25, 2021

roblourens commented Mar 9, 2021

roblourens commented Feb 23, 2021 •

edited

Loading

DonJayamanne commented Feb 23, 2021 •

edited

Loading

DonJayamanne commented Feb 23, 2021 •

edited

Loading

DonJayamanne commented Feb 24, 2021 •

edited

Loading

roblourens commented Feb 24, 2021 •

edited

Loading