Plan for external communication / isolation #800

RalfJung · 2019-06-29T11:02:53Z

Currently, the Miri interpreter only allows one kind of communication between the executed program and the outside world: printing to stdout/stderr. In the long run, that will not be enough. There were already requests for access to the system time, and it seems reasonable to ask for file system or network access. Getting proper randomness into the program might also be interesting.

This issue is basically the current status of #653, now that we [soon will] have a deterministically seeded RNG available in Miri all the time -- so much of the discussion there no longer applies.

It probably makes sense to allow external communication per default, just to get more programs running. But isolation is also a useful property, so I propose a -Zmiri-isolate flag to "turn off" external communication.

Assuming that's not very controversial, we have to decide what to do with the two existing systems we have in place that try hard to avoid allowing external communication:

Environment variables. Should we just forward setting/getting env vars to the outside OS per default, and only keep using our current env var emulation layer when -Zmiri-isolate is set? That would resolve Support for accessing host environment variables #670.
getrandom. Should we just ask the OS for "real" randomness per default, and only ask our internal RNG when -Zmiri-isolate is set?

Current status

Add off-by-default "external communication" mode. Tentative name of the flag: ´-Zmiri-enable-communication.
Allow interpreted program to access host env vars if external communication is enabled.
Allow interpreted program to access host randomness if external communication is enabled.
Come up with a less clumsy name for the flag that enables communication. Maybe -Zmiri-disable-isolation? Is that really less clumsy? -Zmiri-no-isolate?
Turn external communication on by default?

The text was updated successfully, but these errors were encountered:

pvdrz · 2019-08-06T15:45:00Z

I started working on the environment variables part of this issue. I added a new -Zmiri-enable-communication flag for this and I'm updating the get/set/unset env shims.

I am going to try to write the raw bytes of each env-var value and see what happens.

RalfJung · 2019-08-06T19:00:20Z

I'm updating the get/set/unset env shims.

I suggest introducing a new module, shims/env.rs, and moving all the env-related logic there.

pvdrz · 2019-08-06T19:03:55Z

Will do, I haven't decided what's the best way of handle allocations for the host environment variables. Currently I am creating temporary allocations for the variables and then deallocating them after writing.

Is it there any way to write some bytes directly into a PlaceTy instead of having to call write_scalar?

oli-obk · 2019-08-06T19:15:26Z

Can't you invoke the existing setenv logic?

pvdrz · 2019-08-06T19:18:17Z

What do you mean?

RalfJung · 2019-08-06T19:40:50Z

There's already code that does all the work of "given a C string, create an allocation for it". You should be able to re-use that. It's in the setenv shim.

The new thing here is that instead of doing this on setenv, it'll have to happen on getenv. So each getenv creates a new allocation that is then leaked. We should probably find a way to collect those, but for a first implementation that would be enough.

pvdrz · 2019-08-06T19:44:44Z

Oh ok yeah, That is what I am doing right now. I am deallocating each temporary allocation after using it.

oli-obk · 2019-08-06T19:57:53Z

Ah, I thought we'd just iterate over the env at startup and clone it into the miri env.

pvdrz · 2019-08-06T20:06:30Z

Well that might be easier and it would work unles someone decides to modify an env var while miri is running or something.

RalfJung · 2019-08-06T20:18:03Z

Ah, I thought we'd just iterate over the env at startup and clone it into the miri env.

Ah. That would fail to reflect later changes from "the outside" -- which is possible in principle. But I would be fine with this strategy as well.

oli-obk · 2019-08-06T20:31:17Z

Uh... I didn't know you could change env vars of a program while it was running

RalfJung · 2019-08-06T20:32:28Z

Hm, maybe you're not supposed to. But on Linux I suppose you could try to write to /proc/$PID/environ...

EDIT: nope, that file is read-only. I guess you are right.

pvdrz · 2019-08-06T20:36:52Z

Apparently you can do it using gdb, but that seems to be an unlikely scenario.

RalfJung · 2019-08-06T20:38:16Z

Agreed. Scanning the host env on initialization is fine. That's great as it makes the "communicate" and "dont communicate" cases much more similar: the only difference is that one starts with an empty env, the other with the real one.

pvdrz · 2019-08-06T20:39:45Z

Ok I will start working on this

Edit: In order to be able to allocate the env vars, we would need access to Memory and TyCtxt. Should I do this inside create_ecx instead of doing it inside Evaluator::new?

@RalfJung

Enable env communication related issue: #800. r? @RalfJung

RalfJung · 2019-08-15T08:51:44Z

Status update: accessing host env vars is now possible with -Zmiri-enable-communication.

The name for that flag is preliminary, as is the fact that the current default for communication is off.

Also, IMO that flag should also make getrandom calls from the program use host randomness. Any objections to that?

oli-obk · 2019-08-15T09:24:52Z

Having a bunch of env vars given at startup feel very different to arbitrary communication at runtime, since everything is still deterministic given the same env vars

RalfJung · 2019-08-15T09:29:16Z

As far as I am concerned, the purpose of this "external communication" flag is to give up on determinism.

If we want to allow more things than "total isolation" while still being deterministic, I am open for that, but that's a gray area. For example, you could argue that allowing file access is also still deterministic if all accessed files are the same.

pvdrz · 2019-08-15T14:37:34Z

I think one could exploit something like the order of variables in #756 to do stuff in a non deterministic way.

Edit: nvm, if we are using the host's randomness this is not even a real concern

oli-obk · 2019-08-15T14:38:24Z

True. We can always add that back later if it seem desirable.

So..

Any objections to that?

Nope, bring on the cosmic rays

RalfJung · 2019-08-16T06:21:44Z

@christianpoveda do you want to try implementing that?

pvdrz · 2019-08-16T16:23:41Z

I'm working on it.

Edit: Now we are using the host's RNG when the communication flag is enabled

@RalfJung

Use host's rng when communication is enabled This uses the host's randomness when the communication enabled flag is used. I am not sure about the error handling. I was thinking about fallbacking to `rand` if `getrandom` fails and also print something so the user knows miri is not using the host's rng because it failed. Let me know what you think. Related issue: #800. r? @RalfJung @oli-obk

@RalfJung

Use host's rng when communication is enabled This uses the host's randomness when the communication enabled flag is used. I am not sure about the error handling. I was thinking about fallbacking to `rand` if `getrandom` fails and also print something so the user knows miri is not using the host's rng because it failed. Let me know what you think. Related issue: #800. r? @RalfJung @oli-obk

RalfJung · 2019-12-31T11:04:07Z

Closing this: we now have working external communication, and when/if we want to enable that by default, that should be a new discussion IMO. For now, off-by-default seems to still work fine.

RalfJung added A-shims Area: This affects the external function shims C-proposal Category: a proposal for something we might want to do, or maybe not; details still being worked out labels Jun 29, 2019

This was referenced Jun 29, 2019

Implement libstd HashMap seeding on OS X #686

Closed

Figure out a story for non-determinism and external communication #653

Closed

Support for accessing host environment variables #670

Closed

pvdrz mentioned this issue Aug 6, 2019

Enable env communication #894

Merged

bors added a commit that referenced this issue Aug 14, 2019

Auto merge of #894 - christianpoveda:env-vars-communication, r=RalfJung

1f504ea

Enable env communication related issue: #800. r? @RalfJung

pvdrz mentioned this issue Aug 19, 2019

Use host's rng when communication is enabled #914

Merged

RalfJung added C-project Category: a larger project is being tracked here, usually with checkmarks for individual steps and removed C-proposal Category: a proposal for something we might want to do, or maybe not; details still being worked out labels Aug 23, 2019

RalfJung mentioned this issue Aug 26, 2019

Can't call foreign function: clock_gettime #641

Closed

RalfJung closed this as completed Dec 31, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Plan for external communication / isolation #800

Plan for external communication / isolation #800

RalfJung commented Jun 29, 2019 •

edited

Loading

pvdrz commented Aug 6, 2019 •

edited

Loading

RalfJung commented Aug 6, 2019

pvdrz commented Aug 6, 2019 •

edited

Loading

oli-obk commented Aug 6, 2019

pvdrz commented Aug 6, 2019

RalfJung commented Aug 6, 2019

pvdrz commented Aug 6, 2019

oli-obk commented Aug 6, 2019

pvdrz commented Aug 6, 2019

RalfJung commented Aug 6, 2019

oli-obk commented Aug 6, 2019

RalfJung commented Aug 6, 2019 •

edited

Loading

pvdrz commented Aug 6, 2019

RalfJung commented Aug 6, 2019

pvdrz commented Aug 6, 2019 •

edited

Loading

RalfJung commented Aug 15, 2019

oli-obk commented Aug 15, 2019

RalfJung commented Aug 15, 2019 •

edited

Loading

pvdrz commented Aug 15, 2019 •

edited

Loading

oli-obk commented Aug 15, 2019

RalfJung commented Aug 16, 2019

pvdrz commented Aug 16, 2019 •

edited

Loading

RalfJung commented Dec 31, 2019

Plan for external communication / isolation #800

Plan for external communication / isolation #800

Comments

RalfJung commented Jun 29, 2019 • edited Loading

Current status

pvdrz commented Aug 6, 2019 • edited Loading

RalfJung commented Aug 6, 2019

pvdrz commented Aug 6, 2019 • edited Loading

oli-obk commented Aug 6, 2019

pvdrz commented Aug 6, 2019

RalfJung commented Aug 6, 2019

pvdrz commented Aug 6, 2019

oli-obk commented Aug 6, 2019

pvdrz commented Aug 6, 2019

RalfJung commented Aug 6, 2019

oli-obk commented Aug 6, 2019

RalfJung commented Aug 6, 2019 • edited Loading

pvdrz commented Aug 6, 2019

RalfJung commented Aug 6, 2019

pvdrz commented Aug 6, 2019 • edited Loading

RalfJung commented Aug 15, 2019

oli-obk commented Aug 15, 2019

RalfJung commented Aug 15, 2019 • edited Loading

pvdrz commented Aug 15, 2019 • edited Loading

oli-obk commented Aug 15, 2019

RalfJung commented Aug 16, 2019

pvdrz commented Aug 16, 2019 • edited Loading

RalfJung commented Dec 31, 2019

RalfJung commented Jun 29, 2019 •

edited

Loading

pvdrz commented Aug 6, 2019 •

edited

Loading

pvdrz commented Aug 6, 2019 •

edited

Loading

RalfJung commented Aug 6, 2019 •

edited

Loading

pvdrz commented Aug 6, 2019 •

edited

Loading

RalfJung commented Aug 15, 2019 •

edited

Loading

pvdrz commented Aug 15, 2019 •

edited

Loading

pvdrz commented Aug 16, 2019 •

edited

Loading