fix: error when actor panics directly #3697

alanshaw · 2020-09-09T17:15:27Z

Attempting to resolve filecoin-project/test-vectors#87, quoting @anorth:

If an actor implementation panics directly (rather than calling Abortf) then the evaluation is undefined. There is no exit code corresponding to this. The result should not go on chain. A panic (which could also come from some actor dependency) may indicate a transient state or error that cannot be replicated by other nodes and thus cannot form part of consensus. E.g. an out-of-memory.

I believe that returning a "fatal" ActorError instead of SysErrSenderInvalid(1) prevents the message from being recorded on chain since it's caught in ApplyMessage here.

whyrusleeping · 2020-09-09T19:43:47Z

@alanshaw @anorth we explicitly are treating nothing as a fatal error (as far as I know). I'd rather have a miner create an invalid block than have the chain halt because everyone set block X as their head and evaluating it causes a fault.

Kubuxu · 2020-09-09T19:45:38Z

Yeah, this should not be a fatal error but one of the Sys* errors that signifies actors doing something really bad.

alanshaw · 2020-09-10T08:43:06Z

Okay, well, just so you know, there are a bunch of places in the runtime where Fatalf is being used (18 in total) which we might want to check up on 😅:

Just to reiterate, the case where this kind of error happens should be extremely rare and this change would cause the message to not make it into a block in the first place. If I understand correctly and we allow this kind of message into a block, then another node applying this message is likely to not reach the same state conclusion anyway.

I'm super new to the code base so I'm happy to take your advice on this. If you're still set on returning a Sys* code then we could use 4 - its unused, and we could call it...SysErrActorMalfunction?

This PR adds the `apply_message_failures` post condition to the schema. There's currently only 1 test that uses this `no-exit-code` but it currently does not error. If you want to see the new field in JSON output then you'll need to checkout the branch from filecoin-project/lotus#3697 and add a replace to go.mod `replace github.com/filecoin-project/lotus => ../lotus`. This also updates to lotus 0.7 and specs-actors 0.9.8. resolves #134

anorth · 2020-10-01T22:54:26Z

I'd rather have a miner create an invalid block than have the chain halt because everyone set block X as their head and evaluating it causes a fault.

Ok, can we not use SysErrInvalidSender at least? Use one of the reserved exit codes, and we can rename it later.

alanshaw requested a review from anorth September 9, 2020 17:15

alanshaw requested review from magik6k and whyrusleeping as code owners September 9, 2020 17:15

alanshaw mentioned this pull request Sep 9, 2020

test: actor abort filecoin-project/test-vectors#118

Merged

6 tasks

alanshaw requested a review from Kubuxu September 9, 2020 17:16

alanshaw mentioned this pull request Sep 9, 2020

ShimCall error handling is full of dragons #3548

Closed

magik6k added the impact/consensus Impact: Consensus label Sep 9, 2020

alanshaw force-pushed the fix/actor-panic-fatal branch from 7519bb7 to 4d9f8c3 Compare September 11, 2020 11:51

alanshaw mentioned this pull request Sep 11, 2020

feat: add apply_message_failures post condition filecoin-project/test-vectors#138

Merged

alanshaw mentioned this pull request Sep 16, 2020

Release 0.9.0 #3871

Closed

21 tasks

Use SysErrReserved1 in the event of an actors panic

7b55625

arajasek force-pushed the fix/actor-panic-fatal branch from 4d9f8c3 to 7b55625 Compare October 6, 2020 09:35

alanshaw requested a review from raulk as a code owner October 6, 2020 09:35

arajasek changed the base branch from master to asr/spec-v1 October 6, 2020 09:35

magik6k approved these changes Oct 6, 2020

View reviewed changes

Base automatically changed from asr/spec-v1 to next October 6, 2020 21:34

arajasek merged commit dfaabb4 into next Oct 6, 2020

arajasek deleted the fix/actor-panic-fatal branch October 6, 2020 21:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: error when actor panics directly #3697

fix: error when actor panics directly #3697

alanshaw commented Sep 9, 2020

whyrusleeping commented Sep 9, 2020

Kubuxu commented Sep 9, 2020

alanshaw commented Sep 10, 2020 •

edited

Loading

anorth commented Oct 1, 2020

fix: error when actor panics directly #3697

fix: error when actor panics directly #3697

Conversation

alanshaw commented Sep 9, 2020

whyrusleeping commented Sep 9, 2020

Kubuxu commented Sep 9, 2020

alanshaw commented Sep 10, 2020 • edited Loading

anorth commented Oct 1, 2020

alanshaw commented Sep 10, 2020 •

edited

Loading