Don't use T or CONDITION in HANDLER-CASE or non-local-exiting HANDLER-BIND clauses (also affects blackbird) #108

ivan4th · 2014-12-20T05:49:28Z

I've noticed there's repeating pattern of
(handler-case ... (t () ..)) and (handler-case ... (condition () ...))
in cl-async & blackbird code. Unfortunatelly this may lead to unwanted
non-local exits and some very hard-to-debug problems.

The problem is, not every condition in Common Lisp is necessarily
causing stack unwinding. For instance, WARN is implemented using
conditions, too. HANDLER-BIND can be used to handle conditions
without causing the non-local exit:

CL-USER> (handler-bind ((warning
                          #'(lambda (w)
                              (format t "got warning: ~s~%" w))))
           (progn
             (warn "test")
             (princ "zzz")))
got warning: #<SIMPLE-WARNING "test" {100C7B5883}>
WARNING: test
zzz

WARN here signals a SIMPLE-WARNING condition which still allows
(princ "zzz") form to be evaluated.

HANDLER-CASE is usually implemented internally using HANDLER-BIND or
some internal variant of it. It's important difference is that any
condition handled by HANDLER-CASE always causes a non-local exit
(stack unwinding).

SIGNAL function is the fundamental mechanism which is used to signal
conditions. ERROR function is implemented in terms of SIGNAL, but
invokes the debugger in case the condition wasn't handled with a
non-local exit.

There's a separate superclass for conditions that are designed to
cause non-local exit, SERIOUS-CONDITION. It's subclasses are ERROR and
some other conditions that shouldn't be considered plain errors that
are handled as usual, such as stack overflow errors. For instance, in
SBCL:

CL-USER> (mapcar #'class-name (sb-pcl:class-direct-subclasses (find-class 'serious-condition)))
(SB-DI:DEBUG-CONDITION SB-SYS:INTERACTIVE-INTERRUPT TIMEOUT STORAGE-CONDITION
 ERROR)
CL-USER> (mapcar #'class-name (sb-pcl:class-direct-subclasses (find-class 'storage-condition)))
(SB-KERNEL::HEAP-EXHAUSTED-ERROR SB-KERNEL::ALIEN-STACK-EXHAUSTED
                                 SB-KERNEL::BINDING-STACK-EXHAUSTED
                                 SB-KERNEL::CONTROL-STACK-EXHAUSTED
                                 SB-INT:SIMPLE-STORAGE-CONDITION)

(TIMEOUT here is related to SB-EXT:WITH-TIMEOUT).

Here lies the problem: handling every condition using HANDLER-CASE causes
non-local exit even in unwanted cases. For example:

CL-USER> (handler-case
             (progn
               (warn "test")
               (princ "zzz"))
           (condition ()
             (princ "oops")))
oops

Note that zzz isn't printed, although WARN isn't normally supposed
to alter the normal program flow. Same goes for T:

CL-USER> (handler-case
             (progn
               (warn "test")
               (princ "zzz"))
           (t ()
             (princ "oops")))
oops

On the other hand:

CL-USER> (handler-case
             (progn
               (warn "test")
               (princ "zzz"))
           (error ()
             (princ "oops")))
WARNING: test
zzz

Specifying SERIOUS-CONDITION instead of ERROR may not always be a
right option, as you probably don't want to wrap things like stack
overflows in normal cl-async or blackbird error handling mechanisms.

This problem affects both blackbird and cl-async. As of
cl-async, I'm currently readying some important set of changes (see
#107) that would be nice to have in cl-async so fixing the problem

right now may probably cause some merge conflicts, so probably I
should fix it myself as the part of these changes (if you're willing
to integrate them).

See also (besides CLHS):
http://www.nhplace.com/kent/Papers/Condition-Handling-2001.html
http://www.gigamonkeys.com/book/beyond-exception-handling-conditions-and-restarts.html

The text was updated successfully, but these errors were encountered:

orthecreedence · 2014-12-20T20:51:08Z

Thanks, you are not the first person to bring this up, and it points to a lack of understanding in condition handling on my part (see orthecreedence/blackbird#8). I have read about condition handling and also specifically handler-bind and its differences from handler-case and how to use it more effectively, but my general understanding is still fairly limited. This is something I am more than willing to fix in the libraries so that they are more correct/useful for other people's typical workflows.

My error handling background typically consists of try {} catch () {} so some of the more nuanced aspects of condition handling in CL have been hard for me to grasp (but mainly I have not really done the research yet because I did not know it was a problem in the first place).

I'll review the links you sent over and also would just appreciate any feedback (or pull requests) you have on how to make the condition handling more appropriate.

this can be considered a breaking change, per issue #108 as well as discussions at orthecreedence/blackbird#8. the idea being that cl-async no longer catches all errors and routes them to the event-cb callbacks, but instead now looks for event-info conditions (and all derivatives) and sends *only those* to the event-cb, leaving all other errors to bubble to the top (debugger). note that errors only bubble to the top if :catch-app-errors is nil! if t, *all* errors are still caught, with one important change: if :caught-errors is given (a function of one argument) when starting the event loop, then all caught errors are sent to this function to do what you will with them. in other words, event-cb is no longer the dumping ground for all conditions when catching errors, now there is a separate place. note that :caught-errors is only used when :catch-app-errors is T.

orthecreedence · 2015-01-01T18:48:01Z

Ok, biggish updates with regard to error handling here. cl-async no longer catches errors and passes them to the nearest event-cb. The only conditions passed to event-cb for any operation are actual cl-async conditions (streamish-eof, socket-reset, ...) spawned by libuv.

When using :catch-app-errors t, all non-cl-async errors are trapped and sent to the callback given under :caught-errors:

(as:with-event-loop (:catch-app-errors t :caught-errors (lambda (err) ...))
  ...)

This allows for a production mode of sorts where a rogue error won't crash the event loop entirely, just the "thread" that the error occurred on.

When using :catch-app-errors nil (the default), all non-cl-async conditions are allowed to pass, unfettered, to the top-level (and enter the debugger in the case of errors). This allows the use of restarts as well as interactive debugging of errors, with full backtraces.

Also, I added a continue-event-loop restart to cl-async-util::call-with-callback-restarts that passes the error to :caught-errors if invoked. This lets you achieve :catch-app-errors t behavior on an error-by-error basis.

Also, exporting abort-callback, continue-event-loop, exit-event-loop restarts from cl-async-util (but not cl-async) for easier auto restart invocation.

This was referenced Jan 2, 2015

ssl causes faults on sbcl orthecreedence/carrier#5

Closed

Use handler-bind instead of handler-case orthecreedence/blackbird#8

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Don't use T or CONDITION in HANDLER-CASE or non-local-exiting HANDLER-BIND clauses (also affects blackbird) #108

Don't use T or CONDITION in HANDLER-CASE or non-local-exiting HANDLER-BIND clauses (also affects blackbird) #108

ivan4th commented Dec 20, 2014

orthecreedence commented Dec 20, 2014

orthecreedence commented Jan 1, 2015

Don't use T or CONDITION in HANDLER-CASE or non-local-exiting HANDLER-BIND clauses (also affects blackbird) #108

Don't use T or CONDITION in HANDLER-CASE or non-local-exiting HANDLER-BIND clauses (also affects blackbird) #108

Comments

ivan4th commented Dec 20, 2014

orthecreedence commented Dec 20, 2014

orthecreedence commented Jan 1, 2015