EventingService.afterExecute sometimes executed twice #1455

computerlove · 2022-06-27T09:12:16Z

We have an EventingService logging responsetimes:

private static final Map<String, Long> requestTimes = new ConcurrentHashMap<>();

@Override
public void beforeExecute(Context context) {
    requestTimes.put(context.getExecutionId(), currentTimeMillis());
}

@Override
public void afterExecute(Context context) {
    Long startTime = requestTimes.remove(context.getExecutionId());
    log.info("response time " + context.getExecutionId() + " " + calculate(startTime) );
}

After upgrading to Quarkus 2.10.0 afterExecute is executed twice for some Contexts.
Our application fires of ~5 graphql requests at the same time, and it seems random which one is triggering double execution of afterExecute

The text was updated successfully, but these errors were encountered:

phillip-kruger · 2022-06-27T09:30:14Z

interesting. This is a bug, let me have a look.

phillip-kruger · 2022-06-27T09:31:33Z

Do you perhaps have a reproducer ?

computerlove · 2022-06-27T10:16:37Z

Haven't had time yet, will make one soon!

phillip-kruger · 2022-06-28T01:26:10Z

I had a quick look, I can not see anything obvious, so a reproducer would help a lot. Thanks :)

computerlove · 2022-06-28T08:10:59Z

Made a crude reproducer
The frequency of the incidents is higher when the client aborts the request before it has finished.

Run server from root with mvn quarkus:dev, then start client from its folder the same way.

phillip-kruger · 2022-06-30T05:05:24Z

Just some feedback, your reproducer recreates the issue perfectly, however, I am not sure how to fix this. I am going to discuss this with @cescoffier tomorrow and hopefully get a fix in for this a.s.a.p.

cescoffier · 2022-06-30T05:33:02Z

@phillip-kruger do you know where this callback is invoked?

phillip-kruger · 2022-06-30T05:38:06Z

Here:

smallrye-graphql/server/implementation/src/main/java/io/smallrye/graphql/execution/ExecutionService.java

Line 176 in 94164da

Uni.createFrom().completionStage(() -> graphQL.executeAsync(executionInput))

I tried:

memoize()
Removing Uni and just use the underlying CompletionStage

In both cases this still happen

I also checked the onTermination and onCancelation but they are also called twice.

cescoffier · 2022-06-30T05:42:15Z

Hum, the Uni emits a single event (item, failure or cancellation (not handled in your case, BTW). So, are you sure that the writeAsync method is not called twice?

phillip-kruger · 2022-06-30T05:47:46Z

Let me double check that. Hold on.

phillip-kruger · 2022-06-30T06:16:03Z

No, writeAsync is called only one time.

cescoffier · 2022-06-30T06:49:17Z

so either the completion stage is totally broken and emits twice, or there is something really wrong.

Add a .log() just after the creation of the Uni with the completion stage.

cescoffier · 2022-07-01T05:58:23Z

@phillip-kruger I quick debug session showed me that writeAsync is called many (many) times, sometimes with the same SmallRye Context.

It leads to calling the callback multiple times, as each execution receives a result dispatched using the callback.

phillip-kruger · 2022-07-01T06:57:52Z

Really ? My debugging showed writeAsync called ones (per executionId) but then the result is sometimes received multiple times. You can also see the beforeExecute event are always only called once.

cescoffier · 2022-07-01T07:02:04Z

I didn't check the execution Id (only the context reference). I will check once back.

phillip-kruger · 2022-07-20T00:28:59Z

@cescoffier this still happens with 2.10.3.Final (so the context termination is not related) I still have not idea why this is happening. I'll debug some more once I have time.

cescoffier · 2022-07-21T09:58:26Z

What an interesting issue!

The problem is the executionId which is totally messed up because the code is called from different thread and the executed seems to be a thread local or something thread specific.

So, basically, it's even worse than duplicated afterExecute. It may produce a totally broken result for an execution id, as the execution id was not the right one.

I "fixed" the issue by just adding this:

private void writeAsync(GraphQL graphQL,
                          ExecutionInput executionInput,
                          SmallRyeContext smallRyeContext,
                          ExecutionResponseWriter writer) {
      String id = smallRyeContext.getExecutionId(); // Store the execution id
      Uni.createFrom().completionStage(() -> graphQL.executeAsync(executionInput))
              .subscribe().with(executionResult -> {
                  SmallRyeContextManager.restore(smallRyeContext);
                  // To see the issue:
                  System.out.println("what's your id? " + smallRyeContext.getExecutionId() + " =?= " + id);

                  // To fix the issue
                  smallRyeContext.setExecutionId(id);
                  // Notify after
                  eventEmitter.fireAfterExecute(smallRyeContext);

                  ExecutionResponse executionResponse = new ExecutionResponse(executionResult);
                  if (!payloadOption.equals(LogPayloadOption.off)) {
                      log.payloadOut(executionResponse.toString());
                  }
                  writer.write(executionResponse);

              }, failure -> {
                  if (failure != null) {
                      writer.fail(failure);
                  }
              });
  }

phillip-kruger · 2022-07-21T11:02:02Z

Wow!! Thanks @cescoffier ! I also went down the wrong Id route but did not figure out what was wrong.

But this probably means everything on the context is not properly propagated ?? I'll have a look about that in the morning.

Thanks for your help...

cescoffier · 2022-07-21T11:39:36Z

I'm seeing that the computation is based on thread locals, so definitely broken as your processing involves multiple threads.

phillip-kruger · 2022-07-21T11:40:58Z

Ok, I'll have a look tomorrow, thanks again, this helps a lot

phillip-kruger · 2022-08-08T02:34:33Z

Good news. I fixed this in Quarkus (see above PR). @computerlove let me know once the PR is merged or on the next release if this works for you. It fixed your reproducer. Thanks. Closing here.

computerlove · 2022-08-09T06:53:22Z

Yep, looks good!

phillip-kruger self-assigned this Jun 27, 2022

phillip-kruger mentioned this issue Aug 8, 2022

More GraphQL Context cleanup quarkusio/quarkus#27173

Merged

phillip-kruger closed this as completed Aug 8, 2022

computerlove mentioned this issue Nov 3, 2022

Sporadic «The current thread cannot be blocked» in graphql quarkusio/quarkus#29040

Closed

computerlove mentioned this issue Dec 21, 2023

EventingService.afterDataFetch called with wrong context on multi field query #1991

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

EventingService.afterExecute sometimes executed twice #1455

EventingService.afterExecute sometimes executed twice #1455

computerlove commented Jun 27, 2022 •

edited

Loading

phillip-kruger commented Jun 27, 2022

phillip-kruger commented Jun 27, 2022

computerlove commented Jun 27, 2022

phillip-kruger commented Jun 28, 2022

computerlove commented Jun 28, 2022

phillip-kruger commented Jun 30, 2022

cescoffier commented Jun 30, 2022

phillip-kruger commented Jun 30, 2022

cescoffier commented Jun 30, 2022

phillip-kruger commented Jun 30, 2022

phillip-kruger commented Jun 30, 2022

cescoffier commented Jun 30, 2022

cescoffier commented Jul 1, 2022

phillip-kruger commented Jul 1, 2022

cescoffier commented Jul 1, 2022

phillip-kruger commented Jul 20, 2022

cescoffier commented Jul 21, 2022

phillip-kruger commented Jul 21, 2022

cescoffier commented Jul 21, 2022

phillip-kruger commented Jul 21, 2022

phillip-kruger commented Aug 8, 2022

computerlove commented Aug 9, 2022

EventingService.afterExecute sometimes executed twice #1455

EventingService.afterExecute sometimes executed twice #1455

Comments

computerlove commented Jun 27, 2022 • edited Loading

phillip-kruger commented Jun 27, 2022

phillip-kruger commented Jun 27, 2022

computerlove commented Jun 27, 2022

phillip-kruger commented Jun 28, 2022

computerlove commented Jun 28, 2022

phillip-kruger commented Jun 30, 2022

cescoffier commented Jun 30, 2022

phillip-kruger commented Jun 30, 2022

cescoffier commented Jun 30, 2022

phillip-kruger commented Jun 30, 2022

phillip-kruger commented Jun 30, 2022

cescoffier commented Jun 30, 2022

cescoffier commented Jul 1, 2022

phillip-kruger commented Jul 1, 2022

cescoffier commented Jul 1, 2022

phillip-kruger commented Jul 20, 2022

cescoffier commented Jul 21, 2022

phillip-kruger commented Jul 21, 2022

cescoffier commented Jul 21, 2022

phillip-kruger commented Jul 21, 2022

phillip-kruger commented Aug 8, 2022

computerlove commented Aug 9, 2022

computerlove commented Jun 27, 2022 •

edited

Loading