RetryLater AutoDelete queue is causing problems #238

gingertez · 2017-07-05T09:40:59Z

We have a high-throughput system for which we expect some messages to fail and be re-queued and so we are using the AdvancedMessageContext to retry in a couple of seconds.

The creation of the rety_in_2000_ms queue happens automatically when we call context.RetryLater(timespan) and then the queue is automatically deleted when the queued messages have completed.

However, the next time we try to call context.RetryLater(timespan) we get an exception in our stack trace:
The AMQP operation was interrupted: AMQP close-reason, initiated by Peer, code=404, text="NOT_FOUND - no exchange 'rety_in_2000_ms' in vhost '/'", classId=60, methodId=40, cause=

Doing a Google search turns up this which indicates that the issue might be related to the AutoDelete setting on the retry queue: http://rabbitmq.1065348.n5.nabble.com/Send-problem-receiver-closes-exchange-td25702.html

Is my guess correct that it is down to the AutoDelete queue and if so is there any way to override this?

The text was updated successfully, but these errors were encountered:

pardahlman · 2017-07-05T10:24:53Z

Hello, @gingertez - thanks for reporting this 👋 .

All topology related calls (declaring queue/exchange and binding queue/exchange) is handled by the registered ITopologyProvider, which has a default implementation TopologyProvider. Each time a is published, the exchange is declared via a call to DeclareExchangeAsync.

It would be costly to declare the exchange each time, so the client tries to evaluate if the exchange has already been declared, and if so does not declare it. However, looking at that particular code, it looks like it does not take AutoDelete into consideration.

So my guess at what's happening is that it works the first time, then the exchange is removed but is still in the list of known declared exchanges and therefor not declared again, which is why the publish fails.

The retry later feature has been re-implemented in the next version of the client (2.x), which is my main focus now. If you want to, you could try to register a custom implementation of the topology provider (based on the default one) where you check for auto delete and see if it helps. If it does, feel free to create a PR with the fix.

PS. There is a PR for the typo in the retry queue name 😉 .

pardahlman · 2017-07-05T10:30:57Z

Hang on... looks like the the exchange is not added to the list of known exchanges if it uses autodelete: https://github.com/pardahlman/RawRabbit/blob/stable/src/RawRabbit/Common/TopologyProvider.cs#L217

gingertez · 2017-07-05T10:59:28Z

Thanks Pär

I'm not 100% sure how to go about what you suggested in your first reply. Any pointers?

FYI we're currently on version 1.10.3.

Cheers,
Terry.

gingertez · 2017-07-05T14:07:43Z

Based on your second comment - does that mean that amending/declaring my own TopologyProvider won't resolve the issue?

It looks to me like the problem is not that RawRabbit isn't re-creating the queue the second time, the problem is that when RawRabbit tries to create the queue RabbitMQ won't allow it because it's already been deleted.

Does that sound plausible?

If so, what can be done about it?

pardahlman · 2017-07-06T19:49:11Z

Hello again, @gingertez! My first hypothesis was that the problem was due to the client "caching" of known declared topology features and the fact that some of these features can have AutoDelete. However, looking at the code again I realized that this was not the case. So, as for solution proposal you can scrap what I said in my previous comment 😉

I've looked into the problem some more and was occasionally able to reproduce it. I believe that the key is this line of code, where the retry exchange is declared with AutoDelete set to true. This means that the exchange will be removed if no queues are bound to it. As you can see a few lines below the retry queues are un-bound from the exchange once the message is received (in order to not get any more messages routed to that queue that might have been published to a different exchange). However, if the retry later method is called concurrently multiple times, the following sequence may occur:

Thread A declares exchange
Thread B declares same exchange (no problem)
Thread A binds queue to exchange, publishes message and unbounds. This will remove the exchange
Exception thrown, as the exchange has been removed (due to AutoDelete)

You should be able to get around this by implementing a custom IContextEnhancer that declares the exchange with AutoDelete set to false.

pardahlman · 2017-08-12T13:51:35Z

Closing this ticket for now, feel free to re-open if you have any more input on the topic!

gingertez · 2017-08-22T08:44:05Z

Your last message indicated that you found the bug in your code... is there any chance you could actually fix the bug for future consumers?

Cheers,
Terry.

pardahlman · 2017-08-23T05:26:10Z

I'm focusing my development efforts in making 2.0 ready for launch. Up until recently, there has been no planed release of the 1.x branch of RawRabbit. However, #257 is something that I want fixed so it's possible that this issue will be fixed in the same release. I can not commit to dates, sorry! Hope this helps!

pardahlman closed this as completed Aug 12, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RetryLater AutoDelete queue is causing problems #238

RetryLater AutoDelete queue is causing problems #238

gingertez commented Jul 5, 2017

pardahlman commented Jul 5, 2017

pardahlman commented Jul 5, 2017

gingertez commented Jul 5, 2017

gingertez commented Jul 5, 2017

pardahlman commented Jul 6, 2017

pardahlman commented Aug 12, 2017

gingertez commented Aug 22, 2017

pardahlman commented Aug 23, 2017

RetryLater AutoDelete queue is causing problems #238

RetryLater AutoDelete queue is causing problems #238

Comments

gingertez commented Jul 5, 2017

pardahlman commented Jul 5, 2017

pardahlman commented Jul 5, 2017

gingertez commented Jul 5, 2017

gingertez commented Jul 5, 2017

pardahlman commented Jul 6, 2017

pardahlman commented Aug 12, 2017

gingertez commented Aug 22, 2017

pardahlman commented Aug 23, 2017