sending a coap request to an unreachable server: missing notification? #249

vdaele · 2018-08-25T15:50:27Z

As a test case, I want to send a coap request to an unreachable server.
Precondition: my coap client app is started up correctly and I've been able to toggle the (Ikea) light on and off a few times.
Next, I unplug the network cable from the Ikea hub and try to toggle the light again.
I notice that the packet is retransmitted 4 times (as expected I guess) and afterwards, I see in the logs tid=44815: give up after 4 attempts

However, there seems to be no notification/callback to the coap-client, right?
What would be the best way to get this information ? an extra call to coap_handle_event where the "give up" mesage is logged?

Edit: I just noted that the comment before coap_event_t states

Scalar type to represent different events, e.g. DTLS events or retransmission timeouts

Thanks in advance,

Marc

The text was updated successfully, but these errors were encountered:

vdaele · 2018-08-26T12:32:21Z

And on a similar note: If I send a request to a server using an old/wrong IP address, I see multiple times in the logs retransmit handshake packet and after 7 retries I see removed transaction from within dtls_retransmit.
However, I don't seem to get an error/notification in my client.

I do note a number of lines containing DTLS_ALERT_HANDSHAKE_FAILURE in the code. Would it be sufficient to add a statement like CALL(context, event, &node->peer->session, DTLS_ALERT_LEVEL_FATAL, DTLS_ALERT_HANDSHAKE_FAILURE) right before the remove transaction logline in dtls.c?

mrdeep1 · 2018-08-26T16:32:59Z

Do you have a NACK handler defined by coap_register_nack_handler()? For confirmable messages, I would expect this to be getting called in your first case, otherwise in all cases for your second case.

vdaele · 2018-08-26T17:15:12Z

I don't have this handler defined (and I noticed some other handlers that might me interesting as well!).
I'll try adding one and update this issue accordingly!
Thanks!

vdaele · 2018-08-27T17:57:56Z

I registered a coap_register_nack_handler and it indeed gets called in the first case (I do have confirmable messages)

However, in the second case (when using an invalid IP) I would expect a callback when removed transaction is printed. However, it seems that it gets called after a timeout of about 80 seconds.

The log below is a log using #define DTLS_DEFAULT_MAX_RETRANSMIT 7 in tinydtls/global.h.

Aug 27 19:48:52 DEBG *** new session 0x1830d70
...
Aug 27 19:50:20 DEBG ** DTLS global timeout set to 38073ms
Aug 27 19:50:21 DEBG ** DTLS global timeout set to 37073ms
Aug 27 19:50:22 INFO timeout
Aug 27 19:50:22 DEBG send header: (13 bytes):
00000000 15 FE FD 00 00 00 00 00 00 00 06 00 02
Aug 27 19:50:22 DEBG send unencrypted: (2 bytes):
00000000 02 00
Aug 27 19:50:22 DEBG * 192.168.1.12:46603 <-> 192.168.1.55:5684 DTLS: sent 15 bytes
Aug 27 19:50:22 DEBG removed peer: 192.168.1.55:5684
Aug 27 19:50:22 DEBG *** removed session 0x1830d70
Aug 27 19:50:22 MyLog: coap_nack_handler_static 3***
Aug 27 19:50:22 DEBG *** 192.168.1.12:46603 <-> 192.168.1.55:5684 DTLS: session closed

The log below is a log using #define DTLS_DEFAULT_MAX_RETRANSMIT 3 (instead of 7) in tinydtls/global.h. This log shows that the removed transaction comes after 30s and the timeout and the call to the nack_handler after 90s.
Hence my question: shouldn't the nack_handler be called earlier (when the removed transaction is printed)?

Aug 27 19:44:05 DEBG *** new session 0x744d70
...
Aug 27 19:44:35 DEBG ** removed transaction
Aug 27 19:45:35 INFO timeout
Aug 27 19:45:35 DEBG send header: (13 bytes):
00000000 15 FE FD 00 00 00 00 00 00 00 04 00 02
Aug 27 19:45:35 DEBG send unencrypted: (2 bytes):
00000000 02 00
Aug 27 19:45:35 DEBG * 192.168.1.12:55611 <-> 192.168.1.55:5684 DTLS: sent 15 bytes
Aug 27 19:45:35 DEBG removed peer: 192.168.1.55:5684
Aug 27 19:45:35 DEBG *** removed session 0x744d70
Aug 27 19:45:35 MyLog: coap_nack_handler_static 3***
Aug 27 19:45:35 DEBG *** 192.168.1.12:55611 <-> 192.168.1.55:5684 DTLS: session closed

mrdeep1 · 2018-08-28T10:15:17Z

In your first case, encryption has been set up and then you drop the traffic. In the second case, encryption has not been agreed between client and server - so a NACK is not so appropriate here.

Tinydtls is a separate project - libcoap is just using it for one of the DTLS type options, and so changes in the tinydtls code will not be integrated into libcoap. The only possibility is changes to src/coap_tinydtls.c - the glue between libcoap and tinydtls.

The coap-client is setting a global timeout of 90 seconds - which when expired closes the session and triggers the NACK.

With DTLS_DEFAULT_MAX_RETRANSMIT 7, the 90 second timeout expires before the re-transmit count expires. With DTLS_DEFAULT_MAX_RETRANSMIT 3, the re-transmits timeout before the global 90 second timeout.

PR #183 has many event/nack fixes, and it looks like this may already fix the issue you are highlighting.

vdaele · 2018-08-28T10:57:46Z

Is it correct that this PR #183 is not yet merged to the develop branch (I'm no git expert at all)? If so, when do you expect that this merge will happen? I certainly prefer using your lib to modifying coap_tinydtls.c myself.

mrdeep1 · 2018-08-28T12:13:00Z

Yes we are waiting on PR #183 to get merged in. In terms of when, not sure, but would expect it to be soon. based on https://sourceforge.net/p/libcoap/mailman/message/36346758/ last update.

obgm · 2018-10-04T06:43:41Z

As PR #183 has been merged, can you check if this has solved this issue?

vdaele · 2018-10-07T10:18:08Z

Current status with the latest 4.2.0 prerelease:

when disconnecting the hub and using CON messages, the nack_handler gets called ("give up after 4 attempts") after about 1'20" with reason=COAP_NACK_TOO_MANY_RETRIES.
- I can get a faster notification I guess by reducing COAP_DEFAULT_MAX_RETRANSMIT in coap_session.h to eg 1 or 2?
when disconnecting the hub and using NON messages, no handlers seems to get called, right?
- Is there some way to detect this? Can you somehow detect that the remote peer closed the socket?
when trying to connect to an invalid hub, the nack_handler gets called immediately with reason=COAP_NACK_TLS_FAILED

mrdeep1 · 2018-10-07T17:27:55Z

Instead of changing COAP_DEFAULT_MAX_RETRANSMIT, use coap_session_set_max_retransmit() instead to define the local value. See the coap_recovery(3) man page.
Correct - UDP is unreliable, and the use of NON does not guarantee any response, so libcoap will not generate any failure events. However, the CoAP Ping is designed for this purpose of testing out link connectivity by sending an Empty Confirmable Message - see RFC7252 4.3. Messages Transmitted without Reliability.
This is likely down to a the fact that the local device is unable to transmit the packet to the non existent address (as no arp entry could be created to send the packet to).

vdaele · 2018-10-08T06:10:27Z

Thanks a lot for your input (again!)

Marc

vdaele closed this as completed Oct 8, 2018

mrdeep1 mentioned this issue Oct 12, 2018

Question about ping/pong messages #264

Closed

FelicxFoster mentioned this issue Dec 6, 2020

Callback when server is unreachable #575

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sending a coap request to an unreachable server: missing notification? #249

sending a coap request to an unreachable server: missing notification? #249

vdaele commented Aug 25, 2018 •

edited

Loading

vdaele commented Aug 26, 2018

mrdeep1 commented Aug 26, 2018

vdaele commented Aug 26, 2018

vdaele commented Aug 27, 2018

mrdeep1 commented Aug 28, 2018

vdaele commented Aug 28, 2018

mrdeep1 commented Aug 28, 2018

obgm commented Oct 4, 2018

vdaele commented Oct 7, 2018

mrdeep1 commented Oct 7, 2018

vdaele commented Oct 8, 2018

sending a coap request to an unreachable server: missing notification? #249

sending a coap request to an unreachable server: missing notification? #249

Comments

vdaele commented Aug 25, 2018 • edited Loading

vdaele commented Aug 26, 2018

mrdeep1 commented Aug 26, 2018

vdaele commented Aug 26, 2018

vdaele commented Aug 27, 2018

mrdeep1 commented Aug 28, 2018

vdaele commented Aug 28, 2018

mrdeep1 commented Aug 28, 2018

obgm commented Oct 4, 2018

vdaele commented Oct 7, 2018

mrdeep1 commented Oct 7, 2018

vdaele commented Oct 8, 2018

vdaele commented Aug 25, 2018 •

edited

Loading