epee net tcp server: properly terminate interrupted TCP connection. fixes #8685 #8731

BotoX · 2023-02-03T21:52:23Z

See issue #8685
Specifically the comment/patch by @j-berman #8685 (comment)

monerod would start using 160-180% CPU after a few days of uptime.
I was consistently running into this issue for multiple months.
With this patch monerod has been running for two weeks already, without high cpu usage.

Related/Same issue #8550

plowsof · 2023-02-04T14:16:39Z

fix commit author (or ask them to PR the fix)

BotoX · 2023-02-04T16:28:34Z

fix commit author (or ask them to PR the fix)

done.

iamamyth · 2023-02-04T17:53:44Z

contrib/epee/include/net/abstract_tcp_server2.inl

@@ -586,8 +586,7 @@ namespace net_utils
      else if (ec.value())


Given the change below, this else if can be folded into the else.

j-berman

Repeating my logic for this patch:

I'm thinking until someone actually writes the code to reuse an "interrupted" connection, it makes sense to simply complete the connection termination sequence (and shut down the TCP connection) after the server shuts the SSL stream down. This way the server isn't leaving a connection around in this interrupted state that can't be "un-interrupted."

Adding some more logic to support this reasoning... once a connection has entered the interrupted state, it will stop reading data received over the socket thanks to this section:

monero/contrib/epee/include/net/abstract_tcp_server2.inl

Lines 412 to 420 in c5d10a4

    
           if (m_state.status == status_t::INTERRUPTED) 
        
             on_interrupted(); 
        
           else if (m_state.status == status_t::TERMINATING) 
        
             on_terminating(); 
        
           else if (!success) 
        
             interrupt(); 
        
           else { 
        
             start_read(); 
        
           }

When the connection is in the interrupted state, it falls into the first if, preventing start_read from getting called in the else. start_read is what calls async_read_some to read the next chunk of data received over the socket. Thus, the interrupted state prevents the server from reading more data received over the socket.

As written, I don't see why the server should keep an interrupted connection around.

vtnerd · 2023-06-07T18:44:34Z

As written, I don't see why the server should keep an interrupted connection around.

What is the purpose of the state? ~~At a glance I don't see where the state/status can become INTERRUPTED.~~

Whoops, funky search before, I see it now. But still, what is the purpose of the state?

j-berman · 2023-06-08T05:13:42Z

From what I recall in discussions with the author, their intent behind the connection's interrupted status was to allow the server to reuse a connection that hasn't been terminated yet. But the author didn't actually implement the code to reuse an interrupted connection (I don't see a way for a connection's status to go from INTERRUPTED to anything but TERMINATING or WASTED). I believe they intended to implement the ability to reuse the underlying TCP connection in a future PR building off this interrupted status.

More behind my thinking on this patch is here: #8685 (comment)

vtnerd · 2023-06-08T13:53:17Z

@j-berman its tempting to slowly remove the interrupted state completely - it seems to complicated for our use cases.

BotoX force-pushed the fix-8685 branch from e45f1d6 to 2fa53a5 Compare February 4, 2023 16:27

iamamyth reviewed Feb 4, 2023

View reviewed changes

properly terminate interrupted TCP connection. fixes monero-project#8685

6c73dc7

BotoX force-pushed the fix-8685 branch from 2fa53a5 to 6c73dc7 Compare February 4, 2023 21:04

j-berman approved these changes Feb 22, 2023

View reviewed changes

selsta mentioned this pull request Apr 16, 2023

Persistent high CPU usage for a 24 hours #8824

Closed

vtnerd approved these changes Jun 8, 2023

View reviewed changes

selsta mentioned this pull request Jun 9, 2023

properly terminate interrupted TCP connection. fixes #8685 [release-v0.18] #8900

Merged

luigi1111 merged commit 97354d8 into monero-project:master Jun 27, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

epee net tcp server: properly terminate interrupted TCP connection. fixes #8685 #8731

epee net tcp server: properly terminate interrupted TCP connection. fixes #8685 #8731

BotoX commented Feb 3, 2023

plowsof commented Feb 4, 2023 •

edited

Loading

BotoX commented Feb 4, 2023

iamamyth Feb 4, 2023

j-berman left a comment

vtnerd commented Jun 7, 2023 •

edited

Loading

j-berman commented Jun 8, 2023

vtnerd commented Jun 8, 2023

	if (m_state.status == status_t::INTERRUPTED)
	on_interrupted();
	else if (m_state.status == status_t::TERMINATING)
	on_terminating();
	else if (!success)
	interrupt();
	else {
	start_read();
	}

epee net tcp server: properly terminate interrupted TCP connection. fixes #8685 #8731

epee net tcp server: properly terminate interrupted TCP connection. fixes #8685 #8731

Conversation

BotoX commented Feb 3, 2023

plowsof commented Feb 4, 2023 • edited Loading

BotoX commented Feb 4, 2023

iamamyth Feb 4, 2023

Choose a reason for hiding this comment

j-berman left a comment

Choose a reason for hiding this comment

vtnerd commented Jun 7, 2023 • edited Loading

j-berman commented Jun 8, 2023

vtnerd commented Jun 8, 2023

plowsof commented Feb 4, 2023 •

edited

Loading

vtnerd commented Jun 7, 2023 •

edited

Loading