[fix][broker] Fix multiple transfer corruption issues when TLS is enabled #22760

lhotari · 2024-05-22T08:32:28Z

UPDATE: This PR has been replaced by #22810

Fixes #22601 #21892 #19460

Motivation

In Pulsar, there are multiple reported issues where the transferred output gets corrupted and fails with exceptions around invalid reader and writer index. One source of these issues are the ones which occur only when TLS is enabled.

I found these Netty issues that provide a lot of context:

About ByteBuf.internalNioBuffer netty/netty#6184
ChannelOutboundBuffer can cause data-corruption because of caching ByteBuffers netty/netty#2761
Data corruption when duplicate/slice buffers and use on different EventLoops netty/netty#1865
- netty/netty@a74149e
internalNioBuffer(...) lead to races when using from derived ByteBuf implementations netty/netty#1801
- AbstractByteBuf.skipBytes(..) accept negative value and so set the readerIndex to a negative value netty/netty#1797
- netty/netty@6f79291
ByteBuf.nioBuffer(...) should only expose the sub-region of the buffer netty/netty#1925

It seems that this is a long time issue in Netty and it has been partially fixed. However, it's not fixed for many locations in the Netty code base and it's not safe to share ByteBuf instances in all cases.

In Pulsar, the sharing of ByteBuf instance happens in this case at least via the broker cache (RangeEntryCacheManagerImpl) and the pending reads manager (PendingReadsManager).

The SslHandler related issue was originally reported in Pulsar in 2018 with #2401 . The fix that time was #2464.
The ByteBuf .copy() method was used to copy the ByteBuf. The problem with this change is that .copy() itself isn't thread safe and accesses the internalNioBuffer instance directly.

This happens at least when the ByteBuf instance contains a ReadOnlyByteBufferBuf wrapper. This can be seen in the code https://github.com/netty/netty/blob/243de91df2e9a9bf0ad938f54f76063c14ba6e3d/buffer/src/main/java/io/netty/buffer/ReadOnlyByteBufferBuf.java#L412-L433 .

As a result of this, exceptions such as these ones occur:

java.lang.IllegalArgumentException: newPosition > limit: (2094 > 88)
    at java.base/java.nio.Buffer.createPositionException(Buffer.java:341)
    at java.base/java.nio.Buffer.position(Buffer.java:316)
    at java.base/java.nio.ByteBuffer.position(ByteBuffer.java:1516)
    at java.base/java.nio.HeapByteBuffer.get(HeapByteBuffer.java:185)
    at io.netty.buffer.UnpooledHeapByteBuf.setBytes(UnpooledHeapByteBuf.java:268)
    at io.netty.buffer.AbstractByteBuf.writeBytes(AbstractByteBuf.java:1113)
    at io.netty.buffer.ReadOnlyByteBufferBuf.copy(ReadOnlyByteBufferBuf.java:431)
    at io.netty.buffer.DuplicatedByteBuf.copy(DuplicatedByteBuf.java:210)
    at io.netty.buffer.AbstractByteBuf.copy(AbstractByteBuf.java:1194)
    at org.apache.pulsar.common.protocol.ByteBufPair$CopyingEncoder.write(ByteBufPair.java:149)
    at io.netty.channel.AbstractChannelHandlerContext.invokeWrite0(AbstractChannelHandlerContext.java:893)
    at io.netty.channel.AbstractChannelHandlerContext.invokeWrite(AbstractChannelHandlerContext.java:875)
    at io.netty.channel.AbstractChannelHandlerContext.write(AbstractChannelHandlerContext.java:984)
    at io.netty.channel.AbstractChannelHandlerContext.write(AbstractChannelHandlerContext.java:868)
    at org.apache.pulsar.broker.service.PulsarCommandSenderImpl.lambda$sendMessagesToConsumer$1(PulsarCommandSenderImpl.java:277)
    at io.netty.util.concurrent.AbstractEventExecutor.runTask(AbstractEventExecutor.java:173)
    at io.netty.util.concurrent.AbstractEventExecutor.safeExecute(AbstractEventExecutor.java:166)
    at io.netty.util.concurrent.SingleThreadEventExecutor.runAllTasks(SingleThreadEventExecutor.java:470)
    at io.netty.channel.epoll.EpollEventLoop.run(EpollEventLoop.java:413)
    at io.netty.util.concurrent.SingleThreadEventExecutor$4.run(SingleThreadEventExecutor.java:997)
    at io.netty.util.internal.ThreadExecutorMap$2.run(ThreadExecutorMap.java:74)
    at io.netty.util.concurrent.FastThreadLocalRunnable.run(FastThreadLocalRunnable.java:30)
    at java.base/java.lang.Thread.run(Thread.java:840)

java.nio.BufferUnderflowException
    at java.base/java.nio.HeapByteBuffer.get(HeapByteBuffer.java:183)
    at io.netty.buffer.UnpooledHeapByteBuf.setBytes(UnpooledHeapByteBuf.java:268)
    at io.netty.buffer.AbstractByteBuf.writeBytes(AbstractByteBuf.java:1113)
    at io.netty.buffer.ReadOnlyByteBufferBuf.copy(ReadOnlyByteBufferBuf.java:431)
    at io.netty.buffer.DuplicatedByteBuf.copy(DuplicatedByteBuf.java:210)
    at io.netty.buffer.AbstractByteBuf.copy(AbstractByteBuf.java:1194)
    at org.apache.pulsar.common.protocol.ByteBufPair$CopyingEncoder.write(ByteBufPair.java:149)
    at io.netty.channel.AbstractChannelHandlerContext.invokeWrite0(AbstractChannelHandlerContext.java:893)
    at io.netty.channel.AbstractChannelHandlerContext.invokeWrite(AbstractChannelHandlerContext.java:875)
    at io.netty.channel.AbstractChannelHandlerContext.write(AbstractChannelHandlerContext.java:984)
    at io.netty.channel.AbstractChannelHandlerContext.write(AbstractChannelHandlerContext.java:868)
    at org.apache.pulsar.broker.service.PulsarCommandSenderImpl.lambda$sendMessagesToConsumer$1(PulsarCommandSenderImpl.java:277)
    at io.netty.util.concurrent.AbstractEventExecutor.runTask(AbstractEventExecutor.java:173)
    at io.netty.util.concurrent.AbstractEventExecutor.safeExecute(AbstractEventExecutor.java:166)
    at io.netty.util.concurrent.SingleThreadEventExecutor.runAllTasks(SingleThreadEventExecutor.java:470)
    at io.netty.channel.epoll.EpollEventLoop.run(EpollEventLoop.java:413)
    at io.netty.util.concurrent.SingleThreadEventExecutor$4.run(SingleThreadEventExecutor.java:997)
    at io.netty.util.internal.ThreadExecutorMap$2.run(ThreadExecutorMap.java:74)
    at io.netty.util.concurrent.FastThreadLocalRunnable.run(FastThreadLocalRunnable.java:30)
    at java.base/java.lang.Thread.run(Thread.java:840)

It is likely that Failed to peek sticky key from the message metadata java.lang.IllegalArgumentException: Invalid unknonwn tag type: 4 exceptions are also caused by the same root cause.
java.lang.IndexOutOfBoundsException: readerIndex: 31215, writerIndex: 21324 (expected: 0 <= readerIndex <= writerIndex <= capacity(65536)) type of exceptions on the broker side are possibly caused by the same root cause as well.

The root cause of such exceptions could also be different. A shared Netty ByteBuf must have at least have an independent view created with duplicate, slice or retainedDuplicate if the readerIndex is mutated.
The ByteBuf instance must also be properly shared in a thread safe way. Failing to do that could result in similar symptoms and this PR doesn't fix that.

Modifications

Remove the ByteBufPair.CopyingEncoder and make the ByteBufPair.Encoder suitable for both use cases
- A read-only ByteBuf needs to be passed to SslHandler so that it doesn't get mutated. A deep copy isn't required. This solution is also more performant since there will be less memory copies.
Workaround an IntelliJ issue where it shows a red mark in PulsarChannelInitializer classes (casting to ChannelHandler fixes the issue).

Documentation

doc
doc-required
doc-not-needed
doc-complete

…bled

lhotari · 2024-05-22T12:48:13Z

On Netty side, PooledByteBuf.getBytes was made thread safe by making a copy of the internalByteBuffer with netty/netty#9120 changes. However this change is not reflected in the ReadOnlyByteBufferBuf implementation. (this doesn't have an impact on this PR, just sharing the observation)

eolivelli

LGTM
great work

dao-jun · 2024-05-22T18:31:35Z

pulsar-common/src/main/java/org/apache/pulsar/common/protocol/ByteBufPair.java

+            // If the buffer is already read-only, .asReadOnly() will return the same buffer.
+            // That's why the additional .retainedDuplicate() is needed to ensure that the returned buffer
+            // has independent readIndex and writeIndex.
+            return buf.asReadOnly().retainedDuplicate();


private static ByteBuf readOnlyRetainedDuplicate(ByteBuf buf) { if (buf == null || buf.readableBytes() <= 0) { return Unpooled.EMPTY_BUFFER; } if (buf.isReadOnly()) { return buf.retainedDuplicate().asReadOnly(); } return buf.asReadOnly().retain(); }

Is this should be better? I think we don't need to call duplicate() after call asReadOnly

I think we don't need to call duplicate() after call asReadOnly

It is needed. The comment explains it. please check the comment for the reason.

I mean if the input buf is not readOnly, asReadOnly will create a new instance, so we just need to retain() after asReadOnly.
It also has independent read/write index.

if (buf == null || buf.readableBytes() <= 0) {
return Unpooled.EMPTY_BUFFER;
}

I don't think that nulls should be tolerated when nulls shouldn't be passed as input. For the readableBytes() == 0 optimization, is that needed?

This is just defensive programming, if we are sure that the input buf cannot be null, this line can be removed.

Oh, I checked the source code, it looks a little strange that I didn't understand at the first time.

private static boolean attemptCopyToCumulation(ByteBuf cumulation, ByteBuf next, int wrapDataSize) { final int inReadableBytes = next.readableBytes(); final int cumulationCapacity = cumulation.capacity(); if (wrapDataSize - cumulation.readableBytes() >= inReadableBytes && // Avoid using the same buffer if next's data would make cumulation exceed the wrapDataSize. // Only copy if there is enough space available and the capacity is large enough, and attempt to // resize if the capacity is small. (cumulation.isWritable(inReadableBytes) && cumulationCapacity >= wrapDataSize || cumulationCapacity < wrapDataSize && ensureWritableSuccess(cumulation.ensureWritable(inReadableBytes, false)))) { cumulation.writeBytes(next); next.release(); return true; } return false; }

We should return false immediately if cumulation is not writable

This approach is currently blocked by another Netty bug where SslHandler doesn't support a read only buffer. That seems to be the reason why a deep copy is currently required. Fixing that issue in netty/netty#14071.

Yes, the fix looks good

We should return false immediately if cumulation is not writable

In this case, isWritable returns true for a buffer for a wrapped read only buffer. This is surprising and the reason why this read only buffers aren't supported if there's another wrapper. It correctly returns true for isReadOnly.

One possible workaround could be to use Unpooled.unmodifiableBuffer to add the readonly wrapper so that it's always the "top most" wrapper. That would also avoid the need for the extra duplicate() wrapper.

In this case, isWritable returns true for a buffer for a wrapped read only buffer.

It sounds very strange, can you please point me where?

It sounds very strange, can you please point me where?

Yes, it's very surprising behavior. You can try it out with a debugger by running for example TlsProducerConsumerTest and checking what happens in the attemptCopyToCumulation if the .asReadOnly().retainedDuplicate() solution is used.

The io.netty.buffer.AbstractByteBuf#isWritable(int) method gets called: https://github.com/netty/netty/blob/70d6a3f40d7e8fd3f5ced7600ed209c58944f673/buffer/src/main/java/io/netty/buffer/AbstractByteBuf.java#L171-L174
It will evaluate to true since it's implementation doesn't check for isReadOnly():

@Override public boolean isWritable(int numBytes) { return capacity() - writerIndex >= numBytes; }

lhotari · 2024-05-22T19:28:38Z

I'll add background to the PR description to help reviewers understand it quickly.

@dao-jun Please share that as a comment instead. This is my PR and not yours. :) I'll update the description based on the feedback.

dao-jun · 2024-05-22T20:00:29Z

Just add additional context to help understand it quickly.

CopyingEncoder is used to handle the case of SslHandler. Because SslHandler could compose ByteBufs to one(The first input ByteBuf).
For instance, we write buf1,buf2,buf3 to the handler, it will write buf2,buf3 to buf1. If we are enable EntryCache, this will pollute the entries in the EntryCache. So we use buf.copy to make a new instance and copy the bytes to handle the case.

However, buf.copy is not thread safe, multi thread copying may lead to data corruption.

The solution is pass a ReadOnlyByteBuf to SslHandler to disable ByteBuf compose(write buf2,buf3 to buf1), although netty considered the input ByteBuf can be not writable, but it seems there are some problems with the code:

    private static boolean attemptCopyToCumulation(ByteBuf cumulation, ByteBuf next, int wrapDataSize) {
        final int inReadableBytes = next.readableBytes();
        final int cumulationCapacity = cumulation.capacity();
        if (wrapDataSize - cumulation.readableBytes() >= inReadableBytes &&
                // Avoid using the same buffer if next's data would make cumulation exceed the wrapDataSize.
                // Only copy if there is enough space available and the capacity is large enough, and attempt to
                // resize if the capacity is small.
                (cumulation.isWritable(inReadableBytes) && cumulationCapacity >= wrapDataSize ||
                        cumulationCapacity < wrapDataSize &&
                                ensureWritableSuccess(cumulation.ensureWritable(inReadableBytes, false)))) {
            cumulation.writeBytes(next);
            next.release();
            return true;
        }
        return false;
    }

it should return false immediately if cumulation.isWritable(inReadableBytes)== false or isReadOnly == true
Waiting the fix PR netty/netty#14071 merged.

dao-jun · 2024-05-22T20:01:10Z

Yes, of cause, my bad, since I didn't explain myself correctly.

lhotari · 2024-05-22T20:29:49Z

Yes, of cause, my bad, since I didn't explain myself correctly.

Thanks for the useful summary @dao-jun. I'll revisit the description of this PR later once this comes to a conclusion.

lhotari · 2024-05-22T20:30:58Z

it should return false immediately if cumulation.isWritable(inReadableBytes)== false or isReadOnly == true
Waiting the fix PR netty/netty#14071 merged.

there might be a workaround (explained in #22760 (comment)). will test that.

lhotari · 2024-05-22T21:05:04Z

I think I'll need to add a repro test to the Pulsar code base. While testing the recent changes, I can see that the problem occurs even when there's the read only wrapper.

This reverts commit 56cfd35.

…R 14071" This reverts commit b9ad8db.

This reverts commit 3767f6f.

…encoder suitable for both use cases" This reverts commit aa1543f.

lhotari · 2024-05-23T11:06:02Z

There are also other bugs in this area.

When TLS is enabled between Broker and Bookies, the Bookkeeper V3 protocol is used.
In the Bookkeeper client, there's a bug related reference counts when V3 protocol is used.
The PR to fix that issue is apache/bookkeeper#4293 .

lhotari · 2024-05-27T09:59:59Z

Closing this PR since the transfer corruption issues will be fixed by changes in Netty 4.1.111.Final (netty/netty#14072, netty/netty#14071, netty/netty#14076 and netty/netty#14078) and Bookkeeper 4.16.6 (apache/bookkeeper#4289 and apache/bookkeeper#4293).

lhotari · 2024-05-29T00:01:21Z

Fix in Bookkeeper is apache/bookkeeper#4404

lhotari · 2024-05-30T16:23:23Z

UPDATE: This PR has been replaced by #22810 .

lhotari added type/bug The PR fixed a bug or issue reported a bug ready-to-test release/2.10.7 release/2.11.5 release/3.1.5 release/3.3.1 release/3.0.6 release/3.2.4 labels May 22, 2024

lhotari added this to the 3.4.0 milestone May 22, 2024

lhotari requested a review from merlimat May 22, 2024 08:32

lhotari self-assigned this May 22, 2024

lhotari marked this pull request as draft May 22, 2024 08:32

github-actions bot added the doc-not-needed Your PR changes do not impact docs label May 22, 2024

lhotari mentioned this pull request May 22, 2024

[Bug] parseMessageMetadata error when broker entry metadata enable with high loading #22601

Closed

3 tasks

lhotari marked this pull request as ready for review May 22, 2024 09:27

lhotari requested review from Technoboy-, eolivelli, BewareMyPower, nicoloboschi, dao-jun, codelipenghui, hangc0276 and hezhangjian May 22, 2024 09:28

[fix][broker] Fix multiple transfer corruption issues when TLS is ena…

1ffda4c

…bled

lhotari force-pushed the lh-fix-tls-corruption-issue branch from de75e14 to 1ffda4c Compare May 22, 2024 09:41

eolivelli approved these changes May 22, 2024

View reviewed changes

lhotari marked this pull request as ready for review May 22, 2024 17:18

lhotari requested a review from eolivelli May 22, 2024 17:18

Revisit the approach once again: retainedDuplicate() is needed

3767f6f

lhotari marked this pull request as draft May 22, 2024 18:17

dao-jun requested changes May 22, 2024

View reviewed changes

Add workaround for Netty issue which will be fixed by Netty PR 14071

b9ad8db

lhotari marked this pull request as ready for review May 22, 2024 20:53

lhotari requested a review from dao-jun May 22, 2024 20:54

Improve comment

56cfd35

lhotari marked this pull request as draft May 22, 2024 21:04

lhotari mentioned this pull request May 22, 2024

Calling ReadOnlyByteBufferBuf.getBytes(int, byte[], int, int) concurrently from multiple threads will fail netty/netty#14070

Closed

lhotari added 5 commits May 23, 2024 01:03

Revert "Improve comment"

2b3021e

This reverts commit 56cfd35.

Revert "Add workaround for Netty issue which will be fixed by Netty P…

cd60221

…R 14071" This reverts commit b9ad8db.

Revert "Revisit the approach once again: retainedDuplicate() is needed"

06814b5

This reverts commit 3767f6f.

Revert "Use alternative approach: Remove CopyingEncoder and make the …

305f15c

…encoder suitable for both use cases" This reverts commit aa1543f.

Return to original approach, but add read-only wrapper

24b5779

lhotari marked this pull request as ready for review May 22, 2024 22:13

lhotari marked this pull request as draft May 23, 2024 07:31

lhotari closed this May 27, 2024

lhotari mentioned this pull request May 30, 2024

[fix][broker] Fix data corruption issues when TLS is enabled and optimize TLS between Pulsar client and brokers #22810

Closed

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[fix][broker] Fix multiple transfer corruption issues when TLS is enabled #22760

[fix][broker] Fix multiple transfer corruption issues when TLS is enabled #22760

lhotari commented May 22, 2024 •

edited

Loading

lhotari commented May 22, 2024 •

edited

Loading

eolivelli left a comment

dao-jun May 22, 2024

lhotari May 22, 2024

dao-jun May 22, 2024

lhotari May 22, 2024

dao-jun May 22, 2024

dao-jun May 22, 2024

dao-jun May 22, 2024

lhotari May 22, 2024

dao-jun May 22, 2024 •

edited

Loading

lhotari May 22, 2024

lhotari commented May 22, 2024 •

edited

Loading

dao-jun commented May 22, 2024 •

edited

Loading

dao-jun commented May 22, 2024 •

edited

Loading

lhotari commented May 22, 2024 •

edited

Loading

lhotari commented May 22, 2024

lhotari commented May 22, 2024

lhotari commented May 23, 2024

lhotari commented May 27, 2024

lhotari commented May 29, 2024

lhotari commented May 30, 2024

[fix][broker] Fix multiple transfer corruption issues when TLS is enabled #22760

[fix][broker] Fix multiple transfer corruption issues when TLS is enabled #22760

Conversation

lhotari commented May 22, 2024 • edited Loading

Motivation

Modifications

Documentation

lhotari commented May 22, 2024 • edited Loading

eolivelli left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

dao-jun May 22, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

lhotari commented May 22, 2024 • edited Loading

dao-jun commented May 22, 2024 • edited Loading

dao-jun commented May 22, 2024 • edited Loading

lhotari commented May 22, 2024 • edited Loading

lhotari commented May 22, 2024

lhotari commented May 22, 2024

lhotari commented May 23, 2024

lhotari commented May 27, 2024

lhotari commented May 29, 2024

lhotari commented May 30, 2024

lhotari commented May 22, 2024 •

edited

Loading

lhotari commented May 22, 2024 •

edited

Loading

dao-jun May 22, 2024 •

edited

Loading

lhotari commented May 22, 2024 •

edited

Loading

dao-jun commented May 22, 2024 •

edited

Loading

dao-jun commented May 22, 2024 •

edited

Loading

lhotari commented May 22, 2024 •

edited

Loading