Reduce memory usage on Azure repository implementation #66489

fcofdez · 2020-12-17T07:48:10Z

This commit moves the upload logic to the repository itself
instead of delegating into the SDK.
Multi-block uploads are done sequentially instead of in parallel
that allows to bound the outstanding memory.
Additionally the number of i/o threads have been reduced to 1,
to reduce the memory overhead.

Closes #66385

This commit moves the upload logic to the repository itself instead of delegating into the SDK. Multi-block uploads are done sequentially instead of in parallel that allows to bound the outstanding memory. Additionally the number of i/o threads have been reduced to 1, to reduce the memory overhead.

elasticmachine · 2020-12-17T07:48:13Z

Pinging @elastic/es-distributed (Team:Distributed)

fcofdez · 2020-12-17T07:49:11Z

...repository-azure/src/main/java/org/elasticsearch/repositories/azure/AzureClientProvider.java

@@ -150,7 +150,6 @@ private static ByteBufAllocator createByteBufAllocator() {
        int tinyCacheSize = PooledByteBufAllocator.defaultTinyCacheSize();
        int smallCacheSize = PooledByteBufAllocator.defaultSmallCacheSize();
        int normalCacheSize = PooledByteBufAllocator.defaultNormalCacheSize();
-        boolean useCacheForAllThreads = PooledByteBufAllocator.defaultUseCacheForAllThreads();

        return new PooledByteBufAllocator(false,
            nHeapArena,


I think that we can reduce those too, wdyt @original-brownbear?

Yea, can't we just do 1 maybe when we only use one thread now anyway?

original-brownbear

Thanks Francisco. I only have one question pretty much.
That said, I think the move to a single thread and reducing the heap arenas are straight forward in any case. Lets see what we can do about the upload flux. I guess if that turns out to be trickier we could pull the arena and thread count changes to a separate PR so we can reenable tests asap maybe?

original-brownbear · 2020-12-17T08:08:37Z

...repository-azure/src/main/java/org/elasticsearch/repositories/azure/AzureClientProvider.java

@@ -150,7 +150,6 @@ private static ByteBufAllocator createByteBufAllocator() {
        int tinyCacheSize = PooledByteBufAllocator.defaultTinyCacheSize();
        int smallCacheSize = PooledByteBufAllocator.defaultSmallCacheSize();
        int normalCacheSize = PooledByteBufAllocator.defaultNormalCacheSize();
-        boolean useCacheForAllThreads = PooledByteBufAllocator.defaultUseCacheForAllThreads();

        return new PooledByteBufAllocator(false,
            nHeapArena,


Yea, can't we just do 1 maybe when we only use one thread now anyway?

original-brownbear · 2020-12-17T08:57:07Z

plugins/repository-azure/src/main/java/org/elasticsearch/repositories/azure/AzureBlobStore.java

+                    int numOfBytesRead = 0;
+                    int offset = 0;
+                    int len = (int) count;
+                    final byte[] buffer = new byte[len];


This still scares me a little. If we only do 64k at a time here, can't we use the Netty memory allocator (or manage our own set of byte[] and recycle them on doOnComplete or do we still have no guarantees about flushing at that point?

Sadly we don't have guarantees about flushing at that point

I explored a different approach where I was passing an allocator there, and recycling at the end of the request, but in that case you end up holding that memory for the entire duration of the request.

original-brownbear

Discussed this on a different channel and it looks like there is no easy way to get rid of the allocations in uploading (SDK does the same thing under the hood as well).

=> LGTM :)

This commit moves the upload logic to the repository itself instead of delegating into the SDK. Multi-block uploads are done sequentially instead of in parallel that allows to bound the outstanding memory. Additionally the number of i/o threads and heap arenas have been reduced to 1, to reduce the memory overhead. Closes elastic#66385 Backport of elastic#66489

This commit moves the upload logic to the repository itself instead of delegating into the SDK. Multi-block uploads are done sequentially instead of in parallel that allows to bound the outstanding memory. Additionally the number of i/o threads and heap arenas have been reduced to 1, to reduce the memory overhead. Closes #66385 Backport of #66489

Today we represent block IDs sent to Azure using the URL-safe base-64 encoding. This makes sense: these IDs appear in URLs. It turns out that Azure rejects this encoding for block IDs and instead demands that they are represented using the regular, URL-unsafe, base-64 encoding instead, then further wrapped in %-encoding to deal with the URL-unsafe characters that inevitably result. Relates elastic#66489

Today we represent block IDs sent to Azure using the URL-safe base-64 encoding. This makes sense: these IDs appear in URLs. It turns out that Azure rejects this encoding for block IDs and instead demands that they are represented using the regular, URL-unsafe, base-64 encoding instead, then further wrapped in %-encoding to deal with the URL-unsafe characters that inevitably result. Relates elastic#66489 Backport of elastic#68957

Today we represent block IDs sent to Azure using the URL-safe base-64 encoding. This makes sense: these IDs appear in URLs. It turns out that Azure rejects this encoding for block IDs and instead demands that they are represented using the regular, URL-unsafe, base-64 encoding instead, then further wrapped in %-encoding to deal with the URL-unsafe characters that inevitably result. Relates #66489

Today we represent block IDs sent to Azure using the URL-safe base-64 encoding. This makes sense: these IDs appear in URLs. It turns out that Azure rejects this encoding for block IDs and instead demands that they are represented using the regular, URL-unsafe, base-64 encoding instead, then further wrapped in %-encoding to deal with the URL-unsafe characters that inevitably result. Relates #66489 Backport of #68957

fcofdez added >enhancement :Distributed Coordination/Snapshot/Restore Anything directly related to the `_snapshot/*` APIs v8.0.0 v7.11.0 v7.12.0 labels Dec 17, 2020

fcofdez requested a review from original-brownbear December 17, 2020 07:48

elasticmachine added the Team:Distributed (Obsolete) Meta label for distributed team (obsolete). Replaced by Distributed Indexing/Coordination. label Dec 17, 2020

fcofdez commented Dec 17, 2020

View reviewed changes

original-brownbear reviewed Dec 17, 2020

View reviewed changes

Reduce the number of heap arenas

d044af0

original-brownbear approved these changes Dec 17, 2020

View reviewed changes

fcofdez merged commit 02ac68e into elastic:master Dec 17, 2020

fcofdez added the backport pending label Dec 17, 2020

fcofdez mentioned this pull request Dec 17, 2020

[7.x] Reduce memory usage on Azure repository implementation #66499

Merged

fcofdez mentioned this pull request Dec 17, 2020

[7.11] Reduce memory usage on Azure repository implementation #66500

Merged

DaveCTurner mentioned this pull request Feb 12, 2021

Adjust encoding of Azure block IDs #68957

Merged

DaveCTurner mentioned this pull request Feb 15, 2021

Adjust encoding of Azure block IDs #68980

Merged

DaveCTurner mentioned this pull request Feb 15, 2021

Adjust encoding of Azure block IDs #68981

Merged

williamrandolph removed the backport pending label Feb 19, 2021

jakelandis added v8.0.0-alpha1 and removed v8.0.0 labels Jul 26, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce memory usage on Azure repository implementation #66489

Reduce memory usage on Azure repository implementation #66489

fcofdez commented Dec 17, 2020 •

edited

Loading

elasticmachine commented Dec 17, 2020

fcofdez Dec 17, 2020

original-brownbear Dec 17, 2020

original-brownbear left a comment

original-brownbear Dec 17, 2020

original-brownbear Dec 17, 2020

fcofdez Dec 17, 2020

fcofdez Dec 17, 2020

original-brownbear left a comment

Reduce memory usage on Azure repository implementation #66489

Reduce memory usage on Azure repository implementation #66489

Conversation

fcofdez commented Dec 17, 2020 • edited Loading

elasticmachine commented Dec 17, 2020

fcofdez Dec 17, 2020

Choose a reason for hiding this comment

original-brownbear Dec 17, 2020

Choose a reason for hiding this comment

original-brownbear left a comment

Choose a reason for hiding this comment

original-brownbear Dec 17, 2020

Choose a reason for hiding this comment

original-brownbear Dec 17, 2020

Choose a reason for hiding this comment

fcofdez Dec 17, 2020

Choose a reason for hiding this comment

fcofdez Dec 17, 2020

Choose a reason for hiding this comment

original-brownbear left a comment

Choose a reason for hiding this comment

fcofdez commented Dec 17, 2020 •

edited

Loading