Inline TransportReplAct#createReplicatedOperation #41197

DaveCTurner · 2019-04-15T13:10:33Z

TransportReplicationAction.AsyncPrimaryAction#createReplicatedOperation
exists so it can be overridden in tests. This commit re-works these tests to
use a real ReplicationOperation and inlines the now-unnecessary method.

Relates #40706.

`TransportReplicationAction.AsyncPrimaryAction#createReplicatedOperation` exists so it can be overridden in tests. This commit re-works these tests to use a real `ReplicationOperation` and inlines the now-unnecessary method. Relates elastic#40706.

elasticmachine · 2019-04-15T13:10:35Z

Pinging @elastic/es-distributed

DaveCTurner · 2019-04-15T13:11:31Z

.../test/java/org/elasticsearch/action/support/replication/TransportReplicationActionTests.java

@@ -821,57 +809,50 @@ public void testCounterOnPrimary() throws Exception {
        Request request = new Request(shardId);
        PlainActionFuture<TestResponse> listener = new PlainActionFuture<>();
        ReplicationTask task = maybeTask();
-        int i = randomInt(3);
-        final boolean throwExceptionOnCreation = i == 1;


This case doesn't seem to be possible in production, so I removed it.

henningandersen

LGTM.

Thanks @DaveCTurner, I left 3 comments to consider.

henningandersen · 2019-04-15T14:12:54Z

...r/src/main/java/org/elasticsearch/action/support/replication/TransportReplicationAction.java

-                                @Override
-                                public void onFailure(Exception e) {
-                                    handleException(primaryShardReference, e);
+                    final ActionListener<Response> referenceClosingListener = ActionListener.wrap(response -> {


I find the separation into two listeners artificial and a bit confusing. I suggest something like following instead:

final ActionListener<Response> globalCheckpointSyncingListener = ActionListener.wrap(response -> { if (syncGlobalCheckpointAfterOperation) { final IndexShard shard = primaryShardReference.indexShard; try { shard.maybeSyncGlobalCheckpoint("post-operation"); } catch (final Exception e) { // only log non-closed exceptions if (ExceptionsHelper.unwrap( e, AlreadyClosedException.class, IndexShardClosedException.class) == null) { // intentionally swallow, a missed global checkpoint sync should not fail this operation logger.info( new ParameterizedMessage( "{} failed to execute post-operation global checkpoint sync", shard.shardId()), e); } } } primaryShardReference.close(); // release shard operation lock before responding to caller setPhase(replicationTask, "finished"); onCompletionListener.onResponse(response); }, e -> handleException(primaryShardReference, e)); new ReplicationOperation<>(primaryRequest.getRequest(), primaryShardReference, ActionListener.wrap(result -> result.respond(globalCheckpointSyncingListener), globalCheckpointSyncingListener::onFailure), newReplicasProxy(), logger, actionName, primaryRequest.getPrimaryTerm()).execute();

In isolation I agree, but this separation will be important in a followup so I hope it's ok to leave it like it is. The global checkpoint syncing is the responsibility of the primary, whereas the cleanup of the replication task and the primaryShardReference is the responsibility of the reroute/delegation phase.

henningandersen · 2019-04-15T14:35:19Z

.../test/java/org/elasticsearch/action/support/replication/TransportReplicationActionTests.java

            }
-        }.run();
+        }.new AsyncPrimaryAction(primaryRequest, ActionListener.wrap(listener::onResponse, throwable -> {


Could we instead of using ActionListener.wrap just assert that listener.isDone() and do listener.get() like in the test above?

Yes, it seems we can. I pushed 1350c0f.

henningandersen · 2019-04-15T14:43:29Z

.../test/java/org/elasticsearch/action/support/replication/TransportReplicationActionTests.java

            } else {
                throw e;
            }
        }
+
+        if (throwExceptionOnRun || respondWithError) {


nit: I think it is more logical to put this inside the try-catch (after listener.get()) and remove the return above.

Sure, I pushed fb1f7eb.

henningandersen · 2019-04-15T16:17:32Z

...r/src/main/java/org/elasticsearch/action/support/replication/TransportReplicationAction.java

@@ -376,7 +377,8 @@ public void handleException(TransportException exp) {
                                    // intentionally swallow, a missed global checkpoint sync should not fail this operation
                                    logger.info(
                                        new ParameterizedMessage(
-                                            "{} failed to execute post-operation global checkpoint sync", shard.shardId()), e);
+                                            "{} failed to execute post-operation global checkpoint sync",
+                                            primaryShardReference.routingEntry().shardId()), e);


Not sure I follow this change, I cannot figure out how this makes a difference. I think using just shard.shardId() is simpler unless there is a reason for this?

More foreshadowing of changes to come, but I can defer this until later.

This reverts commit a26a986.

`TransportReplicationAction.AsyncPrimaryAction#createReplicatedOperation` exists so it can be overridden in tests. This commit re-works these tests to use a real `ReplicationOperation` and inlines the now-unnecessary method. Relates #40706.

`TransportReplicationAction.AsyncPrimaryAction#createReplicatedOperation` exists so it can be overridden in tests. This commit re-works these tests to use a real `ReplicationOperation` and inlines the now-unnecessary method. Relates elastic#40706.

DaveCTurner added >non-issue :Distributed Indexing/CRUD A catch all label for issues around indexing, updating and getting a doc by id. Not search. v8.0.0 v7.2.0 labels Apr 15, 2019

DaveCTurner requested review from dnhatn and henningandersen April 15, 2019 13:10

DaveCTurner commented Apr 15, 2019

View reviewed changes

henningandersen approved these changes Apr 15, 2019

View reviewed changes

DaveCTurner added 3 commits April 15, 2019 16:36

fail in try block

fb1f7eb

No need to wrap the listener

1350c0f

Less usage of primaryShardReference

a26a986

henningandersen reviewed Apr 15, 2019

View reviewed changes

Revert "Less usage of primaryShardReference"

c5dc9f9

This reverts commit a26a986.

DaveCTurner merged commit 5708796 into elastic:master Apr 16, 2019

DaveCTurner deleted the 2019-04-15-inline-createReplicatedOperation branch April 16, 2019 12:03

jakelandis added v8.0.0-alpha1 and removed v8.0.0 labels Jul 26, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Inline TransportReplAct#createReplicatedOperation #41197

Inline TransportReplAct#createReplicatedOperation #41197

DaveCTurner commented Apr 15, 2019

elasticmachine commented Apr 15, 2019

DaveCTurner Apr 15, 2019

henningandersen left a comment

henningandersen Apr 15, 2019 •

edited

Loading

DaveCTurner Apr 15, 2019

henningandersen Apr 15, 2019

henningandersen Apr 15, 2019

DaveCTurner Apr 15, 2019

henningandersen Apr 15, 2019

DaveCTurner Apr 15, 2019

henningandersen Apr 15, 2019

DaveCTurner Apr 16, 2019

Inline TransportReplAct#createReplicatedOperation #41197

Inline TransportReplAct#createReplicatedOperation #41197

Conversation

DaveCTurner commented Apr 15, 2019

elasticmachine commented Apr 15, 2019

Choose a reason for hiding this comment

henningandersen left a comment

Choose a reason for hiding this comment

henningandersen Apr 15, 2019 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

henningandersen Apr 15, 2019 •

edited

Loading