(6/6) Clean up technical debt in P2PDataStorage and ProtectedStorageEntry objects #3747

julianknutsen · 2019-12-04T20:21:21Z

b6b0026

Motivation

This PR cleans up technical debt in the P2PDataStorage and the ProtectedStorageEntry objects that should be done before new features are added.

Future patches will change these objects quite a bit and getting them into a state that is easy to reason about will make the reviews easier.

Testing

All existing unit tests continue to pass

This will allow us to push the GetData creation inside P2PDataStorage safely.

As part of changing the GetData path, we want to move all creation and processing of GetData messages inside P2PDataStorage. This will allow easier unit testing of the behavior as well as cleaner code in the request handlers that can just focus on nonces, connections, etc.

These are identical test cases to the requestHandler tests, but with much fewer dependencies. The requestHandler tests will eventually be deleted, but they are going to remain throughout development as an extra safety net.

Add some basic sanity tests prior to the refactor to help catch issues.

This is just a strict move of code to reduce errors.

Changed the log to reference getDataResponse instead of getData. Now that we might truncate the response, it ins't true that this is exactly what the peer asked.

Move the logging that utilizes connection information into the request handler. Now, buildGetDataResponse just returns whether or not the list is truncated which will make it easier to test.

Remove the dependence on the connection object by having the handler pass in the peer's capabilities. This now allows unit testing of buildGetDataResponse without any connection dependencies.

Write a full set of unit tests for buildGetDataResponse. This provides a safety net with additional refactoring work.

The appendOnlyDataStoreService and map already have unique keys that are based on the hash of the payload. This would catch instances where: PersistableNetworkPayload - None: The key is based on ByteArray(payload.getHash()) which is the same as this check. ProtectedStorageEntry - Cases where multiple PSEs contain payloads that have equivalent hashCode(), but different data.toProtoMessage().toByteArray(). I don't think it is a good idea to keep 2 "unique" methods on payloads. This is likely left over from a time when Payload hashCode() needed to be different than the hash of the payload.

Move the logging function to the common capabilities check so it can run on both ProtectedStoragePayload and PersistableNetworkPayload objects

Move the capability check inside the stream operation. This should improve performance slightly, but more importantly it makes the two filter functions almost identical so they can be combined.

Removes unnecessary calculations converting Set<byte[]> into Set<ByteArray> and allows additional deduplication of stream operations.

Introduce a generic function that can be used to filter Map<ByteArray, PersistableNetworkPayload> or Map<ByteArray, ProtectedStorageEntry>. Used to deduplicate the GetData code paths and ensure the logic is the same between the two payload types.

Now, the truncation is only triggered if more than MAX_ENTRIES could have been returned.

Add heavy-handed test that exercises the logic to use as a safeguard for refactoring.

Just a code move for now.

Now that we want to unit test the GetData path which has different behavior w.r.t. broadcasts, the tests need a way to verify that state was updated, but not broadcast during an add. This patch changes all verification function to take each state update explicitly so the tests can do the proper verification.

Add a full set of unit tests that uncovered some unexpected behavior w.r.t. signalers.

Previously, multiple handlers needed to signal off one global variable. Now, that this check is inside the singleton P2PDataStorage, make it non-static and private.

Write a few integration test that exercises the exercise interesting synchronization states including the lost remove bug. This fails with the proper validation, but will pass at the end of the new feature development.

- Add more comments - Use Clock instead of System - Remove unnecessary AtomicInteger

Name is left over from previous implementation. Change it to be more relevant to the current code and update comments to indicate the current usage.

Checking for null creates hard-to-read code and it is simpler to just create an empty set if we receive a pre-v0.6 GetDataResponse protobuf message that does not have the field set.

The only two users of this constructor are the fromProto path which already creates an empty Capabilities object if one is not provided and the internal usage of Capabilities.app which is initialized to empty. Remove the @nullable so future readers aren't confused.

…quest The only two users of this constructor are the fromProto path which now creates an empty Capabilities object similar to GetDataResponse. The other internal usage of Capabilities.app which is initialized to empty.

Now that all the implementations are unit tested in P2PDataStorage, the old tests can be removed.

Now that the only user is internal, the API can be made private and the tests can be removed. This involved adding a few test cases to processGetDataResponse to ensure the invalid hash size condition was still covered.

Now that more callers have moved internal, the public facing API can be cleaner and more simple. This should lead to a more maintainable API and less sharp edges with future development work.

In proto3 the initialized value is an empty ByteString and there are no valid uses of passing in null here.

The ProtectedStoragePayload.fromProto code will throw an exception if this is null from the wire so there is no valid use for it to be null.

In proto3 this is intialized to an empty ByteString so there is no valid use for it to be null.

It is never changed

Helps readability when the variable name matches the type.

Before refactoring the function ensure the tests cover all cases. This fixes a bug where the payload ttl was too low in some instances causing backDate to do no work when it should.

1. Remove delete during stream iteration 2. Minimize branching w/ early returns for bad states 3. Use stream filter for readability 4. Implement additional checks that should be done when removing entries

We already have a garbage collection thread that runs every minute to clean up items. Doing it again during onDisconnect is an unnecessary optimization that adds complexity and caused bugs. For example, the original implementation did not handle the sequence number map correctly and was removing entries during a stream iteration. This also reduces the complexity of testing. There is one code path responsible for reducing ttls and one code path responsible for expiring entries. Much easier to reason about.

ProtectedStorageEntry::backDate() already handles this

p2p/src/main/java/bisq/network/p2p/storage/payload/ProtectedMailboxStorageEntry.java

freimair

Ack

ripcurlx · 2019-12-09T11:55:59Z

Ack

@freimair Could you please add how you tested this code changes, so it easier to compare with other ACKs. Thanks!

ripcurlx · 2019-12-09T11:59:33Z

p2p/src/main/java/bisq/network/p2p/storage/P2PDataStorage.java

+        if (closeConnectionReason.isIntended)
+            return;
+
+        if (!connection.getPeersNodeAddressOptional().isPresent())


Is there a reason why you prefer using directly getPeersNodeAddressOptional().isPresent() in comparison using hasPeerNodeAddress() ? It is used mixed in the original code as well, so we might use one way for the future.

I wish there was a better reason than this one, but I used this pattern because it reduces the number of lint errors from IDEA. If you don't have an isPresent() call prior to a get() it will highlight the word and raise an alert. You can suppress it, but that is just more code to add.

The other benefit is that mocking is bit less error-prone because you just need to mock out the Optional and not the wrapper class that has the hasPeerNodeAddress() function.

p2p/src/main/java/bisq/network/p2p/storage/P2PDataStorage.java

ripcurlx

ACK

Tested on local Regtest trading with each peer going offline after each step.
Tested on local Regtest trader chat with each peer going offline after each message.

s/change/chance/

julianknutsen · 2019-12-09T17:24:00Z

@ripcurlx This PR is in a strange GitHub state now. It has the "waiting for author" tag, but since it was ACKd and not "Request Changes" I don't have a button to push to move it back to your queue and can't remove the label myself. Not sure the right way to handle this type of workflow, but I wanted to give you a heads up.

In any event, I've addressed your comments and it is ready for another look.

ripcurlx · 2019-12-09T19:33:09Z

@ripcurlx This PR is in a strange GitHub state now. It has the "waiting for author" tag, but since it was ACKd and not "Request Changes" I don't have a button to push to move it back to your queue and can't remove the label myself. Not sure the right way to handle this type of workflow, but I wanted to give you a heads up.

In any event, I've addressed your comments and it is ready for another look.

It was probably not perfectly handled by myself, as I probably should have asked for change requested as status. In that case I was ok about everything, just had optional remarks.

Labels can only be changed by maintainers or triage permission owners. I'll have a look how to do it with GitHub actions automatically based on the PR state.

ripcurlx

ACK

julianknutsen added 30 commits December 3, 2019 12:20

[TESTS] Add tests of requestData

5fcd18c

This will allow us to push the GetData creation inside P2PDataStorage safely.

[TESTS] Add tests of new RequestData APIs

a927ed4

These are identical test cases to the requestHandler tests, but with much fewer dependencies. The requestHandler tests will eventually be deleted, but they are going to remain throughout development as an extra safety net.

[TESTS] Add tests of GetDataRequestHandler

daffe6d

Add some basic sanity tests prior to the refactor to help catch issues.

[REFACTOR] Introduce buildGetDataResponse

944b3ff

This is just a strict move of code to reduce errors.

[REFACTOR] Extract connectionInfo String

8208f78

[REFACTOR] Extract getDataResponse logging

5402155

Changed the log to reference getDataResponse instead of getData. Now that we might truncate the response, it ins't true that this is exactly what the peer asked.

[REFACTOR] Extract truncation logging

a6e8868

Move the logging that utilizes connection information into the request handler. Now, buildGetDataResponse just returns whether or not the list is truncated which will make it easier to test.

[REFACTOR] Pass peerCapabilities into buildGetDataResponse

dafc762

Remove the dependence on the connection object by having the handler pass in the peer's capabilities. This now allows unit testing of buildGetDataResponse without any connection dependencies.

[TESTS] Unit tests of buildGetDataResponse

5630b35

Write a full set of unit tests for buildGetDataResponse. This provides a safety net with additional refactoring work.

[REFACTOR] Move required capabilities log

703a9a0

Move the logging function to the common capabilities check so it can run on both ProtectedStoragePayload and PersistableNetworkPayload objects

[REFACTOR] Inline capability check for ProtectedStorageEntries

3aaf8a2

Move the capability check inside the stream operation. This should improve performance slightly, but more importantly it makes the two filter functions almost identical so they can be combined.

[REFACTOR] Inline filtering functions

4c5d818

Removes unnecessary calculations converting Set<byte[]> into Set<ByteArray> and allows additional deduplication of stream operations.

[BUGFIX] Fix off-by-one in truncation logic

00128d9

Now, the truncation is only triggered if more than MAX_ENTRIES could have been returned.

[TESTS] Add test of RequestDataHandler::onMessage

c7bce9e

Add heavy-handed test that exercises the logic to use as a safeguard for refactoring.

[REFACTOR] Introduce processGetDataResponse

873271c

Just a code move for now.

[TESTS] Add unit tests for processGetDataResponse

a34488b

Add a full set of unit tests that uncovered some unexpected behavior w.r.t. signalers.

Remove static from initialRequestApplied

3d6e9fb

Previously, multiple handlers needed to signal off one global variable. Now, that this check is inside the singleton P2PDataStorage, make it non-static and private.

[TESTS] Write synchronization integration tests

f92893b

Write a few integration test that exercises the exercise interesting synchronization states including the lost remove bug. This fails with the proper validation, but will pass at the end of the new feature development.

[REFACTOR] Clean up processGetDataResponse

5db1285

- Add more comments - Use Clock instead of System - Remove unnecessary AtomicInteger

[RENAME] LazyProcessedPayload to ProcessOncePersistableNetworkPayload

ecae31e

Name is left over from previous implementation. Change it to be more relevant to the current code and update comments to indicate the current usage.

Remove @nullable around persistableNetworkPayloadSet

a0fae12

Checking for null creates hard-to-read code and it is simpler to just create an empty set if we receive a pre-v0.6 GetDataResponse protobuf message that does not have the field set.

[DEADCODE] Remove old request handler tests

4fe19ae

Now that all the implementations are unit tested in P2PDataStorage, the old tests can be removed.

Make addPersistableNetworkPayloadFromInitialRequest private

0649323

Now that the only user is internal, the API can be made private and the tests can be removed. This involved adding a few test cases to processGetDataResponse to ensure the invalid hash size condition was still covered.

[REFACTOR] Clean up ClientAPI for addPersistableNetworkPayload

56a7661

Now that more callers have moved internal, the public facing API can be cleaner and more simple. This should lead to a more maintainable API and less sharp edges with future development work.

julianknutsen added 3 commits December 4, 2019 11:39

@NotNull ProtectedStorageEntry::ownerPubKey

0c67608

In proto3 the initialized value is an empty ByteString and there are no valid uses of passing in null here.

@NotNull ProtectedStorageEntry::protectedStoragePayload

76e8c57

The ProtectedStoragePayload.fromProto code will throw an exception if this is null from the wire so there is no valid use for it to be null.

@NotNull MailboxStoragePayload::senderPubKeyForAddOperation

104984c

In proto3 this is intialized to an empty ByteString so there is no valid use for it to be null.

julianknutsen requested a review from ripcurlx as a code owner December 4, 2019 20:21

julianknutsen added 6 commits December 4, 2019 16:00

Make CHECK_TTL_INTERVAL_SEC final

01a7f79

It is never changed

s/networkPayload/protectedStoragePayload

c38ff9b

Helps readability when the variable name matches the type.

[TESTS] Make onDisconnect tests more robust

688405b

Before refactoring the function ensure the tests cover all cases. This fixes a bug where the payload ttl was too low in some instances causing backDate to do no work when it should.

Refactor P2PDataStorage::onDisconnect

df2e4cc

1. Remove delete during stream iteration 2. Minimize branching w/ early returns for bad states 3. Use stream filter for readability 4. Implement additional checks that should be done when removing entries

Remove filter for ExpirablePayload

7b8d346

ProtectedStorageEntry::backDate() already handles this

julianknutsen force-pushed the clean-up-pse-objs branch from 46ebb1f to 7b8d346 Compare December 5, 2019 01:31

freimair suggested changes Dec 6, 2019

View reviewed changes

p2p/src/main/java/bisq/network/p2p/storage/payload/ProtectedMailboxStorageEntry.java Show resolved Hide resolved

freimair previously approved these changes Dec 7, 2019

View reviewed changes

freimair mentioned this pull request Dec 7, 2019

For Cycle 8 bisq-network/compensation#411

Closed

ripcurlx reviewed Dec 9, 2019

View reviewed changes

p2p/src/main/java/bisq/network/p2p/storage/P2PDataStorage.java Outdated Show resolved Hide resolved

ripcurlx previously approved these changes Dec 9, 2019

View reviewed changes

ripcurlx added the waiting for author label Dec 9, 2019

[PR COMMENTS] Fix comment typo

e8c8225

s/change/chance/

julianknutsen dismissed stale reviews from ripcurlx and freimair via e8c8225 December 9, 2019 17:13

ripcurlx removed the waiting for author label Dec 9, 2019

ripcurlx approved these changes Dec 9, 2019

View reviewed changes

ripcurlx merged commit 3fe8497 into bisq-network:master Dec 9, 2019

julianknutsen deleted the clean-up-pse-objs branch December 9, 2019 20:40

ripcurlx added this to the v1.2.5 milestone Dec 10, 2019

ripcurlx mentioned this pull request Dec 13, 2019

For Cycle 8 bisq-network/compensation#439

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

(6/6) Clean up technical debt in P2PDataStorage and ProtectedStorageEntry objects #3747

(6/6) Clean up technical debt in P2PDataStorage and ProtectedStorageEntry objects #3747

julianknutsen commented Dec 4, 2019

freimair left a comment

ripcurlx commented Dec 9, 2019

ripcurlx Dec 9, 2019

julianknutsen Dec 9, 2019

ripcurlx left a comment

julianknutsen commented Dec 9, 2019

ripcurlx commented Dec 9, 2019

ripcurlx left a comment

(6/6) Clean up technical debt in P2PDataStorage and ProtectedStorageEntry objects #3747

(6/6) Clean up technical debt in P2PDataStorage and ProtectedStorageEntry objects #3747

Conversation

julianknutsen commented Dec 4, 2019

Motivation

Testing

freimair left a comment

Choose a reason for hiding this comment

ripcurlx commented Dec 9, 2019

ripcurlx Dec 9, 2019

Choose a reason for hiding this comment

julianknutsen Dec 9, 2019

Choose a reason for hiding this comment

ripcurlx left a comment

Choose a reason for hiding this comment

julianknutsen commented Dec 9, 2019

ripcurlx commented Dec 9, 2019

ripcurlx left a comment

Choose a reason for hiding this comment