HBASE-23205 Correctly update the position of WALs currently being replicated #749

JeongDaeKim · 2019-10-23T12:00:41Z

https://issues.apache.org/jira/browse/HBASE-23205

I fixed a failed test which is not related with this PR. TestReplicationSmallTests.testEmptyWALRecovery,
and a minor bug which is updating replication buffer size wrongly by decreasing total buffer size with the size of bulk loaded files.
I removed the changes above and made a separate jira : https://issues.apache.org/jira/browse/HBASE-23254

Apache-HBase · 2019-10-23T13:27:57Z

💔 -1 overall

Vote	Subsystem	Runtime	Comment
💙	reexec	0m 49s	Docker mode activated.
		_ Prechecks _
💚	dupname	0m 0s	No case conflicting files found.
💚	hbaseanti	0m 0s	Patch does not have any anti-patterns.
💚	@author	0m 0s	The patch does not contain any @author tags.
💚	test4tests	0m 0s	The patch appears to include 4 new or modified test files.
		_ branch-1 Compile Tests _
💚	mvninstall	9m 27s	branch-1 passed
💚	compile	0m 50s	branch-1 passed with JDK v1.8.0_232
💚	compile	0m 52s	branch-1 passed with JDK v1.7.0_242
💚	checkstyle	1m 51s	branch-1 passed
💚	shadedjars	3m 26s	branch has no errors when building our shaded downstream artifacts.
💚	javadoc	0m 42s	branch-1 passed with JDK v1.8.0_232
💚	javadoc	0m 51s	branch-1 passed with JDK v1.7.0_242
💙	spotbugs	3m 15s	Used deprecated FindBugs config; considering switching to SpotBugs.
💚	findbugs	3m 11s	branch-1 passed
		_ Patch Compile Tests _
💚	mvninstall	2m 19s	the patch passed
💚	compile	0m 48s	the patch passed with JDK v1.8.0_232
💚	javac	0m 48s	the patch passed
💚	compile	0m 49s	the patch passed with JDK v1.7.0_242
💚	javac	0m 49s	the patch passed
💔	checkstyle	1m 46s	hbase-server: The patch generated 30 new + 44 unchanged - 10 fixed = 74 total (was 54)
💚	whitespace	0m 1s	The patch has no whitespace issues.
💚	shadedjars	3m 10s	patch has no errors when building our shaded downstream artifacts.
💚	hadoopcheck	5m 30s	Patch does not cause any errors with Hadoop 2.8.5 2.9.2.
💚	javadoc	0m 40s	the patch passed with JDK v1.8.0_232
💚	javadoc	0m 46s	the patch passed with JDK v1.7.0_242
💚	findbugs	3m 30s	the patch passed
		_ Other Tests _
💔	unit	30m 44s	hbase-server in the patch failed.
💚	asflicense	0m 20s	The patch does not generate ASF License warnings.
		76m 33s

Reason	Tests
Failed junit tests	hadoop.hbase.master.TestCatalogJanitor

Subsystem	Report/Notes
Docker	Client=19.03.4 Server=19.03.4 base: https://builds.apache.org/job/HBase-PreCommit-GitHub-PR/job/PR-749/1/artifact/out/Dockerfile
GITHUB PR	#749
Optional Tests	dupname asflicense javac javadoc unit spotbugs findbugs shadedjars hadoopcheck hbaseanti checkstyle compile
uname	Linux aa16fec098c2 4.15.0-60-generic #67-Ubuntu SMP Thu Aug 22 16:55:30 UTC 2019 x86_64 x86_64 x86_64 GNU/Linux
Build tool	maven
Personality	/home/jenkins/jenkins-slave/workspace/HBase-PreCommit-GitHub-PR_PR-749/out/precommit/personality/provided.sh
git revision	branch-1 / `ce65db3`
Default Java	1.7.0_242
Multi-JDK versions	/usr/lib/jvm/zulu-8-amd64:1.8.0_232 /usr/lib/jvm/zulu-7-amd64:1.7.0_242
checkstyle	https://builds.apache.org/job/HBase-PreCommit-GitHub-PR/job/PR-749/1/artifact/out/diff-checkstyle-hbase-server.txt
unit	https://builds.apache.org/job/HBase-PreCommit-GitHub-PR/job/PR-749/1/artifact/out/patch-unit-hbase-server.txt
Test Results	https://builds.apache.org/job/HBase-PreCommit-GitHub-PR/job/PR-749/1/testReport/
Max. process+thread count	740 (vs. ulimit of 10000)
modules	C: hbase-server U: hbase-server
Console output	https://builds.apache.org/job/HBase-PreCommit-GitHub-PR/job/PR-749/1/console
versions	git=1.9.1 maven=3.0.5 findbugs=3.0.1
Powered by	Apache Yetus 0.11.0 https://yetus.apache.org

This message was automatically generated.

JeongDaeKim · 2019-10-24T02:20:48Z

Failed tests are not related to this PR. Tests has broken since #731, and Those will be fixed at #748.

Apache-HBase · 2019-10-24T09:58:44Z

💔 -1 overall

Vote	Subsystem	Runtime	Comment
💙	reexec	0m 41s	Docker mode activated.
		_ Prechecks _
💚	dupname	0m 0s	No case conflicting files found.
💚	hbaseanti	0m 0s	Patch does not have any anti-patterns.
💚	@author	0m 0s	The patch does not contain any @author tags.
💚	test4tests	0m 0s	The patch appears to include 4 new or modified test files.
		_ branch-1 Compile Tests _
💚	mvninstall	8m 25s	branch-1 passed
💚	compile	0m 45s	branch-1 passed with JDK v1.8.0_232
💚	compile	0m 46s	branch-1 passed with JDK v1.7.0_242
💚	checkstyle	1m 48s	branch-1 passed
💚	shadedjars	3m 10s	branch has no errors when building our shaded downstream artifacts.
💚	javadoc	0m 44s	branch-1 passed with JDK v1.8.0_232
💚	javadoc	0m 43s	branch-1 passed with JDK v1.7.0_242
💙	spotbugs	3m 9s	Used deprecated FindBugs config; considering switching to SpotBugs.
💚	findbugs	3m 4s	branch-1 passed
		_ Patch Compile Tests _
💚	mvninstall	2m 5s	the patch passed
💚	compile	0m 41s	the patch passed with JDK v1.8.0_232
💚	javac	0m 41s	the patch passed
💚	compile	0m 47s	the patch passed with JDK v1.7.0_242
💚	javac	0m 47s	the patch passed
💚	checkstyle	1m 44s	hbase-server: The patch generated 0 new + 41 unchanged - 13 fixed = 41 total (was 54)
💚	whitespace	0m 0s	The patch has no whitespace issues.
💚	shadedjars	3m 4s	patch has no errors when building our shaded downstream artifacts.
💚	hadoopcheck	5m 8s	Patch does not cause any errors with Hadoop 2.8.5 2.9.2.
💚	javadoc	0m 30s	the patch passed with JDK v1.8.0_232
💚	javadoc	0m 42s	the patch passed with JDK v1.7.0_242
💚	findbugs	3m 5s	the patch passed
		_ Other Tests _
💔	unit	139m 44s	hbase-server in the patch failed.
💚	asflicense	0m 29s	The patch does not generate ASF License warnings.
		181m 50s

Reason	Tests
Failed junit tests	hadoop.hbase.replication.TestReplicationKillMasterRS
	hadoop.hbase.replication.multiwal.TestReplicationSyncUpToolWithMultipleWAL
	hadoop.hbase.replication.regionserver.TestGlobalReplicationThrottler
	hadoop.hbase.replication.TestReplicationSyncUpTool
	hadoop.hbase.replication.TestReplicationMetricsforUI
	hadoop.hbase.replication.TestPerTableCFReplication
	hadoop.hbase.replication.TestReplicationConfigTracker
	hadoop.hbase.replication.TestVerifyCellsReplicationEndpoint
	hadoop.hbase.replication.TestReplicationSyncUpToolWithBulkLoadedData
	hadoop.hbase.security.visibility.TestVisibilityLabelReplicationWithExpAsString
	hadoop.hbase.replication.multiwal.TestReplicationKillMasterRSCompressedWithMultipleWAL
	hadoop.hbase.regionserver.TestRegionReplicaFailover
	hadoop.hbase.replication.TestReplicationDisableInactivePeer
	hadoop.hbase.replication.TestReplicationStatus
	hadoop.hbase.replication.TestReplicationSmallTests
	hadoop.hbase.replication.TestReplicationKillSlaveRS
	hadoop.hbase.security.visibility.TestVisibilityLabelsReplication
	hadoop.hbase.replication.multiwal.TestReplicationEndpointWithMultipleWAL
	hadoop.hbase.regionserver.TestBulkLoadReplication
	hadoop.hbase.replication.TestReplicationWithTags
	hadoop.hbase.replication.TestReplicationEndpoint
	hadoop.hbase.replication.TestMultiSlaveReplication
	hadoop.hbase.replication.TestReplicationKillMasterRSCompressed
	hadoop.hbase.client.replication.TestReplicationAdminWithClusters

Subsystem	Report/Notes
Docker	Client=19.03.4 Server=19.03.4 base: https://builds.apache.org/job/HBase-PreCommit-GitHub-PR/job/PR-749/2/artifact/out/Dockerfile
GITHUB PR	#749
Optional Tests	dupname asflicense javac javadoc unit spotbugs findbugs shadedjars hadoopcheck hbaseanti checkstyle compile
uname	Linux aa4ca70f49f8 4.15.0-66-generic #75-Ubuntu SMP Tue Oct 1 05:24:09 UTC 2019 x86_64 x86_64 x86_64 GNU/Linux
Build tool	maven
Personality	/home/jenkins/jenkins-slave/workspace/HBase-PreCommit-GitHub-PR_PR-749/out/precommit/personality/provided.sh
git revision	branch-1 / `f0999a1`
Default Java	1.7.0_242
Multi-JDK versions	/usr/lib/jvm/zulu-8-amd64:1.8.0_232 /usr/lib/jvm/zulu-7-amd64:1.7.0_242
unit	https://builds.apache.org/job/HBase-PreCommit-GitHub-PR/job/PR-749/2/artifact/out/patch-unit-hbase-server.txt
Test Results	https://builds.apache.org/job/HBase-PreCommit-GitHub-PR/job/PR-749/2/testReport/
Max. process+thread count	3803 (vs. ulimit of 10000)
modules	C: hbase-server U: hbase-server
Console output	https://builds.apache.org/job/HBase-PreCommit-GitHub-PR/job/PR-749/2/console
versions	git=1.9.1 maven=3.0.5 findbugs=3.0.1
Powered by	Apache Yetus 0.11.0 https://yetus.apache.org

This message was automatically generated.

JeongDaeKim · 2019-10-24T11:24:54Z

I made a typo when i fixed checkstyle warnings 😭 (a8244d2#diff-7d551f2261f4c83aec8a97b7d04427e2R137)

Let me fix it.

Apache-HBase · 2019-10-24T15:48:16Z

💔 -1 overall

Vote	Subsystem	Runtime	Comment
💙	reexec	0m 35s	Docker mode activated.
		_ Prechecks _
💚	dupname	0m 1s	No case conflicting files found.
💚	hbaseanti	0m 0s	Patch does not have any anti-patterns.
💚	@author	0m 0s	The patch does not contain any @author tags.
💚	test4tests	0m 0s	The patch appears to include 4 new or modified test files.
		_ branch-1 Compile Tests _
💚	mvninstall	8m 29s	branch-1 passed
💚	compile	0m 44s	branch-1 passed with JDK v1.8.0_232
💚	compile	0m 49s	branch-1 passed with JDK v1.7.0_242
💚	checkstyle	2m 2s	branch-1 passed
💚	shadedjars	3m 50s	branch has no errors when building our shaded downstream artifacts.
💚	javadoc	0m 44s	branch-1 passed with JDK v1.8.0_232
💚	javadoc	0m 48s	branch-1 passed with JDK v1.7.0_242
💙	spotbugs	3m 56s	Used deprecated FindBugs config; considering switching to SpotBugs.
💚	findbugs	3m 53s	branch-1 passed
		_ Patch Compile Tests _
💚	mvninstall	2m 35s	the patch passed
💚	compile	1m 3s	the patch passed with JDK v1.8.0_232
💚	javac	1m 3s	the patch passed
💚	compile	1m 2s	the patch passed with JDK v1.7.0_242
💚	javac	1m 2s	the patch passed
💚	checkstyle	2m 3s	hbase-server: The patch generated 0 new + 41 unchanged - 13 fixed = 41 total (was 54)
💚	whitespace	0m 1s	The patch has no whitespace issues.
💚	shadedjars	3m 32s	patch has no errors when building our shaded downstream artifacts.
💚	hadoopcheck	6m 2s	Patch does not cause any errors with Hadoop 2.8.5 2.9.2.
💚	javadoc	0m 39s	the patch passed with JDK v1.8.0_232
💚	javadoc	0m 57s	the patch passed with JDK v1.7.0_242
💚	findbugs	3m 56s	the patch passed
		_ Other Tests _
💔	unit	134m 14s	hbase-server in the patch failed.
💚	asflicense	0m 27s	The patch does not generate ASF License warnings.
		182m 37s

Subsystem	Report/Notes
Docker	Client=19.03.4 Server=19.03.4 base: https://builds.apache.org/job/HBase-PreCommit-GitHub-PR/job/PR-749/3/artifact/out/Dockerfile
GITHUB PR	#749
Optional Tests	dupname asflicense javac javadoc unit spotbugs findbugs shadedjars hadoopcheck hbaseanti checkstyle compile
uname	Linux 4e1244156651 4.15.0-66-generic #75-Ubuntu SMP Tue Oct 1 05:24:09 UTC 2019 x86_64 x86_64 x86_64 GNU/Linux
Build tool	maven
Personality	/home/jenkins/jenkins-slave/workspace/HBase-PreCommit-GitHub-PR_PR-749/out/precommit/personality/provided.sh
git revision	branch-1 / `41f6713`
Default Java	1.7.0_242
Multi-JDK versions	/usr/lib/jvm/zulu-8-amd64:1.8.0_232 /usr/lib/jvm/zulu-7-amd64:1.7.0_242
unit	https://builds.apache.org/job/HBase-PreCommit-GitHub-PR/job/PR-749/3/artifact/out/patch-unit-hbase-server.txt
Test Results	https://builds.apache.org/job/HBase-PreCommit-GitHub-PR/job/PR-749/3/testReport/
Max. process+thread count	3832 (vs. ulimit of 10000)
modules	C: hbase-server U: hbase-server
Console output	https://builds.apache.org/job/HBase-PreCommit-GitHub-PR/job/PR-749/3/console
versions	git=1.9.1 maven=3.0.5 findbugs=3.0.1
Powered by	Apache Yetus 0.11.0 https://yetus.apache.org

This message was automatically generated.

…licated

JeongDaeKim · 2019-10-30T02:12:12Z

added a minor fix in test code and rebased.

Apache-HBase · 2019-10-30T05:09:26Z

🎊 +1 overall

Vote	Subsystem	Runtime	Comment
💙	reexec	10m 24s	Docker mode activated.
		_ Prechecks _
💚	dupname	0m 0s	No case conflicting files found.
💚	hbaseanti	0m 0s	Patch does not have any anti-patterns.
💚	@author	0m 0s	The patch does not contain any @author tags.
💚	test4tests	0m 0s	The patch appears to include 4 new or modified test files.
		_ branch-1 Compile Tests _
💚	mvninstall	8m 35s	branch-1 passed
💚	compile	0m 47s	branch-1 passed with JDK v1.8.0_232
💚	compile	0m 50s	branch-1 passed with JDK v1.7.0_242
💚	checkstyle	2m 2s	branch-1 passed
💚	shadedjars	3m 35s	branch has no errors when building our shaded downstream artifacts.
💚	javadoc	0m 45s	branch-1 passed with JDK v1.8.0_232
💚	javadoc	0m 48s	branch-1 passed with JDK v1.7.0_242
💙	spotbugs	3m 36s	Used deprecated FindBugs config; considering switching to SpotBugs.
💚	findbugs	3m 32s	branch-1 passed
		_ Patch Compile Tests _
💚	mvninstall	2m 25s	the patch passed
💚	compile	0m 47s	the patch passed with JDK v1.8.0_232
💚	javac	0m 47s	the patch passed
💚	compile	0m 51s	the patch passed with JDK v1.7.0_242
💚	javac	0m 51s	the patch passed
💚	checkstyle	1m 52s	hbase-server: The patch generated 0 new + 41 unchanged - 13 fixed = 41 total (was 54)
💚	whitespace	0m 0s	The patch has no whitespace issues.
💚	shadedjars	3m 34s	patch has no errors when building our shaded downstream artifacts.
💚	hadoopcheck	5m 52s	Patch does not cause any errors with Hadoop 2.8.5 2.9.2.
💚	javadoc	0m 38s	the patch passed with JDK v1.8.0_232
💚	javadoc	0m 48s	the patch passed with JDK v1.7.0_242
💚	findbugs	3m 45s	the patch passed
		_ Other Tests _
💚	unit	120m 32s	hbase-server in the patch passed.
💚	asflicense	0m 27s	The patch does not generate ASF License warnings.
		176m 55s

Subsystem	Report/Notes
Docker	Client=19.03.4 Server=19.03.4 base: https://builds.apache.org/job/HBase-PreCommit-GitHub-PR/job/PR-749/4/artifact/out/Dockerfile
GITHUB PR	#749
Optional Tests	dupname asflicense javac javadoc unit spotbugs findbugs shadedjars hadoopcheck hbaseanti checkstyle compile
uname	Linux 840d003e793d 4.15.0-66-generic #75-Ubuntu SMP Tue Oct 1 05:24:09 UTC 2019 x86_64 x86_64 x86_64 GNU/Linux
Build tool	maven
Personality	/home/jenkins/jenkins-slave/workspace/HBase-PreCommit-GitHub-PR_PR-749/out/precommit/personality/provided.sh
git revision	branch-1 / `5e414f2`
Default Java	1.7.0_242
Multi-JDK versions	/usr/lib/jvm/zulu-8-amd64:1.8.0_232 /usr/lib/jvm/zulu-7-amd64:1.7.0_242
Test Results	https://builds.apache.org/job/HBase-PreCommit-GitHub-PR/job/PR-749/4/testReport/
Max. process+thread count	3894 (vs. ulimit of 10000)
modules	C: hbase-server U: hbase-server
Console output	https://builds.apache.org/job/HBase-PreCommit-GitHub-PR/job/PR-749/4/console
versions	git=1.9.1 maven=3.0.5 findbugs=3.0.1
Powered by	Apache Yetus 0.11.0 https://yetus.apache.org

This message was automatically generated.

wchevreuil

Thanks for the analysis. Had done a first round of reading through, but the PR seems a bit large to grasp it all in one go, hence some of the question in my comments.

It would be nice to keep changes to a minimal, adding only modifications really needed to fix the problem. For example, there are few variable/method renaming, moving to different class, just for personal/cosmetic preferences, together with additional unrelated fixes, such as the mentioned metric one (if that's not needed here, please open a separate jira to it).

hbase-server/src/test/java/org/apache/hadoop/hbase/replication/TestReplicationSmallTests.java

...src/main/java/org/apache/hadoop/hbase/replication/regionserver/ReplicationSourceManager.java

wchevreuil · 2019-10-30T10:34:45Z

.../java/org/apache/hadoop/hbase/replication/regionserver/ReplicationSourceWALReaderThread.java

                  break;
                }
              }
-            } else {


Where is the log position getting updated now if the current edit is not targeted to replication?

I tried not to update log position every single filtered entry. because I found a lot of updates (setData) happened in zookeeper tx logs, even though all entries were filtered.

log position will be updated in these case.

limits(quota, size, count) reached, or a batch has entries when eof reached.

wal rolled

when reader read all wals in recovery queue.

I think updating log position is required for 1) cleanup old logs, 2) replication queue recovery.
for 1) it would be enough to update log position only when log rolled.
for 2) the log position should be updated to the position of the last replicated entry.

in any case, we don't need to update log position aggressively for filtered entries. entries would be filtered again for recovery case.

So if a filtered edit came, any sub-sequent non-filterable one would need to wait for a log roll? That could take too long for some use cases.

any sub-sequent non-filterable one would need to wait for a log roll? That could take too long for some use cases.

If some use cases means "no mutations come for a long time, but a batch has entries", this case is the one of the 1) case i mentioned a batch has entries when eof reached. reader would reach the eof, and log position would be updated.

In addition, even while testing this issue with heavy writes, I observed the reader frequently reached EOF.

If some use cases means "no mutations come for a long time, but a batch has entries"

What if the whole WAL section read got no entries for replication? In this case, batch would be empty, so ReplicationSourceManager.logPositionAndCleanOldLogs does not ever get called (at least, I guess, until the log is rolled).

I think the answer to my question above is in the resetStream() that gets called at the end of the second while loop, which will update lastReadPosition variable that is now used for reading here.

What if the whole WAL section read got no entries for replication? In this case, batch would be empty, so ReplicationSourceManager.logPositionAndCleanOldLogs does not ever get called (at least, I guess, until the log is rolled).

Yes, In that case, the position will be updated when log rolled. That is my intention. #749 (comment)

I think the answer to my question above is in the resetStream() that gets called at the end of the second while loop, which will update lastReadPosition variable that is now used for reading here.

Oh? Then, I think i didn't understand what your question is. 🤣

I guess this is fine for the replication progress problem. One additional issue, though, is regarding monitoring. IIRC, DumpReplicationQueues relies on replication info available at ZK, so now it may not show an accurate position for the log queue. We may need to expose ReplicationSourceWALReaderThread.lastReadPosition via getter method for monitoring purposes.

.../java/org/apache/hadoop/hbase/replication/regionserver/ReplicationSourceWALReaderThread.java

wchevreuil · 2019-10-30T11:03:28Z

...erver/src/test/java/org/apache/hadoop/hbase/replication/regionserver/TestWALEntryStream.java

@@ -378,37 +380,35 @@ public void testReplicationSourceWALReaderThread() throws Exception {
  }

  @Test
-  public void testReplicationSourceUpdatesLogPositionOnFilteredEntries() throws Exception {
+  public void testReplicationSourceWALReaderThreadRecoveredQueue() throws Exception {


Why this test has been changed from it's original purpose of checking for upating wal position when no edits targeted to replication are read? Also the name does not seem accurate, it does not seem to create a recovered queue scenario.

I removed testReplicationSourceUpdatesLogPositionOnFilteredEntries, because the behavior of the reader is changed. (no updates every filtered entry).

And, i made recovered queue scenario by setting queue info as recovered queue. getQueueInfo("1-1")
https://github.com/apache/hbase/pull/749/files/114aa1b1a7b9919c5429fadcb74079cd08629513#diff-05e8e2a626166f52e5737f8bcdc49e39R401

If it is not well recognized as intended, let me add comments or getRecoveredQueueInfo()?

I changed the method name to getRecoveredQueueInfo()

Actually yeah, this is indeed simulating recovered queue when creating it as recovered. I think this test is fine.

Signed-off-by: Wellington Chevreuil <[email protected]>

JeongDaeKim · 2019-10-31T10:35:15Z

Thanks for the review!

there are few variable/method renaming, moving to different class, just for personal/cosmetic preferences

I see. I'll make my changes smaller to remain only necessary ones.

together with additional unrelated fixes, such as the mentioned metric one (if that's not needed here, please open a separate jira to it).

Let me file a new jira then 👍

Apache-HBase · 2019-10-31T15:20:16Z

🎊 +1 overall

Vote	Subsystem	Runtime	Comment
💙	reexec	0m 35s	Docker mode activated.
		_ Prechecks _
💚	dupname	0m 0s	No case conflicting files found.
💚	hbaseanti	0m 0s	Patch does not have any anti-patterns.
💚	@author	0m 0s	The patch does not contain any @author tags.
💚	test4tests	0m 0s	The patch appears to include 4 new or modified test files.
		_ branch-1 Compile Tests _
💚	mvninstall	8m 26s	branch-1 passed
💚	compile	0m 42s	branch-1 passed with JDK v1.8.0_232
💚	compile	0m 48s	branch-1 passed with JDK v1.7.0_242
💚	checkstyle	1m 47s	branch-1 passed
💚	shadedjars	3m 9s	branch has no errors when building our shaded downstream artifacts.
💚	javadoc	0m 36s	branch-1 passed with JDK v1.8.0_232
💚	javadoc	0m 42s	branch-1 passed with JDK v1.7.0_242
💙	spotbugs	3m 3s	Used deprecated FindBugs config; considering switching to SpotBugs.
💚	findbugs	3m 1s	branch-1 passed
		_ Patch Compile Tests _
💚	mvninstall	2m 5s	the patch passed
💚	compile	0m 41s	the patch passed with JDK v1.8.0_232
💚	javac	0m 41s	the patch passed
💚	compile	0m 46s	the patch passed with JDK v1.7.0_242
💚	javac	0m 46s	the patch passed
💚	checkstyle	1m 41s	hbase-server: The patch generated 0 new + 42 unchanged - 12 fixed = 42 total (was 54)
💚	whitespace	0m 0s	The patch has no whitespace issues.
💚	shadedjars	3m 7s	patch has no errors when building our shaded downstream artifacts.
💚	hadoopcheck	5m 11s	Patch does not cause any errors with Hadoop 2.8.5 2.9.2.
💚	javadoc	0m 30s	the patch passed with JDK v1.8.0_232
💚	javadoc	0m 42s	the patch passed with JDK v1.7.0_242
💚	findbugs	3m 6s	the patch passed
		_ Other Tests _
💚	unit	118m 17s	hbase-server in the patch passed.
💚	asflicense	0m 28s	The patch does not generate ASF License warnings.
		159m 54s

Subsystem	Report/Notes
Docker	Client=19.03.4 Server=19.03.4 base: https://builds.apache.org/job/HBase-PreCommit-GitHub-PR/job/PR-749/5/artifact/out/Dockerfile
GITHUB PR	#749
Optional Tests	dupname asflicense javac javadoc unit spotbugs findbugs shadedjars hadoopcheck hbaseanti checkstyle compile
uname	Linux f4a65f785f70 4.15.0-66-generic #75-Ubuntu SMP Tue Oct 1 05:24:09 UTC 2019 x86_64 x86_64 x86_64 GNU/Linux
Build tool	maven
Personality	/home/jenkins/jenkins-slave/workspace/HBase-PreCommit-GitHub-PR_PR-749/out/precommit/personality/provided.sh
git revision	branch-1 / `4bcc397`
Default Java	1.7.0_242
Multi-JDK versions	/usr/lib/jvm/zulu-8-amd64:1.8.0_232 /usr/lib/jvm/zulu-7-amd64:1.7.0_242
Test Results	https://builds.apache.org/job/HBase-PreCommit-GitHub-PR/job/PR-749/5/testReport/
Max. process+thread count	3765 (vs. ulimit of 10000)
modules	C: hbase-server U: hbase-server
Console output	https://builds.apache.org/job/HBase-PreCommit-GitHub-PR/job/PR-749/5/console
versions	git=1.9.1 maven=3.0.5 findbugs=3.0.1
Powered by	Apache Yetus 0.11.0 https://yetus.apache.org

This message was automatically generated.

…CallableWithReplicas (apache#780) Signed-off-by: Sean Busbey <[email protected]>

…alue every time [Take2] (apache#748) * HBASE-23185 Fix test failure by HBASE-23185 changes * HBASE-23185 Fix high cpu usage because getTable()#put() gets config value every time This reverts commit db2ce23.

Signed-off-by: Andrew Purtell <[email protected]>

…che#789) Signed-off-by: Sean Busbey <[email protected]> Signed-off-by: Guangxu Cheng <[email protected]>

Apache-HBase · 2019-11-05T11:51:03Z

💔 -1 overall

Vote	Subsystem	Runtime	Comment
💙	reexec	9m 52s	Docker mode activated.
		_ Prechecks _
💚	dupname	0m 0s	No case conflicting files found.
💚	hbaseanti	0m 0s	Patch does not have any anti-patterns.
💚	@author	0m 0s	The patch does not contain any @author tags.
💚	test4tests	0m 0s	The patch appears to include 4 new or modified test files.
		_ branch-1 Compile Tests _
💚	mvninstall	8m 33s	branch-1 passed
💚	compile	0m 40s	branch-1 passed with JDK v1.8.0_232
💚	compile	0m 47s	branch-1 passed with JDK v1.7.0_242
💚	checkstyle	1m 47s	branch-1 passed
💚	shadedjars	3m 7s	branch has no errors when building our shaded downstream artifacts.
💚	javadoc	0m 37s	branch-1 passed with JDK v1.8.0_232
💚	javadoc	0m 42s	branch-1 passed with JDK v1.7.0_242
💙	spotbugs	3m 4s	Used deprecated FindBugs config; considering switching to SpotBugs.
💚	findbugs	3m 0s	branch-1 passed
		_ Patch Compile Tests _
💚	mvninstall	2m 5s	the patch passed
💚	compile	0m 42s	the patch passed with JDK v1.8.0_232
💚	javac	0m 42s	the patch passed
💚	compile	0m 47s	the patch passed with JDK v1.7.0_242
💚	javac	0m 47s	the patch passed
💔	checkstyle	1m 44s	hbase-server: The patch generated 1 new + 42 unchanged - 12 fixed = 43 total (was 54)
💚	whitespace	0m 0s	The patch has no whitespace issues.
💚	shadedjars	3m 4s	patch has no errors when building our shaded downstream artifacts.
💚	hadoopcheck	5m 10s	Patch does not cause any errors with Hadoop 2.8.5 2.9.2.
💚	javadoc	0m 31s	the patch passed with JDK v1.8.0_232
💚	javadoc	0m 42s	the patch passed with JDK v1.7.0_242
💚	findbugs	3m 7s	the patch passed
		_ Other Tests _
💚	unit	119m 47s	hbase-server in the patch passed.
💚	asflicense	0m 27s	The patch does not generate ASF License warnings.
		170m 44s

Subsystem	Report/Notes
Docker	Client=19.03.4 Server=19.03.4 base: https://builds.apache.org/job/HBase-PreCommit-GitHub-PR/job/PR-749/6/artifact/out/Dockerfile
GITHUB PR	#749
Optional Tests	dupname asflicense javac javadoc unit spotbugs findbugs shadedjars hadoopcheck hbaseanti checkstyle compile
uname	Linux e4ff635ca663 4.15.0-66-generic #75-Ubuntu SMP Tue Oct 1 05:24:09 UTC 2019 x86_64 x86_64 x86_64 GNU/Linux
Build tool	maven
Personality	/home/jenkins/jenkins-slave/workspace/HBase-PreCommit-GitHub-PR_PR-749/out/precommit/personality/provided.sh
git revision	branch-1 / `3f9ce86`
Default Java	1.7.0_242
Multi-JDK versions	/usr/lib/jvm/zulu-8-amd64:1.8.0_232 /usr/lib/jvm/zulu-7-amd64:1.7.0_242
checkstyle	https://builds.apache.org/job/HBase-PreCommit-GitHub-PR/job/PR-749/6/artifact/out/diff-checkstyle-hbase-server.txt
Test Results	https://builds.apache.org/job/HBase-PreCommit-GitHub-PR/job/PR-749/6/testReport/
Max. process+thread count	3964 (vs. ulimit of 10000)
modules	C: hbase-server U: hbase-server
Console output	https://builds.apache.org/job/HBase-PreCommit-GitHub-PR/job/PR-749/6/console
versions	git=1.9.1 maven=3.0.5 findbugs=3.0.1
Powered by	Apache Yetus 0.11.0 https://yetus.apache.org

This message was automatically generated.

Apache-HBase · 2019-11-05T17:08:12Z

💔 -1 overall

Vote	Subsystem	Runtime	Comment
💙	reexec	0m 35s	Docker mode activated.
		_ Prechecks _
💚	dupname	0m 1s	No case conflicting files found.
💚	hbaseanti	0m 0s	Patch does not have any anti-patterns.
💚	@author	0m 0s	The patch does not contain any @author tags.
💚	test4tests	0m 0s	The patch appears to include 4 new or modified test files.
		_ branch-1 Compile Tests _
💚	mvninstall	8m 28s	branch-1 passed
💚	compile	0m 41s	branch-1 passed with JDK v1.8.0_232
💚	compile	0m 48s	branch-1 passed with JDK v1.7.0_242
💚	checkstyle	1m 47s	branch-1 passed
💚	shadedjars	3m 8s	branch has no errors when building our shaded downstream artifacts.
💚	javadoc	0m 37s	branch-1 passed with JDK v1.8.0_232
💚	javadoc	0m 41s	branch-1 passed with JDK v1.7.0_242
💙	spotbugs	3m 2s	Used deprecated FindBugs config; considering switching to SpotBugs.
💚	findbugs	2m 58s	branch-1 passed
		_ Patch Compile Tests _
💚	mvninstall	2m 5s	the patch passed
💚	compile	0m 40s	the patch passed with JDK v1.8.0_232
💚	javac	0m 40s	the patch passed
💚	compile	0m 48s	the patch passed with JDK v1.7.0_242
💚	javac	0m 48s	the patch passed
💚	checkstyle	1m 44s	hbase-server: The patch generated 0 new + 42 unchanged - 12 fixed = 42 total (was 54)
💚	whitespace	0m 0s	The patch has no whitespace issues.
💚	shadedjars	3m 4s	patch has no errors when building our shaded downstream artifacts.
💚	hadoopcheck	5m 8s	Patch does not cause any errors with Hadoop 2.8.5 2.9.2.
💚	javadoc	0m 30s	the patch passed with JDK v1.8.0_232
💚	javadoc	0m 42s	the patch passed with JDK v1.7.0_242
💚	findbugs	3m 7s	the patch passed
		_ Other Tests _
💔	unit	121m 10s	hbase-server in the patch failed.
💚	asflicense	0m 27s	The patch does not generate ASF License warnings.
		162m 39s

Reason	Tests
Failed junit tests	hadoop.hbase.client.TestReplicasClient

Subsystem	Report/Notes
Docker	Client=19.03.4 Server=19.03.4 base: https://builds.apache.org/job/HBase-PreCommit-GitHub-PR/job/PR-749/7/artifact/out/Dockerfile
GITHUB PR	#749
Optional Tests	dupname asflicense javac javadoc unit spotbugs findbugs shadedjars hadoopcheck hbaseanti checkstyle compile
uname	Linux 65d46fa7b0b2 4.15.0-66-generic #75-Ubuntu SMP Tue Oct 1 05:24:09 UTC 2019 x86_64 x86_64 x86_64 GNU/Linux
Build tool	maven
Personality	/home/jenkins/jenkins-slave/workspace/HBase-PreCommit-GitHub-PR_PR-749/out/precommit/personality/provided.sh
git revision	branch-1 / `3f9ce86`
Default Java	1.7.0_242
Multi-JDK versions	/usr/lib/jvm/zulu-8-amd64:1.8.0_232 /usr/lib/jvm/zulu-7-amd64:1.7.0_242
unit	https://builds.apache.org/job/HBase-PreCommit-GitHub-PR/job/PR-749/7/artifact/out/patch-unit-hbase-server.txt
Test Results	https://builds.apache.org/job/HBase-PreCommit-GitHub-PR/job/PR-749/7/testReport/
Max. process+thread count	3887 (vs. ulimit of 10000)
modules	C: hbase-server U: hbase-server
Console output	https://builds.apache.org/job/HBase-PreCommit-GitHub-PR/job/PR-749/7/console
versions	git=1.9.1 maven=3.0.5 findbugs=3.0.1
Powered by	Apache Yetus 0.11.0 https://yetus.apache.org

This message was automatically generated.

…ould be at INFO Signed-off-by: Jan Hentschel <[email protected]>

JeongDaeKim · 2019-11-06T09:55:08Z

The failed test seems not related to this pr. (It failed after a commit just adding a new line), and I can't reproduce it in my local repo.

I think PR is ready to get review. please have a look. @wchevreuil

JeongDaeKim · 2019-11-27T02:01:22Z

No further comments on this PR? If any lacks of description or something for code reviews, please let me know. If not, I just want this PR to be merged, and backported to 1.4.

@wchevreuil Do you still have something to be changed in this PR?

wchevreuil · 2019-11-27T09:37:52Z

Hi @JeongDaeKim , apologies for the delay. I think the solution is good, but since this is changing considerably how we track log reading position, am just taking a conservative approach. I would like to do a bit of testing. Please give me until end of this week to approve it, or suggest changes.

wchevreuil

Thanks for your patience, @JeongDaeKim ! I think I finally got a better understanding of the chages logic, along with the new and modified tests. I had put on some additional comments within the code, but some additional thoughts:

We might want to expose ReplicationSourceWALReaderThread.lastReadPosition, in order to eventually have an accurate monitoring info. Currently we have _DumpReplicationQueues` which just reads info from ZK. Maybe we should print a warning there that reported log position may not be accurate.
Can we add a 3rd test on TestWalEntryStream that adds few filterable entries, then adds a non filterable one, and checks if this non filterable comes in from the batch?

wchevreuil · 2019-11-29T14:45:49Z

...erver/src/test/java/org/apache/hadoop/hbase/replication/regionserver/TestWALEntryStream.java

@@ -378,37 +380,35 @@ public void testReplicationSourceWALReaderThread() throws Exception {
  }

  @Test
-  public void testReplicationSourceUpdatesLogPositionOnFilteredEntries() throws Exception {
+  public void testReplicationSourceWALReaderThreadRecoveredQueue() throws Exception {


Actually yeah, this is indeed simulating recovered queue when creating it as recovered. I think this test is fine.

wchevreuil · 2019-11-29T15:05:40Z

.../java/org/apache/hadoop/hbase/replication/regionserver/ReplicationSourceWALReaderThread.java

                  break;
                }
              }
-            } else {


I guess this is fine for the replication progress problem. One additional issue, though, is regarding monitoring. IIRC, DumpReplicationQueues relies on replication info available at ZK, so now it may not show an accurate position for the log queue. We may need to expose ReplicationSourceWALReaderThread.lastReadPosition via getter method for monitoring purposes.

wchevreuil · 2019-11-29T15:24:04Z

...erver/src/test/java/org/apache/hadoop/hbase/replication/regionserver/TestWALEntryStream.java

+    // reader won't put any batch, even if EOF reached.
+    ExecutorService executor = Executors.newSingleThreadExecutor();
+    Future<WALEntryBatch> future = executor.submit(new Callable<WALEntryBatch>() {
+      @Override
+      public WALEntryBatch call() throws Exception {
+        return reader.take();
+      }
+    });
+    Thread.sleep(2000);
+    assertFalse(future.isDone());


Just a heads up here: we may simplify this part if we decide to make ReplicationSourceWALReaderThread.lastReadPosition exposed via getter method.

) - switch to nexus-staging-maven-plugin for asf-release - cleaned up some tabs in the root pom (differs from master because there are no release scripts here.) Signed-off-by: stack <[email protected]> (cherry picked from commit 97e0107)

…ction disabled in branch-1 (apache#899) Signed-off-by: Balazs Meszaros <[email protected]> Signed-off-by Anoop Sam John <[email protected]>

…File is a reference file Signed-off-by: Lijin Bin <[email protected]>

… to a capacity rule (apache#894) Signed-off-by Wellington Chevreuil <[email protected]>

We have this nice description in the java doc on ITBLL but it's unformatted and thus illegible. Add some formatting so that it can be read by humans. Signed-off-by: Jan Hentschel <[email protected]> Signed-off-by: Josh Elser <[email protected]>

JeongDaeKim · 2019-12-11T11:36:25Z

We might want to expose ReplicationSourceWALReaderThread.lastReadPosition, in order to eventually have an accurate monitoring info. Currently we have _DumpReplicationQueues` which just reads info from ZK. Maybe we should print a warning there that reported log position may not be accurate

I think "reported log position" from DumpReplicationQueues could not be current read position before this PR (no updates during making a batch). DumpReplicationQueues works well for their purpose.
If we want to see lastReadPosition for monitoring, what about adding some messages to https://github.com/apache/hbase/blob/branch-1.4/hbase-server/src/main/java/org/apache/hadoop/hbase/replication/regionserver/ReplicationSource.java#L474

Can we add a 3rd test on TestWalEntryStream that adds few filterable entries, then adds a non filterable one, and checks if this non filterable comes in from the batch?

I see, added a new test

…pache#896) Differs from original by removing Capacity Unit examples since that feature isn't on this branch. (cherry picked from commit a553b78) (cherry picked from commit 2a1efe0)

wchevreuil

LGTM +1.

Let me try run the pre-commit hook before merging just to make sure.

…licated

…to HBASE-23205

wchevreuil · 2019-12-16T08:48:18Z

No luck with the pre-commit. I tried a rebase, but still the job fails while starting. Ain't sure if it's something specific to the PR commits. @JeongDaeKim , would u mind squash these commits on one of you local branches, then open a new PR for branch-1 with this? Please ping me once you open the new PR.

wchevreuil · 2020-01-02T11:59:15Z

Merged PR 944, so am closing this one. Thanks for the contribution and patience, @JeongDaeKim !

JeongDaeKim force-pushed the HBASE-23205 branch from b3e9a4e to a8244d2 Compare October 24, 2019 06:54

JeongDaeKim and others added 4 commits October 29, 2019 16:15

HBASE-23205 Correctly update the position of WALs currently being rep…

6f1f7bc

…licated

fix checkstyle warnings

affc75a

Fix typo

b6efe7e

(fix) close writer

114aa1b

JeongDaeKim force-pushed the HBASE-23205 branch from df8efb2 to 114aa1b Compare October 30, 2019 02:11

wchevreuil requested changes Oct 30, 2019

View reviewed changes

HBASE-23229 Update branch-1 to 1.6.0-SNAPSHOT (apache#772)

4bcc397

Signed-off-by: Wellington Chevreuil <[email protected]>

Jeongdae Kim added 2 commits October 31, 2019 19:53

(fix) revert test for HBASE-18137

1dbf6f7

Revert unnecessary codes

c0b8f7b

wchevreuil and others added 6 commits October 31, 2019 17:11

HBASE-23238 Additional test and checks for null references on Scanner…

577db5d

…CallableWithReplicas (apache#780) Signed-off-by: Sean Busbey <[email protected]>

HBASE-23185 Fix high cpu usage because getTable()#put() gets config v…

3c7c1b5

…alue every time [Take2] (apache#748) * HBASE-23185 Fix test failure by HBASE-23185 changes * HBASE-23185 Fix high cpu usage because getTable()#put() gets config value every time This reverts commit db2ce23.

HBASE-23219 Re-enable ZKLess tests for branch-1 (Revert HBASE-14622)

2451023

Signed-off-by: Andrew Purtell <[email protected]>

HBASE-23246 Fix error prone warning in TestMetricsUserSourceImpl (apa…

3f9ce86

…che#789) Signed-off-by: Sean Busbey <[email protected]> Signed-off-by: Guangxu Cheng <[email protected]>

(fix) Change newly added method name

75620b0

(fix) add getRecoveredQueueInfo() to make a test more recognizable

c92d79e

(fix) a check style warning

d3ed533

HBASE-23250 Log message about CleanerChore delegate initialization sh…

1360816

…ould be at INFO Signed-off-by: Jan Hentschel <[email protected]>

wchevreuil reviewed Nov 29, 2019

View reviewed changes

busbey and others added 7 commits December 3, 2019 23:03

HBASE-23359 RS going down with NPE when splitting a region with compa…

737eaa6

…ction disabled in branch-1 (apache#899) Signed-off-by: Balazs Meszaros <[email protected]> Signed-off-by Anoop Sam John <[email protected]>

HBASE-22096 /storeFile.jsp shows CorruptHFileException when the store…

ec55c2a

…File is a reference file Signed-off-by: Lijin Bin <[email protected]>

HBASE-23364 HRegionServer sometimes does not shut down.

9b10afd

HBASE-23073 Add an optional costFunction to balance regions according…

f5171b4

… to a capacity rule (apache#894) Signed-off-by Wellington Chevreuil <[email protected]>

HBASE-23552 Format Javadocs on ITBLL

80c3581

We have this nice description in the java doc on ITBLL but it's unformatted and thus illegible. Add some formatting so that it can be read by humans. Signed-off-by: Jan Hentschel <[email protected]> Signed-off-by: Josh Elser <[email protected]>

(fix) Add a new test and expose an api

67ca8db

HBASE-23360 [CLI] Fix help command 'set_quota' for removing limits (a…

84c0a90

…pache#896) Differs from original by removing Capacity Unit examples since that feature isn't on this branch. (cherry picked from commit a553b78) (cherry picked from commit 2a1efe0)

wchevreuil approved these changes Dec 13, 2019

View reviewed changes

JeongDaeKim and others added 12 commits December 13, 2019 15:14

HBASE-23205 Correctly update the position of WALs currently being rep…

871e2ea

…licated

fix checkstyle warnings

2142ded

Fix typo

6a574ff

(fix) close writer

16d56dd

(fix) revert test for HBASE-18137

3e83af8

Revert unnecessary codes

bb5492d

(fix) Change newly added method name

b541c24

(fix) add getRecoveredQueueInfo() to make a test more recognizable

833467c

(fix) a check style warning

5cc0dca

(fix) log a message even in empty batch case

9e08eea

(fix) Add a new test and expose an api

d6297a7

Merge branch 'HBASE-23205' of https://github.com/JeongDaeKim/hbase in…

572c73b

…to HBASE-23205

JeongDaeKim mentioned this pull request Dec 16, 2019

HBASE-23205 Correctly update the position of WALs currently being replicated (2) #944

Merged

wchevreuil closed this Jan 2, 2020

JeongDaeKim deleted the HBASE-23205 branch January 3, 2020 07:29

HBASE-23205 Correctly update the position of WALs currently being replicated #749

HBASE-23205 Correctly update the position of WALs currently being replicated #749

Conversation

JeongDaeKim commented Oct 23, 2019 • edited Loading

Apache-HBase commented Oct 23, 2019

JeongDaeKim commented Oct 24, 2019 • edited Loading

Apache-HBase commented Oct 24, 2019

JeongDaeKim commented Oct 24, 2019

Apache-HBase commented Oct 24, 2019

JeongDaeKim commented Oct 30, 2019

Apache-HBase commented Oct 30, 2019

wchevreuil left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

wchevreuil Nov 19, 2019 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

JeongDaeKim Nov 20, 2019 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

JeongDaeKim commented Oct 31, 2019

Apache-HBase commented Oct 31, 2019

Apache-HBase commented Nov 5, 2019

Apache-HBase commented Nov 5, 2019

JeongDaeKim commented Nov 6, 2019 • edited Loading

JeongDaeKim commented Nov 27, 2019

wchevreuil commented Nov 27, 2019

wchevreuil left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

JeongDaeKim commented Dec 11, 2019

wchevreuil left a comment

Choose a reason for hiding this comment

wchevreuil commented Dec 16, 2019

wchevreuil commented Jan 2, 2020

JeongDaeKim commented Oct 23, 2019 •

edited

Loading

JeongDaeKim commented Oct 24, 2019 •

edited

Loading

wchevreuil Nov 19, 2019 •

edited

Loading

JeongDaeKim Nov 20, 2019 •

edited

Loading

JeongDaeKim commented Nov 6, 2019 •

edited

Loading