Reduce CPU usage of gradle run #49055

jaymode · 2019-11-13T19:40:42Z

The RunTask is responsible for logging output from nodes to the console
and also stays active since we want the cluster to keep running.
However, the implementation of the logging and waiting resulted in a
spin loop that continually polls for data to have been written to one
of the nodes' output files. On my laptop, this causes an idle
invocation of gradle run to consume an entire core.

The JDK provides a method to be notified of changes to files through
the use of a WatchService. While a WatchService based implementation
for logging and waiting works, a delay of up to ten seconds is
encountered when running on macOS. This is due to the lack of a native
WatchService implementation that uses kqueue or FSEvents; the current
WatchService implementation in the JDK uses polling with a default
interval of ten seconds. While the interval can be changed
programmatically it is not an acceptable solution due to the need to
access the com.sun.nio.file.SensitivityWatchEventModifier enum, which
is in an internal package.

The change in this commit instead introduces a check to see if any data
was available to read and log. If no data is available in any of the
node output files, the thread sleeps for 100ms. This is enough time to
prevent consuming large amounts of cpu while still providing output to
the console in a timely fashion.

The RunTask is responsible for logging output from nodes to the console and also stays active since we want the cluster to keep running. However, the implementation of the logging and waiting resulted in a spin loop that continually polls for data to have been written to one of the nodes' output files. On my laptop, this causes an idle invocation of `gradle run` to consume an entire core. The JDK provides a method to be notified of changes to files through the use of a WatchService. While a WatchService based implementation for logging and waiting works, a delay of up to ten seconds is encountered when running on macOS. This is due to the lack of a native WatchService implementation that uses kqueue or FSEvents; the current WatchService implementation in the JDK uses polling with a default interval of ten seconds. While the interval can be changed programmatically it is not an acceptable solution due to the need to access the com.sun.nio.file.SensitivityWatchEventModifier enum, which is in an internal package. The change in this commit instead introduces a check to see if any data was available to read and log. If no data is available in any of the node output files, the thread sleeps for 100ms. This is enough time to prevent consuming large amounts of cpu while still providing output to the console in a timely fashion.

elasticmachine · 2019-11-13T19:40:44Z

Pinging @elastic/es-core-infra (:Core/Infra/Build)

alpar-t

LGTM, thanks for catching it !

pugnascotia

LGTM.

In case we ever want to revisit this, there seems to be a couple of MacOS implementation that use JNA and the Carbon APIs to provide a native watch implementation. For example:

https://github.com/gmethvin/directory-watcher

jasontedor · 2019-11-14T13:55:47Z

The Carbon APIs were never made 64-bit and have been removed since macOS Catalina when 32-bit support was dropped, we definitely wouldn't want to build a solution on them.

mark-vieira · 2019-11-14T16:47:41Z

I remember we encountered these same limitations when implementing continuous builds in Gradle. In the end we just lived with the fact that the MacOS experience has potential for high latency.

Thanks for catching this Jay. The only other option I'd see is to refactor this to use JavaExec and just directly stream back stdout. That'd have potential to be significant though, as we'd effectively have to reimplement any desired logic in our launch script in the build.

The RunTask is responsible for logging output from nodes to the console and also stays active since we want the cluster to keep running. However, the implementation of the logging and waiting resulted in a spin loop that continually polls for data to have been written to one of the nodes' output files. On my laptop, this causes an idle invocation of `gradle run` to consume an entire core. The JDK provides a method to be notified of changes to files through the use of a WatchService. While a WatchService based implementation for logging and waiting works, a delay of up to ten seconds is encountered when running on macOS. This is due to the lack of a native WatchService implementation that uses kqueue or FSEvents; the current WatchService implementation in the JDK uses polling with a default interval of ten seconds. While the interval can be changed programmatically it is not an acceptable solution due to the need to access the com.sun.nio.file.SensitivityWatchEventModifier enum, which is in an internal package. The change in this commit instead introduces a check to see if any data was available to read and log. If no data is available in any of the node output files, the thread sleeps for 100ms. This is enough time to prevent consuming large amounts of cpu while still providing output to the console in a timely fashion.

jaymode added >non-issue :Delivery/Build Build or test infrastructure v8.0.0 v7.6.0 labels Nov 13, 2019

jaymode requested a review from alpar-t November 13, 2019 19:40

simplify

928c865

alpar-t approved these changes Nov 13, 2019

View reviewed changes

pugnascotia approved these changes Nov 14, 2019

View reviewed changes

jaymode merged commit e0df257 into elastic:master Nov 14, 2019

jaymode deleted the gradle_run_stop_burning_me branch November 14, 2019 17:18

mark-vieira added the Team:Delivery Meta label for Delivery team label Nov 11, 2020

jakelandis added v8.0.0-alpha1 and removed v8.0.0 labels Jul 26, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce CPU usage of gradle run #49055

Reduce CPU usage of gradle run #49055

jaymode commented Nov 13, 2019

elasticmachine commented Nov 13, 2019

alpar-t left a comment

pugnascotia left a comment

jasontedor commented Nov 14, 2019

mark-vieira commented Nov 14, 2019

Reduce CPU usage of gradle run #49055

Reduce CPU usage of gradle run #49055

Conversation

jaymode commented Nov 13, 2019

elasticmachine commented Nov 13, 2019

alpar-t left a comment

Choose a reason for hiding this comment

pugnascotia left a comment

Choose a reason for hiding this comment

jasontedor commented Nov 14, 2019

mark-vieira commented Nov 14, 2019