Native-image i.e. GraalVM builds of Zookeeper and CLIs #29

solsson · 2020-04-24T04:49:59Z

Faster startup and less resource consumption matters a lot for things we start often; not so much for actual Kafka brokers. This PR replaces #25.

We're actually running native zookeeper on GKE now in a test cluster. JMX isn't working so logs frequently spit out the below stack trace, but the clusters do work. The experiment lives in Yolean/kubernetes-kafka#311, note the scale-6-9 variant. An exploration of the entrypoints that the Kafka distribution's various ./bin/*.sh wrapper scripts generate is inherent in our work here because with native-image a lot of the options to java processes must be set at compile time. IMO the shell script are a layer of hidden assumptions when running in containers, and because they generate the same startup command every time given the same pod spec this exposure of the actual args allows for better understanding and more clear customizations.

More effort must be spent to make logging work. Ideally with the log4j.properties files that JVM images use, but for now it's quite adequate to chose between slf4j-nop and slf4j-simple (none or info-level logging respectively). The zookeeper build uses slf4j-simple.

[QuorumPeer[myid=9](plain=[0:0:0:0:0:0:0:0]:2181)(secure=disabled)] WARN org.apache.zookeeper.server.ZooKeeperServer - Failed to register with JMX
java.lang.NullPointerException
	at org.apache.zookeeper.jmx.MBeanRegistry.register(MBeanRegistry.java:108)
	at org.apache.zookeeper.server.ServerCnxnFactory.registerConnection(ServerCnxnFactory.java:183)
	at org.apache.zookeeper.server.ZooKeeperServer.finishSessionInit(ZooKeeperServer.java:726)
	at org.apache.zookeeper.server.quorum.Learner.revalidate(Learner.java:630)
	at org.apache.zookeeper.server.quorum.Follower.processPacket(Follower.java:165)
	at org.apache.zookeeper.server.quorum.Follower.followLeader(Follower.java:93)
	at org.apache.zookeeper.server.quorum.QuorumPeer.run(QuorumPeer.java:1253)
	at com.oracle.svm.core.thread.JavaThreads.threadStartRoutine(JavaThreads.java:527)
	at com.oracle.svm.core.posix.thread.PosixJavaThreads.pthreadStartRoutine(PosixJavaThreads.java:193)

…ssing at runtime now

and nativeagent manually because for now we'll have to craft the native-image Dockerfiles

missing at runtime now. We'll probably get a lot of these when we run with --allow-incomplete-classpath

…onfig json

in both the entrypoint step and native-image

the slf4j impl with slf4j-simple

This reverts commit b39dd5f3d6dd04bbeb8a74a795f73c98e9c586c8.

type during parsing: io.netty.util.internal.logging.Log4J2Logger. To diagnose the issue you can use the --allow-incomplete-classpath option

solsson · 2020-05-19T11:53:41Z

Current status is that 9c702d5 had to be added because of e2695a3. That flag increases the need for testing with real use cases.

Both admin CLIs and Zookeeper log JMX errors. I assume that no native-image user will want JMX, and the errors seem to have no side effects (other than performance?) but look ugly in logs and for zookeeper actually makes the logs difficult to read. We probably need to start writing @Substitute code to get rid of these errors.

though here it might not make any difference

solsson · 2020-07-12T08:22:15Z

Note that both solsson/kafka:native-cli (including images for individual admin commands) and solsson/kafka:native-zookeeper-server-start use hard coded logging settings, i.e. log4j.properties has no effect. The issue with configured logging is that reflection-config.json probably needs to be tailored to different log4j configurations because they depend on different classes.

The resulting builds from this PR are solsson/kafka:native-cli@sha256:fbf29c59182fb87921c5199783d2d5796856ecbfe34a9c03eca658b3cf50f3c4 and solsson/kafka:native-zookeeper-server-start@sha256:ba3a0632240b8906a3b5bb6441e98ad9d9de73cb716b156ca68f1b435c819e8b.

solsson added 30 commits April 8, 2020 21:49

A way to generate native-image config json

2e5d50e

Start from the generated nativeagent dockerfile

d8bc54b

Hand edited to explore native-image command variations

dfe86a9

Now failing on a java.lang.ref.SoftReference

3345348

native-image passes when Quarkus' kafka-client is in the classpath

d874c70

Output to a writable location

d8a5e9d

distroless with support for unchanged ./bin/*.sh commands

863d300

In practice we'll often do scripting

fb1d666

The reflect config I had in kafka-unwrap, plus the Loader that was mi…

62838d8

…ssing at runtime now

Adds a naming convention to build native with Docker Hub,

6a9adc7

and nativeagent manually because for now we'll have to craft the native-image Dockerfiles

Adds topics list command to verify create, and actually merge configs

d3306a3

Folder rename

319e6c2

The reflect config I had in kafka-unwrap, plus the Loader that was

6add1be

missing at runtime now. We'll probably get a lot of these when we run with --allow-incomplete-classpath

Start from manually maintained native resources json

6885b71

I get an NPE after every kafka-topics run with the original reflect c…

7a7095f

…onfig json

We should version all config resource results

9d8f908

Adds an image that can combine command line tools and scripts

6d05c9c

Fixes kafka-topics support for --zookeeper

b9726c4

Tracking kafka and zk native-agent results, for the fun of it

bf2a7ec

Removes the old native base image, not included in build hooks

98100a2

Use current build's (in the ./hooks/build sequence) graalvm image

3a4f669

Build nativebase always, but don't push; use in subsequent builds

171ed8a

native-cli needs to support scripting

3241d34

Adds utilities and entrypoint to native-cli

e9e0cd5

Avoids useradd because it results in a ~25 MB layer

54cbb4f

WIP another zookeeper

7c1caa2

Zookeeper native-image passes when classpath is minimized,

39897cf

in both the entrypoint step and native-image

WIP zookeeper logging at INFO level

81a5c85

More zookeeper config

ebab223

Declarative FTW, but 3 zk instances made no difference to native conf

cbea571

solsson added 8 commits May 19, 2020 13:07

Using static logging config now

6bc3bbf

A fresh nativeagent run (see README.md)

0e91446

There's compile time dependencies to log4j, but we can still replace

87026d9

the slf4j impl with slf4j-simple

Revert "Using static logging config now"

34326a8

This reverts commit b39dd5f3d6dd04bbeb8a74a795f73c98e9c586c8.

Fails to resolve UnresolvedElementException: Discovered unresolved

e2695a3

type during parsing: io.netty.util.internal.logging.Log4J2Logger. To diagnose the issue you can use the --allow-incomplete-classpath option

For when class loading fails towards the edges of our use cases

9c702d5

Reuse a single dockerfile for all cli applications

44c32bc

Builds pass locally now (:native, see README.md)

213d17a

solsson added 2 commits May 19, 2020 13:56

Fixes a documentation mistake

df6ad4a

Need to use a variable in build ARG to get kafka's .sh names

f43548a

solsson mentioned this pull request May 20, 2020

The nonroot scale-1 variant does not seem to work in Minikube Yolean/kubernetes-kafka#318

Closed

solsson added 4 commits May 20, 2020 19:25

Logging isn't great with CLIs, in particular when used in scripts

9f85435

GraalVM 20.1 has been released

6740685

Apparently there's a new prefix to the download version

8e3e989

Enables the new graalvm feature to pass on termination signals

6d1552f

solsson mentioned this pull request Jul 10, 2020

Try to speed up kafka admin CLI invocations #25

Closed

solsson added 9 commits July 10, 2020 08:08

Quarkus 1.6.0 officially supports graalvm 20.1.0,

d10b342

though here it might not make any difference

Refreshes runtime images

35c4501

Gotcha in hooks/build: ./native is built to :nativebase

4e61120

The official zk image supports JMXDISABLE=true but we must keep sed'ing

45f82b8

Merge branch 'master' into native

4d41551

wip

70e76a3

Use named layers for copy, in case we add more layers

4b23d8a

Static logging for zookeeper, to avoid log4j's reflection

03ca0e5

We'll probably stick with kafka's bundled zookeeper until KIP-500

fa81999

solsson merged commit 84d9453 into master Jul 12, 2020

solsson added a commit to Yolean/kubernetes-kafka that referenced this pull request Jul 12, 2020

Uses the last builds from solsson/dockerfiles#29

cee5278

solsson mentioned this pull request Aug 15, 2020

GraalVM native-image builds based on the Kafka 2.6.0 release #31

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Native-image i.e. GraalVM builds of Zookeeper and CLIs #29

Native-image i.e. GraalVM builds of Zookeeper and CLIs #29

solsson commented Apr 24, 2020

solsson commented May 19, 2020

solsson commented Jul 12, 2020 •

edited

Loading

Native-image i.e. GraalVM builds of Zookeeper and CLIs #29

Native-image i.e. GraalVM builds of Zookeeper and CLIs #29

Conversation

solsson commented Apr 24, 2020

solsson commented May 19, 2020

solsson commented Jul 12, 2020 • edited Loading

solsson commented Jul 12, 2020 •

edited

Loading