Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Kafka 0.11 on debian:stretch #5

Merged
merged 31 commits into from
Jul 26, 2017
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
31 commits
Select commit Hold shift + click to select a range
f5fc72c
Runs the official Kafka distribution on the new Debian stable
solsson Jul 20, 2017
eeee33b
https://kafka.apache.org/downloads says "2.11 is recommended"
solsson Jul 20, 2017
99f4d45
Uses Zulu jre/jdk like cp-docker-images base does
solsson Jul 20, 2017
8bf55a1
Drops a layer
solsson Jul 20, 2017
c4066a7
Starts a build-contract to smoke test and exemplify image use cases
solsson Jul 20, 2017
028aed1
Fixes topic creation, but with startup timing issues because the scri…
solsson Jul 20, 2017
f52bd3b
Adds a consumer
solsson Jul 20, 2017
bdea1ad
For previous commit
solsson Jul 20, 2017
ea54416
Quick attempt to verify that the image works with Connect. Class does…
solsson Jul 20, 2017
16ef01b
Allow automatic topic creation again, to simplify test setup
solsson Jul 20, 2017
73ea978
Each connect setup will have to be its own docker build...
solsson Jul 20, 2017
7b82d0f
Quite useful log aggregation
solsson Jul 20, 2017
845faff
worker.properties can basically be copied among connect builds
solsson Jul 21, 2017
50e0c9a
Demonstrates JMX to HTTP /metrics
solsson Jul 21, 2017
4b183a5
Demonstrates built connect jar, but messages are encoded json in oute…
solsson Jul 21, 2017
87d3172
Splits into separate docker-compose setups per area of interest
solsson Jul 22, 2017
74ad3aa
Now adds less than 6 MB to the official image
solsson Jul 22, 2017
415bb65
Clarifies that we use 3rd party source here
solsson Jul 22, 2017
941e719
Merge branch 'build-contract' into kafka-011-debian
solsson Jul 22, 2017
e4de7e6
Honors the nature of Kafka distributions
solsson Jul 22, 2017
385eff8
Tries to replace the old image behavior with instructions
solsson Jul 22, 2017
e21ae32
Slightly shorter
solsson Jul 22, 2017
46de3f4
Merge pull request #7 from solsson/image-help-entrypoint
solsson Jul 23, 2017
9978188
Adds the nc command as it's useful for liveness probes
solsson Jul 23, 2017
b34eb6c
Adds base image, as Zulu will be upgraded less frequently than Kafka
solsson Jul 25, 2017
15ba474
Builds the kafka image from the base jre image
solsson Jul 25, 2017
17f0f6e
Uses a specific base image build. 58 MB download on top of debian.
solsson Jul 25, 2017
180ff1d
Merge pull request #9 from solsson/jre-base-image
solsson Jul 25, 2017
08ee3b6
Changes scala env names to match assumptions in kafka-run-class.sh
solsson Jul 25, 2017
e7bd905
Reflects kafka-run-class.sh better. We don't have Scala, neither at i…
solsson Jul 25, 2017
0314080
Slightly cleaner installation
solsson Jul 26, 2017
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
61 changes: 56 additions & 5 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,11 +1,62 @@
# dockerfiles
# Kafka docker builds

Automated [Kafka](http://kafka.apache.org/) builds for [solsson/kafka](https://hub.docker.com/r/solsson/kafka/)
and related `kafka-` images under https://hub.docker.com/u/solsson/.

Nowadays we're using separate repositories for dockerization projects.
---

This repo used to contain misc dockerfiles, but they've moved to separate repositories for dockerization projects.
We've kept the repository name to avoid breaking the automated build of solsson/kafka in Docker Hub.

For legacy Dockerfiles from this repo (if you navigated to here from a Docker Hub [solsson](https://hub.docker.com/u/solsson/) image),
see https://github.com/solsson/dockerfiles/tree/misc-dockerfiles.

# Kafka docker builds
---

Our kafka images are tested in production with https://github.com/Yolean/kubernetes-kafka/.

You most likely need to mount your own config files, or for `./bin/kafka-server-start.sh` use overrides like:
```
--override zookeeper.connect=zookeeper:2181
--override log.dirs=/var/lib/kafka/data/topics
--override log.retention.hours=-1
--override broker.id=0
--override advertised.listener=PLAINTEXT://kafka-0:9092
```

## One image to rule them all

Official [Kafka distributions](http://kafka.apache.org/downloads) contain startup scripts and config for various services and clients. Thus `./kafka` produces a multi-purpose image for direct use and specialized docker builds.

We could build specialized images like `kafka-server` but we have two reasons not to:
* Won't be as transparent in Docker Hub because you can't use Automated Build without scripting.
* In reality you'll need to control your own config anyway.

### Example of downstream image: Kafka Connect

See ./connect-jmx

### Example downstream image: Kafka Streams

TODO

## Building

Rudimentary compliance with kubernetes-kafka is tested using a [build-contract](https://github.com/Yolean/build-contract/).

Build and test using: `docker run -v /var/run/docker.sock:/var/run/docker.sock -v $(pwd)/:/source solsson/build-contract test`. However... while timing issues remain you need some manual intervention:

This repository maintains automated [Kafka](http://kafka.apache.org/) builds for https://hub.docker.com/r/solsson/kafka/
and related `kafka-` images under https://hub.docker.com/u/solsson/, used with https://github.com/Yolean/kubernetes-kafka/.
```bash
compose='docker-compose -f build-contracts/docker-compose.yml'
$compose up -d zookeeper kafka-0
$compose logs zookeeper kafka-0
# can we create topics using the image's provided script?
$compose up test-topic-create-1
# can a producer send messages using snappy (has issues before with a class missing in the image)
$compose up test-snappy-compression
$compose up test-consume-all
# demo the log/file aggregation image
docker-compose -f build-contracts/docker-compose.files-aggregation.yml up
# demo the JMX->kafka image
docker-compose -f build-contracts/docker-compose.monitoring.yml up
```
49 changes: 49 additions & 0 deletions build-contracts/docker-compose.files-aggregation.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,49 @@
version: '2.0'
services:

zookeeper:
build: ../kafka
entrypoint: ./bin/zookeeper-server-start.sh
command:
- config/zookeeper.properties

kafka-0:
build: ../kafka
links:
- zookeeper
entrypoint: ./bin/bin/kafka-server-start.sh
command:
- config/server.properties
- --override
- zookeeper.connect=zookeeper:2181
- --override
- broker.id=0
- --override
- advertised.listener=PLAINTEXT://kafka-0:9092

connect-files:
build: ../connect-files
labels:
com.yolean.build-target: ""
links:
- kafka-0

test-connect-files-real-logs:
build: ../connect-files
links:
- kafka-0
volumes:
- /var/log:/logs

test-consume-files:
image: solsson/kafkacat@sha256:1266d140c52cb39bf314b6f22b6d7a01c4c9084781bc779fdfade51214a713a8
labels:
com.yolean.build-contract: ""
command:
- -b
- kafka-0:9092
- -t
- files-000
- -C
- -o
- beginning
77 changes: 77 additions & 0 deletions build-contracts/docker-compose.monitoring.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,77 @@
version: '2.0'
services:

zookeeper:
build: ../kafka
entrypoint: ./bin/zookeeper-server-start.sh
command:
- config/zookeeper.properties

kafka-0:
build: ../kafka
links:
- zookeeper
environment:
- JMX_PORT=5555
expose:
- '5555'
entrypoint: ./bin/bin/kafka-server-start.sh
command:
- config/server.properties
- --override
- zookeeper.connect=zookeeper:2181
- --override
- broker.id=0
- --override
- advertised.listener=PLAINTEXT://kafka-0:9092

prometheus-jmx-exporter:
build: ../prometheus-jmx-exporter
labels:
com.yolean.build-target: ""
links:
- kafka-0
# patch a config before start, as the image is designed for use with local JMX (same k8s pod)
entrypoint: /bin/bash
command:
- -c
- >
sed -i 's|127.0.0.1|kafka-0|' example_configs/kafka-prometheus-monitoring.yml;
cat example_configs/kafka-prometheus-monitoring.yml;
java -jar jmx_prometheus_httpserver.jar
5556 example_configs/kafka-prometheus-monitoring.yml

test-metrics-export:
image: solsson/curl@sha256:8b0927b81d10043e70f3e05e33e36fb9b3b0cbfcbccdb9f04fd53f67a270b874
labels:
com.yolean.build-contract: ""
command:
- --fail-early
- --retry
- '10'
- --retry-delay
- '3'
- --retry-connrefused
- http://prometheus-jmx-exporter:5556/metrics

connect-jmx:
build: ../connect-jmx
labels:
com.yolean.build-target: ""
links:
- kafka-0

# TODO starts too fast, gets % KC_ERROR: Failed to query metadata for topic jmx-test: Local: Broker transport failure
# needs to retry until kafka+topic exists
test-jmx:
image: solsson/kafkacat@sha256:1266d140c52cb39bf314b6f22b6d7a01c4c9084781bc779fdfade51214a713a8
labels:
com.yolean.build-contract: ""
command:
- -b
- kafka-0:9092
- -t
- jmx-test
- -C
- -o
- beginning
69 changes: 69 additions & 0 deletions build-contracts/docker-compose.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,69 @@
version: '2.0'
services:

zookeeper:
build: ../kafka
entrypoint: ./bin/zookeeper-server-start.sh
command:
- config/zookeeper.properties

kafka-0:
build: ../kafka
image: solsson/kafka
labels:
com.yolean.build-target: ""
links:
- zookeeper
entrypoint: ./bin/kafka-server-start.sh
command:
- config/server.properties
- --override
- zookeeper.connect=zookeeper:2181
- --override
- broker.id=0
# unlike Kubernetes StatefulSet, compose gives containers a random hostname (leading to redirects to a hex name)
- --override
- advertised.listener=PLAINTEXT://kafka-0:9092

test-topic-create:
build: ../kafka
labels:
com.yolean.build-contract: ""
links:
- kafka-0
entrypoint: ./bin/kafka-topics.sh
command:
- --zookeeper
- zookeeper:2181
- --create
- --topic
- test-topic-create
- --partitions
- '1'
- --replication-factor
- '1'

test-snappy-compression:
image: solsson/kafkacat@sha256:1266d140c52cb39bf314b6f22b6d7a01c4c9084781bc779fdfade51214a713a8
labels:
com.yolean.build-contract: ""
entrypoint: /bin/sh
command:
- -exc
- sleep 5; echo "Message from $${HOSTNAME} at $$(date)" | kafkacat -z snappy -b kafka-0:9092 -t test1 -P

# TODO starts too fast, gets % KC_ERROR: Failed to query metadata for topic test1: Local: Broker transport failure
# needs to retry until kafka+topic exists
test-consume-all:
image: solsson/kafkacat@sha256:1266d140c52cb39bf314b6f22b6d7a01c4c9084781bc779fdfade51214a713a8
labels:
com.yolean.build-contract: ""
command:
- -b
- kafka-0:9092
- -t
- test1
- -C
- -o
- beginning
- -e
12 changes: 12 additions & 0 deletions connect-files/Dockerfile
Original file line number Diff line number Diff line change
@@ -0,0 +1,12 @@
FROM solsson/kafka:0.11.0.0

COPY worker.properties ./config/
COPY connect-files.sh ./bin/

ENV FILES_LIST_CMD="find /logs/ -name *.log"

# Set up some sample logs
RUN mkdir /logs/; \
echo "Mount /logs and/or change FILES_LIST_CMD (currently '$FILES_LIST_CMD') to read real content instead" > /logs/samplefile1.log;

ENTRYPOINT ["./bin/connect-files.sh"]
22 changes: 22 additions & 0 deletions connect-files/connect-files.sh
Original file line number Diff line number Diff line change
@@ -0,0 +1,22 @@
#!/bin/bash
set -e

FILES=$($FILES_LIST_CMD)

id=0
connectors=""
for FILE in $FILES; do
((++id))
echo "$id: $FILE"
cat <<HERE > ./config/connect-file-source-$id.properties
name=local-file-source-${id}
connector.class=FileStreamSource
tasks.max=1
file=${FILE}
topic=files-000
HERE

connectors="$connectors ./config/connect-file-source-$id.properties"
done

./bin/connect-standalone.sh ./config/worker.properties $connectors
48 changes: 48 additions & 0 deletions connect-files/worker.properties
Original file line number Diff line number Diff line change
@@ -0,0 +1,48 @@
# Licensed to the Apache Software Foundation (ASF) under one or more
# contributor license agreements. See the NOTICE file distributed with
# this work for additional information regarding copyright ownership.
# The ASF licenses this file to You under the Apache License, Version 2.0
# (the "License"); you may not use this file except in compliance with
# the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.

# These are defaults. This file just demonstrates how to override some settings.
bootstrap.servers=kafka-0:9092

# The converters specify the format of data in Kafka and how to translate it into Connect data. Every Connect user will
# need to configure these based on the format they want their data in when loaded from or stored into Kafka
key.converter=org.apache.kafka.connect.json.JsonConverter
value.converter=org.apache.kafka.connect.json.JsonConverter
# Converter-specific settings can be passed in by prefixing the Converter's setting with the converter we want to apply
# it to
key.converter.schemas.enable=true
value.converter.schemas.enable=true

# The internal converter used for offsets and config data is configurable and must be specified, but most users will
# always want to use the built-in default. Offset and config data is never visible outside of Kafka Connect in this format.
internal.key.converter=org.apache.kafka.connect.json.JsonConverter
internal.value.converter=org.apache.kafka.connect.json.JsonConverter
internal.key.converter.schemas.enable=false
internal.value.converter.schemas.enable=false

offset.storage.file.filename=/tmp/connect.offsets
# Flush much faster than normal, which is useful for testing/debugging
offset.flush.interval.ms=10000

# Set to a list of filesystem paths separated by commas (,) to enable class loading isolation for plugins
# (connectors, converters, transformations). The list should consist of top level directories that include
# any combination of:
# a) directories immediately containing jars with plugins and their dependencies
# b) uber-jars with plugins and their dependencies
# c) directories immediately containing the package directory structure of classes of plugins and their dependencies
# Note: symlinks will be followed to discover dependencies or plugins.
# Examples:
# plugin.path=/usr/local/share/java,/usr/local/share/kafka/plugins,/opt/connectors,
#plugin.path=
31 changes: 31 additions & 0 deletions connect-jmx/Dockerfile
Original file line number Diff line number Diff line change
@@ -0,0 +1,31 @@
FROM solsson/kafka:0.11.0.0

ENV srijiths-kafka-connectors-version=dc0a7122650e697d3ae97c970a4785bbed949479

RUN set -ex; \
buildDeps='curl ca-certificates'; \
apt-get update && apt-get install -y $buildDeps --no-install-recommends; \
\
MAVEN_VERSION=3.5.0 PATH=$PATH:$(pwd)/maven/bin; \
mkdir ./maven; \
curl -SLs https://archive.apache.org/dist/maven/maven-3/$MAVEN_VERSION/binaries/apache-maven-$MAVEN_VERSION-bin.tar.gz | tar -xzf - --strip-components=1 -C ./maven; \
mvn --version; \
\
mkdir ./kafka-connectors; \
cd ./kafka-connectors; \
curl -SLs https://github.com/srijiths/kafka-connectors/archive/$srijiths-kafka-connectors-version.tar.gz \
| tar -xzf - --strip-components=1 -C ./; \
mvn clean install; \
cd ..; \
mv ~/.m2/repository/com/sree/kafka/kafka-connect-jmx/0.0.1/kafka-connect-jmx-0.0.1-jar-with-dependencies.jar ./libs/; \
rm -rf ./kafka-connectors; \
rm -rf ./maven ~/.m2; \
\
apt-get purge -y --auto-remove $buildDeps; \
rm -rf /var/lib/apt/lists/*; \
rm /var/log/dpkg.log /var/log/apt/*.log

COPY *.properties ./config/

ENTRYPOINT ["./bin/connect-standalone.sh"]
CMD ["./config/worker.properties", "./config/connect-jmx.properties"]
Loading