Switch to using scala 2.9.2 #1

azymnis · 2012-08-14T19:28:35Z

Compiled and used fine. I had issues with the tests though.

ijuma · 2015-07-08T08:43:47Z

This is no longer relevant (new build system and 2.9.2 is supported). Can you please close this (we cannot close it ourselves with raising a ticket to Apache Infra, unfortunately)?

add kafka-clients and log4j to pom.xml

[LOGBROKER-726] Fix tests & debianization

This may be a reason why we see Jenkins jobs time out at times. I can reproduce it locally. With current trunk there is a possibility to run into this: ```sh "kafka-streams-close-thread" #585 daemon prio=5 os_prio=0 tid=0x00007f66d052d800 nid=0x7e02 waiting for monitor entry [0x00007f66ae2e5000] java.lang.Thread.State: BLOCKED (on object monitor) at org.apache.kafka.streams.processor.internals.StreamThread.close(StreamThread.java:345) - waiting to lock <0x000000077d33c538> (a org.apache.kafka.streams.processor.internals.StreamThread) at org.apache.kafka.streams.KafkaStreams$1.run(KafkaStreams.java:474) at java.lang.Thread.run(Thread.java:745) "appId-bd262a91-5155-4a35-bc46-c6432552c2c5-StreamThread-97" #583 prio=5 os_prio=0 tid=0x00007f66d052f000 nid=0x7e01 waiting for monitor entry [0x00007f66ae4e6000] java.lang.Thread.State: BLOCKED (on object monitor) at org.apache.kafka.streams.KafkaStreams.setState(KafkaStreams.java:219) - waiting to lock <0x000000077d335760> (a org.apache.kafka.streams.KafkaStreams) at org.apache.kafka.streams.KafkaStreams.access$100(KafkaStreams.java:117) at org.apache.kafka.streams.KafkaStreams$StreamStateListener.onChange(KafkaStreams.java:259) - locked <0x000000077d42f138> (a org.apache.kafka.streams.KafkaStreams$StreamStateListener) at org.apache.kafka.streams.processor.internals.StreamThread.setState(StreamThread.java:168) - locked <0x000000077d33c538> (a org.apache.kafka.streams.processor.internals.StreamThread) at org.apache.kafka.streams.processor.internals.StreamThread.setStateWhenNotInPendingShutdown(StreamThread.java:176) - locked <0x000000077d33c538> (a org.apache.kafka.streams.processor.internals.StreamThread) at org.apache.kafka.streams.processor.internals.StreamThread.access$1600(StreamThread.java:70) at org.apache.kafka.streams.processor.internals.StreamThread$RebalanceListener.onPartitionsRevoked(StreamThread.java:1321) at org.apache.kafka.clients.consumer.internals.ConsumerCoordinator.onJoinPrepare(ConsumerCoordinator.java:406) at org.apache.kafka.clients.consumer.internals.AbstractCoordinator.joinGroupIfNeeded(AbstractCoordinator.java:349) at org.apache.kafka.clients.consumer.internals.AbstractCoordinator.ensureActiveGroup(AbstractCoordinator.java:310) at org.apache.kafka.clients.consumer.internals.ConsumerCoordinator.poll(ConsumerCoordinator.java:296) at org.apache.kafka.clients.consumer.KafkaConsumer.pollOnce(KafkaConsumer.java:1037) at org.apache.kafka.clients.consumer.KafkaConsumer.poll(KafkaConsumer.java:1002) at org.apache.kafka.streams.processor.internals.StreamThread.pollRequests(StreamThread.java:531) at org.apache.kafka.streams.processor.internals.StreamThread.runLoop(StreamThread.java:669) at org.apache.kafka.streams.processor.internals.StreamThread.run(StreamThread.java:326) ``` In a nutshell: `KafkaStreams` and `StreamThread` are both waiting for each other since another intermittent `close` (eg. from a test) comes along also trying to lock on `KafkaStreams` : ```sh "main" #1 prio=5 os_prio=0 tid=0x00007f66d000c800 nid=0x78bb in Object.wait() [0x00007f66d7a15000] java.lang.Thread.State: WAITING (on object monitor) at java.lang.Object.wait(Native Method) at java.lang.Thread.join(Thread.java:1249) - locked <0x000000077d45a590> (a java.lang.Thread) at org.apache.kafka.streams.KafkaStreams.close(KafkaStreams.java:503) - locked <0x000000077d335760> (a org.apache.kafka.streams.KafkaStreams) at org.apache.kafka.streams.KafkaStreams.close(KafkaStreams.java:447) at org.apache.kafka.streams.KafkaStreamsTest.testCannotStartOnceClosed(KafkaStreamsTest.java:115) ``` => causing a deadlock. Fixed this by softer locking on the state change, that guarantees atomic changes to the state but does not lock on the whole object (I at least could not find another method that would require more than atomicly-locked access except for `setState`). Also qualified the state listeners with their outer-class to make the whole code-flow around this more readable (having two interfaces with the same naming for interface and method and then using them between their two outer classes is crazy hard to read imo :)). Easy to reproduced yourself by running `org.apache.kafka.streams.KafkaStreamsTest` in a loop for a bit (save yourself some time by running 2-4 in parallel :)). Eventually it will lock on one of the tests (for me this takes less than 1 min with 4 parallel runs). Author: Armin Braun <[email protected]> Author: Armin <[email protected]> Reviewers: Eno Thereska <[email protected]>, Damian Guy <[email protected]>, Ismael Juma <[email protected]> Closes #2791 from original-brownbear/fix-streams-deadlock

removed thread local buffer

Initial commit of ic-kafka-topics tool based on AdminClient

…ay (apache#1) [NOTE] This is a temporary measure to publish artifacts until CI is properly set up to do the job automatically. Users are not expected to run this themselves.

…to get end offsets and create topics The existing `Kafka*BackingStore` classes used by Connect all use `KafkaBasedLog`, which needs to frequently get the end offsets for the internal topic to know whether they are caught up. `KafkaBasedLog` uses its consumer to get the end offsets and to consume the records from the topic. However, the Connect internal topics are often written very infrequently. This means that when the `KafkaBasedLog` used in the `Kafka*BackingStore` classes is already caught up and its last consumer poll is waiting for new records to appear, the call to the consumer to fetch end offsets will block until the consumer returns after a new record is written (unlikely) or the consumer’s `fetch.max.wait.ms` setting (defaults to 500ms) ends and the consumer returns no more records. IOW, the call to `KafkaBasedLog.readToEnd()` may block for some period of time even though it’s already caught up to the end. Instead, we want the `KafkaBasedLog.readToEnd()` to always return quickly when the log is already caught up. The best way to do this is to have the `KafkaBackingStore` use the admin client (rather than the consumer) to fetch end offsets for the internal topic. The consumer and the admin API both use the same `ListOffset` broker API, so the functionality is ultimately the same but we don't have to block for any ongoing consumer activity. Each Connect distributed runtime includes three instances of the `Kafka*BackingStore` classes, which means we have three instances of `KafkaBasedLog`. We don't want three instances of the admin client, and should have all three instances of the `KafkaBasedLog` share a single admin client instance. In fact, each `Kafka*BackingStore` instance currently creates, uses and closes an admin client instance when it checks and initializes that store's internal topic. If we change `Kafka*BackingStores` to share one admin client instance, we can change that initialization logic to also reuse the supplied admin client instance. The final challenge is that `KafkaBasedLog` has been used by projects outside of Apache Kafka. While `KafkaBasedLog` is definitely not in the public API for Connect, we can make these changes in ways that are backward compatible: create new constructors and deprecate the old constructors. Connect can be changed to only use the new constructors, and this will give time for any downstream users to make changes. These changes are implemented as follows: 1. Add a `KafkaBasedLog` constructor to accept in its parameters a supplier from which it can get an admin instance, and deprecate the old constructor. We need a supplier rather than just passing an instance because `KafkaBasedLog` is instantiated before Connect starts up, so we need to create the admin instance only when needed. At the same time, we'll change the existing init function parameter from a no-arg function to accept an admin instance as an argument, allowing that init function to reuse the shared admin instance used by the `KafkaBasedLog`. Note: if no admin supplier is provided (in deprecated constructor that is no longer used in AK), the consumer is still used to get latest offsets. 2. Add to the `Kafka*BackingStore` classes a new constructor with the same parameters but with an admin supplier, and deprecate the old constructor. When the classes instantiate its `KafkaBasedLog` instance, it would pass the admin supplier and pass an init function that takes an admin instance. 3. Create a new `SharedTopicAdmin` that lazily creates the `TopicAdmin` (and underlying Admin client) when required, and closes the admin objects when the `SharedTopicAdmin` is closed. 4. Modify the existing `TopicAdmin` (used only in Connect) to encapsulate the logic of fetching end offsets using the admin client, simplifying the logic in `KafkaBasedLog` mentioned in apache#1 above. Doing this also makes it easier to test that logic. 5. Change `ConnectDistributed` to create a `SharedTopicAdmin` instance (that is `AutoCloseable`) before creating the `Kafka*BackingStore` instances, passing the `SharedTopicAdmin` (which is an admin supplier) to all three `Kafka*BackingStore objects`, and finally always closing the `SharedTopicAdmin` upon termination. (Shutdown of the worker occurs outside of the `ConnectDistributed` code, so modify `DistributedHerder` to take in its constructor additional `AutoCloseable` objects that should be closed when the herder is closed, and then modify `ConnectDistributed` to pass the `SharedTopicAdmin` as one of those `AutoCloseable` instances.) 6. Change `MirrorMaker` similarly to `ConnectDistributed`. 7. Change existing unit tests to no longer use deprecated constructors. 8. Add unit tests for new functionality.

…to get end offsets and create topics (#9780) The existing `Kafka*BackingStore` classes used by Connect all use `KafkaBasedLog`, which needs to frequently get the end offsets for the internal topic to know whether they are caught up. `KafkaBasedLog` uses its consumer to get the end offsets and to consume the records from the topic. However, the Connect internal topics are often written very infrequently. This means that when the `KafkaBasedLog` used in the `Kafka*BackingStore` classes is already caught up and its last consumer poll is waiting for new records to appear, the call to the consumer to fetch end offsets will block until the consumer returns after a new record is written (unlikely) or the consumer’s `fetch.max.wait.ms` setting (defaults to 500ms) ends and the consumer returns no more records. IOW, the call to `KafkaBasedLog.readToEnd()` may block for some period of time even though it’s already caught up to the end. Instead, we want the `KafkaBasedLog.readToEnd()` to always return quickly when the log is already caught up. The best way to do this is to have the `KafkaBackingStore` use the admin client (rather than the consumer) to fetch end offsets for the internal topic. The consumer and the admin API both use the same `ListOffset` broker API, so the functionality is ultimately the same but we don't have to block for any ongoing consumer activity. Each Connect distributed runtime includes three instances of the `Kafka*BackingStore` classes, which means we have three instances of `KafkaBasedLog`. We don't want three instances of the admin client, and should have all three instances of the `KafkaBasedLog` share a single admin client instance. In fact, each `Kafka*BackingStore` instance currently creates, uses and closes an admin client instance when it checks and initializes that store's internal topic. If we change `Kafka*BackingStores` to share one admin client instance, we can change that initialization logic to also reuse the supplied admin client instance. The final challenge is that `KafkaBasedLog` has been used by projects outside of Apache Kafka. While `KafkaBasedLog` is definitely not in the public API for Connect, we can make these changes in ways that are backward compatible: create new constructors and deprecate the old constructors. Connect can be changed to only use the new constructors, and this will give time for any downstream users to make changes. These changes are implemented as follows: 1. Add a `KafkaBasedLog` constructor to accept in its parameters a supplier from which it can get an admin instance, and deprecate the old constructor. We need a supplier rather than just passing an instance because `KafkaBasedLog` is instantiated before Connect starts up, so we need to create the admin instance only when needed. At the same time, we'll change the existing init function parameter from a no-arg function to accept an admin instance as an argument, allowing that init function to reuse the shared admin instance used by the `KafkaBasedLog`. Note: if no admin supplier is provided (in deprecated constructor that is no longer used in AK), the consumer is still used to get latest offsets. 2. Add to the `Kafka*BackingStore` classes a new constructor with the same parameters but with an admin supplier, and deprecate the old constructor. When the classes instantiate its `KafkaBasedLog` instance, it would pass the admin supplier and pass an init function that takes an admin instance. 3. Create a new `SharedTopicAdmin` that lazily creates the `TopicAdmin` (and underlying Admin client) when required, and closes the admin objects when the `SharedTopicAdmin` is closed. 4. Modify the existing `TopicAdmin` (used only in Connect) to encapsulate the logic of fetching end offsets using the admin client, simplifying the logic in `KafkaBasedLog` mentioned in #1 above. Doing this also makes it easier to test that logic. 5. Change `ConnectDistributed` to create a `SharedTopicAdmin` instance (that is `AutoCloseable`) before creating the `Kafka*BackingStore` instances, passing the `SharedTopicAdmin` (which is an admin supplier) to all three `Kafka*BackingStore objects`, and finally always closing the `SharedTopicAdmin` upon termination. (Shutdown of the worker occurs outside of the `ConnectDistributed` code, so modify `DistributedHerder` to take in its constructor additional `AutoCloseable` objects that should be closed when the herder is closed, and then modify `ConnectDistributed` to pass the `SharedTopicAdmin` as one of those `AutoCloseable` instances.) 6. Change `MirrorMaker` similarly to `ConnectDistributed`. 7. Change existing unit tests to no longer use deprecated constructors. 8. Add unit tests for new functionality. Author: Randall Hauch <[email protected]> Reviewer: Konstantine Karantasis <[email protected]>

Revert "KAFKA-8964: Rename tag client-id for thread-level metrics and below (apache#7429)"

Clear topicId cache when removing topic partitions

This commit updates the build.gradle file to enable Harness Test Intelligence. It also adds a .ticonfig.yaml file which tells the ti service which files to ignore changes in.

This change introduces some basic clean up and refactoring for forthcoming commits related to the revised fetch code for the consumer threading refactor project. Reviewers: Christo Lolov <[email protected]>, Jun Rao <[email protected]>

… for aborted txns (#17676) (#1 7733) Reviewers: Jun Rao <[email protected]>

Switch to using scala 2.9.2

2198f48

ymatsuda added a commit to ymatsuda/kafka that referenced this pull request Aug 5, 2015

Merge pull request apache#1 from guozhangwang/master

4ab44fd

add kafka-clients and log4j to pom.xml

stumped2 closed this Feb 2, 2016

vahidhashemian mentioned this pull request Jun 3, 2016

KAFKA-3111: Fix ConsumerPerformance reporting to use time-based instead of message-based intervals #788

Closed

resetius referenced this pull request in resetius/kafka Jun 7, 2016

Merge pull request #1 from baidarov/build

61c658c

[LOGBROKER-726] Fix tests & debianization

vahidhashemian mentioned this pull request Jun 30, 2016

KAFKA-3854: Fix issues with new consumer's subsequent regex (pattern) subscriptions #1572

Closed

radai-rosenblatt mentioned this pull request Sep 29, 2016

KAFKA-4228 - make producer close on sender thread death, make consumer shutdown on failure to rebalance, and make MM die on any of the above. #1930

Closed

mfenniak pushed a commit to mfenniak/kafka that referenced this pull request Dec 4, 2016

test case apache#1 fixup

b35ff76

cmccabe mentioned this pull request Jan 10, 2017

Kafka 4507: The client should send older versions of requests to the broker if necessary #2264

Closed

huxihx mentioned this pull request Jan 25, 2017

kafka-4295: ConsoleConsumer does not delete the temporary group in zookeeper #2054

Closed

cmccabe mentioned this pull request Jul 20, 2017

KAFKA-5565. Add a broker metric specifying the number of consumer gro… #3506

Closed

huxihx mentioned this pull request Aug 21, 2017

KAFKA-5358: Consumer perf tool should count rebalance time. #3188

Closed

egor-ryashin referenced this pull request in egor-ryashin/kafka Mar 22, 2018

Merged in kafka-thread-buffer-fix (pull request #1)

1ac177c

removed thread local buffer

vvcephei mentioned this pull request Apr 9, 2018

KAFKA-6376: streams skip metrics #4812

Closed

3 tasks

koqizhao mentioned this pull request Apr 12, 2018

Bug Hotfix: consumer poll blocked & FetchResponse cast error #4858

Closed

cmccabe mentioned this pull request Jun 5, 2018

KIP-290 Prefixed ACLs #5117

Merged

7 tasks

tedyu mentioned this pull request Jun 26, 2018

KAFKA-7026: Sticky Assignor Partition Assignment Improvement (KIP-341) #5291

Merged

3 tasks

vvcephei mentioned this pull request Jul 12, 2018

MINOR: Remove 1 minute minimum segment interval #5323

Merged

3 tasks

vvcephei mentioned this pull request Aug 7, 2018

KAFKA-7240: -total metrics in Streams are incorrect #5467

Merged

3 tasks

huxihx mentioned this pull request Aug 24, 2018

KAFKA-7211: MM should handle TimeoutException in commitSync #5492

Closed

3 tasks

apovzner mentioned this pull request Sep 9, 2018

KAFKA-7044: Fix Fetcher.fetchOffsetsByTimes and NPE in describe consumer group #5627

Merged

3 tasks

viktorsomogyi mentioned this pull request Sep 12, 2018

KAFKA-1880: Add support for checking binary/source compatibility #5620

Closed

3 tasks

rhauch mentioned this pull request Oct 10, 2018

KAFKA-6891 fixed config constraints in KafkaConnect + refactoring #4995

Closed

krishkoneru pushed a commit to krishkoneru/kafka that referenced this pull request Oct 25, 2018

Merge pull request apache#1 from instaclustr/1.1_ic.1

1c98040

Initial commit of ic-kafka-topics tool based on AdminClient

stanislavkozlovski mentioned this pull request Oct 28, 2018

KAFKA-3932 - Consumer fails to consume in a round robin fashion #5838

Closed

cmccabe mentioned this pull request Feb 1, 2019

KAFKA-7828: Add ExternalCommandWorker to Trogdor #6219

Merged

abbccdda mentioned this pull request Jan 8, 2021

KAFKA-10674: Controller API version bond with forwardable APIs #9600

Merged

3 tasks

spena mentioned this pull request Apr 1, 2021

KAFKA-10847: Fix spurious results on left/outer stream-stream joins #10462

Merged

3 tasks

gardnervickers mentioned this pull request Jun 21, 2021

KAFKA-12964: Collect and rename snapshot files prior to async deletion. #10896

Merged

ableegoldman mentioned this pull request Jul 14, 2021

KAFKA-12984: make AbstractStickyAssignor resilient to invalid input, utilize generation in cooperative, and fix assignment bug #10985

Merged

jeffkbkim pushed a commit to jeffkbkim/kafka that referenced this pull request Oct 22, 2021

Merge pull request apache#1 from vvcephei/upstream_merge_Feb_6

d8a6812

Revert "KAFKA-8964: Rename tag client-id for thread-level metrics and below (apache#7429)"

divijvaidya referenced this pull request in divijvaidya/kafka Apr 6, 2022

Merge pull request #1 from JoelWee/tiered-storage

4c41a65

Clear topicId cache when removing topic partitions

divijvaidya mentioned this pull request Jun 10, 2022

KAFKA-13971: Fix atomicity violations caused by improper usage of ConcurrentHashMap - part2 #12281

Closed

forlack mentioned this pull request Sep 15, 2022

KAFKA-14207; KRaft Operations documentation #12642

Merged

3 tasks

splett2 mentioned this pull request May 11, 2023

KAFKA-14990: Dynamic producer ID expiration should be applied on a broker restart #13707

Merged

3 tasks

jolshan mentioned this pull request Jun 9, 2023

KAFKA-14462; [17/N] Add CoordinatorRuntime #13795

Merged

3 tasks

kirktrue mentioned this pull request Oct 4, 2023

KAFKA-14274 [6, 7]: Introduction of fetch request manager #14406

Merged

kirktrue mentioned this pull request Apr 25, 2024

KAFKA-15974: Enforce that event processing respects user-provided timeout #15640

Merged

3 tasks

kirktrue mentioned this pull request Jun 12, 2024

KAFKA-16637: AsyncKafkaConsumer removes offset fetch responses from cache too aggressively #16241

Closed

3 tasks

cmccabe mentioned this pull request Jun 26, 2024

KAFKA-17011: SupportedFeatures.MinVersion incorrectly blocks v0 (3.8) #16420

Closed

yazgoo mentioned this pull request Jun 28, 2024

[ KAFKA-17049 ] fix Incremental rebalances assign too many tasks for the same connector together #16486

Open

3 tasks

snehashisp mentioned this pull request Nov 10, 2024

KAFKA-18182: KIP-891 Connect Multiversion Support (Base PR with Plugin Loading Isolation Changes) #16984

Merged

3 tasks

kamalcph added a commit that referenced this pull request Nov 10, 2024

KAFKA-17801: RemoteLogManager may compute inaccurate upperBoundOffset…

6ea2ed3

… for aborted txns (#17676) (#1 7733) Reviewers: Jun Rao <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Switch to using scala 2.9.2 #1

Switch to using scala 2.9.2 #1

azymnis commented Aug 14, 2012

ijuma commented Jul 8, 2015

Switch to using scala 2.9.2 #1

Switch to using scala 2.9.2 #1

Conversation

azymnis commented Aug 14, 2012

ijuma commented Jul 8, 2015