Further flink kafka optimizations #408

wagmarcel · 2023-07-26T18:21:18Z

No description provided.

… checkpoint settings to leave it to local job configuration - Upgraded from Flink 1.16.1 to 1.16.2 testGateway - Rocksdb metrics are inserted into flink-config to allow better debugging and optimization - Removed local Checkpoint configurations. This will be done by the jobs in future - Add debug mode to to Gateway: Do not delete the deployment directories in /tmp when DEBUG is set to 'true'

…ms, improve udf handling and flink job configuration - A problem with the Strimzi TopicOperator: It creates topic CRDs also from automatically created topics which could conflict with helmfile objects. To avoid that the flini-deploy must disable topicOperator remove all kafka objects, deploy the new flink-version and enable the topicOperator afterwards. - SHACL.ttl now contains the statetime udf for the OEE time calculation - Create_udf is now using regexp to parse filename to allow filenames with underscore - SQLStatementset now contains the rocksdb configuration and disables the upsert Sink Materializer (as it creates high load)

- Configure Kafka in two profiles "default" and production - Restrict Production retention time to 8h and default to 1h - Production allows compression with gzip, default is without compression

- Problem with Flink: left joins sometimes give early results wich are retracted few milliseconds later. This creates short term alerts on Alerta which creates sometimes a lot of unneccessary alerts flood and confuses the user. - This is solved by adding Flink later arrival filtering: Alerts which are 200ms late are grouped with the previous window. - This creates on the other hand a more complicated E2E testing case.

wagmarcel force-pushed the Further-Flink-Kafka-optimizations branch 3 times, most recently from 9a79cf9 to 74d6798 Compare July 29, 2023 10:57

wagmarcel added 4 commits July 29, 2023 12:57

Manage compression and retention by Kafka Strimzi CRDs

7311e62

- Configure Kafka in two profiles "default" and production - Restrict Production retention time to 8h and default to 1h - Production allows compression with gzip, default is without compression

wagmarcel force-pushed the Further-Flink-Kafka-optimizations branch from 74d6798 to 367152e Compare July 29, 2023 10:57

wagmarcel marked this pull request as ready for review July 30, 2023 11:09

wagmarcel requested a review from oguzcankirmemis July 30, 2023 11:09

oguzcankirmemis approved these changes Jul 31, 2023

View reviewed changes

oguzcankirmemis merged commit a30e2a9 into IndustryFusion:main Jul 31, 2023
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Further flink kafka optimizations #408

Further flink kafka optimizations #408

wagmarcel commented Jul 26, 2023

Further flink kafka optimizations #408

Further flink kafka optimizations #408

Conversation

wagmarcel commented Jul 26, 2023