Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Further flink kafka optimizations #408

Conversation

wagmarcel
Copy link
Member

No description provided.

@wagmarcel wagmarcel force-pushed the Further-Flink-Kafka-optimizations branch 3 times, most recently from 9a79cf9 to 74d6798 Compare July 29, 2023 10:57
… checkpoint settings to leave it to local job configuration

- Upgraded from Flink 1.16.1 to 1.16.2 testGateway
- Rocksdb metrics are inserted into flink-config to allow better debugging and optimization
- Removed local Checkpoint configurations. This will be done by the jobs in future
- Add debug mode to to Gateway: Do not delete the deployment directories in /tmp when DEBUG is set to 'true'
…ms, improve udf handling and flink job configuration

- A problem with the Strimzi TopicOperator: It creates topic CRDs also from automatically created topics
  which could conflict with helmfile objects. To avoid that the flini-deploy must disable topicOperator
  remove all kafka objects, deploy the new flink-version and enable the topicOperator afterwards.
- SHACL.ttl now contains the statetime udf for the OEE time calculation
- Create_udf is now using regexp to parse filename to allow filenames with underscore
- SQLStatementset now contains the rocksdb configuration and disables the upsert Sink
  Materializer (as it creates high load)
- Configure Kafka in two profiles "default" and production
- Restrict Production retention time to 8h and default to 1h
- Production allows compression with gzip, default is without compression
- Problem with Flink: left joins sometimes give early results wich are retracted few milliseconds later. This creates
  short term alerts on Alerta which creates sometimes a lot of unneccessary alerts flood and confuses the user.
- This is solved by adding Flink later arrival filtering: Alerts which are 200ms late are grouped with the previous window.
- This creates on the other hand a more complicated E2E testing case.
@wagmarcel wagmarcel force-pushed the Further-Flink-Kafka-optimizations branch from 74d6798 to 367152e Compare July 29, 2023 10:57
@wagmarcel wagmarcel marked this pull request as ready for review July 30, 2023 11:09
@oguzcankirmemis oguzcankirmemis merged commit a30e2a9 into IndustryFusion:main Jul 31, 2023
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants