Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Consume Kafka Messages from Fetcher and Index Them into OpenSearch #47

Open
olga-casian opened this issue Jun 21, 2024 · 0 comments
Open
Labels
Data-Processor Issues related to the Data-Processor service

Comments

@olga-casian
Copy link
Collaborator

olga-casian commented Jun 21, 2024

Update the Data Processor service to consume messages from Kafka produced by the Fetcher service and index them into OpenSearch. This will enable real-time indexing and searching of XKCD comic data.

Tasks:

  1. Set Up Kafka Consumer in Data Processor:

    • Implement a Kafka consumer in the Data Processor service.
    • Configure the consumer to subscribe to the relevant Kafka topic.
  2. Process Kafka Messages:

    • Parse and process the incoming messages from Kafka.
    • Ensure data integrity and handle any potential errors.
  3. Index Data into OpenSearch:

    • Implement functionality to index the processed data into OpenSearch.
    • Ensure the data is correctly formatted and searchable in OpenSearch.
  4. Testing:

    • Write unit and integration tests to validate Kafka message consumption and OpenSearch indexing.
    • Test edge cases such as message failures and reprocessing.
  5. Update Documentation:

    • Document the Kafka consumer setup and configuration in the Data Processor README.
    • Include instructions on monitoring and troubleshooting the Kafka consumer and OpenSearch indexing process.

Acceptance Criteria:

  • Kafka consumer is implemented and configured in the Data Processor service.
  • Messages from Kafka are processed and indexed into OpenSearch.
  • Tests pass successfully, validating the end-to-end flow from Kafka to OpenSearch.
  • Documentation is updated with Kafka consumer and OpenSearch indexing instructions.
@olga-casian olga-casian added the Data-Processor Issues related to the Data-Processor service label Jun 21, 2024
@yonatankarp yonatankarp changed the title Consume Kafka messages from Fetcher and index them into ES Consume Kafka Messages from Fetcher and Index Them into OpenSearch Jun 22, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Data-Processor Issues related to the Data-Processor service
Projects
None yet
Development

No branches or pull requests

1 participant