Kafka Message Testing: How to Write Integration Tests

This article offers an approach to writing integration tests for Kafka-based applications that focuses on interaction specification, making tests more readable and easier to maintain. The proposed method not only enhances testing efficiency but also contributes to a better understanding of the integration processes within the application. The article builds on three ideas presented in relevant articles: writing tests with a clear separation of Arrange-Act-Assert stages, isolation in Kafka tests, and using tools to enhance test visibility. I recommend reviewing these before delving into the material of this article. Demonstration Scenario Let's take a Telegram bot that forwards requests to the OpenAI API and returns the result to the user as an example. If the request to OpenAI violates the system's security rules, the client will be notified. Additionally, a message will be sent to Kafka for the behavioral control system so that the manager can contact the user, explain that their request was too sensitive even for our bot, and ask them to review their preferences. The interaction contracts with services are described in a simplified manner to emphasize the core logic. Below is a sequence diagram demonstrating the application's architecture. I understand that the design may raise questions from a system architecture perspective, but please approach it with understanding—the main goal here is to demonstrate the approach to writing tests. Message Capture The main testing tool will be the message capture object — RecordCaptor. Its operation is quite similar to the outgoing request capture object — RequestCaptor, which can be read about in the article Ordering Chaos: Arranging HTTP Request Testing in Spring. Message capture will be performed through a standard Kafka consumer. The list of topics must be specified explicitly via a configuration parameter. @KafkaListener(id = "recordCaptor", topics = "#{'${test.record-captor.topics}'.split(',')}", groupId = "test") public void eventCaptorListener(ConsumerRecord record, @Headers Map boundedHeaders) { RecordSnapshot recordSnapshot = mapper.recordToSnapshot(record, boundedHeaders); recordCaptor.capture(recordSnapshot); } The RecordCaptor object accumulates information from captured messages. Using this approach requires adhering to isolation in Kafka tests. Waiting for offset commit confirmation before verifying test results should be done using the KafkaSupport#waitForPartitionOffsetCommit method. Test Example Below is the test code for the described scenario. def "User Message Processing with OpenAI"() { setup: KafkaSupport.waitForPartitionAssignment(applicationContext) // 1 and: // 2 def openaiRequestCaptor = restExpectation.openai.completions(withBadRequest().contentType(APPLICATION_JSON) .body("""{ "error": { "code": "content_policy_violation", "message": "Your request was rejected as a result of our safety system." } }""")) def telegramRequestCaptor = restExpectation.telegram.sendMessage(withSuccess('{}', APPLICATION_JSON)) when: mockMvc.perform(post("/telegram/webhook") // 3 .contentType(APPLICATION_JSON_VALUE) .content("""{ "message": { "from": { "id": 10000000 }, "chat": { "id": 20000000 }, "text": "Hello!" } }""".toString()) .accept(APPLICATION_JSON_VALUE)) .andExpect(status().isOk()) KafkaSupport.waitForPartitionOffsetCommit(applicationContext) // 4 then: openaiRequestCaptor.times == 1 // 5 JSONAssert.assertEquals("""{ "content": "Hello!" }""", openaiRequestCaptor.bodyString, false) and: telegramRequestCaptor.times == 1 JSONAssert.assertEquals("""{ "chatId": "20000000", "text": "Your request was rejected as a result of our safety system." }""", telegramRequestCaptor.bodyString, false) when: // 6 def message = recordCaptor.getRecords("topicC", "20000000").last then: message != null JSONAssert.assertEquals("""{ "webhookMessage": { "message": { "chat": { "id": "20000000" }, "text": "Hello!" } }, "error": { "code": "content_policy_violation", "message": "Your request was rejected as a result of our safety system." } }""", message.value as String, false) } Key steps: Wait for the partition assignment before starting the test scenario. Mock requests to OpenAI and Telegram. Execute the test scenario. Wait for offset confirmation. Verify requests to OpenAI and Telegram. Check the message in Kafka. Using JSONAssert.assertEquals ensures consistency in data representation across Kafka messages, logs, and tests. This simplifies testing by providing flexibility in comparison and accuracy in error diagnosis. The article provides an example with JSON message format; other formats are not covered, but the described approach does not impose format restrictions. How to Find Your Message in RecordCaptor Messages in RecordCaptor are organized by topic name and key. In the provided test, the key used is the Kafka message key. When sending, we explicitly specify it: sendMessage("topicC", chatId, ...); ... private void sendMessage(String topic, String key, Object payload) { Message message = MessageBuilder .withPayload(objectMapper.writeValueAsString(payload)) .setHeader(KafkaHeaders.TOPIC, topic) .setHeader(KafkaHeaders.KEY, key) <-- set key .build(); kafkaTemplate.send(message).get(); } To search by message key within a topic: when: def message = recordCaptor.getRecords("topicC", "20000000").last <-- use key If this option is not suitable, you need to describe your own indexes based on message parameters for constructing the search. An example can be seen in the tests PolicyViolationTestsCustomIndex.groovy. Connecting RecordCaptor The code for connecting RecordCaptor looks as follows: @TestConfiguration(proxyBeanMethods = false) public class RecordCaptorConfiguration { @Bean RecordCaptor recordCaptor() { return new RecordCaptor(); } @Bean RecordCaptorConsumer recordCaptorConsumer(RecordCaptor recordCaptor) { return new RecordCaptorConsumer(recordCaptor, new RecordSnapshotMapper()); } } OffsetSnapshotFrame Experience has shown that working with Kafka-based applications requires tools to facilitate understanding the state of consumers and message consumption status. For this task, you can compare topic offsets and consumer groups in the offset confirmation waiting operation and log discrepancies, as illustrated in the image: The code for OffsetComparisonFrame is available for review. Conclusion Testing messages in Kafka using the proposed approach not only simplifies test writing but also makes it more structured and understandable. Utilizing tools like RecordCaptor, as well as adhering to isolation principles and clear separation of test stages, ensures high accuracy and efficiency. Link to the project repository with test demonstrations — kafka-test-support. Thank you for reading the article, and good luck in your efforts to write effective and clear tests! This article offers an approach to writing integration tests for Kafka-based applications that focuses on interaction specification, making tests more readable and easier to maintain. The proposed method not only enhances testing efficiency but also contributes to a better understanding of the integration processes within the application. The article builds on three ideas presented in relevant articles: writing tests with a clear separation of Arrange-Act-Assert stages , isolation in Kafka tests , and using tools to enhance test visibility . I recommend reviewing these before delving into the material of this article. writing tests with a clear separation of Arrange-Act-Assert stages isolation in Kafka tests using tools to enhance test visibility Demonstration Scenario Let's take a Telegram bot that forwards requests to the OpenAI API and returns the result to the user as an example. If the request to OpenAI violates the system's security rules, the client will be notified. Additionally, a message will be sent to Kafka for the behavioral control system so that the manager can contact the user, explain that their request was too sensitive even for our bot, and ask them to review their preferences. The interaction contracts with services are described in a simplified manner to emphasize the core logic. Below is a sequence diagram demonstrating the application's architecture. I understand that the design may raise questions from a system architecture perspective, but please approach it with understanding—the main goal here is to demonstrate the approach to writing tests. Message Capture The main testing tool will be the message capture object — RecordCaptor . Its operation is quite similar to the outgoing request capture object — RequestCaptor , which can be read about in the article Ordering Chaos: Arranging HTTP Request Testing in Spring . RecordCaptor RequestCaptor Ordering Chaos: Arranging HTTP Request Testing in Spring Message capture will be performed through a standard Kafka consumer. The list of topics must be specified explicitly via a configuration parameter. @KafkaListener(id = "recordCaptor", topics = "#{'${test.record-captor.topics}'.split(',')}", groupId = "test") public void eventCaptorListener(ConsumerRecord record, @Headers Map boundedHeaders) { RecordSnapshot recordSnapshot = mapper.recordToSnapshot(record, boundedHeaders); recordCaptor.capture(recordSnapshot); } @KafkaListener(id = "recordCaptor", topics = "#{'${test.record-captor.topics}'.split(',')}", groupId = "test") public void eventCaptorListener(ConsumerRecord record, @Headers Map boundedHeaders) { RecordSnapshot recordSnapshot = mapper.recordToSnapshot(record, boundedHeaders); recordCaptor.capture(recordSnapshot); } The RecordCaptor object accumulates information from captured messages. RecordCaptor Using this approach requires adhering to isolation in Kafka tests . Waiting for offset commit confirmation before verifying test results should be done using the KafkaSupport#waitForPartitionOffsetCommit method. isolation in Kafka tests KafkaSupport#waitForPartitionOffsetCommit Test Example Below is the test code for the described scenario. def "User Message Processing with OpenAI"() { setup: KafkaSupport.waitForPartitionAssignment(applicationContext) // 1 and: // 2 def openaiRequestCaptor = restExpectation.openai.completions(withBadRequest().contentType(APPLICATION_JSON) .body("""{ "error": { "code": "content_policy_violation", "message": "Your request was rejected as a result of our safety system." } }""")) def telegramRequestCaptor = restExpectation.telegram.sendMessage(withSuccess('{}', APPLICATION_JSON)) when: mockMvc.perform(post("/telegram/webhook") // 3 .contentType(APPLICATION_JSON_VALUE) .content("""{ "message": { "from": { "id": 10000000 }, "chat": { "id": 20000000 }, "text": "Hello!" } }""".toString()) .accept(APPLICATION_JSON_VALUE)) .andExpect(status().isOk()) KafkaSupport.waitForPartitionOffsetCommit(applicationContext) // 4 then: openaiRequestCaptor.times == 1 // 5 JSONAssert.assertEquals("""{ "content": "Hello!" }""", openaiRequestCaptor.bodyString, false) and: telegramRequestCaptor.times == 1 JSONAssert.assertEquals("""{ "chatId": "20000000", "text": "Your request was rejected as a result of our safety system." }""", telegramRequestCaptor.bodyString, false) when: // 6 def message = recordCaptor.getRecords("topicC", "20000000").last then: message != null JSONAssert.assertEquals("""{ "webhookMessage": { "message": { "chat": { "id": "20000000" }, "text": "Hello!" } }, "error": { "code": "content_policy_violation", "message": "Your request was rejected as a result of our safety system." } }""", message.value as String, false) } def "User Message Processing with OpenAI"() { setup: KafkaSupport.waitForPartitionAssignment(applicationContext) // 1 and: // 2 def openaiRequestCaptor = restExpectation.openai.completions(withBadRequest().contentType(APPLICATION_JSON) .body("""{ "error": { "code": "content_policy_violation", "message": "Your request was rejected as a result of our safety system." } }""")) def telegramRequestCaptor = restExpectation.telegram.sendMessage(withSuccess('{}', APPLICATION_JSON)) when: mockMvc.perform(post("/telegram/webhook") // 3 .contentType(APPLICATION_JSON_VALUE) .content("""{ "message": { "from": { "id": 10000000 }, "chat": { "id": 20000000 }, "text": "Hello!" } }""".toString()) .accept(APPLICATION_JSON_VALUE)) .andExpect(status().isOk()) KafkaSupport.waitForPartitionOffsetCommit(applicationContext) // 4 then: openaiRequestCaptor.times == 1 // 5 JSONAssert.assertEquals("""{ "content": "Hello!" }""", openaiRequestCaptor.bodyString, false) and: telegramRequestCaptor.times == 1 JSONAssert.assertEquals("""{ "chatId": "20000000", "text": "Your request was rejected as a result of our safety system." }""", telegramRequestCaptor.bodyString, false) when: // 6 def message = recordCaptor.getRecords("topicC", "20000000").last then: message != null JSONAssert.assertEquals("""{ "webhookMessage": { "message": { "chat": { "id": "20000000" }, "text": "Hello!" } }, "error": { "code": "content_policy_violation", "message": "Your request was rejected as a result of our safety system." } }""", message.value as String, false) } Key steps: Wait for the partition assignment before starting the test scenario. Mock requests to OpenAI and Telegram. Execute the test scenario. Wait for offset confirmation. Verify requests to OpenAI and Telegram. Check the message in Kafka. Wait for the partition assignment before starting the test scenario. Mock requests to OpenAI and Telegram. Execute the test scenario. Wait for offset confirmation. Verify requests to OpenAI and Telegram. Check the message in Kafka. Using JSONAssert.assertEquals ensures consistency in data representation across Kafka messages, logs, and tests. This simplifies testing by providing flexibility in comparison and accuracy in error diagnosis. JSONAssert.assertEquals The article provides an example with JSON message format; other formats are not covered, but the described approach does not impose format restrictions. How to Find Your Message in RecordCaptor RecordCaptor Messages in RecordCaptor are organized by topic name and key. In the provided test, the key used is the Kafka message key. When sending, we explicitly specify it: RecordCaptor sendMessage("topicC", chatId, ...); ... private void sendMessage(String topic, String key, Object payload) { Message message = MessageBuilder .withPayload(objectMapper.writeValueAsString(payload)) .setHeader(KafkaHeaders.TOPIC, topic) .setHeader(KafkaHeaders.KEY, key) <-- set key .build(); kafkaTemplate.send(message).get(); } sendMessage("topicC", chatId, ...); ... private void sendMessage(String topic, String key, Object payload) { Message message = MessageBuilder .withPayload(objectMapper.writeValueAsString(payload)) .setHeader(KafkaHeaders.TOPIC, topic) .setHeader(KafkaHeaders.KEY, key) <-- set key .build(); kafkaTemplate.send(message).get(); } To search by message key within a topic: when: def message = recordCaptor.getRecords("topicC", "20000000").last <-- use key when: def message = recordCaptor.getRecords("topicC", "20000000").last <-- use key If this option is not suitable, you need to describe your own indexes based on message parameters for constructing the search. An example can be seen in the tests PolicyViolationTestsCustomIndex.groovy . PolicyViolationTestsCustomIndex.groovy Connecting RecordCaptor The code for connecting RecordCaptor looks as follows: RecordCaptor @TestConfiguration(proxyBeanMethods = false) public class RecordCaptorConfiguration { @Bean RecordCaptor recordCaptor() { return new RecordCaptor(); } @Bean RecordCaptorConsumer recordCaptorConsumer(RecordCaptor recordCaptor) { return new RecordCaptorConsumer(recordCaptor, new RecordSnapshotMapper()); } } @TestConfiguration(proxyBeanMethods = false) public class RecordCaptorConfiguration { @Bean RecordCaptor recordCaptor() { return new RecordCaptor(); } @Bean RecordCaptorConsumer recordCaptorConsumer(RecordCaptor recordCaptor) { return new RecordCaptorConsumer(recordCaptor, new RecordSnapshotMapper()); } } OffsetSnapshotFrame Experience has shown that working with Kafka-based applications requires tools to facilitate understanding the state of consumers and message consumption status. For this task, you can compare topic offsets and consumer groups in the offset confirmation waiting operation and log discrepancies, as illustrated in the image: The code for OffsetComparisonFrame is available for review. OffsetComparisonFrame Conclusion Testing messages in Kafka using the proposed approach not only simplifies test writing but also makes it more structured and understandable. Utilizing tools like RecordCaptor , as well as adhering to isolation principles and clear separation of test stages, ensures high accuracy and efficiency. RecordCaptor Link to the project repository with test demonstrations — kafka-test-support . kafka-test-support Thank you for reading the article, and good luck in your efforts to write effective and clear tests!