How to split the string into different kafka topic based on some conditions

Question

I am trying to split the string into different kafka topic based on conditions. Here is the topology. Split the string into words. Match every words with conditions (here set of Good words and set of Bad words) If atleast 1 words from Bad words set found in the string, it will be sent to Bad-string topic othe…

Accepted Answer

The if statement is always false because the .toString() of a KStream object is the metadata of it, and never empty.And if you want the full original string split between the two topics, you should not flatmap at all.That being said, seems like you wantvar feedbackStreams = input.branch(hasGoodWords(), hasBadWords());feedbackStreams[0].to("good-string");feedbackStreams[1].to("bad-string");Where the two functions get the full input message and compare against the sets, rather than be given individual words.Although, I think you only need one function to capture all messages with a good word to good-string, and all other messages (no good/bad, both good/bad, and some bad) to the bad-string topice.g.    var feedbackStreams = input.branch(this::hasOnlyGoodWords, (k,v) -> true);    feedbackStreams[0].to("good-string");    feedbackStreams[1].to("bad-string");    return input;}private boolean hasOnlyGoodWords(Object key, String value) {    String cleaned = value.getText().replaceAll("[^a-zA-Z ]", "").toLowerCase();    Set<String> uniqueWords = Arrays.stream(cleaned.split("\s+")).distinct().collect(toSet());    for (String s : BAD_WORDS) {        if (uniqueWords.contains(s)) return false;    }    uniqueWords.retainAll(GOOD_WORDS);    return uniqueWords.size() > 0;}

Advertisement

Answer