I\'m trying to use kafka streams to perform a windowed aggregation and emit the result only after a certain session window is closed. To achieve this I\'m using the suppress
There two option to solve that issue:
use TimeWindowedKStream::aggregate(final Initializer<VR> initializer, final Aggregator<? super K, ? super V, VR> aggregator, final Materialized<K, VR, WindowStore<Bytes, byte[]>> materialized);
use KStream::groupByKey(final Grouped<K, V> grouped)
In you case it will be:
Ad 1:
input
.groupByKey()
.windowedBy(SessionWindows.with(Duration.ofSeconds(30)))
.aggregate(() -> Long.valueOf(0), (key, v1, v2) -> v1 + v2, (key, agg1, agg2) -> agg1 + agg2, Materialized.with(Serdes.String(), Serdes.Long()))
.suppress(Suppressed.untilWindowCloses(Suppressed.BufferConfig.unbounded()))
.toStream()
.map((k, v) -> new KeyValue<>(k.key(), v))
.to("output");
Ad 2:
input
.groupByKey(Grouped.with(Serdes.String(), Serdes.Long())
.windowedBy(SessionWindows.with(Duration.ofSeconds(30)))
.aggregate(() -> Long.valueOf(0), (key, v1, v2) -> v1 + v2, (key, agg1, agg2) -> agg1 + agg2)
.suppress(Suppressed.untilWindowCloses(Suppressed.BufferConfig.unbounded()))
.toStream()
.map((k, v) -> new KeyValue<>(k.key(), v))
.to("output");
To make this work with TopologyTestDriver, you would need to advance the clock time, which it seems has no effect on the Suppress step. A workaround is to allow your test to override the Suppress config with a setting like this:
Suppressed.untilTimeLimit(Duration.ZERO, BufferConfig.unbounded())