exactly-once

Can I rely on a in-memory Java collection in Kafka stream for buffering events by fine tuning punctuate and commit interval?

故事扮演 提交于 2020-11-29 11:12:35
问题 A custom processor which buffers events in a simple java.util.List in process() - this buffer is not a state store. Every 30 seconds WALL_CLOCK_TIME, punctuate() sorts this list and flushes to the sink. Assume only single partition source and sink. EOS processing guarantee is required. I know that at any given time either process() gets executed or punctuate() gets executed. I am concerned about this buffer not being backed by changelog topic. Ideally I believe this should have been a state

Can I rely on a in-memory Java collection in Kafka stream for buffering events by fine tuning punctuate and commit interval?

风格不统一 提交于 2020-11-29 11:11:46
问题 A custom processor which buffers events in a simple java.util.List in process() - this buffer is not a state store. Every 30 seconds WALL_CLOCK_TIME, punctuate() sorts this list and flushes to the sink. Assume only single partition source and sink. EOS processing guarantee is required. I know that at any given time either process() gets executed or punctuate() gets executed. I am concerned about this buffer not being backed by changelog topic. Ideally I believe this should have been a state