示例消息:
>{"sensor":"temp1", "value":12.0, "timestamp":19200230}
>{"sensor":"temp1", "value":12.0, "timestamp":19200230}
>{"sensor":"temp1", "value":12.0, "timestamp":19200230}
>{"sensor":"temp2", "value":5, "timestamp":19200230}
>{"sensor":"temp2", "value":5, "timestamp":19200230}
我正在尝试使用keyby方法构建一个流聚合。
DataStream<Message> messageSumStream = messageStream.keyBy("sensor").timeWindowAll(Time.minutes(5)).sum("value");
我期待
{"sensor": "temp1", "value": 36.000000, "timestamp":19200230 }
{"sensor": "temp2", "value": 10.000000, "timestamp":19200230 }
但是得到了:
{"sensor": "temp1", "value": 46.000000, "timestamp":19200230 }
我在这里错过了什么?
您使用的是DataStream类中的timeWindowAll,而不是KeyedDataStream中的timeWindow,导致代码忽略keyBy。
试试这个:
DataStream<Message> messageSumStream = messageStream.keyBy("sensor").timeWindow(Time.minutes(5)).sum("value");