现代流式计算的基石:Google DataFlow
0. 引言 今天这篇继续讲流式计算。毫无疑问,Apache Flink 和 Apache Spark (Structured Streaming)现在是实时流计算领域的两个最火热的话题了。那么为什么要介绍 Google Dataflow 呢?Streaming Systems 这本书在分析 Flink 的火热原因的时候总结了下面两点: “There were two main reasons for Flink’s rise to prominence: Its rapid adoption of the Dataflow/Beam programming model, which put it in the position of being the most semantically capable fully open sourc