http://static.googleusercontent.com/media/research.google.com/en//pubs/archive/43864.pdf
paper-the-dataflow-model#dataflow-model1 2 3 4dataflow model: In this section, we will define the formal model for the system and explain why its semantics are general enough to subsume the standard batch, micro-batch, and streaming models, as well as the hybrid streaming and batch semantics of the Lambda Architecture. paper-the-dataflow-model#dataflow-model1 2 3 4
paper-the-dataflow-model#unbounded-bounded-vs-streaming-batch1 2 3 4 5When describing infinite/finite data sets, we prefer the terms unbounded/bounded over streaming/batch, because the latter terms carry with them an implication of the use of a specific type of execution engine. In reality, unbounded datasets have been processed using repeated runs of batch systems since their conception, and well-designed streaming systems are perfectly capable of processing bounded data. From the perspective of the model, the distinction of streaming or batch is largely irrelevant, and we thus reserve those terms exclusively for describing runtime execution engines. paper-the-dataflow-model#unbounded-bounded-vs-streaming-batch1 2 3 4 5