← All writing

Tag

Data Engineering

1 post tagged Data Engineering.

Pregel: What the Large-Scale Graph Processing Paper Actually Says

PageRank in MapReduce is O(iterations × full dataset reloads). Pregel fixes this by keeping the graph in memory across iterations and replacing disk I/O with message passing. The 'think like a vertex' model is the insight — BSP is the implementation.