議程摘要:

議程:Scalable Data Processing: Bulk Synchronous Parallel

HadoopMapReduce[1] is a popular open source framework inspired by functional programming 's map and reduce functions, saving developers lots of works by covering many underlying complicated tasks. However, not all tasks fit into MapReduce's scenario, graph related computation task (e.g. social network analysis) is one such example. Google therefore developed their in-house product, Pregel[2], based on Bulk Synchronous Parallel[3] - a bridge model suitable for performing iterative algorithms, performing large scale graph processing.

Outline:
1. What is Bulk Synchronous Parallel?
2. Apache Hama
3. Comparison between HadoopMapReduce and Apache Hama
  [1]. Link1
  [2]. Link2
  [3]. Link3