Next: Matrix Transposition Up: Parallel Algorithms for Image Segmentation Previous: Problem Overview

Block Distributed Memory Model

We use the Block Distributed Memory (BDM) Model ([25], [26]) as a computation model for developing and analyzing our parallel algorithms on distributed memory machines. This model allows the design of algorithms using a single address space and does not assume any particular interconnection topology. The model captures performance by incorporating a cost measure for interprocessor communication induced by remote memory accesses. The cost measure includes parameters reflecting memory latency, communication bandwidth, and spatial locality. This model allows the initial placement of data and prefetching.

The complexity of parallel algorithms will be evaluated in terms of two measures: the computation time , and the communication time . The measure refers to the maximum of the local computations performed on any processor as measured on the standard sequential model. The communication time refers to the total amount of communications time spent by the overall algorithm in accessing remote data. Using the BDM model, an access operation to a remote location takes time, and l prefetch read operations can be executed in time, where is the normalized maximum latency of any message sent in the communications network. No processor can send or receive more than one word at a time.

Two useful data movement patterns, matrix transposition and broadcasting, are discussed next.

Next: Matrix Transposition Up: Parallel Algorithms for Image Segmentation Previous: Problem Overview

David A. Bader
dbader@umiacs.umd.edu