TY - CONF T1 - Model-based OpenMP implementation of a 3D facial pose tracking system T2 - 2006 International Conference on Parallel Processing Workshops, 2006. ICPP 2006 Workshops Y1 - 2006 A1 - Saha,S. A1 - Chung-Ching Shen A1 - Chia-Jui Hsu A1 - Aggarwal,G. A1 - Veeraraghavan,A. A1 - Sussman, Alan A1 - Bhattacharyya, Shuvra S. KW - 3D facial pose tracking system KW - application modeling KW - application program interfaces KW - application scheduling KW - coarse-grain dataflow graphs KW - Concurrent computing KW - data flow graphs KW - Educational institutions KW - face recognition KW - IMAGE PROCESSING KW - image processing applications KW - Inference algorithms KW - Message passing KW - OpenMP platform KW - parallel implementation KW - PARALLEL PROCESSING KW - parallel programming KW - Particle tracking KW - Processor scheduling KW - SHAPE KW - shared memory systems KW - shared-memory systems KW - Solid modeling KW - tracking AB - Most image processing applications are characterized by computation-intensive operations, and high memory and performance requirements. Parallelized implementation on shared-memory systems offer an attractive solution to this class of applications. However, we cannot thoroughly exploit the advantages of such architectures without proper modeling and analysis of the application. In this paper, we describe our implementation of a 3D facial pose tracking system using the OpenMP platform. Our implementation is based on a design methodology that uses coarse-grain dataflow graphs to model and schedule the application. We present our modeling approach, details of the implementation that we derived based on this modeling approach, and associated performance results. The parallelized implementation achieves significant speedup, and meets or exceeds the target frame rate under various configurations JA - 2006 International Conference on Parallel Processing Workshops, 2006. ICPP 2006 Workshops PB - IEEE SN - 0-7695-2637-3 M3 - 10.1109/ICPPW.2006.55 ER - TY - JOUR T1 - IPS-2: the second generation of a parallel program measurement system JF - IEEE Transactions on Parallel and Distributed Systems Y1 - 1990 A1 - Miller, B. P A1 - Clark, M. A1 - Hollingsworth, Jeffrey K A1 - Kierstead, S. A1 - Lim,S. -S A1 - Torzewski, T. KW - 4.3BSD UNIX systems KW - automatic guidance techniques KW - Automatic testing KW - Charlotte distributed operating system KW - CPA KW - DECstation KW - design concepts KW - distributed programs KW - graphical user interface KW - Graphical user interfaces KW - Instruments KW - interactive program analysis KW - IPS-2 KW - measurement KW - message systems KW - network operating systems KW - Operating systems KW - parallel program measurement system KW - parallel programming KW - parallel programs KW - Performance analysis KW - performance analysis techniques KW - performance evaluation KW - performance measurement system KW - Power system modeling KW - program bottlenecks KW - program diagnostics KW - Programming profession KW - semantics KW - Sequent Symmetry multiprocessor KW - shared-memory systems KW - software tools KW - Springs KW - Sun KW - Sun 4 KW - Unix KW - VAX AB - IPS, a performance measurement system for parallel and distributed programs, is currently running on its second implementation. IPS's model of parallel programs uses knowledge about the semantics of a program's structure to provide two important features. First, IPS provides a large amount of performance data about the execution of a parallel program, and this information is organized so that access to it is easy and intuitive. Secondly, IPS provides performance analysis techniques that help to guide the programmer automatically to the location of program bottlenecks. The first implementation of IPS was a testbed for the basic design concepts, providing experience with a hierarchical program and measurement model, interactive program analysis, and automatic guidance techniques. It was built on the Charlotte distributed operating system. The second implementation, IPS-2, extends the basic system with new instrumentation techniques, an interactive and graphical user interface, and new automatic guidance analysis techniques. This implementation runs on 4.3BSD UNIX systems, on the VAX, DECstation, Sun 4, and Sequent Symmetry multiprocessor VL - 1 SN - 1045-9219 CP - 2 M3 - 10.1109/71.80132 ER -