%0 Conference Paper %B Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis %D 2009 %T Efficient band approximation of Gram matrices for large scale kernel methods on GPUs %A Hussein,Mohamed %A Abd-Almageed, Wael %K affinity propagation %K algorithms %K gpu %X Kernel-based methods require O(N2) time and space complexities to compute and store non-sparse Gram matrices, which is prohibitively expensive for large scale problems. We introduce a novel method to approximate a Gram matrix with a band matrix. Our method relies on the locality preserving properties of space filling curves, and the special structure of Gram matrices. Our approach has several important merits. First, it computes only those elements of the Gram matrix that lie within the projected band. Second, it is simple to parallelize. Third, using the special band matrix structure makes it space efficient and GPU-friendly. We developed GPU implementations for the Affinity Propagation (AP) clustering algorithm using both our method and the COO sparse representation. Our band approximation is about 5 times more space efficient and faster to construct than COO. AP gains up to 6x speedup using our method without any degradation in its clustering performance. %B Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis %S SC '09 %I ACM %C Portland, Oregon %P 31:1–31:10 - 31:1–31:10 %8 2009/// %@ 978-1-60558-744-8 %G eng %R 10.1145/1654059.1654091