Efficient band approximation of Gram matrices for large scale kernel methods on GPUs

TitleEfficient band approximation of Gram matrices for large scale kernel methods on GPUs
Publication TypeConference Papers
Year of Publication2009
AuthorsHussein M, Abd-Almageed W
Conference NameProceedings of the Conference on High Performance Computing Networking, Storage and Analysis
Date Published2009///
Conference LocationPortland, Oregon
ISBN Number978-1-60558-744-8
Keywordsaffinity propagation, algorithms, gpu

Kernel-based methods require O(N2) time and space complexities to compute and store non-sparse Gram matrices, which is prohibitively expensive for large scale problems. We introduce a novel method to approximate a Gram matrix with a band matrix. Our method relies on the locality preserving properties of space filling curves, and the special structure of Gram matrices. Our approach has several important merits. First, it computes only those elements of the Gram matrix that lie within the projected band. Second, it is simple to parallelize. Third, using the special band matrix structure makes it space efficient and GPU-friendly. We developed GPU implementations for the Affinity Propagation (AP) clustering algorithm using both our method and the COO sparse representation. Our band approximation is about 5 times more space efficient and faster to construct than COO. AP gains up to 6x speedup using our method without any degradation in its clustering performance.