Abstract: We present performance results for dense linear algebra using recent NVIDIA GPUs. Our matrix-matrix multiply routine (GEMM) runs up to 60% faster than the vendor's implementation and ...
Abstract: This book fills a gap in the existing literature for a serious presentation of numerical linear algebra that is accessible to undergraduates who have taken a basic course in linear algebra.