Showing posts with the label cuda

Matrix Multiplication Cuda Program

For matrix multiplication of m1 and m2 eg m1 x m2 we need to make sure W1 H2 and the size of the result will be H1 x W…

Matrix Multiplication Kernel Cuda

It ensures that extra threads do not do any work. One platform for doing so is NVIDIAs Compute Uni ed Device Architect…

Cuda Matrix Multiplication Example

Opencl Matrix Multiplication Sgemm Tutorial