![]() Kernel compilation proceeds in two steps: cuDNN API implementations for: convolutions (using im2col algorithm over Cedric Nugteren's CLBlast, pooling, ReLU, tanh, and sigmoid.cuBLAS API implementations for GEMM, GEMV, SCAL, SAXPY (using Cedric Nugteren's CLBlast).compiler for device-side code, handling templated C++ code, converting it into bog-standard OpenCL 1.2 code.compiler for host-side code, including memory allocation, copy, streams, kernel launches.Using OpenCL device: Intel(R) HD Graphics 5500 BroadWell U-Processor GT2 Using Intel, OpenCL platform: Intel Gen OCL Driver
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |