Programming Tensor Cores in CUDA 9 | NVIDIA Technical Blog

A defining feature of the new Volta GPU Architecture is its Tensor Cores, which give the Tesla V100 accelerator a peak throughput 12 times the 32-bit floating point throughput of the previous…