2011

 
 

CUDA is a great platform for high performance data parallel computing, but its architecture and implementation leaves something to be desired from a systems perspective.  The situation is improving in CUDA 4.0, but there’s still a way to go.


(image of Fermi die, courtesy of Nvidia)