In the past decade, high performance compute capabilities exhibited by heterogeneous GPGPU platforms have led to popularity of data parallel programming languages such as CUDA and OpenCL. Developing solutions using involve a steep learning curve due complexity underlying devices their impact on performance. This has emergence several High Performance Computing frameworks which provide high-leve...