Performance DataΒΆ
The table below show improvements obtained from optimizing a set of image processing kernels for the C66x DSP device using the techniques described in this chapter.
| Kernel | Generic OpenCL C (cycles/pixel) | Optimized OpenCL C (cycles/pixel) | Improvement (times faster) |
|---|---|---|---|
| Convolution | 12 | 5 | 2.40 |
| Histogram | 56 | 1.75 | 32.00 |
| X_Gradient | 12.4 | 1.25 | 9.92 |
| Edge Relaxation | 530 | 48 | 11.04 |