2015-12-10 13:38:17 -0600 | commented question | MatchTemplate OCL and CPU Timing After diving a little deeper, I am not convinced the opencl normalized functions are implemented for a 32bit float. |
2015-12-09 22:35:33 -0600 | asked a question | MatchTemplate OCL and CPU Timing Hi, I was looking for a performance boost using the OCL implementation of matchtemplate(), but the Mat and UMat implementation both take the same time. When diving a little deeper, It is the normalization that takes the majority of the time, anyone have anymore information on how to implement the fastest normalized template matching? Results: CPU OCL |