Opencv ocl library performance problem and sample square code performance question??

While i was Studying OpenCV and OpenCL (in OpenCV OCL Library), I had ocurred problem in ocl library and square sample code performance.

I think that GPU mode(OCL, OpenCL) performance more faster than CPU mode performance, but My Test result differ.
My Test is comparison in Square sample findsquares function time and in finsquare function used ocl library performance time
but hog_ocl is GPU Mode faster than CPU Mode
why don't know that......??...

Square Sample "findsquares" fucntion result time

Square Search 821.838ms 3.77946ms

ocl Sample performance_ocl result time in square sample is used ocl function

pyrDown 4.12325ms 5.75384
pyrUp 5.70101ms 9.98734ms
Canny 81.9024ms 92.0042ms
threshold 1.01938ms 2.55077ms

My Test Enviroment :
- CPU : AMD E-350 Processor
- GPU : ATI Radeon HD 6310
- Memory : 4GB
- OS : Ubuntu 12.04 LTS
- OpenCL Usable : Yes
- OpenCV Version : 2.4.4

