After profiling my x3399 board(arm cores) using DS-5 streamline I realized that the most heavy function in my system is cvGetHuMoments, after deep search in my code I realized that this function is related to cvtcolor conversion from yuv2 to bgr? is it possible? I can`t find in my profiler where exactly I use the cvGetHuMoments function but when I disable the cvtcolor conversion it disappeared in the profiler. If the problem is with cvtcolor, how would you recommend to optimize it? I thought about using opencl or neon optimization(arm compute library) but could not really understand how to implement it in my code, I would be happy to get a tutorial on how to implement it(replace the cvtcolor in my code). Thanks