Which method for object detection at 25 fps, full HD?
So I promised to prototype a model for object detection trained on my own labeled videos; in real time on full HD video @25 fps. I have spent quite some time learning Mask R-CNN. Now the model is running I realized that this library is too slow.
I have googled OpenCV, browsed through LearnOpenCV, searched these forums, peaked at the tutorials at opencv.org etc. I understand that using the DNN module with C++ will let me train my own model and do object detection at some frame rate.
Which OpenCV based method would you choose for training an object detection model to work @25 fps, full HD?
Based on this blog post, it looks like that specific method has an inference time of ~300 milliseconds? Does that extend to video inference using ~8 seconds for 25 frames?
Sorry if I'm conceived as being lazy for asking instead of trying. I just would like a friendly heads-up if my goal is not feasible.