What algorithm would be optimal in this situation?

asked 2015-05-21 17:27:14 -0600

archer71
1 ●1 ●2

updated 2015-05-22 08:37:52 -0600

Please suggest what algorithms/tutorials/ how can I achieve the following steps:

Extract features from a (one) high-quality image on the web.
Transform into a .xml or .dat file.
Port file to an ARM, ios or android.
Obtain video frames
Apply image recognition, feature extraction etc. to detected object
Get coordinates of objects on every frame scanned Out of scope of openCV but maybe someone can help:
Render a video on top of the coordinates

edit retag flag offensive close merge delete

add a comment

answered 2015-05-22 06:49:07 -0600

thdrksdfthmn
2170 ●5 ●18 ●46

You can start like this:

Detect features, or maybe using gpu, then extract descriptors; the choosing of the descriptors (and features also) is linked to your application (what are you trying to do). Here you have some explications about the features and descriptors.
Use FileStorage for saving to .xml (and for loading from xml too)
XML should be portable, so no problem using it on different environments
For reading video frames you can use VideoCapture
To detect object you can inspire from this example. But for this you need also to match the descriptors
Maybe also using tracking for not detecting in every whole frame but in a small region
For playing a video inside another I have no example, but you can use 2 VideoCapture and put the frame from one in the detected area of the frame of the other capture. For deforming the inside frame you can use warpAffine (or other geometric transformation you need).

Then you can save the new video of play it directly... Hope it helped. You can ask again after you started something and say what it doesn't work.

edit flag offensive delete link

Comments

From what I have understood from reading the documentation, I should choose the same algorithms for both feature extraction, creating the descriptiors and then detecting the object. So first questions I have is what algorithm is the most optimal for this use-case? I will only be using iPhone 5S and newer phones, so I am expecting very good FPS, but I would want the same algorithm to be used because I understood it gives the best results. So what do you recommend? SURF/SIFT/FAST/ORB/ some deep learning?

archer71 ( 2015-05-22 07:59:43 -0600 )edit

I have tried some of these, and it seems that if the descriptors have many info, then they are slower; so I would suggest you to start with SIFT and SURF, and if the FPS is not enough, then try ORB. FAST has no descriptors extractor... More if you are using C++, then you can also use gpu, it will be much faster!

thdrksdfthmn ( 2015-05-22 09:22:56 -0600 )edit

add a comment

What algorithm would be optimal in this situation?

1 answer

Comments

Links

Question Tools

Stats

Related questions

What algorithm would be optimal in this situation? edit

1 answer

Comments

Links

Question Tools

Stats

Related questions

What algorithm would be optimal in this situation?