Ask Your Question
0

How does stereo SGBM algorithm really work inside ?

asked 2019-07-11 12:15:04 -0600

kobeking gravatar image

updated 2019-12-09 07:48:28 -0600

ojesus gravatar image

Hello everyone,

I have been playing with stereo BM & stereo SGBM for a little while now, and even if I understood in details how the BM algorithm works, I am still struggling a lot to understand the latter. I have tried to read all relevant papers : Stereo Processing by Semi-Global Matching and Mutual Information by Heiko Hirschmüller, Learning OpenCV 3 by Adrian Kaehler and Gary Bradski, Depth Discontinuities by Pixel-to-Pixel Stereo by Stan Birchfield and Carlo Tomasi, and also looked at many other thesis on the subject, yet I haven't been able to understand how the SGBM algorithme (which is a combination of BM with a variation of SGM) really works inside.

What I mainly don't understand are the following :

1) What is the Birchfield-Tomasi metric used in the algorithm ? Every paper relates to a "Birchfield-Tomasi metric" but no one explains what it is and reading the Birchfield-Tomasi paper didn't help me to understand that.

2) How is a window used in that algorithm (what is the operation made with that window)?

3) Why is there a preFilterCap in this algorithm if the filter type can only be a Sobel (in contrast with BM algorithm where there is a choice between Sobel & an other prefilter where preFilterCap is used)?

4) And finally, what are the different directions that can be used (3, 5 or 8) ? aren't we supposed to compute a cost matching only on the epipolar lines ?

It is really frustrating for me, I have spent the last entire 2 days trying to figure this out and haven't been able to do so,and I really need to be able to explain that in my thesis.

Any help would be more than welcome !

edit retag flag offensive close merge delete

Comments

Thanks to michaelr, I have been able to understand the first of my questions. Anyone able to help for the other questions?

kobeking gravatar imagekobeking ( 2019-09-07 07:04:25 -0600 )edit

Good morning, u were asking the same questions in mind . did u get any consist responses please? i need them in my graduation project. thank u so much

sana605 gravatar imagesana605 ( 2020-09-15 08:56:34 -0600 )edit

@sana605 asking a new question will be helpful. no one is aware of your comment

sturkmen gravatar imagesturkmen ( 2020-09-15 09:06:44 -0600 )edit

1 answer

Sort by » oldest newest most voted
0

answered 2019-07-15 12:03:10 -0600

michaelr gravatar image

The metric is the pixel similarity measure -it is described in section 2.1.2 ("Pixel Dissimilarity") of the ICCV '98 paper (see here: https://cecas.clemson.edu/~stb/public... )

edit flag offensive delete link more

Comments

Thank you for your reply! It is very helpful.

Have you any idea of how to explain the "directions" that can be used (3, 5 or 8)? I couldn't understand this part.

kobeking gravatar imagekobeking ( 2019-08-09 10:58:40 -0600 )edit

Question Tools

1 follower

Stats

Asked: 2019-07-11 12:15:04 -0600

Seen: 1,474 times

Last updated: Sep 07 '19