how to count boxes with only this camera direction

I have a problem: count boxes from camera. Have multi layeres, each layer has some boxes as image

image description

Which algorithm do I apply for this problem? Canny Edges, Segmentation? I tried it but low accuracy.

Help me!