Can we find out total number of speakers and their duration by looking at/analysing spectrogram.! [image description] (https://drive.google.com/drive/folders/0B4rwzcsr5hevdEJlam9scTRodTg)
By just looking at the image, I can see some pattern, but I am looking for right solution in terms of opencv code(python)