Any future plan to add an audio input function (similar to blobFromImage ) to the DNN module?

dnn
audio

asked 2018-11-12 12:24:12 -0600

kkudryavtsev
1 ●1

updated 2018-11-12 12:42:53 -0600

berak
32993 ●7 ●81 ●312

So that would allow to run some TensorFlow models (like DeepSpeech project) for sound recognition?

edit retag flag offensive close merge delete

Comments

i can't speak for the devs here, but it sounds highly unlikely to happen.

opencv is still a computer-vision library, and the tensorflow audio api is very complex, containing means to load files, calculate MEL coefficients, time-stretching, and various other processing.

related question

berak ( 2018-11-13 01:40:36 -0600 )edit

Understood, thank you! I just thought that OpenCV::DNN is one of the best libraries in terms of speed and simplicity of use. It also supports already all types of layers needed for DeepSpeech, even though they have quite complex overall algorithm. So that the input function would kind of stimulate using the library outside of the vision field.

kkudryavtsev ( 2018-11-13 10:46:55 -0600 )edit

add a comment

Any future plan to add an audio input function (similar to blobFromImage ) to the DNN module?

Comments

Links

Question Tools

Stats

Related questions

Any future plan to add an audio input function (similar to blobFromImage ) to the DNN module? edit

Comments

Links

Question Tools

Stats

Related questions

Any future plan to add an audio input function (similar to blobFromImage ) to the DNN module?