Ask Your Question

Revision history [back]

Yes it is possible to do what you want. Your general pipeline would consist of two parts:

  1. Vision: There are so many tutorials out there on how to detect specific faces using OpenCV. I will leave this to you to do the research.
  2. Alexa: This is where things get a little tricky. I have never used their SDK before so my response is merely based on a 10 minute research I conducted.

According to their documentation, they do have a Raspberry Pi SDK.

If I were you doing this, I would just set my own custom Alexa Wake Word which when triggered, would snap a photo and run the OpenCV stuff. For your case, the face detector model was already built on a known set of people so it simply returns back a label i.e. their name. So once the vision stuff is completed, you can create your own custom response and send it back as text input for Alexa to convert to speech.

OpenCV.js is simply a javascript library of OpenCV. Which version to use depends on which language you are mostly familiar with. Along with this, you also need to keep in mind of the types of SDKs offered in Alexa.

I suggest you read their tutorial first.

Finally, I would recommend doing the vision and Alexa components separately first then merging them together to create your final app.

Cheers mate :)

Yes it is possible to do what you want. Your general pipeline would consist of two parts:

  1. Vision: There are so many tutorials out there on how to detect specific faces using OpenCV. I will leave this to you to do the research.
  2. Alexa: This is where were things get a little tricky. I have never used their SDK before so my response is merely based on a 10 minute research I conducted.

According to their documentation, they do have a Raspberry Pi SDK.

If I were you doing this, I would just set my own custom Alexa Wake Word which when triggered, would snap a photo and run the OpenCV stuff. For your case, the face detector model was already built on a known set of people so it simply returns back a label i.e. their name. So once the vision stuff is completed, you can create your own custom response and send it back as text input for Alexa to convert to speech.

OpenCV.js is simply a javascript library of OpenCV. Which version to use depends on which language you are mostly familiar with. Along with this, you also need to keep in mind of the types of SDKs offered in Alexa.

I suggest you read their tutorial first.

Finally, I would recommend doing the vision and Alexa components separately first then merging them together to create your final app.

Cheers mate :)