Published November 18, 2019 © GPL3+

Voice Controlled AI Camera

Build a voice controlled AI camera that notifies with what it sees via email.

AdvancedFull instructions provided5 days3,592

Things used in this project

Hardware components

Raspberry Pi 3 Model B

NVIDIA Jetson Nano Developer Kit

Raspberry Pi Camera Module

Software apps and online services

TensorFlow

OpenCV

Story

1. Introduction

This project will present the architectural design behind a voice controlled AI camera. The main idea is to control a camera by a voice interface to take a photo and have it send you an email with the photo description of what it observes. The project encompasses multiple components starting from hardware modules like Matrix Voice, Jetson Nano and RPIU2 to software frameworks like SNIPS AI and of course custom Python AI scripts that comprise the neural backend engine.

So in a nutshell we will leverage the Snips AI platform and Matrix Voice for the voice interface and Keras/Tenorflow DNN running on a Jetson Nano SBC for the image inference AI.

The demo code attached is setup so that if a person is detected an email is sent to the user with the annotated image from the AI engine. The class can be changed however.

2. Background

The camera will leverage optimized deep convolutional neural networks. I'm using the RPI camera version 1.1. The RPI does not have the capability to run large CNN.

When deciding which CNN architecture to use I narrowed it down to 3 options:

Inception V3 - Weights are around 92Mb, the network has 1000 classes
Resnet50 - weights are a bit smaller, 90 mb
Squeezenet - weights are ~ 5Mb and it provides AlexNet level accuracy

Even though Squeezenet has impressive characteristics whiche means you can run the whole thing on a RPI I opted to use InceptionV3 since practical tests gave better performance and the idea was to build a distributed system in the first place.

Since RPI3 performance is a bit anemic and in order to be able to scale the design to multiple cameras the program will be setup so that it transmits camera image data via TCP from RPI3 to a Jetson Nano. This design decision was made since due to the TegraX1 GPU, Jetson Nano is capable of running multiple neural networks in parallel. So in recipe format what we are going to build here is the following:

1. Raspberry Pi 3 captures camera photo
2. RPI3 resizes it to InceptionV3 resolution
3. RPI3 sends photo to Jetson Nano.
4. Jetson Nano runs inceptionV3 received photo.
5. Jetson Nano prints image description
5. Description is sent via email to user if class of interest matches

But first, will have to put all the HW together.

3. Hardware setup

The Matrix Voice will be used as an audio input and output device. A Raspberry Pi V3 was used for interfacing with Matrix Voice.The same RPI3 was also used to connect RPI V1.1 camera.

The RPI V1.1 camera uses an OV5640 5MP pixel camera that can take 5MP resolution photos. This is more than enough.

Another option which was tested but not fished in time is to interface the OV7670 camera with the Matrix voice ESP32 micro as the frame capture device.

As mentioned the Jetson Nano SBC will be used to run the backend AI engine.

The system block diagram is as follows:

5. Setting up the Jetson Nano.

I used the following image as an OS for Jetson Nano.

https://developer.nvidia.com/jetson-nano-sd-card-image-r3221

The Jetson Nano is not compatible with RPI V1.1 of the camera. Nvidia uses a Sony camera which uses the same MIPI-CSI2 interface as RPI V2 camera. Incidentally, the community has released a custom kernel that implements the custom drivers for RPI V1.1 camera however the update procedure for the Nano is not as simple as burning a new image.

So anyway we have to install the following packages on Jetson nano.

pip install tensorflow = 1.14
pip install keras = 2.2.1
pip install imutils
pip install pillow
pip install opencv-python
pip install opencv-contrib-python

5. Setting up the RPI software

On the RPI3 you need to install Keras, Scipy, OpenCV and Tensorflow.

The installation of the OpenCv V4.0 is a bit involved since there is no python wheel package so it has to be installed via a custom script.

Then install MatrixIO :

curl https://apt.matrix.one/doc/apt-key.gpg | sudo apt-key add -
echo "deb https://apt.matrix.one/raspbian $(lsb_release -sc) main" | sudo tee /etc/apt/sources.list.d/matrixlabs.list
sudo apt-get update
sudo apt-get upgrade
sudo reboot
sudo apt install matrixio-kernel-modules
sudo reboot

The next step is to install the SNIPS AI framework:

On a Host laptop install Node JS and then install SAM:

sudo npm install -g snips-sam

Then install SNIPS and connect to RPI3 and init the service:

sam connect raspberrypi.local
sam init

Then, configure SNIPS and the configuration file by issuing:

sudo nano /etc/snips.toml
mike = "MATRIXIO-SOUND: - (hw:2,0)"

#also add
portaudio_playback = "default"

finally check that the sound configuration file has the correct Matrix Voice Codec:

nano /etc/asound.conf

Finally install Node Js on RPI3:

sudo apt-get update
sudo apt-get upgrade
curl -sL https://deb.nodesource.com/setup_10.x | sudo -E bash -
sudo apt-get install -y nodejs

The complete client software runs in its own python environment.

6. Testing the network camera

The first version of the software simply records video from the client side and sends it via TCP to the Jetson Nano which performs classification in it.

Basically this part of the Python script implements live video frame classification on the Jetson Nano using RPI camera as a satellite device. On the image below the camera is pointed to a monitor.

You can see the frames and the classification AI engine running on the Jetson nano on the right side.

The next component to test was the email notifications.

7. Testing email notifications

To test the email notification I wrote a Python script found below that takes an image from the current directory and sends it via email.

The idea behind this is that once the image is received via TCP it is saved on the Jetson Nano and then if the image belongs to a specific class, let's say person, the script will attach the frame as an attachment and send an email to the user.

This was tested with a Microsoft Outlook account and as you can see below it successfully works.

This script was then merged with the server side software.

8. Testing the AI engine

On the RPI install Tensorflow V1.14. This is needed just for test:

$ sudo apt-get install libhdf5-serial-dev hdf5-tools libhdf5-dev zlib1g-dev zip libjpeg8-dev
$ sudo apt-get install python3-pip
$ sudo pip3 install -U pip
$ sudo pip3 install -U numpy grpcio absl-py py-cpuinfo psutil portpicker six mock requests gast h5py astor termcolor protobuf keras-applications keras-preprocessing wrapt google-pasta
$ sudo pip3 install --pre --extra-index-url https://developer.download.nvidia.com/compute/redist/jp/v42 tensorflow-gpu==1.14.0+nv19.9

The AI engine runs on the Jetson Nano.

1 / 2

9. Matrix Voice setup

After plugin in the Matrix Voice on the RPI3, the first step was to install the packages:

curl https://apt.matrix.one/doc/apt-key.gpg | sudo apt-key add -
echo "deb https://apt.matrix.one/raspbian $(lsb_release -sc) main" | sudo tee /etc/apt/sources.list.d/matrixlabs.list
sudo apt-get update
sudo apt-get upgrade
sudo reboot
sudo apt install matrixio-kernel-modules
sudo reboot

Next go to Snips AI and open an account. Then, install NodeJs on the development host.

Install Node Js on the RPI also by issuing the following:

sudo apt-get update
sudo apt-get upgrade
curl -sL https://deb.nodesource.com/setup_10.x | sudo -E bash -
sudo apt-get install -y nodejs

Once Node is installed, install SAM.

sudo npm install -g snips-sam

Then connect with the RPI3 on the local netwok and install SNIPS on it.

sam connect raspberrypi.local
sam init

The next step is to setup the microphone and audio. I connected a pair of headphones to the audio jack of the Matrix Voice.

sam setup audio

Finally test the audio and microphone by issuing, say something and then press a key.

sam test microphone / speaker

1 / 2

I encountered a lot of issues with the SNIPS configuration where it would break the day after even though everything was working correctly the day before.

One solution was to remove all the SNIPS packages and re-install them again:

sudo apt-get purge snips-platform-voice
sudo apt-get purge snips-platform-demo
sudo apt-get purge snips-tts
sudo apt-get purge snips-maker-tts
sudo apt-get purge snips-watch
sudo apt-get purge snips-template
sudo apt-get purge snips-skill-server
sudo rm -rf /usr/share/snips
sudo rm -rf /var/lib/snips
sudo userdel _snips
sudo userdel _snips-skills

Then install the snips Camera assistant. this basically recognizes Hey snips Hotword and the intent camera on and camera off.

sam install assistant -i proj_de2aOplEwD

SNIPS ASR running on the RPI performs the word to text transcribing. When the hotword and intent are detected it gets then routed though a MQTT call to the on_connect() function to execute the respective tasks.

When issuing a voice command, I see that it gets detected, however there are problems with the Snips playback and intent detection.

I also observed that while the intent was accurately transcribed a couple of times it stops working at a later date.

The mosquitto server was then not running so I had to search around for a fix.

Looking on the forum I was able to get it back running by issuing the following:

sudo chown mosquitto /var/log/mosquitto
sudo chown mosquitto:mosquitto /var/log/mosquitto/mosquitto.log


sudo systemctl restart mosquitto

To test the SNIPS voice interaction I embedded the camera control under the onMessage MQTT function call:

# Process a message as it arrives
def on_message(client, userdata, msg):
if not msg.topic == HOTWORD_DETECTED:
return
payload = json.loads(msg.payload)
model_id = payload["modelId"]
if model_id in HOTWORDS_ON:
    print("camera_on")
elif model_id in HOTWORDS_OFF:
    print("camera_off")
else:
print("Camera snapshot:  un-mapped hotword ")

This is the part that I ended up spending the most amount of time due to SNIPS constantly breaking down. Specifically the services would stop working when the app was updated. After un-installing and re-installing a couple of times I ended up calling the client Python script from the javascript program used to turn on and off the lights. This worked a couple of times until Snips decided not to run any of the services.

So I had to re-install again and redo the configuration from scratch. While Matrix Voice works correctly as a microphone and speaker the SNIPS service has a lot of issues as of November 2019. Python 3 scripts were only added recently.

The plan is to revise this in a future update.

10. Client software

The software is partitioned into a client side and a server side. The complete system makes uses of multiple stacks such as Keras, Matrix IO API, python sockets and OpenCV.

The client software runs on the RPI3. It's purpose is to get a camera snapshot every time the user issues a voice command. Once the utterance is decoded to instruct the camera to take a photo, the client then encapsulates this photo and sends it via a TCP socket to the Jetson NANO server which runs the AI engine.

The server program on the Jetson nano must be started prior to the client.

The client software starts by turning on the camera and establishing a connection with the Jetson Nano server.

Then when the utterance is decoded from the MQTT server running a camera snapshot is taken. The Pickle library is used to encapsulate the frame into a packet and send it to the sever. The message size is also appended to the packet so that the server has an idea what to expect.

To use SNIPS and Matrix Voice as a voice control interface one has to implement mqtt binding for the intents and the hot word that has been selected.

https://docs.snips.ai/articles/console/actions/

11.Sever software

The server script does the following:

a)received the image frames from the TCP client side

b) decodes the TCP frames into images and saves it locally

c) passes the images to the CNN inception V3 network to obtain an object classification category

d) If the class belongs to a person it sends an email to the user with the locally saved frame

Once the server software is initialized the first step is the initialization of the AI engine. The first time the script will download the Inception V3 weights. The next step is to configure the email. I have used the Outlook client with the smtplib Python library but any email provider should do. Once the program is started it asks the user for :

a) the email address

b) the email password

c) the recepient address (for this you can enter the same email as the sender address)

Once the program obtains the data to configure email and then it proceeds to start the main TCP server.and place it in listening mode.

The next step is to define the AI CNN model. This start the Tensorflow engine on the background.

The server then waits for image packets from the client. Once it receives the image it decapsulates it from the Pickle frame, builds an image from them and passes that image to the AI engine.

The AI engine, obtains the image and decodes it into a class by giving a short description.

img = image.load_img("sframe.jpg", target_size=(299, 299))
x = image.img_to_array(img)
x = np.expand_dims(x, axis=0)
x = preprocess_input(x)
y = model.predict(x)
# cnt =  cnt + 1
for index, res in enumerate(decode_predictions(y)[0]):
print('{}. {}: {:.3f}%'.format(index + 1, res[1], 100 * res[2]))
print(decode_predictions(y)[0])				#this is a list
print(type(decode_predictions(y)[0][1]))	#this is a tuple
print("Item detected : " + (decode_predictions(y)[0][0][1]))		# this picks the class description from the first tuple that shows the class with the highest probabililty
decobj = (decode_predictions(y)[0][0][1])
print("sending email to user\r\n")
sendmail(decobj)

This description then is used as the EMAIL subject together with the photo.

The advantage behind using Jetson Nano as opposed to running the complete application on the RPI3 is that it can use multiple neural networks in parallel due to the use of the GPU. In addition by decentralizing the application it becomes more robust and easy to scale.

12. Demo time.

To test the design I initially setup the server side. Make sure to use python3.

After launching the server side python script move over to the RPI and launch the client script.

You will see a lot of frames being transported via the network. If the AI feature extractor determines that a person is present on the image frame, it sends an email to the user.

On RPI activate the python environment with the OpenCV and start the client:

source .profile
workon cv
python clientVideo.py

On Jetson Nano start the server simply use python3 to run the script/

python3 server.py

There is a multitude of augmentations that you can build upon this program.

First and foremost you can improve the FPS by modifying the AI engine to use TensorRT inference model which can speed things up.

Conclusion

This project implemented a voice controlled AI camera using a distributed architecture which allows it to scale to multiple cameras. The main aim of the design was to leverage the Matrix Voice for voice interaction and the Keras/Tensorflow stack for AI inference. Hopefully SNIPS will iron out the stability issues the near future.

Code

# Copyright dhq November 13 2019

# import all necessary components
import smtplib
from email.mime.text import MIMEText
from email.mime.image import MIMEImage
from email.mime.multipart import MIMEMultipart
import pickle
import socket
import struct
from PIL import Image
import numpy as np
import time
import os, sys
import sys
import cv2
import smtplib
import numpy as np

from imutils.video import VideoStream
import datetime
import argparse
import imutils
import time
import cv2
from time import sleep


HOSTNAME = 'smtp-mail.outlook.com'
USERNAME = ''
PASSWORD = ''
RECEPIENT_ADDRESS = ''
SUBJECT = 'Subject: AI camera using (imageNetV3) engine detected: '

""" Get email credentials
	:param none: 
	:returns none : 
"""
def LoginCredentials():
	USERNAME = input('Enter your email address: ')
	PASSWORD = input('Enter your email password: ')
	RECEPIENT_ADDRESS = input('Enter the recepient address: ')
	return USERNAME,PASSWORD,RECEPIENT_ADDRESS
	
	
def sendmail(detectedobject, user, emailpass, rcp_emailaddr):
	EMAIL_BODY = SUBJECT + str(detectedobject)
	
	msg = MIMEMultipart('related')
	msg['Subject'] = SUBJECT
	msg['From'] = USERNAME
	msg['To'] = RECEPIENT_ADDRESS

	html = """\
	<html>
	  <head></head>
		<body>
		  <img src="cid:image1" alt="Logo" style="width:300px;height:400px;"><br>
		   <p><h4 style="font-size:15px;">AI engine camera snapshot.</h4></p>           
		</body>
	</html>
	"""
	# Record the MIME types of text/html.
	part2 = MIMEText(html, 'html')

	# Attach parts into message container.
	msg.attach(part2)

	# This example assumes the image is in the current directory
	fp = open('ELE.JPG', 'rb')
	msgImage = MIMEImage(fp.read())
	fp.close()

	msgImage.add_header('Content-ID', '<image1>')
	msg.attach(msgImage)

	# print(msg.as_string())
	server = smtplib.SMTP(HOSTNAME, 587)
	server.ehlo()
	server.starttls()
	server.login(user, emailpass)	
	server.sendmail(user, [rcp_emailaddr],msg.as_string())
	server.quit()
	print ("Email sent. Logging Out...")
	
	
if __name__ == '__main__':
		try:
			print ('To exit the program press Ctrl-C.')
			USERNAME,PASSWORD,RECEPIENT_ADDRESS = LoginCredentials()
			detectedobject = str("found another lama")
			print ('Sending email \r\n.')
			sendmail(detectedobject, USERNAME, PASSWORD, RECEPIENT_ADDRESS)
			print ('Email sent \r\n.')

		except KeyboardInterrupt:
			print ('Quitting...')

# Copyright dhq November 13 2019

# import all necessary components
import smtplib
from email.mime.text import MIMEText
from email.mime.image import MIMEImage
from email.mime.multipart import MIMEMultipart

import socket
import sys
import cv2
import pickle
import numpy as np
import struct ## new
import zlib

from PIL import Image
import os
import sys

# from skimage.io import imread
# from skimage.transform import resize
# import numpy as np
# from matplotlib import pyplot as plt
# from skimage import data, io
# from matplotlib import pyplot as plt
# from skimage import img_as_ubyte		#convert float to uint8
# from skimage.color import rgb2gray

from tensorflow.python.keras.preprocessing import image
from tensorflow.python.keras.applications.inception_v3 import *
import numpy as np

from imutils.video import VideoStream
import datetime
import argparse
import imutils
import time
import cv2
from time import sleep
from keras.models import load_model


HOST=''
PORT=8485


HOSTNAME = 'smtp-mail.outlook.com'
USERNAME = ''
PASSWORD = ''
RECEPIENT_ADDRESS = ''
SUBJECT = 'Subject: AI camera using (imageNetV3) engine detected: '

""" Get email credentials
	:param none: 
	:returns none : 
"""
def LoginCredentials():
	USERNAME = input('Enter your email address: ')
	PASSWORD = input('Enter your email password: ')
	RECEPIENT_ADDRESS = input('Enter the recepient address: ')
	return USERNAME,PASSWORD,RECEPIENT_ADDRESS
	
	
def sendmail(detectedobject, user, emailpass, rcp_emailaddr):
	EMAIL_BODY = SUBJECT + str(detectedobject)
	
	msg = MIMEMultipart('related')
	msg['Subject'] = SUBJECT
	msg['From'] = USERNAME
	msg['To'] = RECEPIENT_ADDRESS

	html = """\
	<html>
	  <head></head>
		<body>
		  <img src="cid:image1" alt="Logo" style="width:300px;height:400px;"><br>
		   <p><h4 style="font-size:15px;">AI engine camera snapshot.</h4></p>           
		</body>
	</html>
	"""
	# Record the MIME types of text/html.
	part2 = MIMEText(html, 'html')

	# Attach parts into message container.
	msg.attach(part2)

	# This example assumes the image is in the current directory
	fp = open('servpic.jpg', 'rb')
	msgImage = MIMEImage(fp.read())
	fp.close()

	msgImage.add_header('Content-ID', '<image1>')
	msg.attach(msgImage)

	# print(msg.as_string())
	server = smtplib.SMTP(HOSTNAME, 587)
	server.ehlo()
	server.starttls()
	server.login(user, emailpass)	
	server.sendmail(user, [rcp_emailaddr],msg.as_string())
	server.quit()
	print ("Email sent. ")
	
	
def main():
	model = InceptionV3(weights='imagenet')
	print('Deep CNN started')
	
	s=socket.socket(socket.AF_INET,socket.SOCK_STREAM)
	print('Socket created')

	s.bind((HOST,PORT))
	print('Socket bind complete')
	s.listen(10)
	print('Socket now listening')

	conn,addr=s.accept()

	data = b""
	payload_size = struct.calcsize(">L")
	print("payload_size: {}".format(payload_size))
	while True:
		while len(data) < payload_size:
			print("Recv: {}".format(len(data)))
			data += conn.recv(4096)

		print("Done Recv: {}".format(len(data)))
		packed_msg_size = data[:payload_size]
		data = data[payload_size:]
		msg_size = struct.unpack(">L", packed_msg_size)[0]
		print("msg_size: {}".format(msg_size))
		while len(data) < msg_size:
			data += conn.recv(4096)
		frame_data = data[:msg_size]
		data = data[msg_size:]
		print(msg_size)
		frame=pickle.loads(frame_data)
		frame = cv2.imdecode(frame, cv2.IMREAD_COLOR)
		cv2.imwrite("servpic.jpg",frame)
		# cv2.imshow('ImageWindow',frame)
		# cv2.waitKey(1)
		img = image.load_img("servpic.jpg", target_size=(299, 299))
		x = image.img_to_array(img)
		x = np.expand_dims(x, axis=0)
		x = preprocess_input(x)
		y = model.predict(x)
		# cnt =  cnt + 1
		for index, res in enumerate(decode_predictions(y)[0]):
			print('{}. {}: {:.3f}%'.format(index + 1, res[1], 100 * res[2]))
		print(decode_predictions(y)[0])				#this is a list
		print(type(decode_predictions(y)[0][1]))	#this is a tuple
		print("Item detected : " + (decode_predictions(y)[0][0][1]))		# this picks the class description from the first tuple that shows the class with the highest probabililty
		decobj = (decode_predictions(y)[0][0][1])
		print("sending email to user\r\n")
		if(decobj == "")
			sendmail(decobj, USERNAME, PASSWORD, RECEPIENT_ADDRESS)


		
if __name__ == '__main__':
		try:
			print ('To exit the program press Ctrl-C.')
			USERNAME,PASSWORD,RECEPIENT_ADDRESS = LoginCredentials()
			print ('Email configuration done\r\n.')
			main()
		except KeyboardInterrupt:
			print ('Quitting...')

# Copyright dhq November 13 2019

#sends camera frames to Jetson Nano

import cv2
import io
import socket
import struct
import time
import pickle
import zlib

JETSON_HOSTNAME = '192.168.2.13'		#erplace wih your own

client_socket = socket.socket(socket.AF_INET, socket.SOCK_STREAM)
client_socket.connect((JETSON_HOSTNAME, 8485))
connection = client_socket.makefile('wb')

cam = cv2.VideoCapture(0)

cam.set(3, 320);
cam.set(4, 240);

img_counter = 0

encode_param = [int(cv2.IMWRITE_JPEG_QUALITY), 90]

while True:
    ret, frame = cam.read()
    result, frame = cv2.imencode('.jpg', frame, encode_param)
#    data = zlib.compress(pickle.dumps(frame, 0))
    data = pickle.dumps(frame, 0)
    size = len(data)


    print("{}: {}".format(img_counter, size))
    client_socket.sendall(struct.pack(">L", size) + data)
    img_counter += 1

cam.release()

#!/usr/bin/env python

# encoding: utf-8
from __future__ import unicode_literals
import cv2
import numpy as np
import socket
import sys
import pickle
import struct

# from skimage.io import imread
# from skimage.transform import resize
# import numpy as np
# from matplotlib import pyplot as plt
# from skimage import data, io
# from matplotlib import pyplot as plt
# from skimage import img_as_ubyte		#convert float to uint8
# from skimage.color import rgb2gray
from PIL import Image

# import the necessary packages
# from tensorflow.python.keras.preprocessing import image
# from tensorflow.python.keras.applications.inception_v3 import *
# from keras.models import load_model

from imutils.video import VideoStream
import datetime
import argparse
import imutils
import time
from time import sleep

import json
import subprocess

import paho.mqtt.client as mqtt

fromtimestamp = datetime.datetime.fromtimestamp

# MQTT client to connect to the bus
mqtt_client = mqtt.Client()
HOST = "localhost"
PORT = 1883
HOTWORD_DETECTED = "hermes/hotword/default/detected"
HOTWORDS_ON = {"lumos"}

		
# cap=cv2.VideoCapture(0)

# construct the argument parse and parse the arguments
ap = argparse.ArgumentParser()
ap.add_argument("-p", "--picamera", type=int, default=-1,
	help="whether or not the Raspberry Pi camera should be used")
args = vars(ap.parse_args())

# initialize the video stream and allow the cammera sensor to warmup
vs = VideoStream(usePiCamera=args["picamera"] > 0).start()
time.sleep(2.0)


# clientsocket=socket.socket(socket.AF_INET,socket.SOCK_STREAM)
# clientsocket.connect(('192.168.2.9',8089))

encode_param = [int(cv2.IMWRITE_JPEG_QUALITY), 90]


def main():
	img_counter = 0
	client_socket = socket.socket(socket.AF_INET, socket.SOCK_STREAM)
	client_socket.connect(('192.168.2.13', 8485))
	connection = client_socket.makefile('wb')
	print('Deep CNN started')
	
	# loop over the frames from the video stream
	while True:
		# grab the frame from the threaded video stream and resize it
		# to have a maximum width of 400 pixels
		frame = vs.read()
		frame = imutils.resize(frame, width=320)
	 
		# draw the timestamp on the frame
		timestamp = datetime.datetime.now()
		ts = timestamp.strftime("%A %d %B %Y %I:%M:%S%p")
		cv2.putText(frame, ts, (10, frame.shape[0] - 10), cv2.FONT_HERSHEY_SIMPLEX, 0.35, (0, 0, 255), 1)
		
		# show the frame
		cv2.imshow("Frame", frame)
		
		result, frame = cv2.imencode('.jpg', frame, encode_param)
		#    data = zlib.compress(pickle.dumps(frame, 0))
		data = pickle.dumps(frame, 0)
		size = len(data)

		print("{}: {}".format(img_counter, size))
		client_socket.sendall(struct.pack(">L", size) + data)
		img_counter += 1


# Subscribe to the important messages
def on_connect(client, userdata, flags, rc):
    mqtt_client.subscribe(HOTWORD_DETECTED)


# Process a message as it arrives
def on_message(client, userdata, msg):
	if not msg.topic == HOTWORD_DETECTED:
		return

	payload = json.loads(msg.payload)
	model_id = payload["modelId"]
	# if model_id in HOTWORDS_ON:
		# #main()
		# else:
			# print("Unmapped hotword model_id: %s" % model_id)



if __name__=="__main__":
	main()
	mqtt_client.on_connect = on_connect
	mqtt_client.on_message = on_message
	mqtt_client.connect(HOST, PORT)
	mqtt_client.loop_forever()

Voice Controlled AI Camera

Things used in this project

Hardware components

Software apps and online services

Story

1. Introduction

2. Background

3. Hardware setup

5. Setting up the Jetson Nano.

5. Setting up the RPI software

6. Testing the network camera

7. Testing email notifications

8. Testing the AI engine

9. Matrix Voice setup

10. Client software

11.Sever software

12. Demo time.

Conclusion

Schematics

AI camera system conceptual block diagram

Code

TestSendEmailImage.py

ImgServerAI.py

ImgClientTestCam.py

ImgClientMatrixAI.py

Credits

Comments

Embed the widget on your own site

Voice Controlled AI Camera

Voice Controlled AI Camera

Things used in this project

Hardware components

Software apps and online services

Story

1. Introduction

2. Background

3. Hardware setup

5. Setting up the Jetson Nano.

5. Setting up the RPI software

6. Testing the network camera

7. Testing email notifications

8. Testing the AI engine

9. Matrix Voice setup

10. Client software

11.Sever software

12. Demo time.

Conclusion

Schematics

AI camera system conceptual block diagram

Code

TestSendEmailImage.py

ImgServerAI.py

ImgClientTestCam.py

ImgClientMatrixAI.py

Credits

Comments

Related channels and tags