1. Directory structure of the Darknet to Caffe project
2. Edit the yolov3-tiny cfg file
3.Quantize the Caffe model
4.Compiling the Quantized Model
4. Deploying YOLOv3 on the Ultra96 board
The complete tutorial can be downloaded from
Output

LogicTronix [FPGA Design + Machine Learning Company]

Published August 12, 2019 © GPL3+

Yolov3 Tiny Tutorial: Darknet to Caffe for Xilinx DNNDK

This tutorial shows about "how to convert the YoloV3 Tiny" of Darknet into Caffe Framework and then implement with Xilinx DNNDK and Ultra96.

IntermediateProtip3 hours7,303

Yolov3 Tiny Tutorial: Darknet to Caffe for Xilinx DNNDK

Things used in this project

Hardware components

Avnet Ultra96-V1

Software apps and online services

Xilinx DNNDK

Story

This tutorial is an extension to the Yolov3 Tutorial: Darknet to Caffe to Xilinx DNNDK. The flow of the tutorial is same as described in Edge AI tutorials. Here we mainly focus on the necessary adjustments required to convert Yolov3 Tiny variant.

First, download the Yolov3-tinycfg and weights file.

1. Directory structure of the Darknet to Caffe project

Directory structure of the Darknet to Caffe project

Follow the Preparing the Repository step as it is.

Put the downloaded cfg and weights file for yolov3-tiny inside the 0_model_darknet folder.

2. Edit the yolov3-tiny cfg file

On the line 93, replace this:

[maxpool]

size = 2

With this:

[maxpool]

size = 1

Then run the 0_convert.sh file. Before that modify the script file as shown below:

$ python ../yolo_convert.py \
       0_model_darknet/yolov3-tiny.cfg        #path to Darknet cfg file \
       0_model_darknet/yolov3-tiny.weights    #path to Darknet weights file \
       1_model_caffe/v3-tiny.prototxt         #path to Caffe prototxt file \
       1_model_caffe/v3-tiny.caffemodel       #path to Caffe caffemodel file

This script will convert the Darknet model into two caffe files, v3-tiny.prototxt and v3-tiny.caffemodel and store inside 1_model_caffe folder.

You can test the caffe prototxt using the 1_test_caffe.sh script inside example_yolov3 folder.

3.Quantize the Caffe model

To quantize the Caffe model, copyv3-tiny.prototxt and v3-tiny.caffemodel from 1_model_caffe to the2_model_for_qunatize. Then modify the v3-tiny.prototxt file as shown below:

a. Commenting out the first five lines

b. Adding an ImageData layer with the calibration images for the train phases.

name: "Darkent2Caffe"
#####Comment following five lines generated by converter#####
#input: "data"
#input_dim: 1
#input_dim: 3
#input_dim: 416
#input_dim: 416
#####Change input data layer to ImageDate and modify root_folder/source before run DECENT#####
layer {
name: "data"
type: "ImageData"
top: "data"
top: "label"
include {
  phase: TRAIN
}
transform_param {
  mirror: false
  yolo_height:416    #change height according to Darknet model
  yolo_width:416     #change width  according to Darknet model
}
image_data_param {
  source:"/PATH_TO/5_file_for_test/calib.txt"         #change path accordingly
  root_folder:"/PATH_TO/5_file_for_test/calib_data/"  #change path accordingly
  batch_size: 1
  shuffle: false
}
}
##### No changes after below layers#####

Also make sure to change the source and root folder path accordingly to where you have extracted the 5_file_for_test folder. This contains calibration data.

Edit the2_quantize.sh script as follows:

#Assuming "decent" tool is already in the PATH
$ decent quantize -model 2_model_for_quantize/v3-tiny.prototxt   \
                -weights 2_model_for_quantize/v3-tiny.caffemodel \
                -gpu 0 \
                -sigmoided_layers layer15-conv,layer22-conv  \
                -output_dir 3_model_after_quantize \
                -method 1

If you are doing in CPU only mode like inside a virtual machine your script will be the following:

#Assuming "decent" tool is already in the PATH
$ decent-cpu quantize -model 2_model_for_quantize/v3-tiny.prototxt   \
                -weights 2_model_for_quantize/v3-tiny.caffemodel \
                -sigmoided_layers layer15-conv,layer22-conv  \
                -output_dir 3_model_after_quantize \
                -method 1

Layer15-conv and layer22-conv are the output layers in the Yolov3-tiny as opposed to Yolov3 where layer81-conv, layer93-conv and layer105-conv are the output layers.

4.Compiling the Quantized Model

Modify the deploy.prototxt in the 3_model_after_quantize folder as follows:

layer {
name: "data"
type: "Input"
top: "data"
#####Comment following five lines #####
#transform_param {
# mirror: false
# yolo_height: 416
# yolo_width: 416
# }
#####Nothing change to below layers#####
input_param {
shape {
dim: 1
dim: 3
dim: 416
dim: 416
}
}
}

Edit the 3_compile.sh script as follows:

#Assume the dnnc-dpu1.3.0 is installed in your $PATH
$ dnnc-dpu1.3.0 --prototxt=3_model_after_quantize/deploy.prototxt \
              --caffemodel=3_model_after_quantize/deploy.caffemodel \
              --dpu=4096FA \
              --cpu_arch=arm64 \
              --output_dir=4_model_elf \
              --net_name=yolo_tiny \
              --mode=normal \
              --save_kernel

If you are using DDNDK version 3 which contains dpu 1.4 change the dnnc-dpu1.3.0 to dnnc which will use whatever dpu version you have installed. For Ultra96 change the dpu architecture to 2304FA.

For Ultra96

$ dnnc --prototxt=3_model_after_quantize/deploy.prototxt \
              --caffemodel=3_model_after_quantize/deploy.caffemodel \
              --dpu=2304FA \
              --cpu_arch=arm64 \
              --output_dir=4_model_elf \
              --net_name=yolo_tiny \
              --mode=normal \
              --save_kernel

This will generate dpu_yolo_tiny.elf file inside the 4_model_elf file. Copy the dpu_yolo_tiny.elf file to the model folder in the yolov3_deploy folder.

Then setup the board and transfer this yolov3_deploy folder to your target board. Ultra96 in our case.

4. Deploying YOLOv3 on the Ultra96 board

After transferring the yolov3_deploy folder to the board.

Edit the main.cc file which inside yolov3_deploy/src folder.And Make changes as follows:

At Line 239: Change

const vector<string> outputs_node ={"layer81_conv", "layer93_conv", "layer105_conv"};

const vector<string> outputs_node = {"layer15_conv", "layer22_conv"};

At Line 395: Change

DPUKernel *kernel = dpuLoadKernel("yolo");

DPUKernel *kernel = dpuLoadKernel("yolo_tiny");

Then Save the file. Change the directory to yolov3_deploy and run the make command as follows:

make -j

This will create an executable yolo.

Use the following command to run the executable yolo:

#Test image ./yolo coco_test.jpg i

#Test video ./yolo test.video v

The complete tutorial can be downloaded from:

YoloV3-tiny: Darknet to Caffe with Xilinx DNNDK

Output:

Figure: Implemented YoloV3 on Image [Picture Source: DNNDK, Xilinx]

We have a demo video of YoloV3 Tiny-Caffe at YouTube:

YoloV3 Tiny -Caffe-DNNDK Demo on Ultra96 FPGA

LogicTronix [FPGA Design + Machine Learning Company]

41 projects • 161 followers

We are Certified FPGA Design + Machine Learning Company with expertise on Machine Learning, Computer Vision, Embedded & Crypto Development.

Yolov3 Tiny Tutorial: Darknet to Caffe for Xilinx DNNDK

Things used in this project

Hardware components

Software apps and online services

Story

1. Directory structure of the Darknet to Caffe project

2. Edit the yolov3-tiny cfg file

3.Quantize the Caffe model

4.Compiling the Quantized Model

4. Deploying YOLOv3 on the Ultra96 board

The complete tutorial can be downloaded from:

Output:

Credits

LogicTronix [FPGA Design + Machine Learning Company]

Comments

Embed the widget on your own site

Yolov3 Tiny Tutorial: Darknet to Caffe for Xilinx DNNDK

Yolov3 Tiny Tutorial: Darknet to Caffe for Xilinx DNNDK

Things used in this project

Hardware components

Software apps and online services

Story

1. Directory structure of the Darknet to Caffe project

2. Edit the yolov3-tiny cfg file

3.Quantize the Caffe model

4.Compiling the Quantized Model

4. Deploying YOLOv3 on the Ultra96 board

The complete tutorial can be downloaded from:

Output:

Credits

LogicTronix [FPGA Design + Machine Learning Company]

Comments

Related channels and tags