Published March 31, 2022 © MIT

Smart Traffic Camera – Pothole detector using Kria KV260

The pothole is an important and serious issue in India, as per statistics 30% of people die due to potholes or a sequence of events by this/

IntermediateFull instructions providedOver 2 days2,129

Smart Traffic Camera – Pothole detector using Kria KV260

Things used in this project

Hardware components

AMD Kria™ KV260 Vision AI Starter Kit

Raspberry Pi Camera Module

Software apps and online services

Story

The pothole is an important and serious issue in India, as per statistics 30% of people die due to potholes or a sequence of events generated by a pothole. These potholes are formed by the force of water and abrasion. It is a road surface that has cracked, eroded, and eventually form a pothole. This can be a major issue under Road Safety for researchers in Road Safety and Traffic Control.

The author has an idea to create a system which able to detect pothole with location coordinates and transfer to website or concerned authorities for the repairing/maintenance purpose. The Kria KV260 module will be attached with the Camera and Kria KV260 module is running with Pothole detection ML Models. From the camera input through image/video Kria KV260 will identify the potholes in the image/video and transfer the appropriate information to the website/authority.

Time to time improvement of roads will lead to a reduction in maintenance and prevent damage in vehicles. Potholes make the car less efficient because it pushes gears to lower shift and which consumes lots of fuel. These potholes can damage to Tire, Wheels/Rims, Suspension, and Exhaust.

Solution behind the scenes: A Kria KV260 is connected with a camera and via USB/IAS images/videos will be captured. ML Model will be pre-trained on custom pothole images data set and will be running on Board. Board will identify the pothole and upload images on the website with some relevant information like coordinates and the amount of damage on the road. The model will be trained on Google Coalb/AWS which gives coefficient values and it will be run on End/Edge Device KV260. An author has used opencv and pandas libraries and TensorFlow Framework for completing the ML model task. The stored/resulted images from Board which is stored on the database will be helpful to train the model continuously for better accuracy.

Block Diagram:

(1) Block Diagram

Procedure/Instructions:

Train ML Model : This step starts with Identifying the libraries and framework to be used for purpose, as per proposed idea author has chosen pillow, lxml, Cython, contextlib2, jupyter, matplotlib, pandas, opencv-python. After that install this libraries to machine/ include in scripts to auto-download on platform.

run this command after creating text file with name of "requirement.txt"

$ pipreqs                            // to create requirement file
$ pip install -r requirements.txt    // to run command file

Then after comes to writing code for Capturing Images, Training Model, Testing Model. This step generally known to all student/friends/colleagues. So author has written the code and tested it with pre-captured images as shown in figure (2) & (3), which is working fine as expected.

To optimize this model as per hardware limitations, power, latency etc., will be discussed in Research Section of this submission.

(2) Test Inputs

(3) Test Outputs

Run ML Model as live-stream : We have written the model and tested with data which can be live, pre-captured, custom. Now we come to run this model as live-stream on platform which means user/author do not have to run it every-time by invoking python script, so generally we are writing as

$ python -u myscript.py >myscript.py.log 2>&1

Another task would be keep camera occupied with our code, as this is not advisable because for dedicated hardware/camera it is going to be handled by port address and mapping to its code/also called as Firmware.

Script for data-transfer to server : As per proposed idea potholes are detected by system and location coordinates are with system as pre-configuration, so point come to upload this data somewhere in cloud and by using this data inform authority/contractor to repair the same.

For this task author has written powershell script to fetch the detected data to be passed on to cloud via parameters, For example

p = subprocess.Popen(['powershell.exe', './Alert.ps1' + location + num_detections], stdout=sys.stdout)
print("Alert response is" + p)

So here location and num_detections are values coming from our python code and this is passed in powershell script to be upload on server/cloud.

So may be some of friends has question about powershell, it is basically functionality offered by linux system to manage/handle platform, authomation, configuration and it doesn't consume extra resource or no extra libraries to be included for this purpose. The written file by author is also attached in attachment section.

Create Linux Service for Running Project/Experiment : Now we come to last section of this project to be run this whole system as linux service, so no other task can easily interrupt and stop our code to be run on Kria Platform. As extra functionality Author has added this but without this also project can be run on platform.

This file will be stored as "run.sh"

#!/bin/sh 
# run experiments
python -u myscript1.py >myscript1.py.log 2>&1     // Multiple Task can be added 
python -u myscript2.py >myscript2.py.log 2>&1     //

Assuming the your code files, scripts are in stored in experiments, then by using this command our custom service will be up and running in background.

$ nohup /home/samyak/experiments/run.sh > /home/samyak/experiments/run.sh.log </dev/null 2>&1 &

Here our proposed project is completed and now Auhor is presenting his research for this project and some future improvement which is being carried out by author as add-on in this project after this competition.

Research: While choosing libraries and framework for our project, many of us are confused when it comes to run model on Embedded Hardware/ so called Edge Device in new booming terminology. Their are so many Architectures, Techniques and Frameworks are available in market for same task/purpose.

As per author's experience all libraries are similar footprints and resources requirements in terms for space, so instead for that we can focused on "Framework selection" which gives us limitations like large frameworks and its rich runtime. For this limitations the TF Lite/TF Lite Micro will be correct selection.

For Training model we can opt the "off loading data processing, pruning, data curation" to optimize at code level and "CMSIS-NN" for compiler/low-level optimizations. This techniques can help us to reduce running inference latency, power and bandwidth.

To overcome the Hardware limitations like limited memory, processing power and energy cap. we can work on noise-to-signal ratio which means irrelevant data can cause network congestion & data storage problems without adding value.

Scope of Future Improvement: Author wants to carry on this project with TF Lite Micro custom library for KV260 and NSR techniques to be implemented.

param
(
  [string] $Url = http://10.1.2.172/WSVistaWebClient/RESTService.svc/member/,
	[int] $MaxRetries = 3,
	[int] $OneRetryTimeout = 5,
	[int] $OverallTimeout = 60,
	[int] $WaitBetweenRetries = 1
)

$body = @{
 $Location=$args[0]
 $Potholes=$args[1]
 $Time = Get-Date
} | ConvertTo-Json

$header = @{
 "Accept"="application/json"
 "connectapitoken"="97fe6ab5b1a640909551e36a071ce9ed"
 "Content-Type"="application/json"
} 

$Success = $false
$RetriesLeft = $MaxRetries
$TimeLeft = [timespan]::fromseconds($OverallTimeout)

do
{
    Write-Host "Pinging url - $Url"

    $StopWatch = [Diagnostics.Stopwatch]::StartNew()
    try
    {
        $Response = Invoke-WebRequest $Url -UseBasicParsing -TimeoutSec $OneRetryTimeout -Method 'Post' -Body $body -Headers $header | ConvertTo-HTML
		

        $StatusCode = $Response.StatusDescription
        $StatusCodeInt = $Response.StatusCode
    
        $StatusText = "Ping returned status $StatusCode ($StatusCodeInt)"
        
        $ExceptionOccurred = $false
    } 
    catch 
    {
        if ($_.Exception.Response.StatusCode)
        {
            $StatusCode = $_.Exception.Response.StatusCode
            $StatusCodeInt = $StatusCode.value__
    
            $StatusText = "Ping returned status $StatusCode ($StatusCodeInt)"
        }
        else
        {
            $StatusText = $_.Exception.Message
        }
        $ExceptionOccurred = $true
    }
    $StopWatch.Stop()


    Write-Host $StatusText
    Write-Host 'Time elapsed' $StopWatch.Elapsed
	$TimeLeft-=$StopWatch.Elapsed

    $Success = (($StatusCodeInt -eq 200) -and -not($ExceptionOccurred))
    $RetriesLeft--
	
	Start-Sleep $WaitBetweenRetries
	$TimeLeft-=[timespan]::fromseconds($WaitBetweenRetries)
}
while (($Success -eq $false) -and ($RetriesLeft -ge 0) -and ($TimeLeft -ge 0))

if ($Success) 
{ 
    Write-Host "Ping OK!" -ForegroundColor Green
}
else 
{
    throw "Ping failed: '$Url'"
}

# Import packages
import os
import cv2
import numpy as np
import tensorflow as tf
import sys
import subprocess

# This is needed since the notebook is stored in the object_detection folder.
sys.path.append("..")

# Import utilites
from utils import label_map_util
from utils import visualization_utils as vis_util

location = "39.6167, 2.9833"
# Name of the directory containing the object detection module we're using
MODEL_NAME = 'inference_graph'
cap = cv2.VideoCapture(0)
# Check if the webcam is opened correctly
if not cap.isOpened():
    raise IOError("Cannot open webcam")
# While loop
while True:
    # Capture frame-by-frame
    ret, frame = cap.read()

    # Show the captured image
    cv2.imshow('WebCam', frame)
    break
IMAGE_NAME = cv2.imread('image.png')

# Grab path to current working directory
CWD_PATH = os.getcwd()

# Path to frozen detection graph .pb file, which contains the model that is used
# for object detection.
PATH_TO_CKPT = os.path.join(CWD_PATH,MODEL_NAME,'frozen_inference_graph.pb')

# Path to label map file
PATH_TO_LABELS = os.path.join(CWD_PATH,'training','labelmap.pbtxt')

# Path to image
PATH_TO_IMAGE = os.path.join(CWD_PATH,'pothole_testImages',IMAGE_NAME)

# Number of classes the object detector can identify
NUM_CLASSES = 4

# Load the label map.
# Label maps map indices to category names, so that when our convolution
# network predicts `5`, we know that this corresponds to `king`.
# Here we use internal utility functions, but anything that returns a
# dictionary mapping integers to appropriate string labels would be fine
label_map = label_map_util.load_labelmap(PATH_TO_LABELS)
categories = label_map_util.convert_label_map_to_categories(label_map, max_num_classes=NUM_CLASSES, use_display_name=True)
category_index = label_map_util.create_category_index(categories)

# Load the Tensorflow model into memory.
detection_graph = tf.Graph()
with detection_graph.as_default():
    od_graph_def = tf.GraphDef()
    with tf.gfile.GFile(PATH_TO_CKPT, 'rb') as fid:
        serialized_graph = fid.read()
        od_graph_def.ParseFromString(serialized_graph)
        tf.import_graph_def(od_graph_def, name='')

    sess = tf.Session(graph=detection_graph)

# Define input and output tensors (i.e. data) for the object detection classifier

# Input tensor is the image
image_tensor = detection_graph.get_tensor_by_name('image_tensor:0')

# Output tensors are the detection boxes, scores, and classes
# Each box represents a part of the image where a particular object was detected
detection_boxes = detection_graph.get_tensor_by_name('detection_boxes:0')

# Each score represents level of confidence for each of the objects.
# The score is shown on the result image, together with the class label.
detection_scores = detection_graph.get_tensor_by_name('detection_scores:0')
detection_classes = detection_graph.get_tensor_by_name('detection_classes:0')

# Number of objects detected
num_detections = detection_graph.get_tensor_by_name('num_detections:0')

# Load image using OpenCV and
# expand image dimensions to have shape: [1, None, None, 3]
# i.e. a single-column array, where each item in the column has the pixel RGB value
image = cv2.imread(PATH_TO_IMAGE)
image_expanded = np.expand_dims(image, axis=0)

# Perform the actual detection by running the model with the image as input
(boxes, scores, classes, num) = sess.run(
    [detection_boxes, detection_scores, detection_classes, num_detections],
    feed_dict={image_tensor: image_expanded})

# Draw the results of the detection (aka 'visulaize the results')

vis_util.visualize_boxes_and_labels_on_image_array(
    image,
    np.squeeze(boxes),
    np.squeeze(classes).astype(np.int32),
    np.squeeze(scores),
    category_index,
    use_normalized_coordinates=True,
    line_thickness=8,
    min_score_thresh=0.60)

# All the results have been drawn on image. Now display the image.
cv2.imshow('Object detector', image)
p = subprocess.Popen(['powershell.exe', './Alert.ps1' + location + num_detections], stdout=sys.stdout)
print("Alert response is" + p)
# Press any key to close the image
cv2.waitKey(0)

# Clean up
cv2.destroyAllWindows()

import os
import io
import pandas as pd
import tensorflow as tf

from PIL import Image
from object_detection.utils import dataset_util
from collections import namedtuple, OrderedDict

flags = tf.app.flags
flags.DEFINE_string('csv_input', '', 'Path to the CSV input')
flags.DEFINE_string('image_dir', '', 'Path to the image directory')
flags.DEFINE_string('output_path', '', 'Path to output TFRecord')
FLAGS = flags.FLAGS


# TO-DO replace this with label map
def class_text_to_int(row_label):
    if row_label == 'pothole':
        return 1
    elif row_label == 'car':
        return 2
    elif row_label == 'potholeGroup':
        return 3
    elif row_label == 'auto':
        return 4
    else:
        None


def split(df, group):
    data = namedtuple('data', ['filename', 'object'])
    gb = df.groupby(group)
    return [data(filename, gb.get_group(x)) for filename, x in zip(gb.groups.keys(), gb.groups)]


def create_tf_example(group, path):
    with tf.gfile.GFile(os.path.join(path, '{}'.format(group.filename)), 'rb') as fid:
        encoded_jpg = fid.read()
    encoded_jpg_io = io.BytesIO(encoded_jpg)
    image = Image.open(encoded_jpg_io)
    width, height = image.size

    filename = group.filename.encode('utf8')
    image_format = b'jpg'
    xmins = []
    xmaxs = []
    ymins = []
    ymaxs = []
    classes_text = []
    classes = []

    for index, row in group.object.iterrows():
        xmins.append(row['xmin'] / width)
        xmaxs.append(row['xmax'] / width)
        ymins.append(row['ymin'] / height)
        ymaxs.append(row['ymax'] / height)
        classes_text.append(row['class'].encode('utf8'))
        classes.append(class_text_to_int(row['class']))

    tf_example = tf.train.Example(features=tf.train.Features(feature={
        'image/height': dataset_util.int64_feature(height),
        'image/width': dataset_util.int64_feature(width),
        'image/filename': dataset_util.bytes_feature(filename),
        'image/source_id': dataset_util.bytes_feature(filename),
        'image/encoded': dataset_util.bytes_feature(encoded_jpg),
        'image/format': dataset_util.bytes_feature(image_format),
        'image/object/bbox/xmin': dataset_util.float_list_feature(xmins),
        'image/object/bbox/xmax': dataset_util.float_list_feature(xmaxs),
        'image/object/bbox/ymin': dataset_util.float_list_feature(ymins),
        'image/object/bbox/ymax': dataset_util.float_list_feature(ymaxs),
        'image/object/class/text': dataset_util.bytes_list_feature(classes_text),
        'image/object/class/label': dataset_util.int64_list_feature(classes),
    }))
    return tf_example


def main(_):
    writer = tf.python_io.TFRecordWriter(FLAGS.output_path)
    path = os.path.join(os.getcwd(), FLAGS.image_dir)
    examples = pd.read_csv(FLAGS.csv_input)
    grouped = split(examples, 'filename')
    for group in grouped:
        tf_example = create_tf_example(group, path)
        writer.write(tf_example.SerializeToString())

    writer.close()
    output_path = os.path.join(os.getcwd(), FLAGS.output_path)
    print('Successfully created the TFRecords: {}'.format(output_path))


if __name__ == '__main__':
    tf.app.run()

import os
import glob
import pandas as pd
import xml.etree.ElementTree as ET


def xml_to_csv(path):
    xml_list = []
    for xml_file in glob.glob(path + '/*.xml'):
        tree = ET.parse(xml_file)
        root = tree.getroot()
        for member in root.findall('object'):
            value = (root.find('filename').text,
                     int(root.find('size')[0].text),
                     int(root.find('size')[1].text),
                     member[0].text,
                     int(member[4][0].text),
                     int(member[4][1].text),
                     int(member[4][2].text),
                     int(member[4][3].text)
                     )
            xml_list.append(value)
    column_name = ['filename', 'width', 'height', 'class', 'xmin', 'ymin', 'xmax', 'ymax']
    xml_df = pd.DataFrame(xml_list, columns=column_name)
    return xml_df


def main():
    for folder in ['train','test']:
        image_path = os.path.join(os.getcwd(), ('Pothole_Images/' + folder))
        xml_df = xml_to_csv(image_path)
        xml_df.to_csv(('images/' + folder + '_labels.csv'), index=None)
        print('Successfully converted xml to csv.')


main()