肿瘤康复网 > 深度学习物体检测实战算法2——YOLO与SSD

深度学习物体检测实战算法2——YOLO与SSD

时间：2020-10-10 14:03:19

相关推荐

YOLO原理介绍

每个网格预测两个bbox框

生成两个bbox

4+1：4指的是一个bbox框的四个位置，即xmin,ymin,xmax,ymax;1指的是1个置信度。

要输出两个bbox的框，所以是(4+1+4+1)+20 20:代表20个类别的预测概率结果。

准确率小的原因：一个bbox只能预测一个物体，当两个物体离得比较近时，只有一个物体能被预测。对小物体的检测不是很友好。适用于大物体。

SSD原理介绍

SSD目标检测

缺少网络和网络参数、图像

from nets.ssd_net import SSD300from keras.preprocessing.image import load_img,img_to_arrayfrom keras.applications.imagenet_utils import preprocess_inputimport osfrom scipy.misc import imreadfrom utils.ssd_utils import BBoxUtilityimport numpy as npimport matplotlib.pyplot as plt"""定义号类别数量以及输出模型预测流程SSD300模型输入以及加载参数读取多个本地路径测试图片，preprocess_input以及保存图像像素值(显示需要)模型预测结果，得到7308个priorbox进行非最大抑制算法处理图片的预测结果显示"""class SSDTest(object):def __init__(self):#定义识别类别self.classes_name = ['Aeroplane','Bicycle','Bird','Boat'.'Bottle','Bus','Car','Cat','Chair','Cow','Diningtable','Dog','Horse','Motorbike','Person','Pottedplant','Sheep','Sofa','Train','Tvmonitor']#定义模型的输入参数 +1 背景self.classes_nums = len(self.classes_name)+1self.input_shape = (300,300,3)def test(self):"""对于输入图片进行预测物体位置:return:"""#SSD300模型输入以及加载参数model = SSD300(self.input_shape,num_classes=self.classes_nums)model.load_weights("./ckpt/weights_SSD300.hdf5",by_name=True)feature = []images_data = []#读取多个本地路径测试图片，preprocess_input以及保存图像像素值(显示需要)for path in os.listdir("./images"):img_path = os.path.join("./images/",path)#1.输入到SSD网络当中，数组image = load_img(img_path,target_size=(self.input_shape[0],self.input_shape[1]))image = img_to_array(image)#2.读取图片二进制数据，matplotlib显示使用images_data.append(imread(img_path))#模型预测结果，得到7308个priorbox#处理inputs = preprocess_input(np.asarray(feature))print(inputs)pred = model.predict(inputs)#(2,7308,33) 2代表图片数量，7308代表每个图片预测的default boxer数量，33：4(位置)+21(预测概率)+8(defaule boxes参数)print(pred.shape)#进行非最大抑制算法处理NMS 21类别bb = BBoxUtility(self.classes_nums)res = bb.detestion_out(pred)#(200,6) (200,6)print(res[0].shape,res[1].shape)#200个候选框，每个候选框位置，类别return res,images_datadef tag_picture(self,images_data,outputs):"""显示预测结果到图片中:param images_data::param outputs::return:"""#1.获取每张图片的预测结果中的值for i,img in enumerate(images_data):# 获取res当中对应的结果label,location,xmin,ymin,xmax,ymaxpre_label = outputs[i][:,0]pre_conf = outputs[i][:,1]pre_xmin = outputs[i][:,2]pre_ymin = outputs[i][:,3]pre_xmax = outputs[i][:,4]pre_ymax = outputs[i][:,5]# print("pre_label:{},pre_conf:{},pre_xmin:{},pre_ymin:{},pre_xmax:{},pre_ymax:{}".# format(pre_label,pre_conf,pre_xmin,pre_ymin,pre_xmax,pre_ymax))#由于检测出的物体还是很多，所以进行显示过滤（90%）top_indices = [i for i,conf in enumerate(pre_conf) if conf>0.6]top_conf = pre_conf[top_indices]top_label_indices = pre_label[top_indices].tolist()top_xmin = pre_xmin[top_indices]top_ymin = pre_ymin[top_indices]top_xmax = pre_xmax[top_indices]top_ymax = pre_ymax[top_indices]print("top_label_indices :{},top_conf:{},top_xmin:{},top_ymin:{},top_xmax:{},top_ymax:{}".format(top_label_indices,top_conf,top_xmin,top_ymin,top_xmax,top_ymax))#matplotlib画图显示结果#定义21种颜色，显示图像#currentAxis增加图中文本显示和标记显示colors = plt.cm.hsv(np.linspace(0,1,21)).tolist()plt.imshow(img/255)currentAxis = plt.gca()for i in range(top_conf.shape[0]):xmin = int(round(top_xmin[i] * img.shape[1]))ymin = int(round(top_ymin[i] * img.shape[0]))xmax = int(round(top_xmax[i] * img.shape[1]))ymax = int(round(top_ymax[i] * img.shape[0]))#获取该图片预测概率，名称，定义显示颜色score = top_conf[i]label = int(top_label_indices[i])label_name = self.classes_name[label-1]display_txt = '{:0.2f},{}'.format(score,label_name)coords = (xmin,ymin),xmax-xmin+1,ymax-ymin+1color = colors[label]#显示方框currentAxis.add_patch(plt.Rectangle(*coords,fill=False,edgecolor=color,linewidth=2))#左上角显示概率以及名称currentAxis.text(xmin,ymin,display_txt,bbox={'facecolor':color,'alpha':0.5})plt.show()return Noneif __name__=='__main__':ssd = SSDTest()outputs,images_data = ssd.test()#显示图片ssd.tag_picture(images_data,outputs)

如果觉得《深度学习物体检测实战算法2——YOLO与SSD》对你有帮助，请点赞、收藏，并留下你的观点哦！

本内容不代表本网观点和政治立场，如有侵犯你的权益请联系我们处理。

网友评论

网友评论仅供其表达个人看法，并不表明网站立场。