關(guān)于機器學(xué)習(xí)和深度學(xué)習(xí)網(wǎng)絡(luò)性能評價指標(biāo)的理解以及項目實踐

在開始之前先羅列一些概念（如果哪里有誤請在評論中指出）：

讓我們先考慮一種情況叼风，假如我們現(xiàn)在使用yolov2檢測數(shù)據(jù)集中的小貓，我們知道這些數(shù)據(jù)集是打了標(biāo)簽的拆吆，即它是有g(shù)round truth的存和，正常情況下姐帚，預(yù)測結(jié)果會包含小貓的置信度和位置，將實例分成正類(positive)或負(fù)類(negative)[根據(jù)IOU,與置信度無關(guān)铺根，置信度用于ap計算宪躯，不用于判斷實例是TP還是什么]，可以參考下圖1

圖1

取左上角的tp來說位迂，其中的p是分類器認(rèn)為的樣本分類結(jié)果访雪，本例中認(rèn)為是該實例是正確的详瑞，接著我們拿著這個結(jié)果和ground truth標(biāo)簽作對比，認(rèn)為上述分類是正確的臣缀，即為t坝橡，如果認(rèn)為是錯誤的，則為f肝陪，實例結(jié)果就變成f了驳庭，樣本分類結(jié)果就是fp。tn和fn以此類推氯窍。
另一種解釋：

TP:檢測到的正確樣例饲常；它的IOU數(shù)值大于閾值
FP:檢測到的錯誤樣例；它的IOU數(shù)值小于閾值
FN:沒檢測到的正確樣例
TN:正確的樣例但不在ground truth里的,比如一張圖你識別出來了3只貓（假設(shè)這三只貓全部識別正確）狼讨，但是打標(biāo)簽的時候只打了2只貓贝淤，那另外那只貓就叫做TN，這種樣本是不計入AP計算的
PS:這里的閾值每種數(shù)據(jù)集要求不一樣政供，voc2007要求IOU閾值為50%播聪，COCO則要求在5%到95%范圍內(nèi)。
本文主要參考了該github項目：點我布隔，如果想深入了解上述名詞的含義离陶，可以自己點擊閱讀。

Precison:

TP/(TP+FP)-------------->TP/(all detected things)

Recall:

TP/(TP+FN)-------------->TP/(ground truth中所有的正確實例）

AP:

這里主要講一下上述github項目中有點難懂的地方：

項目中提到的voc數(shù)據(jù)集ap的兩種計算方法分別是11點插值法和全局點插值衅檀，2010年以后都用全局點插值方法招刨。其中關(guān)于11點插值法有點難懂，這里提一下：

我們假定檢測了5張圖片哀军，其中g(shù)roundtruth實例數(shù)目是15沉眶，實際檢測出的實例數(shù)目是24，下圖是24個實例列表杉适，

根據(jù)置信度排序的實例列表

我們分別取0,0.1,0.2谎倔，，猿推，0.9片习，1.0這十一個點的插值計算AP。先根據(jù)實例列表圖畫出PR曲線蹬叭，如下圖：

PR圖

每個插值點取值計算方法：取大于等于該插值點的所有recall值中的最大precision藕咏。例如，0.0插值點具垫，其precision值為1.0侈离；0.1插值點，其precision值為0.666筝蚕；0.2插值點卦碾，為0.4285铺坞。而到了0.5插值點及以后，precision值均為0了洲胖。所以AP的數(shù)值即為：

image.png

image.png

image.png

mAP:

上面計算的都是單類的AP數(shù)值济榨，那么如果目標(biāo)檢測任務(wù)中除了檢測貓，還要檢測狗绿映，雞擒滑，鴨怎么辦呢，這時候就用到mAP了叉弦，計算方法為 $(AP_{MAO}+AP_{GOU}+AP_{JI}+AP_{YA})/4$ 其中的4指的是實例種數(shù)丐一。

項目實踐（計算mAP和PR曲線）

本文的實踐代碼均基于上述github代碼，針對yolov3網(wǎng)絡(luò)做了些修改淹冰。

首先在測試集上測試得到識別結(jié)果库车，yolov3會將識別結(jié)果存入txt文件，具體命令請參考上篇文章樱拴。
計算代碼為：

###########################################################################################
#                                                                                         #
# This sample shows how to evaluate object detections applying the following metrics:     #
#  * Precision x Recall curve       ---->       used by VOC PASCAL 2012                   #
#  * Average Precision (AP)         ---->       used by VOC PASCAL 2012                   #
#                                                                                         #
# Developed by: Rafael Padilla (rafael.padilla@smt.ufrj.br)                               #
#        SMT - Signal Multimedia and Telecommunications Lab                               #
#        COPPE - Universidade Federal do Rio de Janeiro                                   #
#        Last modification: May 24th 2018                                                 #
###########################################################################################

import _init_paths
from BoundingBox import BoundingBox
from BoundingBoxes import BoundingBoxes
from Evaluator import *
from utils import *
dt_path='/home/longmao/workspace/compute MAP/Object-Detection-Metrics/' \
        'samples/yolov3_compute_mAP/carplate.txt'
gt_path='/home/longmao/darknet/VOCdevkit/VOC2007/ImageSets/Main/test.txt'

def getBoundingBoxes(dt_path,gt_path):
    """Read txt files containing bounding boxes (ground truth and detections)."""
    allBoundingBoxes = BoundingBoxes()
    import glob
    import os
    # Read ground truths
    # Class representing bounding boxes (ground truths and detections)
    allBoundingBoxes = BoundingBoxes()
    # Read GT detections from txt file
    # Each line of the files in the groundtruths folder represents a ground truth bounding box
    # (bounding boxes that a detector should detect)
    # Each value of each line is  "class_id, x, y, width, height" respectively
    # Class_id represents the class of the bounding box
    # x, y represents the most top-left coordinates of the bounding box
    # x2, y2 represents the most bottom-right coordinates of the bounding box
    label_path='/home/longmao/darknet/VOCdevkit/VOC2007/labels'
    with open(gt_path,'r') as file_para:
        files=file_para.readlines()
        for f in files:
            f=f.strip()
            idClass = os.path.splitext(os.path.basename(dt_path))[0]
            nameOfImage=f
            with open(os.path.join(label_path,f)+'.txt','r') as a:
                b=a.readlines()
                for c in b:
                    c=c.strip()
                    splitLine=c.split()
                    x = float(splitLine[1])  # confidence
                    y = float(splitLine[2])
                    w = float(splitLine[3])
                    h = float(splitLine[4])
                    bb = BoundingBox(
                        nameOfImage,
                        idClass,
                        x,
                        y,
                        w,
                        h,
                        CoordinatesType.Relative,
                        imgSize=(1920,1080),
                        bbType=BBType.GroundTruth,
                        format=BBFormat.XYWH)
                    allBoundingBoxes.addBoundingBox(bb)
    # Read detections
    # Read detections from txt file
    # Each line of the files in the detections folder represents a detected bounding box.
    # Each value of each line is  "class_id, confidence, x, y, width, height" respectively
    # Class_id represents the class of the detected bounding box
    # Confidence represents confidence (from 0 to 1) that this detection belongs to the class_id.
    # x, y represents the most top-left coordinates of the bounding box
    # x2, y2 represents the most bottom-right coordinates of the bounding box
    with open(dt_path,'r') as files_para:
        files=files_para.readlines()
        idClass=os.path.splitext(os.path.basename(dt_path))[0]
        for f in files:
                f=f.strip()
                print(f)
                splitLine = f.split(" ")
                nameOfImage = splitLine[0]  # class
                confidence = float(splitLine[1])  # confidence
                x = float(splitLine[2])
                y = float(splitLine[3])
                w = float(splitLine[4])
                h = float(splitLine[5])
                print(idClass,nameOfImage,x,y,w,h)
                bb = BoundingBox(
                    nameOfImage,
                    idClass,
                    x,
                    y,
                    w,
                    h,
                    CoordinatesType.Absolute, (1920, 1080),
                    BBType.Detected,
                    confidence,
                    format=BBFormat.XYX2Y2)
                allBoundingBoxes.addBoundingBox(bb)
    print(type(allBoundingBoxes))
    return allBoundingBoxes
# getBoundingBoxes(dt_path,gt_path=gt_path)
def createImages(dictGroundTruth, dictDetected):
    """Create representative images with bounding boxes."""
    import numpy as np
    import cv2
    # Define image size
    width = 200
    height = 200
    # Loop through the dictionary with ground truth detections
    for key in dictGroundTruth:
        image = np.zeros((height, width, 3), np.uint8)
        gt_boundingboxes = dictGroundTruth[key]
        image = gt_boundingboxes.drawAllBoundingBoxes(image)
        detection_boundingboxes = dictDetected[key]
        image = detection_boundingboxes.drawAllBoundingBoxes(image)
        # Show detection and its GT
        cv2.imshow(key, image)
        cv2.waitKey()


# Read txt files containing bounding boxes (ground truth and detections)
boundingboxes = getBoundingBoxes(dt_path,gt_path)
# Uncomment the line below to generate images based on the bounding boxes
# createImages(dictGroundTruth, dictDetected)
# Create an evaluator object in order to obtain the metrics
evaluator = Evaluator()
##############################################################
# VOC PASCAL Metrics
##############################################################
# Plot Precision x Recall curve
evaluator.PlotPrecisionRecallCurve(
    boundingboxes,  # Object containing all bounding boxes (ground truths and detections)
    IOUThreshold=0.3,  # IOU threshold
    method=MethodAveragePrecision.EveryPointInterpolation,  # As the official matlab code
    showAP=True,  # Show Average Precision in the title of the plot
    showInterpolatedPrecision=True)  # Plot the interpolated precision curve
# Get metrics with PASCAL VOC metrics
metricsPerClass = evaluator.GetPascalVOCMetrics(
    boundingboxes,  # Object containing all bounding boxes (ground truths and detections)
    IOUThreshold=0.3,  # IOU threshold
    method=MethodAveragePrecision.EveryPointInterpolation)  # As the official matlab code
print("Average precision values per class:\n")
# Loop through classes to obtain their metrics
for mc in metricsPerClass:
    # Get metric values per each class
    c = mc['class']
    precision = mc['precision']
    recall = mc['recall']
    average_precision = mc['AP']
    ipre = mc['interpolated precision']
    irec = mc['interpolated recall']
    # Print AP per class
    print('%s: %f' % (c, average_precision))

不同的網(wǎng)絡(luò)測試所得結(jié)果可能不同柠衍，可能是每張圖片生成一個txt識別結(jié)果文件，但本文默認(rèn)將所有識別結(jié)果存入一個txt文件中晶乔，這里會有一個情況珍坊，如果該張圖片一個識別結(jié)果都沒有時，我們就跳過這張圖片正罢，不將該圖片的識別結(jié)果放入txt文件中即可阵漏。

最后編輯于：2019.01.05 19:48:38

?著作權(quán)歸作者所有,轉(zhuǎn)載或內(nèi)容合作請聯(lián)系作者

人面猴
序言：七十年代末，一起剝皮案震驚了整個濱河市腺怯，隨后出現(xiàn)的幾起案子袱饭，更是在濱河造成了極大的恐慌川无，老刑警劉巖呛占，帶你破解...
沈念sama閱讀 221,406評論 6贊 515
死咒
序言：濱河連續(xù)發(fā)生了三起死亡事件，死亡現(xiàn)場離奇詭異懦趋，居然都是意外死亡晾虑，警方通過查閱死者的電腦和手機，發(fā)現(xiàn)死者居然都...
沈念sama閱讀 94,395評論 3贊 398
救了他兩次的神仙讓他今天三更去死
文/潘曉璐我一進店門仅叫，熙熙樓的掌柜王于貴愁眉苦臉地迎上來帜篇，“玉大人，你說我怎么就攤上這事诫咱◇舷叮” “怎么了？”我有些...
開封第一講書人閱讀 167,815評論 0贊 360
道士緝兇錄：失蹤的賣姜人
文/不壞的土叔我叫張陵坎缭，是天一觀的道長竟痰。經(jīng)常有香客問我签钩，道長，這世上最難降的妖魔是什么坏快？我笑而不...
開封第一講書人閱讀 59,537評論 1贊 296
?港島之戀（遺憾婚禮）
正文為了忘掉前任铅檩，我火速辦了婚禮，結(jié)果婚禮上莽鸿，老公的妹妹穿的比我還像新娘昧旨。我一直安慰自己，他們只是感情好祥得，可當(dāng)我...
茶點故事閱讀 68,536評論 6贊 397
惡毒庶女頂嫁案：這布局不是一般人想出來的
文/花漫我一把揭開白布兔沃。她就那樣靜靜地躺著，像睡著了一般级及。火紅的嫁衣襯著肌膚如雪粘拾。梳的紋絲不亂的頭發(fā)上，一...
開封第一講書人閱讀 52,184評論 1贊 308
城市分裂傳說
那天创千，我揣著相機與錄音缰雇，去河邊找鬼。笑死追驴，一個胖子當(dāng)著我的面吹牛械哟，可吹牛的內(nèi)容都是我干的。我是一名探鬼主播殿雪，決...
沈念sama閱讀 40,776評論 3贊 421
雙鴛鴦連環(huán)套：你想象不到人心有多黑
文/蒼蘭香墨我猛地睜開眼暇咆，長吁一口氣：“原來是場噩夢啊……” “哼！你這毒婦竟也來了丙曙？” 一聲冷哼從身側(cè)響起爸业，我...
開封第一講書人閱讀 39,668評論 0贊 276
萬榮殺人案實錄
序言：老撾萬榮一對情侶失蹤，失蹤者是張志新（化名）和其女友劉穎亏镰，沒想到半個月后扯旷，有當(dāng)?shù)厝嗽跇淞掷锇l(fā)現(xiàn)了一具尸體，經(jīng)...
沈念sama閱讀 46,212評論 1贊 319
?護林員之死
正文獨居荒郊野嶺守林人離奇死亡索抓，尸身上長有42處帶血的膿包…… 初始之章·張勛以下內(nèi)容為張勛視角年9月15日...
茶點故事閱讀 38,299評論 3贊 340
?白月光啟示錄
正文我和宋清朗相戀三年钧忽，在試婚紗的時候發(fā)現(xiàn)自己被綠了。大學(xué)時的朋友給我發(fā)了我未婚夫和他白月光在一起吃飯的照片逼肯。...
茶點故事閱讀 40,438評論 1贊 352
活死人
序言：一個原本活蹦亂跳的男人離奇死亡耸黑，死狀恐怖，靈堂內(nèi)的尸體忽然破棺而出篮幢，到底是詐尸還是另有隱情大刊，我是刑警寧澤，帶...
沈念sama閱讀 36,128評論 5贊 349
?日本核電站爆炸內(nèi)幕
正文年R本政府宣布三椿，位于F島的核電站缺菌，受9級特大地震影響曲尸，放射性物質(zhì)發(fā)生泄漏。R本人自食惡果不足惜男翰，卻給世界環(huán)境...
茶點故事閱讀 41,807評論 3贊 333
男人毒藥：我在死后第九天來索命
文/蒙蒙一另患、第九天我趴在偏房一處隱蔽的房頂上張望。院中可真熱鬧蛾绎，春花似錦昆箕、人聲如沸。這莊子的主人今日做“春日...
開封第一講書人閱讀 32,279評論 0贊 24
一樁弒父案鹏倘，背后竟有這般陰謀
文/蒼蘭香墨我抬頭看了看天上的太陽。三九已至顽爹，卻和暖如春纤泵，著一層夾襖步出監(jiān)牢的瞬間，已是汗流浹背镜粤。一陣腳步聲響...
開封第一講書人閱讀 33,395評論 1贊 272
情欲美人皮
我被黑心中介騙來泰國打工捏题，沒想到剛下飛機就差點兒被人妖公主榨干…… 1. 我叫王不留，地道東北人肉渴。一個月前我還...
沈念sama閱讀 48,827評論 3贊 376
代替公主和親
正文我出身青樓公荧，卻偏偏與公主長得像，于是被迫代替她去往敵國和親同规。傳聞我的和親對象是個殘疾皇子循狰，可洞房花燭夜當(dāng)晚...
茶點故事閱讀 45,446評論 2贊 359