When I read the paper3,I get the?contention of the features descriptor?of??bag of visual words of the recognition framework.Then I attempt to know its theory.
Here,I'd like to put the blog which I think is great to give the informations primary-learners want to know.http://blog.csdn.net/wsj998689aa/article/details/47089153
Notes:Bag-of words是SIFT算法在目標(biāo)識(shí)別方面的應(yīng)用
對(duì)于圖像處理而言深纲,關(guān)鍵在于找出“視覺詞匯”構(gòu)建出圖片的檢索字典撼短,然后對(duì)圖片進(jìn)行編碼朽肥。雖然同類圖片不同實(shí)例之間存在差異产喉,但其局部的一些特征時(shí)基本相似的,故由此可以利用SIFT算法提取圖像中局部不變特征來構(gòu)建圖像的視覺詞典彭雾,然后對(duì)圖像進(jìn)行編碼沪铭。其具體步驟如下:
于是便可用一個(gè)相對(duì)較少維度的數(shù)值向量來描述一幅圖像,相比于用SIFT來描述一幅圖像(每個(gè)SIFT矢量為128維峦朗,且每幅圖像通常包含成百上千個(gè)SIFT矢量),用Bag-of-words來描述使得在進(jìn)行圖像間相似度計(jì)算時(shí)效率能大大提高排龄。然后將用bag of words表示的圖片用于進(jìn)行分類器的訓(xùn)練波势。
博文最后還說明了如何實(shí)現(xiàn)BOW來表示一幅圖像,稍晚點(diǎn)試試~