9.8 利用視覺碼本和向量量化創(chuàng)建特征

為了創(chuàng)建一個目標(biāo)識別系統(tǒng)轨蛤，需要從每張圖像中提取特征向量。每張圖像需要有一個識別標(biāo)志豹储，以用于匹配。
我們用視覺碼本的概念來創(chuàng)建圖像識別標(biāo)志宰缤。在訓(xùn)練數(shù)據(jù)集中，碼本實(shí)際上就是一個字典晃洒，用于提出關(guān)于圖像的描述慨灭，我們用向量量化方法將很多特征點(diǎn)進(jìn)行聚類并得出中心點(diǎn)，這些中心點(diǎn)將作為視覺碼本的元素球及。

訓(xùn)練數(shù)據(jù)集

包含3類實(shí)例訓(xùn)練數(shù)據(jù)集氧骤，每一類包含20幅圖像，可以在http://www.vision.caltech.edu/html-files/archive.html 下載吃引。

處理加載數(shù)據(jù)集：

def load_training_data(input_folder):
    training_data = []   # 以list的形式 存儲數(shù)據(jù)集中的圖片信息
    if not os.path.isdir(input_folder):
        raise IOError("The folder " + input_folder + " doesn't exist")
    for root, dirs, files in os.walk(input_folder):
        for filename in (x for x in files if x.endswith('.jpg')):
            filepath = os.path.join(root, filename)   
            # filepath 輸出為 'training_images/airplanes\\0001.jpg'
            filepath = filepath.replace('\\','/')   
            # 替換字符\\ 以方便處理 提取label  此時(shí)filepath 輸出為：'training_images/airplanes/0001.jpg'
            object_class = filepath.split('/')[-2]  
            # 此時(shí) object_class  為：airplanes
            #  將每幅圖像的信息以字典的形式保存在  training_data
            training_data.append({'object_class': object_class,
                                  'image_path': filepath})  
    return training_data

提取圖片的特征：

class FeatureBuilder(object):
    '''
    定義一個從輸入圖像提取特征的方法筹陵，
    用star檢測器獲取關(guān)鍵點(diǎn)刽锤，然后用SIFT提取這些位置的描述信息
    '''
    
    # 提取圖片的特征
    def extract_features(self, img):
        #用Start獲取關(guān)鍵點(diǎn)，
        keypoints = StarFeatureDetector().detect(img)
        # 用SIFT提取關(guān)鍵點(diǎn)的位置信息朦佩，keypoint是list類型并思。
        keypoints, feature_vectors = compute_sift_features(img, keypoints)
        #  feature_vectors 是numpy.ndarray類型
        return feature_vectors

    def get_codewords(self, input_map, scaling_size, max_samples=12):
        #max_samples:定義每類樣本數(shù)據(jù)的最大樣本數(shù)：如果大于最大樣本數(shù)則后面相同樣本的數(shù)據(jù)就跳過
        #input_map是所有樣本數(shù)據(jù)的label和位置路徑信息即訓(xùn)練數(shù)據(jù)，list類型
        keypoints_all = []
        #用 keypoints_all 存儲所有圖片的關(guān)鍵點(diǎn)特征信息
        count = 0
        cur_class = ''
        for item in input_map:
            # item是樣本的 信息 
            #例如：{'image_path': 'training_images/airplanes/0001.jpg', 'object_class': 'airplanes'}
            # 如果大于樣本數(shù)則跳過此樣本  即： continue
            if count >= max_samples:
                if cur_class != item['object_class']:
                    count = 0
                else:
                    continue
            count += 1
            if count == max_samples:
                print("Built centroids for", item['object_class'])
            # cur_class  記錄當(dāng)前樣本的lebel, 然后讀取圖像
            cur_class = item['object_class']
            img = cv2.imread(item['image_path'])
            img = resize_image(img, scaling_size)

            num_dims = 128
            # 獲取樣本圖像的  keypoint 關(guān)鍵點(diǎn)信息
            feature_vectors = self.extract_features(img)
            #  將keypoint 關(guān)鍵點(diǎn)信息  存儲在 keypoints_all中
            keypoints_all.extend(feature_vectors)
        #對 keypoints_all 進(jìn)行聚類
        kmeans, centroids = BagOfWords().cluster(keypoints_all)
        return kmeans, centroids

定義一個類來處理詞袋模型和向量量化

class BagOfWords(object):
    def __init__(self, num_clusters=32):
        self.num_dims = 128
        self.num_clusters = num_clusters
        self.num_retries = 10
    
    # 用kmeans聚類來實(shí)現(xiàn)量化數(shù)據(jù)點(diǎn)
    def cluster(self, datapoints):
        kmeans = KMeans(self.num_clusters,
                        n_init=max(self.num_retries, 1),
                        max_iter=10, tol=1.0)
        #提取中心點(diǎn)
        res = kmeans.fit(datapoints)
        centroids = res.cluster_centers_
        return kmeans, centroids
    
    #  歸一化數(shù)據(jù)
    def normalize(self, input_data):
        sum_input = np.sum(input_data)

        if sum_input > 0:
            return input_data / sum_input
        else:
            return input_data
    
    # 獲得圖像的特征向量
    def construct_feature(self, img, kmeans, centroids):
        #獲取圖像的keypoints和位置信息
        keypoints = StarFeatureDetector().detect(img)
        keypoints, feature_vectors = compute_sift_features(img, keypoints)
        # 用kmeans預(yù)測一幅圖片的label
        labels = kmeans.predict(feature_vectors)
        feature_vector = np.zeros(self.num_clusters)
        # 創(chuàng)建直方圖將其歸一化
        for i, item in enumerate(feature_vectors):
            feature_vector[labels[i]] += 1
        
        feature_vector_img = np.reshape(feature_vector,
                                        ((1, feature_vector.shape[0])))
        return self.normalize(feature_vector_img)

輸入圖像提取特征然后映射到某一類

def compute_sift_features(img, keypoints):
    if img is None:
        raise TypeError('Invalid input image')

    img_gray = cv2.cvtColor(img, cv2.COLOR_BGR2GRAY)
    keypoints, descriptors = cv2.xfeatures2d.SIFT_create().compute(img_gray, keypoints)
    return keypoints, descriptors

定義一個

def get_feature_map(input_map, kmeans, centroids, scaling_size):
    feature_map = []
    for item in input_map:
        temp_dict = {}
        temp_dict['object_class'] = item['object_class']

        print("Extracting features for", item['image_path'])

        img = cv2.imread(item['image_path'])
        img = resize_image(img, scaling_size)

        temp_dict['feature_vector'] = BagOfWords().construct_feature(
            img, kmeans, centroids)

        if temp_dict['feature_vector'] is not None:
            feature_map.append(temp_dict)

    return feature_map

resize_image

def resize_image(input_img, new_size):
    h, w = input_img.shape[:2]
    scaling_factor = new_size / float(h)

    if w < h:
        scaling_factor = new_size / float(w)

    new_shape = (int(w * scaling_factor), int(h * scaling_factor))
    return cv2.resize(input_img, new_shape)

Star檢測器

class StarFeatureDetector(object):
    def __init__(self):
        self.detector = cv2.xfeatures2d.StarDetector_create()

    def detect(self, img):
        return self.detector.detect(img)

主文件import

# -*- coding:utf8 -*-
import os
import sys
import argparse
# import cPickle as pickle
import pickle as pickle
import json
import cv2
import numpy as np
from sklearn.cluster import KMeans

在pycharm里編輯輸入信息方便調(diào)試

if __name__ == '__main__':
    data_folder = 'training_images/'
    scaling_size = 200
    codebook_file = 'codebook/9_8.pkl'
    feature_map_file = 'feature_map/9_8.pkl'

    training_data = load_training_data(data_folder)

    # Build the visual codebook
    print("====== Building visual codebook ======")
    kmeans, centroids = FeatureBuilder().get_codewords(training_data, scaling_size)
    if codebook_file:
        with open(codebook_file, 'wb+') as f:
            pickle.dump((kmeans, centroids), f)

    # Extract features from input images
    print("\n====== Building the feature map ======")

    feature_map = get_feature_map(training_data, kmeans, centroids, scaling_size)
    if feature_map_file:
        with open(feature_map_file, 'wb+') as f:
            pickle.dump(feature_map, f)

命令行方式運(yùn)行文件

#  定義命令行輸入方式
def build_arg_parser():
    parser = argparse.ArgumentParser(description='Extract features from a given \
            set of images')

    parser.add_argument("--data-folder", dest="data_folder", required=True,
                        help="Folder containing the training images organized in subfolders")
    parser.add_argument("--codebook-file", dest='codebook_file', required=True,
                        help="Output file where the codebook will be stored")
    parser.add_argument("--feature-map-file", dest='feature_map_file', required=True,
                        help="Output file where the feature map will be stored")
    parser.add_argument("--scaling-size", dest="scaling_size", type=int,
                        default=200, help="Scales the longer dimension of the image down \
                    to this size.")

    return parser


if __name__ == '__main__':
    args = build_arg_parser().parse_args()
    data_folder = args.data_folder
    scaling_size = args.scaling_size

    # Load the training data
    training_data = load_training_data(data_folder)

    # Build the visual codebook
    print("====== Building visual codebook ======")

    kmeans, centroids = FeatureBuilder().get_codewords(training_data, scaling_size)
    if args.codebook_file:
        with open(args.codebook_file, 'wb+') as f:
            pickle.dump((kmeans, centroids), f)

    # Extract features from input images
    print("\n====== Building the feature map ======")

    feature_map = get_feature_map(training_data, kmeans, centroids, scaling_size)
    if args.feature_map_file:
        with open(args.feature_map_file, 'wb+') as f:
            pickle.dump(feature_map, f)

?著作權(quán)歸作者所有,轉(zhuǎn)載或內(nèi)容合作請聯(lián)系作者

人面猴
序言：七十年代末语稠，一起剝皮案震驚了整個濱河市宋彼，隨后出現(xiàn)的幾起案子，更是在濱河造成了極大的恐慌仙畦，老刑警劉巖输涕，帶你破解...
沈念sama閱讀 221,635評論 6贊 515
死咒
序言：濱河連續(xù)發(fā)生了三起死亡事件，死亡現(xiàn)場離奇詭異慨畸，居然都是意外死亡莱坎，警方通過查閱死者的電腦和手機(jī)，發(fā)現(xiàn)死者居然都...
沈念sama閱讀 94,543評論 3贊 399
救了他兩次的神仙讓他今天三更去死
文/潘曉璐我一進(jìn)店門寸士，熙熙樓的掌柜王于貴愁眉苦臉地迎上來檐什，“玉大人，你說我怎么就攤上這事碉京∠嵝冢” “怎么了？”我有些...
開封第一講書人閱讀 168,083評論 0贊 360
道士緝兇錄：失蹤的賣姜人
文/不壞的土叔我叫張陵谐宙，是天一觀的道長烫葬。經(jīng)常有香客問我，道長凡蜻，這世上最難降的妖魔是什么搭综？我笑而不...
開封第一講書人閱讀 59,640評論 1贊 296
?港島之戀（遺憾婚禮）
正文為了忘掉前任，我火速辦了婚禮划栓，結(jié)果婚禮上兑巾，老公的妹妹穿的比我還像新娘。我一直安慰自己忠荞，他們只是感情好蒋歌，可當(dāng)我...
茶點(diǎn)故事閱讀 68,640評論 6贊 397
惡毒庶女頂嫁案：這布局不是一般人想出來的
文/花漫我一把揭開白布。她就那樣靜靜地躺著委煤，像睡著了一般堂油。火紅的嫁衣襯著肌膚如雪。梳的紋絲不亂的頭發(fā)上碧绞，一...
開封第一講書人閱讀 52,262評論 1贊 308
城市分裂傳說
那天府框，我揣著相機(jī)與錄音，去河邊找鬼讥邻。笑死迫靖，一個胖子當(dāng)著我的面吹牛院峡，可吹牛的內(nèi)容都是我干的。我是一名探鬼主播系宜，決...
沈念sama閱讀 40,833評論 3贊 421
雙鴛鴦連環(huán)套：你想象不到人心有多黑
文/蒼蘭香墨我猛地睜開眼照激，長吁一口氣：“原來是場噩夢啊……” “哼！你這毒婦竟也來了蜈首？” 一聲冷哼從身側(cè)響起实抡，我...
開封第一講書人閱讀 39,736評論 0贊 276
萬榮殺人案實(shí)錄
序言：老撾萬榮一對情侶失蹤，失蹤者是張志新（化名）和其女友劉穎欢策，沒想到半個月后吆寨，有當(dāng)?shù)厝嗽跇淞掷锇l(fā)現(xiàn)了一具尸體，經(jīng)...
沈念sama閱讀 46,280評論 1贊 319
?護(hù)林員之死
正文獨(dú)居荒郊野嶺守林人離奇死亡踩寇，尸身上長有42處帶血的膿包…… 初始之章·張勛以下內(nèi)容為張勛視角年9月15日...
茶點(diǎn)故事閱讀 38,369評論 3贊 340
?白月光啟示錄
正文我和宋清朗相戀三年啄清，在試婚紗的時(shí)候發(fā)現(xiàn)自己被綠了。大學(xué)時(shí)的朋友給我發(fā)了我未婚夫和他白月光在一起吃飯的照片俺孙。...
茶點(diǎn)故事閱讀 40,503評論 1贊 352
活死人
序言：一個原本活蹦亂跳的男人離奇死亡辣卒，死狀恐怖，靈堂內(nèi)的尸體忽然破棺而出睛榄，到底是詐尸還是另有隱情荣茫，我是刑警寧澤，帶...
沈念sama閱讀 36,185評論 5贊 350
?日本核電站爆炸內(nèi)幕
正文年R本政府宣布场靴，位于F島的核電站啡莉，受9級特大地震影響，放射性物質(zhì)發(fā)生泄漏旨剥。R本人自食惡果不足惜咧欣，卻給世界環(huán)境...
茶點(diǎn)故事閱讀 41,870評論 3贊 333
男人毒藥：我在死后第九天來索命
文/蒙蒙一、第九天我趴在偏房一處隱蔽的房頂上張望轨帜。院中可真熱鬧魄咕，春花似錦、人聲如沸蚌父。這莊子的主人今日做“春日...
開封第一講書人閱讀 32,340評論 0贊 24
一樁弒父案，背后竟有這般陰謀
文/蒼蘭香墨我抬頭看了看天上的太陽苟弛。三九已至喝滞，卻和暖如春，著一層夾襖步出監(jiān)牢的瞬間嗡午，已是汗流浹背囤躁。一陣腳步聲響...
開封第一講書人閱讀 33,460評論 1贊 272
情欲美人皮
我被黑心中介騙來泰國打工冀痕，沒想到剛下飛機(jī)就差點(diǎn)兒被人妖公主榨干…… 1. 我叫王不留荔睹，地道東北人狸演。一個月前我還...
沈念sama閱讀 48,909評論 3贊 376
代替公主和親
正文我出身青樓，卻偏偏與公主長得像僻他，于是被迫代替她去往敵國和親宵距。傳聞我的和親對象是個殘疾皇子，可洞房花燭夜當(dāng)晚...
茶點(diǎn)故事閱讀 45,512評論 2贊 359