Python 3 & Keras 實(shí)現(xiàn)Mobilenet v2

MobileNet是Google提出來(lái)的移動(dòng)端分類網(wǎng)絡(luò)慈格。在V1中，MobileNet應(yīng)用了深度可分離卷積(Depth-wise Seperable Convolution)并提出兩個(gè)超參來(lái)控制網(wǎng)絡(luò)容量筷凤，這種卷積背后的假設(shè)是跨channel相關(guān)性和跨spatial相關(guān)性的解耦鹦马。深度可分離卷積能夠節(jié)省參數(shù)量省雏婶，在保持移動(dòng)端可接受的模型復(fù)雜性的基礎(chǔ)上達(dá)到了相當(dāng)?shù)母呔取６赩2中，MobileNet應(yīng)用了新的單元：Inverted residual with linear bottleneck较坛，主要的改動(dòng)是為Bottleneck添加了linear激活輸出以及將殘差網(wǎng)絡(luò)的skip-connection結(jié)構(gòu)轉(zhuǎn)移到低維Bottleneck層印蔗。

Paper：Inverted Residuals and Linear Bottlenecks Mobile Networks for Classification, Detection and Segmentation
Github：https://github.com/xiaochus/MobileNetV2

網(wǎng)絡(luò)結(jié)構(gòu)

MobileNetV2的整體結(jié)構(gòu)如下圖所示。每行描述一個(gè)或多個(gè)相同（步長(zhǎng)）層的序列丑勤，每個(gè)bottleneck重復(fù)n次喻鳄。相同序列中的所有層具有相同數(shù)量的輸出通道。每個(gè)序列的第一層有使用步長(zhǎng)s确封，所有其他層使用步長(zhǎng)1除呵。所有的空間卷積使用3 * 3的內(nèi)核。擴(kuò)展因子t始終應(yīng)用于輸入大小爪喘。假設(shè)輸入某一層的tensor的通道數(shù)為k颜曾，那么應(yīng)用在這一層上的filters數(shù)就為 k * t。

net.jpg

Bottleneck的結(jié)構(gòu)如下所示秉剑，根據(jù)使用的步長(zhǎng)大小來(lái)決定是否使用skip-connection結(jié)構(gòu)泛豪。

stru.jpg

環(huán)境

OpenCV 3.4
Python 3.5
Tensorflow-gpu 1.2.0
Keras 2.1.3

實(shí)現(xiàn)

基于論文給出的參數(shù)，我使用Keras 2實(shí)現(xiàn)了網(wǎng)絡(luò)結(jié)構(gòu)侦鹏，如下所示：

from keras.models import Model
from keras.layers import Input, Conv2D, GlobalAveragePooling2D, Dropout
from keras.layers import Activation, BatchNormalization, add, Reshape
from keras.applications.mobilenet import relu6, DepthwiseConv2D
from keras.utils.vis_utils import plot_model

from keras import backend as K


def _conv_block(inputs, filters, kernel, strides):
    """Convolution Block
    This function defines a 2D convolution operation with BN and relu6.
    # Arguments
        inputs: Tensor, input tensor of conv layer.
        filters: Integer, the dimensionality of the output space.
        kernel: An integer or tuple/list of 2 integers, specifying the
            width and height of the 2D convolution window.
        strides: An integer or tuple/list of 2 integers,
            specifying the strides of the convolution along the width and height.
            Can be a single integer to specify the same value for
            all spatial dimensions.
    # Returns
        Output tensor.
    """

    channel_axis = 1 if K.image_data_format() == 'channels_first' else -1

    x = Conv2D(filters, kernel, padding='same', strides=strides)(inputs)
    x = BatchNormalization(axis=channel_axis)(x)
    return Activation(relu6)(x)


def _bottleneck(inputs, filters, kernel, t, s, r=False):
    """Bottleneck
    This function defines a basic bottleneck structure.
    # Arguments
        inputs: Tensor, input tensor of conv layer.
        filters: Integer, the dimensionality of the output space.
        kernel: An integer or tuple/list of 2 integers, specifying the
            width and height of the 2D convolution window.
        t: Integer, expansion factor.
            t is always applied to the input size.
        s: An integer or tuple/list of 2 integers,specifying the strides
            of the convolution along the width and height.Can be a single
            integer to specify the same value for all spatial dimensions.
        r: Boolean, Whether to use the residuals.
    # Returns
        Output tensor.
    """

    channel_axis = 1 if K.image_data_format() == 'channels_first' else -1
    tchannel = K.int_shape(inputs)[channel_axis] * t

    x = _conv_block(inputs, tchannel, (1, 1), (1, 1))

    x = DepthwiseConv2D(kernel, strides=(s, s), depth_multiplier=1, padding='same')(x)
    x = BatchNormalization(axis=channel_axis)(x)
    x = Activation(relu6)(x)

    x = Conv2D(filters, (1, 1), strides=(1, 1), padding='same')(x)
    x = BatchNormalization(axis=channel_axis)(x)

    if r:
        x = add([x, inputs])
    return x


def _inverted_residual_block(inputs, filters, kernel, t, strides, n):
    """Inverted Residual Block
    This function defines a sequence of 1 or more identical layers.
    # Arguments
        inputs: Tensor, input tensor of conv layer.
        filters: Integer, the dimensionality of the output space.
        kernel: An integer or tuple/list of 2 integers, specifying the
            width and height of the 2D convolution window.
        t: Integer, expansion factor.
            t is always applied to the input size.
        s: An integer or tuple/list of 2 integers,specifying the strides
            of the convolution along the width and height.Can be a single
            integer to specify the same value for all spatial dimensions.
        n: Integer, layer repeat times.
    # Returns
        Output tensor.
    """

    x = _bottleneck(inputs, filters, kernel, t, strides)

    for i in range(1, n):
        x = _bottleneck(x, filters, kernel, t, 1, True)

    return x


def MobileNetv2(input_shape, k):
    """MobileNetv2
    This function defines a MobileNetv2 architectures.
    # Arguments
        input_shape: An integer or tuple/list of 3 integers, shape
            of input tensor.
        k: Integer, layer repeat times.
    # Returns
        MobileNetv2 model.
    """

    inputs = Input(shape=input_shape)
    x = _conv_block(inputs, 32, (3, 3), strides=(2, 2))

    x = _inverted_residual_block(x, 16, (3, 3), t=1, strides=1, n=1)
    x = _inverted_residual_block(x, 24, (3, 3), t=6, strides=2, n=2)
    x = _inverted_residual_block(x, 32, (3, 3), t=6, strides=2, n=3)
    x = _inverted_residual_block(x, 64, (3, 3), t=6, strides=2, n=4)
    x = _inverted_residual_block(x, 96, (3, 3), t=6, strides=1, n=3)
    x = _inverted_residual_block(x, 160, (3, 3), t=6, strides=2, n=3)
    x = _inverted_residual_block(x, 320, (3, 3), t=6, strides=1, n=1)

    x = _conv_block(x, 1280, (1, 1), strides=(1, 1))
    x = GlobalAveragePooling2D()(x)
    x = Reshape((1, 1, 1280))(x)
    x = Dropout(0.3, name='Dropout')(x)
    x = Conv2D(k, (1, 1), padding='same')(x)

    x = Activation('softmax', name='softmax')(x)
    output = Reshape((k,))(x)

    model = Model(inputs, output)
    plot_model(model, to_file='images/MobileNetv2.png', show_shapes=True)

    return model


if __name__ == '__main__':
    MobileNetv2((224, 224, 3), 1000)

訓(xùn)練

論文中推薦的輸入大小為 224 * 224诡曙，因此訓(xùn)練集最好使用同樣的大小. data\convert.py 文件提供了將cifar-100數(shù)據(jù)放大為224的例子.

訓(xùn)練數(shù)據(jù)集應(yīng)該按照以下的格式配置:

| - data/
    | - train/
        | - class 0/
            | - image.jpg
                ....
        | - class 1/
          ....
        | - class n/
    | - validation/
        | - class 0/
        | - class 1/
          ....
        | - class n/

運(yùn)行下面的命令來(lái)訓(xùn)練模型:

python train.py --classes num_classes --batch batch_size --epochs epochs --size image_size

訓(xùn)練好的 .h5 權(quán)重文件保存在model文件夾.。如果想要在已有的模型上進(jìn)行微調(diào)略水，可以使用下面的命令价卤。但是需要注意，只能夠改變最后一層輸出的類別的個(gè)數(shù)渊涝，其他層的結(jié)構(gòu)應(yīng)該保持一致慎璧。

python train.py --classes num_classes --batch batch_size --epochs epochs --size image_size --weights weights_path --tclasses pre_classes

參數(shù)

--classes, 當(dāng)前訓(xùn)練集的類別數(shù)。
--size, 圖像大小跨释。
--batch, batch size胸私。
--epochs, epochs。
--weights, 需要fine tune的模型鳖谈。
--tclasses, 訓(xùn)練好的模型中輸出的類別數(shù)岁疼。

實(shí)驗(yàn)

由于條件限制，我們使用cifar-100數(shù)據(jù)庫(kù)缆娃，在一定大小的epochs下進(jìn)行實(shí)驗(yàn)捷绒。

device: Tesla K80
dataset: cifar-100
optimizer: Adam(lr=0.001, beta_1=0.9, beta_2=0.999, epsilon=1e-08)  
batch_szie: 128

實(shí)驗(yàn)細(xì)節(jié)如下，盡管網(wǎng)絡(luò)沒有完全收斂龄恋，但依然取得了不錯(cuò)的準(zhǔn)確率疙驾。

Metrics	Loss	Top-1 Accuracy	Top-5 Accuracy
cifar-100	0.195	94.42%	99.82%

eva.png

最后編輯于：2018.02.11 15:54:23

?著作權(quán)歸作者所有,轉(zhuǎn)載或內(nèi)容合作請(qǐng)聯(lián)系作者

人面猴
序言：七十年代末凶伙，一起剝皮案震驚了整個(gè)濱河市郭毕，隨后出現(xiàn)的幾起案子，更是在濱河造成了極大的恐慌函荣，老刑警劉巖显押，帶你破解...
沈念sama閱讀 219,270評(píng)論 6贊 508
死咒
序言：濱河連續(xù)發(fā)生了三起死亡事件扳肛，死亡現(xiàn)場(chǎng)離奇詭異，居然都是意外死亡乘碑，警方通過(guò)查閱死者的電腦和手機(jī)挖息，發(fā)現(xiàn)死者居然都...
沈念sama閱讀 93,489評(píng)論 3贊 395
救了他兩次的神仙讓他今天三更去死
文/潘曉璐我一進(jìn)店門，熙熙樓的掌柜王于貴愁眉苦臉地迎上來(lái)兽肤，“玉大人套腹，你說(shuō)我怎么就攤上這事∽收。” “怎么了电禀？”我有些...
開封第一講書人閱讀 165,630評(píng)論 0贊 356
道士緝兇錄：失蹤的賣姜人
文/不壞的土叔我叫張陵，是天一觀的道長(zhǎng)笤休。經(jīng)常有香客問我尖飞，道長(zhǎng)，這世上最難降的妖魔是什么店雅？我笑而不...
開封第一講書人閱讀 58,906評(píng)論 1贊 295
?港島之戀（遺憾婚禮）
正文為了忘掉前任政基，我火速辦了婚禮，結(jié)果婚禮上闹啦，老公的妹妹穿的比我還像新娘沮明。我一直安慰自己，他們只是感情好窍奋，可當(dāng)我...
茶點(diǎn)故事閱讀 67,928評(píng)論 6贊 392
惡毒庶女頂嫁案：這布局不是一般人想出來(lái)的
文/花漫我一把揭開白布珊擂。她就那樣靜靜地躺著，像睡著了一般费变。火紅的嫁衣襯著肌膚如雪不瓶。梳的紋絲不亂的頭發(fā)上厨内，一...
開封第一講書人閱讀 51,718評(píng)論 1贊 305
城市分裂傳說(shuō)
那天，我揣著相機(jī)與錄音，去河邊找鬼占婉。笑死，一個(gè)胖子當(dāng)著我的面吹牛费尽，可吹牛的內(nèi)容都是我干的至会。我是一名探鬼主播，決...
沈念sama閱讀 40,442評(píng)論 3贊 420
雙鴛鴦連環(huán)套：你想象不到人心有多黑
文/蒼蘭香墨我猛地睜開眼矮慕，長(zhǎng)吁一口氣：“原來(lái)是場(chǎng)噩夢(mèng)啊……” “哼帮匾！你這毒婦竟也來(lái)了？” 一聲冷哼從身側(cè)響起痴鳄，我...
開封第一講書人閱讀 39,345評(píng)論 0贊 276
萬(wàn)榮殺人案實(shí)錄
序言：老撾萬(wàn)榮一對(duì)情侶失蹤瘟斜，失蹤者是張志新（化名）和其女友劉穎，沒想到半個(gè)月后，有當(dāng)?shù)厝嗽跇淞掷锇l(fā)現(xiàn)了一具尸體螺句，經(jīng)...
沈念sama閱讀 45,802評(píng)論 1贊 317
?護(hù)林員之死
正文獨(dú)居荒郊野嶺守林人離奇死亡虽惭，尸身上長(zhǎng)有42處帶血的膿包…… 初始之章·張勛以下內(nèi)容為張勛視角年9月15日...
茶點(diǎn)故事閱讀 37,984評(píng)論 3贊 337
?白月光啟示錄
正文我和宋清朗相戀三年，在試婚紗的時(shí)候發(fā)現(xiàn)自己被綠了蛇尚。大學(xué)時(shí)的朋友給我發(fā)了我未婚夫和他白月光在一起吃飯的照片芽唇。...
茶點(diǎn)故事閱讀 40,117評(píng)論 1贊 351
活死人
序言：一個(gè)原本活蹦亂跳的男人離奇死亡，死狀恐怖取劫，靈堂內(nèi)的尸體忽然破棺而出匆笤，到底是詐尸還是另有隱情，我是刑警寧澤谱邪，帶...
沈念sama閱讀 35,810評(píng)論 5贊 346
?日本核電站爆炸內(nèi)幕
正文年R本政府宣布疚膊，位于F島的核電站，受9級(jí)特大地震影響虾标，放射性物質(zhì)發(fā)生泄漏寓盗。R本人自食惡果不足惜，卻給世界環(huán)境...
茶點(diǎn)故事閱讀 41,462評(píng)論 3贊 331
男人毒藥：我在死后第九天來(lái)索命
文/蒙蒙一璧函、第九天我趴在偏房一處隱蔽的房頂上張望傀蚌。院中可真熱鬧，春花似錦蘸吓、人聲如沸善炫。這莊子的主人今日做“春日...
開封第一講書人閱讀 32,011評(píng)論 0贊 22
一樁弒父案库继，背后竟有這般陰謀
文/蒼蘭香墨我抬頭看了看天上的太陽(yáng)箩艺。三九已至，卻和暖如春宪萄，著一層夾襖步出監(jiān)牢的瞬間艺谆，已是汗流浹背。一陣腳步聲響...
開封第一講書人閱讀 33,139評(píng)論 1贊 272
情欲美人皮
我被黑心中介騙來(lái)泰國(guó)打工拜英，沒想到剛下飛機(jī)就差點(diǎn)兒被人妖公主榨干…… 1. 我叫王不留静汤，地道東北人。一個(gè)月前我還...
沈念sama閱讀 48,377評(píng)論 3贊 373
代替公主和親
正文我出身青樓居凶，卻偏偏與公主長(zhǎng)得像虫给，于是被迫代替她去往敵國(guó)和親。傳聞我的和親對(duì)象是個(gè)殘疾皇子侠碧，可洞房花燭夜當(dāng)晚...
茶點(diǎn)故事閱讀 45,060評(píng)論 2贊 355