搭建自己的深度學習神經(jīng)網(wǎng)絡(luò)

上篇文章倒堕，自己推導了一遍誤差反向傳播算法砂蔽，從而對深度學習有了一定的認識碑定。這期試著自己搭建一個基于python的深度學習神經(jīng)網(wǎng)，來解決一定的問題又官。

需要用到：python3延刘、numpy

數(shù)據(jù)集：MNIST數(shù)據(jù)集（一個被”嚼爛”了的數(shù)據(jù)集, 很多教程都會對它”下手”, 幾乎成為一個 “典范”）

MNIST 數(shù)據(jù)集可在 http://yann.lecun.com/exdb/mnist/ 獲取, 它包含了四個部分:

Training set images: train-images-idx3-ubyte.gz (9.9 MB, 解壓后 47 MB, 包含 60,000 個樣本)

Training set labels: train-labels-idx1-ubyte.gz (29 KB, 解壓后 60 KB, 包含 60,000 個標簽)

Test set images: t10k-images-idx3-ubyte.gz (1.6 MB, 解壓后 7.8 MB, 包含 10,000 個樣本)

Test set labels: t10k-labels-idx1-ubyte.gz (5KB, 解壓后 10 KB, 包含 10,000 個標簽)

MNIST 數(shù)據(jù)集來自美國國家標準與技術(shù)研究所, National Institute of Standards and Technology (NIST). 訓練集 (training set) 由來自 250 個不同人手寫的數(shù)字構(gòu)成, 其中 50% 是高中學生, 50% 來自人口普查局 (the Census Bureau) 的工作人員. 測試集(test set) 也是同樣比例的手寫數(shù)字數(shù)據(jù)。

我們創(chuàng)建一個DataParser.py文件來解析MNIST數(shù)據(jù)：


def load_mnist(path, kind='train'):

labels_path = os.path.join(path, '%s-labels-idx1-ubyte' % kind)

images_path = os.path.join(path, '%s-images-idx3-ubyte' % kind)

with open(labels_path, 'rb')as lbfile:

magic, n = struct.unpack('>II', lbfile.read(8))

labels = np.fromfile(lbfile, dtype=np.uint8)

with open(images_path, 'rb')as imgfile:

magic, num, rows, cols = struct.unpack('>IIII', imgfile.read(16))

images = np.fromfile(imgfile, dtype=np.uint8).reshape(len(labels), 784)

return images /255, labels

除以255把圖片像素值映射到[0六敬，1]上碘赖。
手寫圖片的尺寸是28*28

訓練思路：
我們將訓練過程分為60個epochs，每個epoch里訓練1000條數(shù)據(jù)外构，訓練結(jié)束后用測試數(shù)據(jù)集來計算訓練的準確率普泡。
每次測試時，我們需要打亂測試數(shù)據(jù)审编，以保證測試結(jié)果更準確撼班。
確定網(wǎng)絡(luò)參數(shù)：
由于深度學習需要包括至少2個隱層，所以我們將網(wǎng)絡(luò)結(jié)構(gòu)定義如下：
數(shù)據(jù)層——全連接層1——隱層1——全連接層2——隱層2——輸出層
數(shù)據(jù)層垒酬、全連接層1各有784個神經(jīng)元（對應手寫圖片上的28*28個像素）砰嘁，全連接層2包含256個，輸出層10個（對應0—9的概率）
激活函數(shù)采用Sigmoid函數(shù)勘究，損失函數(shù)采用交叉熵損失函數(shù)（交叉熵損失函數(shù)的導數(shù)分母有一個1-x項矮湘，爾Sigmoid函數(shù)的導數(shù)分子有一個1-x項，兩者相乘的時候完美解決了使用Sigmoid函數(shù)作為激活函數(shù)而產(chǎn)生的梯度發(fā)散問題）
下面我們就可以用python來實現(xiàn)這個網(wǎng)絡(luò)了口糕。

數(shù)據(jù)層定義如下:（數(shù)據(jù)層正向傳播輸出其本身缅阳，且無需反向傳播）

class DataLayer:

    def __init__(self):
        imgs, lbls = dp.load_mnist('', kind='train')
        self.x = imgs
        self.y = lbls
        self.pos = 0

    def forward(self, index):
        pos = index
        xx = self.x[pos]
        ret = (xx.reshape(xx.size,1), self.y[pos])
        return ret

    def backward(self, d):
        pass

測試數(shù)據(jù)層定義如下:（使用shuffle_data每次打亂數(shù)據(jù)）

class TestLayer:

    def __init__(self):
        imgs, lbls = dp.load_mnist('', kind='t10k')
        self.x = imgs
        self.y = lbls

    def shuffle_data(self):
        l = len(self.x)
        index = list(range(l))
        np.random.shuffle(index)
        self.x = self.x[index]
        self.y = self.y[index]

    def forward(self, index):
        xx = self.x[index]
        ret = (xx.reshape(xx.size,1), self.y[index])
        return ret

全連接層如下：（權(quán)重除以一個值np.sqrt(l_x)，會避免產(chǎn)生異常數(shù)據(jù)）
向前傳播forward：矩陣運算景描，計算z值
反向傳播backward：根據(jù)接收到的誤差更新權(quán)重和偏置

class FullConnect:

    def __init__(self, l_x, l_y):
        self.weights = np.random.randn(l_y, l_x) / np.sqrt(l_x)
        self.bias = np.random.randn(l_y, 1)
        self.lr = 0

    def forward(self, x):
        self.x = x
        self.y = np.dot(self.weights, x)+self.bias
        return self.y

    def backward(self, d):
        self.dw = np.dot(d, self.x.T)
        self.db = d
        self.dx = np.dot(self.weights.T, d)

        self.weights -= self.lr * self.dw
        self.bias -= self.lr * self.db

        return self.dx

激活函數(shù)：

class Sigmoid:

    def __init__(self):
        pass

    def sigmoid(self, x):
        return 1 / (1 + np.exp(-x))

    def forward(self, x):
        self.x = x
        self.y = self.sigmoid(x)
        return self.y

    def backward(self, d):  
        sig = self.sigmoid(self.x)
        self.dx =d * sig * (1 - sig)
        return self.dx  # 反向傳遞梯度

損失函數(shù)：

class CrossEntropyLoss:
    def __init__(self):
        pass

    def forward(self, x, label):
        self.x = x
        self.label = np.zeros_like(x)
        self.label[label]=1.0

        self.loss=np.nan_to_num(-self.label * np.log(x) -(1- self.label)*np.log(1-x))
        self.loss=np.sum(self.loss) / x.shape[0]
        return  self.loss

    def backward(self):
        self.dx=(self.x-self.label)/self.x /(1- self.x)
        return self.dx

確定準確度的方法為：(數(shù)組x中最大值（最大概率）的下標和標記值一樣十办，我們認為結(jié)果準確，反之不正確)

class Accuracy:
    def __init__(self):
        pass

    def forward(self, x, label):
        b = x.max()
        if x[label] > b-0.0001:
            return 1
        else:
            return 0

然后我們將這些層進行簡單的組裝超棺，就可以進行訓練了：

def train():
    block_size = 1000
    datalayer1 = DataLayer()
    datalayer2 = TestLayer()

    inner_layers = []
    inner_layers.append(FullConnect(784, 256))
    inner_layers.append(Sigmoid())
    inner_layers.append(FullConnect(256, 10))
    inner_layers.append(Sigmoid())

    lostlayer = CrossEntropyLoss()
    accuracy = Accuracy()
    epochs = 60

    for layer in inner_layers:
        layer.lr = 0.05

    for i in range(epochs):
        print('epochs: ', i)
        losssum = 0
        iters = 0

        for j in range(block_size):
            losssum = 0
            iters += 1
            x, label = datalayer1.forward(i * block_size +j)

            for layer in inner_layers:
                x = layer.forward(x)

            loss = lostlayer.forward(x, label)
            losssum += loss
            d = lostlayer.backward()

            for layer in inner_layers[::-1]:
                d = layer.backward(d)

            if j == block_size-1:
                accu = 0
                datalayer2.shuffle_data()
                for k in range(10000):
                    x, label = datalayer2.forward(index=k)
                    for layer in inner_layers:
                        x = layer.forward(x)
                    accu += accuracy.forward(x, label)
                print('accuracy: ', accu/10000)

我們把學習速度定位0.05橘洞，然后經(jīng)過不到2分鐘的訓練，就能將準確率提升到96%了说搅。輸出數(shù)據(jù)我們可以看到炸枣，第一回合到第二回合性能上有了很大的提升。

epochs:  0
accuracy:  0.6473
epochs:  1
accuracy:  0.846
epochs:  2
accuracy:  0.8799
epochs:  3
accuracy:  0.8828
epochs:  4
accuracy:  0.8964
epochs:  5
accuracy:  0.8988
epochs:  6
accuracy:  0.9022
epochs:  7
accuracy:  0.8958
epochs:  8
accuracy:  0.9058
epochs:  9
accuracy:  0.9157
epochs:  10
accuracy:  0.9055
epochs:  11
accuracy:  0.915
epochs:  12
accuracy:  0.9149
epochs:  13
accuracy:  0.9114
epochs:  14
accuracy:  0.9259
epochs:  15
accuracy:  0.9226
epochs:  16
accuracy:  0.9308
epochs:  17
accuracy:  0.9304
epochs:  18
accuracy:  0.9394
epochs:  19
accuracy:  0.9323
epochs:  20
accuracy:  0.9328
epochs:  21
accuracy:  0.9356
epochs:  22
accuracy:  0.94
epochs:  23
accuracy:  0.9456
epochs:  24
accuracy:  0.9431
epochs:  25
accuracy:  0.9374
epochs:  26
accuracy:  0.9466
epochs:  27
accuracy:  0.9468
epochs:  28
accuracy:  0.9465
epochs:  29
accuracy:  0.9389
epochs:  30
accuracy:  0.9471
epochs:  31
accuracy:  0.9473
epochs:  32
accuracy:  0.9524
epochs:  33
accuracy:  0.953
epochs:  34
accuracy:  0.9501
epochs:  35
accuracy:  0.944
epochs:  36
accuracy:  0.9458
epochs:  37
accuracy:  0.9554
epochs:  38
accuracy:  0.9556
epochs:  39
accuracy:  0.9536
epochs:  40
accuracy:  0.9497
epochs:  41
accuracy:  0.9557
epochs:  42
accuracy:  0.9548
epochs:  43
accuracy:  0.9517
epochs:  44
accuracy:  0.9496
epochs:  45
accuracy:  0.9495
epochs:  46
accuracy:  0.9579
epochs:  47
accuracy:  0.9573
epochs:  48
accuracy:  0.9485
epochs:  49
accuracy:  0.9605
epochs:  50
accuracy:  0.9618
epochs:  51
accuracy:  0.9625
epochs:  52
accuracy:  0.9599
epochs:  53
accuracy:  0.9574
epochs:  54
accuracy:  0.961
epochs:  55
accuracy:  0.9621
epochs:  56
accuracy:  0.9597
epochs:  57
accuracy:  0.9565
epochs:  58
accuracy:  0.9591
epochs:  59
accuracy:  0.9604

到此弄唧，通過搜集網(wǎng)上各種資料适肠，自己實現(xiàn)了第一個深度學習神經(jīng)網(wǎng)絡(luò)，中間不免有不合理的地方候引，繼續(xù)加油：钛！３胃伞逛揩！

最后編輯于：2018.04.04 17:23:11

?著作權(quán)歸作者所有,轉(zhuǎn)載或內(nèi)容合作請聯(lián)系作者

人面猴
序言：七十年代末柠傍，一起剝皮案震驚了整個濱河市，隨后出現(xiàn)的幾起案子辩稽，更是在濱河造成了極大的恐慌惧笛，老刑警劉巖，帶你破解...
沈念sama閱讀 216,591評論 6贊 501
死咒
序言：濱河連續(xù)發(fā)生了三起死亡事件逞泄，死亡現(xiàn)場離奇詭異患整，居然都是意外死亡，警方通過查閱死者的電腦和手機喷众，發(fā)現(xiàn)死者居然都...
沈念sama閱讀 92,448評論 3贊 392
救了他兩次的神仙讓他今天三更去死
文/潘曉璐我一進店門各谚，熙熙樓的掌柜王于貴愁眉苦臉地迎上來，“玉大人到千，你說我怎么就攤上這事昌渤。” “怎么了憔四？”我有些...
開封第一講書人閱讀 162,823評論 0贊 353
道士緝兇錄：失蹤的賣姜人
文/不壞的土叔我叫張陵膀息，是天一觀的道長。經(jīng)常有香客問我加矛，道長履婉，這世上最難降的妖魔是什么煤篙？我笑而不...
開封第一講書人閱讀 58,204評論 1贊 292
?港島之戀（遺憾婚禮）
正文為了忘掉前任斟览，我火速辦了婚禮，結(jié)果婚禮上辑奈，老公的妹妹穿的比我還像新娘苛茂。我一直安慰自己，他們只是感情好鸠窗，可當我...
茶點故事閱讀 67,228評論 6贊 388
惡毒庶女頂嫁案：這布局不是一般人想出來的
文/花漫我一把揭開白布妓羊。她就那樣靜靜地躺著，像睡著了一般稍计。火紅的嫁衣襯著肌膚如雪躁绸。梳的紋絲不亂的頭發(fā)上，一...
開封第一講書人閱讀 51,190評論 1贊 299
城市分裂傳說
那天臣嚣，我揣著相機與錄音净刮，去河邊找鬼。笑死硅则，一個胖子當著我的面吹牛淹父，可吹牛的內(nèi)容都是我干的。我是一名探鬼主播怎虫，決...
沈念sama閱讀 40,078評論 3贊 418
雙鴛鴦連環(huán)套：你想象不到人心有多黑
文/蒼蘭香墨我猛地睜開眼暑认，長吁一口氣：“原來是場噩夢啊……” “哼困介！你這毒婦竟也來了？” 一聲冷哼從身側(cè)響起蘸际，我...
開封第一講書人閱讀 38,923評論 0贊 274
萬榮殺人案實錄
序言：老撾萬榮一對情侶失蹤座哩，失蹤者是張志新（化名）和其女友劉穎，沒想到半個月后捡鱼，有當?shù)厝嗽跇淞掷锇l(fā)現(xiàn)了一具尸體八回，經(jīng)...
沈念sama閱讀 45,334評論 1贊 310
?護林員之死
正文獨居荒郊野嶺守林人離奇死亡，尸身上長有42處帶血的膿包…… 初始之章·張勛以下內(nèi)容為張勛視角年9月15日...
茶點故事閱讀 37,550評論 2贊 333
?白月光啟示錄
正文我和宋清朗相戀三年驾诈，在試婚紗的時候發(fā)現(xiàn)自己被綠了缠诅。大學時的朋友給我發(fā)了我未婚夫和他白月光在一起吃飯的照片。...
茶點故事閱讀 39,727評論 1贊 348
活死人
序言：一個原本活蹦亂跳的男人離奇死亡乍迄，死狀恐怖管引，靈堂內(nèi)的尸體忽然破棺而出，到底是詐尸還是另有隱情闯两，我是刑警寧澤褥伴，帶...
沈念sama閱讀 35,428評論 5贊 343
?日本核電站爆炸內(nèi)幕
正文年R本政府宣布，位于F島的核電站漾狼，受9級特大地震影響重慢，放射性物質(zhì)發(fā)生泄漏。R本人自食惡果不足惜逊躁，卻給世界環(huán)境...
茶點故事閱讀 41,022評論 3贊 326
男人毒藥：我在死后第九天來索命
文/蒙蒙一似踱、第九天我趴在偏房一處隱蔽的房頂上張望。院中可真熱鬧稽煤，春花似錦核芽、人聲如沸。這莊子的主人今日做“春日...
開封第一講書人閱讀 31,672評論 0贊 22
一樁弒父案轧简，背后竟有這般陰謀
文/蒼蘭香墨我抬頭看了看天上的太陽。三九已至匾二，卻和暖如春哮独，著一層夾襖步出監(jiān)牢的瞬間，已是汗流浹背察藐。一陣腳步聲響...
開封第一講書人閱讀 32,826評論 1贊 269
情欲美人皮
我被黑心中介騙來泰國打工皮璧，沒想到剛下飛機就差點兒被人妖公主榨干…… 1. 我叫王不留，地道東北人转培。一個月前我還...
沈念sama閱讀 47,734評論 2贊 368
代替公主和親
正文我出身青樓恶导，卻偏偏與公主長得像，于是被迫代替她去往敵國和親浸须。傳聞我的和親對象是個殘疾皇子惨寿，可洞房花燭夜當晚...
茶點故事閱讀 44,619評論 2贊 354

搭建自己的深度學習神經(jīng)網(wǎng)絡(luò)

推薦閱讀更多精彩內(nèi)容