claude_vip - 簡書

claude_vip

IP屬地：江蘇

Voice Conversion
Introduction VC aims to convert the non-linguistic information of the sp...

1531 0 1
ASR Systems
Introduction The ASR system can be categoried as three classes by its ou...

592 0 0

Language Model for ASR
Background Automatic Speech Recognition (ASR) uses both acoustic model (...

466 0 0
語言模型融合 Language Model Fusion
Introduction In the previous articals, we have learnt the CTC loss makes...

778 0 0
Keyword Spotting關(guān)鍵詞偵聽
Introduction Keyword Spotting (KWS) aims at detecting predefined key-wor...

1199 0 0
注意力機制的增強Enhancement of Attention Mechanism
Multi-headed Attention 一個attention head可能權(quán)重大部分在某處坟岔，不能提取豐富的信息，需要多個進行融合钓觉。 Fu...

710 0 0
Query-Key-Value Perspective on Attention Mechanism 怎么用“查詢-鍵-值”理解注意力機制
注意力機制 RNN編碼-解碼模型論文[1]中衫贬，從RNN編碼-解碼模型演進出注意力機制朝氓。RNN編碼-解碼模型中缸逃，編碼器輸入序列少辣，是編碼器RNN在...

3370 0 0

CTC序列模型
背景手寫體識別、語音識別中笤成，輸入數(shù)據(jù)和輸出的識別結(jié)果長度不一致评架、而且可變。直接用神經(jīng)網(wǎng)絡(luò)訓(xùn)練需要預(yù)分割炕泳、調(diào)整纵诞，得到對應(yīng)關(guān)系，這很難做到培遵。CTC...

1125 0 0
Faster R-CNN
網(wǎng)絡(luò)架構(gòu) 可以分為3個部分 Head Region Proposal Network(RPN) Classification Network R...

181 0 0