HoloLens開發(fā)手記 - 語音識別（聽寫識別）

Hololens上語音輸入有三種形式坟募，分別是：

語音命令 Voice Command
聽寫 Diction
語法識別 Grammar Recognizer

在 HoloLens開發(fā)手記 - 語音識別（語音命令）博客已經(jīng)介紹了 Voice Command 的用法。本文將介紹聽寫的用法：

聽寫識別 Diction

聽寫就是語音轉(zhuǎn)化成文字 (Speech to Text)崎逃。此特性在HoloLens上使用的場所一般多用于需要用到鍵入文字的地方，例如在HoloLens中使用 Edge 搜索時休偶，由于在HoloLens上一般是非常規(guī)的物理鍵盤輸入换途，使用手勢點(diǎn)按虛擬鍵盤鍵入文字的具體操作需要用戶轉(zhuǎn)動頭部將Gaze射線光標(biāo)定位到想輸入的虛擬鍵盤字母上咬荷，再用Gesture點(diǎn)按手勢確認(rèn)選定此字母路呜，由此可見還是有極大的不便性迷捧。

Paste_Image.png

所以語音轉(zhuǎn)為文字實(shí)現(xiàn)鍵入內(nèi)容的操作將能大大提高效率织咧。

聽寫特性用于將用戶語音轉(zhuǎn)為文字輸入，同時支持內(nèi)容推斷和事件注冊特性漠秋。Start()和Stop()方法用于啟用和禁用聽寫功能笙蒙，在聽寫結(jié)束后需要調(diào)用Dispose()方法來關(guān)閉聽寫頁面。GC會自動回收它的資源庆锦，如果不Dispose會帶來額外的性能開銷捅位。

使用聽寫識別應(yīng)該注意的是:

在你的應(yīng)用中必須打開 Microphone 特性。設(shè)置如下：Edit -> Project Settings -> Player -> Windows Store -> Publishing Settings > Capabilities 中確認(rèn)勾上Microphone搂抒。
必須確認(rèn)HoloLens連接上了wifi艇搀，這樣聽寫識別才能工作。

DictationRecognizer.cs

using HoloToolkit;
using System.Collections;
using System.Text;
using UnityEngine;
using UnityEngine.UI;
using UnityEngine.Windows.Speech;

public class MicrophoneManager : MonoBehaviour
{
    [Tooltip("A text area for the recognizer to display the recognized strings.")]
    public Text DictationDisplay;

    private DictationRecognizer dictationRecognizer;

    // Use this string to cache the text currently displayed in the text box.
    //使用此字符串可以緩存當(dāng)前顯示在文本框中的文本求晶。
    private StringBuilder textSoFar;

    void Awake()
    {
        /* TODO: DEVELOPER CODING EXERCISE 3.a */

        //Create a new DictationRecognizer and assign it to dictationRecognizer variable.
        dictationRecognizer = new DictationRecognizer();

        //Register for dictationRecognizer.DictationHypothesis and implement DictationHypothesis below
        // This event is fired while the user is talking. As the recognizer listens, it provides text of what it's heard so far.
        //注冊聽寫假設(shè)事件焰雕。此事件在用戶說話時觸發(fā)。當(dāng)識別器收聽時誉帅，提供到目前為止所聽到的內(nèi)容文本
        dictationRecognizer.DictationHypothesis += DictationRecognizer_DictationHypothesis;

        //Register for dictationRecognizer.DictationResult and implement DictationResult below
        // This event is fired after the user pauses, typically at the end of a sentence. The full recognized string is returned here.
        //注冊聽寫結(jié)果事件。此事件在用戶暫停后觸發(fā)右莱，通常在句子的結(jié)尾處蚜锨，返回完整的已識別字符串
        dictationRecognizer.DictationResult += DictationRecognizer_DictationResult;

        //Register for dictationRecognizer.DictationComplete and implement DictationComplete below
        // This event is fired when the recognizer stops, whether from Stop() being called, a timeout occurring, or some other error.
        //注冊聽寫完成事件。無論是調(diào)用Stop()函數(shù)慢蜓、發(fā)生超時或者其他的錯誤使得識別器停止都會觸發(fā)此事件
        dictationRecognizer.DictationComplete += DictationRecognizer_DictationComplete;

        //Register for dictationRecognizer.DictationError and implement DictationError below
        // This event is fired when an error occurs.
        //注冊聽寫錯誤事件亚再。當(dāng)發(fā)生錯誤時調(diào)用此事件，通常是為連接網(wǎng)絡(luò)或者在識別過程中網(wǎng)絡(luò)發(fā)生中斷等時產(chǎn)生錯誤
        dictationRecognizer.DictationError += DictationRecognizer_DictationError;

        // Shutdown the PhraseRecognitionSystem. This controls the KeywordRecognizers
        //PhraseRecognitionSystem控制的是KeywordRecognizers晨抡，關(guān)閉語音命令關(guān)鍵字識別氛悬。只有在關(guān)閉這個后才能開啟聽寫識別
        PhraseRecognitionSystem.Shutdown();

        //Start dictationRecognizer
        //開啟聽寫識別
        dictationRecognizer.Start();

    }

    /// <summary>
    /// This event is fired while the user is talking. As the recognizer listens, it provides text of what it's heard so far.
    /// </summary>
    /// <param name="text">The currently hypothesized recognition.</param>
    private void DictationRecognizer_DictationHypothesis(string text)
    {
        // Set DictationDisplay text to be textSoFar and new hypothesized text
        // We don't want to append to textSoFar yet, because the hypothesis may have changed on the next event
        DictationDisplay.text = textSoFar.ToString() + " " + text + "...";
    }

    /// <summary>
    /// This event is fired after the user pauses, typically at the end of a sentence. The full recognized string is returned here.
    /// </summary>
    /// <param name="text">The text that was heard by the recognizer.</param>
    /// <param name="confidence">A representation of how confident (rejected, low, medium, high) the recognizer is of this recognition.</param>
    private void DictationRecognizer_DictationResult(string text, ConfidenceLevel confidence)
    {
        // 3.a: Append textSoFar with latest text
        textSoFar.Append(text + "");

        // 3.a: Set DictationDisplay text to be textSoFar
        DictationDisplay.text = textSoFar.ToString();
    }

    /// <summary>
    /// This event is fired when the recognizer stops, whether from Stop() being called, a timeout occurring, or some other error.
    /// Typically, this will simply return "Complete". In this case, we check to see if the recognizer timed out.
    /// </summary>
    /// <param name="cause">An enumerated reason for the session completing.</param>
    private void DictationRecognizer_DictationComplete(DictationCompletionCause cause)
    {
        // If Timeout occurs, the user has been silent for too long.
        // With dictation, the default timeout after a recognition is 20 seconds.
        // The default timeout with initial silence is 5 seconds.
        //如果在聽寫開始后第一個5秒內(nèi)沒聽到任何聲音，將會超時  
        //如果識別到了一個結(jié)果但是之后20秒沒聽到任何聲音耘柱，也會超時  
        if (cause == DictationCompletionCause.TimeoutExceeded)
        {
            Microphone.End(deviceName);

            DictationDisplay.text = "Dictation has timed out. Please press the record button again.";
            SendMessage("ResetAfterTimeout");
        }
    }

    /// <summary>
    /// This event is fired when an error occurs.
    /// </summary>
    /// <param name="error">The string representation of the error reason.</param>
    /// <param name="hresult">The int representation of the hresult.</param>
    private void DictationRecognizer_DictationError(string error, int hresult)
    {
        // 3.a: Set DictationDisplay text to be the error string
        DictationDisplay.text = error + "\nHRESULT: " + hresult;
    }


    // Update is called once per frame  
    void Update () {  
      
    }  
  
    void OnDestroy()  
    {  
        dictationRecognizer.Stop();  
        dictationRecognizer.DictationHypothesis -= DictationRecognizer_DictationHypothesis;  
        dictationRecognizer.DictationResult -= DictationRecognizer_DictationResult;  
        dictationRecognizer.DictationComplete -= DictationRecognizer_DictationComplete;  
        dictationRecognizer.DictationError -= DictationRecognizer_DictationError;  
        dictationRecognizer.Dispose();  
    }  

}

HoloLens只能運(yùn)行單個語音識別 (run at a time)如捅，所以若要使用聽寫識別的話，必須要關(guān)閉KeywordRecognizer调煎。

DictationRecognizer中設(shè)置有兩個超時：

如果識別器啟用并且在5秒內(nèi)沒有聽到任何聲音镜遣，將會超時。
如果識別器識別到了結(jié)果士袄，但是在20秒內(nèi)沒有聽到聲音悲关，將會超時。

最后編輯于：2017.12.04 17:12:19

?著作權(quán)歸作者所有,轉(zhuǎn)載或內(nèi)容合作請聯(lián)系作者

人面猴
序言：七十年代末娄柳，一起剝皮案震驚了整個濱河市寓辱，隨后出現(xiàn)的幾起案子，更是在濱河造成了極大的恐慌赤拒，老刑警劉巖秫筏，帶你破解...
沈念sama閱讀 218,682評論 6贊 507
死咒
序言：濱河連續(xù)發(fā)生了三起死亡事件诱鞠，死亡現(xiàn)場離奇詭異，居然都是意外死亡跳昼，警方通過查閱死者的電腦和手機(jī)般甲，發(fā)現(xiàn)死者居然都...
沈念sama閱讀 93,277評論 3贊 395
救了他兩次的神仙讓他今天三更去死
文/潘曉璐我一進(jìn)店門，熙熙樓的掌柜王于貴愁眉苦臉地迎上來鹅颊，“玉大人敷存，你說我怎么就攤上這事】拔椋” “怎么了锚烦？”我有些...
開封第一講書人閱讀 165,083評論 0贊 355
道士緝兇錄：失蹤的賣姜人
文/不壞的土叔我叫張陵，是天一觀的道長帝雇。經(jīng)常有香客問我涮俄，道長，這世上最難降的妖魔是什么尸闸？我笑而不...
開封第一講書人閱讀 58,763評論 1贊 295
?港島之戀（遺憾婚禮）
正文為了忘掉前任彻亲，我火速辦了婚禮，結(jié)果婚禮上吮廉，老公的妹妹穿的比我還像新娘苞尝。我一直安慰自己，他們只是感情好宦芦，可當(dāng)我...
茶點(diǎn)故事閱讀 67,785評論 6贊 392
惡毒庶女頂嫁案：這布局不是一般人想出來的
文/花漫我一把揭開白布宙址。她就那樣靜靜地躺著，像睡著了一般调卑。火紅的嫁衣襯著肌膚如雪抡砂。梳的紋絲不亂的頭發(fā)上，一...
開封第一講書人閱讀 51,624評論 1贊 305
城市分裂傳說
那天恬涧，我揣著相機(jī)與錄音注益，去河邊找鬼。笑死溯捆，一個胖子當(dāng)著我的面吹牛聊浅，可吹牛的內(nèi)容都是我干的。我是一名探鬼主播现使，決...
沈念sama閱讀 40,358評論 3贊 418
雙鴛鴦連環(huán)套：你想象不到人心有多黑
文/蒼蘭香墨我猛地睜開眼低匙，長吁一口氣：“原來是場噩夢啊……” “哼！你這毒婦竟也來了碳锈？” 一聲冷哼從身側(cè)響起顽冶，我...
開封第一講書人閱讀 39,261評論 0贊 276
萬榮殺人案實(shí)錄
序言：老撾萬榮一對情侶失蹤，失蹤者是張志新（化名）和其女友劉穎售碳，沒想到半個月后强重，有當(dāng)?shù)厝嗽跇淞掷锇l(fā)現(xiàn)了一具尸體绞呈，經(jīng)...
沈念sama閱讀 45,722評論 1贊 315
?護(hù)林員之死
正文獨(dú)居荒郊野嶺守林人離奇死亡，尸身上長有42處帶血的膿包…… 初始之章·張勛以下內(nèi)容為張勛視角年9月15日...
茶點(diǎn)故事閱讀 37,900評論 3贊 336
?白月光啟示錄
正文我和宋清朗相戀三年间景，在試婚紗的時候發(fā)現(xiàn)自己被綠了佃声。大學(xué)時的朋友給我發(fā)了我未婚夫和他白月光在一起吃飯的照片。...
茶點(diǎn)故事閱讀 40,030評論 1贊 350
活死人
序言：一個原本活蹦亂跳的男人離奇死亡倘要，死狀恐怖圾亏，靈堂內(nèi)的尸體忽然破棺而出，到底是詐尸還是另有隱情封拧，我是刑警寧澤志鹃，帶...
沈念sama閱讀 35,737評論 5贊 346
?日本核電站爆炸內(nèi)幕
正文年R本政府宣布，位于F島的核電站泽西，受9級特大地震影響曹铃，放射性物質(zhì)發(fā)生泄漏。R本人自食惡果不足惜捧杉，卻給世界環(huán)境...
茶點(diǎn)故事閱讀 41,360評論 3贊 330
男人毒藥：我在死后第九天來索命
文/蒙蒙一陕见、第九天我趴在偏房一處隱蔽的房頂上張望。院中可真熱鬧味抖，春花似錦评甜、人聲如沸。這莊子的主人今日做“春日...
開封第一講書人閱讀 31,941評論 0贊 22
一樁弒父案蜕着，背后竟有這般陰謀
文/蒼蘭香墨我抬頭看了看天上的太陽谋竖。三九已至红柱，卻和暖如春，著一層夾襖步出監(jiān)牢的瞬間蓖乘，已是汗流浹背锤悄。一陣腳步聲響...
開封第一講書人閱讀 33,057評論 1贊 270
情欲美人皮
我被黑心中介騙來泰國打工，沒想到剛下飛機(jī)就差點(diǎn)兒被人妖公主榨干…… 1. 我叫王不留嘉抒，地道東北人零聚。一個月前我還...
沈念sama閱讀 48,237評論 3贊 371
代替公主和親
正文我出身青樓，卻偏偏與公主長得像些侍，于是被迫代替她去往敵國和親隶症。傳聞我的和親對象是個殘疾皇子，可洞房花燭夜當(dāng)晚...
茶點(diǎn)故事閱讀 44,976評論 2贊 355

HoloLens開發(fā)手記 - 語音識別（聽寫識別）

聽寫識別 Diction

DictationRecognizer.cs

推薦閱讀更多精彩內(nèi)容