Tensorflow RunOptions解析

本文檔對(duì)RunOptions的參數(shù)定義及使用進(jìn)行說(shuō)明心赶，tensorflow版本為1.12。

1.RunOptions參數(shù)

RunOptions提供配置參數(shù)岗喉，供SessionRun調(diào)用時(shí)使用蚌堵，包括：

TraceLevel：
timeout_in_ms: op超時(shí)等待時(shí)間，單位為ms
inter_op_thread_pool: 創(chuàng)建session時(shí)如果配置了session_inter_op_thread_pool參數(shù)莲绰，當(dāng)前參數(shù)指定需要使用的線程池。注釋中有說(shuō)明姑丑，如果配置為-1蛤签，使用調(diào)用者的線程，適用于比較簡(jiǎn)單的圖執(zhí)行栅哀，避免線程切換的開銷顷啼，注意此處存在版本差異，tf1.10之前的版本昌屉，如果配置為-1會(huì)報(bào)InvalidArgument的錯(cuò)钙蒙。
output_partition_graphs：布爾型變量，標(biāo)記當(dāng)前子圖執(zhí)行結(jié)果是否需要輸出至MetaData间驮。
debug_options ： debug相關(guān)參數(shù)躬厌。
report_tensor_allocations_upon_oom：當(dāng)allocator發(fā)生OOM時(shí)，Error信息中包含tensor allocation的信息，使能后會(huì)導(dǎo)致Session::Run執(zhí)行變慢扛施。
experimental: 相關(guān)參數(shù)不穩(wěn)定鸿捧，不同版本使用時(shí)需要注意兼容性問(wèn)題。RunOptions中兩個(gè)實(shí)驗(yàn)參數(shù)至tensorflow 2.1依然有效疙渣。其中use_run_handler_pool推薦在CPU負(fù)載較大的場(chǎng)景比如inference中使用匙奴，達(dá)到session間線程池集中調(diào)度、降低延時(shí)的作用妄荔。

message RunOptions {
  enum TraceLevel {
    NO_TRACE = 0;
    SOFTWARE_TRACE = 1;
    HARDWARE_TRACE = 2;
    FULL_TRACE = 3;
  }
  TraceLevel trace_level = 1;

  int64 timeout_in_ms = 2;

  int32 inter_op_thread_pool = 3;

  bool output_partition_graphs = 5;

  DebugOptions debug_options = 6;

  bool report_tensor_allocations_upon_oom = 7;

  message Experimental {
    int64 collective_graph_key = 1;
    bool use_run_handler_pool = 2;
  };
  Experimental experimental = 8;

  reserved 4;
}

2. RunMetaData參數(shù)

RunMetaData與RunOptions中參數(shù)一樣泼菌，定義在config.proto中。通常啦租，配合RunOptions相關(guān)配置收集執(zhí)行過(guò)程中的跟蹤信息哗伯，包括延時(shí)、內(nèi)存開銷等篷角。

message RunMetadata {
  StepStats step_stats = 1;
  CostGraphDef cost_graph = 2;
  repeated GraphDef partition_graphs = 3;
}

3. 源碼解析

session.h中定義了session.Run（）的API焊刹，其中支持RunOptions作為參數(shù)輸入的API如下所示：

 virtual Status Run(const RunOptions& run_options,
                     const std::vector<std::pair<string, Tensor> >& inputs,
                     const std::vector<string>& output_tensor_names,
                     const std::vector<string>& target_node_names,
                     std::vector<Tensor>* outputs, RunMetadata* run_metadata);

本節(jié)主要關(guān)注inter_op_thread_pool 及use_run_handler_pool 兩個(gè)參數(shù)相關(guān)的源碼：

3.1 inter_op_thread_pool參數(shù)

在前序介紹NewSession流程的文檔中，了解到創(chuàng)建的thread_pool保存在了vector thread_pools_中恳蹲。

  std::vector<std::pair<thread::ThreadPool*, bool>> thread_pools_;

在調(diào)用Session::Run時(shí)虐块，會(huì)先進(jìn)行參數(shù)檢查，inter_op_thread_pool應(yīng)該小于thread_pools_.size()嘉蕾，否則會(huì)報(bào)錯(cuò)贺奠。

if (run_options.inter_op_thread_pool() < -1 ||
      run_options.inter_op_thread_pool() >=
          static_cast<int32>(thread_pools_.size())) {
    run_state.executors_done.Notify();
    delete barrier;
    return errors::InvalidArgument("Invalid inter_op_thread_pool: ",
                                   run_options.inter_op_thread_pool());
  }

對(duì)于合法參數(shù)，tensorflow采用指定的線程池完成后續(xù)的計(jì)算荆针。
tensorflow1.12中開始允許inter_op_thread_pool=-1,此時(shí)采用主線程完成計(jì)算敞嗡。

thread::ThreadPool* pool =
      run_options.inter_op_thread_pool() >= 0
          ? thread_pools_[run_options.inter_op_thread_pool()].first
          : nullptr;
  if (pool == nullptr) {
    if (executors_and_keys->items.size() > 1) {
      pool = thread_pools_[0].first;
    } else {
      VLOG(1) << "Executing Session::Run() synchronously!";
    }
  }

3.2 use_run_handler_pool

當(dāng)使用GlobalThreadPool時(shí)颁糟，該配置參數(shù)有效航背。
備注：GlobalThreadPool相關(guān)介紹可參見：http://www.reibang.com/p/e9fd4f0d6bd1

std::unique_ptr<RunHandler> handler;
  if (ShouldUseRunHandlerPool() &&
      run_options.experimental().use_run_handler_pool()) {
    handler = GetOrCreateRunHandlerPool(options_)->Get();
  }
  auto* handler_ptr = handler.get();

主要影響Session::Run(）時(shí)使用的RunHandler,該類的定義位于：
tensorflow/core/framework/run_handler.h。

class RunHandler {
 public:
  void ScheduleInterOpClosure(std::function<void()> fn);
  ~RunHandler();
 private:
  class Impl;
  friend class RunHandlerPool::Impl;
  explicit RunHandler(Impl* impl);
  Impl* impl_;  // NOT OWNED.
};

當(dāng)配置use_run_handler_pool時(shí)棱貌，通過(guò)GetOrCreateRunHandlerPool獲取RunHandler玖媚。通過(guò)維護(hù)一個(gè)全局的RunHandlerPool，達(dá)到提升性能的目的婚脱。

static RunHandlerPool* GetOrCreateRunHandlerPool(
    const SessionOptions& options) {
  static RunHandlerPool* pool =
      new RunHandlerPool(NumInterOpThreadsFromSessionOptions(options));
  return pool;
}

4.使用示例

4.1 timeline

可將運(yùn)行時(shí)trace信息通過(guò)chrome:://tracing打開保存的json文件進(jìn)行分析：

import tensorflow as tf
from tensorflow.python.client import timeline

a = tf.random_normal([1, 100])
b = tf.random_normal([1, 100])
res = tf.add(a, b)

with tf.Session() as sess:
    options = tf.RunOptions(trace_level=tf.RunOptions.FULL_TRACE)
    run_metadata = tf.RunMetadata()
    sess.run(res, options=options, run_metadata=run_metadata)
    timeline = timeline.Timeline(run_metadata.step_stats)
    trace = timeline.generate_chrome_trace_format()
    with open('timeline.json', 'w') as f:
        f.write(chrome_trace)

如果需要合并多次session.run的trace今魔，可使用如下TimeLiner類實(shí)現(xiàn)，每次調(diào)用session.run將trace存為json格式后障贸，可調(diào)用TimeLiner的update_timeline函數(shù)進(jìn)行更新错森，最后調(diào)用save函數(shù)將timeline寫入json文件：

import json

class TimeLiner:
   _timeline_dict = None   
   def update_timeline(self, chrome_trace):
       chrome_trace_dict = json.loads(chrome_trace)
       if self._timeline_dict is None:
          self._timeline_dict = chrome_trace_dict
       else:
          for event in chrome_trace_dict['traceEvents']:
              if 'ts' in event:
                  self._timeline_dict['traceEvents'].append(event)
   def save(self,f_name):
       print (f_name)
       with open(f_name,'w') as f:
          json.dump(self._timeline_dict,f)

?著作權(quán)歸作者所有,轉(zhuǎn)載或內(nèi)容合作請(qǐng)聯(lián)系作者

人面猴
序言：七十年代末，一起剝皮案震驚了整個(gè)濱河市篮洁，隨后出現(xiàn)的幾起案子涩维，更是在濱河造成了極大的恐慌，老刑警劉巖袁波，帶你破解...
沈念sama閱讀 219,366評(píng)論 6贊 508
死咒
序言：濱河連續(xù)發(fā)生了三起死亡事件瓦阐，死亡現(xiàn)場(chǎng)離奇詭異蜗侈，居然都是意外死亡，警方通過(guò)查閱死者的電腦和手機(jī)睡蟋，發(fā)現(xiàn)死者居然都...
沈念sama閱讀 93,521評(píng)論 3贊 395
救了他兩次的神仙讓他今天三更去死
文/潘曉璐我一進(jìn)店門踏幻，熙熙樓的掌柜王于貴愁眉苦臉地迎上來(lái)，“玉大人戳杀，你說(shuō)我怎么就攤上這事该面。” “怎么了豺瘤？”我有些...
開封第一講書人閱讀 165,689評(píng)論 0贊 356
道士緝兇錄：失蹤的賣姜人
文/不壞的土叔我叫張陵吆倦，是天一觀的道長(zhǎng)。經(jīng)常有香客問(wèn)我坐求，道長(zhǎng)蚕泽，這世上最難降的妖魔是什么？我笑而不...
開封第一講書人閱讀 58,925評(píng)論 1贊 295
?港島之戀（遺憾婚禮）
正文為了忘掉前任桥嗤，我火速辦了婚禮须妻，結(jié)果婚禮上，老公的妹妹穿的比我還像新娘泛领。我一直安慰自己荒吏，他們只是感情好，可當(dāng)我...
茶點(diǎn)故事閱讀 67,942評(píng)論 6贊 392
惡毒庶女頂嫁案：這布局不是一般人想出來(lái)的
文/花漫我一把揭開白布渊鞋。她就那樣靜靜地躺著绰更，像睡著了一般。火紅的嫁衣襯著肌膚如雪锡宋。梳的紋絲不亂的頭發(fā)上儡湾，一...
開封第一講書人閱讀 51,727評(píng)論 1贊 305
城市分裂傳說(shuō)
那天，我揣著相機(jī)與錄音执俩，去河邊找鬼徐钠。笑死，一個(gè)胖子當(dāng)著我的面吹牛役首，可吹牛的內(nèi)容都是我干的尝丐。我是一名探鬼主播，決...
沈念sama閱讀 40,447評(píng)論 3贊 420
雙鴛鴦連環(huán)套：你想象不到人心有多黑
文/蒼蘭香墨我猛地睜開眼衡奥，長(zhǎng)吁一口氣：“原來(lái)是場(chǎng)噩夢(mèng)啊……” “哼爹袁！你這毒婦竟也來(lái)了？” 一聲冷哼從身側(cè)響起矮固，我...
開封第一講書人閱讀 39,349評(píng)論 0贊 276
萬(wàn)榮殺人案實(shí)錄
序言：老撾萬(wàn)榮一對(duì)情侶失蹤失息，失蹤者是張志新（化名）和其女友劉穎，沒(méi)想到半個(gè)月后，有當(dāng)?shù)厝嗽跇淞掷锇l(fā)現(xiàn)了一具尸體根时，經(jīng)...
沈念sama閱讀 45,820評(píng)論 1贊 317
?護(hù)林員之死
正文獨(dú)居荒郊野嶺守林人離奇死亡瘦赫，尸身上長(zhǎng)有42處帶血的膿包…… 初始之章·張勛以下內(nèi)容為張勛視角年9月15日...
茶點(diǎn)故事閱讀 37,990評(píng)論 3贊 337
?白月光啟示錄
正文我和宋清朗相戀三年，在試婚紗的時(shí)候發(fā)現(xiàn)自己被綠了蛤迎。大學(xué)時(shí)的朋友給我發(fā)了我未婚夫和他白月光在一起吃飯的照片确虱。...
茶點(diǎn)故事閱讀 40,127評(píng)論 1贊 351
活死人
序言：一個(gè)原本活蹦亂跳的男人離奇死亡，死狀恐怖替裆，靈堂內(nèi)的尸體忽然破棺而出校辩，到底是詐尸還是另有隱情，我是刑警寧澤辆童，帶...
沈念sama閱讀 35,812評(píng)論 5贊 346
?日本核電站爆炸內(nèi)幕
正文年R本政府宣布宜咒，位于F島的核電站，受9級(jí)特大地震影響把鉴，放射性物質(zhì)發(fā)生泄漏故黑。R本人自食惡果不足惜，卻給世界環(huán)境...
茶點(diǎn)故事閱讀 41,471評(píng)論 3贊 331
男人毒藥：我在死后第九天來(lái)索命
文/蒙蒙一庭砍、第九天我趴在偏房一處隱蔽的房頂上張望场晶。院中可真熱鬧，春花似錦怠缸、人聲如沸诗轻。這莊子的主人今日做“春日...
開封第一講書人閱讀 32,017評(píng)論 0贊 22
一樁弒父案揭北，背后竟有這般陰謀
文/蒼蘭香墨我抬頭看了看天上的太陽(yáng)扳炬。三九已至，卻和暖如春搔体，著一層夾襖步出監(jiān)牢的瞬間恨樟，已是汗流浹背。一陣腳步聲響...
開封第一講書人閱讀 33,142評(píng)論 1贊 272
情欲美人皮
我被黑心中介騙來(lái)泰國(guó)打工嫉柴，沒(méi)想到剛下飛機(jī)就差點(diǎn)兒被人妖公主榨干…… 1. 我叫王不留厌杜，地道東北人奉呛。一個(gè)月前我還...
沈念sama閱讀 48,388評(píng)論 3贊 373
代替公主和親
正文我出身青樓计螺，卻偏偏與公主長(zhǎng)得像，于是被迫代替她去往敵國(guó)和親瞧壮。傳聞我的和親對(duì)象是個(gè)殘疾皇子登馒，可洞房花燭夜當(dāng)晚...
茶點(diǎn)故事閱讀 45,066評(píng)論 2贊 355