HMaster無法啟動(dòng)(報(bào)錯(cuò): Cannot recoverLease)

一宪卿、報(bào)錯(cuò):

關(guān)鍵字:

Cannot recoverLease

Failed to recover lease

master.HMaster: Failed to become active master

Cannot obtain block length for LocatedBlock

報(bào)錯(cuò)堆棧:

2019-10-31 18:34:48,318 INFO [cloud1:16000.activeMasterManager] balancer.StochasticLoadBalancer: loading config

2019-10-31 18:34:48,389 INFO? [cloud1:16000.activeMasterManager] master.HMaster: Server active/primary master=cloud1,16000,1572518079864, sessionid=0x26e215f68470001, setting cluster-up flag (Was=false)

2019-10-31 18:34:48,392 INFO? [master/cloud1/172.18.49.221:16000] regionserver.HRegionServer: ClusterId : 6f9b6fa5-d0aa-426e-bd0c-e7b12d437a2d

2019-10-31 18:34:48,438 INFO? [cloud1:16000.activeMasterManager] procedure.ZKProcedureUtil: Clearing all procedure znodes: /hbase/flush-table-proc/acquired /hbase/flush-table-proc/reached /hbase/flush-table-proc/abort

2019-10-31 18:34:48,484 INFO? [cloud1:16000.activeMasterManager] procedure.ZKProcedureUtil: Clearing all procedure znodes: /hbase/online-snapshot/acquired /hbase/online-snapshot/reached /hbase/online-snapshot/abort

2019-10-31 18:34:48,533 INFO? [cloud1:16000.activeMasterManager] master.MasterCoprocessorHost: System coprocessor loading is enabled

2019-10-31 18:34:48,555 INFO? [cloud1:16000.activeMasterManager] procedure2.ProcedureExecutor: Starting procedure executor threads=13

2019-10-31 18:34:48,556 INFO? [cloud1:16000.activeMasterManager] wal.WALProcedureStore: Starting WAL Procedure Store lease recovery

2019-10-31 18:34:48,559 INFO? [cloud1:16000.activeMasterManager] util.FSHDFSUtils: Recover lease on dfs file hdfs://mycluster/hbase/MasterProcWALs/state-00000000000000000001.log

2019-10-31 18:34:48,588 INFO? [cloud1:16000.activeMasterManager] util.FSHDFSUtils: Failed to recover lease, attempt=0 on file=hdfs://mycluster/hbase/MasterProcWALs/state-00000000000000000001.log after 29ms

2019-10-31 18:34:52,608 INFO? [cloud1:16000.activeMasterManager] util.FSHDFSUtils: Failed to recover lease, attempt=1 on file=hdfs://mycluster/hbase/MasterProcWALs/state-00000000000000000001.log after 4049ms

2019-10-31 18:35:56,728 INFO? [cloud1:16000.activeMasterManager] util.FSHDFSUtils: Failed to recover lease, attempt=2 on file=hdfs://mycluster/hbase/MasterProcWALs/state-00000000000000000001.log after 68169ms

2019-10-31 18:38:05,209 INFO? [cloud1:16000.activeMasterManager] util.FSHDFSUtils: Failed to recover lease, attempt=3 on file=hdfs://mycluster/hbase/MasterProcWALs/state-00000000000000000001.log after 196650ms

2019-10-31 18:41:17,427 INFO? [cloud1:16000.activeMasterManager] util.FSHDFSUtils: Failed to recover lease, attempt=4 on file=hdfs://mycluster/hbase/MasterProcWALs/state-00000000000000000001.log after 388868ms

2019-10-31 18:45:33,913 INFO? [cloud1:16000.activeMasterManager] util.FSHDFSUtils: Failed to recover lease, attempt=5 on file=hdfs://mycluster/hbase/MasterProcWALs/state-00000000000000000001.log after 645354ms

2019-10-31 18:49:46,673 ERROR [ActiveMasterInitializationMonitor-1572518086672] master.HMaster: Master failed to complete initialization after 900000ms. Please consider submitting a bug report including a thread dump of this process.

2019-10-31 18:50:54,185 INFO? [cloud1:16000.activeMasterManager] util.FSHDFSUtils: Failed to recover lease, attempt=6 on file=hdfs://mycluster/hbase/MasterProcWALs/state-00000000000000000001.log after 965626ms

2019-10-31 18:50:54,185 WARN? [cloud1:16000.activeMasterManager] util.FSHDFSUtils: Cannot recoverLease after trying for 900000ms (hbase.lease.recovery.timeout); continuing, but may be DATALOSS!!!; attempt=6 on file=hdfs://mycluster/hbase/MasterProcWALs/state-00000000000000000001.log after 965626ms

2019-10-31 18:50:54,450 ERROR [cloud1:16000.activeMasterManager] wal.WALProcedureStore: Unable to read state log: FileStatus{path=hdfs://mycluster/hbase/MasterProcWALs/state-00000000000000000001.log; isDirectory=false; length=132; replication=3; blocksize=67108864; modification_time=1572514780827; access_time=1572514780827; owner=cloud; group=supergroup; permission=rw-r--r--; isSymlink=false}

java.io.IOException: Cannot obtain block length for LocatedBlock{BP-2130371621-192.168.120.226-1506852297811:blk_1075210355_1469774; getBlockSize()=132; corrupt=false; offset=0; locs=[172.18.49.224:50010, 172.18.49.222:50010, 172.18.49.223:50010]}

? ? ? ? at org.apache.hadoop.hdfs.DFSInputStream.readBlockLength(DFSInputStream.java:354)

? ? ? ? at org.apache.hadoop.hdfs.DFSInputStream.fetchLocatedBlocksAndGetLastBlockLength(DFSInputStream.java:298)

? ? ? ? at org.apache.hadoop.hdfs.DFSInputStream.openInfo(DFSInputStream.java:235)

? ? ? ? at org.apache.hadoop.hdfs.DFSInputStream.<init>(DFSInputStream.java:228)

? ? ? ? at org.apache.hadoop.hdfs.DFSClient.open(DFSClient.java:1318)

? ? ? ? at org.apache.hadoop.hdfs.DistributedFileSystem$3.doCall(DistributedFileSystem.java:293)

? ? ? ? at org.apache.hadoop.hdfs.DistributedFileSystem$3.doCall(DistributedFileSystem.java:289)

? ? ? ? at org.apache.hadoop.fs.FileSystemLinkResolver.resolve(FileSystemLinkResolver.java:81)

? ? ? ? at org.apache.hadoop.hdfs.DistributedFileSystem.open(DistributedFileSystem.java:289)

? ? ? ? at org.apache.hadoop.fs.FileSystem.open(FileSystem.java:764)

? ? ? ? at org.apache.hadoop.hbase.procedure2.store.wal.ProcedureWALFile.open(ProcedureWALFile.java:67)

? ? ? ? at org.apache.hadoop.hbase.procedure2.store.wal.WALProcedureStore.initOldLog(WALProcedureStore.java:961)

? ? ? ? at org.apache.hadoop.hbase.procedure2.store.wal.WALProcedureStore.initOldLogs(WALProcedureStore.java:924)

? ? ? ? at org.apache.hadoop.hbase.procedure2.store.wal.WALProcedureStore.recoverLease(WALProcedureStore.java:262)

? ? ? ? at org.apache.hadoop.hbase.procedure2.ProcedureExecutor.start(ProcedureExecutor.java:511)

? ? ? ? at org.apache.hadoop.hbase.master.HMaster.startProcedureExecutor(HMaster.java:1224)

? ? ? ? at org.apache.hadoop.hbase.master.HMaster.startServiceThreads(HMaster.java:1136)

? ? ? ? at org.apache.hadoop.hbase.master.HMaster.finishActiveMasterInitialization(HMaster.java:725)

? ? ? ? at org.apache.hadoop.hbase.master.HMaster.access$600(HMaster.java:189)

? ? ? ? at org.apache.hadoop.hbase.master.HMaster$2.run(HMaster.java:1803)

? ? ? ? at java.lang.Thread.run(Thread.java:745)

2019-10-31 18:50:54,452 FATAL [cloud1:16000.activeMasterManager] master.HMaster: Failed to become active master

java.io.IOException: Unable to read state log: FileStatus{path=hdfs://mycluster/hbase/MasterProcWALs/state-00000000000000000001.log; isDirectory=false; length=132; replication=3; blocksize=67108864; modification_time=1572514780827; access_time=1572514780827; owner=cloud; group=supergroup; permission=rw-r--r--; isSymlink=false}

? ? ? ? at org.apache.hadoop.hbase.procedure2.store.wal.WALProcedureStore.initOldLog(WALProcedureStore.java:969)

? ? ? ? at org.apache.hadoop.hbase.procedure2.store.wal.WALProcedureStore.initOldLogs(WALProcedureStore.java:924)

? ? ? ? at org.apache.hadoop.hbase.procedure2.store.wal.WALProcedureStore.recoverLease(WALProcedureStore.java:262)

? ? ? ? at org.apache.hadoop.hbase.procedure2.ProcedureExecutor.start(ProcedureExecutor.java:511)

? ? ? ? at org.apache.hadoop.hbase.master.HMaster.startProcedureExecutor(HMaster.java:1224)

? ? ? ? at org.apache.hadoop.hbase.master.HMaster.startServiceThreads(HMaster.java:1136)

? ? ? ? at org.apache.hadoop.hbase.master.HMaster.finishActiveMasterInitialization(HMaster.java:725)

? ? ? ? at org.apache.hadoop.hbase.master.HMaster.access$600(HMaster.java:189)

? ? ? ? at org.apache.hadoop.hbase.master.HMaster$2.run(HMaster.java:1803)

? ? ? ? at java.lang.Thread.run(Thread.java:745)

Caused by: java.io.IOException: Cannot obtain block length for LocatedBlock{BP-2130371621-192.168.120.226-1506852297811:blk_1075210355_1469774; getBlockSize()=132; corrupt=false; offset=0; locs=[172.18.49.224:50010, 172.18.49.222:50010, 172.18.49.223:50010]}

? ? ? ? at org.apache.hadoop.hdfs.DFSInputStream.readBlockLength(DFSInputStream.java:354)

? ? ? ? at org.apache.hadoop.hdfs.DFSInputStream.fetchLocatedBlocksAndGetLastBlockLength(DFSInputStream.java:298)

? ? ? ? at org.apache.hadoop.hdfs.DFSInputStream.openInfo(DFSInputStream.java:235)

? ? ? ? at org.apache.hadoop.hdfs.DFSInputStream.<init>(DFSInputStream.java:228)

? ? ? ? at org.apache.hadoop.hdfs.DFSClient.open(DFSClient.java:1318)

? ? ? ? at org.apache.hadoop.hdfs.DistributedFileSystem$3.doCall(DistributedFileSystem.java:293)

? ? ? ? at org.apache.hadoop.hdfs.DistributedFileSystem$3.doCall(DistributedFileSystem.java:289)

? ? ? ? at org.apache.hadoop.fs.FileSystemLinkResolver.resolve(FileSystemLinkResolver.java:81)

? ? ? ? at org.apache.hadoop.hdfs.DistributedFileSystem.open(DistributedFileSystem.java:289)

? ? ? ? at org.apache.hadoop.fs.FileSystem.open(FileSystem.java:764)

? ? ? ? at org.apache.hadoop.hbase.procedure2.store.wal.ProcedureWALFile.open(ProcedureWALFile.java:67)

? ? ? ? at org.apache.hadoop.hbase.procedure2.store.wal.WALProcedureStore.initOldLog(WALProcedureStore.java:961)


二金句、原因:

1瓶蚂、可能是由于集群服務(wù)器斷電導(dǎo)致(我的情況就是這樣慈鸠,前一天服務(wù)器全部斷電确镊。)

2拨与、其他未知原因瓦胎。


三猿规、解決辦法:

解決辦法是需要?jiǎng)h掉hdfs上的/hbase/MasterProcWALs和/hbase/WALs目錄下的所有文件(如果擔(dān)心出問題衷快,可以先備份這兩個(gè)文件夾,hdfs dfs -cp /hbase/WALs /hbase/WALs.bak)姨俩,當(dāng)然我也刪了zk中/hbase蘸拔,然后啟動(dòng)hbase即可。


四环葵、參考文章:

1调窍、apache zookeeper - HBase Master won't start - Stack Overflow

最后編輯于
?著作權(quán)歸作者所有,轉(zhuǎn)載或內(nèi)容合作請(qǐng)聯(lián)系作者
  • 序言:七十年代末,一起剝皮案震驚了整個(gè)濱河市张遭,隨后出現(xiàn)的幾起案子邓萨,更是在濱河造成了極大的恐慌,老刑警劉巖菊卷,帶你破解...
    沈念sama閱讀 219,039評(píng)論 6 508
  • 序言:濱河連續(xù)發(fā)生了三起死亡事件缔恳,死亡現(xiàn)場(chǎng)離奇詭異,居然都是意外死亡洁闰,警方通過查閱死者的電腦和手機(jī)歉甚,發(fā)現(xiàn)死者居然都...
    沈念sama閱讀 93,426評(píng)論 3 395
  • 文/潘曉璐 我一進(jìn)店門,熙熙樓的掌柜王于貴愁眉苦臉地迎上來扑眉,“玉大人纸泄,你說我怎么就攤上這事⊙兀” “怎么了聘裁?”我有些...
    開封第一講書人閱讀 165,417評(píng)論 0 356
  • 文/不壞的土叔 我叫張陵,是天一觀的道長(zhǎng)耸弄。 經(jīng)常有香客問我咧虎,道長(zhǎng),這世上最難降的妖魔是什么计呈? 我笑而不...
    開封第一講書人閱讀 58,868評(píng)論 1 295
  • 正文 為了忘掉前任砰诵,我火速辦了婚禮穿撮,結(jié)果婚禮上昆咽,老公的妹妹穿的比我還像新娘。我一直安慰自己,他們只是感情好诡宗,可當(dāng)我...
    茶點(diǎn)故事閱讀 67,892評(píng)論 6 392
  • 文/花漫 我一把揭開白布怀挠。 她就那樣靜靜地躺著乃摹,像睡著了一般云茸。 火紅的嫁衣襯著肌膚如雪。 梳的紋絲不亂的頭發(fā)上妹萨,一...
    開封第一講書人閱讀 51,692評(píng)論 1 305
  • 那天年枕,我揣著相機(jī)與錄音,去河邊找鬼乎完。 笑死熏兄,一個(gè)胖子當(dāng)著我的面吹牛,可吹牛的內(nèi)容都是我干的树姨。 我是一名探鬼主播摩桶,決...
    沈念sama閱讀 40,416評(píng)論 3 419
  • 文/蒼蘭香墨 我猛地睜開眼,長(zhǎng)吁一口氣:“原來是場(chǎng)噩夢(mèng)啊……” “哼帽揪!你這毒婦竟也來了硝清?” 一聲冷哼從身側(cè)響起,我...
    開封第一講書人閱讀 39,326評(píng)論 0 276
  • 序言:老撾萬榮一對(duì)情侶失蹤转晰,失蹤者是張志新(化名)和其女友劉穎芦拿,沒想到半個(gè)月后,有當(dāng)?shù)厝嗽跇淞掷锇l(fā)現(xiàn)了一具尸體查邢,經(jīng)...
    沈念sama閱讀 45,782評(píng)論 1 316
  • 正文 獨(dú)居荒郊野嶺守林人離奇死亡防嗡,尸身上長(zhǎng)有42處帶血的膿包…… 初始之章·張勛 以下內(nèi)容為張勛視角 年9月15日...
    茶點(diǎn)故事閱讀 37,957評(píng)論 3 337
  • 正文 我和宋清朗相戀三年,在試婚紗的時(shí)候發(fā)現(xiàn)自己被綠了侠坎。 大學(xué)時(shí)的朋友給我發(fā)了我未婚夫和他白月光在一起吃飯的照片。...
    茶點(diǎn)故事閱讀 40,102評(píng)論 1 350
  • 序言:一個(gè)原本活蹦亂跳的男人離奇死亡裙盾,死狀恐怖实胸,靈堂內(nèi)的尸體忽然破棺而出,到底是詐尸還是另有隱情番官,我是刑警寧澤庐完,帶...
    沈念sama閱讀 35,790評(píng)論 5 346
  • 正文 年R本政府宣布,位于F島的核電站徘熔,受9級(jí)特大地震影響门躯,放射性物質(zhì)發(fā)生泄漏。R本人自食惡果不足惜酷师,卻給世界環(huán)境...
    茶點(diǎn)故事閱讀 41,442評(píng)論 3 331
  • 文/蒙蒙 一讶凉、第九天 我趴在偏房一處隱蔽的房頂上張望染乌。 院中可真熱鬧,春花似錦懂讯、人聲如沸荷憋。這莊子的主人今日做“春日...
    開封第一講書人閱讀 31,996評(píng)論 0 22
  • 文/蒼蘭香墨 我抬頭看了看天上的太陽勒庄。三九已至,卻和暖如春瘫里,著一層夾襖步出監(jiān)牢的瞬間实蔽,已是汗流浹背。 一陣腳步聲響...
    開封第一講書人閱讀 33,113評(píng)論 1 272
  • 我被黑心中介騙來泰國(guó)打工谨读, 沒想到剛下飛機(jī)就差點(diǎn)兒被人妖公主榨干…… 1. 我叫王不留局装,地道東北人。 一個(gè)月前我還...
    沈念sama閱讀 48,332評(píng)論 3 373
  • 正文 我出身青樓漆腌,卻偏偏與公主長(zhǎng)得像贼邓,于是被迫代替她去往敵國(guó)和親。 傳聞我的和親對(duì)象是個(gè)殘疾皇子闷尿,可洞房花燭夜當(dāng)晚...
    茶點(diǎn)故事閱讀 45,044評(píng)論 2 355