Making Sense of Vision and Touch: Self-Supervised Learning of Multimodal Representations for Cont...

Best Paper Award, ICRA 2019
[pdf] [site] [ppt]

摘要

在非結(jié)構(gòu)環(huán)境中框都，多接觸操作任務(wù)（Contact-rich manipulation tasks）通常同時需要觸覺和視覺反饋畜埋，通常手動設(shè)計結(jié)合不同特征的模式的控制器并非易事粹污。深度強(qiáng)化學(xué)習(xí)（DRL），在高維輸入下學(xué)習(xí)控制策略已經(jīng)取得了成功，由于樣本復(fù)雜性，這些算法通常難以在真實機(jī)器人上部署歇拆。我們使用自監(jiān)督，去學(xué)習(xí)傳感器數(shù)據(jù)的緊湊和多模態(tài)表示，然后可以用來提高我們的策略學(xué)習(xí)的樣本效率查吊。我們在栓釘插入任務(wù)上評估了我們的方法谐区，對不同的幾何形狀，配置和間隙進(jìn)行推廣逻卖，同時對外部擾動具有魯棒性宋列。我們在仿真和真實機(jī)器人上呈現(xiàn)效果。

介紹

Fig. 1: Force sensor readings in the z-axis (height) and visual observations are shown with corresponding stages of a peg insertion task. The force reading transitions from (1) the arm moving in free space to (2) making contact with the box. While aligning the peg, the forces capture the sliding contact dynamics on the box surface (3, 4). Finally, in the insertion stage, the forces peak as the robot attempts to insert the peg at the edge of the hole (5), and decrease when the peg slides into the hole (6).

主要貢獻(xiàn)有：

多模態(tài)表示學(xué)習(xí)模型评也，可以從中學(xué)習(xí)多接觸操作策略炼杖。

插入任務(wù)的示范，有效地利用觸覺和視覺反饋進(jìn)行孔搜索盗迟，栓釘對齊和插入（參見Fig.1）坤邪。燒蝕研究比較了每種模態(tài)對任務(wù)表現(xiàn)的影響。

評估具有不同栓釘幾何形狀的任務(wù)的泛化罚缕，以及對擾動和傳感器噪聲的魯棒性艇纺。

多模態(tài)表示模型

Fig. 2: Neural network architecture for multimodal representation learning with self-supervision. The network takes data from three different sensors as input: RGB images, F/T readings over a 32ms window, and end-effector position and velocity. It encodes and fuses this data into a multimodal representation based on which controllers for contact-rich manipulation can be learned. This representation learning network is trained end-to-end through self-supervision.

策略學(xué)習(xí)和控制器設(shè)計

Fig. 3: Our controller takes end-effector position displacements from the policy at 20Hz and outputs robot torque commands at 200Hz. The trajectory generator interpolates high-bandwidth robot trajectories from low-bandwidth policy actions. The impedance PD controller tracks the interpolated trajectory. The operational space controller uses the robot dynamics model to transform Cartesianspace accelerations into commanded joint torques. The resulting controller is compliant and reactive.

實驗：設(shè)計和設(shè)置

Fig. 4: Simulated Peg Insertion: Ablative study of representations trained on different combinations of sensory modalities. We compare our full model, trained with a combination of visual and haptic feedback and proprioception, with baselines that are trained without vision, or haptics, or either. (b) The graph shows partial task completion rates with different feedback modalities, and we note that both the visual and haptic modalities play an integral role for contact-rich tasks.

Reward Design

實驗：結(jié)果

Real Robot Experiments

Fig. 5: (a) 3D printed pegs used in the real robot experiments and their box clearances. (b) Qualitative predictions: We visualize examples of optical flow predictions from our representation model (using color scheme in [22]). The model predicts different flow maps on the same image conditioned on different next actions indicated by projected arrows.

Fig. 6: Real Robot Peg Insertion: We evaluate our Full Model on the real hardware with different peg shapes, indicated on the x-axis. The learned policies achieve the tasks with a high success rate. We also study transferring the policies and representations from trained pegs to novel peg shapes (last four bars). The robot effectively re-uses previously trained models to solve new tasks.

最后編輯于：2019.06.26 11:56:16

?著作權(quán)歸作者所有,轉(zhuǎn)載或內(nèi)容合作請聯(lián)系作者

人面猴
序言：七十年代末，一起剝皮案震驚了整個濱河市邮弹，隨后出現(xiàn)的幾起案子黔衡，更是在濱河造成了極大的恐慌，老刑警劉巖腌乡，帶你破解...
沈念sama閱讀 218,546評論 6贊 507
死咒
序言：濱河連續(xù)發(fā)生了三起死亡事件盟劫，死亡現(xiàn)場離奇詭異，居然都是意外死亡与纽，警方通過查閱死者的電腦和手機(jī)侣签，發(fā)現(xiàn)死者居然都...
沈念sama閱讀 93,224評論 3贊 395
救了他兩次的神仙讓他今天三更去死
文/潘曉璐我一進(jìn)店門，熙熙樓的掌柜王于貴愁眉苦臉地迎上來急迂，“玉大人影所，你說我怎么就攤上這事×潘椋” “怎么了型檀？”我有些...
開封第一講書人閱讀 164,911評論 0贊 354
道士緝兇錄：失蹤的賣姜人
文/不壞的土叔我叫張陵，是天一觀的道長听盖。經(jīng)常有香客問我，道長裂七，這世上最難降的妖魔是什么皆看？我笑而不...
開封第一講書人閱讀 58,737評論 1贊 294
?港島之戀（遺憾婚禮）
正文為了忘掉前任，我火速辦了婚禮背零，結(jié)果婚禮上腰吟，老公的妹妹穿的比我還像新娘。我一直安慰自己，他們只是感情好毛雇，可當(dāng)我...
茶點故事閱讀 67,753評論 6贊 392
惡毒庶女頂嫁案：這布局不是一般人想出來的
文/花漫我一把揭開白布嫉称。她就那樣靜靜地躺著，像睡著了一般灵疮。火紅的嫁衣襯著肌膚如雪织阅。梳的紋絲不亂的頭發(fā)上，一...
開封第一講書人閱讀 51,598評論 1贊 305
城市分裂傳說
那天震捣，我揣著相機(jī)與錄音荔棉，去河邊找鬼。笑死蒿赢，一個胖子當(dāng)著我的面吹牛润樱，可吹牛的內(nèi)容都是我干的。我是一名探鬼主播羡棵，決...
沈念sama閱讀 40,338評論 3贊 418
雙鴛鴦連環(huán)套：你想象不到人心有多黑
文/蒼蘭香墨我猛地睜開眼壹若，長吁一口氣：“原來是場噩夢啊……” “哼！你這毒婦竟也來了皂冰？” 一聲冷哼從身側(cè)響起店展，我...
開封第一講書人閱讀 39,249評論 0贊 276
萬榮殺人案實錄
序言：老撾萬榮一對情侶失蹤，失蹤者是張志新（化名）和其女友劉穎灼擂，沒想到半個月后壁查，有當(dāng)?shù)厝嗽跇淞掷锇l(fā)現(xiàn)了一具尸體，經(jīng)...
沈念sama閱讀 45,696評論 1贊 314
?護(hù)林員之死
正文獨居荒郊野嶺守林人離奇死亡剔应，尸身上長有42處帶血的膿包…… 初始之章·張勛以下內(nèi)容為張勛視角年9月15日...
茶點故事閱讀 37,888評論 3贊 336
?白月光啟示錄
正文我和宋清朗相戀三年睡腿，在試婚紗的時候發(fā)現(xiàn)自己被綠了。大學(xué)時的朋友給我發(fā)了我未婚夫和他白月光在一起吃飯的照片峻贮。...
茶點故事閱讀 40,013評論 1贊 348
活死人
序言：一個原本活蹦亂跳的男人離奇死亡席怪，死狀恐怖，靈堂內(nèi)的尸體忽然破棺而出纤控，到底是詐尸還是另有隱情挂捻，我是刑警寧澤，帶...
沈念sama閱讀 35,731評論 5贊 346
?日本核電站爆炸內(nèi)幕
正文年R本政府宣布船万，位于F島的核電站刻撒，受9級特大地震影響，放射性物質(zhì)發(fā)生泄漏耿导。R本人自食惡果不足惜声怔，卻給世界環(huán)境...
茶點故事閱讀 41,348評論 3贊 330
男人毒藥：我在死后第九天來索命
文/蒙蒙一、第九天我趴在偏房一處隱蔽的房頂上張望舱呻。院中可真熱鬧醋火，春花似錦、人聲如沸。這莊子的主人今日做“春日...
開封第一講書人閱讀 31,929評論 0贊 22
一樁弒父案柿冲，背后竟有這般陰謀
文/蒼蘭香墨我抬頭看了看天上的太陽。三九已至兆旬，卻和暖如春假抄，著一層夾襖步出監(jiān)牢的瞬間，已是汗流浹背爵憎。一陣腳步聲響...
開封第一講書人閱讀 33,048評論 1贊 270
情欲美人皮
我被黑心中介騙來泰國打工慨亲，沒想到剛下飛機(jī)就差點兒被人妖公主榨干…… 1. 我叫王不留，地道東北人宝鼓。一個月前我還...
沈念sama閱讀 48,203評論 3贊 370
代替公主和親
正文我出身青樓刑棵，卻偏偏與公主長得像，于是被迫代替她去往敵國和親愚铡。傳聞我的和親對象是個殘疾皇子蛉签，可洞房花燭夜當(dāng)晚...
茶點故事閱讀 44,960評論 2贊 355

Making Sense of Vision and Touch: Self-Supervised Learning of Multimodal Representations for Cont...

摘要

介紹

多模態(tài)表示模型

策略學(xué)習(xí)和控制器設(shè)計

實驗：設(shè)計和設(shè)置

Reward Design

實驗：結(jié)果

Real Robot Experiments

推薦閱讀更多精彩內(nèi)容