前言
今天我們來(lái)演示MOVICS包的第二個(gè)模塊乐设,在上一篇推文中:MOVICS系列教程(一) GET Module分析后佑力,我們得到了乳腺癌的5個(gè)亞型澈圈,那么此模塊就是為了對(duì)這5種亞型間的分子特征進(jìn)行展示划提。
因?yàn)樾枰弦徊糠值妮敵鼋Y(jié)果挠锥,大家必須跑完上一篇推文的代碼才可以進(jìn)行本篇推文的演示代碼十兢。
主要函數(shù)
同樣的餐济,我們先來(lái)看一下這個(gè)模塊用到的函數(shù):
compSurv(): compare survival outcome and generate a Kalan-Meier curve with pairwise comparison if possible
compClinvar(): compare and summarize clinical features among different identified subtypes
compMut(): compare mutational frequency and generate an OncoPrint with significant mutations
compTMB(): compare total mutation burden among subtypes and generate distribution of Transitions and Transversions
compFGA(): compare fraction genome altered among subtypes and generate a barplot for distribution comparison
compDrugsen(): compare estimated half maximal inhibitory concentration (IC50) for drug sensitivity and generate a boxviolin for distribution comparison
compAgree(): compare agreement of current subtypes with other pre-existed classifications and generate an alluvial diagram and an agreement barplot
主要是通過(guò)比較各亞型間腫瘤的主要特征(生存分析耘擂,臨床特征,突變狀態(tài)絮姆,TMB醉冤,藥敏和一致性)秩霍,來(lái)揭示各亞型間不同的分子特征。
代碼演示
# survival comparison
surv.brca <- compSurv(moic.res = cmoic.brca,
surv.info = surv.info,
convt.time = "m", # convert day unit to month
surv.median.line = "h", # draw horizontal line at median survival
xyrs.est = c(5,10), # estimate 5 and 10-year survival
fig.name = "KAPLAN-MEIER CURVE OF CONSENSUSMOIC")
# survival comparison
surv.brca <- compSurv(moic.res = cmoic.brca,
surv.info = surv.info,
convt.time = "m", # convert day unit to month
surv.median.line = "h", # draw horizontal line at median survival
xyrs.est = c(5,10), # estimate 5 and 10-year survival
fig.name = "KAPLAN-MEIER CURVE OF CONSENSUSMOIC")
# mutational frequency comparison
mut.brca <- compMut(moic.res = cmoic.brca,
mut.matrix = brca.tcga$mut.status, # binary somatic mutation matrix
doWord = TRUE, # generate table in .docx format
doPlot = TRUE, # draw OncoPrint
freq.cutoff = 0.05, # keep those genes that mutated in at least 5% of samples
p.adj.cutoff = 0.05, # keep those genes with adjusted p value < 0.05 to draw OncoPrint
innerclust = TRUE, # perform clustering within each subtype
annCol = annCol, # same annotation for heatmap
annColors = annColors, # same annotation color for heatmap
width = 6,
height = 2,
fig.name = "ONCOPRINT FOR SIGNIFICANT MUTATIONS",
tab.name = "INDEPENDENT TEST BETWEEN SUBTYPE AND MUTATION")
# compare TMB
tmb.brca <- compTMB(moic.res = cmoic.brca,
maf = maf,
rmDup = TRUE, # remove duplicated variants per sample
rmFLAGS = FALSE, # keep FLAGS mutations
exome.size = 38, # estimated exome size
test.method = "nonparametric", # statistical testing method
fig.name = "DISTRIBUTION OF TMB AND TITV")
# compare FGA, FGG, and FGL
fga.brca <- compFGA(moic.res = cmoic.brca,
segment = segment,
iscopynumber = FALSE, # this is a segmented copy number file
cnathreshold = 0.2, # threshold to determine CNA gain or loss
test.method = "nonparametric", # statistical testing method
fig.name = "BARPLOT OF FGA")
# drug sensitivity comparison
drug.brca <- compDrugsen(moic.res = cmoic.brca,
norm.expr = fpkm[,cmoic.brca$clust.res$samID], # double guarantee sample order
drugs = c("Cisplatin", "Paclitaxel"), # a vector of names of drug in GDSC
tissueType = "breast", # choose specific tissue type to construct ridge regression model
test.method = "nonparametric", # statistical testing method
prefix = "BOXVIOLIN OF ESTIMATED IC50")
# customize the factor level for pstage
surv.info$pstage <- factor(surv.info$pstage, levels = c("TX","T1","T2","T3","T4"))
# agreement comparison (support up to 6 classifications include current subtype)
agree.brca <- compAgree(moic.res = cmoic.brca,
subt2comp = surv.info[,c("PAM50","pstage")],
doPlot = TRUE,
box.width = 0.2,
fig.name = "AGREEMENT OF CONSENSUSMOIC WITH PAM50 AND PSTAGE")
總結(jié)
相信你已經(jīng)被上述各種炫酷的圖片吸引住了蚁阳,但是截止到目前為止前域,我們?nèi)匀恢皇菑谋硇蜕险页鋈橄侔┑母鱽喰烷g分子功能的不同。而如果想更深入的挖掘其背后機(jī)制韵吨,就需要找出各亞型間這些差異表達(dá)的基因是哪些,這就是MOVICS第三個(gè)模塊的作用了移宅,Immugent將會(huì)在下一次推文中進(jìn)行介紹归粉,敬請(qǐng)期待!