腳本更新----空間轉(zhuǎn)錄組信號(hào)流COMMOT與空間inferCNV（封裝版）

作者律秃，Evil Genius

針對(duì)visium爬橡、bin模式的Stereo-seq、HD等的腳本更新棒动。

對(duì)于單細(xì)胞而言糙申，大家網(wǎng)上找點(diǎn)代碼分析出個(gè)結(jié)果，已經(jīng)很容易了船惨，但是想分析的很好柜裸，還是很難的。

咱國(guó)內(nèi)的盜版能力粱锐，全世界都出了名了疙挺。

大家拿到盜版，千萬自己要學(xué)習(xí)一下卜范，看看有沒有錯(cuò)誤的地方衔统，我當(dāng)然不鼓勵(lì)盜版，但是也沒辦法海雪，尤其課上的內(nèi)容锦爵，一定要忠于數(shù)據(jù)，忠于實(shí)際情況奥裸，不要瞎用险掀，生物信息，生物在前湾宙，背景內(nèi)容一定要了解清楚再去做樟氢。

我自己看了看，盜版很多內(nèi)容都是閹割后的侠鳄，大家自己要有識(shí)別能力埠啃，改過來，還有參數(shù)的配置伟恶，我上課是示例碴开，配的參數(shù)簡(jiǎn)單，別瞎套用，自己的項(xiàng)目要調(diào)整的潦牛。

大家以后工作了眶掌，一定要忠于數(shù)據(jù)，忠于事實(shí)巴碗，生物信息往往都跟病人朴爬、腫瘤、發(fā)育等等相關(guān)橡淆，千萬不要有任何的僥幸心理召噩，小小的差錯(cuò)很容易造成人間悲劇。

今天我們更新腳本逸爵，隨著方法的更新和認(rèn)識(shí)的提升蚣常，腳本也會(huì)隨時(shí)變動(dòng)。

尤其借鑒一些高分文獻(xiàn)痊银，加深了認(rèn)識(shí)之后，腳本也要隨之更新施绎。

蘭衛(wèi)正在招賢納士溯革，一家好的公司，尤其上市公司谷醉，值得大家關(guān)注致稀。

首先是空間版本的inferCNV，空間區(qū)域注釋大家自己做俱尼。

#! user/bin/R
###zhaoyunfei
###Run inferCNV on one ST sample
###2024/11/19

### Parameters ------------------------------------------------------
option_list = list(
      make_option(c("-s", "--sample_id"),
        type="character",
        default=NULL,
        help="Sample name for ST",
        metavar="character"),
      make_option(c("-m", "--infercnv_mode"),
        type="character",
        default="subcluster",
        help="InferCNV analysis_mode sample|subcluster|cell",
        metavar="character"),
      make_option(c("-c", "--label_column"),
        type="character",
        default="tumor_normal",
        help="InferCNV cell_type_file column name",
        metavar="character"),
          make_option(c("-o", "--outdir"),
        type="character",
        default="./",
        help="the outdir",
        metavar="character"),
      make_option(c("-w", "--window_length"),
        type="integer",
        default="151",
        help="InferCNV smoothing window length",
        metavar="integer"),
      make_option(c("-d", "--sim_method"),
        type="character",
        default="meanvar",
        help="InferCNV hspike simulation method",
        metavar="character"),
      make_option(c("-n", "--num_normal"),
        type="integer",
        default="-1",
        help="Number normal spots to give input to inferCNV",
        metavar="integer"),
      make_option(c("-p", "--pnorm_bayes"),
        type="numeric",
        default="0.5",
        help="Bayes posterior probability cutoff for inferCNV",
        metavar="numeric"),
      make_option(c("-r", "--leiden_resolution"),
        type="numeric",
        default="0.05",
        help="Leiden resolution for clustering",
        metavar="numeric"),
      make_option("--output_prefix",
        type="character",
        default="",
        help="Output prefix",
        metavar="character")
      )

    opt_parser = OptionParser(option_list=option_list);
    opt = parse_args(opt_parser)

sample_id=opt$sample_id
infercnv_mode=opt$infercnv_mode
label_column=opt$label_column
window_length=opt$window_length
sim_method=opt$sim_method
num_normal=opt$num_normal
pnorm_bayes=opt$pnorm_bayes
output_prefix=opt$output_prefix

library(infercnv)
library(Seurat)
library(reticulate)
library(googlesheets4)
library(tidyverse)
library(optparse)
# use_condaenv("inferCNV", required=TRUE)
reticulate::py_config()

公司一般參數(shù)是外置的抖单，就是寫Shell腳本配參數(shù)運(yùn)行主腳本，對(duì)于沒有l(wèi)inux系統(tǒng)的情況遇八，參數(shù)可以內(nèi)置矛绘，比如下面寫進(jìn)主腳本。

sample_id = "PT100000000001"
label_column = "tumor_regions"
infercnv_mode = "sample"
output_prefix = ""
window_length = 151
sim_method = "meanvar"
num_normal = 200
pnorm_bayes = 0.3

主腳本

short_label_column = case_when(label_column=="tumor_regions"~"region",
                   label_column=="tumor_normal"~"TN",
                   TRUE~NA_character_)
normal_label = case_when(label_column=="tumor_regions"~"Ref",
             label_column=="tumor_normal"~"Normal",
             TRUE~NA_character_)

if (!file.exists(outdir)) {dir.create(outdir,recursive = TRUE)}

if (infercnv_mode == "sample") {
    output_dir = str_interp("${outdir}/${output_prefix}_${window_length}_${sim_method}_${num_normal}")
} else if (infercnv_mode == "subcluster") {
    output_dir = str_interp("${outdir}/${output_prefix}subcluster_${window_length}_${sim_method}_${num_normal}")
} else if (infercnv_mode == "cell") {
    output_dir = str_interp("${outdir}/${output_prefix}cell_${window_length}_${sim_method}_${num_normal}")
}

sample_outdir = str_interp("${output_dir}/${sample_id}/")
if (!dir.exists(sample_outdir)) {dir.create(sample_outdir)} 

# Don't rerun sample already done
final_mat_filename = str_interp("${sample_outdir}/infercnv.observations.txt")
stopifnot(!file.exists(final_mat_filename))

print(paste0("[inferCNV] Start running for sample: ", sample_id))

# Read metadata from annotation sheet
obj <- read_st_obj(sample_id)
st_meta = read_subclone_tbl(sample_id)
obj = AddMetaData(obj, st_meta[Cells(obj),])
obj$tumor_normal = ifelse(obj$Filtered_tumor_regions==0, "Normal", "Tumor")
if (label_column == "Filtered_tumor_regions") {
  subclone_color = create_subclone_color(obj$Filtered_tumor_regions)
  subclone_color["0"] = "grey75" 
  subclone_color["Ref"] = NA
} else if (label_column == "tumor_normal") {
  subclone_color = c("Tumor"= as.character(bound.color["Tumor"]), "Normal"=as.character(bound.color["TME"]))
}
Idents(obj) <- label_column
table(Idents(obj))

counts_matrix = GetAssayData(object = obj, slot = "counts")
####gene order file
gene_order = read.delim(file = "gencode_v32_gene_name.txt", header=FALSE, stringsAsFactors = FALSE, sep="\t") 

cell_type_file = obj@meta.data %>% select(!!label_column)
gene_order_file = gene_order %>% column_to_rownames("V1")
Idents(obj) = label_column  

##### select normal spots with low estimated purity #####
if (num_normal > 0) {
    threshold <- sort(obj$Purity[obj$Filtered_tumor_regions==0])[num_normal]
    if (is.na(threshold)) {
    threshold = sort(obj$Purity)[num_normal]
        obj$Filtered_tumor_regions = ifelse(obj$Purity<=threshold, "Ref", obj$Filtered_tumor_regions)
    }
    obj$Filtered_tumor_regions = case_when(
        obj$Purity <= threshold & obj$Filtered_tumor_regions==0 ~ "Ref",
    TRUE ~ obj$Filtered_tumor_regions
    )
    cell_type_file_filtered = obj@meta.data %>% select(!!label_column)

    infercnv_obj = CreateInfercnvObject(raw_counts_matrix=as.matrix(counts_matrix),
                        annotations_file=cell_type_file_filtered,
                        gene_order_file=gene_order_file,
                        ref_group_names=as.vector(normal_label))
} else {
    infercnv_obj = CreateInfercnvObject(raw_counts_matrix=as.matrix(counts_matrix),
                                        annotations_file=cell_type_file,
                                        gene_order_file=gene_order_file,
                                        ref_group_names=as.vector(normal_label))
}
table(obj@meta.data[,label_column])

if (!dir.exists(str_interp("${output_dir}/infercnv_label"))) {dir.create(str_interp("${output_dir}/infercnv_label"))} 

p_HE = SpatialPlot(obj, group.by=label_column, alpha=0, crop=F) + 
    NoLegend() + labs(title=sample_id)
p = SpatialPlot(obj, group.by=label_column, stroke=0, label=T, label.size=2, label.box=F, crop=F, alpha=0.8) + 
    scale_fill_manual(values=subclone_color, na.value=NA) + NoLegend() +
    labs(title="InferCNV label")

ggsave(str_interp("${output_dir}/infercnv_label/infercnv_label_${sample_id}.pdf"), plot=p_HE+p, w=6, h=3.5)

infercnv_obj = infercnv::run(infercnv_obj, 
                 window_length = window_length,
                             analysis_mode = infercnv_mode, 
                             cutoff=0.1,  # use 1 for smart-seq, 0.1 for 10x-genomics
                             out_dir=str_interp("${output_dir}/${sample_id}/"), # dir is auto-created for storing outputs 
                             cluster_by_groups=T,   # cluster
                             sim_method=sim_method,
                             plot_steps=T,
                             denoise=T,
                             HMM=T,
                             BayesMaxPNormal = pnorm_bayes, 
                             mask_nonDE_genes = T,
                             resume_mode=F,
                             num_threads = 8)

print(paste0("[inferCNV] Finished running for sample: ", sample_id))

拿到熱圖

拿到CNV矩陣

至于進(jìn)化樹注釋刃永，之前寫過了货矮，大家可以參加文章流程升級(jí)----空間CNV分析流程升級(jí)

接下來COMMOT腳本，python版本,一樣的封裝類型,比較簡(jiǎn)單

還有 44% 的精彩內(nèi)容

?著作權(quán)歸作者所有,轉(zhuǎn)載或內(nèi)容合作請(qǐng)聯(lián)系作者

禁止轉(zhuǎn)載斯够，如需轉(zhuǎn)載請(qǐng)通過簡(jiǎn)信或評(píng)論聯(lián)系作者囚玫。

支付￥50.00 繼續(xù)閱讀

人面猴
序言：七十年代末，一起剝皮案震驚了整個(gè)濱河市读规，隨后出現(xiàn)的幾起案子抓督，更是在濱河造成了極大的恐慌，老刑警劉巖束亏，帶你破解...
沈念sama閱讀 206,013評(píng)論 6贊 481
死咒
序言：濱河連續(xù)發(fā)生了三起死亡事件铃在，死亡現(xiàn)場(chǎng)離奇詭異，居然都是意外死亡枪汪，警方通過查閱死者的電腦和手機(jī)涌穆，發(fā)現(xiàn)死者居然都...
沈念sama閱讀 88,205評(píng)論 2贊 382
救了他兩次的神仙讓他今天三更去死
文/潘曉璐我一進(jìn)店門怔昨，熙熙樓的掌柜王于貴愁眉苦臉地迎上來，“玉大人宿稀，你說我怎么就攤上這事趁舀。” “怎么了祝沸？”我有些...
開封第一講書人閱讀 152,370評(píng)論 0贊 342
道士緝兇錄：失蹤的賣姜人
文/不壞的土叔我叫張陵矮烹，是天一觀的道長(zhǎng)。經(jīng)常有香客問我罩锐，道長(zhǎng)奉狈，這世上最難降的妖魔是什么？我笑而不...
開封第一講書人閱讀 55,168評(píng)論 1贊 278
?港島之戀（遺憾婚禮）
正文為了忘掉前任涩惑，我火速辦了婚禮仁期，結(jié)果婚禮上，老公的妹妹穿的比我還像新娘竭恬。我一直安慰自己跛蛋，他們只是感情好，可當(dāng)我...
茶點(diǎn)故事閱讀 64,153評(píng)論 5贊 371
惡毒庶女頂嫁案：這布局不是一般人想出來的
文/花漫我一把揭開白布痊硕。她就那樣靜靜地躺著赊级，像睡著了一般。火紅的嫁衣襯著肌膚如雪岔绸。梳的紋絲不亂的頭發(fā)上理逊，一...
開封第一講書人閱讀 48,954評(píng)論 1贊 283
城市分裂傳說
那天，我揣著相機(jī)與錄音盒揉，去河邊找鬼晋被。笑死，一個(gè)胖子當(dāng)著我的面吹牛预烙，可吹牛的內(nèi)容都是我干的墨微。我是一名探鬼主播，決...
沈念sama閱讀 38,271評(píng)論 3贊 399
雙鴛鴦連環(huán)套：你想象不到人心有多黑
文/蒼蘭香墨我猛地睜開眼扁掸，長(zhǎng)吁一口氣：“原來是場(chǎng)噩夢(mèng)啊……” “哼翘县！你這毒婦竟也來了？” 一聲冷哼從身側(cè)響起谴分，我...
開封第一講書人閱讀 36,916評(píng)論 0贊 259
萬榮殺人案實(shí)錄
序言：老撾萬榮一對(duì)情侶失蹤锈麸，失蹤者是張志新（化名）和其女友劉穎，沒想到半個(gè)月后牺蹄，有當(dāng)?shù)厝嗽跇淞掷锇l(fā)現(xiàn)了一具尸體忘伞，經(jīng)...
沈念sama閱讀 43,382評(píng)論 1贊 300
?護(hù)林員之死
正文獨(dú)居荒郊野嶺守林人離奇死亡，尸身上長(zhǎng)有42處帶血的膿包…… 初始之章·張勛以下內(nèi)容為張勛視角年9月15日...
茶點(diǎn)故事閱讀 35,877評(píng)論 2贊 323
?白月光啟示錄
正文我和宋清朗相戀三年，在試婚紗的時(shí)候發(fā)現(xiàn)自己被綠了氓奈。大學(xué)時(shí)的朋友給我發(fā)了我未婚夫和他白月光在一起吃飯的照片翘魄。...
茶點(diǎn)故事閱讀 37,989評(píng)論 1贊 333
活死人
序言：一個(gè)原本活蹦亂跳的男人離奇死亡，死狀恐怖，靈堂內(nèi)的尸體忽然破棺而出，到底是詐尸還是另有隱情哟玷，我是刑警寧澤檩电，帶...
沈念sama閱讀 33,624評(píng)論 4贊 322
?日本核電站爆炸內(nèi)幕
正文年R本政府宣布伙菜，位于F島的核電站，受9級(jí)特大地震影響，放射性物質(zhì)發(fā)生泄漏。R本人自食惡果不足惜腹躁，卻給世界環(huán)境...
茶點(diǎn)故事閱讀 39,209評(píng)論 3贊 307
男人毒藥：我在死后第九天來索命
文/蒙蒙一、第九天我趴在偏房一處隱蔽的房頂上張望南蓬。院中可真熱鬧纺非，春花似錦、人聲如沸赘方。這莊子的主人今日做“春日...
開封第一講書人閱讀 30,199評(píng)論 0贊 19
一樁弒父案，背后竟有這般陰謀
文/蒼蘭香墨我抬頭看了看天上的太陽蒜焊。三九已至，卻和暖如春科贬，著一層夾襖步出監(jiān)牢的瞬間泳梆，已是汗流浹背。一陣腳步聲響...
開封第一講書人閱讀 31,418評(píng)論 1贊 260
情欲美人皮
我被黑心中介騙來泰國(guó)打工榜掌，沒想到剛下飛機(jī)就差點(diǎn)兒被人妖公主榨干…… 1. 我叫王不留优妙，地道東北人。一個(gè)月前我還...
沈念sama閱讀 45,401評(píng)論 2贊 352
代替公主和親
正文我出身青樓憎账，卻偏偏與公主長(zhǎng)得像套硼，于是被迫代替她去往敵國(guó)和親。傳聞我的和親對(duì)象是個(gè)殘疾皇子胞皱，可洞房花燭夜當(dāng)晚...
茶點(diǎn)故事閱讀 42,700評(píng)論 2贊 345

腳本更新----空間轉(zhuǎn)錄組信號(hào)流COMMOT與空間inferCNV（封裝版）

作者律秃，Evil Genius

針對(duì)visium爬橡、bin模式的Stereo-seq、HD等的腳本更新棒动。

對(duì)于單細(xì)胞而言糙申，大家網(wǎng)上找點(diǎn)代碼分析出個(gè)結(jié)果，已經(jīng)很容易了船惨，但是想分析的很好柜裸，還是很難的。

咱國(guó)內(nèi)的盜版能力粱锐，全世界都出了名了疙挺。

我自己看了看，盜版很多內(nèi)容都是閹割后的侠鳄，大家自己要有識(shí)別能力埠啃，改過來，還有參數(shù)的配置伟恶，我上課是示例碴开，配的參數(shù)簡(jiǎn)單，別瞎套用，自己的項(xiàng)目要調(diào)整的潦牛。

大家以后工作了眶掌，一定要忠于數(shù)據(jù)，忠于事實(shí)巴碗，生物信息往往都跟病人朴爬、腫瘤、發(fā)育等等相關(guān)橡淆，千萬不要有任何的僥幸心理召噩，小小的差錯(cuò)很容易造成人間悲劇。

今天我們更新腳本逸爵，隨著方法的更新和認(rèn)識(shí)的提升蚣常，腳本也會(huì)隨時(shí)變動(dòng)。

尤其借鑒一些高分文獻(xiàn)痊银，加深了認(rèn)識(shí)之后，腳本也要隨之更新施绎。

蘭衛(wèi)正在招賢納士溯革，一家好的公司，尤其上市公司谷醉，值得大家關(guān)注致稀。

首先是空間版本的inferCNV，空間區(qū)域注釋大家自己做俱尼。

公司一般參數(shù)是外置的抖单，就是寫Shell腳本配參數(shù)運(yùn)行主腳本，對(duì)于沒有l(wèi)inux系統(tǒng)的情況遇八，參數(shù)可以內(nèi)置矛绘，比如下面寫進(jìn)主腳本。

主腳本

拿到熱圖

拿到CNV矩陣

至于進(jìn)化樹注釋刃永，之前寫過了货矮，大家可以參加文章流程升級(jí)----空間CNV分析流程升級(jí)

接下來COMMOT腳本，python版本,一樣的封裝類型,比較簡(jiǎn)單

推薦閱讀更多精彩內(nèi)容