hello,大家好肋坚,隨著10X單細(xì)胞、10X空間轉(zhuǎn)錄組如火如荼的進(jìn)行中肃廓,我們的分析內(nèi)容和手段也要進(jìn)入深水區(qū)了智厌,很多深入和細(xì)節(jié)的分析需要我們格外注意了,今天我們來(lái)分享兩個(gè)非常好的點(diǎn)亿昏,希望大家能夠深入分析自己的數(shù)據(jù),發(fā)大文章档礁。
首先第一點(diǎn)角钩,Spatial Correlation Analysis,其實(shí)這個(gè)談過(guò)好幾次了呻澜,文章在10X空間轉(zhuǎn)錄組之共定位分析(細(xì)胞類(lèi)型和配受體基因)递礼,10X空間轉(zhuǎn)錄組之基因的空間表達(dá)模式,10X空間轉(zhuǎn)錄組(10X單細(xì)胞)之論細(xì)胞通訊空間分布的重要性等羹幸。這一次我們?cè)谖恼?a target="_blank">Multimodal Analysis of Composition and Spatial Architecture in Human Squamous Cell Carcinoma分享一些很經(jīng)典和值得注意的方法脊髓,大家一定要重點(diǎn)關(guān)注。
We reasoned that genes expressed in adjacent spots in ST were potentially meaningful and that a simple correlation of genes across spots could overlook this adjacency structure within the data(在ST的相鄰斑點(diǎn)中表達(dá)的基因具有潛在的意義栅受,并且各個(gè)斑點(diǎn)之間的基因簡(jiǎn)單相關(guān)可能會(huì)忽略數(shù)據(jù)中的這種鄰接結(jié)構(gòu)将硝,這個(gè)地方已經(jīng)多次強(qiáng)調(diào)過(guò),希望引起大家的重視 ). Thus, we calculated average normalized gene expression(均一化的數(shù)據(jù)) across a ‘‘sliding window’’ of spot groups consisting of a central spot surrounding by its N nearest neighbors(臨近spot), where N = 4 in the original ST data and N = 6 in Visium samples for each spot in the tissue, generating a matrix of genes by average spot group expression across all spots
(重點(diǎn)關(guān)注屏镊,臨近spot平均之后產(chǎn)生新的矩陣). This matrix can be correlated with any ‘‘a(chǎn)nchoring’’ gene of interest (FOXP3 in our case) by calculating pairwise Pearson correlations of the FOXP3 expression vector across all spots and the gene average group expression vectors across spots(這個(gè)地方體現(xiàn)其準(zhǔn)備的價(jià)值). These values reflect if the expression of a gene in the area surrounding the anchoring gene is correlated with the expression of the anchoring gene and termed ‘‘spatial gene correlation’’ with FOXP3 .(空間基因的相關(guān)性)依疼。
關(guān)于空間基因的相關(guān)性分析,多次的強(qiáng)調(diào)過(guò)而芥,因?yàn)榻M織有一個(gè)有序的“實(shí)體”律罢,組織上的細(xì)胞類(lèi)型,基因表達(dá)的分布都有其深刻的生物學(xué)意義棍丐,一定要重點(diǎn)關(guān)注误辑。
第二個(gè)分析點(diǎn),cellphoneDB與NicheNet聯(lián)合進(jìn)行細(xì)胞通訊分析歌逢,這個(gè)方法相當(dāng)經(jīng)典
Ligand-receptor interactions were inferred using a similar approach as previously described (Vento-Tormo et al., 2018)(這個(gè)地方就是cellphoneDB的分析結(jié)果). We first calculated average expression of ligand and receptor pairs across cell type pairs in normalized scRNA-seq data from an aggregate of the seven patient tumor samples containing TSK cells(老套路). We only considered genes with more than 10% of cells demonstrating expression within each cell type considered. We calculated a null distribution for average ligand-receptor by shuffling cell identities in the aggregated data and re-calculating ligand-receptor average pair expression across 1,000 permutations of randomized cell identities. The P value was the number of randomized pairs exceeding the observed data. For bar plots shown in Figures 6B and 6C, in addition to including only ligand-receptor pairs with p < 0.001, we further thresholded individual ligand or receptor expression with a cutoff of average expression > 0.2 (in log space). The 0.2 cutoff was determined by calculating the average log gene expression distribution for all genes across each cell type, and genes expressed at or above this cutoff corresponded with the top 12% or higher of expressed genes for each cell type.(這個(gè)地方就是cellphoneDB的一般流程)巾钉。
For NicheNet analysis, we derived TME cell type signatures by taking the top 100 differentially expressed genes in cells isolated from tumors or normal skin, including B cells, endothelial cells, fibroblasts, Langerhans cells, plasmacytoid DCs, CD1C DCs, CLEC9A DCs, T cells, NK cells, macrophages, and MDSCs(熟悉這個(gè)軟件的同學(xué)應(yīng)該不陌生,需要輸入靶基因列表秘案,但是這個(gè)靶基因的選擇很有講究睛琳,不是簡(jiǎn)單的cluster之間的差異盒蟆。)。 We input these signatures into NicheNet to derive a union set of predicted ligands modulating tumor-specific TME cell type signatures(依據(jù)靶基因預(yù)測(cè)配體). For ligands predicting TSK modulation, we input the top 100 TSK-differentially expressed genes . The top 15% of predicted ligands (配體的挑選)by regulatory potential that also demonstrated significance in our scRNA-seq ligand-receptor interaction analysis .we used the FindAllMarkers function in Seurat to generate average logFC values per cell type compared to other cell types from the scRNAseq data.(千萬(wàn)注意)师骗。
For ligand-receptor spatial transcriptomic proximity analysis, the average value of all ligand-receptor pairs across the leading edge from the eight sections from patients 2, 4, and 10 were calculated first by averaging the ligand and receptor expression among each leading edge spot and its 4-6 nearest neighbors (depending on ST technology), and then taking the average values of all of these groups of five or seven spots across the leading edge. This calculation for each ligand-receptor pair was then performed on 1,000 randomized permutations of spot identities while preserving total number of spots per replicate section to generate a null distribution per patient. P value was calculated by number of randomized permutation calculations that exceeded the true average.(邊界分析)历等。
簡(jiǎn)單總結(jié)一下,cellphoneDB分析配受體辟癌,依據(jù)感興趣的靶基因寒屯,通過(guò)NicheNet分析,挑選高活性的配體黍少,然后再?gòu)腸ellphoneDB里面匹配顯著的配受體對(duì)寡夹,從而達(dá)到分析目的,說(shuō)起來(lái)很簡(jiǎn)單厂置,但真正的操作菩掏,很需要智慧和能力。
生活很好昵济,等你超越