今天跟大家分享的是16年1月發(fā)表在Cell Cycle(IF=3.530)上的一篇perspective前瞻性文章?Reducing GWAS Complexity (https://doi.org/10.1080/15384101.2015.1120928)
一句話總結(jié):用精細(xì)定位來縮小候選功能SNPs的范圍
關(guān)鍵詞:GWAS; SNP; Cancer; chromatin; non-coding DNA; enhancer; fine-mapping
隨著測(cè)序成本的不斷降低和meta分析的應(yīng)用萄凤,復(fù)雜表型相關(guān)的GWAS在大樣本數(shù)據(jù)的處理中越來越強(qiáng)大卤恳,越來越多MAF很低的SNPs被鑒定出來秸歧。這些候選risk SNPs之間由于LD相互關(guān)聯(lián)灾锯,使得給定locus的關(guān)聯(lián)分析變得復(fù)雜,更何況超過90%的risk SNPs位于非編碼的DNA序列中绰垂。
為了解決以上問題室奏,作者結(jié)合染色體的生物特征(chromatin biofeatures)去確認(rèn)GWAS最初發(fā)掘的SNPs的潛在功能。這些SNP被稱為索引SNP (index SNPs)劲装,之后由r2計(jì)算LD后得到的SNPs胧沫,被稱為替代SNP (surrogate SNPs)。許多軟件FunciSNP RegulomeDB, Haploreg, Annovar, IGV以及最新的FunSeq8和motifbreakR9 可以根據(jù)染色體的注釋信息對(duì)risk SNPs進(jìn)行重新定位占业,這會(huì)大大減少SNP的數(shù)目(表1)绒怨。在過去的幾年里,作者把這個(gè)注釋方法成功運(yùn)用到前列腺癌和乳腺癌的risk regions篩選中谦疾。
盡管此方法把候選SNP數(shù)目從50,000篩到了500南蹂,再到平均10個(gè)(中位數(shù)是5)和前列腺癌相關(guān)的risk SNPs,但這10個(gè)SNP仍需要一一進(jìn)行詳細(xì)和全面的濕實(shí)驗(yàn)驗(yàn)證念恍。精細(xì)定位(fine-mapping) 的新方法可以解決這個(gè)問題(來減少實(shí)驗(yàn)量)六剥。
除了p值,有許多新的分析方法來統(tǒng)計(jì)篩選候選SNP峰伙,比如貝葉斯和重新采樣(re-sampling approaches)方法(原文如下疗疟,不太好翻譯,但是很重要)
“These methods include Bayesian and re-sampling approaches that formally incorporate the uncertainty in estimation and ranking, model selection approaches to condition on multiple SNPs in the region using either individual-level data or marginal test statistics, and approaches that formally incorporate prior information, such as SNP annotation, into the final inference.”
最近词爬,作者和他人合作秃嗜,使用多種族人群、單一大型歐洲人群數(shù)據(jù)對(duì)前列腺癌風(fēng)險(xiǎn)區(qū)進(jìn)行了精細(xì)定位顿膨,降低了每個(gè)區(qū)域候選risk SNP的數(shù)量(表1)锅锨。FunciSNP3注釋的功能和2個(gè)精細(xì)作圖研究,進(jìn)一步減少了共同候選SNP的數(shù)量恋沃,平均每個(gè)區(qū)域只有大約2個(gè)SNP(表1)必搞。這意味著,如果沒有功能性的注釋或精細(xì)定位囊咏,每個(gè)位點(diǎn)都有大量的假陽性結(jié)果恕洲,從而導(dǎo)致與risk無關(guān)的功能分析。Ultimately SNPs must be func- tional to be causal, but not all functional SNPs are inevitably involved in risk.
然而梅割,即使有精細(xì)作圖的數(shù)據(jù)在手霜第,當(dāng)多個(gè)功能性SNP位于單一基因座上且其中只有一個(gè)對(duì)疾病有主效作用時(shí),下游分析也會(huì)變得復(fù)雜户辞。也可能是多個(gè)變異位點(diǎn)效應(yīng)互補(bǔ)泌类,亦或是某一特定疾病位點(diǎn)的病因靶組織位于原發(fā)組織的外部。
舉個(gè)例子底燎,除了前列腺上皮細(xì)胞的失調(diào)刃榨,前列腺癌可能還受到免疫細(xì)胞的影響。這可以解釋双仍,為什么現(xiàn)有的前列腺上皮表觀基因組學(xué)數(shù)據(jù)中大約1/3的風(fēng)險(xiǎn)位點(diǎn) (risk loci)?沒有功能枢希。此外,漿液性卵巢癌中精細(xì)作圖的risk SNP 富集在FTSE (fallopian tube serous epithelium)?原生代細(xì)胞系而并非卵巢漿液性上皮細(xì)胞系朱沃,表明該組織是由FTSE起源而來的苞轿,因此作者預(yù)計(jì)多個(gè)候選組織(或所有已知細(xì)胞類型)之間的比較將成為該領(lǐng)域的標(biāo)準(zhǔn)。Thus, in the midst of so much uncertainty, it is essential that biological assays give repeatable and reliable measures of these complex interactions逗物。必須強(qiáng)調(diào)的是呕屎,單獨(dú)的功能注釋顯然不是全面的,其他與染色質(zhì)有關(guān)的未知功能在將來也會(huì)被考慮敬察。最終目標(biāo)是在每個(gè)risk位點(diǎn)鑒定出真正的功能/致病性SNP(具有超過1個(gè)SNP或多個(gè)causal等位基因)秀睛。實(shí)現(xiàn)這一目標(biāo)需要開發(fā)特定的檢測(cè)方法,一旦相應(yīng)的表觀基因組(或其他)的變異功能得到驗(yàn)證莲祸,即可檢測(cè)等位基因?qū)?duì)癌癥進(jìn)程的影響蹂安。The eventual utility from this will be the compilation of more informative nomograms for risk assess- ments and the identification of risk mechanisms.
參考文獻(xiàn):
見原文
擴(kuò)展閱讀:
What is the Perspectives? ?From New Journal of Chemistry
Perspectives are brief reviews giving the personal viewpoint of a leading scientist in their area of research, setting it in the context of the work of others and looking forward to future developments.
In some cases, pairs of collaborating scientists from different disciplines are invited to each give their perspective on their common field of research in order to demonstrate the benefits of collaborative research and facilitate dialogue between communities.
The majority of Perspectives are invited by the editorial office, however suggestions of suitable Perspectives in the form of a brief synopsis and biography of the author, are welcome.