11 篇 Science 連發(fā)妥泉!解析基因組的關(guān)鍵之處椭微,揭示人類健康和疾病的相關(guān)信息

導(dǎo)讀

從兩克重的大黃蜂到重達(dá)數(shù)噸的鯨魚,地球上存在著包括人類在內(nèi)的豐富的物種盲链,在過去的漫長(zhǎng)時(shí)間里蝇率,它們幾乎適應(yīng)了地球上的所有環(huán)境。其中刽沾,哺乳動(dòng)物是最多樣化的一類動(dòng)物本慕,無論是在大小上,還是在形狀上侧漓,均表現(xiàn)出豐富的多樣性锅尘。自生命科學(xué)研究出現(xiàn)以來,了解哺乳動(dòng)物的變異是何時(shí)布蔗、如何以及在何種選擇壓力下發(fā)展起來的一直是人們感興趣的問題藤违。此外,通過研究人的進(jìn)化史纵揍,還可以進(jìn)一步了解人類的健康狀況顿乒,例如,那些在許多物種中保守的基因可能是對(duì)正常功能至關(guān)重要的基因泽谨,因此當(dāng)其發(fā)生改變時(shí)可能導(dǎo)致疾病淆游。

2023 年 4 月 28 日,諸多科學(xué)家們與世界上最大的哺乳動(dòng)物基因組學(xué)比較資源 Zoonomia Project 的國(guó)際合作隔盛,同日在 Science 雜志上發(fā)表了 11 篇研究論文。他們對(duì) 240 種哺乳動(dòng)物物種(占哺乳動(dòng)物家族的 80% 以上)的基因組多樣性進(jìn)行了編目拾稳。其中吮炕,部分研究發(fā)現(xiàn)指出人類基因組中經(jīng)過數(shù)百萬年進(jìn)化后保持不變的部分,提供了可能揭示人類健康和疾病的信息访得。

圖片來源:Science

Zoonomia Project 是由麻省理工學(xué)院龙亲、哈佛大學(xué)等單位的科學(xué)家牽頭的一個(gè)大型國(guó)際研究項(xiàng)目,研究人員通過對(duì)一系列哺乳動(dòng)物基因組進(jìn)行測(cè)序悍抑,然后將數(shù)百個(gè)物種的基因組進(jìn)行整合分析鳄炉,為理解哺乳動(dòng)物、哺乳動(dòng)物進(jìn)化和人類自身打開一扇新的大門搜骡。研究人員對(duì)一系列哺乳動(dòng)物基因組進(jìn)行測(cè)序拂盯,然后將它們進(jìn)行對(duì)比,這是一項(xiàng)巨大的計(jì)算任務(wù)记靡。利用這種比對(duì)谈竿,研究人員確定了基因組的關(guān)鍵區(qū)域团驱,在哺乳動(dòng)物物種和數(shù)百萬年的進(jìn)化中最為保守或不變

作者假設(shè)空凸,雖然這些區(qū)域不產(chǎn)生蛋白質(zhì)嚎花,但可能包含指導(dǎo)蛋白質(zhì)產(chǎn)生時(shí)間和數(shù)量的指令,這些區(qū)域的突變可能在疾病的起源或哺乳動(dòng)物物種的獨(dú)特特征中發(fā)揮重要作用呀洲。通過他們的分析紊选,研究人員也驗(yàn)證了這一假設(shè),并能夠確定至少 10% 的人類基因組是有功能的道逗,大約是蛋白質(zhì)編碼(1%)的十倍兵罢。研究結(jié)果進(jìn)一步揭示了遺傳變異可能在罕見和常見的人類疾病(包括癌癥)中起到因果作用憔辫。

01

如果某些東西對(duì)物種正常的功能很重要趣些,那么它往往會(huì)在進(jìn)化過程中被保存下來,即進(jìn)化約束概念贰您。因此坏平,進(jìn)化約束是衡量基因組中特定區(qū)域在生命進(jìn)化樹上的變化程度。在今日 Science 特刊的一篇研究 Leveraging base-pair mammalian constraint to understand genetic variation and human disease中锦亦,Sullivan 等人觀察到的在許多物種和進(jìn)化過程中保持不變的 DNA 序列舶替,以及在一個(gè)或幾個(gè)譜系中突然開始積累突變的序列,都強(qiáng)有力地表明了功能相關(guān)性和進(jìn)化力量在起作用杠园。研究人員還通過研究髓母細(xì)胞瘤患者顾瞪,發(fā)現(xiàn)了人類基因組進(jìn)化保守位置的突變,他們認(rèn)為這些突變可能導(dǎo)致腦腫瘤生長(zhǎng)更快或抵抗治療抛蚁。結(jié)果表明陈醒,在疾病研究中使用這些數(shù)據(jù)和方法可以更容易地發(fā)現(xiàn)增加疾病風(fēng)險(xiǎn)的遺傳變化。

02

在研究 Evolutionary constraint and innovation across hundreds of placental mammals中瞧甩,研究人員確定了與哺乳動(dòng)物世界中一些特殊特征相關(guān)的基因組部分钉跷,例如非凡的大腦大小、卓越的嗅覺以及在冬季冬眠的能力肚逸。作者使用基因組來證實(shí)爷辙,對(duì)有效種群規(guī)模和多樣性的估計(jì)可以幫助預(yù)測(cè)難以監(jiān)測(cè)和采樣的物種的風(fēng)險(xiǎn)。

03

在另一項(xiàng)研究 A genomic timescale for placental mammal evolution 中表明朦促,甚至在大約 65 萬年前膝晾,即地球被小行星撞擊、恐龍滅絕之前务冕,哺乳動(dòng)物就已經(jīng)開始變異和分化血当。

The timing of placental mammal evolution.

Superordinal mammalian diversification took place in the Cretaceous during periods of continental fragmentation and sea level rise with little phylogenomic discordance (pie charts: left, autosomes; right, X chromosome), which is consistent with allopatric speciation. By contrast, the Paleogene hosted intraordinal diversification in the aftermath of the K-Pg mass extinction event, when clades exhibited higher phylogenomic discordance consistent with speciation with gene flow and incomplete lineage sorting.

04

另一項(xiàng)題為Three-dimensional genome rewiring in loci with human accelerated regions的研究中棒动,使用 Zoonomia 數(shù)據(jù)和實(shí)驗(yàn)分析檢查了 10000 多個(gè)特定于人類的基因缺失竿报,并將其中一些與神經(jīng)元的功能聯(lián)系起來。

Example of HAR enhancer hijacking.image.png

The HAR is nearby and regulates gene A, but not gene B, as the chimpanzee genome folds. An insertion in the human genome brings the HAR closer to gene B, causing expression of gene B. The HAR adapts to being in gene B’s regulatory domain through substitutions to previously conserved nucleotides.

05

一篇題為Comparative genomics of Balto, a famous historic dog, captures lost diversity of 1920s sled dogs的研究中,提供了為什么 1920 年代一只名叫巴爾托的著名雪橇犬能夠在阿拉斯加的惡劣環(huán)境中幸存下來的遺傳解釋枉长。

Balto, famed 20th-century Alaskan sled dog, shares common ancestry with modern Asian and Arctic canine lineages.

In an unsupervised admixture analysis, Balto’s ancestry, representing 20th-century Alaskan sled dogs, is assigned predominantly to four Arctic lineage dog populations. He had no discernable wolf ancestry. The Alaskan sled dogs (a working population) did not fall into a distinct ancestry cluster but shared about a third of their ancestry with Balto in the supervised admixture analysis. Balto and working sled dogs carried fewer constrained and missense rare variants than modern dog breeds.IMAGE CREDIT: K. MORRILL

06

一篇題為 The functional and evolutionary impacts of human-specific deletions in conserved elements的研究中饰豺,Xue 等人則分享了對(duì)基因組結(jié)構(gòu)的研究连茧。在確定了僅跨越少數(shù)堿基的缺失后腹侣,他們分析了這些缺失在多種人類細(xì)胞類型中調(diào)節(jié)基因表達(dá)的能力,并探索了這些缺失是否可能導(dǎo)致獨(dú)特的人類表型撤奸。結(jié)果發(fā)現(xiàn)吠昭,復(fù)雜的認(rèn)知功能再次成為人類進(jìn)化過程中序列變化的主要受益者之一,這些小缺失附近的基因系統(tǒng)地富集了那些在大腦和神經(jīng)元功能中發(fā)揮作用的基因胧瓜。通過實(shí)驗(yàn)證實(shí)了它們?cè)诙喾N細(xì)胞類型中的功能后矢棚,作者還觀察到,許多缺失導(dǎo)致人類細(xì)胞中基因表達(dá)的增加府喳,這是獲取新功能的驅(qū)動(dòng)因素蒲肋。

Human-specific deletions that remove nucleotides from regions highly conserved in other animals (hCONDELs).

We assessed 10,032 hCONDELs across diverse, biologically relevant datasets and identified tissue-specific enrichment (top left). The regulatory impact of hCONDELs was characterized by comparing chimp and human sequences in MPRAs (bottom left). The ability of hCONDELs to either improve or perturb activating and repressing gene-regulatory elements was assessed (top right). The deleted chimpanzee sequence was reintroduced back into human cells, causing a cascade of transcriptional differences for an hCONDEL regulating LOXL2 (bottom right).

07

在一篇題為 Relating enhancer genetic variation across mammals to complex phenotypes using machine learning的研究中,研究人員使用機(jī)器學(xué)習(xí)來識(shí)別與大腦大小相關(guān)的基因組區(qū)域钝满。

Tissue-Aware Conservation Inference Toolkit (TACIT) associates genetic differences between species with phenotypes.

TACIT works by generating open chromatin data from a few species in a tissue related to a phenotype, using the sequences underlying open and closed chromatin regions to train a machine learning model for predicting tissue-specific open chromatin and associating open chromatin predictions across dozens of mammals with the phenotype. [Species silhouettes are from PhyloPic]

08

在題為 Mammalian evolution of human cis-regulatory elements and transcription factor binding sites的研究中兜粘,描述了人類基因組中調(diào)控序列的進(jìn)化。

Mammalian evolution of the human regulatory landscape.

(A) Distribution of human cCREs by the number of genomes they align.
(B) Projection of cCREs by alignments to the other 240 mammalian genomes.
(C) Project of HNF4A sites (constrained, red; unconstrained, blue).
(D) Heritability enrichment for 69 human traits in partitions of TFBSs ordered by evolutionary constraint.
(E) Heritability enrichment for human traits by subsets of TFBSs.

09

在題為 Insights into mammalian TE diversity through the curation of 248 genome assemblies 的研究中弯蚜,檢測(cè)了 248 個(gè)胎盤哺乳動(dòng)物基因組裝配體的轉(zhuǎn)座元件 (transposable element, TE) 含量孔轴,這是迄今真核生物中最大的 de novo TE 管理工作。研究發(fā)現(xiàn)碎捺,盡管哺乳動(dòng)物在總 TE 含量和多樣性方面相似路鹰,但它們?cè)诮?TE 積累方面表現(xiàn)出實(shí)質(zhì)性的差異。哺乳動(dòng)物在任何給定的時(shí)間往往只積累少數(shù)幾種 TE收厨,其中一種 TE 占主導(dǎo)地位晋柱。此外,還發(fā)現(xiàn)了飲食習(xí)慣與 DNA 轉(zhuǎn)座子入侵之間的關(guān)聯(lián)诵叁。

Boxplots depicting the range of recently accumulated TEs among mammals (by proportion of genome).

Five categories of TE were examined: DNA transposons, long interspersed elements (LINEs), long terminal repeat (LTR) retrotransposons, rolling circle (RC) transposons, and short interspersed elements (SINEs). Species with the highest and lowest proportions for each TE type are indicated by a picture of the organism and its common name. With regard to RC and DNA transposons, we found that most mammalian genome assemblies exhibit essentially zero recent accumulation (RC: 240 of 248 mammals had <0.1%; DNA: 210 of 248 mammals had <0.1%).ILLUSTRATIONS: BRITTANY ANN HALE

10

在題為 The contribution of historical processes to contemporary extinction risk in placental mammals 的研究中趣斤,調(diào)查了 240 種哺乳動(dòng)物的單基因組的遺傳變異,發(fā)現(xiàn)由于遺傳負(fù)荷的長(zhǎng)期積累和固定黎休,歷史上種群較小的物種攜帶了比例較大的有害等位基因,有較高的滅絕風(fēng)險(xiǎn)玉凯。

Genomic information can help predict extinction risk in diverse mammalian species.

Across 240 mammals, species with smaller historical Ne had lower genetic diversity, higher genetic load, and were more likely to be threatened with extinction. Genomic data were used to train models that predict whether a species is threatened, which can be valuable for assessing extinction risk in species lacking ecological or census data. [Animal silhouettes are from PhyloPic]

11

在題為 Integrating gene annotation with orthology inference at scale 的研究中势腮,提出了 TOGA(Tool to infer Orthologs from Genome Alignments),這是一種集成了結(jié)構(gòu)基因注釋和同源序列推斷的方法漫仆。研究人員將其應(yīng)用于 488 個(gè)胎盤哺乳動(dòng)物和 501 個(gè)鳥類捎拯,從而創(chuàng)建了迄今最大的比較基因資源。

A different paradigm for orthology inference.

Orthologous, but not paralogous, genes have partially aligning intronic and intergenic regions. TOGA uses this principle to infer orthologous gene loci and integrates orthology inference with gene annotation. Using a reference species, TOGA can be applied to hundreds of aligned query genomes to provide rich comparative genomics resources.

在本期 Science 特刊的一系列論文中盲厌,比較了 240 種哺乳動(dòng)物的基因組署照,其中還包含了許多受威脅或?yàn)l危物種祸泪。這些 DNA 樣本由全球 50 多個(gè)不同的機(jī)構(gòu)收集和提供,這些發(fā)現(xiàn)有助于說明比較基因組學(xué)如何不僅可以闡明某些物種如何取得非凡的壯舉建芙,還可以幫助科學(xué)家更好地了解我們基因組中功能正常的部分以及它們?nèi)绾斡绊懡】岛图膊?/strong>没隘。

參考文獻(xiàn):
1. Bogdan M. Kirilenko et al. Integrating gene annotation with orthology inference at scale. Science (2023).

2. Aryn P. Wilder et al. The contribution of historical processes to contemporary extinction risk in placental mammals. Science (2023).

3. Nicole M. Foley et al. A genomic timescale for placental mammal evolution. Science (2023).

4. Austin B. Osmanski et al. Insights into mammalian TE diversity through the curation of 248 genome assemblies. Science (2023).

5. James R. Xue et al. The functional and evolutionary impacts of human-specific deletions in conserved elements. Science (2023).

6. Matthew J. Christmas and Irene M. Kaplow et al. Evolutionary constraint and innovation across hundreds of placental mammals. Science (2023).

7. Katherine L. Moon et al. Comparative genomics of Balto, a famous historic dog, captures lost diversity of 1920s sled dogs. Science (2023).

8. Gregory Andrews et al. Mammalian evolution of human cis-regulatory elements and transcription factor binding sites. Science (2023).

9. Kathleen C. Keough et al. Three-dimensional genome rewiring in loci with human accelerated regions. Science (2023).

10. Irene M. Kaplow et al. Relating enhancer genetic variation across mammals to complex phenotypes using machine learning. Science (2023).

11. Patrick F. Sullivan et al. Leveraging base-pair mammalian constraint to understand genetic variation and human disease. Science (2023).

參考:https://mp.weixin.qq.com/s/Nxj92VYnRFNg9KH6L7CTBw

?著作權(quán)歸作者所有,轉(zhuǎn)載或內(nèi)容合作請(qǐng)聯(lián)系作者
  • 序言:七十年代末,一起剝皮案震驚了整個(gè)濱河市禁荸,隨后出現(xiàn)的幾起案子右蒲,更是在濱河造成了極大的恐慌,老刑警劉巖赶熟,帶你破解...
    沈念sama閱讀 206,013評(píng)論 6 481
  • 序言:濱河連續(xù)發(fā)生了三起死亡事件瑰妄,死亡現(xiàn)場(chǎng)離奇詭異,居然都是意外死亡映砖,警方通過查閱死者的電腦和手機(jī)间坐,發(fā)現(xiàn)死者居然都...
    沈念sama閱讀 88,205評(píng)論 2 382
  • 文/潘曉璐 我一進(jìn)店門,熙熙樓的掌柜王于貴愁眉苦臉地迎上來邑退,“玉大人竹宋,你說我怎么就攤上這事」霞ⅲ” “怎么了逝撬?”我有些...
    開封第一講書人閱讀 152,370評(píng)論 0 342
  • 文/不壞的土叔 我叫張陵,是天一觀的道長(zhǎng)乓土。 經(jīng)常有香客問我宪潮,道長(zhǎng),這世上最難降的妖魔是什么趣苏? 我笑而不...
    開封第一講書人閱讀 55,168評(píng)論 1 278
  • 正文 為了忘掉前任狡相,我火速辦了婚禮,結(jié)果婚禮上食磕,老公的妹妹穿的比我還像新娘尽棕。我一直安慰自己,他們只是感情好彬伦,可當(dāng)我...
    茶點(diǎn)故事閱讀 64,153評(píng)論 5 371
  • 文/花漫 我一把揭開白布滔悉。 她就那樣靜靜地躺著,像睡著了一般单绑。 火紅的嫁衣襯著肌膚如雪回官。 梳的紋絲不亂的頭發(fā)上,一...
    開封第一講書人閱讀 48,954評(píng)論 1 283
  • 那天搂橙,我揣著相機(jī)與錄音歉提,去河邊找鬼。 笑死,一個(gè)胖子當(dāng)著我的面吹牛苔巨,可吹牛的內(nèi)容都是我干的版扩。 我是一名探鬼主播,決...
    沈念sama閱讀 38,271評(píng)論 3 399
  • 文/蒼蘭香墨 我猛地睜開眼侄泽,長(zhǎng)吁一口氣:“原來是場(chǎng)噩夢(mèng)啊……” “哼礁芦!你這毒婦竟也來了?” 一聲冷哼從身側(cè)響起蔬顾,我...
    開封第一講書人閱讀 36,916評(píng)論 0 259
  • 序言:老撾萬榮一對(duì)情侶失蹤宴偿,失蹤者是張志新(化名)和其女友劉穎,沒想到半個(gè)月后诀豁,有當(dāng)?shù)厝嗽跇淞掷锇l(fā)現(xiàn)了一具尸體窄刘,經(jīng)...
    沈念sama閱讀 43,382評(píng)論 1 300
  • 正文 獨(dú)居荒郊野嶺守林人離奇死亡,尸身上長(zhǎng)有42處帶血的膿包…… 初始之章·張勛 以下內(nèi)容為張勛視角 年9月15日...
    茶點(diǎn)故事閱讀 35,877評(píng)論 2 323
  • 正文 我和宋清朗相戀三年舷胜,在試婚紗的時(shí)候發(fā)現(xiàn)自己被綠了娩践。 大學(xué)時(shí)的朋友給我發(fā)了我未婚夫和他白月光在一起吃飯的照片。...
    茶點(diǎn)故事閱讀 37,989評(píng)論 1 333
  • 序言:一個(gè)原本活蹦亂跳的男人離奇死亡烹骨,死狀恐怖翻伺,靈堂內(nèi)的尸體忽然破棺而出,到底是詐尸還是另有隱情沮焕,我是刑警寧澤吨岭,帶...
    沈念sama閱讀 33,624評(píng)論 4 322
  • 正文 年R本政府宣布,位于F島的核電站峦树,受9級(jí)特大地震影響辣辫,放射性物質(zhì)發(fā)生泄漏。R本人自食惡果不足惜魁巩,卻給世界環(huán)境...
    茶點(diǎn)故事閱讀 39,209評(píng)論 3 307
  • 文/蒙蒙 一急灭、第九天 我趴在偏房一處隱蔽的房頂上張望。 院中可真熱鬧谷遂,春花似錦葬馋、人聲如沸。這莊子的主人今日做“春日...
    開封第一講書人閱讀 30,199評(píng)論 0 19
  • 文/蒼蘭香墨 我抬頭看了看天上的太陽。三九已至集晚,卻和暖如春掠廓,著一層夾襖步出監(jiān)牢的瞬間,已是汗流浹背甩恼。 一陣腳步聲響...
    開封第一講書人閱讀 31,418評(píng)論 1 260
  • 我被黑心中介騙來泰國(guó)打工, 沒想到剛下飛機(jī)就差點(diǎn)兒被人妖公主榨干…… 1. 我叫王不留,地道東北人条摸。 一個(gè)月前我還...
    沈念sama閱讀 45,401評(píng)論 2 352
  • 正文 我出身青樓悦污,卻偏偏與公主長(zhǎng)得像,于是被迫代替她去往敵國(guó)和親钉蒲。 傳聞我的和親對(duì)象是個(gè)殘疾皇子切端,可洞房花燭夜當(dāng)晚...
    茶點(diǎn)故事閱讀 42,700評(píng)論 2 345

推薦閱讀更多精彩內(nèi)容