特约专稿

“端粒到端粒”人类参考基因组:精准医学时代的新起点

展开
  • 中国科学院北京基因组研究所(国家生物信息中心),北京 100101
康禹,研究方向:医学基因组学、基因检测和分析新技术。

收稿日期: 2024-02-19

  网络出版日期: 2024-04-19

T2T human reference genomes mark the new starting point of the precision medicine era

Expand
  • Beijing Institute of Genomics, Chinese Academy of Sciences/China National Center for Bioinformation, Beijing 100101, China

Received date: 2024-02-19

  Online published: 2024-04-19

摘要

DNA测序技术的飞速发展使人类基因组学研究迈进“端粒到端粒(T2T)时代”。这个时代的核心标志是实现每一个人类染色体分子——从端粒到端粒——高质量、高完整度的连续线性组装,成为精准医学质量控制的金标准,从而准确和稳定地识别个体变异信息。越来越多个体基因组的完整组装为我们揭示远高于预期的人类基因组差异,也为各人群采用自己的近缘参考基因组进行变异分析提供了关键的理论依据。近缘参考基因组可以更准确地识别和定位个体的基因变异,而准确的变异数据是精准医学中变异与表型关联研究的关键数据基础。这种以人群为单位的基因组分析和研究新范式,必将显著影响国际和国内未来精准医学发展的格局。

本文引用格式

康禹 . “端粒到端粒”人类参考基因组:精准医学时代的新起点[J]. 自然杂志, 2024 , 46(2) : 88 -94 . DOI: 10.3969/j.issn.0253-9608.2024.02.002

Abstract

Driven by the rapid advancement of DNA sequencing technology, human genome research entered the era of “telomere to telomere (T2T)”. The central hallmark of this era is achieving high-quality and high-continuity linear assembly of every human chromosome molecule, from telomere to telomere, which could be served as the gold standard for quality control in precision medicine. This enables accurate and stable identification of individual variation information. The complete assembly of an increasing number of individual genomes has revealed significantly higher levels of human genome variations than initially anticipated. It has also provided a crucial theoretical foundation for each population to utilize its own reference genome for variation analysis. By employing a closely matched reference genome, it would be more precise in identification and locating individual variations. Dataset of accurate variation information forms the fundamental basis for studying the association between variations and phenotypes in precision medicine. This novel paradigm based on population-based genome analysis and research will undoubtedly have a significant impact on the future development of precision medicine.

参考文献

[1] RHIE A, NURK S, CECHOVA M, et al. The complete sequence of a human Y chromosome [J]. Nature, 2023, 621: 344-354.
[2] YANG C, ZHOU Y, SONG Y, et al. The complete and fully-phased diploid genome of a male Han Chinese [J]. Cell Res, 2023, 33: 745-761.
[3] HE Y, CHU Y, GUO S, et al. T2T-YAO: A telomere-to-telomere assembled diploid reference genome for Han Chinese [J]. Genomics Proteomics Bioinformatics, 2023. DOI: 10.1016/j.gpb.2023.08.001.
[4] 于军.“人类基因组计划”回顾与展望: 从基因组生物学到精准医学[J]. 自然杂志, 2013, 35(5): 326-331.
[5] International Human Genome Sequencing Consortium. Finishing the euchromatic sequence of the human genome [J]. Nature, 2004, 431: 931-945.
[6] WENGER A M, PELUSO P, ROWELL W J, et al. Accurate circular consensus long-read sequencing improves variant detection and
assembly of a human genome [J]. Nat Biotechnol, 2019, 37: 1155-1162.
[7] JAIN M, KOREN S, MIGA K H, et al. Nanopore sequencing and assembly of a human genome with ultra-long reads [J]. Nat Biotechnol, 2018, 36: 338-345.
[8] RHIE A, MCCARTHY S A, FEDRIGO O, et al. Towards complete and error-free genome assemblies of all vertebrate species [J].
Nature, 2021, 592: 737-746.
[9] NURK S, KOREN S, RHIE A, et al. The complete sequence of a human genome [J]. Science, 2022, 376: 44-53.
[10] BLATTNER F R, PLUNKETT G 3RD, BLOCH C A, et al. The complete genome sequence of Escherichia coli K-12 [J]. Science,
1997, 277: 1453-1462.
[11] KIM K E, PELUSO P, BABAYAN P, et al. Long-read, wholegenome shotgun sequence data for five model organisms [J]. Sci
Data, 2014, 1: 140045.
[12] LIAO W W, ASRI M, EBLER J, et al. A draft human pangenome reference [J]. Nature, 2023, 617: 312-324.
[13] GAO Y, YANG X, CHEN H, et al. A pangenome reference of 36 Chinese populations [J]. Nature, 2023, 619: 112–121.
[14] LING Y, JIN Z, SU M, et al. VCGDB: a dynamic genome database of the Chinese population [J]. BMC Genomics, 2014, 15: 265.
[15] WANG J, WANG W, LI R, et al. The diploid genome sequence of an Asian individual [J]. Nature, 2008, 456: 60-65.
[16] JARVIS E D, FORMENTI G, RHIE A, et al. Semi-automated assembly of high-quality diploid human reference genomes [J].
Nature, 2022, 611: 519-531.
[17] WAGNER J, OLSON N D, HARRIS L, et al. Curated variation benchmarks for challenging medically relevant autosomal genes [J].
Nat Biotechnol, 2022, 40: 672-680.

[18] ZHANG X. T2T-YAO reference genome of Han Chinese—New step in advancing precision medicine in China [J]. Genomics ProteomicsBioinformatics, 2023. DOI: 10.1016/j.gpb.2023.09.001.
[19] ZOOK J M, CHAPMAN B, WANG J, et al. Integrating human sequence data sets provides a resource of benchmark SNP and indel
genotype calls [J]. Nat Biotechnol, 2014, 32: 246-251.
[20] AGANEZOV S, YAN S M, SOTO D C, et al. A complete reference genome improves analysis of human genetic variation [J]. Science, 2022, 376: eabl3533.

[21] VOLLGER M R, GUITART X, DISHUCK P C, et al. Segmental duplications and their variation in a complete human genome [J].
Science, 2022, 376: eabj6965.
[22] XIAO J, YU J. T2T-Yao, T2T-Shun, and more [J]. Genomics Proteomics Bioinformatics, 2023. DOI: 10.1016/j.gpb.2023.09.002.
[23] PENNISI E. Upstart DNA sequencers could be a ‘game changer’ [J]. Science, 2022, 376: 1257-1258.

文章导航

/