Home>>

China Focus: High-quality Han Chinese complete genome reference assembled

(Xinhua) 19:02, December 12, 2023

BEIJING, Dec. 12 (Xinhua) -- A team of geneticists has assembled a complete human genome reference for Han Chinese, the first of its kind and which could potentially promote precision medicine in China.

The telomere-to-telomere (T2T) gapless diploid genome sequence of a healthy male individual, named T2T-YAO, contained two complete sets of chromosomes, one from each parent, and the Y-chromosome that passes only from male parents to male offspring.

A complete and accurate reference genome has been a long-standing goal in the biomedical research community since the initiation of the Human Genome Project three decades ago.

A similar work, T2T-CHM13, published in 2022 by the U.S. National Institutes of Health, fulfilled 8 percent of the previously unknown highly repetitive region in the human genome.

However, it was of European-origin and without the Y-chromosome, thus not enough for representing all individuals worldwide.

The scientists from Peking University People's Hospital and Beijing Institute of Genomics (BIG) under the Chinese Academy of Sciences, collected samples from an ancient village in Hongtong County in Shanxi Province in the north of China, a place believed to be a starting point of the countrywide mass migration in around the late 14th century.

The YAO part of the name stems from the sampling point located near the ruins of the capital of the legendary Chinese emperor Yao, while T2T stands for telomere-to-telomere or end-to-end sequence of all chromosomes in the genome.

The quality of YAO sequencing is the best among all currently available human genome assemblies, according to the study published in the journal Genomics, Proteomics & Bioinformatics.

Also, this version with a single set of chromosomes has reached a high quality of fewer than one error per 29.5 million basepairs, which is "generally better than that of T2T-CHM13," said Kang Yu, a scientist from BIG.

GENETIC DISCREPANCY

At the early stage of gene sequencing, scientists found a strong similarity between human genes and those of chimpanzees and even rodents. Also, based on partial human genomes that contain mainly protein-coding genes in previous studies, it had been believed that the genome differences between individuals of different races were only 0.1 percent, which meant that all humans could share a single reference genome.

However, a comparative analysis conducted by the Chinese team has revealed that about 11 percent of YAO's genome is not alignable to that of T2T-CHM13, with about 3,000 different genes in each genome, a discrepancy much wider than previously estimated, said Gao Zhancheng from Peking University People's Hospital, the paper's correspondence author.

The significant discrepancies between the individual genomes of the two human populations in the study are mainly attributed to the mass of non-coding DNA, accounting for nearly 99 percent of the genome, he added.

Recently, some of those non-coding DNA sequences were found to serve important functional roles, such as in the regulation of gene expression, while the functions of other non-coding DNA remain unknown.

Gao and his collaborators have found that YAO is mostly of East Asian origin and admixed with sporadic predicted markers of South Asia, Europe, and America.

The markers from South Asia are a little more than those stemming from Europe and America, revealing greater genetic exchange between the East Asian and South Asian ethnic groups, according to the study.

In addition, the haplotype of Y-chromosome in YAO, a predominant type in China and Asia, has also been identified in ancient DNA samples from a Neolithic site in the nearby Shaanxi Province dating back to approximately 4,000 years ago, which suggests a potential genetic continuity in the region from the earliest days of human habitation in this part of China.

The reference human genome is known as a genetic "navigation map" widely used in human genetics and medical research, and the great genome discrepancies among ethnic groups suggest that YAO is a more appropriate reference genome for Han Chinese.

The YAO genome can provide more accurate gene and mutation information for the Han Chinese population in establishing a technical system and quality benchmark for clinical research such as genetic disease diagnosis, disease risk prediction, cancer studies, and precision medicine in China, commented Cheng Jing from Tsinghua University, an academician of the Chinese Academy of Engineering.

(Web editor: Zhang Wenjie, Wu Chaolan)

Photos

Related Stories