Introduction
Hepatitis B is an infectious disease that is caused by the hepatitis B virus (HBV) and is a major global health threat. HBV infection can lead to serious and long-term adverse outcomes, such as liver cirrhosis (LC) and hepatocellular cancer (HCC) [Reference Dan1]. Hepatitis B vaccines have been available since the mid-1980s, and in recombinant form since the early 1990s. Although HBV infection has declined due to vaccination, especially among the current generation of children in China, many people living with HBV were infected before the widespread availability of the hepatitis B vaccine, during their birth. There is some horizontal transmission of HBV, mostly among adults, but the vast majority of adult infections are self-limited and do not become chronic [Reference Ferrari2]. Chronic HBV infection from perinatal infection is usually asymptomatic for decades, and complications such as cirrhosis and liver cancer emerge after the fourth decade of life. There are nearly 90 million people living with HBV in China, and as these individuals age, the prevalence of complications is increasing [Reference Di Bisceglie3].
HBV has 10 known genotypes (A-J) and several subgenotypes [Reference Talebi and Talaee4]. Although HBV genotype or subgenotype can influence the natural history of HBV infection, antiviral treatment is not dependent on the identification of a specific genotype or subgenotype. Genotype A is associated with chronic active hepatitis; genotype D is associated with acute self-limiting hepatitis; genotype B is associated with liver cancer. Among patients with chronic HBV infection, individuals with type C and D infection have more severe clinical outcomes than those with genotype A and B infection [Reference Lin and Kao5].
In the United States, genotype distribution varies by geographic and demographic factors [Reference Gordon6]. Regardless of the study site, almost all Asian patients were infected with genotypes B or C. Non-Hispanic white patients were almost all infected with genotype A or D; Black patients were mainly infected with genotype A, followed by E and D. Genotype F was the most prominent genotype of Alaska Natives [Reference Livingston7]. Genotype G is predominant in many western countries, especially among people of Mongolian descent [Reference Osiowy8]; genotype H is frequently seen in Mexico [Reference Alvarado-Esquivel9]; genotypes I and J are mainly found in Asians. Correlation between race and genotype is consistent with geographical distributions in Asia, northern Europe, southern Europe and Africa. Differences in dominant HBV genotype among immigrants of various races in the United States and other countries suggest that there is a strong relationship between HBV genotype and race [Reference Roberts10]. There may also be differences in the distribution of HBV genotypes by ethnic group. Studies in China have found differences in the distribution of HBV genotypes among ethnic groups, primarily in Xinjiang, Tibet and among other western China ethnic minorities. However, there are problems with studies on the relationship between ethnicity and HBV genotype, including inconsistent research results, insufficient sample sizes, too few ethnic groups and confounding factors.
The HBsAg carrier prevalence in southwestern China (7.9%) is significantly higher than in other regions. Yunnan province is located in southwestern China, and has the greatest geographical diversity, the highest biodiversity and the most diverse ethnic group mix in China [Reference Sun11]. The distribution of ethnic minorities in Yunnan is characterised by ‘scattered distribution in a large area’ and ‘aggregated distribution in a small area.’ Therefore, Yunnan's population is suitable to study epidemiology, natural history and clinical virology of HBV infection. However, to date, there has been insufficient research on HBV genotypes and subgenotypes in Yunnan [Reference Shen12].
Materials and methods
Ethical review and study population
This study was approved by the ethics committee of Yunnan CDC and carried out in accordance with national ethics regulations. The investigation plan was conducted in line with the ethical guidelines of the Declaration of Helsinki. Participants were informed of the purpose of the study, that participation was voluntary and that their data would be confidential. Participants provided informed consent before entering the study and collecting samples.
From 1 January 2017 to 31 December 2018, we enrolled 765 individuals with acute or chronic HBV infection who had been reported to the hepatitis B surveillance system from one of 16 prefectures (or cities) of Yunnan province. These individuals were from 20 different minority groups. Blood samples from included subjects met sample quality criteria. Ethnicity was self-described. For comparison with Han ethnicity, we used genotype data from 65 Han people obtained in another study that was conducted in the same region [Reference Xu13].
Laboratory testing
The subjects' 765 serum samples were frozen at −20°C; repeated freezing and thawing was avoided.
HBV DNA preparation and amplification
In accordance with the manufacturer's instructions, we extracted HBV-DNA from 200 ml serum samples using QIA amp DNA Blood Mini Kits (Qiagen, Hilden, Germany). S-gene nested PCR amplification primers were designed consistent with protocols from the National Key Laboratory of CDC in the Chinese Center for Disease Control and Prevention and were synthesised by Shanghai Biotechnology Company. The first pair of primers was SF1:5′-cctgtattttcctgctggtggctcc-3′ and SR1:5′-gcagcaaagcccaaaagaccc-3′; the second pair of primers was SF2:5′-gttacaggcggggttttt-3′ and SR2:5′-cccatgaagttaagggagtagc-3′. Nested amplification was conducted with polymerase chain reaction (PCR) amplification kits (Takara, Japan). The first PCR amplification system consisted of Premix Tap 25 μl, 100 pmol/μl SF1 1 μl, 100 pmol/μl SR1 1 μl, sterilised double distilled water 18 μl, and HBV-DNA 5 μl, 50 μl. The second PCR amplification system consisted of Premix Tap 25 μl, 100 pmol/μl SF2 1 μl, 100 pmol/μl SR2 1 μl, sterilised double distilled water 21 μl, and first-round PCR products 2 μl, 50 μl total. PCR amplification conditions were: first, denaturation was carried out at 94°C for 5 min, followed by 35 cycles of amplification (each cycle was denaturation at 94°C for 30 s, annealing at 58°C for 40°Cs, extension at 72°C for 70 s), and extension at 72°C for 10 min. The second round of amplification was identical to the first. PCR products were eluted by 1.2% agarose gel electrophoresis. The first and second PCR products of HBV S gene fragments consisted of 931 and 646 base pairs, respectively.
Sequencing and genotyping of the S gene
We use the following method to determine genotype and subgenotype. After purification of the amplified S gene products, we used an ABIPrism3730X automatic sequencer (Applied BioSystems, Foster City, CA, USA) to conduct bidirectional sequencing. Sequencing results were edited by sequencer software from Gene Codes Corporation, USA. The HBV genetic reference sequences for each genotype, A-J, and subgenotype were from GenBank. Phylogenetic trees were created using the neighbour-joining method (MEGA, V5.2). The resulting HBV S gene sequences were compared with HBV genotype and subgenotype reference strains; the reliability of phylogenetic trees was tested 1000 times by bootstrap.
Quality control
According to the ‘Guidelines for Prevention and Treatment of Viral Hepatitis’ and quality control standards of provincial laboratories, each sample was tested twice, and only samples that were twice positive were considered HBV positive. We used positive and negative controls during DNA extraction and PCR amplification to avoid contamination.
Statistical analysis
Baseline characteristics were described using frequencies and percentages for categorical variables and means and standard deviations for continuous variables. Pearson χ 2 test was used to determine differences in categorical variables, such as the percentage of HBV genotype and gender; continuous variables, such as age, were tested for normal distribution (P = 0.2), with homogeneity of variance (P = 0.439), and then tested for differences by ANOVA. SPSS (version 16.0) was used for statistical analyses. P < 0.05 was considered statistically significant.
Results
The mean age of the 765 HBV-positive subjects from ethnic minorities in Yunnan province was 37.4 ± 15.5 years; the male–female ratio was 1:1.03. We were able to amplify HBV DNA of 87.84% (672/765) and sequence 97% (649/672) of subjects' samples.
Typing of HBV S gene
Phylogenetic analysis of the HBV S gene (Fig. 1) showed that the 649 HBV sequenced strains were in one of four genotypes: 138 (21.3%) were genotype B, 497 (76.6%) were genotype C, 12 (1.8%) were genotype D, and 2 (0.3%) were genotype I. Among the genotype B strains, 1 was subgenotype B1, 118 were B2, 9 were B3, 7 were B4, 2 were B7 and 1 was B8. Among genotype C strains, 394 were C1, 60 were C2, 17 were C5, 24 were C8, 1 was C10 and 1 was C17. Among genotype D strains, 1 was D3 and 11 were C/D recombinant genotypes. Both genotype I strains were I1. Genotype distribution did not vary by age (x 2 = 27.68, P = 0.274) or gender (x 2 = 0.331, P = 0.954) (Tables 1 and 2).
The 649 sequenced HBV strains were distributed in 16 prefectures (cities) of Yunnan province, with the largest percentage being from Wenshen prefecture (20.0%), followed by Honghe prefecture (17.4%); Baoshan prefecture had the fewest strains (0.5%). Regional distributions of the four genotypes had statistically significant differences (x 2 = 246.02, P < 0.001), as shown in Table 3.
Distribution of HBV genotypes and subgenotypes among ethnic groups
There were statistically significant differences in the distribution of HBV genotypes (x 2 = 215.8, P < 0.001) and subgenotypes (x 2 = 625.3, P = 0.011) among ethnic minorities in Yunnan province. The 20 Ethnic groups included Achang, Bai, Bulang, Tibetan, Dai, De'ang, Dulong, Hani, Hui, Jingpo, Lahu, Lisu, Miao, Naxi, Nu, Pumi, Wa, Yao, Yi and Zhuang people. The distribution of HBV subgenotypes by ethnic group is shown in Table 2. HBV genetic diversity was smaller among the Han comparison group, which had genotypes B and C (Table 4). The two genotype I viruses were in Zhuang people. Tibet had a greater proportion of genotype D than other provinces, and both genotypes from Tibetan and Yi ethnic minorities were C/D recombinant (Table 5), homologous with D4. There were no statistically differences by gender (P = 0.129) or age (P = 0.439) among different ethnic groups.
Note: each superscript letter indicates a subset of ethnic groups. At the level of α = 0.05, there is no significant difference between the column proportions of these groups.
Pairwise comparison by the ethnic group showed no statistically significant difference in genotype B. For genotype C, Han was a subset of Lisu and was statistically significantly different from other nationalities. For genotype D, Tibetan was a subset of Naxi, which was statistically significantly different from other nationalities. In genotype I, Zhuang was statistically significantly different from other nationalities.
Comparison of HBV genotype ethnic and geographic distribution
Regardless of geographic location, the predominant HBV genotypes were consistently the same by ethnic group. As an example, the 167 Yi nationality cases (Table 6), were from 14 prefectures and cities in Yunnan. Although there were differences in genotype distribution by region (x 2 = 56.63, P < 0.001), all viruses from Yi nationality were genotype C.
Discussion
This study determined the HBV genotypes and subgenotypes of viruses isolated from subjects with acute or chronic HBV infection who were from any of 20 ethnic groups living in 16 prefectures of Yunnan province. Fourteen of the ethnic groups are unique to Yunnan. We found that these 20 ethnic minority populations had four HBV genotypes (B, C, D and I), most of which (76.6%) were genotype C. There were no statistically significant differences in genotype distribution by gender or age among different ethnic groups, showing that the results are likely stable, representing steady-state distributions.
Our finding that the most common genotype was C is consistent with Jing You's study [Reference Hutcha, Virasakdi and Alan14] across all of Yunnan province. Genotype C is also the most common genotype in China (53%) [Reference Zeng15]. Genotype C HBV inhibits the expression of HBeAg and causes immune escape, but does not affect the replication efficiency of the virus in the host, resulting in more persistent HBV infection and promoting greater deterioration of the liver. This may be the reason that some ethnic groups have greater burdens of chronic liver disease.
A large number of studies have identified the racial and geographic differences in the distribution of HBV genotypes. Some researchers have found that the distribution of HBV genotypes and the course of liver disease are closely related to ethnic background [Reference Bzowej16], and HBV infection rates differ by ethnic group in Yunnan [Reference Shen12]. Biological susceptibility may play a role in the difference in genotype distribution [Reference Nguyen and Thuluvath17]. Based on our analysis of the twenty ethnic minority groups compared with the Han majority in Yunnan, we found differences in the distribution of HBV genotypes. For genotype C, Han was a subset of Lisu, which was statistically different from other nationalities, indicating that the susceptibility of Han and Lisu to genotype C was similar. Susceptibilities of Tibetan and Naxi to D genotype were also similar. Genotypes I were only seen among Zhuang people, identical to findings in 2016 [Reference Xu13]. Genotype I cases were found in Yunnan for the first time in China in 2011 [Reference Kang18], and a small number of cases have been found in recent years that may be caused by foreign carriers. However, all instances were among Zhuang people. Jinhua Hu [Reference Hu19] found that HBV seen in Zhuang nationality in Guangxi is mostly genotype C. Whether Zhuang nationality is more susceptible to genotype I needs further study.
HBV among Han nationality in our study was mainly genotype B, which is consistent with Hu Jinhua's research. HBV in Tibetan and Yi populations are mainly genotype C, which is consistent with what is seen in Sichuan province [Reference Zhang20]. The distribution of C/D recombinant HBV was seen most prominently in Tibet, consistent with the results of Bin Zhou's study [Reference Zhou21]. Most of the HBV-positive samples were from the Diqing area, which may be due to a regional prevalence of HBV genotype D in Diqing.
By comparing ethnic and geographic differences of HBV distribution, we found the uniqueness of ethnic differences. For example, Yi HBV cases were from different regions in Yunnan, but the dominant genotypes were the same, as was the case with Dai and Hani minority groups. Genetic studies have found that DNMT3B gene polymorphism varies by race, nationality, or geographic region [Reference Wu and Lin22]. The frequency of chronic HBV infection in the Chinese population is more than that in European populations, which may be related to NF-kB gene polymorphism [Reference Li, Hou and Cao23]. In general, genetic factors can affect the incidence and prognosis of hepatitis B. Thus, the differences in HBV genotypes among different minorities may be due to human genetic differences.
Although we found ethnic differences in the distribution of HBV genotypes and subgenotypes in Yunnan, the majority of genotypes were C. This is similar to the five ethnic groups in Xinjiang (also C majority) [Reference Feng24] and is consistent with the dominant genotype of the entire province. However, other studies have had different findings. For example, HBV in the Bai nationality in Yunnan is mainly genotype C, while HBV in Bai nationality in Dali is mainly genotype B [Reference Li25]. HBV in Han nationality in Yunnan is mainly genotype B, while the Han in Xinjiang has mainly genotype C HBV [Reference Zhang20]. Previously, it was reported that the C/D recombinant is unique to Tibetans, while we found that Naxi and Yi were also infected with the recombinant virus. This likely indicates that in the continuous development of society, interactions between ethnic groups and regions are increasingly close, leading to the constant change in the epidemiology of HBV infection. The heterogeneity of HBV in the United States is primarily influenced by immigration [Reference Chu26]. This study is primarily descriptive and exploratory. Further study is needed to identify genetic differences or key social factors, such as resource inequality, to more deeply understand reasons for differences in disease burden and find a new breakthrough for hepatitis B prevention and treatment.
This study had strengths and limitations. Strengths include that the success rate of HBV DNA amplification was 87.84%, and the S gene sequencing was able to be completed in 649 cases, with a success rate of 97%. The serum quality was high and the quality control was effective. Our sample size was much larger than in previous similar studies [Reference Shen12], allowing a finer grain analysis of ethnic differences of HBV genotypes. Limitations include that new generation DNA sequencing technology was not used, making it impossible to obtain more information from the samples. We did not have access to special software (e.g., Simplot) to further determine HBV C/D recombinant isolates. This study had only one subject of Pumi nationality whose HBV was successfully typed, limiting representativeness for Pumi people. We plan future research to understand the distribution of HBV genotypes and subgenotypes in ethnic groups in Yunnan province using faster methods with higher sensitivities and specificities that will provide evidence, reference and insight into the prevention and control of hepatitis B.
Conclusions
In summary, this study found ethnic differences in HBV genotype distribution along with special distributions of HBV genotypes in some ethnic groups. Some of the results differ from the results of other researchers. Importantly, the epidemiological characteristics of HBV have changed with societal changes, and there are similarities and differences in the geographic and ethnic distributions of HBV. Prospective systematic research with strategic study design and sampling methods can improve our understanding of HBV evolution. Key factors affecting the prognosis of hepatitis B infection, such as genetic, geographic, or health resource allocation, should be evaluated. Scientific study can help reduce ethnic differences in the burden of hepatitis B and the prevalence of HBV genotype C in Yunnan province, leading to an overall reduction of the burden of chronic hepatitis B infection and its complications.
Supplementary material
The supplementary material for this article can be found at https://doi.org/10.1017/S0950268821002326.
Acknowledgements
We express our gratitude to all our patients and control volunteers for kindly donating their blood and for their agreement to participate in this study.
Author contributions
The authors Z Dong and JR Li wrote this manuscript; MD Hu and ZX Zhao did the data analysis; L Xu and W Yu finished the experiment; WY Kang and QF Li designed the project and reviewed the manuscript.
Conflict of interest
The authors declare no conflicts of interest.
Data availability statement
The data that support the findings of this study are available from Email: [email protected]. Restrictions apply to the availability of these data, which were used under licence for this study. Data are available with the permission of the author.