1. Introduction
Trees have long lifespans ranging from tens to hundreds of years, during which their trunks and branches continue to grow. Somatic mutations occur due to errors in genome replication during cell division and failures in DNA damage repair following physical or chemical disturbances. These mutations arise and accumulate in different branches, leading to genetic differentiation within an individual tree (Tomimoto & Satake, Reference Tomimoto and Satake2023). Thanks to recent developments in genomic analysis technologies such as next-generation sequencing, we have become able to identify genetic patterns caused by somatic mutations (Duan et al., Reference Duan, Yan, Zhu, Zhang, Tao, Ji, Zhang, Wang and Wang2022; Hanlon et al., Reference Hanlon, Otto and Aitken2019; Hofmeister et al., Reference Hofmeister, Denkena and Colomé-Tatché2020; Orr et al., Reference Orr, Padovan, Kainer, Külheim, Bromham, Bustos-Segura, Foley, Haff, Hsieh, Morales-Suarez, Cartwright and Lanfear2020; Perez-Roman et al., Reference Perez-Roman, Borredá, Usach and Talon2022; Plomion et al., Reference Plomion, Aury, Amselem, Leroy, Murat and Duplessis2018; Reusch et al., Reference Reusch, Baums and Werner2021; Schmid-Siegert et al., Reference Schmid-Siegert, Sarkar, Iseli, Calderon, Gouhier-Darimont, Chrast, Cattaneo, Schütz, Farinelli, Pagni, Schneider, Voumard, Jaboyedoff, Fankhauser, Hardtke, Keller, Pannell, Reymond, Robinson-Rechavi, Xenarios and Reymond2017; Schmitt et al., Reference Schmitt, Leroy, Heuertz and Tysklind2022; Wang et al., Reference Wang, Ji, Hu, Hu, Jia and Jiang2019; Zahrdadníková et al., Reference Zahrdadníková, Ficek, Brejová, Vinař and Mičieta2020). The physical structure of the tree, with its branches, exhibited a topology similar to the molecular phylogenetic trees of cells sampled from different branches (Perez-Roman et al., Reference Perez-Roman, Borredá, Usach and Talon2022; Satake et al., Reference Satake, Imai, Fujino, Tomimoto, Ohta, Na’iem, Indrioko, Widiyatno, Mollá–Morales, Nizhynska, Tani, Suyama, Sasaki and Kasahara2023), suggesting that somatic mutations and epimutations accumulate as shoots elongate.
In plant tissues, somatic genetic variations can contribute to the next generation because reproductive organs (flowers and fruits) originate from stem cells in the shoot apical meristems (abbreviated as SAM). Gametes, including eggs and pollen, can undergo genetic diversification through somatic mutations, thereby contributing to genetic variation among offspring (Plomion et al., Reference Plomion, Aury, Amselem, Leroy, Murat and Duplessis2018; Sutherland & Watkins, Reference Sutherland and Watkins1986; Whitham & Slobodchikoff, Reference Whitham and Slobodchikoff1981). This differs from animals with unitary structures, where germ lines differentiate from soma in early developmental stages.
In a previous article (Iwasa et al., Reference Iwasa, Tomimoto and Satake2023), we studied a mathematical model describing how the genetic pattern of a shoot is determined by the behavior of stem cells in the meristem. We evaluated the phylogenetic distance between cells sampled from different portions of a shoot, indicating their genetic difference due to somatic mutations accumulated during shoot elongation. Stem cells in the SAM normally undergo asymmetric cell division, producing successor stem cells and differentiated cells. However, occasionally, a stem cell may fail to leave its successor stem cell. Subsequently, to recover the stem cell number, the vacancy is filled by the duplication of one of the nearest neighbour stem cells. Because cell walls prevent stem cells from exchanging their positions, this leads to the genetic differentiation of cells according to the angle around the shoot and a larger genetic variance accumulated among cells in the body (Iwasa et al., Reference Iwasa, Tomimoto and Satake2023).
In this article, we aim to highlight the significance of somatic mutations (and epimutations) in generating genetic diversity among reproductive organs of the same individual and their evolutionary effects through modifying individual fitness and the population genetic variance. For analysis, we again adopt a cell lineage-based model (Chen et al., Reference Chen, Burianm and Johannes2024).
We first note that there are several different aspects regarding the pattern of genetic variation of cells within a single individual tree. To clarify these differences, we introduce three indexes calculated based on phylogenetic distance, defined as the length of ancestral stem cell lineages between sampled cells (Iwasa et al., Reference Iwasa, Tomimoto and Satake2023). They depict different aspects of within-individual genetic diversity. Subsequently, we discuss how these indexes might contribute to various effects on evolutionary processes. Evaluating the evolutionary effects is closely related to population genetic theories developed for the evolution of sex, recombination, and mutation. In this article, the term ‘tree’ consistently refers to a large woody perennial plant typically having a single main stem or trunk. To indicate the evolutionary history and relationships of cells, we use the term ‘phylogeny.’
2. Background information
In this section, we summarize the basis for the discussion of the accumulation of somatic mutations and their patterns. First, we outline the characteristics of tree structures. Subsequently, we explain phylogenetic distance, defined as the length of ancestral phylogeny between two sampled cells until their common ancestral cell, which provides the basis for all the methods used in the remainder of the article.
2.1 Characteristics of tissue structure of trees
Here, we highlight characteristics of tree tissue structure, some of which are shared with modular animals (e.g., corals and sponges; Vasquez Kuntz et al., Reference Vasquez Kuntz, Kitchen, Conn, Vohsen, Chan, Vermeij, Page, Marhaver and Baums2022).
-
(i) No clear distinction between germ line and soma:
In plant tissues, somatic mutations accumulated during shoot elongation may contribute to reproductive organs and gametes (eggs and pollen), affecting the genetic content of offspring.
-
(ii) Shoot apical meristems on different branches:
In trees, each shoot (trunk or branch) contains a shoot apical meristem (SAM) with a small number of stem cells. The SAMs are physically separated and located at the tips of different branches. This structure facilitates independent accumulation of somatic mutations between branches, leading to their genetic differentiation.
-
(iii) Before-forking portion of ancestral cell lineages:
Stem cells undergo asymmetric cell division, creating both a successor stem cell and a differentiated cell. The differentiated cells then undergo a finite number of duplications, increasing in number and size to form a portion of a shoot. If the stem cells perform asymmetric cell division with a high probability, different stem cell lineages accumulate somatic mutations independently, leading to genetic heterogeneity of stem cells. The ancestral lineages of two different stem cells in the SAM have some distance between their common ancestor and the sampled location as illustrated in Figure 1. This is referred to as the ‘before-forking’ portion (Tomimoto et al., Reference Tomimoto, Satake and Iwasa2023). Its magnitude depends on the probability of failure in asymmetric cell division of stem cells (Iwasa et al., Reference Iwasa, Tomimoto and Satake2023).
-
(iv) Circular genetic differentiation around a shoot:
When a stem cell fails to leave its successor, the stem cell line is replaced by a copy of a neighbouring stem cell, as cell movement is constrained by the cell wall. This produces circular genetic differentiation of stem cells and leads to larger genetic diversity among them than when they are well mixed (Iwasa et al., Reference Iwasa, Tomimoto and Satake2023). Possible roles of the layer structure in the meristem have also been discussed (Klekowski et al., Reference Klekowski, Kazarinova-Fukshansky and Mohr1985; Pineda-Krch & Lehtilä, Reference Pineda-Krch and Lehtilä2002).
-
(v) Distinction between the main and lateral branches:
When a lateral branch is formed at a main branch, the stem cells of the axillary meristem of the lateral branch are sampled from the SAM of the main branch. In many trees, only a small fraction of stem cells in the main branch contribute to the axillary meristem (Tomimoto et al., Reference Tomimoto, Satake and Iwasa2023; Tomimoto & Satake, Reference Tomimoto and Satake2023). This results in an asymmetry between the two stem cell lineages sampled above the forking. As illustrated in Figure 1(D), the main branch maintains multiple ancestral cell lineages just below the forking, while cell lineages from the lateral branch coalesce at the forking. Main and lateral branches at a forking may be distinguished by comparing the genetic diversity among cells of the two branches.
2.2 Phylogenetic distance between cells
We consider the following situation: A shoot consists of cells derived from a small number of stem cells in the SAM located at the tip. A stem cell undergoes asymmetric cell division and produces one successor stem cell and one differentiated cell. The differentiated cells increase in number through a finite number of cell divisions, grow in cell size, form a portion of the shoot, and lift the SAM. This process continues and creates a shoot with the current SAM on the tip and the lower portion of the shoot reflecting an earlier genetic state of stem cells in the SAM (Iwasa et al., Reference Iwasa, Tomimoto and Satake2023).
All cells forming above-ground plant body are ultimately derived from stem cells in the SAM. As the shoot elongates, novel mutations accumulate. Hence, the genetic distance between two cells can be evaluated by the length of the ancestral lineage connecting them. Iwasa et al. (Reference Iwasa, Tomimoto and Satake2023) defined phylogenetic distance $D$ as the physical length along a branch, measured in units of cm or m. The genetic distance, representing the number of genomic differences between cells, is calculated by multiplying $D$ by the mutation rate per physical shoot length. Mutations occur due to errors in cell division and failures of DNA damage repair, with the rates proportional to the number of cell divisions and to the calendar time, respectively. We assume an infinite-site model where the same mutation does not occur in any stem cell, and no back mutation occurs. Comparing trees between fast and slow growing tropical species, Satake et al. (Reference Satake, Imai, Fujino, Tomimoto, Ohta, Na’iem, Indrioko, Widiyatno, Mollá–Morales, Nizhynska, Tani, Suyama, Sasaki and Kasahara2023) concluded that the mutation rate proportional to the number of calendar years was much more important than the other type.
While Iwasa et al. (Reference Iwasa, Tomimoto and Satake2023) discussed $D$ between two cells sampled from the same shoot, in this article, we consider the cases where the sampled cells are on different branches. Figure 1(A) illustrates an example of ancestral cell lineages connecting two sampled cells X and Y, located on different branches spliced from the main branch (trunk). The location labelled as fork[X, Y] represents the forking point between X and Y, where their ancestral cell lineages of cells X and Y converged to the same SAM. These two ancestral stem cells may or may not be identical, because a SAM contains multiple stem cells. If they are different, there is a distance between their common ancestor stem cell and fork[X, Y]. We call this portion as the ‘before forking’ segment (Tomimoto et al., Reference Tomimoto, Satake and Iwasa2023) and denote the length of the path between two ancestral stem cells at fork[X, Y] by $Bf\left(X,Y\right)$ . The physical distance between the common ancestor cell and fork[X, Y] is half of $Bf\left(X,Y\right)$ and was called ‘coalescent length’ between the two ancestral cells at fork[X, Y] (Iwasa et al., Reference Iwasa, Tomimoto and Satake2023)
The before-forking portion may extend to the base of the shoot. However, there is a possibility that their common ancestral stem cell exists in the middle of the shoot. This phenomenon, known as ‘coalescence’, can occur because a stem cell in the SAM sometimes fails to leave its successor stem cell. When such a failure occurs, another stem cell in the SAM duplicates to replenish the stem cell count. Consequently, a stem cell line is replaced by another, and the ancestral cell lineage of a focal stem cell shifts its location (Iwasa et al., Reference Iwasa, Tomimoto and Satake2023).
The phylogenetic distance between X and Y is defined as the length of their stem cell lineage since their common ancestor cell (Figure 1(A)), given by:
where the first term on the right-hand side represents the physical length along the branch between sampled cell X and the forking (see Iwasa et al., Reference Iwasa, Tomimoto and Satake2023 for definition of $L$ ). The second term represents a similar quantity for sampled cell Y. Detailed explanations are provided in Part A of the Supplemental Material (abbreviated as SM). The sum of the first and second terms on the right-hand side of Eq. (1) is the after-forking portion of the phylogenetic distance. The last term indicates the before-forking term, which increases with the genetic diversity of stem cells within the SAM.
Figure 1(B) illustrates the case of three sampled cells from different branches. The ancestral cell lineages of X and Y coalesce below fork[X, Y] by $\frac{1}{2} Bf\left(X,Y\right)$ . Their common ancestral cell lineage then continues downward. The forking between their common ancestral lineage and the ancestral cell lineage of Z is denoted by fork[Z, XY]. In Figure 1(B), the common ancestor of X and Y is above fork[Z, XY]. Coalescence of all three ancestral lineages occurs where the ancestral lineage of Z and the common ancestral lineage of X and Y coalesce. It is located below fork[Z, XY] by length $\frac{1}{2} Bf\left(Z, XY\right)$ . For further details, refer to Part A of the SM.
Figure 1(C) illustrates the case ancestral cell lineages of X and Y do not coalesce above fork[Z, XY]. The ancestral cell lineages have three different ancestral cells at fork[Z, XY]. We observe a coalescent process of these three cells. In Figure 1(C), the ancestral cell lineages of X and Z coalesce first, followed by the coalescence of their common ancestral lineage with the ancestral lineage of Y.
Figure 1(D) illustrates the asymmetry between the main and lateral branches. The ancestral cell lineages of Y and Z do not coalesce within the lateral branch but they coalesce where the lateral branch is spliced because the stem cells of the SAM of the lateral branch were derived from a single stem cell in the SAM of the main shoot. In contrast, the main branch keeps the diversity, as illustrated by that the ancestral cell lineage of X was kept separate from the common ancestral cell lineage of Y and Z.
In Part B of the SM, we delve into the variance in the number of genetic differences between two sampled cells. It originates from the Poisson distribution with a mean proportional to the phylogenetic distance and also from the uncertainty of the ancestral cell lineage. Because this article aims to illustrate the various facets of within-individual genetic variation, our emphasis lies on the mean values of the indexes in the main text.
3. Three indexes
In this section, we will explain three indexes that characterize different aspects of within-individual genetic variation.
3.1 Mean pairwise phylogenetic distance between reproductive organs $\overline{\boldsymbol{D}}$
A tree may have many reproductive organs, producing seeds or flowers, with the latter producing pollen to fertilize the seeds of other trees. Let $N$ be the number of reproductive organs of a tree. A reproductive organ may be an inflorescence, which may later develop into multiple fruits. The size of a reproductive organ determines the number of seeds it produces, which in turn affects the number of offspring. We represent relative sizes of these organs as ${\omega}_1,{\omega}_2,\dots, {\omega}_N$ , which satisfy ${\omega}_1+{\omega}_2+\dots +{\omega}_N=1$ . The phylogenetic distance between organs $i$ and $j$ is denoted as $D\left(i,j\right)$ , which is defined by Eq. (1) (see Figure 2(A)).
Here, we define the mean pairwise phylogenetic distance among all the reproductive organs of an individual tree as follows:
where $D\left(i,i\right)=0$ . This equals to the mean phylogenetic distance between two randomly chosen seeds. The genetic diversity is quantified by $\overline{D}$ given in Eq. (2), multiplied by the mutation rate per unit shoot length.
Cells from the focal tree undergo meiosis, become haploid gametes and experience syngamy with the gametes produced by other trees. Hence, tree structure with a larger $\overline{D}$ realizes genetically more diverse offspring.
3.2 Phylogenetic diversity of multiple stem cells of a tree $\boldsymbol{PD}$
Phylogenetic diversity (abbrev. $PD$ ) is defined as the sum of branch lengths of phylogeny (Figure 2(B)). It was first proposed by Faith (Reference Faith1992) as a measure of the importance of different species (or taxonomic units) in conservation biology, and has been adopted widely to evaluate the diversity of a group of multiple species (microbes; Sogin et al., Reference Sogin, Morrison, Huber, Welch, Huse, Neal, Arrieta and Herndl2006; Lauber et al., Reference Lauber, Hamady, Knight and Fierer2009). In the classical gene-genealogy theory, the sum of branch lengths of molecular phylogeny is calculated when those genomes are sampled from a single well mixed population. Watterson (Reference Watterson1975) defined the expected number of segregating sites, denoted by ${S}_n$ , as the product of mutation rate $m$ and the sum of branch lengths in gene phylogeny.
We apply $PD$ to quantify the genetic diversity of reproductive organs of a single individual tree, treating them as taxonomic units. Note that sampling was conducted in a population with geographic structures characterized by separation into multiple SAMs at different branches and a circular genetic structure within each SAM.
Both mean pairwise phylogenetic distance $\overline{D}$ and phylogenetic diversity $PD$ measure the genetic differences between reproductive organs and evaluate the variability among offspring (i.e., seeds and young trees). However, these two metrics emphasize different aspects of the genetic diversity among offspring. The phylogenetic diversity $PD$ is proportional to the total number of haplotypes included in the whole phylogeny, giving equal weight to genotypes that produce numerous seeds and those that produce a single seed. In contrast, the mean phylogenetic distance between organs $\overline{D}$ considers branches with many seeds more important than those with few sees.
Additionally, even when all the reproductive organs have equal weight ( ${\omega}_1={\omega}_2=\dots ={\omega}_n=1/N$ ), $\overline{D}$ and $PD$ assign different weights to various branches. Figure 3 illustrates a tree with eight reproductive organs ( $N=8$ ). In Figure 3(A), each branch has an importance proportional to the number of ancestral lineages connected two organs. Note that removing a branch results in two disconnected trees. If the number of organs included in these trees are $i$ and $j$ , the weight for the branch is proportional to $ij$ . In Figure 3(A), we indicated the importance of each branch in calculating $\overline{D}$ , with the values normalized to ensure their sum equals 100. In contrast, Figure 3(B) illustrates the equal importance of each branch for $PD$ , with the same value assigned to all branches. Comparing these two, we can conclude that $\overline{D}$ assigns more weight to branches in the central portion than those in the marginal portion of the tree.
3.2.1 Phylogenetic diversity of stem cells sampled at the same position
Figure 1(C) illustrates the case where the ancestral cells of three sampled cells, X, Y and Z, are different cells in the same SAM. We need to evaluate the sum of branch lengths of ancestral lineages of these three cells until the coalescence. In this section, we consider the phylogenetic diversity of more than two stem cells in the same SAM along the shoot.
Let $n$ be the number of stem cells in the SAM. They are arranged in a circular manner, with the nearest neighbours separated by an angle of $2\pi /n$ radians. The number of sampled stem cells is $m$ , and they are separated from their neighbors by ${k}_1,{k}_2,\dots, {k}_m$ ( ${k}_1+\dots +{k}_m=n$ ). If a stem cell fails to leave its successor stem cell, the location is filled by a copy of its nearest neighbor. This occurs at a rate $q$ per unit shoot length. Then the location of the ancestral lineage of a sampled cell shifts by one, either to the left or right. The mean phylogenic diversity of sampled stem cells, denoted by $PD\left({k}_1,\dots, {k}_m;y\right)$ , depends on $y$ , the distance of the sampled location from the base of the shoot.
In Part C of the SM, we derive a differential equation for $PD\left({k}_1,\dots, {k}_m;y\right)$ and obtain the solution considering the boundary conditions. Figure 4 illustrates how $PD\left({k}_1,\dots, {k}_3;y\right)$ increases with $y$ . In the limit of very large $y$ ( $y\to \infty$ ), we have the following solution:
which indicates the total sum of the coalescent lengths of $m$ stem cells if the sampled location is far from the base of the shoot. $PD$ is large when stem cells have a high probability of leaving their successor cells ( $q$ is small). It also depends on the intervals between the sampled stem cells $\left({k}_1,\dots, {k}_m\right)$ . We can rewrite Eq. (3) as follows:
where $\mathit{var}\left({k}_i\right)$ is the variance of the length of intervals: Phylogenetic diversity reaches its maximum when the sampled cells are equally spaced. When the stem cells are sampled with equal intervals ( $\mathit{var}\left({k}_i\right)=0$ ), $PD=\frac{1}{2q}{n}^2\left(1-\frac{1}{m}\right)$ holds.
When we sample just two cells ( $m=2$ ), $PD$ is the same as the phylogenetic distance $D=k\left(n-k\right)/q$ , which was the basis of index $\overline{D}$ calculated in the last section.
3.3 Parent-offspring distances ${\boldsymbol{D}}_{\boldsymbol{PO}}$
The genetic diversity generated by somatic mutations per generation contributes to the genetic variation of the whole population, which controls the speed of adaptation of traits under anthropogenetic environmental changes. The genetic variance of a quantitative trait in the entire population increases by mutations and decreases due to random genetic drift and stabilizing selection (Lande, Reference Lande1976, Reference Lande1981). The rate at which genetic variance is produced per generation is calculated based on the trait change from the parent to offspring, rather than the difference between offspring. Hence, to evaluate the contribution to the genetic variation of the population, we need to know the phylogenetic distance between parent and offspring (Figure 2(C)). However, both $\overline{D}$ and $PD$ only measure the genetic variation among offspring.
We here consider the parent-offspring phylogenetic distance as follows:
where $D\left( parent,i\right)$ is the phylogenetic distance between the parent genome and the ith reproductive organ: $D\left( parent,i\right)=L\left(\mathrm{base}\kern0.17em \mathrm{of}\kern0.17em \mathrm{the}\kern0.17em \mathrm{whole}\kern0.17em \mathrm{shoot},i\right)$ , which is simply equal to the physical length between the base of the whole shoot and the reproductive organs. ${D}_{PO}$ is independent of the genetic diversity of stem cells in the SAM, unlike the two other indexes.
These three indexes capture different aspects of within-individual genetic variation. This can be illustrated by examining a simple case. In Part D of the SM, we analysed the three indexes in a model tree with branching architecture studied in Tomimoto et al. (Reference Tomimoto, Satake and Iwasa2023), which exhibit different dependence on the number of stem cells in the SAM.
4. Discussion: evolutionary impacts
Trees exhibit several characteristic aspects of tissue structure that differ from many animals (unitary organisms). As a result, trees accumulate somatic mutations efficiently, which contribute to the genetic variability of offspring. The selective advantage of somatic mutations expressed in the current generation have been discussed: For example, somatic mutations can lead to diversification in defence chemicals between branches, reducing the harm of herbivores and pathogens (Antolin & Strobeck, Reference Antolin and Strobeck1985; Folse & Roughgarden, Reference Folse and Roughgarden2012; Gill et al., Reference Gill, Chao, Perkins and Wolf1995). In addition, somatic mutations can modify the genome of reproductive organs and the genomic composition of gametes (eggs and pollens), resulting in genetically diverse offspring (Schoen & Schultz, Reference Schoen and Schultz2019; Whitham & Slobodchikoff, Reference Whitham and Slobodchikoff1981). These may have diverse evolutionary effects. In this section, we will address these effects and discuss how three indexes for the genetic diversity of somatic mutations assess different evolutionary effects.
4.1. Generating genetic variations
We may ask whether we can justify the argument that the tissue structure of trees might have evolved as a result of promoting a high rate of mutations in the next generation.
In Part E of the SM, we briefly summarize the conclusions of population genetics theory for the evolution of mutation rates. In the simplest populations (constant environments, no spatial structure), the mutation rate controlled by neutral modifiers evolves towards lower values (Altenberg et al., Reference Altenberg, Liberman and Feldman2017; Karlin, Reference Karlin1975; Karlin & McGregor, Reference Karlin and McGregor1974; Liberman et al., Reference Liberman, Van Cleve and Feldman2011). In populations with various structures, a higher mutation rate can evolve. The key elements of these processes were originally studied in the context of the evolution of sex and recombination. Among them, processes such as environmental fluctuations (Ishii et al., Reference Ishii, Matsuda, Iwasa and Sasaki1989; Sasaki & Iwasa, Reference Sasaki and Iwasa1987), interactions with antagonistic species (Haraguchi & Sasaki, Reference Haraguchi and Sasaki1996, Reference Haraguchi and Sasaki1997; Sasaki & Haraguchi, Reference Sasaki and Haraguchi2000), spatial heterogeneity, and sibling competition (Bell, Reference Bell1982; Douge & Iwasa, Reference Douge and Iwasa2017a, Reference Douge and Iwasa2017b; Maynard Smith, Reference Maynard Smith1978; Williams, Reference Williams1975) have been shown to potentially favour the evolution of a high mutation rate.
Many of these theoretical studies discuss the evolution of the rate of switches between functional alleles that can confer advantages in different environments. However, in most situations, mutations arise due to errors in replication or repair, and they often result in producing many defective and malfunctional genes. It is very difficult for a high mutation rate to be advantageous in general circumstances, except for special mechanisms appearing in host–pathogen interactions (Metzgar & Willis, Reference Metzgar and Willis2000; Rosenberg et al., Reference Rosenberg, Thulin and Harris1998). The observed positive rate of mutation in genomes is considered to have evolved not because of the advantage of higher mutation rate but determined by the high cost of reducing the mutation rate further by investing in more accurate repair or duplication of the genome (Lynch, Reference Lynch2010, Reference Lynch2011).
Therefore, we can conclude that the tissue structure of trees has evolved not because of a high mutation rate but because of other reasons, such as photosynthetic efficiency, competition with neighbouring individuals, defence against pathogens or predators, mechanical strength, etc. Mutations produced by trees that contribute to the next generation are considered as ‘byproducts’ of the tissue structures.
Even if the tissue structure of trees evolved primarily for other reasons, examining the evolutionary effects quantitatively is important. Among many processes that may provide a fitness advantage for having a higher mutation rate, the two most promising ideas are (1) the antagonistic genotype-specific interaction with pathogens and herbivores, and (2) situations characterized by significant spatio-temporal variation and intense sib-competition (refer to Part E of the SM). These promote the advantage of producing a few diverse offspring, rather than having many similar offspring. It is worthy to quantify these fitness effects in the field. They are likely to be important for genes involved in antagonistic interactions with pathogens and herbivorous insects, especially in species with strong spatial variation and intense sib competition. These are plausible for trees with gravity-dispersed seeds (barochory). Additionally, separate from the fitness benefits to the parent tree, there exists (3) an effect of increasing the genetic variance of the whole population, which governs the population’s evolutionary capacity to respond to environmental changes.
4.2. Efficient elimination of deleterious mutations
An analysis of the next-generation sequencing data in trees concluded that somatic mutations behave as neutral variations during branch elongation but suffer deleterious effects when they form the next generations (Ally et al., Reference Ally, Ritland and Otto2010; Satake et al., Reference Satake, Imai, Fujino, Tomimoto, Ohta, Na’iem, Indrioko, Widiyatno, Mollá–Morales, Nizhynska, Tani, Suyama, Sasaki and Kasahara2023), which was different from the expectation of the population genetic theory (Otto & Orive, Reference Otto and Orive1995). This implies that somatic mutations are maintained without receiving strong selection within an individual tree, suggesting a novel theoretical idea for the evolutionary advantage of tree tissue structure: Trees might be able to effectively eliminate harmful deleterious genes, provided that the deleterious mutations work synergistically.
Mutation accumulation experiments suggested that the fitness of a genome including $k$ deleterious genes is represented as $\mathit{exp}\left[-\alpha k-\beta {k}^2\right]$ with $\alpha >0$ and $\beta >0$ (Mukai, Reference Mukai1969), implying that the effect of a few mutations is small, but with sufficiently many mutations, fitness declines rapidly in an accelerating manner (Maynard Smith, Reference Maynard Smith1978). This type of epistasis eliminates deleterious mutations without much reduction in the population’s mean fitness, as emphasized in the theory of the evolution of sex (Hurst & Peck, Reference Hurst and Peck1996; Kondrashov, Reference Kondrashov1988, Reference Kondrashov1993; Kondrashov & Crow, Reference Kondrashov and Crow1988).
We can compare a tree that accumulate somatic mutations over hundreds of years with a large genet of a clonal plant that extends through many small individuals (ramets). In the clonal plant, deleterious mutations are expressed each time a ramet is formed, as shown in seagrass (Yu et al., Reference Yu, Boström, Franzenburg, Bayer, Dagan and Reusch2020). In contrast, in tree, the mutations accumulate without being exposed to selection and receive strong selection only during the reproductive stages. Different branches of a tree may accumulate different numbers of deleterious mutations. If the mutations interact synergistically during reproduction, a small fraction of seeds may carry many deleterious mutations, leading to significantly reduced fitness, while others may not exhibit as much deleteriousness. This, coupled with strong sib competition, could efficiently remove deleterious mutations and improve the reproductive success of the parent tree compared with the corresponding clonal plant.
At this moment, empirical evidence for the extent of synergism was not decisive: Synergism was supported in experiments with Drosophila (Mukai, Reference Mukai1969) but not detected in a study with E. coli (Elena & Lensky, Reference Elena and Lensky1997). Empirical studies on the accumulation of somatic mutations in trees and clonal plants yielded varying conclusions: Ally et al. (Reference Ally, Ritland and Otto2010) reported a decline in fertility, Bobiwash et al. (Reference Bobiwash, Schultz and Schoen2013) estimated the accumulation of deleterious mutations in a long-lived clonal shrub. In contrast, Cruzan et al. (Reference Cruzan, Streisfeld and Schwoch2022) found within-individual selection in perennial herbs resulting in the production of advantageous mutations in some shoots, while Alejano et al. (Reference Alejano, Domínguez-Delmás, García-González, Wazny, Vázquez-Piqué and Fernández-Martínez2019) observed no correlation between the fitness of offspring and the age of the parent tree. More empirical studies are needed.
4.3 Indexes measuring different aspects of novel genetic variation
In this article, we discussed three indexes for the genetic diversity of somatic mutations within an individual tree. They quantify the amount of genetic diversity in different aspects.
The additive genetic variance of a population is crucial in determining its response to environmental changes. Somatic mutations in trees may increase the genetic variance of the population. To assess this effect, ${D}_{PO}$ is important as it represents the magnitude of parent-offspring distance produced per generation.
In contrast, to measure the contribution to the fitness of the parental tree, we need to count the number of surviving offspring relative to other trees in the same population. Both $\overline{D}$ and $PD$ measure the genetic diversity of offspring produced by each individual tree, rather than their effect on the entire population.
If we compare the two indexes, $\overline{D}$ and $PD$ , the phylogenetic distance $PD$ is proportional to the total number of haplotypes included in the whole phylogeny. It assigns equal importance to genotypes with different numbers of seeds. In contrast, the mean phylogenetic distance between organs $\overline{D}$ considers a branch with many seeds more important than another with only a few seeds.
In situations where the majority of seeds can survive and engage in intense sibling competition, the number of genetically distinct offspring becomes crucial, rendering multiple copies of the same genotype irrelevant. In such cases, $PD$ (phylogenetic diversity) holds more significance than $\overline{D}$ (mean pairwise distance). However, if a large fraction of offspring die early in their life before they start to compete with each other, a genotype existing as a single copy is likely to be eliminated stochastically. In such cases, producing many offspring of the same genotype makes sense to realize at least one individual can join in the sib competition. In such scenarios, $\overline{D}$ , which considers the relative number of seeds, may be more appropriate than $PD$ . For further argument, refer to Part F of the SM.
In a species-rich environment such as tropical rain forests, the specialist herbivores and pathogens are abundant near the parent tree (Janzen, Reference Janzen1970). Having the offspring with traits different from the parent might be more important than having offspring that differ among themselves. In such a case, ${D}_{PO}$ might be the most suited in evaluating the fitness contribution to the parent.
In short, the choice of the most suitable index for quantifying the genetic variation caused by somatic mutations should depend on which aspect of the many evolutionary effects one intends to evaluate.
To enhance our understanding of the role of tree tissue structure in accumulating somatic mutations, we require both theoretical studies to explore the evolutionary impacts of somatic mutations accumulated in trees and empirical research in the field that takes into account the success of all life stages, genetic structure, as well as both temporal and spatial variations.
Acknowledgements
We thank the following people for their very helpful comments: R. Hayashi, R. Imai, S.P Otto, E. Sasaki, H. Tachida, T. Yahara and R. Yamaguchi.
Conflict of interest
The authors declare that they have no conflict of interest.
Author contribution
YI, ST and AS conceptualized the study, developed the methodology, conducted formal analysis, created visualizations and wrote the article.
Data availability statement
The authors declare that the data supporting the findings of this study are available within the paper, and its supplementary information files. Because the research is entirely theoretical, no empirical data are available.
Supplementary material
The supplementary material for this article can be found at http://doi.org/10.1017/qpb.2024.9.
Financial support statement
This work was done in support of a Grant-in-Aid for Scientific Research for JSPS Fellows: 23KJ1722 to S.T. and Gants-in-aid for Scientific Research: 21H04781, 23H04965 and 23H04966 to A.S. (Japan Society for the Promotion of Sciences)
Comments
Editor, Quantitative Plant Biology
February 20, 2024
Dear Editor,
Attached is the manuscript that we submitted for the publication in Quantitative Plant Biology.
Title: Genetic diversity within a tree and alternative indexes for different evolutionary effects.
Authors:
Yoh Iwasa (email: [email protected])
Sou Tomimoto (email: [email protected])
Akiko Satake (email: [email protected])
Address:
Department of Biology, Faculty of Science, Kyushu University,
744 Motooka, Nishi-ku, Fukuoka 819-0395, Japan
Corresponding author: Yoh Iwasa
We prepared this manuscript in response to an invitation letter from Alison Paskins to Akiko Satake.
Trees, living for centuries, accumulate somatic mutations in their growing trunks and branches, causing genetic divergence within a single tree. Stem cell lineages in a shoot apical meristem accumulate mutations independently and diverge from each other. In plants, somatic mutations can alter the genetic composition of reproductive organs and gametes, impacting future generations.
To evaluate the genetic variation among a tree’s reproductive organs, we consider three indexes: mean pairwise phylogenetic distance (D ̅), phylogenetic diversity (PD; sum of branch lengths in molecular phylogeny), and parent-offspring phylogenetic distance (D_PO). Because mutants generally contain many non-functional genes, natural selection does not lead to an increase in the mutation rate.
The tissue architecture of trees evolved primarily for reasons such as photosynthetic efficiency and mechanical strength, but as a secondary consequence, it facilitated the accumulation of somatic mutations. Somatic mutations have various evolutionary effects, including enhancing fitness under strong sib competition and intense host-pathogen interactions, efficiently eliminating deleterious mutations through epistasis, and increasing genetic variance in the population.
We conclude that choosing appropriate indexes to measure the genetic diversity of somatic mutations depends on the specific aspect of evolutionary influence being assessed.
This work is not under consideration for publication in another journal or book. Our submission for publication has been approved by all authors and institutions. All persons entitled to authorship have been so named. All authors have seen and agreed to the submitted version of the manuscript.
Best wishes,
Sincerely yours,
Yoh Iwasa