1 Introduction
Perturbation theory and Sturm–Liouville operators have a rich history of interdependence; perturbation-theoretic ideas are often tested on these well-known ordinary differential operators, and advances in special classes of operators (such as one-dimensional Schrödinger operators with various potentials) often require new perturbation results. In particular, the works of Aronszajn and Donoghue, responsible for well-known results on the spectra of rank-one self-adjoint perturbation problems, were motivated by applications to Sturm–Liouville differential equations [Reference Aronszajn5, Reference Donoghue16].
However, the current formulation of most perturbation results makes them only suitable for a select class of Sturm–Liouville operators—those for which any limit-circle endpoints are regular—despite the fact that many classical operators do not fall into this category. The current manuscript aims to expand the classical setup in which all self-adjoint extensions of a regular Sturm–Liouville operator are represented by an additive perturbation, to the case where the Sturm–Liouville operator can have more singular limit-circle endpoints. This leads to many new possibilities for the spectral analysis of such operators, as will be explored later.
In order to give a frame of reference, we briefly present a slight variation of the well-known regular endpoint setup employed in [Reference Simon49, Section 11.6] that yields a rank-one perturbation problem.
Example 1.1 Let $\ell [\cdot ]$ be a Sturm–Liouville operator defined by equation (2.2) on $L^2[(a,b),w(x)]$ with the endpoint a regular and the endpoint b in the limit-point case. For simplicity, we will assume that the symmetric operator $\{\ell ,\mathcal {D}_{\scriptstyle \mathrm {min}}\}$ is positive for the moment. Note that any second-order differential equation with real coefficients on $[a,b]$ that are sufficiently differentiable can be expressed as a Sturm–Liouville operator [Reference Naimark45, Section 1.5]. Such an operator requires a single boundary condition at $x=a$ which can be written, for $f\in L^2[(a,b),w(x)]$ and $0\leq {\vartheta }<\pi $ , as
Hence, ${\vartheta }=0$ corresponds to the Dirichlet boundary condition and ${\vartheta }=\pi /2$ to the Neumann boundary condition. Let $\mathcal {D}_{\scriptstyle \mathrm {max}}$ denote the maximal domain associated with $\ell $ , those functions in $L^2[(a,b),w(x)]$ that $\ell $ maps back into $L^2[(a,b),w(x)]$ (see Definition 2.1). Then, the operator $\ell _{{\vartheta }}$ defined by acting via $\ell $ on the domain
is easily seen to be self-adjoint for any fixed ${\vartheta }$ .
Let $f\in \mathcal {D}_{{\vartheta }}$ be real-valued. Then, integration by parts yields
where equation (1.1) and the fact that $x=b$ is in the limit-point case were used to simplify expressions and $\tau $ stands for the quadratic form defined by the first two summands in the middle line. Let $\boldsymbol {A}$ be the ${\vartheta }=\pi /2$ Neumann boundary condition operator, and let $\delta _a$ be the Dirac delta function with singularity at $x=a$ . Then,
corresponds to $\ell _{{\vartheta }}=\boldsymbol {A}_{-\cot ({\vartheta })}$ via equation (1.2). Since $x=a$ is a regular endpoint, the distribution $\delta _a$ gives rise to a well-defined linear functional acting on the maximal domain $\mathcal {D}_{\scriptstyle \mathrm {max}}$ thanks to Sobolev embedding [Reference Simon49, Section 11.6]. However, $\delta _a$ is a distribution and clearly does not belong to $\mathcal {H}=L^2[(a,b),w(x)]$ . We conclude that the perturbation in (1.3) is singular form-bounded in the sense of [Reference Albeverio and Kurasov2], i.e., $\delta _a\in \mathcal {H}_{-1}(\boldsymbol {A})\setminus \mathcal {H}$ . See Section 2.3 for a definition of the scale of Hilbert spaces $\mathcal {H}_{s}(\boldsymbol {A})$ for $s\in {\mathbb R}$ and Remark 3.7 for an explanation of how the case of “infinite coupling” is handled. We direct readers interested in very general $p(x)$ or $w(x)$ to [Reference Kurasov36].
One of the contributions of the current manuscript is to extend equation (1.3) to semibounded Sturm–Liouville differential equations with general limit-circle endpoint(s) (see Section 3). In general, such endpoints are much more difficult to handle than regular ones due to the fact that functions in the maximal domain may blow up near the endpoints. This results in more complicated boundary conditions to ensure self-adjointness, as typical Dirichlet and Neumann boundary conditions are not rigorously defined. Recent analysis of boundary conditions and the m-function in the limit-circle case, in general and for examples, can be found in [Reference Eckhardt, Gesztesy, Nichols and Teschl18, Reference Everitt and Kalf20, Reference Gesztesy, Littlejohn and Nichols26, Reference Gesztesy and Zinchenko29, Reference Gesztesy and Zinchenko30, Reference Kurasov and Luger38], and further historical results can be found in [Reference Brown and Hinton9, Reference Bulla and Gesztesy11, Reference Fulton25, Reference Marletta and Zettl44, Reference Niessen and Zettl46, Reference Titchmarsh52]. A direct approach, adapted from the above setup for regular endpoints, is therefore not successful. Instead, a boundary triple [Reference Bruk10, Reference Calkin12–Reference Derkach and Malamud14, Reference Gorbachuk and Gorbachuk31, Reference Kochubei33] and a compatible boundary pair [Reference Arlinskii4, Reference Lyantse and Storozh43] are constructed for such an operator with limit-circle endpoint(s). This framework is an application of methods developed in the recent book [Reference Behrndt, Hassi and de Snoo7], and we invite the reader to read the rich history of contributions to these fields in its well-written introduction. A suitable amount of background on these objects for our task can be found for convenience in Section 2.2. Some knowledge of semibounded forms associated with semibounded self-adjoint operators is assumed, although a few facts can also be found in Section 2.2 and their explicit link with the perturbation setup is described in Remark 3.7. Further comments on general connections between the boundary triple and boundary pair framework and setting up perturbation problems are made in Section 3.3.
The boundary triple and boundary pair framework introduces one difference from the classical perturbation theory that should be commented upon. The adapted rank-two perturbation setup in Theorem 3.6 does not rely on $2\times 2$ Hermitian matrices as a parameter, which we denote $\Theta $ , but rather $2\times 2$ self-adjoint relations. This divergence from the classical theory arises naturally, and we usually assume that we are in the special case where the linear relation is a matrix. The price to pay for this assumption is that we are unable to claim that some spectral results apply to all self-adjoint extensions. However, there is a great deal of information able to be discerned from these matrix-valued spectral measures that scalar-valued spectral measures do not capture. In particular, separated (or local) boundary conditions, which correspond to $\Theta $ being a diagonal matrix, have simple eigenvalues when there are two limit-circle endpoints [Reference Zettl55], but the richer context of connected (or nonlocal) boundary conditions can give rise to eigenvalues of multiplicity two.Footnote 1 This discrepancy is a main motivation for the investigation and our attempts to go deeper than classical results like [Reference Zettl55, Theorem 10.6.3].
Boundary triples naturally define a self-adjoint extension with the domain being functions in the kernel of the map $\Gamma _0$ . This self-adjoint extension is not the unperturbed operator ${\boldsymbol {A}}_0$ in our setup though. Indeed, the domain of ${\boldsymbol {A}}_0$ is transversal with the natural self-adjoint extension defined by the boundary triple. Fortunately, this difference is not an impediment to the analysis of the operators, as they are intimately related. More details on how the two methods start from “opposite” self-adjoint extensions can be found in Remark 4.7. Comparing and contrasting the strengths of these two methods is one of the central objectives of the paper.
Another advantage of using boundary triples is that “infinite coupling” in the perturbation parameter is easily handled, and corresponds to the cases where $\Theta $ is a $2\times 2$ self-adjoint relation but not a Hermitian matrix. Such cases technically fall outside the definition of form-bounded perturbations, but can be given rigorous meaning by appealing to the semibounded forms associated with the relevant self-adjoint extensions (see Remark 3.7 for more information).
The newly expanded perturbation setup is also utilized to gain new insight about the classical Jacobi differential equation in Section 4, and builds upon some of the recent results in [Reference Frymark22]. Notably, the Weyl-m function for important self-adjoint extensions was determined in [Reference Frymark22] and utilized as a starting point for the spectral analysis here. Our attention is focused on this operator for many reasons: It is extremely well studied [Reference Bailey, Everitt and Zettl6, Reference Bochner8, Reference Everitt19, Reference Frymark and Liaw23, Reference Ismail32, Reference Krall34, Reference Szegö51], contains many interesting examples as special choices of its parameters, and is limit-circle at both endpoints. It is also a change of variables away from the hypergeometric differential equation (a fact which is exploited heavily), and every second-order ordinary differential equation with at most three regular singular points can be transformed into the hypergeometric differential equation. Analysis of the Jacobi equation thus practically covers all semibounded Sturm–Liouville equations with two limit-circle endpoints. It should be noted that such Sturm–Liouville equations, including Jacobi, are guaranteed to have only discrete spectrum [Reference Bailey, Everitt and Zettl6, Reference Zettl55].
This leaves many opportunities for further investigation into the absolutely continuous spectrum of self-adjoint extensions, which may occur when a Sturm–Liouville operator has, e.g., one limit-circle endpoint and one limit-point endpoint.
In cases where the perturbation parameter $\Theta $ is a Hermitian $2\times 2$ matrix or a special linear relation, several spectral properties are analyzed. The matrix-valued weights of point masses for the unperturbed operator, multiplicity of eigenvalues, and eigenvectors from the space $L^2(\mu ^{\Theta })$ , where $\mu ^{\Theta }$ is the matrix-valued spectral measure of the perturbed operator, are all determined explicitly. (Note that the definition of the space $L^2(\mu ^{\Theta })$ becomes clear through its inner product given in equation (4.13).) Most of these properties are obtained by looking at explicit Weyl-m functions that are features of the boundary triple framework and spectral results from [Reference Gesztesy and Tsekanovskii28]. Theorem 4.9 refines some observations on multiplicity/degeneracy of eigenvalues from [Reference Zettl55]. Perturbation results from [Reference Liaw and Treil39, Reference Liaw and Treil41] further assist with the analysis of the spectrum and allow for the tracing of eigenfunctions through the perturbation in Corollary 4.18.
The spectral analysis of a wide array of operators, including one-dimensional Schrödinger operators with $L^1_{\scriptstyle \mathrm {loc}}$ potentials, can benefit from our results. Largely unexplored applications include higher-order operators.
1.1 Roadmap of results
Figure 1 informally summarizes several key results in this paper. While some of these results are obtained via boundary triples, other are proved using perturbation theory. It is this merging of theories that allows us to obtain such a complete picture of the situation. Clearly, some of the symbols are not yet defined. And so, we refer the reader to the respective theorems, equations, and corollaries to find the exact meaning of the symbols. The solid arrows in this diagram do not necessarily represent explicit maps from the objects that they originate at to the object that they terminate at. Rather, they represent the flow of influence and information.
1.2 Notation
Notation such as $\mathbf {L}_{\scriptstyle \mathrm {max}}=\{\ell ,\mathcal {D}_{\scriptstyle \mathrm {max}}\}$ is used to say that the operator $\mathbf {L}_{\scriptstyle \mathrm {max}}$ acts via $\ell $ on the domain $\mathcal {D}_{\scriptstyle \mathrm {max}}$ . Operators are often written in bold to help distinguish them from linear relations where there may be confusion.
The symbol ${\boldsymbol {\alpha }}$ is used to denote the rank-one perturbation parameter, whereas ${\alpha }$ and $\beta $ are reserved as parameters for the Jacobi differential operator in Section 4.
Finally, the word “singular” is used in two ways in this paper: to signify when a perturbation vector is taken from outside of the Hilbert space natural to the problem (see Section 2.3) and when an endpoint of a Sturm–Liouville equation is in the limit-circle or limit-point case (see the end of Section 2.1). With regard to endpoints, the limit-point case is not of interest, as it does not require a boundary condition.
Note that $\infty $ is used as a subscript in Sections 3 and 4 to refer to specific self-adjoint extensions that are related to infinite coupling from the perturbation-theoretic viewpoint. The most specific information can be found in Remarks 3.5 and 3.7.
2 Background
Consider the classical Sturm–Liouville differential equation
where $p(x),w(x)>0$ a.e. on $(a,b)$ and $q(x)$ real-valued a.e. on $(a,b)$ , with $a,b\in \mathbb {R}\cup \{\pm \infty \}$ . Furthermore, let $1/p(x),q(x),w(x)\in L^1_{\scriptstyle \mathrm {loc}}[(a,b),dx]$ . Additional details about Sturm–Liouville theory can be found, e.g., in [Reference Akhiezer and Glazman1, Reference Bailey, Everitt and Zettl6, Reference Everitt19, Reference Gesztesy and Zinchenko30, Reference Zettl55]. The differential expression can be viewed as a linear operator, mapping a function f to the function $\ell [f]$ via
This unbounded operator acts on the Hilbert space $L^2[(a,b),w]$ , endowed with the inner product $ \langle f,g\rangle :=\int _a^b f(x)\overline {g(x)}w(x)dx. $ In this setting, the eigenvalue problem $\ell [f](x)=\lambda f(x)$ can be considered. However, the operator acting via $\ell [\,\cdot \,]$ on $L^2[(a,b),w]$ is, in general, not self-adjoint a priori; additional boundary conditions may be required to ensure this property.
We assume throughout the manuscript that both a and b are nonoscillatory (see, e.g., [Reference Zettl55] for more information).
2.1 Extension theory
We begin by presenting the classical self-adjoint extension theory for symmetric operators as applied to ordinary differential operators. The classical references [Reference Akhiezer and Glazman1, Reference Naimark45] should be consulted for further details. Let $\ell $ be a Sturm–Liouville differential expression. It is important to reiterate that the analysis of self-adjoint extensions does not involve changing the differential expression associated with the operator at all, merely the domain of definition by applying boundary conditions.
Definition 2.1 [Reference Naimark45, Section 17.2]
The maximal domain of $\ell [\,\cdot \,]$ is given by
The designation of “maximal” is appropriate in this case because $\mathcal {D}_{\scriptstyle \mathrm {max}}(\ell )$ is the largest possible subset of functions from the Hilbert space on which the application of $\ell $ makes sense and that is mapped back into the Hilbert space by this application. For $f,g\in \mathcal {D}_{\scriptstyle \mathrm {max}}(\ell )$ and $a<a_0\le b_0<b$ , the sesquilinear form associated with $\ell $ is given by
Each of these values on the left-hand side can be seen to exist individually (and not just as a difference) by taking limits.
Theorem 2.1 [Reference Naimark45, Section 17.2]
The limits $[f,g](b):=\lim _{x\to b^-}[f,g](x)$ and $[f,g](a):=\lim _{x\to a^+}[f,g](x)$ exist and are finite for $f,g\in \mathcal {D}_{\scriptstyle \mathrm {max}}(\ell )$ .
The equation (2.3) is Green’s formula for $\ell [\,\cdot \,]$ , and in the case of Sturm–Liouville operators, it can be explicitly computed using integration by parts to be the modified Wronskian
Definition 2.2 [Reference Naimark45, Section 17.2]
The minimal domain of $\ell [\,\cdot \,]$ is given by
The maximal and minimal operators associated with the expression $\ell [\,\cdot \,]$ are then defined as $\mathbf {L}_{\scriptstyle \mathrm { min}}=\{\ell ,\mathcal {D}_{\scriptstyle \mathrm {min}}\}$ and $\mathbf {L}_{\scriptstyle \mathrm {max}}=\{\ell ,\mathcal {D}_{\scriptstyle \mathrm { max}}\}$ , respectively. By [Reference Naimark45, Section 17.2], these operators are adjoints of one another, i.e., $(\mathbf { L}_{\scriptstyle \mathrm { min}})^*=\mathbf {L}_{\scriptstyle \mathrm {max}}$ and $(\mathbf {L}_{\scriptstyle \mathrm {max}})^*=\mathbf { L}_{\scriptstyle \mathrm {min}}$ . The operator $\mathbf {L}_{\scriptstyle \mathrm {min}}$ is thus symmetric.
Note that the self-adjoint extensions of a symmetric operator coincide with those of the closure of the symmetric operator [Reference Dunford and Schwartz17, Theorem XII.4.8], so without loss of generality we assume that $\mathbf {L}_{\scriptstyle \mathrm {min}}$ is closed.
Definition 2.3 (Variation of [Reference Naimark45, Section 14.2])
Define the positive defect space and the negative defect space, respectively, by
The dimensions dim $(\mathcal {D}_+)=m_+$ and dim $(\mathcal {D}_-)=m_-$ , called the positive and negative deficiency indices of $\mathbf { L}_{\scriptstyle \mathrm {min}}$ , respectively, will play an important role. They are usually conveyed as the pair $(m_+,m_-)$ . The symmetric operator $\mathbf {L}_{\scriptstyle \mathrm {min}}$ has self-adjoint extensions if and only if its deficiency indices are equal [Reference Naimark45, Section 14.8.8].
The classical von Neumann formula then says that
The decomposition can be made into an orthogonal direct sum by using the graph norm (see [Reference Fleeman, Frymark and Liaw21]).
If the operator $\mathbf {L}_{\scriptstyle \mathrm {min}}$ has any self-adjoint extensions, then the deficiency indices of $\mathbf {L}_{\scriptstyle \mathrm {min}}$ have the form $(m,m)$ , where $0\leq m\leq 2$ [Reference Naimark45, Section 14.8.8]. Hence, Sturm–Liouville expressions that generate self-adjoint operators must have deficiency indices $(0,0)$ , $(1,1)$ , or $(2,2)$ . If a differential expression is either in the limit-circle case or regular at the endpoint a, it requires a boundary condition at a. If it is in the limit-point case at the endpoint a, it does not require a boundary condition. The analogous statements are true at the endpoint b.
Sturm–Liouville differential expressions are rather well understood (see, e.g., [Reference Bailey, Everitt and Zettl6, Reference Everitt19] for encyclopedic references), so the deficiency indices are well known in almost all cases of interest. Representative examples are the Jacobi operator, with deficiency indices $(2,2)$ , the Laguerre operator, with indices $(1,1)$ , and the Hermite operator, with indices $(0,0)$ . The sources [Reference Bailey, Everitt and Zettl6, Reference Gesztesy and Zinchenko30, Reference Niessen and Zettl46, Reference Weyl54, Reference Zettl55] can also be consulted for more information on the classification of endpoints.
2.2 Boundary triples and pairs
Boundary triples and pairs provide additional ways to tackle boundary value problems, and are intimately connected. Most of the material from this subsection is taken from the excellent open-source book [Reference Behrndt, Hassi and de Snoo7], which should be consulted for more details and the history of contributions. The presentation here is a bit simpler, as it is formulated for operators instead of the more general linear relations.
Definition 2.4 [Reference Behrndt, Hassi and de Snoo7, Definition 2.1.1]
Let S be a closed symmetric operator in a Hilbert space $\mathcal {H}$ . Then, $\{\mathcal {G},\Gamma _0,\Gamma _1\}$ is a boundary triple for $S^*$ if $\mathcal {G}$ is a Hilbert space and $\Gamma _0,\Gamma _1:\operatorname {\mathrm {dom}} S^*\to \mathcal {G}$ are surjective linear maps and the Green’s identity
holds for all $f,g\in \operatorname {\mathrm {dom}} S^*$ .
Notice that when S is a Sturm–Liouville differential operator, the left-hand side of equation (2.5) is just the sesquilinear form given in equation (2.3) when $a_0=a$ and $b_0=b$ .
Definition 2.5 [Reference Behrndt, Hassi and de Snoo7, Definition 1.4.9]
Let S be a symmetric operator in $\mathcal {H}$ , and let ${\lambda }\in \mathbb {C}$ . The space
is called the defect subspace of S at the point ${\lambda }\in \mathbb {C}$ .
The usual positive and negative defect spaces from Definition 2.3 are then simply $\mathfrak {N}_i(\mathbf {L}_{\scriptstyle \mathrm { max}})$ and $\mathfrak {N}_{-i}(\mathbf {L}_{\scriptstyle \mathrm {max}})$ , respectively. It should be clear that boundary triples are usually not unique. Indeed, given a self-adjoint extension H of a closed symmetric operator S, the adjoint $S^*$ can be decomposed into a direct sum of $\operatorname {\mathrm {dom}}(H)$ and a defect space. This allows maps $\Gamma _0$ and $\Gamma _1$ to be defined so that $\operatorname {\mathrm {dom}}(H)=\ker (\Gamma _0)$ and $\{\mathcal {G},\Gamma _0,\Gamma _1\}$ form a boundary triple (see [Reference Behrndt, Hassi and de Snoo7, Theorem 2.4.1]). The structure also allows the definition of a Weyl m-function, an important object for spectral theory.
Definition 2.6 [Reference Behrndt, Hassi and de Snoo7, Definitions 2.3.1 and 2.3.4]
Let S be a closed symmetric operator in a complex Hilbert space $\mathcal {H}$ , let $\{\mathcal {G},\Gamma _0,\Gamma _1\}$ be a boundary triple for $S^*$ , and let $A_0=\ker \Gamma _0$ . Then,
where $\upharpoonright $ denotes the restriction, is called the Weyl m-function associated with the boundary triple $\{\mathcal {G},\Gamma _0,\Gamma _1\}$ .
Another way that boundary triples are useful is that they provide a description of all self-adjoint extensions of closed symmetric operators with finite deficiency indices in terms of linear relations.
Definition 2.7 [Reference Behrndt, Hassi and de Snoo7, Section 1.1]
Let $\mathfrak {H}$ and $\mathfrak {K}$ be Hilbert spaces over $\mathbb {C}$ . A linear subspace of $\mathfrak {H}\times \mathfrak {K}$ is called a linear relation H from $\mathfrak {H}$ to $\mathfrak {K}$ , and the elements $\widehat {h}\in H$ will, in general, be written as pairs $\{h,h'\}$ with components $h\in \mathfrak {H}$ and $h'\in \mathfrak {K}$ . If $\mathfrak {H}=\mathfrak {K}$ , then we will just say H is a linear relation in $\mathfrak {H}$ .
Let $\{\mathbb {C}^{m},\Gamma _0,\Gamma _1\}$ be a boundary triple for a symmetric minimal operator $\boldsymbol {A}_{\scriptstyle \mathrm {min}}$ , defined on $\mathcal {D}_{\scriptstyle \mathrm {min}}$ , with deficiency indices $(m,m)$ . Then, self-adjoint extensions $\boldsymbol {A}_{ {}_{\scriptstyle \Theta }}\subset \boldsymbol {A}_{\scriptstyle \mathrm {max}}$ are in one-to-one correspondence with the self-adjoint relations $\Theta $ in $\mathbb {C}^{m}$ via
The self-adjoint linear relation can be made more practical using a decomposition such as equation (4.17). In the special case that ${\vartheta }$ is an $m\times m$ matrix, the boundary condition for the domain of $\boldsymbol {A}_{ {}_{\scriptstyle \Theta }}$ can be written as
Let S be a semibounded self-adjoint operator with lower bound $m(S)$ . Recall that there is a natural way to identify S with a closed semibounded form $\mathfrak {t}$ in $\mathcal {H}$ with lower bound $m(\mathfrak {t})=m(S)$ via the First and Second Representation Theorems (see [Reference Behrndt, Hassi and de Snoo7, Theorem 5.1.18] and [Reference Behrndt, Hassi and de Snoo7, Theorem 5.1.23]). Let ${\varphi }\in \operatorname {\mathrm {dom}} (S)$ , $\psi \in \operatorname {\mathrm {dom}}(\mathfrak {t})$ , and $\gamma <m(S)$ , where
Equivalently, if ${\varphi },\psi \in \operatorname {\mathrm {dom}}(\mathfrak {t}_S)$ , then
The space $\operatorname {\mathrm {dom}}(\mathfrak {t}_S)$ endowed with the inner product
is a Hilbert space, denoted by $\mathcal {H}_{ {}_{\scriptstyle \mathfrak {t}_{S-\gamma }}}$ .
More general semibounded symmetric operators S (i.e., the minimal operator of a Sturm–Liouville expression) also have a form associated with them via
The semibounded self-adjoint operator $S_F$ associated with the closure of the form $\mathfrak {t}_S$ in equation (2.6) is called the Friedrichs extension of S (see [Reference Behrndt, Hassi and de Snoo7, Definition 5.3.2]).
Definition 2.8 [Reference Behrndt, Hassi and de Snoo7, Definition 1.7.6]
Let S be a closed symmetric operator in $\mathcal {H}$ . If H and K are closed extensions such that $\operatorname {\mathrm {dom}}(S)\subset \operatorname {\mathrm {dom}}(H),\operatorname {\mathrm {dom}}(K)\subset \operatorname {\mathrm {dom}}(S^*)$ , then they are called disjoint if $\operatorname {\mathrm {dom}}(H)\cap \operatorname {\mathrm {dom}}(K)=\operatorname {\mathrm {dom}}(S)$ and transversal if they are disjoint and $\operatorname {\mathrm {dom}}(H)\cup \operatorname {\mathrm {dom}}(K)=\operatorname {\mathrm {dom}}(S^*)$ .
Finally, we are able to state the definition of boundary pair. These pairs will help access the semibounded closed forms associated with extensions when a boundary triple has already been constructed.
Definition 2.9 [Reference Behrndt, Hassi and de Snoo7, Definition 5.6.1]
Let S be a closed symmetric operator in $\mathcal {H}$ , and let $S_1$ be a semibounded self-adjoint extension of S with lower bound $m(S_1)$ such that $S_1$ and the Friedrichs extension $S_F$ are transversal. Let $\mathfrak {t}_{ {}_{\scriptstyle S_1}}$ be the closed form associated with $S_1$ , and let
be the corresponding Hilbert space. A pair $\left \{\mathcal {G},\Lambda \right \}$ is called a boundary pair for S corresponding to $S_1$ if $\mathcal {G}$ is a Hilbert space and $\Lambda \in \mathbf {B}\left (\mathcal {H}_{ {}_{\scriptstyle \mathfrak {t}_{S_1-\gamma }}},\mathcal {G}\right )$ satisfies
In order to formulate when boundary triples and pairs are related, it is necessary to first define what it means to extend an operation in our context.
Definition 2.10 [Reference Behrndt, Hassi and de Snoo7, Definition 5.6.3]
Let S be a closed semibounded operator in $\mathcal {H}$ , and let $\left \{\mathcal {G},\Gamma _0,\Gamma _1\right \}$ be a boundary triple for $S^*$ with $A_0=\ker (\Gamma _0)$ . Let $S_1$ be a semibounded self-adjoint extension of S such that $S_1$ and $S_F$ are transversal, so that $\operatorname {\mathrm {dom}}(S^*)\subset \operatorname {\mathrm {dom}}(\mathfrak {t}_{ {}_{\scriptstyle S_1}})$ . Then, an operator $\Lambda \in \mathbf {B}(\mathcal {H}_{ {}_{\scriptstyle \mathfrak {t}_{S_1-\gamma }}},\mathcal {G})$ is said to be an extension of $\Gamma _0$ if
Definition 2.11 [Reference Behrndt, Hassi and de Snoo7, Definition 5.6.4]
Let S be a closed semibounded operator in $\mathcal {H}$ , and let $\left \{\mathcal {G},\Gamma _0,\Gamma _1\right \}$ be a boundary triple for $S^*$ with
Let $S_1$ be a semibounded self-adjoint extension of S such that $S_1$ and $S_F$ are transversal, and let $\left \{\mathcal {G},\Lambda \right \}$ be a boundary pair for S corresponding to $S_1$ . Then, $\left \{\mathcal {G},\Gamma _0,\Gamma _1\right \}$ and $\left \{\mathcal {G},\Lambda \right \}$ are said to be compatible if $\Lambda $ is an extension of $\Gamma _0$ and the self-adjoint relations $A_1$ and $S_1$ coincide.
Lemma 5.6.5 of [Reference Behrndt, Hassi and de Snoo7] adds that in the special case when the defect indices of S are finite and $\Lambda \in \mathbf { B}(\mathcal {H}_{ {}_{\scriptstyle \mathfrak {t}_{S_1-\gamma }}},\mathcal {G})$ is an extension of $\Gamma _0$ , then $A_0=\ker (\Gamma _0)=S_F$ and $\left \{\mathcal {G},\Lambda \right \}$ is a boundary pair for S corresponding to $S_1$ . If, in this case, also $A_1$ and $S_1$ coincide, then $\left \{\mathcal {G},\Gamma _0,\Gamma _1\right \}$ and $\left \{\mathcal {G},\Lambda \right \}$ are compatible.
As will be shown in Section 3, there is a natural way to create a boundary triple for Sturm–Liouville operators. A compatible boundary pair will then help set up a perturbation problem by accessing the form domain $\mathfrak {t}_{S_1-\gamma }$ and defining a bounded map $\Lambda $ that acts like $\Gamma _0$ when applied to functions from $\operatorname {\mathrm {dom}}(S^*)$ .
2.3 Finite rank perturbation theory
Let $\boldsymbol {A}$ and $\boldsymbol {T}$ be (possibly unbounded) self-adjoint operators on a separable Hilbert space $\mathcal {H}$ . In general perturbation theory, we are interested in the following question: If we know the properties of operator $\boldsymbol {A}$ well, what can we say about the formal operator $\boldsymbol {A}+\boldsymbol {T}$ ? In our application to Sturm–Liouville operators, both $\boldsymbol {A}$ and $\boldsymbol {T}$ will be unbounded, although $\boldsymbol {A}$ will be bounded from below and $\boldsymbol {T}$ will be of finite rank, i.e., has finite-dimensional range.
On the side, we note that the theory of rank-one perturbations has been flourishing for several decades. Most prominent are the results by Aronszajn [Reference Aronszajn5] and Donoghue [Reference Donoghue16] on the mutual singularity of singular parts, by Aleksandrov [Reference Aleksandrov3] on averaging/disintegration of the spectral measures, and by Poltoratskiĭ [Reference Poltoratskiĭ47] on the existence of nontangential boundary values almost everywhere with respect to the singular parts. Recently, progress has been made on the finer properties of the singular parts by one of the current authors and Treil (see [Reference Liaw, Treil, Pereyra, Marcantognini, Stokolos and Urbina40] for a survey). An application of rank-one perturbations leads to the famous Simon–Wolff criterion [Reference Simon and Wolff50] restricting the singular parts of the discrete random Schrödinger operator.
Returning to our agenda of rigorously defining finite rank perturbations, by the closed graph theorem, we know that the unbounded operators $\boldsymbol {A}$ and $\boldsymbol {T}$ are only defined on a dense subset of $\mathcal {H}$ . If the operators are completely unrelated, it can happen that the intersection of their domains is empty. In this case, $\boldsymbol {A}+\boldsymbol {T}$ would be meaningless. In our application to Sturm–Liouville operators, it turns out that $\boldsymbol {T}$ is relatively bounded with respect to $\boldsymbol {A}$ , so that the domain of $\boldsymbol {A}+\boldsymbol {T}$ equals that of $\boldsymbol {A}$ . Moreover, the fact that $\boldsymbol {T}$ is of finite rank allows us to formulate this perturbation problem rather concretely. To do so, we introduce a scale of Hilbert spaces associated with $\boldsymbol {A}$ . The definition of these finite rank singular form-bounded perturbations roughly follows that of [Reference Albeverio and Kurasov2]. See also [Reference Simon48, Reference Simon49].
Consider the nonnegative operator $|\boldsymbol {A}|=(\boldsymbol {A}^*\boldsymbol {A})^{1/2}$ , whose domain coincides with the domain of $\boldsymbol {A}$ . Alternatively, if $\boldsymbol {A}$ is bounded from below, the shifted operator $\boldsymbol {A}+kI$ , $k\in \mathbb {R}$ sufficiently large, will provide a nonnegative operator. We introduce a scale of Hilbert spaces.
Definition 2.12 [Reference Albeverio and Kurasov2, Section 1.2.2]
For $s\geq 0$ , define the space $\mathcal {H}_s(A)$ to consist of ${\varphi }\in \mathcal {H}$ for which the s-norm
is bounded. The space $\mathcal {H}_s(\boldsymbol {A})$ equipped with the norm $||\cdot ||_s$ is complete. The adjoint spaces, formed by taking the linear bounded functionals on $\mathcal {H}_s(\boldsymbol {A})$ , are used to define these spaces for negative indices, i.e., $\mathcal {H}_{-s}(\boldsymbol {A}):=\mathcal {H}_s^*(\boldsymbol {A})$ . The corresponding norm in the space $\mathcal {H}_ {-s}(\boldsymbol {A})$ is thus defined by (2.7) as well. The collection of these $\mathcal {H}_s(\boldsymbol {A})$ spaces will be called the scale of Hilbert spaces associated with the self-adjoint operator $\boldsymbol {A}$ .
It is not difficult to see that the spaces satisfy the nesting properties
and that for every two $s,t$ with $s<t$ , the space $\mathcal {H}_t(\boldsymbol {A})$ is dense in $\mathcal {H}_s(\boldsymbol {A})$ with respect to the norm $||\cdot ||_s$ . Indeed, the operator $(\boldsymbol {A}+1)^{t/2}$ defines an isometry from $\mathcal {H}_s(\boldsymbol {A})$ to $\mathcal {H}_{s-t}(\boldsymbol {A})$ . For $\psi \in \mathcal {H}_s(\boldsymbol {A})$ and ${\varphi }\in \mathcal {H}_{-s}(\boldsymbol {A})$ , we define the duality pairing
where the brackets on the right-hand side denote the inner product of $\mathcal {H}$ .
Throughout the literature of other fields, similar constructions occur under different names. For instance, the pairing of $\mathcal {H}_1(\boldsymbol {A})$ , $\mathcal {H}$ , and $\mathcal {H}_{-1}(\boldsymbol {A})$ is sometimes referred to as a Gelfand triple or rigged Hilbert space. Furthermore, when ${\boldsymbol {A}}$ is the derivative operator, these scales are simply Hilbert–Sobolev spaces. When $\boldsymbol {A}$ is a general differential operator, they are closely related. More details about Hilbert scales can be found in [Reference Krein and Petunin35].
It is worth noting that these Hilbert scales are related to those generated by so-called left-definite theory [Reference Littlejohn and Wellman42]. This theory employs powers of a semibounded self-adjoint differential operator to create a continuum of operators whereupon spectral properties can be studied. The theory can be applied to self-adjoint extensions of self-adjoint operators, which can be viewed as finite rank perturbations (see, e.g., [Reference Fleeman, Frymark and Liaw21, Reference Frymark and Liaw23] and the references therein).
Rank-one perturbations of a given operator ${\boldsymbol {A}}$ arise most commonly when the vectors ${\varphi }$ are bounded linear functionals on the domain of the operator $\boldsymbol {A}$ ; so, many applications are focused on $\mathcal {H}_{-2}(\boldsymbol {A})$ . Here, we discuss the case ${\varphi }\in \mathcal {H}_{-1}(\boldsymbol {A})$ for the sake of simplicity, the so-called form-bounded singular case. However, references usually contain information on extensions to ${\varphi }\in \mathcal {H}_{-2}(\boldsymbol {A})$ , and information on the case when ${\varphi }\notin \mathcal {H}_{-2}(\boldsymbol {A})$ can be found in [Reference Dijksma, Kurasov and Shondin15, Reference Kurasov37].
The formulation of finite rank perturbations also uses the Hilbert scale $\mathcal {H}_s({\boldsymbol {A}}).$ Our applications, again, fall into the category of form-bounded, i.e., from $\mathcal {H}_{-1}(\boldsymbol {A})$ , finite rank perturbations.Footnote 2 To define rank n form-bounded perturbations of a self-adjoint operator ${\boldsymbol {A}}$ on a Hilbert space $\mathcal {H}$ , consider a coordinate mapping $\mathbf {B}:\mathbb {C}^n\to \operatorname {\mathrm {ran}}(\mathbf {B})\subset \mathcal {H}_{-1}\left ({\boldsymbol {A}}\right )$ that acts via multiplication by the row vector
Formally, the mapping $\mathbf {B}^*:\operatorname {\mathrm {ran}}(\mathbf {B})\to \mathbb {C}^n$ acts by
We say formally, because the inner products occurring in $\mathbf {B}^*$ are not defined on all of $\operatorname {\mathrm {ran}}(\mathbf {B})$ . However, in accordance with the definition of $\mathcal {H}_{-1}\left ({\boldsymbol {A}}\right )$ , they do make sense as a duality pairing on the quadratic form space of the unperturbed operator ${\boldsymbol {A}}$ , which is all we require. Abusing notation slightly, we use the same notation $\mathbf { B}^*$ for the operator restricted to this form domain.
The quadratic form sense now gives rigorous meaning to the finite rank form-bounded singular perturbation
where $\Theta :{\mathbb C}^n\to {\mathbb C}^n$ is an $n\times n$ matrix; in Remark 3.7, we discuss the extension of this definition to the case when $\Theta $ is a linear relation.
3 Formulating singular perturbations
We apply the theory of boundary triples and boundary pair framework to set up a singular self-adjoint perturbation that can describe all self-adjoint extensions of a semibounded symmetric Sturm–Liouville operator with limit-circle endpoints. Afterward, in Section 3.3, we discuss how this setup could in principle be expanded to a wider range of semibounded symmetric operators.
3.1 One limit-circle endpoint
Let $\ell [\cdot ]$ be a Sturm–Liouville operator as in Definition 2.2 on the Hilbert space $\mathcal {H}=L^2[(a,b),w(x)]$ such that the minimal operator $\mathbf {L}_{\scriptstyle \mathrm {min}}$ is semibounded. Furthermore, assume that the endpoint a is in the limit-circle case and the endpoint b is in the limit-point case. Hence, one boundary condition must be imposed on the maximal domain in order to create a domain for a self-adjoint extension of the minimal operator $\mathbf {L}_{\scriptstyle \mathrm {min}}$ . In order to determine these extensions, it is necessary to first look at solutions.
Definition 3.1 [Reference Behrndt, Hassi and de Snoo7, Definition 6.10.3]
Let $(\mathbf {L}_{\scriptstyle \mathrm {max}}-\lambda _0)f=0$ with $\lambda _0\in \mathbb {R}$ be nonoscillatory at the endpoint a, and let u and v be real solutions of $(\mathbf {L}_{\scriptstyle \mathrm {max}}-\lambda _0)f=0$ . Then, u is said to be principal at a if $1/(pu^2)$ is not integrable at a and v is said to be nonprincipal at a if $1/(pv^2)$ is integrable at a, where p comes from the definition of $\ell $ in equation (2.2).
Let $u_a$ and $v_a$ be such a principal solution and a nonprincipal solution, respectively, corresponding to the expression $\ell $ and ${\lambda }_0\in \mathbb {R}$ at the endpoint a. A principal solution exists—due to the endpoint a being in the limit-circle case—and it is uniquely determined up to real multiples. Hence, the second linearly independent solution is automatically nonprincipal [Reference Behrndt, Hassi and de Snoo7, Corollary 6.10.5]. We also assume that these solutions are normalized in the sense that $[u_a,v_a](a)=1$ , where the sesquilinear form is defined via equation (2.4). These solutions define two operations on functions from the maximal domain via their action through the sesquilinear form. For $f\in \mathcal {D}_{\scriptstyle \mathrm {max}}$ , let
Note that the simplifications of the sesquilinear forms were only recently verified for limit-circle endpoints (see [Reference Gesztesy, Littlejohn and Nichols26, Theorem 4.5]), and that the operations are well defined due to Theorem 2.1. A description of all possible self-adjoint extensions of $\mathbf {L}_{\scriptstyle \mathrm {min}}$ is readily available in the literature (see, e.g., [Reference Gesztesy, Littlejohn and Nichols26, Theorem 3.11(ii)]), via a one-variable parameterization. The boundary condition must be of the form, for $0\leq {\vartheta }<\pi $ ,
The choice ${\vartheta }=0$ corresponds to the important Friedrich’s extension via [Reference Marletta and Zettl44, Theorem 13], whereas the choice ${\vartheta }=\pi /2$ is a Krein-type extension under certain conditions (see [Reference Behrndt, Hassi and de Snoo7, Section 5.4]). Define $\mathbf { L}_{{\vartheta }}$ to be the operator that acts via $\ell $ on the domain
The operations defined in equation (3.1) naturally form a boundary triple.
Proposition 3.1 [Reference Behrndt, Hassi and de Snoo7, Proposition 6.12.1]
Assume that the endpoint a is in the limit-circle case and that the endpoint b is in the limit-point case. Let $v_a$ be solution of the equation $(\mathbf {L}_{\scriptstyle \mathrm {max}}-{\lambda }_0)f=0$ , ${\lambda }_0\in \mathbb {R}$ , which is nonprincipal at a. Then, $\{\mathbb {C},\Gamma _0,\Gamma _1\}$ , where
is a boundary triple for $\mathcal {D}_{\scriptstyle \mathrm {max}}$ .
Let $u_b$ be a solution of $(\mathbf {L}_{\scriptstyle \mathrm {max}}-\lambda _0)f=0$ which is principal at b, where ${\lambda }_0$ is the same real number used in the choice of $u_a$ and $v_a$ above. We then choose $a_0$ and $b_0$ such that $v_a$ does not vanish on $(a,a_0)$ and $u_b$ does not vanish on $(b_0,b)$ and consider $[c,d]\subset (a,b)$ with
Let ${\varphi }$ be a real solution of $(\mathbf {L}_{\scriptstyle \mathrm {max}}-{\lambda }_0)f=0$ that is nonoscillatory at a and does not vanish on $(a,a_0)$ , such as $v_a$ above. We introduce the first-order differential operator that acts on functions $f\in AC(a,a_0)$ via
Note that when the potential $q=0$ in the definition of $\ell $ , it may be convenient to choose $\lambda _0=0$ and ${\varphi }=1$ so that $N_{{\varphi }}f=\sqrt {p}f'$ . Next, define the subset (closed under addition and scalar multiplication) of $\mathcal {H}$
By its definition, set $\mathfrak {D}$ contains those functions from the Hilbert space which have some regularity on the interior of the interval, and which are mapped “into $L^2$ near the endpoints” by the first-order operators $N_{v_a}$ and $N_{v_b}$ . Moreover, by [Reference Behrndt, Hassi and de Snoo7, Lemma 6.12.3], we have $\mathcal {D}_{\scriptstyle \mathrm {max}}\subset \mathfrak {D}$ .
For $f,g\in \mathfrak {D}$ , define the form
Observe that
and the form $\mathfrak {t}$ is a densely defined closed semibounded form in $\mathcal {H}$ by [Reference Behrndt, Hassi and de Snoo7, Corollary 6.12.2]. Hence, there is a semibounded self-adjoint operator corresponding to $\mathfrak {t}$ via the First Representation Theorem [Reference Behrndt, Hassi and de Snoo7, Theorem 5.1.18], say $\mathbf {S}_1$ . Since $\mathcal {D}_{\scriptstyle \mathrm {max}}\subset \mathfrak {D}$ , it follows that $\mathbf {S}_1$ is transversal with $\mathbf {L}_0$ , the Friedrichs extension of $\mathbf {L}_{\scriptstyle \mathrm {min}}$ [Reference Behrndt, Hassi and de Snoo7, Theorem 5.3.8]. We extend one of the mappings from the boundary triple in Proposition 3.1 to correspond to this larger domain.
Define $\Lambda :\mathfrak {D}\to \mathbb {C}$ as
The conditions that $f\in AC(a,a_0)$ and $N_{v_a}f\in L^2(a,c)$ from the definition of $\mathfrak {D}$ are enough to ensure that the operation $\Lambda $ is well defined on its domain [Reference Behrndt, Hassi and de Snoo7, Theorem 6.10.9(i)].
Lemma 3.2 [Reference Behrndt, Hassi and de Snoo7, Lemma 6.12.5]
Let $v_a$ and $u_b$ be solutions of $(\mathbf {L}_{\scriptstyle \mathrm {max}}-{\lambda }_0)f=0$ , ${\lambda }_0=0$ , which are nonprincipal at a and principal at b, respectively. Assume that the endpoint a is in the limit-circle case and the endpoint b is in the limit-point case, and let $\{\mathbb {C},\Gamma _0,\Gamma _1\}$ be the boundary triple defined in Proposition 3.1. Then, $\{\mathbb {C},\Lambda \}$ is a boundary pair for $\mathbf {L}_{\scriptstyle \mathrm {min}}$ corresponding to $\mathbf {S}_1$ which is compatible with the boundary triple $\{\mathbb {C},\Gamma _0,\Gamma _1\}$ . Moreover, one has
Notice that the lemma shows that $\mathfrak {t}$ is the semibounded closed form associated with the semibounded self-adjoint operator $\mathbf { L}_{\pi /2}$ , and thereby identifies $\mathbf {L}_{\pi /2}=\mathbf {S}_1$ .
It remains only to transcribe the above boundary pair into the language of perturbation theory, appealing to equation (3.2). Define the weighted distribution $\widetilde {\delta }_a$ that mimics the operation $\Lambda $ via
on the domain $\mathfrak {D}$ . The starting self-adjoint operator that will be perturbed is thus $\mathbf {L}_{\pi /2}$ and corresponds to $\boldsymbol {A}_0$ in the following theorem.
Theorem 3.3 Let ${\vartheta }\in [0,\pi /2)$ and ${\boldsymbol {\alpha }}=-\cot ({\vartheta })$ in order to define the operator $\boldsymbol {A}_{{\alpha }}=\mathbf { L}_{{\vartheta }}$ on the domain $\operatorname {\mathrm {dom}}(\boldsymbol {A}_{{\boldsymbol {\alpha }}})=\operatorname {\mathrm {dom}}(\mathbf {L}_{{\vartheta }})$ as in equation (3.3). Then, $\boldsymbol {A}_{{\boldsymbol {\alpha }}}$ can be written as the singular rank-one perturbation:
The case ${\boldsymbol {\alpha }}=\infty $ is understood to be $\mathbf {L}_0$ , because $\vartheta = 0$ . In particular, $\widetilde {\delta }_a\in \mathcal {H}_{-1}(\boldsymbol {A})$ and every self-adjoint extension of the minimal operator $\mathbf {L}_{\scriptstyle \mathrm { min}}$ can be written as $\boldsymbol {A}_{{\alpha }}$ for some ${\boldsymbol {\alpha }}$ .
Proof It is only necessary to show that $\widetilde {\delta }_a\in \mathcal {H}_{-1}(\boldsymbol {A}_0)$ and that every self-adjoint extension of $\mathbf { L}_{\scriptstyle \mathrm {min}}$ can be realized as $\boldsymbol {A}_{{\boldsymbol {\alpha }}}$ for some ${\boldsymbol {\alpha }}$ . The statement $\widetilde {\delta }_a\in \mathcal {H}_{-1}(\boldsymbol {A}_0)$ follows immediately from [Reference Behrndt, Hassi and de Snoo7, Lemma 6.12.4] and the fact that both $\Lambda $ and $\mathfrak {t}$ , the semibounded closed form associated with $\boldsymbol {A}_0$ , are defined on the same domain: $\mathfrak {D}$ . Equation (3.2) and [Reference Gesztesy, Littlejohn and Nichols26, Theorem 3.11(ii)] then show that every self-adjoint extension of $\mathbf {L}_{\scriptstyle \mathrm {min}}$ can be realized by equation (3.5) as ${\boldsymbol {\alpha }}$ varies.
Remark 3.4 We briefly comment that the case ${\boldsymbol {\alpha }}=\infty $ is rigorous in the sense of semibounded forms as it represents the linear relation $\{0\}\times \mathbb {C}$ , but this “infinite coupling” is not technically form-bounded. As this is the only relevant linear relation in this case, we postpone further discussion until Remark 3.7, where the situation is more complicated. The reader should also consult [Reference Albeverio and Kurasov2, Reference Gesztesy and Simon27] for more on infinite coupling in the context of perturbation theory.
3.2 Two limit-circle endpoints
The results of Section 3.1 can easily be extended to the case of Sturm–Liouville expressions $\ell $ on the Hilbert space $\mathcal {H}=L^2[(a,b),w(x)]$ such that the minimal operator $\mathbf {L}_{\scriptstyle \mathrm {min}}$ is semibounded and both endpoints are in the limit-circle case. Due to the close connection with the previous subsection, we recall only a few major differences in the setup and refer the reader to [Reference Behrndt, Hassi and de Snoo7, Section 6.11] for more details about creating a boundary pair compatible with the natural boundary triple.
Let $v_a$ and $v_b$ be solutions of $(\mathbf {L}_{\scriptstyle \mathrm {max}}-\lambda _0)f=0$ , ${\lambda }_0\in \mathbb {R}$ , which are nonprincipal at a and b, respectively. The natural boundary triple $\{\mathbb {C}^2,\Gamma _0,\Gamma _1\}$ for $\mathbf {L}_{\scriptstyle \mathrm {max}}$ defined by these solutions is given in [Reference Behrndt, Hassi and de Snoo7, Proposition 6.11.1] as
Let $a_0$ and $b_0$ be such that $v_a$ does not vanish on $(a,a_0)$ , $v_b$ does not vanish on $(b_0,b)$ and $c,d$ are as in equation (3.4). The closed semibounded form associated with the operator $\mathbf {L}_0$ (boundary condition $\Gamma _1f=0$ above) is then given by
for $f,g\in \mathfrak {D}$ , with
The mapping $\Gamma _0$ is thus extended as
and $\{\mathbb {C}^2,\Lambda \}$ is found to be a boundary pair for $\mathbf {L}_{\scriptstyle \mathrm {min}}$ compatible with the boundary triple $\{\mathbb {C}^2,\Gamma _0,\Gamma _1\}$ (see [Reference Behrndt, Hassi and de Snoo7, Lemma 6.11.5]).
Transcribing the above boundary pair into the language of perturbation theory is analogous to the previous subsection, but requires a more careful consideration of the involved mappings. Begin by defining the weighted distributions $\widetilde {\delta }_a$ and $\widetilde {\delta }_b$ that mimic the operation $\Lambda $ via
on the domain $\mathfrak {D}$ .
Identify the operator $\boldsymbol {A}_0$ as the self-adjoint extension of $\mathbf {L}_{\scriptstyle \mathrm {min}}$ that acts via $\ell [\cdot ]$ on the domain
with notation chosen so as to connect to the unperturbed operator in the perturbation setup. As in Section 3.1, we abuse notation and denote the other natural self-adjoint extension as
Remark 3.5 There is an ordering of the semibounded self-adjoint extensions of symmetric operators that is inherited from their form domains and plays an important role here. Namely, let $H_1$ and $H_2$ be two semibounded self-adjoint operators with lower bound a. Let $x<a$ . By [Reference Behrndt, Hassi and de Snoo7, Proposition 5.2.7], if $H_1\leq H_2$ , then
Of course, these domains are simply the domains of the semibounded closed forms associated with $H_1$ and $H_2$ , denoted $\mathfrak {t}_1$ and $\mathfrak {t}_2$ , respectively. The Friedrichs extension, $S_F$ , of a semibounded symmetric operator, S, is in this sense the self-adjoint extension with the smallest form domain; if H is another semibounded self-adjoint extension of S, then $H\leq S_F$ , or equivalently,
after an appropriate shift [Reference Behrndt, Hassi and de Snoo7, Proposition 5.3.6]. In our example, the Friedrichs extension of $\mathbf {L}_{\scriptstyle \mathrm {min}}$ is easily identified as $\boldsymbol {A}_{\infty }$ by [Reference Behrndt, Hassi and de Snoo7, Corollary 6.11.9].
Setting up the perturbation, we would like the form domain of the unperturbed operator to be as large as possible for two reasons: the perturbation will be singular form-bounded and all self-adjoint extensions can be defined from it. This self-adjoint extension with the largest form domain is transversal (see Definition 2.8) with the Friedrichs extension [Reference Behrndt, Hassi and de Snoo7, Corollary 5.3.9]. If H and $S_F$ are transversal, then every semibounded self-adjoint extension of S, $H'$ satisfies
for an appropriate shift of the operators. The operator $\boldsymbol {A}_0$ is such a transversal operator by construction in our example, and hence plays the role of the unperturbed operator. The effect of this choice of unperturbed operator can be seen later in Remark 3.7, where the form interpretations of the perturbation are all well defined.
The weighted distributions $\widetilde {\delta }_a$ and $\widetilde {\delta }_b$ are immediately seen to be in $\mathcal {H}_{-1}(\boldsymbol {A}_0)$ by [Reference Behrndt, Hassi and de Snoo7, Theorem 6.11.4].
Leaning on our discussion in Section 2.3, we define a coordinate mapping $\mathbf {B}:\mathbb {C}^2\to \operatorname {\mathrm {ran}}(\mathbf { B})\subset \mathcal {H}_{-1}\left ({\boldsymbol {A}}_0\right )$ to act via multiplication by the row vector $ \begin {pmatrix} \widetilde {\delta }_a, \widetilde {\delta }_b \end {pmatrix}. $ A simple calculation yields that on $\mathcal {H}_1(\boldsymbol {A}_0)$ the operator $\mathbf {B}^*$ is given by
For more general information on the role these operations play in the finite rank perturbation theory of self-adjoint operators, please consult, e.g., [Reference Frymark and Liaw24, Reference Liaw and Treil41].
Theorem 3.6 Let $\Theta $ be a self-adjoint linear relation in $\mathbb {C}^2$ . Define $\boldsymbol {A}_{ {}_{\scriptstyle \Theta }}$ as the singular rank-two perturbation:
Then, every self-adjoint extension of the minimal operator $\mathbf {L}_{\scriptstyle \mathrm {min}}$ can be written as $\boldsymbol {A}_{ {}_{\scriptstyle \Theta }}$ for some $\Theta $ .
Remark 3.7 We note that the perturbation in equation (3.10) should be interpreted in terms of semibounded forms for the relevant self-adjoint extensions; this provides a rigorous interpretation of the many ways in which “infinite coupling” can occur in this perturbation setup as expressed by linear relations. In order for the perturbation to be well defined, some care should be taken so that $\operatorname {\mathrm {ran}}(\mathbf {B}^*)\subset \operatorname {\mathrm {dom}}(\Theta )$ . In particular, the closed semibounded form $\mathfrak {t} _{ {}_{\scriptstyle \Theta }}$ in $\mathcal {H}$ is given by
and can be expressed in terms of $\mathfrak {t}$ and $\Lambda $ from equations (3.7) and (3.9), respectively, via [Reference Behrndt, Hassi and de Snoo7, Theorem 6.11.6]:
-
(i) If $\Theta $ is a $2\times 2$ symmetric matrix, then
$$ \begin{align*} \mathfrak{t} _{ {}_{\scriptstyle \Theta}}[f,g]=\mathfrak{t}[f,g]+\langle\Theta\Lambda f,\Lambda g\rangle ,~~\operatorname{\mathrm{dom}}(\mathfrak{t}_{ {}_{\scriptstyle \Theta}})=\mathfrak{D}. \end{align*} $$ -
(ii) If $\Theta =\Theta _{\scriptstyle \mathrm {op}}\widehat {\oplus }\Theta _{\scriptstyle \mathrm {mul}}$ (componentwise direct sum) with respect to the decomposition $\mathbb {C}^2=\operatorname {\mathrm {dom}}(\Theta _{\scriptstyle \mathrm {op}})+\operatorname {\mathrm {mul}}(\Theta )$ and $\dim \operatorname {\mathrm {dom}}(\Theta _{\scriptstyle \mathrm {op}})=1$ , then
$$ \begin{align*} \mathfrak{t} _{ {}_{\scriptstyle \Theta}}[f,g]=\mathfrak{t}[f,g]+\langle\Theta_{\scriptstyle \mathrm{op}}\Lambda f,\Lambda g\rangle ,~~\operatorname{\mathrm{dom}}(\mathfrak{t}_{ {}_{\scriptstyle \Theta}})=\left\{h\in\mathfrak{D}~:~\Lambda h\in\operatorname{\mathrm{dom}}(\Theta_{\scriptstyle \mathrm{ op}})\right\}\!. \end{align*} $$ -
(iii) If $\Theta =\{0\}\times \mathbb {C}^2$ , then $\boldsymbol {A}_{ {}_{\scriptstyle \Theta }}=\boldsymbol {A}_{\infty }$ coincides with the Friedrichs extension of $\mathbf {L}_{\scriptstyle \mathrm {min}}$ and
$$ \begin{align*} \mathfrak{t} _{ {}_{\scriptstyle \Theta}}[f,g]=\mathfrak{t}[f,g],~~\operatorname{\mathrm{dom}}(\mathfrak{t}_{ {}_{\scriptstyle \Theta}})=\left\{h\in\mathfrak{D}~:~\Lambda h=0\right\}\!. \end{align*} $$
This interpretation also adds additional clarity to the regular endpoint example in equation (1.2) and the special case of Theorem 3.3 when ${\boldsymbol {\alpha }}=\infty $ .
Proof The boundary triple given in equation (3.6) immediately implies that all self-adjoint extensions of $\mathbf {L}_{\scriptstyle \mathrm { min}}$ are in a one-to-one correspondence with self-adjoint relations in $\mathbb {C}^2$ via
The densely defined closed semibounded form $\mathfrak {t}_{ {}_{\scriptstyle \Theta }}$ corresponding to $\boldsymbol {A}_{ {}_{\scriptstyle \Theta }}$ in equation (3.11) also exactly matches the action of the perturbation in equation (3.10).
3.3 General perturbation setups with boundary triples/pairs
The current manuscript deals with semibounded Sturm–Liouville differential operators, but there are only a few impediments to achieving a perturbation characterization like that of Theorem 3.6 for other classes of closed symmetric semibounded relations. In practice, these obstacles may be very difficult to overcome in different settings, but we outline the theory here anyway.
Let S be a closed symmetric semibounded relation in a Hilbert space $\mathcal {H}$ , and define a boundary triple $\left \{\mathcal {G},\Gamma _0,\Gamma _1\right \}$ for $S^*$ . Denote $A_0=\ker \Gamma _0$ and $A_1=\ker \Gamma _1$ . Let $S_1$ be a semibounded self-adjoint extension of S such that $S_1$ and the Friedrichs extension of S, $S_F$ , are transversal (see Remark 3.5 for more information).
If $\Lambda \in \mathbf {B}(\mathcal {H}_{ {}_{\scriptstyle \mathfrak {t}_{S_1-\gamma }}},\mathcal {G})$ , $\gamma <m(S_1)$ , is an extension of $\Gamma _0$ , $\operatorname {\mathrm {dom}}\mathfrak {t}_{ {}_{\scriptstyle S_F}}=\ker \Lambda $ , and $A_1=S_1$ , then $A_0=S_F$ and the boundary triple $\left \{\mathcal {G},\Gamma _0,\Gamma _1\right \}$ is compatible with the boundary pair $\left \{\mathcal {G},\Lambda \right \}$ by [Reference Behrndt, Hassi and de Snoo7, Lemma 5.6.5]. The identification of self-adjoint extensions of S and self-adjoint relations $\Theta $ in $\mathcal {G}$ via
thus immediately leads to a potential perturbation characterization like that of Theorem 3.6.
The first obstacle to such a characterization is explicitly determining the Friedrichs extension of the symmetric operator, but this has been accomplished for various operators. For instance, let $\Omega \subset \mathbb {R}^n$ be a bounded $C^2$ domain and $\boldsymbol {A}_D$ the self-adjoint Dirichlet realization of $-\Delta +V$ in $L^2(\Omega )$ , with $V\geq 0$ . Then, [Reference Behrndt, Hassi and de Snoo7, Corollary 8.5.4] shows that $\boldsymbol {A}_D$ is the Friedrichs extension of the associated minimal domain. Additionally, a way of obtaining the Friedrichs extension of powers of Sturm–Liouville operators is thought to lie with the so-called general left-definite theory from [Reference Littlejohn and Wellman42].
The second obstacle is showing that $\Lambda \in \mathbf {B}(\mathcal {H}_{ {}_{\scriptstyle \mathfrak {t}_{S_1-\gamma }}},\mathcal {G})$ , which means that $\Lambda \in \mathcal {H}_{-1}(S_1)$ . Fortunately, [Reference Behrndt, Hassi and de Snoo7, Theorem 5.6.6] says that if it is known that $\ker \Gamma _0=S_F$ and $\ker \Gamma _1=S_1$ , then $\Lambda \in \mathbf {B}(\mathcal {H}_{ {}_{\scriptstyle \mathfrak {t}_{S_1-\gamma }}},\mathcal {G})$ . Hence, if the boundary triple is carefully set up and we know the Friedrichs extension, then this obstacle is easily overcome. It may occasionally be of practical importance to determine the form $\mathfrak {t}_{S_1-\gamma }$ explicitly, but this is a separate problem.
Finally, the last obstacle is a bit more esoteric in the sense that such a characterization may not be useful unless there are sufficient perturbation-theoretic techniques to take advantage of. Sections 2.3, 4.4, and 4.5 provide references and spectral results that prove to be helpful when the deficiency indices of the relation S are finite and a finite rank perturbation is formulated. On the other hand, it is also convenient to work with Weyl m-functions which come straight from the boundary triple to gain spectral information. We pursue this path in most of Section 4 (from the beginning to Corollary 4.12).
4 Jacobi differential operator with varying boundary conditions
4.1 Boundary triple and boundary pair construction
Let $0<{\alpha },\beta <1$ , and consider the classical Jacobi differential expression given by
on the maximal domain
where the Hilbert space
We denote the inner product on this space by $\langle \cdot , \cdot \rangle _{\alpha , \beta }$ . This maximal domain defines the associated minimal domain given in Definition 2.2 with defect indices $(2,2)$ . The operator $\mathbf {L}_{\scriptstyle \mathrm {min}}$ will be used to denote the expression (4.1) acting on the minimal domain. The specified values of ${\alpha },\beta $ will ensure that the differential expression is in the limit-circle nonoscillating case at both endpoints, and so are assumed throughout. If either parameter is outside of the range, it changes the endpoint classification and results will need to be slightly modified accordingly. It is worth pointing out that the case ${\alpha }=\beta =0$ describes the Legendre differential equation.
The Jacobi polynomials $P_n^{({\alpha },\beta )}(x)$ , $n\in \mathbb {N}_0$ , form a complete orthogonal set in $L^2_{{\alpha },\beta }(-1,1)$ for which $P_n^{({\alpha },\beta )}(x)$ solves the eigenvalue equation of the symmetric expression given in equation (4.1), that is,
for each $n\in \mathbb {N}_0$ . Both solutions to the equation $\ell _{{\alpha },\beta }[f]=\lambda f$ , ${\lambda }\in \mathbb {C}$ , are easily given in terms of hypergeometric functions, with the special case ${\lambda }=0$ given by $\widetilde {w}_1$ and $\widetilde {w}_2$ below.
The goal of the current subsection is to formulate all self-adjoint extensions of $\mathbf {L}_{\scriptstyle \mathrm {min}}$ via a rank-two perturbation, which is accomplished in Theorem 4.2. In order to make this characterization rigorous, some of the material in this subsection may feel heavy in explicit expressions. Readers who are mainly interested in the spectral results may prefer to skip to Theorem 4.2, and return to the contents in this subsection later.
Define the Gauss hypergeometric series (or function) as
where $(x)^k=x(x+1)(x+2)\cdots (x+k-1)$ denotes the Pochhammer function, or rising factorial, on the disk $|z|<1$ . Notice that this notation is usually written as a subscript in the hypergeometric community. Define
The associated sesquilinear form is defined, for $f,g\in \mathcal {D}_{\scriptstyle \mathrm {max}}^{({\alpha },\beta )}$ , via equation (2.3). Integration by parts easily yields the explicit expression
Note that the dependence of the sesquilinear form on the parameters ${\alpha }$ and $\beta $ is suppressed in the definition for the sake of notation. Theorem 2.1 also says that the sesquilinear form is both well defined and finite for all $f,g\in \mathcal {D}_{\scriptstyle \mathrm {max}}^{({\alpha },\beta )}$ .
Two operations are then generated via these particular solutions:
Explicitly, we have
If these operations are used to define two mappings
then it can easily be shown that $\{\mathbb {C}^2,\Gamma _0,\Gamma _1\}$ is a boundary triple for $\mathcal {D}^{({\alpha },\beta )}_{\scriptstyle \mathrm {max}}$ , as in [Reference Frymark22]. Additionally, the self-adjoint extension $\boldsymbol {A}_0$ will be chosen to correspond to the notation of Section 3.2 and have the domain
The slight abuse of notation here, with ${\boldsymbol {A}}_0$ chosen to represent the boundary conditions given by $\ker (\Gamma _1)$ , seems counterintuitive given the boundary triple but is a natural choice as ${\boldsymbol {A}}_0$ will act as the unperturbed operator in our additive perturbation. Accordingly, another notation (albeit slightly abused, see Section 1.2) must be used to denote the self-adjoint operator with the boundary conditions given by $\ker (\Gamma _0)$ , also inspired by perturbation theory:
The explicit matrix-valued Weyl m-functions for these two self-adjoint extensions can be written with the help of some special functions:
and our spectral parameter ${\lambda }$ is now given via the formula
due to the natural breakdown given in equation (4.3). Note that this differs from [Reference Frymark22, Section 3] only by substituting $\mu =-z-1$ , which will be useful in the proof of Theorem 4.6. Otherwise, the functions $c_j$ , for $j=1,\dots ,4$ , are the same as in [Reference Frymark22]. The Weyl m-function associated with the extension ${\boldsymbol {A}}_{\infty }$ can be written as
and is naturally associated with the boundary triple $\{\mathbb {C}^2,\Gamma _0,\Gamma _1\}$ for $\mathcal {D}^{({\alpha },\beta )}_{\scriptstyle \mathrm {max}}$ . Likewise, the Weyl m-function associated with the extension ${\boldsymbol {A}}_0$ is
Remark 4.1 The function $M_0$ is determined by the choice of $\Gamma _0$ . As we will see in Theorem 4.4, the operations of $\Gamma _0$ fix the cyclic functions that define the spectral measure of our unperturbed operator ${\boldsymbol {A}}_0$ .
Interested readers should consult [Reference Frymark22, Section 3] for additional details concerning the underlying solutions $\widetilde {w}_1$ and $\widetilde {w}_2$ , the implemented boundary triples, and the associated Weyl m-functions.
It is worth pointing out that these two m-functions can be obtained from one another by switching the operations $\Gamma _0$ and $\Gamma _1$ . The effect of this switch is described in [Reference Behrndt, Hassi and de Snoo7, Corollary 2.5.4]; explicitly, we have $M_0=-(M_{\infty })^{-1}$ and $M_{\infty }=-(M_0)^{-1}$ . In a general setup such as that of Section 3.2, the Weyl m-functions are $2\times 2$ matrices, so if one m-function is known, it is usually not too difficult to compute the other. This will be of great help later.
Notice that the boundary triple $\{\mathbb {C}^2,\Gamma _0,\Gamma _1\}$ for $\mathcal {D}^{({\alpha },\beta )}_{\scriptstyle \mathrm {max}}$ given in equation (4.4) already matches that of Section 3.2. Recall the standard conditions on principal and nonprincipal solutions $u_a$ and $v_a$ near a limit-circle endpoint a, respectively, from Section 3.2 and the normalizations already imposed on our functions. Given this, the statement in [Reference Behrndt, Hassi and de Snoo7, Lemma 6.10.10] says that, for $f\in \mathcal {D}_{\scriptstyle \mathrm { max}}^{({\alpha },\beta )}$ , we have
and holds analogously at the other limit-circle endpoint b. This extra minus sign is already spoken for in the definition of $\widetilde {w}_1$ in the current example.
Next, we set up a quadratic form $\mathfrak {t}$ (see equation (3.7)) which will act on a domain $\mathfrak {D}$ (see equation (3.8)). The function $\widetilde {w}_2$ is necessarily broken down into its actions near each endpoint as $\widetilde {w}_{2(-1)}$ and $\widetilde {w}_{2(1)}$ . Fortunately, these functions are flexible in the sense that choosing values c and d, in accordance with (3.4), can be anything in our example. Additionally, $\widetilde {w}_2$ was chosen specifically so that ${\lambda }_0=0$ . We arbitrarily let $c=-3/4$ and $d=3/4$ and notice that
where $p(x)$ is from equation (4.1) in accordance with Definition 2.1. Then, for all $f\in AC(-1,-3/4]$ , we calculate that
and, for $f\in AC[3/4,1)$ ,
A closed semibounded form is thus defined as
for $f,g\in \mathfrak {D}$ , where
The mapping $\Gamma _0$ can now be extended to the form domain $\mathfrak {D}$
so that $\{\mathbb {C}^2,\Lambda \}$ is found to be a boundary pair for $\mathbf {L}_{\scriptstyle \mathrm {min}}$ compatible with the boundary triple $\{\mathbb {C}^2,\Gamma _0,\Gamma _1\}$ given in equation (4.4).
Finally, the explicit perturbation problem can be set up using this boundary pair structure. Define two weighted distributions that mimic the operation $\Lambda $ via
on the domain $\mathfrak {D}$ . The inner product is always assumed to be that of the Hilbert space $L^2_{{\alpha },\beta }(-1,1)$ (see equation (4.2)). These distributions are immediately seen to be in $\mathcal {H}_{-1}(\boldsymbol {A}_0)$ by [Reference Behrndt, Hassi and de Snoo7, Theorem 6.11.4]. In accordance with Section 3.2, a coordinate mapping $\mathbf { B}:\mathbb {C}^2\to \operatorname {\mathrm {ran}}(\mathbf {B})\subset \mathcal {H}_{-1}\left (\mathbf {A}_0\right )$ can be defined to act via multiplication by the row vector
Its adjoint $\mathbf {B}^*:\operatorname {\mathrm {ran}}(\mathbf {B})\to \mathbb {C}^2$ is given by
Note that although the inner product in these operations come from $L^2_{{\alpha },\beta }(-1,1)$ , the function f is naturally restricted to the domain $\mathcal {H}_1(\boldsymbol {A}_0)$ , as was discussed in Remark 3.7.
Theorem 4.2 Let $\Theta $ be a self-adjoint linear relation in $\mathbb {C}^2$ , and let ${\boldsymbol {A}}_0$ be the self-adjoint operator given in equation ( 4.5 ). Define ${\boldsymbol {A}}_{ {}_{\scriptstyle \Theta }}$ as the singular rank-two perturbation:
Then, every self-adjoint extension of the minimal operator $\mathbf {L}_{\scriptstyle \mathrm {min}}$ that acts via equation ( 4.1 ) can be written as ${\boldsymbol {A}}_{ {}_{\scriptstyle \Theta }}$ for some $\Theta $ . In particular,
Proof The theorem follows as a direct application of Theorem 3.6 to the Jacobi differential operator, with the setup given in the preceding discussion.
4.2 Spectral theory of the unperturbed operator $\boldsymbol {A}_0$
Realizing all self-adjoint extensions of the Jacobi differential operator through this perturbation setup allows for the tools of self-adjoint perturbation theory to be implemented. Next, we focus our attention on the matrix-valued spectral properties for the unperturbed operator ${\boldsymbol {A}}_0$ so that this perturbation theory can be carried out.
The spectral properties of ${\boldsymbol {A}}_0$ can be determined by analyzing $M_0$ . Sturm–Liouville operators with two limit-circle endpoints have only discrete spectrum (see, e.g., [Reference Bailey, Everitt and Zettl6, Reference Weyl54, Reference Zettl55]), which are given by first-order poles of the m-function [Reference Behrndt, Hassi and de Snoo7, Theorem 3.6.1(iv)]. Explicitly, we will apply Theorem 4.4.
The functions $c_j$ , for $j=1,\dots ,4$ , given in (4.6), can be analytically continued to wherever the Gamma functions do not have poles. By the restrictions of the parameters $\alpha , \beta \in (0,1)$ , and the fact that the Gamma function does not have roots, we observe that all eigenvalues of ${\boldsymbol {A}}_0$ are caused by the poles of the Gamma function $\Gamma (-z)$ in the denominator of $c_3$ . Of course, it is well known that the Gamma function has simple poles at the nonpositive integers; when $z=n$ , $n\in \mathbb {N}_0$ . By (4.7), the specific relation providing us with the location of the eigenvalues is
The functions $c_1$ and $c_4$ can be further simplified in this special case.
Lemma 4.3 For $z=n \in {\mathbb N}_0$ , the functions $c_1(n)$ and $c_4(n)$ reduce to
Proof Using the recursive definition of the gamma function $\Gamma (z+1) = z\Gamma (z)$ , we can simplify the factors of
from (4.6) for $z=n \in {\mathbb N}_0$ . First, we examine the factors containing ${\alpha }$ ,
Then, those containing $\beta $ ,
Substituting these two into the expression for $c_4$ yields the desired result
The computation for $c_1 (n)$ for $n \in {\mathbb N}_{0}$ follows from simply making identical substitutions with the roles of ${\alpha }$ and $\beta $ swapped, so we omit it.
For a matrix-valued Borel measure $\mu : \mathcal {B}\to {\mathbb C}^{d\times d}_+$ (where $\mathcal {B}$ and ${\mathbb C}^{d\times d}_+$ denote the Borel sigma algebra of ${\mathbb R}$ and the set of positive definite $d\times d$ matrices, respectively), the space $L^2(\mu )$ arises from the inner product:
The following well-known result will allow us to extract spectral information from the Weyl m-function.
Theorem 4.4 [Reference Gesztesy and Tsekanovskii28, Theorem 5.5(i)]
For a matrix-valued Herglotz function with representation
we have
Moreover, recall that a set $\{u_1,\ldots ,u_k\}\subset \mathcal {H}_{-1}(\boldsymbol {A}_0)$ is called cyclic for operator $\boldsymbol {A}_0$ , if
For now, let $\mu $ be the matrix-valued spectral measure of $\boldsymbol {A}_0$ with respect to the cyclic set $\{\widetilde {\delta }_{1}, \widetilde {\delta }_{-1}\}$ . Via Theorem 4.4 and Remark 4.1, the measure $\mu $ (with this cyclic set) corresponds to the Weyl m-function stated in equation (4.9). The point spectrum of a self-adjoint operator, e.g., $\boldsymbol {T}$ , will be denoted by $\sigma _p(\boldsymbol {T})$ . While a detailed analysis of the perturbed operator ${\boldsymbol {A}}_{\Theta }$ through its m-function $M_{\Theta }$ will start in Section 4.3, it is necessary to introduce the following proposition which will be used to determine the multiplicity of eigenvalues even in this unperturbed setting.
Proposition 4.5 [Reference Derkach and Malamud14, Proposition 1]
Let $\{\mathbb {C}^2,\Gamma _0,\Gamma _1\}$ be the boundary triple defined above, let ${\vartheta }$ be a Hermitian $2 \times 2$ matrix, and let $z\in \rho ({\boldsymbol {A}}_{\infty })$ . Then,
and $\dim \operatorname {\mathrm {\ker}}({\boldsymbol {A}}_{\Theta }-z)=\dim \operatorname {\mathrm {\ker}}(\Theta -M_{\infty })$ .
The point masses of the matrix-valued weight of $\mu $ and multiplicity of eigenvalues for $\boldsymbol {A}_0$ can then be determined using the m-function from Theorem 4.4 and Proposition 4.5. We will abuse notation slightly by not taking limits with respect to ${\lambda }=(-z-1)(-z+{\alpha }+\beta )$ but with respect to the corresponding z within the calculation, as this is how $M_0$ is defined.
Theorem 4.6 The matrix-valued weights of the point masses are
In particular, we verify the well-known fact that the multiplicity of each eigenvalue is one.
Proof By Theorem 4.4, these matrix-valued weights are obtained by evaluating the limit of $\varepsilon M(\lambda +i\varepsilon )$ as $\varepsilon \searrow 0$ . The relations (4.7), (4.12), and (4.14), together with the analyticity of the Gamma function away from its poles, yield
Using the well-known residues of the Gamma function, we obtain
proving the formula for the point masses.
It remains to show that the multiplicity of each eigenvalue equals one. Proposition 4.5 says that this is the case only if $\dim \operatorname {\mathrm {Ker}}(M_{\infty }({\lambda }_n))=1$ for each eigenvalue ${\lambda }_n$ . It is enough to show that $M_{\infty }$ does not have rank zero or, equivalently, that it does not have nullity 2. Upon inspection, this occurs if and only if $1/c_2(n)=0$ , which is impossible because the Gamma function has no zeros. We conclude that the multiplicity of each eigenvalue must be one.
4.3 Perturbation theory for the Jacobi differential operator
The spectral measure of the unperturbed operator ${\boldsymbol {A}}_0$ is now concrete thanks to Theorem 4.6 and it is possible to explore spectral properties of other self-adjoint extensions. This will be done with a combination of techniques arising from the theory of boundary triples and finite rank self-adjoint perturbation theory.
As we have seen in the previous subsection, the main tool for analyzing spectral properties through the theory of boundary triples is the Weyl m-function. The linear relation $\Theta $ from Theorem 4.2, which identifies a self-adjoint extension, can also be used to calculate the m-function for the extension. In particular, if $\Theta $ is a $2\times 2$ Hermitian matrix, [Reference Behrndt, Hassi and de Snoo7, equation (3.8.7)] says that, for ${\lambda }\in \rho ({\boldsymbol {A}}_{\infty })\cap \rho ({\boldsymbol {A}}_{ {}_{\scriptstyle \Theta }})$ , we have
Let $\Theta $ be a $2\times 2$ Hermitian matrix, and write
The use of equation (4.8) in (4.15) yields
Many commonly studied special cases of boundary conditions, such as separated and periodic, fall into the category where $\Theta $ is a $2\times 2$ Hermitian matrix (see, e.g., [Reference Behrndt, Hassi and de Snoo7, Example 6.3.6] and [Reference Frymark22, Section 5.1] for some examples).
Remark 4.7 It is worth briefly describing how $\Theta $ here corresponds to the perturbation in Theorem 4.2 and clarifying some points of possible confusion. The theory of boundary triples has a natural self-adjoint extension whose domain is given by functions in the intersection of the maximal domain and the $\ker (\Gamma _0)$ . This extension is ${\boldsymbol {A}}_{\infty }$ in our example. Hence, $M_{\infty }$ is the starting point of many formulas that come from boundary triples. In contrast, the unperturbed operator in our construction is ${\boldsymbol {A}}_0$ , necessary by Remark 3.5, so we would expect to start with $M_0$ to determine spectral properties. In this sense, perturbation theory and boundary triples are starting with “opposite” extensions, so to speak.
Fortunately, this discrepancy is not a problem. A quick sanity check uncovers that by plugging $\Theta =\left \{\mathbb {C}^2,0\right \}$ , the $0$ matrix, into equation (4.15), the m-function should be $-M_{\infty }^{-1}=M_0$ . Plugging this $\Theta $ into equation (4.10) for our perturbation also yields ${\boldsymbol {A}}_0$ .
For linear relations that are not $2\times 2$ matrices, there is a similar relationship that can be exploited. If $\Theta $ is a self-adjoint relation in $\mathbb {C}^2$ , then there exist $2\times 2$ matrices $\mathcal {A}$ and $\mathcal {B}$ such that
and
by [Reference Behrndt, Hassi and de Snoo7, Corollary. 1.10.9]. The domain of the self-adjoint extension ${\boldsymbol {A}}_{ {}_{\scriptstyle \Theta }}$ given in equation (4.11) then has the special form
Finally, a general analog of equation (4.15) can be written for linear relations (see [Reference Behrndt, Hassi and de Snoo7, equation (3.8.4)]:
The case where $\Theta $ is a linear relation is unfortunately not covered in the following theorems due to the calculational complexity introduced by the above formulas. Indeed, the case where $\Theta $ is a Hermitian $2 \times 2$ matrix is much more applicable. If there arises a situation where the spectrum of an extension is needed for a particular linear relation, then the following results can still serve as a road map for extracting such information.
An eigenvalue ${\lambda }_n^{\Theta }\in \rho ({\boldsymbol {A}}_{\infty })$ of $\boldsymbol {A}_{ {}_{\scriptstyle \Theta }}$ is said to be degenerate if for the corresponding $z_n\in \mathbb {R}$ such that ${\lambda }_n^{\Theta }=(-z_n-1)(-z_n+{\alpha }+\beta )$ we have
Note that, for ${\lambda }_n^{\Theta }\in \rho ({\boldsymbol {A}}_{\infty })$ , these values are all well defined thanks to $1/c_2(z_n)$ not having a pole and that each of the two possible choices of $z_n$ necessarily yields the same ${\lambda }_n$ . Degenerate eigenvalues thus correspond to the case where $\Theta -M_{\infty }$ results in the zero matrix. An operator ${\boldsymbol {A}}_{ {}_{\scriptstyle \Theta }}$ is called nondegenerate if all of its eigenvalues are nondegenerate.
Corollary 4.8 If $\Theta $ is a $2\times 2$ Hermitian matrix and ${\lambda }_n^{\Theta }$ is a degenerate eigenvalue of $\boldsymbol {A}_{ {}_{\scriptstyle \Theta }}$ , then it has multiplicity two.
Proof The result immediately follows from Proposition 4.5 and the above observation that degenerate eigenvalues correspond to the case where $\Theta -M_{\infty }$ is the zero matrix.
When eigenvalues are nondegenerate, we now show that their multiplicity must be one, sharpening results mentioned in [Reference Zettl55, Chapter 10.11].
Theorem 4.9 If $\Theta $ is a $2\times 2$ Hermitian matrix or $\Theta =\{0\}\times \mathbb {C}^2$ , then nondegenerate eigenvalues of the self-adjoint extension $\boldsymbol {A}_{ {}_{\scriptstyle \Theta }}$ defined by equation ( 4.10 ) have multiplicity one.
Proof Any self-adjoint extension of $\mathbf {L}_{\scriptstyle \mathrm {min}}$ is given by a self-adjoint relation $\Theta $ in $\mathbb {C}^2$ via Theorem 3.6. We cover the two distinct cases in the hypotheses of the theorem separately:
-
(i) $\Theta $ is a $2\times 2$ Hermitian matrix or
-
(ii) $\Theta =\{0\}\times \mathbb {C}^2$ .
Case $(i)$ : The standard formula for the inverse of a $2\times 2$ matrix yields the following simplification of equation (4.16):
where ${\lambda }= (-z-1)(-z+{\alpha }+\beta )$ by equation (4.7),
and
By Theorem 4.4, the operator $\boldsymbol {A}_{ {}_{\scriptstyle \Theta }}$ has eigenvalues at those ${\lambda }$ that are poles of $M_{ {}_{\scriptstyle \Theta }}({\lambda })$ .
Now, recall the definitions of $c_1(z)$ , $c_2(z)$ and $c_4(z)$ and that the Gamma function does not have roots on the real line. Furthermore, observe that at the poles of the Gamma functions that arise in the denominators of $c_1(z)$ , $c_2(z)$ , and $c_4(z)$ , the expression of $M_{ {}_{\scriptstyle \Theta }}({\lambda })$ remains bounded.
Therefore, the poles of $M_{ {}_{\scriptstyle \Theta }}({\lambda })$ can only occur at those points where the denominator of $K(z)$ vanishes:
This denominator is just $\det L(z)$ , so we conclude that the rank of $\Theta -M_{\infty }(\lambda )$ is one for a ${\lambda }$ whose corresponding value of z satisfies equation (4.19). The eigenvalue ${\lambda }$ was also assumed to be nondegenerate, so the nullity of $\Theta -M_{\infty }(\lambda )$ is thus one as well. Proposition 4.5 then says that the multiplicity of ${\lambda }$ is one. All eigenvalues are discrete because the operator has two limit-circle endpoints [Reference Bailey, Everitt and Zettl6, Reference Zettl55].
Case $(ii)$ : This occurs when $\boldsymbol {A}_{ {}_{\scriptstyle \Theta }}$ is the Friedrichs extension of $\mathbf {L}_{\scriptstyle \mathrm {min}}$ , as the form domain of this self-adjoint extension (see Remark 3.7) is given by
The linear relation $\Theta $ thus chooses the transversal extension to the starting operator $\boldsymbol {A}_0$ due to the way our boundary pair was defined.
While the previous theorem limits the multiplicity of nondegenerate eigenvalues, it does not say anything about the frequency with which degenerate eigenvalues occur. The following corollary sheds some light on this question.
Corollary 4.10 Let ${\lambda }\in (-{\alpha }-\beta ,\infty )\backslash [\sigma ({\boldsymbol {A}}_0)\cup \sigma ({\boldsymbol {A}}_{\infty })]$ . Then, ${\lambda }$ is a degenerate eigenvalue for some ${\boldsymbol {A}}_{ {}_{\scriptstyle \Theta }}$ where $\Theta $ is a Hermitian $2\times 2$ matrix.
Proof The lowest eigenvalue of ${\boldsymbol {A}}_0$ is $-{\alpha }-\beta $ , and it is easy to see that any ${\lambda }>-{\alpha }-\beta $ corresponds to two values of z via the relationship ${\lambda }=(-z-1)(-z+{\alpha }+\beta )$ . For ${\lambda }\in \rho ({\boldsymbol {A}}_0)\cup \rho ({\boldsymbol {A}}_{\infty })$ , it suffices to take one such value of z and generate a Hermitian matrix $\Theta $ via the equations in (4.18). Choosing such a $\Theta $ will yield an ${\boldsymbol {A}}_{ {}_{\scriptstyle \Theta }}$ with ${\lambda }$ as a degenerate eigenvalue by definition.
Of course, not every choice of $\Theta $ yields a degenerate eigenvalue ${\lambda }$ ; there are many ways to satisfy equation (4.19).
4.4 Eigenfunctions and the spaces $L^2(\mu ^{\Theta })$
Let us first consider the unperturbed operator $\boldsymbol {A}_0$ and its spectral representation given by the operator $\boldsymbol {M}$ on $L^2(\mu )$ (the operator that multiplies by the independent variable $\boldsymbol {M}:L^2(\mu )\to L^2(\mu )$ with $\boldsymbol {M}f(x) = xf(x)$ ). The magnitude of the eigenvector does not matter, so we simply dropped the multiplicative constant from Theorem 4.6 and normalized the first component to equal one.
Corollary 4.11 The eigenvector of $\boldsymbol {A}_0$ corresponding to eigenvalue ${\lambda }_n$ takes the following form in the spectral representation:
where $\chi _{ {}_{\scriptstyle \{\lambda _n\}}}$ denotes the characteristic function at $x={\lambda }_n$ . We obtain an explicit formulation of the space $L^2(\mu )=\operatorname {\mathrm {clos}} \operatorname {\mathrm {span}} \left \{f_n:n\in {\mathbb N}_0\right \}.$
Amusingly, although we only know the location of the eigenvalues implicitly, the directions of the eigenfunctions do not depend on $\lambda _n$ but merely on n; so these directions are stated explicitly.
Proof The direction of $f_n$ can be obtained by simply reading off the direction of Ran $\,\mu \{\lambda _n\}$ from the representation in Theorem 4.6. Since the operator $\boldsymbol {A}_0$ has only eigenvalues $\lambda _n$ , the eigenfunction corresponding to $\lambda _n$ takes on this direction at the locations of the eigenvalues and is zero away from $\lambda _n$ . Of course, the eigenfunction is unique only up to multiplication by a nonzero scalar.
Operator $\boldsymbol {A}_0$ is densely defined on the Hilbert space $L^2_{{\alpha },\beta }(-1,1)$ . Therefore, the span of the eigenfunctions in the spectral representation is complete in the space $L^2(\mu )$ .
Next, we apply these ideas to the perturbed operators $\boldsymbol {A}_{ {}_{\scriptstyle \Theta }}$ . Recall that by the spectral theorem the operator $\boldsymbol {A}_{ {}_{\scriptstyle \Theta }}$ is unitarily equivalent to the multiplication by the independent variable on a vector-valued space $L^2(\mu ^{\Theta })$ with inner product defined analogously to equation (4.13). Here, $\mu ^{\Theta }$ is the matrix-valued spectral measure of $\boldsymbol {A}_{ {}_{\scriptstyle \Theta }}$ with respect to the cyclic set $\{\widetilde {\delta }_{1}, \widetilde {\delta }_{-1}\}$ . Note that $\mu ^{\Theta }$ is defined by fixing this cyclic set; this is analogous to the definition of $\mu $ that is given after Theorem 4.4. The definition of $M_{ {}_{\scriptstyle \Theta }}$ through equation (4.15) and the definition of ${\boldsymbol {A}}_{ {}_{\scriptstyle \Theta }}$ in equation (4.10) are also fixed through this same choice of the cyclic vectors. Consequently, $\mu ^{\Theta }$ and $M_{ {}_{\scriptstyle \Theta }}$ indeed align (i.e., they are related via Theorem 4.4).
The space $L^2(\mu ^{\Theta })$ is vector-valued, and Theorem 4.9 informs us that the eigenspace is one-dimensional in the nondegenerate case (i.e., when (4.18) does not hold). Noting that simple minor adjustments suffice for degenerate eigenvalues, we focus on the nondegenerate case in the next corollary and discuss the degenerate case in Remark 4.14. So for now, assume that all eigenvalues of $\boldsymbol {A}_{ {}_{\scriptstyle \Theta }}$ are simple. In the language of the spectral representation, this means that $\mu ^{\Theta }\{x\}$ has rank one whenever $x=\lambda _n^{\Theta }$ (for all $n\in {\mathbb N}_0$ ) and rank zero for all other x. Here, we express the eigenvectors in the spectral representation in terms of the eigenvalues $\lambda ^{\Theta }_n$ , which are implicitly given by equation (4.19). It is worth noting that the main expression of (4.19) is thought to be transcendental, and therefore exact locations of eigenvalues may not be described explicitly.
Carrying out the analogous proof to that of Corollary 4.11, while replacing the use of Theorem 4.6 by Theorem 4.9, yields the following result.
Corollary 4.12 Let $\Theta $ be a Hermitian $2\times 2$ matrix corresponding to nondegenerate operator $\boldsymbol {A}_{ {}_{\scriptstyle \Theta }}$ . The eigenvector $f^{\Theta }_n\in L^2(\mu ^{\Theta })$ of $\boldsymbol {A}_{ {}_{\scriptstyle \Theta }}$ corresponding to eigenvalue ${\lambda }^{\Theta }_n$ takes the following form in the spectral representation:
The space $L^2(\mu ^{\Theta })=\operatorname {\mathrm {clos}} \operatorname {\mathrm {span}} \left \{f_n^{\Theta }:n\in {\mathbb N}_0\right \}.$
Remark 4.13 At first sight, it seems surprising that only two of the entries of $\Theta $ occur in the eigenvector. Upon reflection of how we arrived at the corollary, it becomes clear that this is because the eigenvalue is located exactly where the other column of the matrix is co-linear by Theorem 4.9.
Remark 4.14 If $\lambda _n^{\Theta }$ is a degenerate eigenvalue, then the corresponding eigenspace in the spectral representation is spanned by $\chi _{ {}_{\scriptstyle \{\lambda ^{\Theta }_n\}}}(x)\begin {pmatrix} 1\\0\end {pmatrix}\in L^2(\mu ^{\Theta })$ and $ \chi _{ {}_{\scriptstyle \{\lambda ^{\Theta }_n\}}}(x)\begin {pmatrix} 0\\1\end {pmatrix}\in L^2(\mu ^{\Theta })$ .
From [Reference Liaw and Treil41, Lemma 2.5], we know that $\operatorname {\mathrm {ran}}(\mathbf {B})$ is cyclic for $\boldsymbol {A}_{ {}_{\scriptstyle \Theta }}$ . Some additional information about the location of the eigenvalues can be obtained by applying the following theorem to perturbations of ${\boldsymbol {A}}_0$ .
Theorem 4.15 [Reference Liaw and Treil41, Theorem 6.3]
Let $\Theta _t=\Theta _0 + t\Theta $ , where $\Theta $ , $\Theta _0$ are Hermitian $2\times 2$ matrices and $\Theta>0$ . Let $\operatorname {\mathrm {tr}}\mu ^{\Theta _0+t\Theta }$ be the scalar spectral measures of $\boldsymbol {A}_{ {}_{\scriptstyle \Theta }}$ . For an arbitrary singular Radon measure $\nu $ on ${\mathbb R}$ ,
for all except maybe countably many $t\in {\mathbb R}$ .
Setting $\nu = \operatorname {\mathrm {tr}}\mu ^{\widetilde \Theta }$ for any Hermitian $2\times 2$ matrix $\widetilde \Theta $ allows us to restrict the relative location of eigenvalues of the operators $\mathbf {A}_{ {}_{\scriptstyle \Theta }}$ along certain lines in the space of perturbation parameters.
Proposition 4.16 Let $\boldsymbol {A}_{ {}_{\scriptstyle \Theta }}$ , $\Theta _0$ , $\Theta $ , and $\Theta _t$ be defined as in Theorem 4.15. For any Hermitian $2\times 2$ matrix $\widetilde \Theta $ , the spectra $\sigma (\boldsymbol {A}_{ {}_{\scriptstyle \widetilde \Theta }})\cap \sigma (\boldsymbol {A}_{ {}_{\scriptstyle \Theta _t}})=\varnothing $ for all but possibly countably many $t\in {\mathbb R}$ .
Proof In Theorem 4.15, we choose $\nu = \operatorname {\mathrm {tr}}\mu ^{\widetilde \Theta }$ . Since the $\boldsymbol {A}_{ {}_{\scriptstyle \Theta }}$ has only eigenvalues, $ \nu $ is a singular Radon measure. By the conclusion of Theorem 4.15, we obtain that $\operatorname {\mathrm {tr}}\mu ^{\widetilde \Theta }\perp \operatorname {\mathrm {tr}}\mu ^{\Theta _t}_{\scriptstyle \mathrm {s}}$ for all but possibly countably many t. Since the eigenvalues of $\boldsymbol {A}_{ {}_{\scriptstyle \widetilde \Theta }}$ and $\boldsymbol {A}_{ {}_{\scriptstyle \Theta _t}}$ are discrete (see, e.g., [Reference Bailey, Everitt and Zettl6, Reference Wang and Zettl53, Reference Zettl55]), the set of eigenvalues equals these operators’ spectra, respectively.
4.5 Image of $f_0$ under $V_{ {}_{\scriptstyle \Theta }}$
We include a simple yet instructive application of the Representation Theorem 4.17 onto the first eigenfunction $f_0$ from Corollary 4.12. In principle, we can replace $f_0$ by any given function $f\in L^2(\mu ).$ We chose $f_0$ just to keep expressions simple.
We recall [Reference Liaw and Treil41, Theorem 5.1], which provides us with a formula for the spectral representation of $\boldsymbol {A}_{ {}_{\scriptstyle \Theta }}$ , i.e., for the unitary operator $V_{ {}_{\scriptstyle \Theta }}: L^2(\mu )\to L^2(\mu ^{\Theta })$ intertwining $\boldsymbol {A}_{ {}_{\scriptstyle \Theta }}$ and $\boldsymbol {M}$ for $\Theta $ a Hermitian $2\times 2$ matrix
which maps each column of $\boldsymbol {B}$ to itself.
Theorem 4.17 The spectral representation $V_{ {}_{\scriptstyle \Theta }}$ takes the form
for $\mathbf {e}\in {\mathbb C}^d$ and compactly supported $h\in C^1({\mathbb R})$ .
Notice that, for $h\in C^1({\mathbb R}),$ the integral does not have a singularity by the Mean Value Theorem.
Corollary 4.18 When $\Theta $ is such that $\boldsymbol {A}_{ {}_{\scriptstyle \Theta }}$ is nondegenerate and $\sigma \left (\boldsymbol {A}_0\right )\cap \sigma \left (\boldsymbol {A}_{ {}_{\scriptstyle \Theta }}\right )=\varnothing $ , we have
In particular, the image under $V_{ {}_{\scriptstyle \Theta }}$ of the first eigenvector of $\boldsymbol {A}$ is being smeared out over all the eigenvectors of $\boldsymbol {A}_{ {}_{\scriptstyle \Theta }}$ .
Remark 4.19 For the interpretation of this function in $L^2(\mu ^{\Theta })$ , only the values of $(V_{ {}_{\scriptstyle \Theta }} f_0)(t)$ at $t=\lambda ^{\Theta }_n$ matter. Furthermore, a scalar factor will occur from taking the orthogonal projection of the vector in $(V_{ {}_{\scriptstyle \Theta }} f_0)(\lambda _n^{\Theta })$ onto the direction of the vector in $f_n^{\Theta }$ from Corollary 4.12.
Proof The location of the zeroth ( $n=0$ ) eigenfunction $f_0$ from Corollary 4.11 is obtained from (4.7): $\lambda _0 = -\alpha -\beta $ . Moreover, Lemma 4.3 in conjunction with Corollary 4.11 informs us that
In order to apply Theorem 4.17, we need to use a compactly, once differentiable function. To that end, let $\mathbf {e} = \begin {pmatrix} \alpha \\\beta \end {pmatrix}$ and where is some $h(x)\in C^1$ which equals $1$ at $s_0 = -\alpha -\beta $ and drops to zero appropriately to the left and right of $s_0$ . This necessitates two assumptions about the function $h(x)$ : Without loss of generality (because the spectrum of A is discrete), we assume that
and
In $L^2(\mu )$ , we have $\widetilde f_0(x) = h(x) \mathbf {e}$ . So, we obtain
upon realizing (4.21) and using Corollary 4.12.
Recall that $\mu $ is the matrix-valued atomic measure given in Theorem 4.6. Now, by Condition (4.20), function $h(x)$ is only nonzero at the atom of the zeroth eigenvalue, when $x = x_0 =-\alpha -\beta $ . Therefore, we obtain
The lemma follows from $ f_0(x) = \widetilde f(x)/\alpha $ and the linearity of $V_{ {}_{\scriptstyle \Theta }}$ .
Remark 4.20 The condition $\sigma (\boldsymbol {A}_0)\cap \sigma (\boldsymbol {A}_{ {}_{\scriptstyle \Theta }})=\varnothing $ is a bit stronger than necessary. In the proof, we only use (4.20) and (4.21). Proposition 4.16 provides us with sufficient (but by no means necessary) conditions for many $\Theta $ .
Remark 4.21 We note that it is also possible to start with any function $h\mathbf {e}\in L^2(\mu ^{\Theta _1})$ , with $h\in C^1({\mathbb R})$ compactly supported, by using $V_{ {}_{\scriptstyle \Theta _2-\Theta _1}}:L^2(\mu ^{\Theta _1})\to L^2(\mu ^{\Theta _2})$ in Theorem 4.17.
Acknowledgment
The authors would like to thank Pavel Kurasov and Annemarie Luger for many insightful conversations regarding the construction of the perturbation and the use of boundary triples and pairs.