We use cookies to distinguish you from other users and to provide you with a better experience on our websites. Close this message to accept cookies or find out how to manage your cookie settings.
To save content items to your account,
please confirm that you agree to abide by our usage policies.
If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account.
Find out more about saving content to .
To save content items to your Kindle, first ensure [email protected]
is added to your Approved Personal Document E-mail List under your Personal Document Settings
on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part
of your Kindle email address below.
Find out more about saving to your Kindle.
Note you can select to save to either the @free.kindle.com or @kindle.com variations.
‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi.
‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.
So far we have only dealt with waves in unbounded spaces. The item now on the agenda is extending the wave theory to encompass the passage of waves through lenses.
When a light wave enters a lens, it is redirected and distorted by the lens surfaces, and truncated by the diaphragm as well as by the lens edges. The calculation of the wave arriving in the image space requires, from the point of view of an uncompromising physicist, the solution of a boundary value problem so complicated that its exact solution is hopelessly intractable except for very simple cases, such as diffraction by a solid homogeneous sphere [18] or the reflection of a plane wave by the exterior of a perfectly conducting paraboloid [26]. Courageous approximations are clearly needed to arrive at a theory that can be used in the daily practice of lens design.
One simplifying feature of the problem is the linear relation between the field in the object space and the field in the image space. The field in the image space created by the sum of several input fields is the sum of the image fields created by the individual inputs. There are, of course, exceptions to this rule. Non-linear media may display frequency doubling; absorption of the input wave may cause thermal effects that change the lens characteristics, etc. Such effects can better be dealt with on a case to case basis after the linear theory is worked out in detail.
In this chapter we show how to calculate the paraxial properties of a lens when its construction parameters are specified. Rather than using the eikonal techniques of the previous chapter, we base this work on Snell's law and simple geometry. The results will be quite similar to what we found before; in particular we will rederive the linear equations connecting the ray parameters in the object and image space.
Why then did we bother with the heavy machinery of the previous chapters at all? There are two important reasons. First, in the current chapter we assume right from the start that all angles with the axis are small. Extending this work to larger angles leads to a thicket of thorny mathematics that firmly refuses to provide any general insights. Secondly, the eikonal functions introduced in the previous chapters play, as we shall see later, an indispensable role in the diffraction theory of image formation, where they appear again and again as phase functions in wave propagation integrals. Snell's law is not a suitable tool to clarify the relations between geometrical optics and the wave theory.
Notation and sign rules
An unambiguous notation is needed to keep track of the many parameters involved. The rules we shall use in this book are listed below.
(i) In the object space the light travels from left to right.
(ii) A lower case r denotes the radius of curvature of a lens surface. Its reciprocal 1/r is the curvature R.
The vast majority of lenses are used for image formation. Camera lenses, eye glasses, binoculars, and all sorts of other lens systems are useful because they form, each in their own way, an image of an object. To begin a discussion of the image forming process we choose a fixed source point P0 in the object space of a lens that we wish to study, and follow the rays emerging from this point all the way through the lens to the image space. One might hope that the rays emerging into the image space pass through a single point again, but this is rarely the case. Usually each of the emerging rays intersects the nominal image plane at a slightly different point. It is the task of the lens designer to bring, for a specified set of conditions, these intersection points as close together as possible.
Although the emerging rays do not usually pass through one point, the ray patterns in the image space created by a single source point in the object space are not completely arbitrary. Fermat's principle imposes an important restriction. To analyze this restriction we choose in the image space any convenient Cartesian coordinate system (x, y, z) and introduce the path function E(x, y) from the source point P0 to points (x, y, 0) in the z = 0 coordinate plane.
A ray can be specified by one of its points and its direction at that point. In a homogeneous medium we can, for instance, select a ray by specifying the coordinates (x, y) of its point of intersection with the plane z = 0, and the first two components (L, M) of the unit vector in the direction of the ray. The third direction cosine N is not needed because it is the square root of (1 – L2 – M2). The number of parameters needed to specify a ray is clearly four: x, y, L, and M.
Now consider a ray as it approaches a lens. It enters the lens, travels through the lens along a possibly tortuous path, and leaves the lens to enter the image space. Each of the four parameters needed to give a complete description of the ray in the image space is determined by the four parameters that specify the ray in the object space. It seems to follow that four functions of four variables are needed to characterize the lens fully: x′, y′, L′, and M′, each as a function of x, y, L, and M. In this chapter we shall see that this conclusion is incorrect; on account of Fermat's principle one single function of four variables is all that is required. As a result, Fermat's principle puts a severe constraint on the imaging processes a lens can carry out.
Many books on modern optics confine the treatment of lens theory to the paraxial approximation. Aberrations are treated casually as an afterthought, and it usually remains unexplained whether they are due to the laws of physics or due to the limited art of the lens designer. The reader is often left with the notion that lenses must be designed to obey the laws of Gaussian optics as closely as possible. This is regrettable, because it has been known since the eighteenth century that paraxial optics used with finite heights and angles leads to projective geometry, a valuable branch of mathematics which is, however, a poor representation of the behavior of actual lenses.
The development of Fourier optics over the last forty years has brought lens theory and physics much closer together, but again many insights are lost because most authors, in spite of the ubiquity of high aperture lenses in the laboratory, are content with the small angle approximation when dealing with the theory of image formation. Either a clear and convincing demonstration should be given that the small angle approximation can be used with impunity for very large angles, or the theory should be developed honestly, without the small angle approximation. This honest theory exists, but is buried in books and papers providing so much detail that beginners are apt to get lost in the mathematical intricacies.
This appendix gives a recipe for calculating the third order aberrations of a lens when its construction data are known. As an example we use the 4 diopter eye glass corrected for astigmatism described in section 30.6. This is a thin lens with a focal length of 250 mm, and the exit pupil 25 mm to the right of the lens.
The first step is to trace two paraxial rays through the system, as shown in table A2.1. The first ray, called the object ray, comes from the axial point of the object and must be chosen such that its direction in the image space is +1. For the eye glass used as an example the object is located at infinity, so the incident ray must be parallel to the axis, and 250 mm below it to obtain the direction +1 in the image space. The first two columns of numbers in table A2.1 show the data for this ray. The unit of length used is the dm (100 mm); this avoids very large and very small numbers. Lines 1, 2, 3, and 6 contain the radii, refractive indices, and surface spacings of the lens. The incident ray is specified by h = –2.5 dm on line 7 and ψ=0 on line and 10. Filling in lines 11 through 15 of the first column yields the direction of the ray after the first surface, and lines 8 and 9 provide the height of the ray at the next surface. These two numbers, 2.0951 for the new direction and –2.5 for the new height, are transferred to the next column, which is then completed to find the height and direction of the ray as it emerges from the second surface. In our case there are only two surfaces; if there are more surfaces the process is repeated till the last surface is reached. Line 16 must be completed as well.
A concentric system is constructed of spherical surfaces, refracting or reflecting, that all have a common center. A solid sphere is a simple example; two other examples are shown in fig. 29.1. A concentric lens is insensitive to rotations around its center; this high degree of symmetry determines the special properties of concentric systems.
For any incident ray the plane containing the ray and the center of the system divides the lens into two symmetric halves. There is no reason for the ray to prefer one of these halves over the other; so the ray will remain in the symmetry plane as it traverses the lens. It follows that every ray travels in a plane containing the center.
As long as the lens is not afocal the angle eikonal W(L, M, L′, M′) can be used. We choose any straight line through the center as the axis, and locate both the (x, y) reference plane in the object space and the (x′, y′) reference plane in the image space right in the center of the system. Then the angle eikonal is the optical distance from the projection P of the center onto the ray in the object space to the projection P′ of the center onto the ray in the image space. On account of the spherical symmetry of the lens a rotation of the entire ray around the center has no effect on the value of the eikonal function.
This appendix describes a ray tracing scheme that can be used to trace a meridional ray by hand. It is a slight modification of one of T. Smith's ray tracing procedures [74]. In this calculation a ray approaching a surface is specified by the sine of the angle ψ with the axis, and the perpendicular distance h from the vertex of the surface to the ray. The calculation yields the sine of the angle ψ′ with the axis after refraction, and the perpendicular distance h′ from the vertex to the refracted ray. An additional calculation determines the change in h′ as the ray travels to the next surface.
Table A3.1 shows a ray traversing a cemented doublet which happens to have a unit focal length. The radii, refractive indices, and thicknesses are found on lines 1, 3, 4, and 5. The ray enters the lens parallel to the axis; so ψ = 0 for the first surface, as shown on line 11. The entrance height is 0.125, corresponding to an F/4 aperture; it is shown on line 6.
The change in direction is calculated in two steps. Lines 12 through 15 yield an approximate value sinθ for sin ψ′. Paraxially this value would be correct, but to obtain exact results a correction must be made. This correction is calculated in lines 18 through 26, followed by lines 16 and 17. The change from h to h′ is generated at the same time (lines 7 and 8).