GEOMETRICAL OPTICS: Thin Lens Equation Derivation

Of all the various equation derivations one may encounter as an introductory physics student, the ones regarding ray diagrams are as counterintuitive as any. Whether a system at hand consists of thin lenses or mirrors, radiant energy is diagrammed as rays for the sake of simplicity. Never, ever forget that these diagrams are grossly oversimplified depictions of outwardly diverging packets of energy ( photons ) and momentum. In order to justify usage of such visually oversimplified diagrams, an assumption is made that the rays pictured are those that have traveled a considerable distance from the source that omitted them. Under such circumstances, a spherical wavefront will have expanded so much that any small portion of the front that is encountered far away will essentially consist of a planar ( flat ) wavefront. As symmetry would have it, the rays at the boundaries of such an imaginary wavefront are parallel to one another: 

Fortunately, the laws of physics enable parallel rays to converge again if they move through an appropriate medium; thus, a lens that bends rays inward towards a central axis is either called a convex or converging lens. The degree to which a convex lens bends such rays is both a function of a lens’ shape and its optical characteristics, most notable of which is its index of refraction ( n ). In-depth discussions of a material’s refractive index take place during studies of optical scattering. For now, let’s consider a system in which a convex lens focuses light from a distant object:

There are two distances in the diagram that are of importance to us. First, consider the focal length ( f ) on either side of the lens. As we will see, a parallel ray of light emitted from the top of an object of height ( ho ) positioned left of the lens will be bent inward by the lens so that it passes through the focal length ( fi ) on the system’s image side. Maximum symmetry between an object’s height and the image’s inverted height ( – hi ) exist when the object is placed at a distance that is twice that of the focal length:

do = 2fo

di = 2fi

Prior to visually confirming this claim to be true, let’s look at a ray that passes from the top of the object and through the center of the lens at hand:

As can be seen, the ray in question moves through the lens undeflected until it reaches the “ top “ of the image that has been formed. As a consequence, we now have two triangles that are related by an angle ( θ ) that they share with the central axis:

Since the distance that separates the object and image are known, as well as the heights associated with each, we use the tangent ( θ ) function to establish a relationship between the left and right side of the system:

tan ( θ ) = ( opp / adj )

tan ( θ ) = ( ho / do ) = ( – hi / di )

We are now ready to add the ray that is bent inward by the lens to the diagram:

Let’s now take a very close look at the right-hand side of the system ( rightward of the lens ). The ray that dissects the central axis at the image’s focal point ( fi ) creates a new set of symmetrical triangles. This newer pair of triangles share an angle ( ɸ ) with the central axis:

We will now make use of the tan ( ɸ ) function as it relates the two newer triangles, but we must be somewhat creative in our approach. The adjacent side of the first triangle in question is the focal length of the image; however, the adjacent side of the inverted triangular “ twin “ of the pair will have an adjacent side defined in the following manner:

adj = ( di – f )

Taking this new derivation into account, the tangent ( ɸ ) and previous tangent ( θ ) relationship can be used to confirm the symmetry of the right-triangle pairs associated with angles ( θ ) and ( ɸ ):

Note: At this point, whether we keep the subscripts attributed to the focal length in question can be omitted for the sake of simplicity.

Alas, we have finally arrived ( almost ) at our destination:

( ho / – hi ) = [ f / ( di – f ) ]

And,

( ho / – hi ) = ( do / di )

We can now set these equations equal to one another:

( do / di ) = [ f / ( di – f ) ]

We now multiply both sides of the equation by [ ( 1 / do ) / ( 1 / di ) ] to get all of our variables onto the right-side of the equation:

[ ( 1 / do ) / ( 1 / di ) ]( do / di ) = [ ( 1 / do ) / ( 1 / di ) ][ f / ( di – f ) ]

1 = [ ( f / do ) / [ ( di – f ) / di

And,

1 = [ ( f / do ) / [ 1 – ( f / di ) ] ]

Multiplying both sides of this expression by the denominator of the fraction on the right yields another intermediate expression, but it contains the focal length variable on both sides:

[ 1 – ( f / di ) ] = ( f / do )

The next step requires us to multiply both sides of the equation by ( 1 / f ):

( 1 / f )[ 1 – ( f / di ) ] = ( 1 / f )( f / do )

( 1 / f ) – ( 1 / di ) = ( 1 / do )

( 1 / f ) = ( 1 / do ) + ( 1 / di )

The lattermost equation is the Gaussian Lens Equation. We must realize that the diagram above will change if the object’s location within the system is changed. These changes will cause changes to occur in related dimensions of interest, namely the ratios of the object and image heights as well as ratios of the distances that separate the image and object from the lens:

Magnification ( M ) = ( hi / ho ) = ( – di / do )

After indulging in some additional algebraic back flips, we’ll end up with the Thin-Lens Equation:

( 1 / do ) + ( 1 / di ) = ( n1 – 1 )[ ( 1 / R1 ) – ( 1 / R2 ) ]

And,

( 1 / f ) = ( n1 – 1 )[ ( 1 / R1 ) – ( 1 / R2 ) ]

The ( n ) term references the previously mentioned index of refraction, which shall be expanded upon at a more appropriate time.

Published by George Tafari

In 2004, I became history's second African American student to earn a degree in physics ( chemistry minor ) from the College of Charleston in beautiful Charleston, South Carolina. Keep it 7!!! X

Leave a comment