The additional perihelion precession of Mercury due to general relativity can be calculated by a method that is no more difficult than solving for the Newtonian orbit. This method relies on linearizing the relativistic orbit equation, is simpler than standard textbook methods, and is closely related to Newton's theorem on revolving orbits. The main result is accurate for all values of for near-circular orbits.
I. INTRODUCTION
An important and early success of general relativity was an explanation of the anomalous precession of the perihelion of Mercury by 43 arcseconds per century.1 The same analysis also applies to the recent observation of the precession of a star orbiting the supermassive black hole at the centre of our Milky Way galaxy.2 It is of interest to derive this result in the classroom, but standard derivations typically either apply perturbation theory to the orbit equation,3–5 or rely on an approximate factorization of the relativistic energy equation6–8 (formally related to Einstein's original calculation).1 However, the perturbation method requires finding a particular integral of an inhomogenous second-order differential equation (which is usually supplied to students) and extracting its nonperiodic component, while the energy approach requires evaluating a nontrivial integral via nonobvious changes of variables, which is again usually supplied to students.
Alternative derivations in the literature similarly tend to have challenging features for introductory students. For example, a method based on the Runge–Lenz vector is very elegant but requires sophisticated mathematical machinery.9 Furthermore, while approaches based on the small eccentricities of near-circular orbits are more elementary in character, they require ad hoc assumptions about the typical orbital radius10,11 and/or angular momentum12 to obtain the correct result.
The purpose of this note is to point out a particularly simple method for calculating the precession in an introductory course. This is based on linearizing a quadratic term in the relativistic orbit equation. The resulting approximate orbit equation is as easy to solve as the Newtonian orbit equation, is able to estimate the precession for all values of while avoiding ad hoc assumptions, and is related to Newton's method for estimating the precession of near-circular orbits.13,14
II. NEWTONIAN ORBITS
III. PRECESSION OF RELATIVISTIC ORBITS
IV. DISCUSSION
The linearization method presented here shows that the relativistic precession of Mercury's orbit may be obtained with no more mathematical sophistication than is required for solving the Newtonian orbit equation. In particular, no higher-order perturbative solutions3–5 or nontrivial integrals6–8 need be evaluated. This makes the method very suitable for introductory courses in general relativity. For teachers and advanced students, the additional points discussed below may also be of interest.
The simplicity and generality of condition (15) may be compared to the relatively complicated and restrictive condition required in a related approach for near-circular orbits by Lemmon and Mondragon,12,21 which linearizes the orbit equation about and makes an ad hoc identification of the relativistic angular momentum j with the angular momentum h of a Newtonian orbit. This approach not only requires comparing relativistic and non-relativistic orbits having unequal perihelion distances but moreover, unlike Eq. (14), is accurate only to second order in .
It is concluded that the linearization method provides a simple and useful approach for calculating the general relativistic contribution to the precession of Mercury's orbit. The corresponding orbit equation (7) is no more difficult to solve than the Newtonian orbit equation (1), making the method particularly suitable for introductory courses; the precession is accurately estimated for any value of for near-circular orbits ( ); and the natural expansion about the average inverse radius keeps the basic idea of Newton's method for calculating precession intact while removing the need for ad hoc assumptions.
ACKNOWLEDGMENTS
The author thanks the referees for several valuable suggestions.