The additional perihelion precession of Mercury due to general relativity can be calculated by a method that is no more difficult than solving for the Newtonian orbit. This method relies on linearizing the relativistic orbit equation, is simpler than standard textbook methods, and is closely related to Newton's theorem on revolving orbits. The main result is accurate for all values of G M / c 2 for near-circular orbits.

An important and early success of general relativity was an explanation of the anomalous precession of the perihelion of Mercury by 43 arcseconds per century.1 The same analysis also applies to the recent observation of the precession of a star orbiting the supermassive black hole at the centre of our Milky Way galaxy.2 It is of interest to derive this result in the classroom, but standard derivations typically either apply perturbation theory to the orbit equation,3–5 or rely on an approximate factorization of the relativistic energy equation6–8 (formally related to Einstein's original calculation).1 However, the perturbation method requires finding a particular integral of an inhomogenous second-order differential equation (which is usually supplied to students) and extracting its nonperiodic component, while the energy approach requires evaluating a nontrivial integral via nonobvious changes of variables, which is again usually supplied to students.

Alternative derivations in the literature similarly tend to have challenging features for introductory students. For example, a method based on the Runge–Lenz vector is very elegant but requires sophisticated mathematical machinery.9 Furthermore, while approaches based on the small eccentricities of near-circular orbits are more elementary in character, they require ad hoc assumptions about the typical orbital radius10,11 and/or angular momentum12 to obtain the correct result.

The purpose of this note is to point out a particularly simple method for calculating the precession in an introductory course. This is based on linearizing a quadratic term in the relativistic orbit equation. The resulting approximate orbit equation is as easy to solve as the Newtonian orbit equation, is able to estimate the precession for all values of G M / c 2 while avoiding ad hoc assumptions, and is related to Newton's method for estimating the precession of near-circular orbits.13,14

The outline of this paper is as follows: Necessary elements of Newtonian orbits are briefly reviewed in Sec. II. The linearization approach is presented in Sec. III, and underlying assumptions and comparisons with other methods are discussed in Sec. IV.

Newton's orbit equation for a particle moving in a given plane about a gravitational centre of mass M is generally solved by writing it in the Binet form3 
(1)
where r = 1 / u is the orbital distance, ϕ is the orbital angle relative to the perihelion, G is Newton's gravitational constant, and h = r 2 ( d ϕ / d t ) is the conserved specific angular momentum. The general solution is found by adding the constant solution u 0 = G M / h 2 to any solution of the homogenous equation d 2 u / d ϕ 2 + u = 0 to give
(2)
for some constant e, with e 0 to ensure perihelion at ϕ = 0. This represents a conic section with e <1 for a bound orbit, corresponding to an ellipse with focus at M and eccentricity e.
The parameters u0 and e in Eq. (2) are fully determined by the perihelion distance r min and aphelion distance r max of the orbit via 1 / r min = u 0 ( 1 + e ) and 1 / r max = u 0 ( 1 e ), yielding
(3)
Thus, observations of the perihelion and aphelion are sufficient to fix the orbit.15 One also has the useful relation
(4)
where a is the semi-major axis of an elliptical orbit.
In general relativity, the motion of a body orbiting a much larger mass M in a given plane, such as Mercury orbiting the Sun, is well-modeled by the equation for a geodesic orbit around a stationary black hole of mass M,3–8 
(5)
where r = 1 / u and ϕ are the orbital distance and the angle relative to perihelion in Schwarzschild coordinates, respectively, j = r 2 ( d ϕ / d τ ) is the conserved specific angular momentum with respect to proper time, and c is the speed of light. The quadratic term on the right side has no analogue in the Newtonian orbit equation (1) and is responsible for a non-Newtonian precession of bound orbits. The aim of this note is to give a simple method for estimating this precession.
The central idea is to linearize the quadratic term in Eq. (5) about the average value u ¯ of u, under the assumption that ( u u ¯ ) 2 u ¯ 2, which is equivalent to | r u ¯ 1 | r. This is clearly a good approximation for near-circular orbits and is closely related to Newton's theorem on revolving orbits, as discussed below. In particular, linearization gives
(6)
Substituting this result into Eq. (5) yields the approximate orbit equation
(7)
with the constants
(8)
Just as for the solution of the Newtonian orbit equation (1), the general solution of Eq. (7) is found by adding the constant solution, u 0 = L / K, to any solution of the homogenous equation d 2 u / d ϕ 2 + K u = 0, to give
(9)
for a suitable constant e, with e [ 0 , 1 ) for bound orbits.16 Noting that the cosine term varies between 1 and –1, it follows that the parameters u0 and e in Eq. (9) are related to the observed perihelion and aphelion distances of the orbit as per Eqs. (3) and (4) for Newtonian orbits.15 
However, whereas the Newtonian orbit in Eq. (2) returns to perihelion at ϕ = 2 π, the relativistic orbit in Eq. (9) returns to perihelion at K ϕ = 2 π, i.e., at
(10)
to first order in G M u ¯ / c 2, using the approximation ( 1 x ) 1 / 2 1 + x / 2. Thus, in the relativistic case, there is a precession of the perihelion per orbit by an amount
(11)
To determine the average value u ¯ for use in Eq. (11), note that the orbit equations (5) and (7) are with respect to ϕ, so that an average over ϕ is both appropriate and natural. Then, since K ϕ in Eq. (9) ranges over 0– 2 π from the initial perihelion to the next, the cosine term is zero on average, giving u ¯ = u 0.17 Hence, applying Eqs. (3) and (4) gives
(12)
and substituting into Eq. (11) gives
(13)
to first order in G M / ( a c 2 ). This is precisely the same expression for the relativistic precession calculated via standard methods in the literature, and for Mercury's 88-day orbit about the Sun ( M = M = 1.99 × 10 30 kg, r min = 4.60 × 10 10 m, r max = 6.98 × 10 10 m18) evaluates to the well-known observed value of 43 arcseconds per century.3–8 Equation (13) further predicts the much larger precession of 12 arcminutes per orbit for the star S2, which orbits the supermassive black hole at the centre of our galaxy every 16 years ( M = 4.26 × 10 6 M , r min = 1.8 × 10 13 m, r max = 29.4 × 10 13 m), which is again consistent with observations.2,19

The linearization method presented here shows that the relativistic precession of Mercury's orbit may be obtained with no more mathematical sophistication than is required for solving the Newtonian orbit equation. In particular, no higher-order perturbative solutions3–5 or nontrivial integrals6–8 need be evaluated. This makes the method very suitable for introductory courses in general relativity. For teachers and advanced students, the additional points discussed below may also be of interest.

First, it is worth noting that while Eq. (13) can be shown to be accurate to first order in G M / ( a c 2 ) for all values of e2,20 the general prediction of the linearization method in Eq. (11),
(14)
is, conversely, accurate to first order in e2 for all values of G M / ( a c 2 ). In particular, substituting Eq. (9) into the central assumption ( u u ¯ ) 2 u ¯ 2 used in Eq. (6) and recalling u 0 = u ¯ gives the equivalent condition
(15)
for the validity of Eq. (14) independent of the value of G M / ( a c 2 ), corresponding to near-circular orbits. For Mercury, one has e 2 0.04, so that the method accurately predicts the precession.

The simplicity and generality of condition (15) may be compared to the relatively complicated and restrictive condition required in a related approach for near-circular orbits by Lemmon and Mondragon,12,21 which linearizes the orbit equation about u c = j 2 / ( G M ) and makes an ad hoc identification of the relativistic angular momentum j with the angular momentum h of a Newtonian orbit. This approach not only requires comparing relativistic and non-relativistic orbits having unequal perihelion distances but moreover, unlike Eq. (14), is accurate only to second order in G M / ( j c ).

Second, there are two weak constraints, K >0 and L >0, on the applicability of the linearization method, implicit in the form of the general solution in Eq. (9). These constraints correspond to the existence of solutions with 0 < r < , i.e., to bound orbits, and from Eqs. (8) and (12) they can be written in the form
(16)
where rS is the Schwarzschild radius of the central mass M and v is a measure of orbital speed. These constraints trivially hold for planetary orbits in our solar system, where r S 3 km for the Sun and v 0.0002 c for Mercury.18 
Third, as mentioned in Sec. III, the linearization method is closely related to Newton's theorem on revolving orbits: If r ( ϕ ) describes an orbit corresponding to a central force per unit mass f(r) and angular momentum h, then r ( k ϕ ) describes an orbit corresponding to a central force f ( r ) + α / r 3 and angular momentum H = h / k, with α = ( k 2 1 ) H 2.13,14,22 This theorem follows from the Newtonian orbit equation for the latter case3,14
(17)
which simplifies to
(18)
with solution u = 1 / r ( k ϕ ) following from the definition of r ( ϕ ). The linear term on the right side of Eq. (17), which corresponds to an inverse-cube force, plays the same formal role here as the linear term obtained via linearization in Eq. (6). Thus the simple connection between the forms of the elliptical and precessing orbits in Eqs. (2) and (9) may be viewed as a special case of Newton's theorem on revolving orbits.
Furthermore, Newton used his theorem to estimate the precession of near-circular orbits perturbed by an arbitrary additional force, by approximating the perturbing force by an inverse-cube force.13,14 However, while this is analogous to linearization of the quadratic term in Eq. (5), previous attempts to apply Newton's method to the relativistic precession of Mercury10,11 have relied on an ad hoc assumption that is avoided in the linearization approach. In particular, the relativistic orbit equation in Eq. (5) corresponds to a perturbing force 3 j 2 G M / ( c 2 r 4 ) [cf. Eq. (17)], which is approximated via the Taylor series expansion 1 / r 1 / R ( r R ) / R 2 about some fixed value r = R using 1 / r 4 = ( 1 / r ) ( 1 / r 3 ) 2 / ( R r 3 ) 1 / ( R 2 r 2 ), i.e., by the sum of an inverse-cube term and an inverse-square term. The latter can be absorbed into the Newtonian gravitational force, and applying Newton's theorem for revolving orbits then leads to the corresponding precession10,11
(19)
to first order in G M / ( R c 2 ). Comparing with Eq. (13), it follows that one should choose R = a ( 1 e 2 ) (to at least first order in e2) to obtain the correct prediction for near-circular orbits. However, this choice is purely ad hoc. Rowlands simply makes this choice ab initio, without any physical motivation,10 while Nguyen instead chooses R = a / ( 1 e 2 ), leading to a predicted precession of only 39 arcseconds per century for Mercury.11 An alternative choice, the average orbital distance
(20)
again leads to an incorrect prediction.23 In contrast, the linearization method leads naturally and directly to the correct result in Eq. (13) and its generalization in Eq. (14).

It is concluded that the linearization method provides a simple and useful approach for calculating the general relativistic contribution to the precession of Mercury's orbit. The corresponding orbit equation (7) is no more difficult to solve than the Newtonian orbit equation (1), making the method particularly suitable for introductory courses; the precession is accurately estimated for any value of G M / ( a c 2 ) for near-circular orbits ( e 2 1); and the natural expansion about the average inverse radius u ¯ keeps the basic idea of Newton's method for calculating precession intact while removing the need for ad hoc assumptions.

The author thanks the referees for several valuable suggestions.

1.
A.
Einstein
, “
Erklärung der perihelbewegung des merkur aus der allgemeinen relativitätstheorie
,”
Sitzungsber. K. Preuss. Akad. Wiss.
47
,
831
839
(
1915
), https://nms.kcl.ac.uk/gerard.watts/Einstein-1915.pdf.
2.
GRAVITY Collaboration
, “
Detection of the Schwarzschild precession in the orbit of the star S2 near the Galactic centre massive black hole
,”
Astron. Astrophys.
636
,
L5–L19
(
2020
).
3.
S. T.
Thornton
and
J. B.
Marion
,
Classical Dynamics of Particles and Systems
, 5th ed. (
Brooks/Cole
,
Belmont
,
2004
), Secs. 8.4 and 8.9, pp.
291
295
and 312–316.
4.
W.
Rindler
,
Relativity: Special, General, and Cosmological
, 2nd ed. (
Oxford U. P.
,
Oxford
,
2006
), Sec. 11.9, pp.
241
245
.
5.
P. J.
O'Donnell
,
Essential Dynamics and Relativity
(
CRC Press
,
Boca Raton, FL
,
2014
), Sec. 5.8, pp.
127
129
.
6.
S.
Weinberg
,
Gravitational and Cosmology
(
Wiley
,
New York
,
1972
), Sec. 8.6, pp.
194
201
.
7.
J. B.
Hartle
,
Gravity: An Introduction to Einstein's General Relativity
(
Addison-Wesley
,
San Francisco, CA
,
2003
), pp.
216
217
.
8.
M. J. W.
Hall
,
General Relativity: An Introduction to Black Holes, Gravitational Waves, and Cosmology
(
Morgan & Claypool/IOP Publishing
,
San Rafael, CA
,
2018
), pp.
5-13
5-14
.
9.
M. G.
Stewart
, “
Precession of the perihelion of Mercury's orbit
,”
Am. J. Phys.
73
,
730
734
(
2005
).
10.
P.
Rowlands
, “
A simple approach to the experimental consequences of general relativity
,”
Phys. Educ.
32
,
49
55
(
1997
).
11.
H. D.
Nguyen
, “
Rearing its ugly head: The cosmological constant and Newton's greatest blunder
,”
Am. Math. Mon.
115
,
415
430
(
2008
).
12.
T. J.
Lemmon
and
A. R.
Mondragon
, “
Alternative derivation of the relativistic contribution to perihelic precession
,”
Am. J. Phys.
77
,
890
893
(
2009
).
13.
I.
Newton
,
Mathematical Principles of Natural Philosophy
, translated by
A.
Motte
and
F.
Cajori
(
University of California Press
,
Berkeley, CA
,
1962
), Book I, Sec. IX, pp.
135
147
.
14.
S.
Chandrasekhar
,
Newton's Principia for the Common Reader
(
Clarendon
,
Oxford
,
1995
), pp.
184
199
.
15.
Note also in both Eqs. (2) and (9) that 1 / u 0 is equal to the orbital distance at the angle halfway from perihelion to aphelion (corresponding to the semilatus rectum for elliptical orbits).
16.
One can alternatively obtain this result by dividing Eq. (7) by K to give d 2 u d ( K ϕ ) 2 + u = L / K, which is equivalent to the Newtonian orbit equation (1) with ϕ replaced by K ϕ.
17.
The same result is obtained if one averages over many orbits, corresponding to u ¯ = lim Φ 1 Φ 0 Φ d ϕ u ( ϕ ).
18.
Planetary data are available at, e.g., the NASA website <https://nssdc.gsfc.nasa.gov/planetary/factsheet/> (accessed on July 2022).
19.
The given values of r min and r max = r min ( 1 + e ) / ( 1 e ) for the orbit of S2 follow directly from the parameters r min = 120 AU and e = 0.885 given in: GRAVITY Collaboration
, “
Detection of the gravitational redshift in the orbit of the star S2 near the Galactic centre massive black hole
,”
Astron. Astrophys.
615
,
L15
(
2018
).
20.
K.-H.
Lo
,
K.
Young
, and
B. Y. P.
Lee
, “
Advance of perihelion
,”
Am J. Phys.
81
,
695
702
(
2013
).
21.
The self-consistency condition in Eq. (31) of Ref. 12 is e + 3 ( G M j c ) 2 ( e + 2 ) 1.
22.
M. M.
D'Eliseo
, “
The first-order orbital equation
,”
Am. J. Phys.
75
,
352
355
(
2007
).
23.
Similarly, choosing R to be the time average of r between successive perihelia P and P and using r 2 d ϕ d τ = j gives R = P P r d τ / P P d τ = 0 2 π d ϕ [ u 0 ( 1 + e cos ϕ ) ] 3 / 0 2 π d ϕ [ u 0 ( 1 + e cos ϕ ) ] 2 = a ( 1 + 1 2 e 2 ), thus also leading to an incorrect prediction.