Skip to article frontmatterSkip to article content

Chapter 9 - Gravity

9.1Overview

In previous chapters, we have so far learned about Newton’s Theory of Classical Mechanics, which allowed us to model the motion of an object based on the forces acting on the object. In this chapter, we present the theories that describe the force of gravity itself. We will see several theories of gravity and focus primarily on Newton’s Universal Theory of Gravity.

9.2Kepler’s Laws

Although humans have long been fascinated by the motion of objects in the sky, it was Johannes Kepler, in the early seventeenth century, that was the first to write down quantitative rules that described the motion of planets around the Sun. His theory was based on the extensive and detailed observations recorded by Tycho Brahe in the late sixteenth century.

Kepler proposed three laws that describe all of the data that Tycho Brahe had collected about planetary motion:

  1. The path of a planet around the Sun is described by an ellipse with the Sun at one of its foci.
  2. All planets move in such a way that the area swept by a line connecting the planet and the Sun in a given period of time is constant.
  3. The ratio between the orbital periods, TT, of two planets squared is equal to the ratio of the semi-major axes, ss, of their orbits cubed:
(T1T2)2=(s1s2)3\left(\frac{T_1}{T_2}\right)^2=\left(\frac{s_1}{s_2}\right)^3

We examine these three laws in more detail in the sections that follow. It should also be noted that, even though Kepler’s Laws were derived for planets orbiting the Sun, they apply to any body that is orbiting any other body under the influence of gravity[22].

9.2.1Kepler’s First Law

Kepler noticed that the motion of all planets followed the path of an ellipse with the Sun located at one of its foci. Figure 1 shows a diagram of an ellipse, along with its two foci, its semi-major axis, ss, its semi-minor axis, bb, and its eccentricity, and the distance between a focus and the centre of the ellipse, cc. We define the eccentricity, ee of the ellipse as the ratio e=cse=\frac{c}{s}. The eccentricity is a measure of how far a focus is from the centre of the ellipse. A larger eccentricity thus corresponds to a “flatter” ellipse. Note that a circle is just a special case of an ellipse, with both foci located at the centre of the circle. The eccentriciy of a circle is thus 0.

A ellipse, showing its two foci, its semi-major axis, $s$, its semi-minor axis, $b$, and the distance between a focus and the centre of the ellipse, $c$.

Figure 1:A ellipse, showing its two foci, its semi-major axis, ss, its semi-minor axis, bb, and the distance between a focus and the centre of the ellipse, cc.

The sun is located at one of the foci. The point of closest approach to the Sun is called the “perihelion” of the orbit (or “perigee” if the orbit is around the Earth), and the point furthest from the Sun is called the “aphelion” of the orbit (or “apogee” if the orbit is around the Earth), as shown in Figure 2.

The orbit of the Earth around the Sun, showing the perihelion and aphelion, and the orbit of the Moon around the Earth, showing the perigee and the apogee. (Not to scale.)

Figure 2:The orbit of the Earth around the Sun, showing the perihelion and aphelion, and the orbit of the Moon around the Earth, showing the perigee and the apogee. (Not to scale.)

9.2.2Kepler’s Second Law

Kepler’s Second Law is really a statement about the speed of a planet in an elliptical orbit. It states that the area swept by a line connecting the planet and the Sun in a given period of time is fixed. This is illustrated in Figure 4, which shows the elliptical orbit of a planet around the Sun located at one of the foci, and the area swept out when the planet goes from AA to BB and from CC to DD.

Illustration of Kepler's Second Law, showing the area that is "swept" by a planet in a fixed period of time.

Figure 4:Illustration of Kepler’s Second Law, showing the area that is “swept” by a planet in a fixed period of time.

Kepler’s Second Law states that the two areas that are shown by the greyed out sections in the figure are the same if the planet took the same amount of time to travel between points AA and BB as it did to travel between points CC and DD. Because the points CC and DD are further away from the Sun than points AA and BB, the distance between points CC and DD must be smaller than the distance between points AA and BB for the two areas to be the same. This, in turn, implies that the planet must be moving slower between CC and DD than between points AA and BB. The speed of a planet is thus greatest at the perihelion and smallest at the aphelion. As we will see in a later chapter, Kepler’s Second Law is equivalent to the statement that the angular momentum of the planet about the Sun is conserved.

9.2.3Kepler’s Third Law

Kepler’s Third Law is quantitative and relates the orbital periods (TT) and the semi-major axes (ss) between any two planets in orbit around the Sun:

(T1T2)2=(s1s2)3\left(\frac{T_1}{T_2}\right)^2=\left(\frac{s_1}{s_2}\right)^3

We can re-arrange this relation so that all of the quantities related to one planet are on the same side of the equal sign:

T12s13=T22s23=constant\frac{T_1^2}{s_1^3}=\frac{T_2^2}{s_2^3}=\text{constant}

In other words, the ratio between the orbital period squared and the semi-major axis cubed is a constant, independent of the particular planet. In Example 9.2, we will use Newton’s Universal Theory of Gravity to evaluate the constant.

9.3Newton’s Universal Theory of Gravity

Newton supposedly gained insight into the gravitational force by observing an apple falling from a tree and concluding that if it is the same force that makes apples fall at sea level and at the top of a mountain, perhaps that force can be exerted all the way up to the moon. It is rather remarkable that Newton was able to make the connection between falling apples and the motion of the moon around the Earth to find a single theory to describe both situations.

We should be clear that the theory of gravity is a different theory than Newton’s “Laws of Motion” (Newton’s Three Laws). The Laws of Motion introduce the concepts of force and inertial mass, and prescribe how to use those concepts in order to model motion using kinematics. Newton’s Universal Theory of Gravity is a theory that describes the force of gravity that two bodies with (gravitational) mass exert on each other.

Newton’s Universal Theory of Gravity states that if two bodies, with masses M1M_1 and M2M_2, located at positions r1\vec r_1 and r2\vec r_2, respectively, are separated by a distance, rr, then M2M_2 will exert an attractive force on M1M_1, F12\vec F_{12}, given by:

F12=GM1M2r2r^21\vec F_{12}=-G\frac{M_1M_2}{r^2}\hat r_{21}

where r^21\hat r_{21} is the unit vector from M2M_2 to M1M_1:

r21=r2r1r^21=1rr21\begin{align*} \vec r_{21} &= \vec r_2 - \vec r_1\\ \hat r_{21} &= \frac{1}{r} \vec r_{21} \end{align*}

as shown in Figure 5. F12\vec F_{12} should be read as “the force on body 1 from body 2”. G=6.67×1011Nm2/kg2G=6.67\times 10^{-11} {\rm Nm^2/kg^2} is Newton’s Universal Constant of Gravity. It should be noted that Newton’s theory for the force of gravity written in this form only applies to either point masses (separated by a distance rr) or spherical bodies whose centres are separated by a distance rr that is larger than the radius of either sphere.

Illustration of the vectors involved in Newton's Universal Theory of Gravity.

Figure 5:Illustration of the vectors involved in Newton’s Universal Theory of Gravity.

Originally, Newton argued that the force of gravity would be proportional to the masses of the bodies, and inversely proportional to the square of the distance between them:

F12M1M2r2F_{12}\propto \frac{M_1M_2}{r^2}

and GG is simply the constant of proportionality.

Because of Newton’s Third Law, body 1 exerts a force on body 2 that is equal in magnitude but opposite in direction:

F12=F21\vec F_{12} = -\vec F_{21}

9.3.1Weight and apparent weight

You have probably seen images of astronauts floating around the International Space Station (ISS) or other orbiting vessels, and heard of the term “weightlessness” to describe their motion. The ISS is in orbit at an altitude of approximately 400km400 {\rm km}, where the force of Earth’s gravity is far from negligible (in Example 9.3 we showed that one needs to go to an altitude of 2640km2640 {\rm km} for the force to be reduced by half of that at the surface of the Earth). The contradiction between being weightless and the fact that weight is not zero is resolved by understanding that the popular term “weightless” is imprecise from a physics perspective.

The correct term to use from a physics perspective is to say that the apparent weight of the astronauts is zero when they are floating around. Weight is the magnitude of the force of gravity exerted by the Earth. Apparent weight is, for example, the force that one measures when standing on a spring scale, which is equal to the normal force exerted by the spring scale on the person. Apparent weight could also be determined by the tension in a string from which a person is suspended. The apparent weight is the sum of the forces exerted on a person minus the force of gravity. If gravity is the only force exerted on a person (or object), that person’s apparent weight is zero, which is what is popularly called being weightless.

One way to feel weightless is when you are in free-fall (e.g. the first few seconds of a parachute jump from an airplane). One can think of being in orbit as continuously falling towards the centre of the Earth, but with an initial velocity in a direction such that you never actually collide with the Earth. The feeling of weightlessness will occur any time that the only force exerted on you is the force of gravity. If you are in a spacecraft in any orbit and the only force on the spacecraft is from gravity (i.e. no rockets or wings are exerting any forces), then everything in the spacecraft will have the same acceleration, since gravity is the only force acting on anything in the spacecraft, and it will appear that everything is just floating. To an outside observer, it would be clear that the spacecraft and its contents are all accelerating.

9.3.1.1Effects of Earth’s rotation

Earth’s rotation affects the apparent weight of objects near the Earth’s surface. Consider a person standing on a spring scale at the North pole (top free-body diagram in Figure 6). The only two forces exerted on the person are their weight, Fg\vec F_g, and the normal force, N\vec N, exerted by the spring scale. Since the person is not accelerating, the normal force and the weight have the same magnitude and opposite directions. The scale will thus read the actual weight of the person[23].

The apparent weight, given by the normal force, is different at the Earth's equator because a person's acceleration is non-zero as they rotate with the Earth.

Figure 6:The apparent weight, given by the normal force, is different at the Earth’s equator because a person’s acceleration is non-zero as they rotate with the Earth.

Consider, instead, a person standing on a spring scale at the equator (Figure 6). That person is accelerating because they are undergoing uniform circular motion as they rotate along with the Earth. Again, the only forces acting on the person are their weight and the normal force exerted by the scale. The sum of the forces must now be equal to the person’s mass, mm, times the radial acceleration, ara_r, that is necessary for them to follow the surface of the Earth as the Earth rotates about its axis. Newton’s Second Law allows us to find the magnitude of the normal force acting on the person:

F=FgN=mar=mv2RN=Fgmv2R=GMmR2mv2R=m(GMR2v2R)=m(gv2R)\begin{align*} \sum F &= F_g-N=ma_r=m\frac{v^2}{R}\\ \therefore N &= F_g - m\frac{v^2}{R}\\ &=G\frac{Mm}{R^2} - m\frac{v^2}{R}\\ &=m\left(G\frac{M}{R^2} - \frac{v^2}{R} \right)\\ &=m\left(g - \frac{v^2}{R} \right) \end{align*}

where MM is the mass of the Earth, RR is the radius of the Earth, and vv is the speed at the surface of the Earth due to the Earth’s rotation. In the last line, we used the result from Example 9.3 where we determined the value of gg in terms of the mass and radius of the Earth.

We see that the normal force is reduced compared to what it would be if the Earth were not rotating (v=0v=0) or if one is standing at one of the poles. Your apparent weight, which you can measure by standing on a spring scale, is thus smaller at the equator than it is at the poles. The quantity in parenthesis can be thought of as a modified or “effective” value of gg at the equator.

If you are circling the Earth a distance RR from the centre of the Earth at a constant speed vv, it is possible for your apparent weight to be zero. Imagine standing on a scale in an aircraft that is circling the Earth and measuring your apparent weight with the spring scale. As the speed of the aircraft increases, your apparent weight, NN, decreases according to the formula that we just found:

N=m(GMR2v2R)N=m\left(G\frac{M}{R^2} - \frac{v^2}{R} \right)

At a certain speed, vv, your apparent weight is zero and you feel weightless:

GMR2=v2Rv=GMR\begin{align*} G\frac{M}{R^2} &= \frac{v^2}{R}\\ \therefore v&= \sqrt{G\frac{M}{R} } \end{align*}

This speed corresponds to a centripetal acceleration that is exactly equal to that due to gravity. This makes sense, since gravity is the only force that is acting on you (the normal force is now zero), which is exactly what we call being in orbit.

Earth’s rotation has some interesting consequences for stationary objects. At any position on Earth that is not at the equator or the poles, the sum of the forces on any stationary object (meaning stationary relative to the Earth’s surface) cannot be zero. This is because the object must rotate along with the Earth, so the net force on the object must point toward the centre of the circle about which that location on Earth is rotating.

Take, for example, a plumb line, which consists of a mass hanging from a string. The two forces acting on the mass are gravity and tension. The force of gravity must point towards the centre of the Earth. We would expect that the force of tension, and therefore the string, would point directly away from the centre of the Earth. However, we find that if the plumb line is located at some angle θ from the equator (but not at the equator or poles), as in Figure 7, then the string will point slightly away from the centre of the Earth. In order for the mass to remain stationary relative to the ground, it must rotate along with the Earth (radius RR) around a circle of radius RcosθR\cos\theta. Thus, the tension from the string cannot point away from the centre of the Earth, because the net force must point towards the centre of the circle of radius RcosθR\cos\theta.

Away from the equator and poles, a plumb line (right) does not point towards the centre of the Earth, because the net force on the mass must provide the acceleration towards the centre of the circle (of radius $R\cos\theta$) about which the plumb line rotates due to the Earth's rotation. Note that the deflection of the plumb line is highly exaggerated.

Figure 7:Away from the equator and poles, a plumb line (right) does not point towards the centre of the Earth, because the net force on the mass must provide the acceleration towards the centre of the circle (of radius RcosθR\cos\theta) about which the plumb line rotates due to the Earth’s rotation. Note that the deflection of the plumb line is highly exaggerated.

9.3.2The gravitational field

The gravitational force exerted on a mass mm by a mass MM can be written as:

F(r)=GMmr2r^\vec F(\vec r) = -G\frac{Mm}{r^2}\hat r

if we define a coordinate system with the origin located at the centre of mass MM so that r\vec r is the position of mm relative to MM. We can define the “gravitational field”, g(r)\vec g(\vec r), at position, r\vec r, due to the presence of mass MM as the gravitational force per unit mass exerted by MM:

g(r)=F(r)m=GMr2r^\boxed{\vec g(\vec r) = \frac{\vec F(\vec r)}{m} = - \frac{GM}{r^2}\hat r}

The word “field” is just a mathematical term for a function that depends on position. Since g(r)\vec g(\vec r) is a vector, we would refer to it as a “vector field”.

Defining the gravitational field makes it easy to calculate the force of gravity from MM on any mass mm:

Fg=mg(r)\vec F_g = m\vec g(\vec r)

At the surface of the Earth, the magnitude of the gravitational field is given by:

g(R)=GMR2=9.81N/kgg(R_\oplus)=\frac{GM}{R_\oplus^2}=9.81 {\rm N/kg}

where RR_\oplus is the radius of the Earth. Of course, this also corresponds to the acceleration of objects in free-fall near the surface of the Earth, which we can find from Newton’s Second Law:

F=Fg=mamg(R)=maa=g(R)\begin{align*} \sum \vec F &= \vec F_g = m\vec a\\ m\vec g(R_\oplus)&= m\vec a\\ \therefore \vec a &= \vec g(R_\oplus) \end{align*}

but we see here why it more precise to refer to gg as the “magnitude of the gravitational field at the surface of the Earth” rather than “the acceleration due to Earth’s gravity”. It is also worth noting that the two are only equal if the gravitational mass (on the left of the equation in the second line) is the same as the inertial mass (on the right of the equation). The gravitational mass is the mass that appears in the gravitational force defined by Newton, whereas the inertial mass is the mass that appears with the acceleration in Newton’s Second Law.

Suppose that there are two large mass bodies, M1M_1 and M2M_2, and a smaller mass body, mm. We can calculate the net gravitational force on mm by summing the gravitational force vectors from M1M_1 and M2M_2 separately. If the gravitational fields from M1M_1 and M2M_2 are given by g1(r)\vec g_1(\vec r) and g2(r)\vec g_2(\vec r), respectively, then the total gravitational force on mm is given by:

F=mg1(r)+mg2(r)=m(g1(r)+g2(r))=mg(r)\begin{align*} \vec F &= m\vec g_1(\vec r) + m\vec g_2(\vec r)=m(\vec g_1(\vec r)+\vec g_2(\vec r))\\ &=m \vec g(\vec r) \end{align*}

where we have introduced the total gravitational field:

g(r)=g1(r)+g2(r)\vec g(\vec r) = \vec g_1(\vec r)+\vec g_2(\vec r)

In other words, if there are multiple bodies that result in a non-negligible force of gravity, we can calculate their gravitational fields independently and sum them together to define a net gravitational field, g(r)\vec g(\vec r), that models the net force of gravity from all of the bodies. The net gravitational force on a new body of mass mm' is then simply given by mg(r)m'\vec g(\vec r), and we do not need to add any more vectors together. For example, when calculating the motion of satellites that can be influenced by the force of gravity from both the Earth and the Moon, we simply need to calculate the net gravitational field from the Earth and Moon, and the motion of any satellite can then be modelled using that net gravitational field.

9.3.3Gauss’s Law

Newton’s Universal Theory of Gravity postulates that the force of gravity between two bodies decreases as the squared of the distance between those two bodies. Using the terminology of a field, we would say that the strength of the gravitational field from an object decreases as the inverse of the square of the distance to that object. This is an example of a what we generally call an “inverse-square law”. The electric force between two charges is also given by an inverse-square law, and we now understand that these forces behave as if they were “transmitted” by waves or particles.

If a force is given by an inverse-square law, then Gauss’s law gives a way to determine the strength of the field that is associated with that force. In the case of gravity, Gauss’s law states that:

g(r)dA=4πGMenc\oint \vec g(\vec r) \cdot d\vec A = 4\pi G M^{enc}

where the integral on the left is an integral over a “closed surface” that completely surrounds the mass for which we want to determine the gravitational field. To evaluate the integral, imagine taking a closed surface, SS, that surrounds the mass. The vector dAd\vec A is defined as the vector that goes with a small element of that surface and points outwards from the closed surface. The magnitude of the vector is equal to the area of that small surface (dAdA), as illustrated in Figure 9. You can then take the scalar product of dAd\vec A with the gravitational field, g(r)\vec g(\vec r), at that point on the surface. If you sum all of those scalar products, you get the value of the integral on the left. Gauss’s law states that the value of that integral is equal 4πG4 \pi G times the total mass that is enclosed by the surface.

We will go into more detail about Gauss’s law when we cover electromagnetism, but it is worth seeing how to use it in a simple scenario. Figure 9 shows a spherical body of mass MM and radius RR for which we would like to determine the value of the gravitational field at a distance rr from the centre of the body.

Example of a spherical Gaussian surface, $S$, of radius $r$ centred about a body of mass $M$ and radius $R$. An element of the surface, $d\vec A$ is also shown along with the gravitational field, $\vec g(\vec r)$, at that point.

Figure 9:Example of a spherical Gaussian surface, SS, of radius rr centred about a body of mass MM and radius RR. An element of the surface, dAd\vec A is also shown along with the gravitational field, g(r)\vec g(\vec r), at that point.

To do so, we draw a “Gaussian surface”, SS, that is a sphere with a radius rr, and centred about the body. At any point on the surface, the area element vector dAd\vec A points away from the centre of the spherical surface. The gravitational field vector, g(r)\vec g(\vec r) will always point towards the centre of the spherical surface, as illustrated. Furthermore, by symmetry, the magnitude of g(r)\vec g(\vec r) is constant along the whole Gaussian surface. Thus, the scalar product g(r)dA=g(r)dA\vec g(\vec r) \cdot d\vec A=-g(r)dA everywhere along the surface (it is negative because the two vectors are anti-parallel). The integral is thus given by:

g(r)dA=g(r)dA\oint \vec g(\vec r) \cdot d\vec A = -g(r)\oint dA

where we factored g(r)g(r) out of the integral, since the magnitude of g(r)\vec g(\vec r) is constant for all of the area elements dAdA on the sphere. Remember that an integral is a sum. The integral dA\oint dA thus means “sum all of the area elements dAdA over the entire surface SS”. Thus, the integral is the total area of the spherical surface SS[24]:

g(r)dA=g(r)dA=g(r)(4πr2)\oint \vec g(\vec r) \cdot d\vec A = -g(r)\oint dA =-g(r)(4\pi r^2)

Inserting this into Gauss’s law, we find:

g(r)dA=4πGMencg(r)(4πr2)=4πGMencg(r)=GMr2\begin{align*} \oint \vec g(\vec r) \cdot d\vec A &= 4\pi G M^{enc}\\ -g(r)(4\pi r^2) &= 4\pi G M^{enc}\\ \therefore g(r) &= - \frac{GM}{r^2} \end{align*}

where Menc=MM^{enc}=M is the total mass enclosed by the Gaussian surface (in this case, the entire mass MM is enclosed). This is of course the result that we expected and obtained earlier from Newton’s formulation. Note that Gauss’s law is only easy to use if the system is highly symmetric (e.g. spherically symmetric), and that it does not give the direction of the field vector, which must be obtained from symmetry arguments.

We can also use Gauss’s law to determine the gravitational field inside of the body of mass MM and radius RR. This is illustrated in Figure 11, which shows a spherical Gaussian surface of radius rr that is inside of the body of mass MM.

Example of a spherical Gaussian surface, $S$, of radius $r$ centred inside a body of mass $M$ and radius $R$.

Figure 11:Example of a spherical Gaussian surface, SS, of radius rr centred inside a body of mass MM and radius RR.

The gravitational field inside of the body of mass MM is also symmetric and constant in magnitude across the whole surface, %? so that the integral is the same as before:

g(r)dA=g(r)(4πr2)\oint \vec g(\vec r) \cdot d\vec A=-g(r)(4\pi r^2)

However, in order to use Gauss’s law, we need to determine the mass of the body that is enclosed within the spherical surface, which will be less than MM. If we assume that the mass density, ρ, of the object is constant (the body is made of a uniform material), then the density is simply the mass of the object over its volume:

ρ=M43πR3\rho = \frac{M}{\frac{4}{3}\pi R^3}

The amount of mass enclosed by the spherical surface of radius rr is the density multiplied by the volume of a sphere of radius rr:

Menc=ρ43πr3=Mr3R3M^{enc} = \rho \frac{4}{3}\pi r^3 = M\frac{r^3}{R^3}

Applying Gauss’s law, we can now find the magnitude of the gravitational field inside of the spherical body at a distance rr from the centre:

g(r)dA=4πGMencg(r)(4πr2)=4πGMr3R3g(r)=GMR3r\begin{align*} \oint \vec g(\vec r) \cdot d\vec A &= 4\pi G M^{enc}\\ -g(r)(4\pi r^2) &= 4\pi G M\frac{r^3}{R^3}\\ \therefore g(r) &= - \frac{G M}{R^3}r \end{align*}

And we find that, inside a uniform spherical body of mass MM, the gravitational field increases linearly with radius as one moves out from the centre. At the centre of the body, the gravitational field is zero.

9.4Gravitational potential energy

Consider a large spherical body of mass MM with a coordinate system whose origin coincides with the centre of the spherical body (for example, the large body could be the Earth). The force, F(r)\vec F(\vec r) on a body of mass mm (for example, a satellite), located at a position r\vec r is then given by:

F(r)=GMmr2r^=GMmr3r\vec F(\vec r) = - G\frac{Mm}{r^2}\hat r=- G\frac{Mm}{r^3}\vec r

where in the second equality, we use the fact that the unit vector in the direction of r\vec r is simply the vector r\vec r divided by its magnitude. We can write the force out in Cartesian coordinates:

r=xx^+yy^+zz^r=x2+y2+z2=(x2+y2+z2)12F(x,y,z)=GMm(x2+y2+z2)32(xx^+yy^+zz^)\begin{align*} \vec r &= x\hat x + y \hat y + z\hat z\\ r &= \sqrt{x^2+y^2+z^2} =(x^2+y^2+z^2)^\frac{1}{2} \\ \therefore \vec F(x,y,z) &= - G\frac{Mm}{(x^2+y^2+z^2)^\frac{3}{2} }(x\hat x + y \hat y + z\hat z) \end{align*}

Mathematically, this is equivalent to the force that we considered in Example 8.2 of Chapter 8.1, which we showed was a conservative force. The force of gravity in Newton’s theory is thus a conservative force, for which we can determine a potential energy function.

In order to determine the gravitational potential energy function for the mass mm in the presence of a mass MM, we calculate the work done by the force of gravity on the mass mm over a path where the integral for work will be “easy” to evaluate, namely a straight line. Figure 12 shows such a path in the radial direction, rr, over which it will be easy to calculate the work done by the force of gravity from mass MM when mass mm moves from being a distance rAr_A to a distance rBr_B from the centre of mass MM.

Calculating the work done on a mass $m$ by the force of gravity exerted by mass $M$ when mass $m$ moves from a distance $r_A$ to a distance $r_B$ from the centre of mass $M$.

Figure 12:Calculating the work done on a mass mm by the force of gravity exerted by mass MM when mass mm moves from a distance rAr_A to a distance rBr_B from the centre of mass MM.

The work done by the force of gravity on mm in going from rAr_A to rBr_B is given by:

W=rArBF(r)dr=rArB(GMmr2r^)dr=rArBGMmr2dr=[GMmr]rArB=GMmrBGMmrA\begin{align*} W &= \int_{r_A}^{r_B}\vec F(r) \cdot d\vec r = \int_{r_A}^{r_B} \left(- G\frac{Mm}{r^2}\hat r \right)\cdot d\vec r =\int_{r_A}^{r_B} - G\frac{Mm}{r^2}dr\\ &=\left[G\frac{Mm}{r} \right]_{r_A}^{r_B} =G\frac{Mm}{r_B} - G\frac{Mm}{r_A} \end{align*}

The difference in potential energy in going from position AA to position BB is given by the negative of the work done by the force:

ΔU=U(rB)U(rA)=W=GMmrAGMmrB\Delta U = U(r_B) - U(r_A) = -W = G\frac{Mm}{r_A} - G\frac{Mm}{r_B}

By inspection, we can identify the potential energy function for gravity:

U(r)=GMmr+C\boxed{U(r) = -G\frac{Mm}{r} + C}

which is determined only up to a constant, CC.

A particularly useful choice of constant is C=0C=0. This corresponds to choosing the potential energy to be zero only when rr goes to infinity. That is, the potential energy of mass mm is zero only when it is infinitely far away from mass MM. The choice of constant CC corresponds to the (arbitrary) value of the potential energy when mass mm is infinitely far from mass MM. When mass mm is not infinitely far away, it has negative potential energy (if C=0C=0). This is not a problem! Remember, the only thing that is meaningful is a difference in potential energy, so the specific value of the potential energy has no meaning. The kinetic energy of an object, on the other hand, has to be positive.

Recall that if there are no other forces acting on an object, that object will move in such a way to reduce its potential energy. If the object of mass mm is located at some distance rr from the object of mass MM, the force of gravity will attract mm so that rr decreases. As rr decreases in magnitude, the potential energy becomes more negative (larger in magnitude, but further away from zero), and the potential energy of mm will indeed decrease as it accelerates due to the force of gravity.

9.4.1Mechanical energy with gravity

Unless noted otherwise, we will continue our discussion of gravitational potential energy with the particular choice of constant C=0C=0:

U(r)=GMmr\boxed{U(r) = -G\frac{Mm}{r}}

Furthermore, we will assume that MM is a large body, such as the Earth, which we can consider as fixed, and focus our discussion on describing the motion of mass mm (e.g. a satellite). If MM is much bigger than mm, they will both experience a force of gravity from each other of the same magnitude (Newton’s Third Law), but because MM is so much larger, its acceleration will be much smaller (Newton’s Second Law). Thus, it is a good approximation to assume that MM is stationary and that only mm moves when M>>mM>>m.

We can define the total mechanical energy of mass mm when it has a speed vv (relative to MM) and is located at a distance rr from the centre of mass MM:

E=U+K=GMmr+12mv2E = U + K = -G\frac{Mm}{r}+\frac{1}{2}mv^2

where the kinetic energy term is always positive. If gravity is the only force exerted on mass mm, then the mechanical energy, EE, as defined above, will be a constant. The mechanical energy of an object can give us insight into the possible motion of the object.

Imagine launching a rocket straight upwards from the surface of the Earth; once all of the fuel has burnt up, the rocket’s mechanical energy becomes constant as the rocket engine stops doing work on the rocket. As soon as the engine stops providing thrust, the rocket will start to slow down as the force of gravity attracts the rocket back to Earth. If the rocket is going fast enough, it will be able to completely escape the Earth’s gravitational pull and travel to infinity (we assume that there are no other planets or the Sun, just the Earth exists!). If, on the other hand, the rocket’s speed is too low, it will eventually stop and fall back to Earth. This is the same thing that happens to you when you try to jump vertically. If you could jump hard enough, you would be able to escape the Earth’s gravitational pull!

In terms of mechanical energy, we can ask ourselves if the mechanical energy of the rocket is large enough to escape the Earth’s gravitational pull. Specifically, we can ask ourselves what the value of the rocket’s kinetic energy would be when it reaches infinity. The kinetic energy of the rocket is given by:

K=EUK = E - U

If the rocket is infinitely far from the Earth, then its potential energy is zero, and the kinetic energy is equal to EE.

If the mechanical energy, EE, is negative, it is not possible for the rocket to ever make it to infinity because its kinetic energy would have to be negative. In other words, if the mechanical energy is negative, then the object of mass mm can never escape the gravitational pull of object MM. We say that mm is “gravitationally bound” to MM.

If the mechanical energy, EE, is exactly zero, then the object’s kinetic energy will become zero just as it reaches infinity. In other words, it will just barely be able to escape the gravitational pull from mass MM. The condition for this to happen is:

E=0K=U12mv2=GMmrvesc=2GMr\begin{align*} E &= 0\\ K & = -U\\ \frac{1}{2}mv^2 &= G\frac{Mm}{r}\\ \therefore v_{esc} &= \sqrt{\frac{2GM}{r}} \end{align*}

which we can interpret as a condition for the speed of the rocket. If at some distance rr from MM, the rocket has the speed given by the condition above, then it will have enough kinetic energy to escape the gravitational pull of MM. We call this speed the “escape velocity”.

Finally, if the mechanical energy is greater than zero, then the rocket will have enough energy to escape the gravitational pull of MM and have a non-zero speed when it reaches infinity.

9.4.1.1Types of orbits

The mechanical energy of a body of mass mm determines whether it is gravitationally bound to (i.e. cannot escape) the body of mass MM. The path (orbit) that mm will take depends on its velocity with respect to MM. Clearly, if the velocity of mm is directed at the centre of MM, then mm will just collide with MM. In all other cases, the orbit that mm will take depends on the mechanical energy of mm as well as the speed of mm at the point of closest approach to MM (see Figure 13). The velocity of mm at the point of closest approach will always be perpendicular to the line joining the centres of mm and MM. The different possible orbits are:

  1. A circular orbit of radius RR (where RR is the distance of closest approach) if the mechanical energy is negative (i.e. it is bound) and the speed is exactly equal to the value necessary for the gravitational force to provide the required centripetal acceleration for uniform circular motion:
F=GMmR2=mv2Rvcirc=GMR\begin{align*} \sum F = G\frac{Mm}{R^2} &= m\frac{v^2}{R}\\\therefore v_{circ}=\sqrt{\frac{GM}{R}} \end{align*}
  1. An elliptical orbit if the mechanical energy is negative and the speed at the point of closest approach is different than that required for a circular orbit.
  2. A parabolic orbit if the mechanical energy is exactly zero.
  3. A hyperbolic orbit if the mechanical energy is bigger than zero.

The possible orbits are illustrated in Figure 13, and are curves in the family of “conic sections”, as they can be found by the intersection of a plane and a cone. All conic sections have at least one “focus” point (ellipses have two) that corresponds to the location of MM.

The different possible orbits of $m$ due to the gravitational force of $M$ depend on the mechanical energy, $E$, of $m$. The orbits are drawn in a frame of reference where $M$ is at rest.

Figure 13:The different possible orbits of mm due to the gravitational force of MM depend on the mechanical energy, EE, of mm. The orbits are drawn in a frame of reference where MM is at rest.

9.5Einstein’s Theory of General Relativity

Newton’s Universal Theory of Gravity was extremely successful at describing the motion of planets in the solar system, and allowed for high precision astronomy. For example, precision measurements of Uranus’s orbit showed that it appeared to be inconsistent with Newton’s theory, unless the gravitational influence of another planet was included in the model. This led to the discovery of the planet Neptune.

However, some issues with Newton’s theory were uncovered. The orbit of Mercury was shown to be different than what Newton’s theory could describe, but searches for another planet (Vulcan) were unsuccessful. In addition, Albert Einstein’s theory of Special Relativity, published in 1905, was found to be incompatible with Newton’s theory of gravity. One of the consequences of Special Relativity is that nothing can propagate faster than the speed of light. Newton’s Universal Theory of Gravity implies that the gravitational force is transmitted instantaneously. In Newton’s theory, if the Sun suddenly disappeared, Earth would immediately “fall out” of its orbit, and we would immediately know that the Sun has disappeared. This would violate Special Relativity because there cannot be a mechanism that would allow us to know that the Sun has disappeared faster than it would take light to propagate from the Sun. In other words, for the 8min8 {\rm min} that are required for light to travel from the Sun to the Earth, we cannot know that the Sun has disappeared: only when we literally see the Sun disappear would the Earth be “allowed” to fall out of its orbit.

Einstein’s Theory of General Relativity is a theory developed by Einstein in order to describe gravity in a way that is consistent with Special Relativity and the propagation of light. Einstein was famous for his “thought experiments,” which allow us to think about some of the implications of a theory, even if the experiments would be very difficult to carry out in practice. One such thought experiment is to consider what someone would observe in an accelerating frame of reference.

Consider an observer in an elevator, as illustrated in Figure 14. If the elevator is stationary at the surface of the Earth (left panel), and the observer is standing on a scale, they could measure their weight, mgmg, on the scale. The two forces on the observer are their weight and the normal force, which would be equal in magnitude since the observer is not accelerating. The normal force, read out by the scale, would thus correspond to their weight. To be more precise, the normal force would be equal to mGgm_Gg, where mGm_G is the gravitational mass of the observer (that mass which is related to the force of gravity experienced by a mass).

If the elevator was instead placed in empty space, and the elevator was accelerating upwards with an acceleration of gg (right panel), the observer would still be able to measure their weight by stepping on the scale. The only force on the observer is the normal force from the scale, which must be equal to its mass times their acceleration N=mIa=mIgN=m_Ia=m_Ig, since the observer is accelerating with the elevator. In this case, it is the inertial mass of the observer, mIm_I, that comes into play, so the normal force read on the scale is mIgm_Ig.

Left: A person standing on a scale in an elevator at rest at the surface of the Earth. Right: A person in an elevator that is accelerating in empty space with the same acceleration as that due to gravity at the Earth's surface. The curvature of the light beam is exaggerated.

Figure 14:Left: A person standing on a scale in an elevator at rest at the surface of the Earth. Right: A person in an elevator that is accelerating in empty space with the same acceleration as that due to gravity at the Earth’s surface. The curvature of the light beam is exaggerated.

Einstein postulated that it would be impossible for the observer to distinguish whether they are at rest on the surface of the Earth, or in empty space accelerating with an acceleration of gg. In other words, he postulated that the inertial and gravitational masses are exactly equivalent. This is what is called the “Equivalence Principle”.

This simple statement has dramatic implications. Special Relativity requires that light will travel in a straight line in empty space. If a beam of light enters and then exits the elevator, the observer on Earth and the one accelerating in empty space must observe the same thing, since they cannot distinguish between being on Earth or accelerating in space. The observer in space, who is accelerating, will observe that the beam of light bends as it crosses the elevator (the beam travels in a straight line as observed in an inertial reference frame, so the person in the accelerating elevator would see it follow a parabolic path). The observer on Earth must thus observe the same thing, namely that light will follow a curved path in the presence of a gravitational field.

But...light must travel in a straight line in empty space. That means that if the path of a beam of light is curved near Earth, it must be because space itself is curved in the presence of a gravitational field! In other words, Einstein’s Theory of General Relativity describes how the presence of mass (or energy) results in a curvature of space (and time).

Imagine a ladybug on the side of a basketball. If the ladybug starts moving in what it believes to be a straight line, it will actually move in a curved path along the surface of the ball, as in Figure 15. This is like the curved path of light that we observe. If we didn’t know the ball was there, we would just think that the bug was moving along a curved path. In the same way, if an observer is not aware of the curvature of spacetime, it appears that light follows a curved path.

Left: A ladybug perceives itself to be moving in a straight line. Center: The basketball is curved, so the ladybugs follow curved paths. Right: What an observer would see if they didn't know the basketball was there.

Figure 15:Left: A ladybug perceives itself to be moving in a straight line. Center: The basketball is curved, so the ladybugs follow curved paths. Right: What an observer would see if they didn’t know the basketball was there.

Now imagine there’s a second ladybug. Both bugs start at the middle of the ball and start moving towards the top of the ball in what they think is a straight line (as shown in the center panel of Figure 15). When the bugs start moving, they are parallel to each other, so if the ball was not curved, the ladybugs would never meet. However, because it is curved, the ladybugs will eventually cross paths. If you were not aware that the ball was there, you would have to conclude that there was some force attracting the bugs to each other, just like if you were unaware that spacetime was curved, you would conclude that massive bodies moving towards each other are attracted by a gravitational force.

Objects that are moving in a gravitational field are actually following Newton’s First Law (they are moving at constant velocity in a straight line and no force is exerted on them). It is strange and unexpected, but high precision measurements confirm that this correctly describes everything that we have measured!

Einstein’s theory was able to describe the orbit of Mercury, and the prediction that gravity leads to light following a curved path was confirmed by Eddington within five years of Einstein’s theory being published. Another implication of the theory is that time goes by slower in the presence of a gravitational field. Clocks on Earth run slower than clocks in orbit (where the gravitational field is weaker). This effect is taken into account when using GPS to determine your position on Earth, since this is based on comparing the time that it takes signals to arrive to your position on Earth from different satellites. This is also somewhat reasonably well described in the movie “Interstallar”, where time is seen to pass much slower for a set of astronauts in the vicinity of a black hole, where the gravitational field is strong.

9.6Summary

Kepler was the first to synthesize a large amount of data to quantitatively describe gravity with his three laws:

  1. The path of a planet around the Sun is described by an ellipse with the Sun at once of its foci.
  2. Planets move in such a way that the area swept by a line connecting the planet and the Sun in a given period of time is constant, independent of the location of the planet.
  3. The ratio between the orbital periods, TT, squared of two planets is equal to the ratio of the semi-major axes, ss, of their orbits cubed:
(T1T2)2=(s1s2)3\left(\frac{T_1}{T_2}\right)^2=\left(\frac{s_1}{s_2}\right)^3

Newton described the attractive force of gravity exerted between two bodies of mass M1M_1 and M2M_2 (which must be point masses) as:

F12=GM1M2r2r^21\vec F_{12}=-G\frac{M_1M_2}{r^2}\hat r_{21}

where F12\vec F_{12} is the force on body 1 from body 2, rr is the distance between the two bodies, and r21\vec r_{21} is the vector from body 2 to body 1. The motion of a body under the influence of only this force will satisfy all of Kepler’s Laws, if the body is gravitationally bound.

The gravitational field, g(r)\vec g(\vec r), from a body of mass MM, is defined as the gravitational force that another body would experience per unit mass:

g(r)=F(r)m=GMr2r^\vec g(\vec r)=\frac{\vec F(\vec r)}{m}=-G\frac{M}{r^2}\hat r

The field can be used to determine the corresponding gravitational force, Fg\vec F_g, that a body of mass mm would experience if located at a position r\vec r relative to the body of mass MM:

Fg=mg(r)F_g = m \vec g(\vec r)

When describing the motion of objects near the surface of the Earth, it is thus more precise to refer to g=9.8N/kgg=9.8 {\rm N/kg} as the magnitude of the Earth’s gravitational field at the surface of the Earth, then to refer to g=9.8m/s2g=9.8 {\rm m/s^2} as the acceleration due to Earth’s gravity. The two are only equal if gravitational mass (the mm in the above equation) and inertial mass (the mm in Newton’s Second Law) are the same.

Gauss’s law, which applies to all inverse-square force laws, can be used to determine the magnitude of the gravitational field from a body of mass MM, even if it is not a point mass:

g(r)dA=4πGMenc\oint \vec g(\vec r) \cdot d\vec A = 4\pi G M^{enc}

Since the force described by Newton’s theory is conservative, we can define a potential energy function. The gravitational potential energy of a mass mm located a distance rr away from a mass MM is:

U(r)=GMmr+CU(r) = -G\frac{Mm}{r} + C

A convenient choice of the constant is C=0C=0, as this corresponds to the gravitational potential energy being equal to zero when mm is infinitely far away from MM.

The mechanical energy, EE, of an object of mass mm that is located at a distance rr from an object of mass MM, if gravity is the only conservative force exerted on mm, is given by:

E=K+U=12mv2GMmrE = K + U = \frac{1}{2}mv^2 - G\frac{Mm}{r}

where we have explicitly chosen C=0C=0, and vv is the speed of mm relative MM (considered to be at rest). Furthermore, if no non-conservative forces do work on the body of mass mm, the mechanical energy, EE, is constant.

If the mechanical energy of mm is negative, it is gravitationally bound to MM. Depending on the mechanical energy of mm and its velocity at the point of closest approach to MM, the orbit of mm will be described by one of four conic sections (circle, ellipse, parabola, hyperbola).

Einstein’s Theory of General Relativity describes gravitation as the bending of space and time caused by the presence of mass and energy. In Einstein’s theory, objects follow straight (inertial) paths and do not feel a force of gravity. The curvature of space is what results in their apparent motion not being a straight line. Einstein’s theory is based on the Equivalence Principle (inertial and gravitational mass are exactly equal) and the properties of how light propagates according to the Theory of Special Relativity. \end{chapterSummary}

9.7Thinking about the material

9.8Sample problems and solutions

9.8.1Problems

9.8.2Solutions