Curvature Tensors

This post discusses the concept of curvature in manifolds and various tensor fields which measure this curvature.

Curvature Tensors
An image generated by Google Gemini using the prompt "generate an artistic depiction of the Riemann curvature tensor".

Consider the following figure, which depicts the parallel transport of vectors along a closed loop on a sphere:

Figure 1: The parallel transport of a vector pointing north (located on the bottom right) around a closed loop on the surface of a sphere. The vector proceeds in a parallel manner toward the north pole, then toward the equator, then back toward its starting point. The vector was originally pointing northward, but now is pointing westward.

If the vector on the bottom right is parallel transported toward the north pole, then toward the equator, then along the equator, returning to its original point, the direction of the vector changes: it was originally pointing "northward", but now is pointing "westward". This change in direction is due to the curvature of the sphere.

However, when the same process is performed on a cylinder, the resultant vector is the same as the original vector, as in the following figure:

Figure 2: The parallel transport of a vector along a closed "rectangular" loop on the surface of a cylinder. The transported vector arrives precisely where it started and there is no displacement.

This is because a cylinder is an example of a "flat" manifold; it has no intrinsic curvature, although, when embedded in 3-dimensional space, it can be considered to have extrinsic curvature.

In fact, this observation can be used to define and measure curvature on arbitrary manifolds. The idea is as follows: parallel transport vectors along closed, "rectangular" loops, as in the following figure:

Figure 3: The parallel transport of a vector along a closed loop resembling a rectangle. The solid arrow represents the original vector, and the dashed arrow represents the result of parallel transporting the original vector around the loop. The difference is an indication of the degree of curvature at the given point from which the vectors emanate. Each black dot represents a point (the "corners" of the rectangle), and each thin line indicates a curve between the points. Together, these four curves comprise a "rectangular" closed loop.

Measure the displacement vector as the difference between the original vector and the transported vector. Next, contract this rectangle, so that it becomes arbitrarily small. Define curvature as an appropriate limit involving the displacement vector.

To make this precise, we need the notion of a parametric family of curves, i.e. we need to be able to trace the outline of each rectangle using curves.

Definition (Parametric Family of Curves). A parametric family of curves is continuous map Γ:J×IM, where I,JR are intervals and M is a smooth manifold. The primary curves are the curves Γs:IM, one for each sJ, defined as Γs(t)=Γ(s,t) for every tI. The transverse curves are the curves Γ(t):JM, one for each tI, defined as Γ(t)(s)=Γ(s,t) for every sJ.

A parametric family of curves thus represents a sort of "grid" of "horizontal" and "vertical" curves that can be overlayed on a manifold. They can be used to "trace" the "rectangles" described above.

Just as we can define vector fields along curves, we can likewise define vector fields along families of curves.

Definition (Vector Field along a Family of Curves). A vector field along a parametric family of curves Γ:J×IM is a continuous map V:J×ITM such that V(s,t)TΓ(s,t)M for all (s,t)J×I.

We denote the velocity vectors of the primary curves as

sΓ(s,t)=(Γs)(t)TΓ(s,t)M

and we denote the velocity vectors of the transverse curves as

tΓ(s,t)=(Γ(t))(s)TΓ(s,t)M.

Thus, the velocity vectors are examples of vector fields along Γ.

We likewise use the following notation:

  • Vs:ITM,Vs(t)=V(s,t),
  • V(t):JTM,Vt(s)=V(s,t).

Likewise, just as we can define the covariant derivative of a vector field along a curve, we can also define the covariant derivative of a vector field along a parametric family of curves using only a slightly modified definition.

Definition (Covariant Derivative along a Family of Curves). Given a family of curves Γ:J×IM, the primary covariant derivative of a vector field V:J×ITM along Γ is defined as

DtV(s,t)=DtVs(t),

where DtVs(t) is the covariant derivative along the primary curve Γs, and the transverse covariant derivative of a V along Γ is defined as

DsV(s,t)=DsV(t)(s),

where DsV(t)(s) is the covariant derivative along the transverse curve Γ(t).

Thus, given smooth coordinates (xi) in a neighborhood of Γ(s,t), the coordinate expressions are as follows:

DtV(s,t)=DtVs(t)=Dt(Vsi(t)i|Γs(t))=V˙si(t)i|Γs(t)+Vsi(t)Dti|Γs(t)=Vit(s,t)i|Γ(s,t)+Vi(s,t)Dti|Γ(s,t).

DsV(s,t)=DsV(t)(s)=Ds((V(t))ii|Γ(t)(s))=((V(t))i)(s)i|Γ(t)(s)+(V(t))i(s)Dsi|Γ(t)(s)=Vis(s,t)i|Γ(s,t)+Vi(s,t)Dsi|Γ(s,t).

Likewise, we can also define parallel transport for families of curves.

Definition (Parallel Transport along a Family of Curves). Given a Riemannian (or semi-Riemannian) manifold M and a smooth family of curves Γ:I×IM defined on an open interval IR such that 0I, the parallel transport along the primary curve Γs from time t1 to time t2 is defined as

Ps1,t1s1,t2=Pt1,t2Γs1:TΓ(s1,t1)MTΓ(s2,t2)M.

Likewise, the parallel transport along the transverse curve Γ(t) from time s1 to time s2 is defined as

Ps1,t1s2,t1=Ps1,s2Γ(t1):TΓ(s1,t1)MTΓ(s2,t1)M.

We are now prepared to define the curvature measure proposed in the introduction.

Definition (Displacement). Given a Riemannian (or semi-Riemannian) manifold M and a smooth family of curves Γ:I×IM defined on an open interval IR such that 0I, the displacement along Γ at the point p=Γ(0,0) is defined for each zTpM as follows:

Dp(z)=limδ,ε0Pδ,00,0Pδ,εδ,0P0,εδ,εP0,00,ε(z)zδε.

The displacement is the limiting ratio of the displacement vector resulting from parallel transport around the "rectangular" loop and the area δε of the rectangular interval (0,δ)×(0,ε) in the domain of Γ.

The limit limx,yaf(x,y)=L is interpreted as follows: for all ε>0, there exists a δ>0 such that f(x,y)L<ε whenever both xa<δ and ya<δ. This is equivalent to limxalimyaf(x,y).

This process is depicted in the following figure:

Figure 4: The parallel transport around a closed, "rectangular" loop using the family of curves Γ.

Let Z denote the vector field along Γ defined by first parallel transporting the tangent vector Z(0,0)=z "vertically" (according to the figure) along the primary curve Γ0 and then, for each t, parallel transporting "horizontally" (according to the figure) along the transverse curve Γ(t). Then, Z represents the process of transporting a vector z around the closed, rectangular loop as depicted in the figure. Z can thus be defined as follows:

Z(s,t)=P0,ts,t(P0,00,t(z)).

This process is analogous to a derivative, and indeed bears a resemblance to the formula for the covariant derivative of a vector field V along a curve γ in terms of parallel transport:

DtV(t0)=limt10Pt,0γV(t0+t1)V(t0)t1.

This formula then implies that

DsDtZ(0,0)=limδ0Pδ,00,0(DtZ(δ,0))DtZ(0,0)δ.

Furthermore, the same formula implies that

DtZ(δ,0)=limε0Pδ,εδ,0(Z(δ,ε))Z(δ,0)ε

and

DtZ(0,0)=limε0P0,ε0,0(Z(0,ε))Z(0,0)ε.

Now, recall that limits in finite-dimensional normed vector spaces are computed component-wise, meaning that limhaf(h)=(limhafj(h))ej for any basis (ej). It thus follows that, for any linear map A:VW between finite-dimensional normed vector spaces V and W and any basis (ej) for V,

A(limhaf(h))=A((limhafj(h))ej)=(limhafj(h))A(ej)=limha(fj(h)A(ej))=limhaA(fj(h)ej)=limhaA(f(h)).

Thus, limits "commute" with the application of linear maps. Since parallel transport is a linear map, it follows that

Pδ,00,0(DtZ(δ,0))=Pδ,00,0(limε0Pδ,εδ,0(Z(δ,ε))Z(δ,0)ε)=limε0Pδ,00,0(Pδ,εδ,0(Z(δ,ε))Z(δ,0)ε)=limε0Pδ,00,0(Pδ,εδ,0(Z(δ,ε)))Pδ,00,0(Z(δ,0))ε.

Then, substituting, we obtain

DsDtZ(0,0)=limδ0[limε0Pδ,00,0(Pδ,εδ,0(Z(δ,ε)))Pδ,00,0(Z(δ,0))ε][limε0P0,ε0,0(Z(0,ε))Z(0,0)ε]δ=limδ0limε0[Pδ,00,0(Pδ,εδ,0(Z(δ,ε)))Pδ,00,0(Z(δ,0))εP0,ε0,0(Z(0,ε))Z(0,0)ε]δ=limδ0limε0[Pδ,00,0(Pδ,εδ,0(Z(δ,ε)))Pδ,00,0(Z(δ,0))εP0,ε0,0(Z(0,ε))Z(0,0)ε]δ=limδ,ε0Pδ,00,0(Pδ,εδ,0(Z(δ,ε)))Pδ,00,0(Z(δ,0))P0,ε0,0(Z(0,ε))+Z(0,0)δε.

Next, recall that the parallel transport map is a linear isomorphism, whose inverse is parallel transport along the inverse curve (i.e. curve going "in the opposite direction" with the upper and lower indices swapped). Also note that Z is parallel along Γ0 and along Γ(t) for every t by construction, and hence the result of parallel transport along such curves to an index is the same as the value of Z at the respective index. It then follows that

z=Z(0,0)=P0,ε0,0(P0,00,ε(Z(0,0)))=P0,ε0,0(Z(0,ε)),

and likewise

z=Z(0,0)=P0,εδ,ε(Pδ,ε0,ε(Z(0,0)))=P0,εδ,ε(Z(0,ε)).

It also follows that

Z(δ,ε)=P0,εδ,ε(Z(0,ε))=P0,εδ,ε(P0,00,ε(z)).

If we substitute this into the limit obtained previously, we obtain

DsDtZ(0,0)=limδ,ε0Pδ,00,0Pδ,εδ,0P0,εδ,εP0,00,ε(z)zδε.

Thus, we have obtained an equivalent expression for displacement:

Dp(z)=DsDtZ(0,0),

where Γ is an appropriately defined family of curves, Z is an appropriately defined vector field along Γ, and Z(0,0)=z.

Definition (Displacement [Alternative]). Given a Riemannian (or semi-Riemannian) manifold M and a smooth family of curves Γ:I×IM defined on an open interval IR such that 0I, the displacement along Γ at the point p=Γ(0,0) is defined for every tangent vector zTpM as

Dp(z)=DsDtZ(0,0),

where Z is the vector field along Γ defined as

Z(s,t)=P0,ts,t(P0,00,t(z)).

Although this is a useful and intuitive characterization of curvature, it is not yet complete since it is restricted to vector fields along families of curves. Instead, we seek a characterization of curvature which is applicable to arbitrary vector fields. The displacement operation indicates that curvature is somehow related to second covariant derivatives.

It is often instructive to consider the special case of the Euclidean connection ¯. Recall that the Euclidean connection can be expressed in terms of smooth coordinates as

¯XY=X(Yi)i.

We then compute the following in smooth coordinates (noting that ¯Xi=0 for any X in the Euclidean connnection):

¯X¯YZ=¯X(Y(Zi)i)=Y(Zi)¯Xi+X(Y(Zi))i(Product rule)=X(Y(Zi))i.

Likewise, transposing X and Y, it follows that

¯Y¯XZ=Y(X(Zi))i.

Again, it also follows that

¯[X,Y]Z=[X,Y](Zi)i=(XYYX)(Zi)i=X(Y(Zi))iY(X(Zi))i=¯X¯YZ¯Y¯XZ.

Thus, for the Euclidean connection,

¯X¯YZ¯Y¯XZ¯[X,Y]Z=0.

Therefore, for the Euclidean connection, it is not sufficient to consider ¯X¯YZ alone, but the additional terms in the expression above must also be considered, and, in particular, they obey the equation above.

We therefore generalize this formula and propose the following characterization of curvature for the Levi-Civita connection on arbitrary Riemannian (or semi-Riemannian) manifolds.

Definition (Riemann Curvature Endomorphism). For any Riemannian (or semi-Riemannian) manifold M and smooth vector fields X and Y on M, the (Riemann) curvature endomorphism determined by X and Y is the map R(X,Y):X(M)X(M), where R:X(M)×X(M)×X(M)X(M) defined for all ZX(M) as

R(X,Y)Z=XYZYXZ[X,Y]Z.

Thus, for the Euclidean connection, R(X,Y)Z=0 for all vector fields X,Y, and Z.

The following theorem holds (but its proof would require a detour, so it is omitted):

Theorem. A Riemannian (or semi-Riemannian) manifold is flat if and only if its curvature tensor vanishes identically.

Thus, the relation above holds for all flat manifolds (i.e. those locally isometric to Euclidean space). This verifies that the curvature endomorphism characterizes flatness, and likewise provides a concrete measure of the deviation from flatness for non-flat manifolds.

Next, we want to confirm that the Riemann curvature endomorphism is multilinear over C(M), which will allows us to derive a corresponding tensor field from this endomorphism.

First, note that

R(fX,Y)Z=fXYZYfXZ[fX,Y]Z=fXYZY(fXZ)(fXYY(fX))Z=fXYZfYXZYfXZ(fXYfYXYfX)Z=fXYZfYXZYfXZ(f[X,Y]YfX)Z=fXYZfYXZYfXZf[X,Y]Z+YfXZ=fXYZfYXZf[X,Y]Z=fR(X,Y)Z.

A similar calculation shows that R(X,fY)=fR(X,Y)Z. Finally, note that

R(X,Y)(fZ)=XY(fZ)YX(fZ)[X,Y](fZ)=X(fYZ+YfZ)Y(fXZ+XfZ)(f[X,Y]Z+[X,Y]fZ)=X(fYZ)+X(YfZ)Y(fXZ)Y(XfZ)f[X,Y]Z[X,Y]fZ=(fXYZ+XfYZ)+(YfXZ+XYfZ)(fYXZ+YfXZ)(XfYZ+YXfZ)f[X,Y]Z(XYfZYXfZ)=fXYZfYXZf[X,Y]Z=fR(X,Y)Z.

Tensors and Multilinear Maps

Tensor fields of type (k+1,l) can be characterized by maps that are multilinear over C(M), either with signature

X(M)××X(M)k+1 times×X(M)×X(M)l timesC(M)

or with signature

X(M)××X(M)k times×X(M)×X(M)l timesX(M).

In this section we will review these results, which are crucial for formulating a curvature tensor based on the curvature endomorphism.

Linear Maps

For any vector spaces V and W,

[V,W]L(W,V;R),

where [V,W] denotes the vector space of linear maps with signature VW and L(W,V;R) denotes the vector space of multilinear maps with signature W×VR.

This isomorphism is witnessed by the following map:

φ(A)(ω,v)=ω(Av).

This is indeed an isomorphism, since it has a trivial kernel. Given a basis (EiW) for W with corresponding dual basis (εWj), the inverse mapping can be defined explicitly:

φ1(L)(v)=L(εWi,v)EiW,

We then compute

φ(φ1(L))(ω,v)=ω(L(εWi,v)EiW)=L(εWi,v)ω(EiW)=ωiL(εWi,v)=L(ωiεWi,v)=L(ω,v).

Likewise, we also compute

φ1(φ(A))(v)=εWi(Av)EiW=[Av]iEiW=Av.

If φ1(L)=A, then it follows that

A(EjV)=φ1(L)(EjV)=L(εWi,EjV)EiW=LjiEiW,

where Lji are the components of the tensor, and likewise if φ(A)=L, then it follows that

Lji=L(εWi,EjV)=φ(A)(εWi,EjV)=εWi(A(EjV))=[A(EjV)]i.

Thus, if only A is known, the tensor components Lji can be computed in this manner.

Multilinear Maps

Moreover, for any vector spaces V1,,Vk and W,

L(V1,,Vk;W)L(W,V1,,Vk;R).

This isomorphism is witnessed by the following map:

φ(A)(ω,v1,,vk)=ω(A(v1,,vk)).

This is indeed an isomorphism, since it has a trivial kernel. Given a basis (EiW) for W with corresponding dual basis (εWi), the inverse mapping can be defined explicitly:

φ1(L)(v1,,vk)=L(εWi,v1,,vk)EiW.

Multilinear Maps on a Vector Space

This can be restricted to multilinear maps "on" a vector space V, i.e. either covariant maps

L(V,,Vk times;V)L(V,V,,Vk times;R)

as witnessed by the map

φ(A)(ω,v1,,vk)=ω(A(v1,,vk)),

or contravariant maps

L(V,,Vk times;V)L(V,,Vk+1 times;R)

as witnessed by the map

φ(A)(ω1,,ωk+1)=ωk+1(A(ω1,,ωk)),

or maps of mixed variance

L(V,,Vk times,V,,Vl times;V)L(V,,Vk+1 times,V,,Vl times;R)

as witnessed by the map

φ(A)(ω1,,ωk+1,v1,,vl)=ωk+1(A(ω1,,ωk,v1,,vl)).

This is indeed an isomorphism, since it has a trivial kernel. Given a basis (EiW) for W with corresponding dual basis (εWi), the inverse mapping can be defined explicitly:

φ1(L)(ω1,,ωk,v1,,vl)=L(ω1,,ωk,εWik+1,v1,,vl)Eik+1W.

If φ1(L)=A, then it follows that

A(εWi1,,εWik,Ej1,,Ejl)=φ1(L)(εWi1,,εWik,Ej1,,Ejl)=L(εWi1,,εWik+1,Ej1,,Ejl)Eik+1W=Lj1,,jli1,,ik+1Eik+1W,

and likewise if φ(A)=L, then it follows that

Lj1,,jli1,,ik+1=L(εWi1,,εWik+1,Ej1,,Ejl)=φ(A)(εWi1,,εWik+1,Ej1,,Ejl)=εWik+1(A(εWi1,,εWik,Ej1,,Ejl))Eik+1W=[A(εWi1,,εWik,Ej1,,Ejl)]ik+1Eik+1W.

Thus, if only A is known, the tensor components Lj1,,jki1,,ik+1 can be computed in this manner.

The Characterization Lemma

Given any tensor field F:MT(k,l)TM, smooth covector fields ω1,,ωk, and smooth vector fields X1,,Xl, it is possible to define a corresponding function F~(ω1,,ωk,X1,,Xl):UR on any open subset UM on which each of the covector and vector fields are defined, as follows:

F~(ω1,,ωk,X1,,Xl)(p)=Fp(ωp1,,ωpk,X1|p,,Xl|p).

It can be shown that this function is smooth. Thus, F~ represents a map with signature

F~:X(M)××X(M)k times×X(M)×X(M)l timesC(M).

This map is multilinear over C(M). Moreover, it can be shown that any multilinear map over C(M) with this signature induces a corresponding tensor field.

Then, given any multilinear map A with signature

A:X(M)××X(M)k times×X(M)×X(M)l timesX(M),

such a map induces a multilinear map φ(A) with signature

φ(A):X(M)××X(M)k+1 times×X(M)×X(M)l timesC(M)

defined as

φ(A)(ω1,,ωk+1,X1,,Xl)=ωk+1(A(ω1,,ωk,X1,,Xl)).

This map is a isomorphism since it has a trivial kernel. Given a coordinate frame (j) with corresponding dual frame (dxj), the inverse mapping can be defined explicitly:

φ1(L)(ω1,,ωk,X1,,Xl)=L(ω1,,ωk,dxik+1,j1,,jl)ik+1.

If φ1(L)=A, then it follows that

A(dxi1,,dxik,j1,,jl)=φ1(L)(dxi1,,dxik,j1,,jl)=L(dxi1,,dxik+1,j1,,jl)ik+1=Lj1,,jli1,,ik+1ik+1,

and if φ(A)=L, then it follows that

Lj1,,jli1,,ik+1=L(dxi1,,dxik+1,j1,,jl)=φ(A)(dxi1,,dxik+1,j1,,jl)=dxik+1(A(dxi1,,dxik,j1,,jl))=[A(dxi1,,dxik,j1,,jl)]ik+1.

Thus, if only A is known, the tensor components Lj1,,jli1,,ik+1 can be computed in this manner.

Thus, since such multilinear maps induce tensor fields, they are often conflated together with tensor fields, and referred to as "tensors" themselves.

The Curvature Tensor

The curvature map has signature

R:X(M)×X(M)×X(M)X(M).

As indicated in the previous section, there is a corresponding map

φ(R):X(M)×X(M)×X(M)×X(M)C(M)

defined as

φ(R)(ω,X,Y,Z)=ω(R(X,Y,Z)).

Thus, as indicated in the previous section, there exists a corresponding tensor field F=φ(R).

Note that we will adopt the convention that the final index is the contravariant index of the corresponding tensor.

Given a coordinate frame (j) with corresponding dual frame (dxi), an inverse mapping can be computed explicitly as

φ1(F)(X,Y,Z)=F(X,Y,Z,dxi)i.

Note that dxi appeared as the last argument, since we are adopting a different convention regarding the ordering of indices. We can now compute the components of this tensor F; if φ1(F)=R, then

R(i,j,k)=φ1(F)(i,j,k)=F(i,j,k,dxl)l=Fijkll,

and likewise if φ(R)=F, then

Fijkl=F(i,j,k,dxl)=φ(R)(i,j,k,dxl)=dxl(R(X,Y)Z)=[R(X,Y)Z]l.

Note that we use the same notation R to refer to the tensor F, so that

Rijkll=R(i,j,k),

or, equivalently,

Rijkl=dxl(R(i,j,k)).

Thus, this tensor can be written in terms of its components as

R=Rijkldxidxjdxkl.

We refer to this tensor field as the (1,3)-curvature tensor.

Next, we will compute the coordinate expression for this tensor. Recall that the coordinate expression for the covariant derivative is

XY=X(Yj)j+XiYjΓijkk.

Note that

jk=Γjkll,

ik=Γikll,

and

[i,j]k=0.

Then, we compute

R(i,j,k)=ijkjik[i,j]k=i(Γjkll)j(Γikll)=Γjklil+iΓjkll(Γikjl+jΓikll)=Γjklil+iΓjkllΓikjljΓikll=ΓjklΓilmm+iΓjkllΓikΓjlmmjΓikll=ΓjkmΓimll+iΓjkllΓikmΓjmlljΓikll(transposing dummy indices m and l)=(iΓjkljΓikl+ΓjkmΓimlΓikmΓjml)l.

It thus follows that

Rijkl=iΓjkljΓikl+ΓjkmΓimlΓikmΓjml.

It is often more convenient to work with a covariant tensor rather than a tensor of mixed variance. Thus, we define a map

Rm:X(M)×X(M)×X(M)×X(M)C(M)

called the Riemann curvature tensor, as

Rm(X,Y,Z,W)=R(X,Y,Z),Wg,

where ,g denotes the metric.

This map represents a (0,4) tensor field by the characterization lemma. This is obtained from the (1,3)-curvature tensor by lowering the final index, i.e.

Rijkl=glmRijkl.

This covariant tensor is thus expressed in coordinates as

Rijkl=glm(iΓjkmjΓikm+ΓjkpΓiplΓikpΓjpm),

where we have renamed index l to m and index m to p. Thus, this tensor has coordinate expression

Rijkldxidxjdxkdxl.

The Ricci Curvature Tensor

The Ricci curvature tensor Rc is the trace (contraction) of the Riemann curvature tensor on the first and last indices, and thus has components Rij defined as

Rij=gkmRkijm.

The Ricci curvature can be given a geometric interpretation in terms of sectional curvature.

The Scalar Curvature Function

The scalar curvature is a function (or a (0,0)-tensor) which is the contraction of the Ricci curvature tensor. It is thus defined as

S=gijRij=gijgkmRkijm.

The scalar curvature can likewise be given a geometric interpretation in terms of sectional curvature.

The Ricci and scalar curvature are useful since the Riemann curvature tensor is such a complicated construct. They summarize key information about curvature while being simpler to employ.