Curvature is tensorial: Difference between revisions

Latest revision as of 17:36, 6 January 2012

This article gives the statement, and possibly proof, that a map constructed in a certain way is tensorial
View other such statements

Statement

Let $\nabla$ be a connection on a vector bundle $E$ over a differential manifold $M$ . The Riemann curvature tensor of $\nabla$ is given as a map $Γ (T M) \otimes Γ (T M) \otimes Γ (E) \to Γ (E)$ defined by:

$R (X, Y) Z = \nabla_{X} \nabla_{Y} Z - \nabla_{Y} \nabla_{X} Z - \nabla_{[X, Y]} Z$

We claim that $R$ is a tensorial map in each of the variables $X, Y, Z$ .

Related facts

Facts used

Fact no.	Name	Statement with symbols
1	Any connection is $C^{\infty}$ -linear in its subscript argument	$\nabla_{f A} = f \nabla_{A}$ for any $C^{\infty}$ -function $f$ and vector field $A$ .
2	The Leibniz-like axiom that is part of the definition of a connection	For a function $f$ and vector fields $A, B$ , and a connection $\nabla$ , we have $\nabla_{A} (f B) = (A f) (B) + f \nabla_{A} (B)$
3	Corollary of Leibniz rule for Lie bracket (in turn follows from Leibniz rule for derivations	For a function $f$ and vector fields $X, Y$ : $f [X, Y] = [f X, Y] + (Y f) X$ $f [X, Y] = [X, f Y] - (X f) Y$

Proof

To prove tensoriality in a variable, it suffices to show $C^{\infty}$ -linearity in that variable. This is because linearity in $C^{\infty}$ -functions guarantees linearity in a function that is 1 at exactly one point, and zero at others.

The proofs for $X$ and $Y$ are analogous, and rely on manipulation of the Lie bracket $[f X, Y]$ and the property of a connection being $C^{\infty}$ in the subscript vector. These proofs do not involve any explicit use of $Z$ . The proof for $Z$ relies simply on repeated application of the product rule, and the fact that $X Y - Y X = [X, Y]$ .

Tensoriality in the first variable

Given: $f : M \to R$ is a $C^{\infty}$ -function.

To prove: $R (f X, Y) = f R (X, Y)$ , or more explicitly, $\nabla_{f X} \nabla_{Y} - \nabla_{Y} \nabla_{f X} - \nabla_{[f X, Y]} = f (\nabla_{X} \nabla_{Y} - \nabla_{Y} \nabla_{X} - \nabla_{[X, Y]}$

We start out with the left side:

$\nabla_{f X} \nabla_{Y} - \nabla_{Y} \nabla_{f X} - \nabla_{[f X, Y]}$

Each step below is obtained from the previous one via some manipulation explained along side.

Step no.	Current status of left side	Facts/properties used	Specific rewrites
1	$f \nabla_{X} \nabla_{Y} - \nabla_{Y} (f \nabla_{X}) - \nabla_{[f X, Y]}$	Fact (1): $\nabla$ is $C^{\infty}$ -linear in its subscript argument.	$\nabla_{f X} \to f \nabla_{X}$
2	$f \nabla_{X} \nabla_{Y} - (Y f) \nabla_{X} - f \nabla_{Y} \nabla_{X} - \nabla_{[f X, Y]}$	Fact (2)	$\nabla_{Y} (f \nabla_{X}) \to (Y f) \nabla_{X} + f \nabla_{Y} \nabla_{X}$ . To understand this more clearly imagine an input $Z$ to the whole expression, so that the rewrite becomes $\nabla_{Y} (f \nabla_{X} (Z)) \to (Y f) \nabla_{X} (Z) + f \nabla_{Y} \nabla_{X} (Z)$ . In the notation of fact (3), $A = Y$ , $f = f$ , and $B = \nabla_{X} (Z)$ .
3	$f (\nabla_{X} \nabla_{Y} - \nabla_{Y} \nabla_{X}) - \nabla_{(Y f) X} - \nabla_{[f X, Y]}$	Fact (1)	$(Y f) \nabla_{X} \to \nabla_{(Y f) X}$
4	$f (\nabla_{X} \nabla_{Y} - \nabla_{Y} \nabla_{X}) - \nabla_{(Y f) X + [f X, Y]}$	$\nabla$ is additive in its subscript argument	$\nabla_{(Y f) X} + \nabla_{[f X, Y]} = \nabla_{(Y f) X + [f X, Y]}$
5	$f (\nabla_{X} \nabla_{Y} - \nabla_{Y} \nabla_{X}) - \nabla_{f [X, Y]}$	Fact (3)	$[f X, Y] + (Y f) X \to f [X, Y]$
6	$f (\nabla_{X} \nabla_{Y} - \nabla_{Y} \nabla_{X} - \nabla_{[X, Y]})$	Fact (1)	$\nabla_{f [X, Y]} \to f \nabla_{[X, Y]}$

Tensoriality in the second variable

Given: $f : M \to R$ is a $C^{\infty}$ -function.

To prove: $R (X, f Y) = f R (X, Y)$ , or more explicitly, $\nabla_{X} \nabla_{f Y} - \nabla_{f Y} \nabla_{X} - \nabla_{[X, f Y]} = f (\nabla_{X} \nabla_{Y} - \nabla_{Y} \nabla_{X} - \nabla_{[X, Y]}$ .

We start out with the left side:

$\nabla_{X} \nabla_{f Y} - \nabla_{f Y} \nabla_{X} - \nabla_{[X, f Y]}$

Each step below is obtained from the previous one via some manipulation explained along side.

Step no.	Current status of left side	Facts/properties used	Specific rewrites
1	$\nabla_{X} (f \nabla_{Y}) - f \nabla_{Y} \nabla_{X} - \nabla_{[X, f Y]}$	Fact (1)	$\nabla_{f Y} \to f \nabla_{Y}$ .
2	$(X f) \nabla_{Y} + f (\nabla_{X} \nabla_{Y}) - f \nabla_{Y} \nabla_{X} - \nabla_{[X, f Y]}$	Fact (2)	$\nabla_{X} (f \nabla_{Y}) \to (X f) \nabla_{Y} + f (\nabla_{X} \nabla_{Y})$ . To make this more concrete, imagine an input $Z$ . Then, the rewrite becomes $\nabla_{X} (f \nabla_{Y} (Z)) \to (X f) \nabla_{Y} (X) + f (\nabla_{X} \nabla_{Y} (Z))$ . This comes setting $A = X$ , $f = f$ , $B = \nabla_{Y} Z$ in Fact (3).
3	$f (\nabla_{X} \nabla_{Y} - \nabla_{Y} \nabla_{X}) - \nabla_{[X, f Y]} + \nabla_{(X f) Y}$	Fact (1)	$(X f) \nabla_{Y} \to \nabla_{(X f) Y}$
4	$f (\nabla_{X} \nabla_{Y} - \nabla_{Y} \nabla_{X}) - \nabla_{[X, f Y] - (X f) Y}$	$\nabla$ is additive in its subscript argument.	$\nabla_{[X, f Y]} - \nabla_{(X f) Y} \to \nabla_{[X, f Y] - (X f) Y}$ .
5	$f (\nabla_{X} \nabla_{Y} - \nabla_{Y} \nabla_{X}) - \nabla_{f [X, Y]}$	Fact (3)	$[X, f Y] - (X f) Y \to f [X, Y]$
6	$f (\nabla_{X} \nabla_{Y} - \nabla_{Y} \nabla_{X} - \nabla_{[X, Y]})$	Fact (1)	$\nabla_{f [X, Y]} \to f \nabla_{[X, Y]}$

Tensoriality in the third variable

Given: A $C^{\infty}$ -function $f : M \to R$ .

To prove: $R (X, Y) (f Z) = f R (X, Y) Z$ . More explicitly, $\nabla_{X} \nabla_{Y} (f Z) - \nabla_{Y} \nabla_{X} (f Z) - \nabla_{[X, Y]} (f Z) = f (\nabla_{X} \nabla_{Y} - \nabla_{Y} \nabla_{X} - \nabla_{[X, Y]}) Z + ((X Y - Y X - [X, Y]) f) Z$ .

We start out with the left side:

$\nabla_{X} \nabla_{Y} (f Z) - \nabla_{Y} \nabla_{X} (f Z) - \nabla_{[X, Y]} (f Z)$

Each step below is obtained from the previous one via some manipulation explained along side.

Step no.	Current status of left side	Facts/properties used	Specific rewrites
1	$\nabla_{X} ((Y f) (Z) + f \nabla_{Y} Z) - \nabla_{Y} ((X f) Z + f \nabla_{X} Z) - f \nabla_{[X, Y]} Z - ([X, Y] f) Z$	Fact (2)	$\nabla_{Y} (f Z) \to (Y f) (Z) + f \nabla_{Y} Z$ and $\nabla_{X} (f Z) \to (X f) Z + f \nabla_{X} Z$
2	$(X Y f) (Z) + (Y f) \nabla_{X} Z + (X f) \nabla_{Y} Z + f \nabla_{X} \nabla_{Y} Z - (Y X f) Z - (X f) \nabla_{Y} Z - (Y f) \nabla_{X} Z - f \nabla_{Y} \nabla_{X} Z - f \nabla_{[X, Y]} Z - ([X, Y] f) Z$	Fact (2)	$\nabla_{X} ((Y f) Z) \to X ((Y f) Z) + (Y f) \nabla_{X} Z$ , etc.
3	$f (\nabla_{X} \nabla_{Y} - \nabla_{Y} \nabla_{X} - \nabla_{[X, Y]}) Z + ((X Y - Y X - [X, Y]) f) Z$	--	cancellations
4	$f (\nabla_{X} \nabla_{Y} - \nabla_{Y} \nabla_{X} - \nabla_{[X, Y]}) Z + ((X Y - Y X - [X, Y]) f) Z$	use $[X, Y] = X Y - Y X$ , definition	cancellation

@@ Line 2: / Line 2: @@
 ==Statement==
-Let <math>\nabla</math> be a [[connection]] on a [[vector bundle]] <math>E</math> over a [[differential manifold]] <math>M</math>. The '''Riemann curvature tensor''' of <math>\nabla</math> is given as a map <math>\Gamma(TM) \otimes \Gamma(TM) \otimes \Gamma(E) \to \Gamma(E)</math> defined by:
+Let <math>\nabla</math> be a [[connection]] on a [[vector bundle]] <math>E</math> over a [[differential manifold]] <math>M</math>. The [[fact about::Riemann curvature tensor]] of <math>\nabla</math> is given as a map <math>\Gamma(TM) \otimes \Gamma(TM) \otimes \Gamma(E) \to \Gamma(E)</math> defined by:
 <math>R(X,Y)Z = \nabla_X\nabla_Y Z - \nabla_Y \nabla_X Z - \nabla_{[X,Y]} Z</math>
-We claim that <math>R</math> is a [[tensorial map]] in each of the variables <math>X,Y,Z</math>.
+We claim that <math>R</math> is a [[fact about::tensorial map]] in each of the variables <math>X,Y,Z</math>.
-==Facts used==
+==Related facts==
-* [[Leibniz rule for derivations]]: This states that for a vector field <math>X</math> and functions <math>f,g</math>, we have:
+* [[Curvature is antisymmetric in first two variables]]
+* [[Curvature is antisymmetric in last two variables]]
+* [[Curvature is symmetric in the pairs of first and last two variables]]
-<math>X(fg) = (Xf)(g) + f(Xg)</math>
+==Facts used==
-* [[Corollary of Leibniz rule for Lie bracket]]: This states that for a function <math>f</math> and vector fields <math>X,Y</math>:
-<math>f[X,Y] = [fX,Y] + (Yf)X</math>
+{| class="sortable" border="1"
+! Fact no. !! Name !! Statement with symbols
-<math>f[X,Y] = [X,fY] - (Xf)Y</math>
+|-
+| 1 || Any connection is <math>C^\infty</math>-linear in its subscript argument || <math>\nabla_{fA} = f\nabla_A</math> for any <math>C^\infty</math>-function <math>f</math> and vector field <math>A</math>.
-* The Leibniz rule axiom that's part of the definition of a [[connection]], namely:
+|-
+| 2 || The Leibniz-like axiom that is part of the definition of a connection || For a function <math>f</math> and vector fields <math>A,B</math>, and a connection <math>\nabla</math>, we have <math>\nabla_A(fB) = (Af)(B) + f\nabla_A(B)</math>
-<math>\nabla_X(fZ) = (Xf)(Z) + f\nabla_X(Z)</math>
+|-
+| 3 || [[uses::Corollary of Leibniz rule for Lie bracket]] (in turn follows from [[uses::Leibniz rule for derivations]]|| For a function <math>f</math> and vector fields <math>X,Y</math>:
+<br><math>\! f[X,Y] = [fX,Y] + (Yf)X</math><br><math>\! f[X,Y] = [X,fY] - (Xf)Y</math>
+|}
 ==Proof==
@@ Line 32: / Line 35: @@
 ===Tensoriality in the first variable===
-Let <math>f:M \to \R</math> be a scalar function. We will show that:
+'''Given''': <math>f:M \to \R</math> is a <math>C^\infty</math>-function.
-<math>R(fX,Y) = f R(X,Y)</math>
+'''To prove''': <math>\! R(fX,Y) = f R(X,Y)</math>, or more explicitly, <math>\! \nabla_{fX}\nabla_Y - \nabla_Y \nabla_{fX} - \nabla_{[fX,Y]} = f(\nabla_X\nabla_Y - \nabla_Y\nabla_X - \nabla_{[X,Y]}</math>
 We start out with the left side:
@@ Line 40: / Line 43: @@
 <math>\nabla_{fX}\nabla_Y - \nabla_Y \nabla_{fX} - \nabla_{[fX,Y]}</math>
-Now by the definition of a [[connection]], <math>\nabla</math> is <math>C^\infty</math>-linear in its subscript argument. Thus, the above expression can be written as:
+Each step below is obtained from the previous one via some manipulation explained along side.
-<math>f\nabla_X\nabla_Y - \nabla_Y (f \nabla_X) - \nabla_{[fX,Y]}</math>
-Now applying the Leibniz rule for connections, we get:
-<math>f\nabla_X\nabla_Y - (Yf)\nabla_X - f \nabla_Y\nabla_X - \nabla_{[fX,Y]}</math>
-We can rewrite <math>(Yf)\nabla_X = \nabla_{(Yf)X}</math> and we then get:
-<math>f(\nabla_X\nabla_Y - \nabla_Y\nabla_X) - \nabla_{(Yf)X + [fX,Y]}</math>
+{| class="sortable" border="1"
+! Step no. !! Current status of left side !! Facts/properties used !! Specific rewrites
-By the corollary stated above, we have:
+|-
+| 1 || <math>f\nabla_X\nabla_Y - \nabla_Y (f \nabla_X) - \nabla_{[fX,Y]}</math> || Fact (1): <math>\nabla</math> is <math>C^\infty</math>-linear in its subscript argument. || <math>\nabla_{fX} \to f\nabla_X</math>
-<math>(Yf)X + [fX,Y] = f[X,Y]</math>
+|-
+| 2 || <math>f\nabla_X\nabla_Y - (Yf)\nabla_X - f \nabla_Y\nabla_X - \nabla_{[fX,Y]}</math> || Fact (2) || <math>\nabla_Y(f \nabla_X) \to (Yf)\nabla_X + f\nabla_Y\nabla_X</math>. To understand this more clearly imagine an input <math>Z</math> to the whole expression, so that the rewrite becomes <math>\nabla_Y(f \nabla_X(Z)) \to (Yf)\nabla_X(Z) + f\nabla_Y\nabla_X(Z)</math>. In the notation of fact (3), <math>A = Y</math>, <math>f = f</math>, and <math>B = \nabla_X(Z)</math>.
-which, substituted back, gives:
+|-
+| 3 || <math>f(\nabla_X\nabla_Y - \nabla_Y\nabla_X) - \nabla_{(Yf)X} - \nabla_{[fX,Y]}</math> || Fact (1) || <math>(Yf)\nabla_X \to \nabla_{(Yf)X}</math>
-<math>f(\nabla_X\nabla_Y - \nabla_Y\nabla_X - \nabla_{[X,Y]})</math>
+|-
+| 4 || <math>f(\nabla_X\nabla_Y - \nabla_Y\nabla_X) - \nabla_{(Yf)X + [fX,Y]}</math> || <math>\nabla</math> is additive in its subscript argument || <math>\nabla_{(Yf)X} + \nabla_{[fX,Y]} = \nabla_{(Yf)X + [fX,Y]}</math>
+|-
+| 5 || <math>f(\nabla_X\nabla_Y - \nabla_Y\nabla_X) - \nabla_{f[X,Y]}</math> || Fact (3) || <math>[fX,Y] + (Yf)X \to f[X,Y]</math>
+|-
+| 6 || <math>f(\nabla_X\nabla_Y - \nabla_Y\nabla_X - \nabla_{[X,Y]})</math> || Fact (1) || <math>\nabla_{f[X,Y]} \to f\nabla_{[X,Y]}</math>
+|}
 ===Tensoriality in the second variable===
-Let <math>f:M \to \R</math> be a scalar function. We will show that:
+'''Given''': <math>f:M \to \R</math> is a <math>C^\infty</math>-function.
-<math>R(X,fY) = f R(X,Y)</math>
+'''To prove''': <math>\! R(X,fY) = f R(X,Y)</math>, or more explicitly, <math>\nabla_X\nabla_{fY} - \nabla_{fY}\nabla_X - \nabla_{[X,fY]} = f (\nabla_X\nabla_Y - \nabla_Y\nabla_X - \nabla_{[X,Y]}</math>.
 We start out with the left side:
@@ Line 70: / Line 71: @@
 <math>\nabla_X\nabla_{fY} - \nabla_{fY}\nabla_X - \nabla_{[X,fY]}</math>
-Applying the Leibniz rule and the property of a connection being <math>C^\infty</math> in its subscript variable yields:
+Each step below is obtained from the previous one via some manipulation explained along side.
-<math>(Xf)\nabla_Y + f(\nabla_X\nabla_Y - \nabla_Y\nabla_X) - \nabla_{[X,fY]}</math>
+{| class="sortable" border="1"
+! Step no. !! Current status of left side !! Facts/properties used !! Specific rewrites
-which simplifies to:
+|-
+| 1 || <math>\nabla_X(f\nabla_Y) - f\nabla_Y\nabla_X - \nabla_{[X,fY]}</math> || Fact (1) || <math>\nabla_{fY} \to f\nabla_Y</math>.
-<math>f(\nabla_X\nabla_y - \nabla_Y\nabla_X) - \nabla_{[X,fY] - (Xf)Y}</math>
+|-
+| 2 || <math>(Xf)\nabla_Y + f(\nabla_X\nabla_Y) - f\nabla_Y\nabla_X - \nabla_{[X,fY]}</math> || Fact (2) || <math>\nabla_X(f\nabla_Y) \to (Xf)\nabla_Y + f(\nabla_X\nabla_Y)</math>. To make this more concrete, imagine an input <math>Z</math>. Then, the rewrite becomes <math>\nabla_X(f\nabla_Y(Z)) \to (Xf)\nabla_Y(X) + f(\nabla_X\nabla_Y(Z))</math>. This comes setting <math>A = X</math>, <math>f = f</math>, <math>B = \nabla_YZ</math> in Fact (3).
-We now use the corollary stated above:
+|-
+| 3 || <math>f(\nabla_X\nabla_Y - \nabla_Y\nabla_X) - \nabla_{[X,fY]} + \nabla_{(Xf)Y}</math> || Fact (1) || <math>(Xf)\nabla_Y \to \nabla_{(Xf)Y}</math>
-<math>f[X,Y] = [X,fY] - (Xf)Y</math>
+|-
+| 4 || <math>f(\nabla_X\nabla_Y - \nabla_Y\nabla_X) - \nabla_{[X,fY] - (Xf)Y}</math> || <math>\nabla</math> is additive in its subscript argument. || <math>\nabla_{[X,fY]} - \nabla_{(Xf)Y} \to \nabla_{[X,fY] - (Xf)Y}</math>.
-substituting this gives:
+|-
+| 5 || <math>f(\nabla_X\nabla_Y - \nabla_Y\nabla_X) - \nabla_{f[X,Y]}</math> || Fact (3) || <math>[X,fY] - (Xf)Y \to f[X,Y]</math>
-<math>f (\nabla_X\nabla_Y - \nabla_Y\nabla_X - \nabla_{[X,Y]}</math>
+|-
+| 6 || <math>f(\nabla_X\nabla_Y - \nabla_Y\nabla_X - \nabla_{[X,Y]})</math> || Fact (1) || <math>\nabla_{f[X,Y]} \to f\nabla_{[X,Y]}</math>
-which is <math>f R(X,Y)</math>
+|}
 ===Tensoriality in the third variable===
-Let <math>f:  M \to \R</math> be a scalar function. We will show that:
+'''Given''': A <math>C^\infty</math>-function <math>f:M \to \R</math>.
-<math>R(X,Y) (fZ) = f R(X,Y) Z</math>
+'''To prove''': <math>\! R(X,Y) (fZ) = f R(X,Y) Z</math>. More explicitly, <math>\! \nabla_X\nabla_Y(fZ) - \nabla_Y\nabla_X(fZ) - \nabla_{[X,Y]}(fZ)  = f (\nabla_X\nabla_Y - \nabla_Y\nabla_X - \nabla_{[X,Y]})Z + ((XY - YX - [X,Y])f)Z</math>.
 We start out with the left side:
@@ Line 98: / Line 99: @@
 <math>\nabla_X\nabla_Y(fZ) - \nabla_Y\nabla_X(fZ) - \nabla_{[X,Y]}(fZ)</math>
-Now we apply the Leibniz rule for connnections on each term:
+Each step below is obtained from the previous one via some manipulation explained along side.
-<math>\nabla_X( (Yf)(Z) + f \nabla_YZ) - \nabla_Y ((Xf)Z + f \nabla_XZ) - f \nabla_{[X,Y]}Z - ([X,Y]f) Z</math>
-We again apply the Leibniz rule to the first two term groups:
-<math>(XYf)(Z) + (Yf) \nabla_XZ + (Xf) \nabla_YZ + f \nabla_X\nabla_YZ - (YXf)Z - (Xf) \nabla_YZ - (Yf) \nabla_XZ -f \nabla_Y\nabla_XZ - f \nabla_{[X,Y]}Z - ([X,Y]f) Z</math>
-After cancellations we are left with the following six terms:
-<math>f (\nabla_X\nabla_Y - \nabla_Y\nabla_X - \nabla_{[X,Y]})Z + ((XY - YX - [X,Y])f)Z</math>
-But since <math>[X,Y] = XY - YX</math>, the last three terms vanish, and we are left with:
-<math>f R(X,Y)Z</math>
+{| class="sortable" border="1"
+! Step no. !! Current status of left side !! Facts/properties used !! Specific rewrites
+|-
+| 1 || <math>\! \nabla_X( (Yf)(Z) + f \nabla_YZ) - \nabla_Y ((Xf)Z + f \nabla_XZ) - f \nabla_{[X,Y]}Z - ([X,Y]f) Z</math> || Fact (2) || <math>\nabla_Y(fZ) \to (Yf)(Z) + f\nabla_YZ</math> and <math>\nabla_X(fZ) \to (Xf)Z + f\nabla_XZ</math>
+|-
+| 2 || <math>\! (XYf)(Z) + (Yf) \nabla_XZ + (Xf) \nabla_YZ + f \nabla_X\nabla_YZ - (YXf)Z - (Xf) \nabla_YZ - (Yf) \nabla_XZ -f \nabla_Y\nabla_XZ - f \nabla_{[X,Y]}Z - ([X,Y]f) Z</math> || Fact (2) || <math>\nabla_X((Yf)Z) \to X((Yf)Z) + (Yf)\nabla_XZ</math>, etc.
+|-
+| 3 || <math>f (\nabla_X\nabla_Y - \nabla_Y\nabla_X - \nabla_{[X,Y]})Z + ((XY - YX - [X,Y])f)Z</math> || -- || cancellations
+|-
+| 4 || <math>f (\nabla_X\nabla_Y - \nabla_Y\nabla_X - \nabla_{[X,Y]})Z + ((XY - YX - [X,Y])f)Z</math>|| use <math>[X,Y] = XY - YX</math>, definition || cancellation
+|}