Orthogonal

The Dual Pythagorean Theorem [Extra]

Main page | Extra
Plus, Minus | The Dual Pythagorean Theorem | Geometry and Motion | Geometry and Waves | Riemannian Electromagnetism | Riemannian Thermodynamics | Riemannian General Relativity | Riemannian Quantum Mechanics | Glossary | The Clockwork Rocket excerpt | The Eternal Flame excerpt | The Arrows of Time excerpt | Videos
Orthogonal contents
Back to home page | Site Map | Side-bar Site Map

Dual Vectors

Dual Vectors

Suppose we have an n-dimensional vector space, V, and a linear function f from V to the real numbers, R.

The kernel of f, written ker f, is the subspace of V on which f is zero. The dimension of ker f must be at least n–1 (and unless f is the zero function, it will be exactly n–1). For example, if n = 2 and f ≠ 0 then f will be zero on a line through the origin of V. If n = 3 and f ≠ 0 then f will be zero on a plane through the origin, and so on. This follows from a basic theorem of linear algebra known as the Rank-Nullity Theorem.

Linear function on a two-dimensional vector space

The diagram shows a two-dimensional example; here the function f is zero on a line through the origin spanned by the vector k.

If we pick a vector such as v, for which f(v) is non-zero, there will be a line in V passing, not through the origin but through the tip of v, on which f is uniformly equal to f(v). We form this line by adding arbitrary multiples of k to v:

f(v + s k) = f(v) + s f(k) = f(v) = 1

The last equality only follows because we happen to have chosen v such that f(v) = 1. But all the sets on which f takes on a constant value will be of this form: lines parallel to the vector k. And just as we draw different vectors on this diagram as arrows of various lengths and directions, we can draw different linear functions as stacks of lines of various spacings and orientations.

Larger vectors are drawn with larger arrows, but in this scheme a larger function would be drawn, not with greater distances between the lines, but with the lines packed closer together. For example, the function g = 2 f would be drawn with an extra line between every pair of lines in the stack for f, because it reaches 1 when f is just 1/2.

The value of f for any vector effectively counts the number of gaps between the stack of lines that the vector’s arrow crosses. For example, the arrow for w crosses two complete gaps in the stack for f, so f(w) = 2. We also need to account for arrows that cross the gaps backwards to yield a negative value for f (such as p), or those that don’t cross a whole number of gaps (such as q), so this is more of an appeal to geometric intuition than a rigorous mathematical scheme, but you can probably already see how it fits in with the examples on the main page, where we discuss how many furrows in a field or contour lines on a map are crossed when we move a given distance in a certain direction.

If V has more than two dimensions, instead of a stack of lines in V we will have a stack of planes, or hyperplanes, of dimension n–1.

Now, let’s consider the set of all linear functions from V to R. We can turn this set itself into an n-dimensional vector space in its own right, which is called the dual space to V, and written V*. We call elements of V*, such as f, dual vectors.

To make a set into a real vector space we need to be able to add its elements together, and multiply them by real numbers. We can do this to the set of linear functions on V in a pretty obvious way; if f and g are in V*, v is in V, and s is in R, we define the sum f + g and the scalar multiple s f by stating their values as functions:

(f + g)(v) = f(v) + g(v)
(s f)(v) = s f(v)

As with any vector space, we can choose a basis for V*, consisting of a set of n linear functions in terms of which we can write any function in V*. We will write the elements of this basis as {e¹, e², ... eⁿ}, and write:

f = f₁ e¹ + f₂ e² + ... + f_n eⁿ

where f₁ etc. are the components of f with respect to this basis. In these notes, while we usually label the components of vectors with superscripts, we will label the components of dual vectors with subscripts – and while we usually label the individual vectors in a basis with subscripts, we will label the individual dual vectors in a basis with superscripts.

If we have a particular basis for V, {e₁, e₂, ... e_n}, we will say that a basis {e¹, e², ... eⁿ} for V* is dual to the basis for V if the following condition holds:

eⁱ(e_j) = δⁱ_j

where δⁱ_j, known as the Kronecker delta symbol, is 1 if i=j and 0 if i≠j. Geometrically, what this means is that the function eⁱ for a particular i appears as a stack of lines or planes in V such that the vector e_i crosses exactly one gap between them, and all the other basis vectors e_j lie within the line or plane that passes through the origin, and don’t cross the stack at all. For example, in the diagram on the left the vector e₂ lies within the line e¹ = 0, and the vector e₁ lies within the line e² = 0.

Given our basis {e₁, e₂, ... e_n} for V, we can find the components of any linear function f in V* with respect to the dual basis just by feeding each of the vectors e_i in the original basis to f. Because the two bases are duals, only the particular basis function eⁱ will be non-zero at e_i, and only the component that multiplies it, f_i, will remain in the sum.

f(e_i) = (f₁ e¹ + f₂ e² + ... + f_n eⁿ)(e_i) = f_i

If the components of a vector v in V are vⁱ with respect to some chosen basis for V and the components of a dual vector f in V* are f_i with respect to the dual basis, then:

f(v) = f(vⁱ e_i) = f_i vⁱ

where we have used the Einstein summation convention to abbreviate sums over repeated indices (e.g. vⁱ e_i = v¹ e₁ + v² e₂ + ... + vⁿ e_n).

Suppose we have a dot product on V. Then for any vector w in V, we can define a linear function f_w in V* by:

f_w(v) = v · w

Equally, given any linear function f in V*, we can find a vector w_f such that f(v) = v · w_f for every v in V. We choose an orthonormal basis {e₁, e₂, ... e_n} for V, then we set:

w_f = f_i e_i

where we’re using the Einstein summation convention again, and the components f_i are with respect to the basis of V* dual to our orthonormal basis of V. Then:

v · w_f = v · [f_i e_i] = f_i [v · e_i] = f_i vⁱ = f(v)

So we can identify any element of V with a unique element of V*, and vice versa.

The vector f_i e_i that we associate with f this way will be orthogonal to any vector k that lies in the kernel of f, since the dot product of k with f_i e_i is just f(k) = 0. Since all the lines (or planes, etc.) in the stack we use to visualise f are parallel to the one through the origin that is the kernel of f, the vector f_i e_i will be perpendicular to the whole stack.

We can put a dot product on V*, by declaring that any basis dual to an orthonormal basis for V is itself orthonormal. This lets us define the “length” or magnitude of a dual vector f via its square:

|f|² = f · f = (f₁ e¹ + f₂ e² + ... + f_n eⁿ) · (f₁ e¹ + f₂ e² + ... + f_n eⁿ) = (f₁)² + (f₂)² + ... + (f_n)²

Here the f_i are components taken with respect to a basis {e¹, e², ... eⁿ} of V* that is dual to an orthonormal basis for V.

According to our geometric interpretation of f, each component f_i = f(e_i) here is a count of the number of “gaps in the stack” that the basis vector e_i crosses. So these are like the counts of cycles of the wave crossed in the diagram on the right, when we move one metre along the x and y axes.

But now suppose we cross the stack perpendicularly, with a unit vector in the direction of f_i e_i – explicitly, f_i e_i / |f_i e_i| — which we know is orthogonal to the stack. The count from such a direct crossing is:

f(f_i e_i / |f_i e_i|)
= f_i f(e_i) / |f_i e_i|
= f_i f_i / |f_i e_i|
= |f|² / |f|
= |f|

This is another statement of the Dual Pythagorean Theorem! Crossing the stack directly with a unit vector gives a count, |f|, whose square we have seen is just the sum of squares of the counts obtained by crossing the stack with a unit vector in each of n orthogonal directions.

Main page | Extra
Plus, Minus | The Dual Pythagorean Theorem | Geometry and Motion | Geometry and Waves | Riemannian Electromagnetism | Riemannian Thermodynamics | Riemannian General Relativity | Riemannian Quantum Mechanics | Glossary | The Clockwork Rocket excerpt | The Eternal Flame excerpt | The Arrows of Time excerpt | Videos
Orthogonal contents
Back to home page | Site Map | Side-bar Site Map

Orthogonal / The Dual Pythagorean Theorem [Extra] / created Wednesday, 6 April 2011