This articleneeds additional citations forverification.(November 2017) |
Inmathematics,anordered basisof avector spaceof finitedimensionnallows representing uniquely any element of the vector space by acoordinate vector,which is asequenceofnscalarscalledcoordinates.If two different bases are considered, the coordinate vector that represents a vectorvon one basis is, in general, different from the coordinate vector that representsvon the other basis. Achange of basisconsists of converting every assertion expressed in terms of coordinates relative to one basis into an assertion expressed in terms of coordinates relative to the other basis.[1][2][3]
Such a conversion results from thechange-of-basis formulawhich expresses the coordinates relative to one basis in terms of coordinates relative to the other basis. Usingmatrices,this formula can be written
where "old" and "new" refer respectively to the initially defined basis and the other basis,andare thecolumn vectorsof the coordinates of the same vector on the two bases.is thechange-of-basis matrix(also calledtransition matrix), which is the matrix whose columns are the coordinates of the newbasis vectorson the old basis.
A change of basis is sometimes called achange of coordinates,although it excludes manycoordinate transformations. For applications inphysicsand specially inmechanics,a change of basis often involves the transformation of anorthonormal basis,understood as arotationinphysical space,thus excludingtranslations. This article deals mainly with finite-dimensional vector spaces. However, many of the principles are also valid for infinite-dimensional vector spaces.
Change of basis formula
editLetbe a basis of afinite-dimensional vector spaceVover afieldF.[a]
Forj= 1,...,n,one can define a vectorwjby its coordinatesover
Let
be thematrixwhosejth column is formed by the coordinates ofwj.(Here and in what follows, the indexirefers always to the rows ofAand thewhile the indexjrefers always to the columns ofAand thesuch a convention is useful for avoiding errors in explicit computations.)
Settingone has thatis a basis ofVif and only if the matrixAisinvertible,or equivalently if it has a nonzerodeterminant.In this case,Ais said to be thechange-of-basis matrixfrom the basisto the basis
Given a vectorletbe the coordinates ofoverandits coordinates overthat is
(One could take the same summation index for the two sums, but choosing systematically the indexesifor the old basis andjfor the new one makes clearer the formulas that follows, and helps avoiding errors in proofs and explicit computations.)
Thechange-of-basis formulaexpresses the coordinates over the old basis in terms of the coordinates over the new basis. With above notation, it is
In terms of matrices, the change of basis formula is
whereandare the column vectors of the coordinates ofzoverandrespectively.
Proof:Using the above definition of the change-of basis matrix, one has
Asthe change-of-basis formula results from the uniqueness of the decomposition of a vector over a basis.
Example
editConsider theEuclidean vector spaceand a basis consisting of the vectorsandIf onerotatesthem by an angle oft,one has anew basisformed byand
So, the change-of-basis matrix is
The change-of-basis formula asserts that, ifare the new coordinates of a vectorthen one has
That is,
This may be verified by writing
In terms of linear maps
editNormally, amatrixrepresents alinear map,and the product of a matrix and a column vector represents thefunction applicationof the corresponding linear map to the vector whose coordinates form the column vector. The change-of-basis formula is a specific case of this general principle, although this is not immediately clear from its definition and proof.
When one says that a matrixrepresentsa linear map, one refers implicitly tobasesof implied vector spaces, and to the fact that the choice of a basis induces anisomorphismbetween a vector space andFn,whereFis the field of scalars. When only one basis is considered for each vector space, it is worth to leave this isomorphism implicit, and to workup toan isomorphism. As several bases of the same vector space are considered here, a more accurate wording is required.
LetFbe afield,the setof then-tuplesis aF-vector space whose addition and scalar multiplication are defined component-wise. Itsstandard basisis the basis that has as itsith element the tuple with all components equal to0except theith that is1.
A basisof aF-vector spaceVdefines alinear isomorphismby
Conversely, such a linear isomorphism defines a basis, which is the image byof the standard basis of
Letbe the "old basis" of a change of basis, andthe associated isomorphism. Given a change-of basis matrixA,one could consider it the matrix of anendomorphismofFinally, define
(wheredenotesfunction composition), and
A straightforward verification shows that this definition ofis the same as that of the preceding section.
Now, by composing the equationwithon the left andon the right, one gets
It follows that, forone has
which is the change-of-basis formula expressed in terms of linear maps instead of coordinates.
Function defined on a vector space
editAfunctionthat has a vector space as itsdomainis commonly specified as amultivariate functionwhose variables are the coordinates on some basis of the vector on which the function isapplied.
When the basis is changed, theexpressionof the function is changed. This change can be computed by substituting the "old" coordinates for their expressions in terms of the "new" coordinates. More precisely, iff(x)is the expression of the function in terms of the old coordinates, and ifx=Ayis the change-of-base formula, thenf(Ay)is the expression of the same function in terms of the new coordinates.
The fact that the change-of-basis formula expresses the old coordinates in terms of the new one may seem unnatural, but appears as useful, as nomatrix inversionis needed here.
As the change-of-basis formula involves onlylinear functions,many function properties are kept by a change of basis. This allows defining these properties as properties of functions of a variable vector that are not related to any specific basis. So, a function whose domain is a vector space or a subset of it is
- a linear function,
- apolynomial function,
- acontinuous function,
- adifferentiable function,
- asmooth function,
- ananalytic function,
if the multivariate function that represents it on some basis—and thus on every basis—has the same property.
This is specially useful in the theory ofmanifolds,as this allows extending the concepts of continuous, differentiable, smooth and analytic functions to functions that are defined on a manifold.
Linear maps
editConsider alinear mapT:W→Vfrom avector spaceWof dimensionnto a vector spaceVof dimensionm.It is represented on "old" bases ofVandWby am×nmatrixM.A change of bases is defined by anm×mchange-of-basis matrixPforV,and ann×nchange-of-basis matrixQforW.
On the "new" bases, the matrix ofTis
This is a straightforward consequence of the change-of-basis formula.
Endomorphisms
editEndomorphismsare linear maps from a vector spaceVto itself. For a change of basis, the formula of the preceding section applies, with the same change-of-basis matrix on both sides of the formula. That is, ifMis thesquare matrixof an endomorphism ofVover an "old" basis, andPis a change-of-basis matrix, then the matrix of the endomorphism on the "new" basis is
As everyinvertible matrixcan be used as a change-of-basis matrix, this implies that two matrices aresimilarif and only if they represent the same endomorphism on two different bases.
Bilinear forms
editAbilinear formon a vector spaceVover afieldFis a functionV×V→ Fwhich islinearin both arguments. That is,B:V×V→ Fis bilinear if the maps and are linear for every fixed
The matrixBof a bilinear formBon a basis(the "old" basis in what follows) is the matrix whose entry of theith row andjth column is.It follows that ifvandware the column vectors of the coordinates of two vectorsvandw,one has
wheredenotes thetransposeof the matrixv.
IfPis a change of basis matrix, then a straightforward computation shows that the matrix of the bilinear form on the new basis is
Asymmetric bilinear formis a bilinear formBsuch thatfor everyvandwinV.It follows that the matrix ofBon any basis issymmetric.This implies that the property of being a symmetric matrix must be kept by the above change-of-base formula. One can also check this by noting that the transpose of a matrix product is the product of the transposes computed in the reverse order. In particular,
and the two members of this equation equalif the matrixBis symmetric.
If thecharacteristicof the ground fieldFis not two, then for every symmetric bilinear form there is a basis for which the matrix isdiagonal.Moreover, the resulting nonzero entries on the diagonal are defined up to the multiplication by a square. So, if the ground field is the fieldof thereal numbers,these nonzero entries can be chosen to be either1or–1.Sylvester's law of inertiais a theorem that asserts that the numbers of1and of–1depends only on the bilinear form, and not of the change of basis.
Symmetric bilinear forms over the reals are often encountered ingeometryandphysics,typically in the study ofquadricsand of theinertiaof arigid body.In these cases,orthonormal basesare specially useful; this means that one generally prefer to restrict changes of basis to those that have anorthogonalchange-of-base matrix, that is, a matrix such thatSuch matrices have the fundamental property that the change-of-base formula is the same for a symmetric bilinear form and the endomorphism that is represented by the same symmetric matrix. TheSpectral theoremasserts that, given such a symmetric matrix, there is an orthogonal change of basis such that the resulting matrix (of both the bilinear form and the endomorphism) is a diagonal matrix with theeigenvaluesof the initial matrix on the diagonal. It follows that, over the reals, if the matrix of an endomorphism is symmetric, then it isdiagonalizable.
See also
edit- Active and passive transformation
- Covariance and contravariance of vectors
- Integral transform,the continuous analogue of change of basis.
- Chirgwin-Coulson weights— application in computational chemistry
Notes
edit- ^Although a basis is generally defined as a set of vectors (for example, as a spanning set that is linearly independent), thetuplenotation is convenient here, since the indexing by the first positive integers makes the basis anordered basis.
References
edit- ^Anton (1987,pp. 221–237)
- ^Beauregard & Fraleigh (1973,pp. 240–243)
- ^Nering (1970,pp. 50–52)
Bibliography
edit- Anton, Howard (1987),Elementary Linear Algebra(5th ed.), New York:Wiley,ISBN0-471-84819-0
- Beauregard, Raymond A.; Fraleigh, John B. (1973),A First Course In Linear Algebra: with Optional Introduction to Groups, Rings, and Fields,Boston:Houghton Mifflin Company,ISBN0-395-14017-X
- Nering, Evar D. (1970),Linear Algebra and Matrix Theory(2nd ed.), New York:Wiley,LCCN76091646
External links
edit- MIT Linear Algebra Lecture on Change of Basis,from MIT OpenCourseWare
- Khan Academy Lecture on Change of Basis,from Khan Academy