Quantcast
Loading [MathJax]/jax/output/HTML-CSS/jax.js
  • Register
PhysicsOverflow is a next-generation academic platform for physicists and astronomers, including a community peer review system and a postgraduate-level discussion forum analogous to MathOverflow.
W3Counter Web Stats

Welcome to PhysicsOverflow! PhysicsOverflow is an open platform for community peer review and graduate-level Physics discussion.

Please help promote PhysicsOverflow ads elsewhere if you like it.

News

PO is now at the Physics Department of Bielefeld University!

New printer friendly PO pages!

Migration to Bielefeld University was successful!

Please vote for this year's PhysicsOverflow ads!

Please do help out in categorising submissions. Submit a paper to PhysicsOverflow!

... see more

Tools for paper authors

Submit paper
Claim Paper Authorship

Tools for SE users

Search User
Reclaim SE Account
Request Account Merger
Nativise imported posts
Claim post (deleted users)
Import SE post

Users whose questions have been imported from Physics Stack Exchange, Theoretical Physics Stack Exchange, or any other Stack Exchange site are kindly requested to reclaim their account and not to register as a new user.

Public β tools

Report a bug with a feature
Request a new functionality
404 page design
Send feedback

Attributions

(propose a free ad)

Site Statistics

208 submissions , 166 unreviewed
5,138 questions , 2,258 unanswered
5,413 answers , 23,081 comments
1,470 users with positive rep
822 active unimported users
More ...

  Is partial derivative a vector or dual vector?

+ 7 like - 0 dislike
9288 views

The textbook(Introduction to the Classical Theory of Particles and Fields, by Boris Kosyakov) defines a hypersurface by F(x) = c, where FC[M4,R]. Differentiating gives dF = (μF)dxμ = 0. The text then says dxμ is a covector and μF a vector. I learnt from another book that dxμ are 4 dual vectors(in Minkowski space), μ indexes dual vector themselves, not components of a single dual vector. So I think μF should also be 4 vectors, each being the directional derivative along a coordinate axis. But this book later states that (μF)dxμ=0 describes a hyperplane Σ with normal μF spanned by vectors dxμ, and calls Σ a tangent plane (page 33-34). This time, it seems to treat μF as a single vector and dxμ as vectors. But I think dxμ should span a cotangent space.

I need some help to clarify these things.

[edit by Ben Crowell] The following appears to be the text the question refers to, from Appendix A (which Amazon let me see through its peephole):

Elie Cartan proposed to use differential coordinates dxi as a convenient basis of 1-forms. The differentials dxi transform like covectors [...] Furthermore, when used in the directional derivative dxiF/xi, dxi may be viewed as a linear functional which takes real values on vectors F/xi. The line elements dxi are called [...] 1-forms.

This post imported from StackExchange Physics at 2014-11-11 14:50 (UTC), posted by SE-user elflyao
asked Oct 31, 2014 in Mathematics by elflyao (35 points) [ no revision ]
Most voted comments show all comments
@MBN: "μF is a function not a vector." You mean it's a function in the sense that it varies over spacetime? Sure, that's certainly true, but it's a vector-valued function. "Perhaps you mean the fourtuple of the μF's?" What I would normally mean by this is that when the index μ is unbound, and no coordinate system has been specified, the index notation μF indicates the entire gradient covector, as in abstract index notation. This is similar to "the function f(x)," where x is unbound. Kosyakov's "vectors F/xi," with [...]

This post imported from StackExchange Physics at 2014-11-11 14:50 (UTC), posted by SE-user Ben Crowell
plural "vectors," suggests that he thinks of something like F/x0 as a vector. This doesn't make much sense to me, since it clearly can't have the transformation properties of a vector. It's a single real number, and, e.g., under a change of coordinates x07x0, F/x0(1/7)F/x0, which is what a component of a covector would do. A component of a vector would get 7 times bigger.

This post imported from StackExchange Physics at 2014-11-11 14:50 (UTC), posted by SE-user Ben Crowell
@BenCrowell: OK, then it makes sense. To me it looked like one partial derivative.

This post imported from StackExchange Physics at 2014-11-11 14:50 (UTC), posted by SE-user MBN
Wasn't this question already answered? I see a lot of great answers below...

This post imported from StackExchange Physics at 2014-11-11 14:50 (UTC), posted by SE-user QuantumBrick
I compared Kosyakov's definitions with Wald, General Relativity, pp. 15 and 20ff. There are some inconsistencies. Wald defines F as the set of smooth scalar fields on a manifold M, and defines a tangent vector vVp as a map v:FR that is linear and obeys the Leibniz rule at p, so that it can be interpreted as a directional derivative at p. This makes a partial derivative a vector as a matter of definition, and it also means that for FF, F/xi is a real number, not a vector as Kosyakov describes it.

This post imported from StackExchange Physics at 2014-11-11 14:50 (UTC), posted by SE-user Ben Crowell
Most recent comments show all comments
After writing an answer and then succeeding in getting a look at what Kosyakov wrote, I'm just as confused as elflyao. I would be interested in hearing from others who might have broader experience or be able to explain whether Kosyakov has an unusual point of view.

This post imported from StackExchange Physics at 2014-11-02 19:45 (UTC), posted by SE-user Ben Crowell
cross-posted here: physicsforums.com/threads/…

This post imported from StackExchange Physics at 2014-11-11 14:50 (UTC), posted by SE-user Ben Crowell

6 Answers

+ 7 like - 0 dislike

Below follows a handful of excerpts from the book Introduction to the Classical Theory of Particles and Fields (2007) by B. Kosyakov.

Controversial/misleading/wrong statements are marked in red. We agree with OP that the statements marked in red are opposite standard terminology/conventions. Some (not all) correct statements are marked in green.

1.2 Affine and Metric Structures

[...] Let e1, , en and e1, , en be two arbitrary bases. Each vector of the latter basis can be expanded in terms of vectors of the former basis: ei = ej Lji.

[...] Thus, linear functionals form the dual vector space V. If V is n-dimensional, so is V. Indeed, let e1, , en be a basis in V. Then any ωV is specified by n real numbers ω1=ω(e1), , ωn=ω(en), and the value of ω on a=aiei is given by ω(a) = ωiai. We see that V is isomorphic to V. That is why we sometimes refer to linear functionals as covectors. A closer look at (1.52) shows that a vector a can be regarded as a linear functional on V. One can show (Problem 1.2.3) that changing the basis (1.37) implies the transformation of ωi according to the same law: ωi = ωj Lji. We will usually suppress the argument of ω(a), and identify ω with its components ωi. [...]

1.3 Vectors, Tensors, and n-Forms

[...] A simple generalization of vectors and covectors are tensors. Algebraically, a tensor T of rank (m,n) is a multilinear mapping T:V××Vm times×V××Vn timesR. We have already encountered examples of tensors in the previous section: a scalar is a rank (0,0) tensor, a vector is a rank (1,0) tensor, a covector is a rank (0,1) tensor, the metric gij is a rank (0,2), while gij is a rank (2,0) tensor, and the Kronecker delta δij is a rank (1,1) tensor. Just as four vectors can be regarded as objects which transform according to the law aμ = Λμν aν, where Λμν is the Lorentz transformation matrix relating the two frames of reference, so tensors of rank (m,n) can be described in terms of Lorentz group representations by the requirement that their transformation law be Tμ1μmν1νn = Λμ1α1Λμmαm Tα1αmβ1βn Λβ1ν1Λβnνn.

[...]The differential operator μ = xμ transforms like a covariant vector. To see this, we use the chain rule for differentiation: xμ = xνxμxν, and note that, for linear coordinate transformations xμ=Λμν xν+aμ xμxν = Λμν. We will always use the shorthand notation μ, and treat this differential operator as an ordinary vector. [...]

1.4 Lines and Surfaces

[...] We define a hypersurface Mn1 by F(x) = C, where F is an arbitrary smooth function M4R. Differentiating (1.176) gives (μF)dxμ = 0. One may view dxμ as a covector, and μF as a vector. Indeed, dxμ transforms like a covector under linear coordinate transformations xμ=Λμν xν+aμ, dxμ = xμxνdxν = Λμν dxν, and μF transforms like a vector: Fxμ = Fxνxνxμ = FxνΛνμ. In Minkowski space, vectors and covectors can be converted to each other according to (1.121). For this reason, we will often regard dxμ as vectors. [...]

A. Differential Forms

[...] Elie Cartan proposed to use differential coordinates dxi as a convenient basis of one forms. The differentials dxi transform like covectors under a local coordinate change, dxj = xjxidxi. [If the coordinate change is specialized to Euclidean transformations xj=Lji xi+cj, then xj/xi reduces to Lji, an orthogonal matrix with constant entries, and (A.1) becomes (1.53), the transformation law for covectors.] [...]

Notes:

  1. The corrected eq. (1.114) reads Tμ1μmν1νn = Λμ1α1Λμmαm Tα1αmβ1βn (Λ1)β1ν1(Λ1)βnνn.

  2. The corrected eq. (1.179) reads Fxμ = Fxνxνxμ = Fxν(Λ1)νμ.

  3. To explain why (A.1) does not becomes (1.53), let e1, , en, be a (dual) basis in V. In light of (1.53), in order for a covector ω=ωieiV to be independent of the choice of basis, the dual basis must transform as ei = Mij ej, where M = L1. Identifying the dual bases eidxi, the above eq.(*) becomes (A.1). Moreover, in the sentence below eq. (A.1), the L matrix should be replaced with the M matrix in two places.

  4. Finally, let us answer OP's title question: A partial derivative μF (of a scalar function F) is a component of a cotangent vector dF=(μF)dxμ, while the un-applied partial derivative μ is a local basis element of a tangent vector. Both μF and μ transform as covectors.

This post imported from StackExchange Physics at 2014-11-11 14:51 (UTC), posted by SE-user Qmechanic
answered Nov 3, 2014 by Qmechanic (3,120 points) [ no revision ]
well point 4. is nicely put, but i dont like so much the red/green tags, as this is convention than anything else (of course any book errata excepted)

This post imported from StackExchange Physics at 2014-11-11 14:51 (UTC), posted by SE-user Nikos M.
@NikosM.: Qmechanic's points 1 and 2 are not matters of convention; Kosyakov has made mistakes in those spots.

This post imported from StackExchange Physics at 2014-11-11 14:51 (UTC), posted by SE-user Ben Crowell
+ 6 like - 0 dislike

I took a quick look at pages 59 and 60 of "Gravitation", section 2.6 "Gradients and Directional Derivatives", to see if there's anything there we can use to clarify this issue.

In this section, the gradient of f is df, the directional derivative along the vector v is vf and the following relationship holds:

vf=df,v

Then assuming a set of basis forms dxμ and dual basis vectors eμ we have

μfeμf=df,eμ=fxμ

So, according to MTW in this section, μf are the components of df on this basis.

Thus, it must be that, per the 2nd equation in the question,

df=(μf)dxμ

which is just the expansion of the form df on the basis forms dxμ

As to why Kosyakov would identify this as a contraction of a form and a vector I haven't a clue.

This post imported from StackExchange Physics at 2014-11-11 14:51 (UTC), posted by SE-user Alfred Centauri
answered Nov 2, 2014 by Alfred Centauri (110 points) [ no revision ]
+ 4 like - 0 dislike

I believe this is just imprecise use of language by the author - there is nothing mysterious happening, it is just not well stated:

As stated in the question, for a hypersurface Σ defined by

F(x)=cR

we find that

dF=0

must hold on Σ. This is crucial - it means that the 1-form dF acting upon tangent vectors of Σ must vanish identically:

vTxΣ:dF(v)=(μF)vμ=0

But we can recognize (μF)vμ as the scalar product of the vectors v and g(dF,˙), the latter being the usual dual of dF with components μF. Since TxΣTxM4 naturally, this means that dF=0 indeed sweeps out a hypersurface in the tangent space that has, in sloppy diction, the gradient as its normal (although it is really its dual).

This post imported from StackExchange Physics at 2014-11-11 14:51 (UTC), posted by SE-user ACuriousMind
answered Nov 2, 2014 by ACuriousMind (910 points) [ no revision ]
I think the crucial point is in the sentence "But we can recognize...," and it's here that I don't follow you. Since the gradient dF is a 1-form, its dual μF is a vector. If so, then we can't take the scalar product of the vector μF with a vector vμ. I don't see any reason for taking duals anywhere at all. Even if we didn't have a metric, and therefore couldn't take duals, we could simply have (μF)vμ, the scalar product of a covector with a vector.

This post imported from StackExchange Physics at 2014-11-11 14:51 (UTC), posted by SE-user Ben Crowell
@BenCrowell: Yes, indeed. That will still define a hypersurface in the tangent space, but we will not have a "normal vector" to describe it. I agree that there is no need to take duals - but I think Kosyakov implicitly does exactly that when he talks of μF being a normal vector. Your nomenclature seems a bit unorthodox to me though - a scalar product is between two vectors or two covectors (and usually induced by the metric) - applying a covector to a vector is not a scalar product.

This post imported from StackExchange Physics at 2014-11-11 14:51 (UTC), posted by SE-user ACuriousMind
OK, by "scalar product" I simply meant a product that transforms as a scalar. So taking "scalar product" in your answer to mean g(,), I don't understand why one would describe (μF)vμ as a scalar product of vμ with μ. That might indicate that we take the gradient, raise its index, lower its index, and then contract. I don't see the point of raising an index and then immediately lowering it again. Or we could raise the gradient's index, lower vμ's index, and contract. Again, why raise or lower at all?

This post imported from StackExchange Physics at 2014-11-11 14:51 (UTC), posted by SE-user Ben Crowell
@BenCrowell: Because it yields the geometric interpretation of μF as the normal vector to the hyperplane of tangent vectors of Σ. I don't think there's anything deeper than that here.

This post imported from StackExchange Physics at 2014-11-11 14:51 (UTC), posted by SE-user ACuriousMind
+ 3 like - 0 dislike

On a manifold without extra structure, dF makes sense only as a covector  (= 1-form = a section of the cotangent space). For example, like any 1-form, one can integrate dF along a path (but unlike for a general 1-form, the integral only depends on the endpoints). To interpret dF as a vector, one needs a metric that can be used to identify vectors and covectors.

You can see that dF must be a covector from your formula, since as a linear combination of the covectors dxμ,  dF is itself a covector - with components dFμ.  The term dFμ(x) itself is just a number, and dFμ a noncovariant function. 

Physicists using the Einstein summation convention refer to a tensor by its indices, then dFμ is what the mathematicians call dF - a covector. (But the physicists notation gets confusing when it is applied to dxμ, which looks like a vectoraccording to its indeices, but in fact is for each fixed μ a covector.)

answered Nov 9, 2014 by Arnold Neumaier (15,787 points) [ revision history ]
+ 3 like - 1 dislike

Hmm I guess the that the simplest way to think about it is the following. If you have a differentiable manifold X(which for the shake of argument let us suppose it enjoys all nice properties we like) then it is possible to define the tangent and co-tangent space at this point such  that the partial derivatives i|pXform an orthonormal basis in the tangent space TpX and the 1-forms dxiare elements of the co-tangent space TpX. Both of these quantities form a real vector space and they are isomorphic to each other. By the way, your first formula gives zero because F is constant. F is a 4-dimensional plane In general we have 

dF=iiFdxi

You can understand the partial derivative as a vector if you really think of what it is doing upon acting on a scalar. It gives it a direction! Draw a graph of a smooth scalar function and take the tangent at a point. Intuitively you have drawn a vector (dir. derivative as you mention) from the point of contact to some direction.  Finally if your space is Minkowski then you have i,j=0,1,2,3. Do not get confused on the notation. Each element dxidenotes a cotangent vector. For example in 4d Minkowski  you have dx0,dx1,dx2,dx3. You can find some nice explanations in Nakahara's book Geometry Topology and Physics and also in almost any General Relativity textbook. Also check out this.

Hope it helps.

answered Nov 8, 2014 by conformal_gk (3,625 points) [ no revision ]

dFμ cannot have the meaning of a vector unless one has a (pseudo-)Riemannian manifold! See my answer.

This should be included in "all the nice properties". To my best knowledge we, physicists, almost always discuss (pseudo-)Riemannian manifolds. To a physicists, I think, my answer is quite right. I guess a mathematician would not like it though. In the end of the day, it seems we are talking about simple GR nor I said anywhere that dFμ is a vector. I said it for the partial derivative. As for the metric, well, M4 is clearly a metric space, so not sure what is the target of your comment although your answer is very nice!

But the question itself distinguishes between vectors and covectors, hence assumes no metric. Otherwise there wouldn't be any confusion to begin with.

In GR many times we call the partial derivatives as vectors and and the one-forms as co-vectors in the sense that they belong in the cotangent bundle of the specified point. Furthermore the whole question assumes Minkowski 4, at least this is what I got.  I completely agree with your answer and in specific that sometimes we mix notation but I think that, at least intuitively, there is no error in my answer ) 

+ 1 like - 0 dislike

I believe you are confused because you are mixing up related but slightly different quantities.

Yes, a partial derivative is a vector and yes, a vector is an object with an upper index.

The above statement may seem contradictory, but in fact it is not for the following reason. A vector is an abstract quantity that is an element of a "vector space". In this case, the vector space that is being discussed is the tangent space. On a vector space, one can choose a basis, any basis. Once a basis has been chosen, any other vector in the vector space can be described by simply prescribing a set of numbers. For instance in R2 (rather the corresponding affine space), one can choose a basis of vectors as ˆx and ˆy. Once this has been done any other vector can be described simply by 2 numbers. For instance the numbers (1,2) really implies that we are talking about the vector ˆx+2ˆy.

How does the discussion above apply here?

On the tangent space, a natural choice of basis are the set of partial derivatives μ={0,1,2,3} (assuming we are in M4. Each partial derivative is in itself a vector.

Now, once this basis has been chosen, every other vector can be described by a set of 4 numbers vμ=(v0,v1,v2,v3) which corresponds to the vector vμμ. It is this sense, that the bold statement above is true. Often, since the basis of partial derivatives is obvious, one simply describes a vector as an object with an upper index vμ.

Next, let us discuss co-vectors (quantities with a lower index). These are lements of the dual vector space (which is the space of linear functions on the vector space) of the tangent space. Given the partial derivative basis on the tangent space, one then has a natural basis in the cotangent space denoted by dxμ={dx0,dx1,dx2,dx3}. Note that each differential itself is a covector. This natural basis is defined by the relation dxμ(ν)=δμν. As before, once this natural basis has been chosen, any element of the cotangent space can be described by 4 numbers, namely vμ={v0,v1,v2,v3} which corresponds to the covector vμdxμ.

In summary, μ for each μ corresponds to a 4-dimensional vector whereas vμ for each μ corresponds to 4 components of a single vector. Similarly, dxμ for each μ corresponds to a 4-dimensional covector whereas vμ for each μ corresponds to 4 components of a single covector.

PS 1 - Sometimes people like to use bases other than μ and dxμ on the tangent and cotangent spaces respectively. These are known as non-coordinate bases.

PS 2 - Just to be clear, μ is a vector, but μF is a function

This post imported from StackExchange Physics at 2014-11-11 14:51 (UTC), posted by SE-user Prahar
answered Nov 3, 2014 by prahar21 (545 points) [ no revision ]

Your answer

Please use answers only to (at least partly) answer questions. To comment, discuss, or ask for clarification, leave a comment instead.
To mask links under text, please type your text, highlight it, and click the "link" button. You can then enter your link URL.
Please consult the FAQ for as to how to format your post.
This is the answer box; if you want to write a comment instead, please use the 'add comment' button.
Live preview (may slow down editor)   Preview
Your name to display (optional):
Anti-spam verification:
If you are a human please identify the position of the character covered by the symbol in the following word:
pysicOverflow
Then drag the red bullet below over the corresponding character of our banner. When you drop it there, the bullet changes to green (on slow internet connections after a few seconds).
Please complete the anti-spam verification




user contributions licensed under cc by-sa 3.0 with attribution required

Your rights
...