What is the symbol of a differential operator?

4410 views

I find Wikipedia's discussion of symbols of differential operators a bit impenetrable, and Google doesn't seem to turn up useful links, so I'm hoping someone can point me to a more pedantic discussion.

Background

I think I understand the basic idea on $\mathbb{R}^n$, so for readers who know as little as I do, I will provide some ideas. Any differential operator on $\mathbb{R}^n$ is (uniquely) of the form $\sum p_{i_1,\dotsc,i_k}(x)\frac{\partial^k}{\partial x_{i_1}\dots\partial x_{i_k}}$, where $x_1,\dotsc,x_n$ are the canonical coordinate functions on $\mathbb{R}^n$, the $p_{i_1,\dotsc,i_k}(x)$ are smooth functions, and the sum ranges over (finitely many) possible indexes (of varying length). Then the symbol of such an operator is $\sum p_{i_1,\dotsc,i_k}(x)\xi^{i_1}\dotso\xi^{i_k}$, where $\xi^1,\dotsc,\xi^n$ are new variables; the symbol is a polynomial in the variables $\{\xi^1,\dotsc,\xi^n\}$ with coefficients in the algebra of smooth functions on $\mathbb{R}^n$.

Ok, great. So symbols are well-defined for $\mathbb{R}^n$. But most spaces are not $\mathbb{R}^n$ — most spaces are formed by gluing together copies of (open sets in) $\mathbb{R}^n$ along smooth maps. So what happens to symbols under changes of coordinates? An affine change of coordinates is a map $y_j(x)=a_j+\sum_jY_j^ix_i$ for some vector $(a_1,\dotsc,a_n)$ and some invertible matrix $Y$. It's straightforward to describe how the differential operators change under such a transformation, and thus how their symbols transform. In fact, you can forget about the fact that indices range $1,\dotsc,n$, and think of them as keeping track of tensor contraction; then everything transforms as tensors under affine coordinate changes, e.g. the variables $\xi^i$ transform as coordinates on the cotangent bundle.

On the other hand, consider the operator $D = \frac{\partial^2}{\partial x^2}$ on $\mathbb{R}$, with symbol $\xi^2$; and consider the change of coordinates $y = f(x)$. By the chain rule, the operator $D$ transforms to $(f'(y))^2\frac{\partial^2}{\partial y^2} + f''(y) \frac{\partial}{\partial y}$, with symbol $(f'(y))^2\psi^2 + f''(y)\psi$. In particular, the symbol did not transform as a function on the cotangent space. Which is to say that I don't actually understand where the symbol of a differential operator lives in a coordinate-free way.

Why I care

One reason I care is because I'm interested in quantum mechanics. If the symbol of a differential operator on a space $X$ were canonically a function on the cotangent space $T^\ast X$, then the inverse of this Symbol map would determine a "quantization" of the functions on $T^\ast X$, corresponding to the QP quantization of $\mathbb{R}^n$.

But the main reason I was thinking about this is from Lie algebras. I'd like to understand the following proof of the PBW theorem:

Let $L$ be a Lie algebra over $\mathbb{R}$ or $\mathbb{C}$, $G$ a group integrating the Lie algebra, $\mathrm{U}L$ the universal enveloping algebra of $L$ and $\mathrm{S}L$ the symmetric algebra of the vector space $L$. Then $\mathrm{U}L$ is naturally the space of left-invariant differential operators on $G$, and $\mathrm{S}L$ is naturally the space of symbols of left-invariant differential operators on $G$. Thus the map Symbol defines a canonical vector-space (and in fact coalgebra) isomorphism $\mathrm{U}L\to\mathrm{S}L$.

This post imported from StackExchange MathOverflow at 2017-04-14 08:57 (UTC), posted by SE-user Theo Johnson-Freyd

asked Oct 30, 2009 in Mathematics by Theo Johnson-Freyd (290 points) [ no revision ]
retagged Apr 14, 2017

Your comment on this question:

To answer, leave an answer instead. Comments are usually for non-answers.
To mask links under text, please type your text, highlight it, and click the "link" button. You can then enter your link URL.
To alert a user, please use the "@" command and remove spaces from the username, example, the user "John Doe" should be pinged as "@JohnDoe", while the user "Johndoe" should be pinged as "@Johndoe". The post author is always automatically pinged (unless you are the post author).
Please consult the FAQ for as to how to format your post.

Live preview (may slow down editor) Preview

Your name to display (optional):

Email me at this address if a comment is added after mine:

Privacy: Your email address will only be used for sending these notifications.

Anti-spam verification:

[captcha placeholder]

Please complete the anti-spam verification

7 Answers

One way to understand the symbol of a differential operator (or more generally, a pseudodifferential operator) is to see what the operator does to "wave packets" - functions that are strongly localised in both space and frequency.

Suppose, for instance, that one is working in $\mathbb R^n$, and one takes a function $\psi$ which is localised to a small neighbourhood of a point $x_0$, and whose Fourier transform is localised to a small neighbourhood of $\xi_0/\hbar$, for some frequency $\xi_0$ (or more geometrically, think of $(x_0,\xi_0)$ as an element of the cotangent bundle of $\mathbb R^n$). Such functions exist when $\hbar$ is small, e.g. $\psi(x) = \eta( (x-x_0)/\epsilon ) e^{i \xi_0 \cdot (x-x_0) / \hbar}$ for some smooth cutoff $\eta$ and some small $\epsilon$ (but not as small as $\hbar$).

Now apply a differential operator $L$ of degree $d$ to this wave packet. When one does so (using the chain rule and product rule as appropriate), one obtains a bunch of terms with different powers of $1/\hbar$ attached to them, with the top order term being $1/\hbar^d$ times some quantity $a(x_0,\xi_0)$ times the original wave packet. This number $a(x_0,\xi_0)$ is the principal symbol of $a$ at $(x_0,\xi_0)$. (The lower order terms are related to the lower order components of the symbol, but the precise relationship is icky.)

Basically, when viewed in a wave packet basis, (pseudo)differential operators are diagonal to top order. (This is why one has a pseudodifferential calculus.) The diagonal coefficients are essentially the principal symbol of the operator. [While on this topic: Fourier integral operators (FIO) are essentially diagonal matrices times permutation matrices in the wave packet basis, so they have a symbol as well as a relation (the canonical relation of the FIO, which happens to be a Lagrangian submanifold of phase space).]

One can construct wave packets in arbitrary smooth manifolds, basically because they look flat at small scales, and one can define the inner product $\xi_0\cdot(x-x_0)$ invariantly (up to lower order corrections) in the asymptotic limit when $x$ is close to $x_0$ and $(x_0,\xi_0)$ is in the cotangent bundle. This gives a way to define the principal symbol on manifolds, which of course agrees with the standard definition.

This post imported from StackExchange MathOverflow at 2017-04-14 08:57 (UTC), posted by SE-user Terry Tao

answered Nov 1, 2009 by Terry Tao (330 points) [ no revision ]

Your comment on this answer:

Live preview (may slow down editor) Preview

Your name to display (optional):

Email me at this address if a comment is added after mine:

Privacy: Your email address will only be used for sending these notifications.

Anti-spam verification:

[captcha placeholder]

Please complete the anti-spam verification

The principal symbol of a differential operator $\sum_{|\alpha| \leq m} a_\alpha(x) \partial_x^\alpha$ is by definition the function $\sum_{|\alpha| = m} a_\alpha(x) (i\xi)^\alpha$ Here $\alpha$ is a multi-index (so $\partial_x^\alpha$ denotes $\alpha_1$ derivatives with respect to $x_1$, etc.) At this point, the vector $\xi = (\xi_1, \ldots, \xi_n)$ is merely a formal variable. The power of this definition is that if one interprets $(x,\xi)$ as variables in the cotangent bundle in the usual way -- i.e. $x$ is any local coordinate chart, then $\xi$ is the linear coordinate in each tangent space using the basis $dx^1, \ldots, dx^n$, then the principal symbol is an invariantly defined function on $T^*X$, where $X$ is the manifold on which the operator is initially defined, which is homogeneous of degree $m$ in the cotangent variables.

Here is a more invariant way of defining it: fix $(x_0,\xi_0)$ to be any point in $T^*X$ and choose a function $\phi(x)$ so that $d\phi(x_0) = \xi_0$. If $L$ is the differential operator, then $L( e^{i\lambda \phi})$ is some complicated sum of derivatives of $\phi$, multiplied together, but always with a common factor of $e^{i\lambda \phi}$. The `top order part' is the one which has a $\lambda^m$, and if we take only this, then its coefficient has only first derivatives of $\phi$ (lower order powers of $\lambda$ can be multiplied by higher derivatives of $\phi$). Hence if we take the limit as $\lambda \to \infty$ of $\lambda^{-m} L( e^{i\lambda \phi})$ and evaluate at $x = x_0$, we get something which turns out to be exactly the principal symbol of $L$ at the point $(x_0, \xi_0)$.

There are many reasons the principal symbol is useful. There is indeed a `quantization map' which takes a principal symbol to any operator of the correct order which has this as its principal symbol. This is not well defined, but is if we mod out by operators of one order lower. Hence the comment in a previous reply about this being an isomorphism between filtered algebras.

In special situations, e.g. on a Riemannian manifold where one has preferred coordinate charts (Riemann normal coordinates), one can define a total symbol in an invariant fashion (albeit depending on the metric). There are also other ways to take the symbol, e.g. corresponding to the Weyl quantization, but that's another story.

In microlocal analysis, the symbol captures some very strong properties of the operator $L$. For example, $L$ is called elliptic if and only if the symbol is invertible (whenever $\xi \neq 0$). We can even talk about the operator being elliptic in certain directions if the principal symbol is nonvanishing in an open cone (in the $\xi$ variables) about those directions. Another interesting story is wave propagation: the characteristic set of the operator is the set of $(x,\xi)$ where the principal symbol $p(L)$ vanishes. If its differential (as a function on the cotangent bundle) is nonvanishing there, then the integral curves of the Hamiltonian flow associated to $p(L)$, i.e. for the Hamiltonian vector field determined by $p(L)$ using the standard symplectic structure on $T^*X$, ``carries'' the singularities of solutions of $Lu = 0$. This is the generalization of the classical fact that singularities of solutions of the wave equation propagate along light rays.

This post imported from StackExchange MathOverflow at 2017-04-14 08:57 (UTC), posted by SE-user Rafe Mazzeo

answered Nov 1, 2009 by Rafe Mazzeo (120 points) [ no revision ]

Your comment on this answer:

Live preview (may slow down editor) Preview

Your name to display (optional):

Email me at this address if a comment is added after mine:

Privacy: Your email address will only be used for sending these notifications.

Anti-spam verification:

[captcha placeholder]

Please complete the anti-spam verification

I think you have misunderstood the definition of "symbol." You should only take the term of highest order in the vector fields. Then the symbol is well defined. (EDIT: well, I guess I should have read Wikipedia first. I stick by my assertion that the symbol map one should consider is the leading order one).

More to the point, the symbol map isn't from differential operators to functions on the cotangent bundle, it's from the associated graded of differential operators for the order filtration to functions on the cotangent bundle. So, on operators of order less than n, you can do the operation you described to the highest order term, and you get something coordinate independent.

This post imported from StackExchange MathOverflow at 2017-04-14 08:57 (UTC), posted by SE-user Ben Webster

answered Oct 30, 2009 by Ben Webster (150 points) [ no revision ]

That's what I thought. But then the "proof" of PBW (which I got from TWF) fails, unless there's something special going on for left-invariant things.

This post imported from StackExchange MathOverflow at 2017-04-14 08:57 (UTC), posted by SE-user Theo Johnson-Freyd

commented Oct 31, 2009 by Theo Johnson-Freyd (290 points) [ no revision ]

it doesn't really fail, because there's no proof there. Baez just says there's a map and calls it "symbol." But there is no one symbol map. However, there is a unique G-invariant isomorphism of SL-> UL which sends a homogenous function to a differential operator with that element as principal symbol. You can't blame Baez for calling that map "symbol."

This post imported from StackExchange MathOverflow at 2017-04-14 08:57 (UTC), posted by SE-user Ben Webster

commented Oct 31, 2009 by Ben Webster (150 points) [ no revision ]

The phrases 'principal symbol' (the highest degree part) and 'total symbol' (for every part) are pretty useful for distinguishing between the two.

This post imported from StackExchange MathOverflow at 2017-04-14 08:57 (UTC), posted by SE-user Greg Muller

commented Nov 2, 2009 by Greg Muller (70 points) [ no revision ]

It must also be remembered that the total symbol of (say, for concreteness) a scalar linear partial differential operator doesn't live in general on the cotangent bundle, but on the bundle of jets of scalar-valued maps of the same order as the order of the operator. This can also be seen from the extension of the chain rule to higher-order derivatives. There the notion of a total symbol becomes coordinate-invariant.

This post imported from StackExchange MathOverflow at 2017-04-14 08:57 (UTC), posted by SE-user Pedro Lauridsen Ribeiro

commented Sep 7, 2011 by Pedro Lauridsen Ribeiro (580 points) [ no revision ]

Your comment on this answer:

Live preview (may slow down editor) Preview

Your name to display (optional):

Email me at this address if a comment is added after mine:

Privacy: Your email address will only be used for sending these notifications.

Anti-spam verification:

[captcha placeholder]

Please complete the anti-spam verification

The D-module course notes of Dragan Milicic contain a detailed construction of the symbol map -- they can be found on his webpage www.math.utah.edu/~milicic. There may be several versions linked there -- the 2007-2008 course should be most thorough. Start reading in Chapter 1, section 5. This goes through the construction of the filtration Ben mentions, then constructs the graded module and symbol map explicitly. Of course, this section only covers the (complex) affine case you already describe. The coordinate-free generalization for say smooth quasi-projective varieties over the complex numbers is done in Chapter 2, section 3. Basically, you look at the sheaf of differential operators on your variety, construct a degree filtration of that sheaf, then the corresponding graded sheaf is isomorphic to the direct image of the sheaf of regular functions on the cotangent bundle via the symbol map.

In the case G in your question is an algebraic group, the sheaf of differential operators on G are formed by localizing UL, and the pushforward of regular functions on the cotangent bundle is the localization of SL. Then UL and SL can be recovered by pulling these sheaves back to the identity element in G. I don't think the isomorphism the way you have stated it is true as-is. I think the content of any statement along these lines (as it relates to the proof of the PBW theorem) probably has to do with the construction of the filtration by top degree being well-defined.

This post imported from StackExchange MathOverflow at 2017-04-14 08:57 (UTC), posted by SE-user S Kitchen

answered Oct 31, 2009 by S Kitchen (40 points) [ no revision ]

Your comment on this answer:

Live preview (may slow down editor) Preview

Your name to display (optional):

Email me at this address if a comment is added after mine:

Privacy: Your email address will only be used for sending these notifications.

Anti-spam verification:

[captcha placeholder]

Please complete the anti-spam verification

The original questioner already knows this, but anyone else who is interested in this question should check out the conversation at John Baez's blog.

This post imported from StackExchange MathOverflow at 2017-04-14 08:57 (UTC), posted by SE-user David Speyer

answered Nov 1, 2009 by David Speyer (110 points) [ no revision ]

Your comment on this answer:

Live preview (may slow down editor) Preview

Your name to display (optional):

Email me at this address if a comment is added after mine:

Privacy: Your email address will only be used for sending these notifications.

Anti-spam verification:

[captcha placeholder]

Please complete the anti-spam verification

The definition of symbol as presented in Wikipedia is not invariant — only the highest order terms. Some textbooks call those higher order terms symbols (Wikipedia suggests the name principal symbol), hence the Ben's answer, which refers to that definition.

The highest order terms are clearly most important for the properties of the differential equations, e.g. their positiveness allows to prove the existence of solutions (it's related to the fact that positively definite linear operators are invertible in linear algebra).

As for "Thus the map Symbol defines a canonical vector-space (and in fact coalgebra) isomorphism UL → SL.", this statement should be proved by induction order-by-order in a fixed coordinate system. It should be true in any coordinate system, but the homomorphism depends on it.

There is however, a canonical coordinate system given by the exp map, and this, I think (not sure here), is the canonical map referred to in the question.

As for quantum mechanics, while I have only some general knowledge, I think "inverse of this Symbol map would determine a "quantization" of the functions on T*X, corresponding to the QP quantization of ℝn" is true, but somewhat too optimistic. Yes, there are quantizations, but they are canonical to all terms only when you restrict yourself to linear change of coordinates (or when you do some additional constructions).

I risk being viewed as stepping on a slippery stone here, but my limited understanding is that you're actually hitting a fundamental question here — even some quantum field theory play nice with diffeomorphisms, but it's far from a simple exercise. Moreover, the theory that includes them as the degrees of freedom would be called quantum gravity and is the Holy Grail of high-energy physics rather then a theorem of Lie group geometry (though the latter is extensively used for the former).

This post imported from StackExchange MathOverflow at 2017-04-14 08:57 (UTC), posted by SE-user Ilya Nikokoshev

answered Oct 30, 2009 by Ilya Nikokoshev (20 points) [ no revision ]

I think the accepted terminology these days is such that a 'quantum field theory' does not include gravity. A theory of quantum gravity is indeed a primary goal of research in hep-th and gr-qc, but it is questionable whether it is in the form of a quantum field theory. The current thinking is that quantum field theories are effective theories of a more fundamental theory (perhaps a string/M theory, perhaps something else).

This post imported from StackExchange MathOverflow at 2017-04-14 08:57 (UTC), posted by SE-user José Figueroa-O'Farrill

commented Nov 2, 2009 by José Figueroa-O'Farrill (2,315 points) [ no revision ]

Another comment is that Chern-Simons theory is an example of a quantum field theory which "plays nicely" with diffeomorphisms, but it is not a gravity theory because it has no metric degreees of freedom. In other words, whereas a quantum gravity theory presumably should "play nicely" with diffeomorphisms, not every theory which does is a theory of gravity.

This post imported from StackExchange MathOverflow at 2017-04-14 08:57 (UTC), posted by SE-user José Figueroa-O'Farrill

commented Nov 2, 2009 by José Figueroa-O'Farrill (2,315 points) [ no revision ]

Agreed, but the last paragraph is quite vague anyway... I'll rewrite it still.

This post imported from StackExchange MathOverflow at 2017-04-14 08:57 (UTC), posted by SE-user Ilya Nikokoshev

commented Nov 2, 2009 by Ilya Nikokoshev (20 points) [ no revision ]

Your comment on this answer:

Live preview (may slow down editor) Preview

Your name to display (optional):

Email me at this address if a comment is added after mine:

Privacy: Your email address will only be used for sending these notifications.

Anti-spam verification:

[captcha placeholder]

Please complete the anti-spam verification

I think that one shouldn't insist on the invariance of symbols. A symbol is just a realization of a differential operator on the cotangent bundle. If the symbol were inavariant under some transformation it would restrict the corresponding operator to some subset which may be less interesting. As a finite dimensional example, the subspace of linear transformations of a finite dimensional vector space invariant under the group of unitary transformations would be the dull subspace of multiples of the unit operator.

This post imported from StackExchange MathOverflow at 2017-04-14 08:57 (UTC), posted by SE-user David Bar Moshe

answered Oct 31, 2009 by David Bar Moshe (4,355 points) [ no revision ]

Your comment on this answer:

Live preview (may slow down editor) Preview

Your name to display (optional):

Email me at this address if a comment is added after mine:

Privacy: Your email address will only be used for sending these notifications.

Anti-spam verification:

[captcha placeholder]

Please complete the anti-spam verification

Your answer

Please use answers only to (at least partly) answer questions. To comment, discuss, or ask for clarification, leave a comment instead.
To mask links under text, please type your text, highlight it, and click the "link" button. You can then enter your link URL.
Please consult the FAQ for as to how to format your post.
This is the answer box; if you want to write a comment instead, please use the 'add comment' button.

Live preview (may slow down editor) Preview

Your name to display (optional):

Email me at this address if my answer is selected or commented on:

Privacy: Your email address will only be used for sending these notifications.

Anti-spam verification:

If you are a human please identify the position of the character covered by the symbol $\varnothing$ in the following word:
p$\hbar$y$\varnothing$icsOverflow
Then drag the red bullet below over the corresponding character of our banner. When you drop it there, the bullet changes to green (on slow internet connections after a few seconds).

Please complete the anti-spam verification

News

Tools for paper authors

Tools for SE users

Public \(\beta\) tools

Most popular tags

Site Statistics

What is the symbol of a differential operator?

Background

Why I care

Your comment on this question:

Live Preview

Preview

7 Answers

Your comment on this answer:

Live Preview

Preview

Your comment on this answer:

Live Preview

Preview

Your comment on this answer:

Live Preview

Preview

Your comment on this answer:

Live Preview

Preview

Your comment on this answer:

Live Preview

Preview

Your comment on this answer:

Live Preview

Preview

Your comment on this answer:

Live Preview

Preview

Your answer

Live Preview

Preview

News

Tools for paper authors

Tools for SE users

Public \(\beta\) tools

Most popular tags

Related questions

Site Statistics

What is the symbol of a differential operator?

Background

Why I care

Your comment on this question:

Live Preview

Preview

7 Answers

Your comment on this answer:

Live Preview

Preview

Your comment on this answer:

Live Preview

Preview

Your comment on this answer:

Live Preview

Preview

Your comment on this answer:

Live Preview

Preview

Your comment on this answer:

Live Preview

Preview

Your comment on this answer:

Live Preview

Preview

Your comment on this answer:

Live Preview

Preview

Your answer

Live Preview

Preview