Quantcast
  • Register
PhysicsOverflow is a next-generation academic platform for physicists and astronomers, including a community peer review system and a postgraduate-level discussion forum analogous to MathOverflow.

Welcome to PhysicsOverflow! PhysicsOverflow is an open platform for community peer review and graduate-level Physics discussion.

Please help promote PhysicsOverflow ads elsewhere if you like it.

News

PO is now at the Physics Department of Bielefeld University!

New printer friendly PO pages!

Migration to Bielefeld University was successful!

Please vote for this year's PhysicsOverflow ads!

Please do help out in categorising submissions. Submit a paper to PhysicsOverflow!

... see more

Tools for paper authors

Submit paper
Claim Paper Authorship

Tools for SE users

Search User
Reclaim SE Account
Request Account Merger
Nativise imported posts
Claim post (deleted users)
Import SE post

Users whose questions have been imported from Physics Stack Exchange, Theoretical Physics Stack Exchange, or any other Stack Exchange site are kindly requested to reclaim their account and not to register as a new user.

Public \(\beta\) tools

Report a bug with a feature
Request a new functionality
404 page design
Send feedback

Attributions

(propose a free ad)

Site Statistics

205 submissions , 163 unreviewed
5,082 questions , 2,232 unanswered
5,353 answers , 22,789 comments
1,470 users with positive rep
820 active unimported users
More ...

  In spontaneous symmetry breaking, why can we be sure the VEV is near the classical one, given the convexity of effective potential?

+ 6 like - 0 dislike
8266 views

For concreteness let's nail the context to be Mexican hat potential. To deal with spontaneous symmetry breaking(SSB), one usually starts with assuming the vacuum expectation value(VEV) to be near the classical one, i.e., a minimum of the Mexican hat, and then quantum correct it by perturbation theory. However, as a nonperturbative result, the exact effective potential must be convex, this means our Mexican hat becomes flat-bottomed after the full quantum corrections. Then it means the VEV can be any where in this flat bottom(it can even be 0!), possibly very very far away from the classical one, so why is it reasonable at all to start the calculation from the classical VEV as if it must be approximately correct?

Update: After the long discussion, what's clear to me now is that, the physical vacuum must be chosen from a list of superselected vacua, and superpositions of vacua in this list are not viable choices, so the VEV probably cannot be freely chosen from the flat bottom. However, what's not yet clarified is, how do we know in general, that the superselected vacua must be represented by the edge of the flat bottom?

Update: After talking to people, it kinda surprised me that this is not a widely known result even to field theorists, so I'll just put the standard proof here for reference: 

Consider an Euclidean scalar theory, we have the free energy functional that generates connected diagrams $W[\rho]$, defined as 

$$\exp W[\rho]= \int \mathcal{D}\phi \exp\{-\int d^n x [L-\rho \phi] \}.$$

We can define a new positive measure for notational convenience

$$\mathcal{D}\mu= \mathcal{D}\phi \exp\{-\int d^n x L\}$$

Now we can write

$$\exp W[\rho]= \int \mathcal{D}\mu \exp\{\int d^n x \rho \phi \}.$$

Now Holder's inequality asserts 

$$\int \mathcal{D}\mu F^\alpha G^\beta \leq [\int \mathcal{D}\mu F]^\alpha[\int \mathcal{D}\mu G]^\beta,$$ 

where $F$ and $G$ are arbitrary positive functionals, $\alpha$ and $\beta$ are positive numbers satisfying $\alpha+\beta=1$.

Now apply Holder's inequality to $F=\exp\{ \int \rho_1\phi \}$ and $G=\exp\{ \int \rho_2\phi \}$ then we have 

$$\exp W[\rho_1+\rho_2]\leq \exp \{W[\rho_1]+W[\rho_2]\}$$

And this means $W$ is a convex functional. Since Legendre transform preserves convexity, we conclude $\Gamma[\langle \phi \rangle]$ is a convex functional. 

But then I'm not yet sure what the counter part of this theorem is in Minkowski QFT.

And I copy one of my comments under Ron's post since it might be relevant, but still quite possibly very flawed:

 I think if somehow we can make the analogy toward Gibbs free energy of phase transitions, we may still sensibly argue VEV should be at the edge: in this analogy, superselected vacua should correspond to pure phases, while maybe linear superpositions of superselected vacuua should correspond to coexisting phases. Then when one tries to goes across the phase transition lines, it would be expected that we first start by hitting the coexisting phase with 0 mixture, then gradually to the ones with finite mixture, then enter the zone of 0 mixture again, for example: water-->water vapour mixture-->vapour; it would be just very weird to suddenly hit a point of finite mixture at the very beginning, then at some point in the middle it becomes pure again.

This is not yet a well thought of argument, for example the immediate flaw I can think of would be that the correspondence to Gibbs free energy can't be exact. To be exact, one must have a path integral over periodic boundary conditions, but the effective potential we are talking about really is derived from the partition function with vacuum as fixed boundary conditions, not integrated over.

asked Feb 4, 2015 in Theoretical Physics by Jia Yiyang (2,640 points) [ revision history ]
edited Sep 18, 2015 by Jia Yiyang
I liked it the other way better.
@RonMaimon, ok, reverted.
"as a nonperturbative result, the exact effective potential must be convex, this means our Mexican hat becomes flat-bottomed after the full quantum corrections." Why? What's the reason?
As another comment: the classical VEV is a free parameter of the theory, only determined experimentally through muon decay and muon mass (given by Zeeman transitions in muonion). I guess that in principle the classical VEV might be zero, but muons would be stable, there wouldn't be Higgs mechanism and so on. As for the real (fully quantum) VEV, should it be close to the classical one? Why? What are the implications if they were very distinct and in particular if the real one is zero? Would that make the perturbative approach not justified?
@drake, thanks for the reply. Somehow I didn't get any notification for some of the comments in this post. @dimension10, @polarkernel, has this happened before?

@JiaYiyang This is a known bug, see here

@dimension10, ok, thanks for the explanation.

1 Answer

+ 4 like - 0 dislike
The analogous non perturbative thermodynamic theorem is the convexity of the free energy, and the analogous phenomenon is the Maxwell construction. When you have a first order phase transition, the region of coexistence is the flat part of the free-energy function, where the unperturbed pure-phase free energy has the wrong convexity property. The behavior there is completely different from the pure-phase region. The two phases are segregated in bulk, with a phase boundary. The interpolating construction fixes up the nonperturbative free energy to be convex, and there is an additional free energy cost in the middle part from the phase-boundary, it just scales as the area and not the volume, and vanishes in the thermodynamic limit.

In field theory, for scalar $\phi^4$, it's not an analogy, it's the exact same thing. The intermediate region in the phase diagram as a function of the field parameter $\hat\phi$ in the effective potential consists of the two phases separated by domain wall. The domain wall has energy, just with subleading scaling as the area, so it doesn't contribute, so that you have a Maxwell style convex free energy graph which is only qualitatively similar to the perturbation theory free energy at the far ends, and it has a totally flat bottom.

In the Mexican hat case of broken continuous symmetry, the regions of intermediate $\hat{\phi}$ are also described by various superpositions/probabilistic-mixtures of non-uniform fields (depending on whether you are quantum or Euclidean), and this restores convexity, although now, there are field gradients all over. The parameter in the effective potential $\hat{\phi}$ is a constraint telling you what proportion of different vacua are included. This constraint is not physical, and at an actual value, at the minimum value of the effective potential, you are close to the edge. This is where you have the empty vacua we are always interested in perturbing around. The flattened out middle part doesn't describe the same thing.
answered Feb 5, 2015 by Ron Maimon (7,730 points) [ no revision ]
Thanks! I suppose only the last paragraph is directly addressing my question? Then it looks quite succinct, let me think about it and see if I can formulate some reasonable questions about it.
Ok, so you are saying in the middle of the bottom region, the quantum state corresponding to that VEV is not translation-invariant, hence cannot be interpreted as vacuum? How can I see this is the case?
@JiaYiyang: It's not for sure non-translationally invariant--- the Maxwell construction states can be translation invariant (despite intuition) by a probabilistic superposition of all possible positions for the domain wall, there's no gravity to determine the position of the wall in the box. In the infinite volume limit, the Maxwell construction is a superposition of different non-homogenous states separated by domain walls, and each different configuration of domain wall can't globally fluctuate into another one. That's the statement that each vacuum is a separate superselection sector at infinite volume. The fixed superselection sectors are not homogenous and translation invariant. I only know what they look like for $\phi^4$, and I really am not sure what they look like for the case of the Mexican hat, I was just vaguely pretending to simulate the statistical theory in my head, and I would guess for the circle it's a superposition of soups of flabby vortex lines with net statistical preference for one vaccum or another, depending on the value of $\hat\phi$, or some flabby field gradients which are mostly vaccum, but with some filaments separating different phases for higher-d mexican hats.

I'm confused again, then why is a VEV in the intermediate region of the bottom not good?  I mean, the superposed state is translation-invariant and has the minimal energy,  it looks like a good enough definition for a vacuum?

 I only know what they look like for ϕ4, and I really am not sure what they look like for the case of the Mexican hat

emm, are we talking about the same thing? Isn't Mexican hat described by $V(\phi)=-\frac{1}{2}\mu^2 \phi^2+\lambda \phi^4$? What the "$\phi^4$" are you having in mind?

(Ok, I got silly, I forgot Mexican hat had two directions in field space, I'll change the question to the simplest $\phi^4$)

@Jia: No need to change the question --- it's pretty much the same idea in all the cases.

@Ron, so what do you think about the first question:

I'm confused again, then why is a VEV in the intermediate region of the bottom not good?  I mean, the superposed state is translation-invariant and has the minimal energy,  it looks like a good enough definition for a vacuum?  

@JiaYiyang: It doesn't have minimum energy really, as you aren't taking the subleading-scaling energy in the strings/domain-walls or whatever gradients into account. Only the energy density is the same as a real edge vacuum.
@RonMaimon, emm, ok. Now I see the logic is consistent if I take your words, I'll read on some statistical field theory material to learn and verify what you said, e.g. domain wall, scaling etc. (Any suggestion of reference?)

Wait, after some thought, I'm not quite convinced that 

 It doesn't have minimum energy really... Only the energy density is the same as a real edge vacuum.

For simplicity let's consider scalar phi 4,suppose we've found the vacuum states $|VAC\pm\rangle$ corresponding to, say, $\langle \phi \rangle=\pm 1$, which are the two degenerate edge vacua. Then all linear combinations $\sin \theta |VAC+\rangle+\cos\theta |VAC-\rangle$ would have the same energy(not energy density), but now the VEV becomes $\sin ^2 \theta - \cos^2\theta$(cross terms vanish by superselection?) Now VEV can take any value in $[-1, 1]$, without any change in energy(again, not energy density), and the state surely is still translation-invariant since $|VAC\pm\rangle$ individually are. What's wrong with this simple deduction?  

@JiaYiyang: Yeah, I think you might be right, the intermediate vacua in the middle are probabilistic superpositions of the different edge vacua, not Maxwell construction things with domain-walls and intermediate gradients--- there's no global obstruction to making the superposition global. But it's still nonphysical stuff in the middle region, but I'll update the answer after thinking about it.
@JiaYiyang I'm trying to follow the discussion, but I'm getting lost. How the vacuum state is going to be a linear combinations of VAC+ and VAC- if the the potential wall between them is infinite (in the infinite volume limit) ? This is QFT, not QM. Each VEV defines a superselection sector. In the case of a continuous symmetry, it's the same thing plus the fact that each vacuum contains an infinite number of goldstones of any other vacuum (in the infinite volume limit) and every pair of vacuum states is perpendicular each other. Probably I'm totally lost (?)
@drake,an infinite potential prevents $VAC+$ and $VAC-$ transiting to each other, but how can it prevent the linear combination being an initial condition?
@JiaYiyang: The initial condition of a superposition of global vacua is, relative to any observer, just one or another of the vacua with a probability as the square of the amplitude. It's not a real superposition, as you can never have interference. The two vacua can't talk to each other, because they are separate superselection sectors, no local perturbation can take you from one to the other. I'll put that in the modified answer--- I was ignoring these ridiculous superposed vacua, I screwed up.
@JiaYiyang Well, it prevents the transition from any state built on VAC+  to any state built on VAC- : they are separated superselection sectors. There aren't physical superpositions of both sectors for the same reason that there aren't superposition of states with different electric charge, or superpositions of bosons and fermions.
@drake, @RonMaimon, then I need elaboration. On the surface the situation seems quite "symmetric", I mean, I can start with $\{VAC+\rangle, |VAC-\rangle\}$ as a set of superselected states, but I can also start with, say, $\{VAC+\rangle+|VAC-\rangle ,|VAC+\rangle-|VAC-\rangle\}$ as a set of superselected states, I don't see why I should prefer one over the other. Weinberg in Vol2 of his QFT argues in presence of external perturbation such as $\phi^3(x)$, the first choice is prefered. However, this reasoning is rather dubious to me, after all the inability to transit between superselected states has been the key in discussions regarding SSB, but physically turning on perturbations such as $\phi^3(x)$ trespasses superselections(Haag theorem), so Weinberg's argument looks like a double standard to me. Though I don't know if some argument along the same line can be made while circumventing Haag theorem.
@JiaYiyang--- the rotated vacuum states are not symmetrical with the unrotated ones with a fixed value of the average $\phi$--- the local operator $\phi$ (or a smeared version) ha a definite eigenstate only in the proper one-value vacuum, and $\phi$ is a local observable you can actually measure, while the rotated operator whose eigenvalus are superpositions of vacua with different values of $\phi$ is not a local operator, it can only be measured globally using the whole space. There is no symmetry in quantum superposing vacua, only the usual vacua are the proper things, the preferred basis is from the local field operators you have in the theory, which defines which local measurements are allowed.
@RonMaimon, ok, now I'm almost convinced. Indeed $\{VAC+\rangle, |VAC-\rangle\}$ is a superselcted pair while $\{VAC+\rangle+|VAC-\rangle ,|VAC+\rangle-|VAC-\rangle\}$ is not, I was wrong in my last comment. Now the only thing that still puzzles me a bit is, in general how can I see the superselected vacua must be represented by the edges of the bottom of effective potential? Or equivalently, any vacuum represented by the intermediate region must not be part of a superselected basis?
@JiaYiyang: The effective action is not just a function of the global real parameter $\hat\phi$, it is a functional of the classical parameter field $\hat\phi(x)$, it's the functional Legendre transform of the free energy as a functional of $J(x)$. The global parameter $\hat\phi$ is not the thing that you vary to get local correlation functions, its just the global value. You vary $\hat\phi(x)$ locally. Under local variations, the effective potential is infinitely differentiable, and the one-particle-irreducible (1PI) correlation functions which obey cluster decomposition are those associated with one or another edge vacuum. There are two conditions--- first the global minimum energy condition $\delta \Gamma = 0$, saying the the variation of the effective potential is minimum at the global value of $\hat\phi$, and second the local condition $g(x) * g(x')* ... \delta_x\delta_y... \Gamma$, where the g's are test function and * is convolution, smeared local variations must converge to the appropriate smeared 1PI correlation functions obeying cluster decomposition. The cluster decomposition property of the correlation functions is what ensures you are looking at the S-matrix of one specific vacuum not the unphysical superposition of many vacua. I don't know how to write this formally very well, though, I'll think about it.

I'm still kinda fixated on this question. I think if somehow we can make the analogy toward Gibbs free energy of phase transitions, we may still sensibly argue VEV should be at the edge: in this analogy, superselected vacua should correspond to pure phases, while maybe linear superpositions of superselected vacuua should correspond to coexisting phases. Then when one tries to goes across the phase transition lines, it would be expected that we first start by hitting the coexisting phase with 0 mixture, then gradually to the ones with finite mixture, then enter the zone of 0 mixture again, for example: water-->water vapour mixture-->vapour; it would be just very weird to suddenly hit a point of finite mixture at the very beginning, then at some point in the middle it becomes pure again.

This is not yet a well thought of argument, for example the immediate flaw I can think of would be that the correspondence to Gibbs free energy can't be exact. To be exact, one must have a path integral over periodic boundary conditions, but the effective potential we are talking about really is derived from the partition function with vacuum as fixed boundary conditions, not integrated over.

Your answer

Please use answers only to (at least partly) answer questions. To comment, discuss, or ask for clarification, leave a comment instead.
To mask links under text, please type your text, highlight it, and click the "link" button. You can then enter your link URL.
Please consult the FAQ for as to how to format your post.
This is the answer box; if you want to write a comment instead, please use the 'add comment' button.
Live preview (may slow down editor)   Preview
Your name to display (optional):
Privacy: Your email address will only be used for sending these notifications.
Anti-spam verification:
If you are a human please identify the position of the character covered by the symbol $\varnothing$ in the following word:
p$\hbar$ysicsOverf$\varnothing$ow
Then drag the red bullet below over the corresponding character of our banner. When you drop it there, the bullet changes to green (on slow internet connections after a few seconds).
Please complete the anti-spam verification




user contributions licensed under cc by-sa 3.0 with attribution required

Your rights
...