This answer is just an expanding version of Kconrad answer's. I am posting it here because this argument support the variational method for finite state space and also touch in the observation made by Kconrad about a technical issue about boundary values of the variational approach.
Proposition: Suppose Ω non empty finite set and let M denote the set of the probability measures on Ω then
supμ∈M[h(μ)−∫ΩUdμ]=logZ
moreover, the supremum is attained for the measure
μ given by
μ({ω})=1Ze−U(ω).
Proof:
Let be
n the cardinality of
Ω. Define the function
f:Rn+→R by
f(x1,…,xn)=−n∑i=1[xilogxi+Kixi],
where
Ki∈R for all
i∈{1,…,n}. Consider the function
g:Rn+→R given by
g(x1,…,xn)=n∑i=1xi.
We fix an enumeration for
Ω and let be
Ki=U(ωi). So the following optimization problem
supμ∈M[h(μ)−∫ΩUdμ]
can be solved by finding a maximum for
f restricted to
g−1(1). Note that for any critical point
(x1,…,xn) of
f in
(0,∞)n∩g−1(1), it follows from the Lagrange Multipliers Theorem's that
∇f(x1,…,xn)=λ∇g(x1,…,xn)
for some
λ∈R, i.e.,
−(logxi+1+Ki)=λ, for all i=1,…,n.
So for any pairs of index
i,j∈{1,…,n}, we have
logxi+Ki=logxj+Kj
taking the exponentials it follows that
xieKi=xjeKj.
Using that
∑ni=1xi=1 and the above identities, we have
xie−Ki=[1−∑j∈{1,…,n}∖{i}xj]e−Ki
=e−Ki−∑j∈{1,…,n}∖{i}xje−Ki
So
xie−Ki=e−Ki−xi∑j∈{1,…,n}∖{i}e−Kj.
Explicting
xi, we show that all critical points of
f in
(0,∞)n∩g−1(1) are given by (here there is just one)
xi=e−Ki∑nj=1e−Kj.
The image of
f at this point is given by
−n∑i=1[(e−Ki∑nj=1e−Kj)log(e−Ki∑nj=1e−Kj)+Ki(e−Ki∑nj=1e−Kj)]=log(n∑j=1e−Kj)
to see that
(x1,…,xn) is local maximum we can compute the Hessian and check that it is negative definite at this point.
To show that the point is global maximum point, we can compare the image of
f at this point, with the value of
f in any point of the set
∂(0,∞)n∩g−1(1).
The restriction of
f to this set is given by
f(x1,…,xn)=−∑i∈{1,…,n}∖I[xilogxi+Kixi]
Where
I⊂{1,…,n} is a index set such that
|I|≥1 e
xi=0 para todo
i∈I .
We define
fI which is a function of
n−|I| variables. It is maximum point can be determined in the same way and therefore we have that the max of
fI is
log(∑j∈{1,…,n}∖Ie−Kj)
which is less than
log(n∑j=1e−Kj).
Repeating this argument at most
n times we conclude that maximum of
f restricted to
g−1(1)∩Rn+, is not attained in the boundary.
This post imported from StackExchange MathOverflow at 2015-08-19 09:28 (UTC), posted by SE-user Leandro