Trace inequality#Von Neumann's trace inequality
{{Short description|Concept in Hlibert spaces mathematics}}
In mathematics, there are many kinds of inequalities involving matrices and linear operators on Hilbert spaces. This article covers some important operator inequalities connected with traces of matrices.E. Carlen, Trace Inequalities and Quantum Entropy: An Introductory Course, Contemp. Math. 529 (2010) 73–140 {{doi|10.1090/conm/529/10428}}R. Bhatia, Matrix Analysis, Springer, (1997).B. Simon, Trace Ideals and their Applications, Cambridge Univ. Press, (1979); Second edition. Amer. Math. Soc., Providence, RI, (2005).M. Ohya, D. Petz, Quantum Entropy and Its Use, Springer, (1993).
Basic definitions
Let denote the space of Hermitian matrices, denote the set consisting of positive semi-definite Hermitian matrices and denote the set of positive definite Hermitian matrices. For operators on an infinite dimensional Hilbert space we require that they be trace class and self-adjoint, in which case similar definitions apply, but we discuss only matrices, for simplicity.
For any real-valued function on an interval one may define a matrix function for any operator with eigenvalues in by defining it on the eigenvalues and corresponding projectors as
given the spectral decomposition
=Operator monotone=
{{Main|Operator monotone function}}
A function defined on an interval is said to be operator monotone if for all and all with eigenvalues in the following holds,
where the inequality means that the operator is positive semi-definite. One may check that is, in fact, not operator monotone!
=Operator convex=
A function is said to be operator convex if for all and all with eigenvalues in and , the following holds
Note that the operator has eigenvalues in since and have eigenvalues in
A function is {{visible anchor|operator concave}} if is operator convex;=, that is, the inequality above for is reversed.
={{anchor|Joint_convexity_function_2016_10}}Joint convexity=
A function defined on intervals is said to be {{visible anchor|jointly convex}} if for all and all
with eigenvalues in and all with eigenvalues in and any the following holds
A function is {{visible anchor|jointly concave}} if − is jointly convex, i.e. the inequality above for is reversed.
=Trace function=
Given a function the associated trace function on is given by
where has eigenvalues and stands for a trace of the operator.
Convexity and monotonicity of the trace function
Löwner–Heinz theorem
For , the function is operator monotone and operator concave.
For , the function is operator monotone and operator concave.
For , the function is operator convex. Furthermore,
: is operator concave and operator monotone, while
: is operator convex.
The original proof of this theorem is due to K. Löwner who gave a necessary and sufficient condition for {{mvar|f}} to be operator monotone.{{cite journal | last=Löwner | first=Karl | title=Über monotone Matrixfunktionen | journal=Mathematische Zeitschrift | publisher=Springer Science and Business Media LLC | volume=38 | issue=1 | year=1934 | issn=0025-5874 | doi=10.1007/bf01170633 | pages=177–216 | s2cid=121439134 | language=de}} An elementary proof of the theorem is discussed in and a more general version of it in.W.F. Donoghue, Jr., Monotone Matrix Functions and Analytic Continuation, Springer, (1974).
{{anchor|Klein2016_10}}Klein's inequality
For all Hermitian {{mvar|n}}×{{mvar|n}} matrices {{mvar|A}} and {{mvar|B}} and all differentiable convex functions
with derivative {{math|f ' }}, or for all positive-definite Hermitian {{mvar|n}}×{{mvar|n}} matrices {{mvar|A}} and {{mvar|B}}, and all differentiable
convex functions {{mvar|f}}:(0,∞) → , the following inequality holds,
{{Equation box 1
|indent =:
|equation =
|cellpadding= 6
|border
|border colour = #0073CF
|bgcolor=#F9FFF7}}
In either case, if {{mvar|f}} is strictly convex, equality holds if and only if {{mvar|A}} = {{mvar|B}}.
A popular choice in applications is {{math|f(t) {{=}} t log t}}, see below.
=Proof=
Let so that, for ,
:,
varies from to .
Define
:.
By convexity and monotonicity of trace functions, is convex, and so for all ,
:,
which is,
:,
and, in fact, the right hand side is monotone decreasing in .
Taking the limit yields,
:,
which with rearrangement and substitution is Klein's inequality:
:
Note that if is strictly convex and , then is strictly convex. The final assertion follows from this and the fact that is monotone decreasing in .
Golden–Thompson inequality
{{main|Golden–Thompson inequality}}
In 1965, S. Golden {{cite journal | last=Golden | first=Sidney | title=Lower Bounds for the Helmholtz Function | journal=Physical Review | publisher=American Physical Society (APS) | volume=137 | issue=4B | date=1965-02-22 | issn=0031-899X | doi=10.1103/physrev.137.b1127 | pages=B1127–B1128| bibcode=1965PhRv..137.1127G }} and C.J. Thompson {{cite journal | last=Thompson | first=Colin J. | title=Inequality with Applications in Statistical Mechanics | journal=Journal of Mathematical Physics | publisher=AIP Publishing | volume=6 | issue=11 | year=1965 | issn=0022-2488 | doi=10.1063/1.1704727 | pages=1812–1813| bibcode=1965JMP.....6.1812T }} independently discovered that
For any matrices ,
:
This inequality can be generalized for three operators:{{cite journal | last=Lieb | first=Elliott H | title=Convex trace functions and the Wigner-Yanase-Dyson conjecture | journal=Advances in Mathematics | volume=11 | issue=3 | year=1973 | issn=0001-8708 | doi=10.1016/0001-8708(73)90011-x | doi-access=free | pages=267–288| url=http://www.numdam.org/item/RCP25_1973__19__A4_0/ }} for non-negative operators ,
:
Peierls–Bogoliubov inequality
Let be such that Tr eR = 1.
Defining {{math|g {{=}} Tr FeR}}, we have
:
The proof of this inequality follows from the above combined with Klein's inequality. Take {{math|f(x) {{=}} exp(x), A{{=}}R + F, and B {{=}} R + gI}}.D. Ruelle, Statistical Mechanics: Rigorous Results, World Scient. (1969).
Gibbs variational principle
Let be a self-adjoint operator such that is trace class. Then for any with
:
with equality if and only if
Lieb's concavity theorem
The following theorem was proved by E. H. Lieb in. It proves and generalizes a conjecture of E. P. Wigner, M. M. Yanase, and Freeman Dyson.{{cite journal | last1=Wigner | first1=Eugene P. | last2=Yanase | first2=Mutsuo M. | title=On the Positive Semidefinite Nature of a Certain Matrix Expression | journal=Canadian Journal of Mathematics | publisher=Canadian Mathematical Society | volume=16 | year=1964 | issn=0008-414X | doi=10.4153/cjm-1964-041-x | pages=397–406| s2cid=124032721 }} Six years later other proofs were given by T. Ando {{cite journal | last=Ando | first=T. | title=Concavity of certain maps on positive definite matrices and applications to Hadamard products | journal=Linear Algebra and Its Applications | publisher=Elsevier BV | volume=26 | year=1979 | issn=0024-3795 | doi=10.1016/0024-3795(79)90179-4 | pages=203–241| doi-access=free }} and B. Simon, and several more have been given since then.
For all matrices , and all and such that and , with the real valued map on given by
:
F(A,B,K) = \operatorname{Tr}(K^*A^qKB^r)
- is jointly concave in
- is convex in .
Here stands for the adjoint operator of
Lieb's theorem
For a fixed Hermitian matrix , the function
:
is concave on .
The theorem and proof are due to E. H. Lieb, Thm 6, where he obtains this theorem as a corollary of Lieb's concavity Theorem.
The most direct proof is due to H. Epstein;{{cite journal | last=Epstein | first=H. | title=Remarks on two theorems of E. Lieb | journal=Communications in Mathematical Physics | publisher=Springer Science and Business Media LLC | volume=31 | issue=4 | year=1973 | issn=0010-3616 | doi=10.1007/bf01646492 | pages=317–325| bibcode=1973CMaPh..31..317E | s2cid=120096681 | url=http://projecteuclid.org/euclid.cmp/1103859039 }} see M.B. Ruskai papers,{{cite journal | last=Ruskai | first=Mary Beth | title=Inequalities for quantum entropy: A review with conditions for equality | journal=Journal of Mathematical Physics | publisher=AIP Publishing | volume=43 | issue=9 | year=2002 | issn=0022-2488 | doi=10.1063/1.1497701 | pages=4358–4375| arxiv=quant-ph/0205064 | bibcode=2002JMP....43.4358R | s2cid=3051292 }}{{cite journal | last=Ruskai | first=Mary Beth | title=Another short and elementary proof of strong subadditivity of quantum entropy | journal=Reports on Mathematical Physics | publisher=Elsevier BV | volume=60 | issue=1 | year=2007 | issn=0034-4877 | doi=10.1016/s0034-4877(07)00019-5 | pages=1–12| arxiv=quant-ph/0604206 | bibcode=2007RpMP...60....1R | s2cid=1432137 }} for a review of this argument.
Ando's convexity theorem
T. Ando's proof of Lieb's concavity theorem led to the following significant complement to it:
For all matrices , and all and with , the real valued map on given by
:
is convex.
{{anchor|Joint_convexity_2016_10}}Joint convexity of relative entropy
For two operators define the following map
:
For density matrices and , the map is the Umegaki's quantum relative entropy.
Note that the non-negativity of follows from Klein's inequality with .
=Statement=
The map is jointly convex.
=Proof=
For all , is jointly concave, by Lieb's concavity theorem, and thus
:
is convex. But
:
and convexity is preserved in the limit.
Jensen's operator and trace inequalities
The operator version of Jensen's inequality is due to C. Davis.C. Davis, A Schwarz inequality for convex operator functions, Proc. Amer. Math. Soc. 8, 42–44, (1957).
A continuous, real function on an interval satisfies Jensen's Operator Inequality if the following holds
:
for operators with and for self-adjoint operators with spectrum on .
See,{{cite journal | last1=Hansen | first1=Frank | last2=Pedersen | first2=Gert K. | title=Jensen's Operator Inequality | journal=Bulletin of the London Mathematical Society | volume=35 | issue=4 | date=2003-06-09 | issn=0024-6093 | doi=10.1112/s0024609303002200 | pages=553–564|arxiv=math/0204049| s2cid=16581168 }} for the proof of the following two theorems.
=Jensen's trace inequality=
Let {{mvar|f}} be a continuous function defined on an interval {{mvar|I}} and let {{mvar|m}} and {{mvar|n}} be natural numbers. If {{mvar|f}} is convex, we then have the inequality
:
for all ({{mvar|X}}1, ... , {{mvar|X}}n) self-adjoint {{mvar|m}} × {{mvar|m}} matrices with spectra contained in {{mvar|I}} and
all ({{mvar|A}}1, ... , {{mvar|A}}n) of {{mvar|m}} × {{mvar|m}} matrices with
:
Conversely, if the above inequality is satisfied for some {{mvar|n}} and {{mvar|m}}, where {{mvar|n}} > 1, then {{mvar|f}} is convex.
=Jensen's operator inequality=
For a continuous function defined on an interval the following conditions are equivalent:
- is operator convex.
- For each natural number we have the inequality
:
for all bounded, self-adjoint operators on an arbitrary Hilbert space with
spectra contained in and all on with
- for each isometry on an infinite-dimensional Hilbert space and
every self-adjoint operator with spectrum in .
- for each projection on an infinite-dimensional Hilbert space , every self-adjoint operator with spectrum in and every in .
Araki–Lieb–Thirring inequality
{{distinguish|text=the Lieb–Thirring inequality}}
E. H. Lieb and W. E. Thirring proved the following inequality in E. H. Lieb, W. E. Thirring, Inequalities for the Moments of the Eigenvalues of the Schrödinger Hamiltonian and Their Relation to Sobolev Inequalities, in Studies in Mathematical Physics, edited E. Lieb, B. Simon, and A. Wightman, Princeton University Press, 269–303 (1976). 1976: For any and
In 1990 {{cite journal | last=Araki | first=Huzihiro | title=On an inequality of Lieb and Thirring | journal=Letters in Mathematical Physics | publisher=Springer Science and Business Media LLC | volume=19 | issue=2 | year=1990 | issn=0377-9017 | doi=10.1007/bf01045887 | pages=167–170| bibcode=1990LMaPh..19..167A | s2cid=119649822 }} H. Araki generalized the above inequality to the following one: For any and
for and
for
There are several other inequalities close to the Lieb–Thirring inequality, such as the following:Z. Allen-Zhu, Y. Lee, L. Orecchia, Using Optimization to Obtain a Width-Independent, Parallel, Simpler, and Faster Positive SDP Solver, in ACM-SIAM Symposium on Discrete Algorithms, 1824–1831 (2016). for any and
and even more generally:L. Lafleche, C. Saffirio, Strong Semiclassical Limit from Hartree and
Hartree-Fock to Vlasov-Poisson Equation, arXiv:2003.02926 [math-ph]. for any and
The above inequality generalizes the previous one, as can be seen by exchanging by and by with and using the cyclicity of the trace, leading to
Additionally, building upon the Lieb-Thirring inequality the following inequality was derived: V. Bosboom, M. Schlottbom, F. L. Schwenninger, On the unique solvability of radiative transfer equations with polarization, in Journal of Differential Equations, (2024). For any and all with , it holds that
Effros's theorem and its extension
If is an operator convex function, and and are commuting bounded linear operators, i.e. the commutator , the perspective
:
is jointly convex, i.e. if and with (i=1,2), ,
:
Ebadian et al. later extended the inequality to the case where and do not commute .{{cite journal | last1=Ebadian | first1=A. | last2=Nikoufar | first2=I. | last3=Eshaghi Gordji | first3=M. | title=Perspectives of matrix convex functions | journal=Proceedings of the National Academy of Sciences | publisher=Proceedings of the National Academy of Sciences USA| volume=108 | issue=18 | date=2011-04-18 | issn=0027-8424 | doi=10.1073/pnas.1102518108| pmc=3088602 | pages=7313–7314| bibcode=2011PNAS..108.7313E | doi-access=free }}
See also
- {{annotated link|Lieb–Thirring inequality}}
- {{annotated link|Schur–Horn theorem}}
- {{annotated link|Trace identity}}
- {{annotated link|von Neumann entropy}}
References
{{reflist}}
- [http://www.scholarpedia.org/article/Matrix_and_Operator_Trace_Inequalities#Gibbs_variational_principle Scholarpedia] primary source.