Generalizations of Pauli matrices

{{Short description|Families of matrices in mathematics, physics, and quantum information}}

In mathematics and physics, in particular quantum information, the term generalized Pauli matrices refers to families of matrices which generalize the (linear algebraic) properties of the Pauli matrices. Here, a few classes of such matrices are summarized.

Multi-qubit Pauli matrices (Hermitian)

This method of generalizing the Pauli matrices refers to a generalization from a single 2-level system (qubit) to multiple such systems. In particular, the generalized Pauli matrices for a group of N qubits is just the set of matrices generated by all possible products of Pauli matrices on any of the qubits.{{cite journal |last1=Brown |first1=Adam R. |last2=Susskind |first2=Leonard |title=Second law of quantum complexity |journal=Physical Review D |date=2018-04-25 |volume=97 |issue=8 |pages=086015 |doi=10.1103/PhysRevD.97.086015|arxiv=1701.01107 |bibcode=2018PhRvD..97h6015B |s2cid=119199949 }}

The vector space of a single qubit is V_1 = \mathbb{C}^2 and the vector space of N qubits is V_N = \left(\mathbb{C}^2\right)^{\otimes N}\cong \mathbb{C}^{2^N}. We use the tensor product notation

:\sigma_a^{(n)} = I^{(1)} \otimes \dotsm \otimes I^{(n-1)} \otimes \sigma_a \otimes I^{(n+1)} \otimes \dotsm \otimes I^{(N)}, \qquad a = 1, 2, 3

to refer to the operator on V_N that acts as a Pauli matrix on the nth qubit and the identity on all other qubits. We can also use a = 0 for the identity, i.e., for any n we use \sigma_0^{(n)} = \bigotimes_{m=1}^N I^{(m)}. Then the multi-qubit Pauli matrices are all matrices of the form

:\sigma_{\,\vec a} := \prod_{n=1}^N \sigma_{a_n}^{(n)} = \sigma_{a_1} \otimes \dotsm \otimes \sigma_{a_N}, \qquad \vec{a} = (a_1, \ldots, a_N) \in \{0, 1, 2, 3\}^{\times N},

i.e., for \vec{a} a vector of integers between 0 and 4. Thus there are 4^N such generalized Pauli matrices if we include the identity I = \bigotimes_{m=1}^N I^{(m)} and 4^N - 1 if we do not.

= Notations =

In quantum computation, it is conventional to denote the Pauli matrices with single upper case letters

:I \equiv \sigma_0, \qquad X \equiv \sigma_1, \qquad Y \equiv \sigma_2, \qquad Z \equiv \sigma_3.

This allows subscripts on Pauli matrices to indicate the qubit index. For example, in a system with 3 qubits,

:X_1 \equiv X \otimes I \otimes I, \qquad Z_2 \equiv I \otimes Z \otimes I.

Multi-qubit Pauli matrices can be written as products of single-qubit Paulis on disjoint qubits. Alternatively, when it is clear from context, the tensor product symbol \otimes can be omitted, i.e. unsubscripted Pauli matrices written consecutively represents tensor product rather than matrix product. For example:

:XZI \equiv X_1Z_2 = X \otimes Z \otimes I.

Higher spin matrices (Hermitian)

The traditional Pauli matrices are the matrix representation of the \mathfrak{su}(2) Lie algebra generators J_x, J_y, and J_z in the 2-dimensional irreducible representation of SU(2), corresponding to a spin-1/2 particle. These generate the Lie group SU(2).

For a general particle of spin s=0,1/2,1,3/2,2,\ldots, one instead utilizes the 2s+1-dimensional irreducible representation.

{{main article|Spin (physics)#Higher spins}}

Generalized Gell-Mann matrices (Hermitian)

This method of generalizing the Pauli matrices refers to a generalization from 2-level systems (Pauli matrices acting on qubits) to 3-level systems (Gell-Mann matrices acting on qutrits) and generic d-level systems (generalized Gell-Mann matrices acting on qudits).

= Construction =

Let E_{jk} be the matrix with 1 in the {{math|jk}}-th entry and 0 elsewhere. Consider the space of d\times d complex matrices, \Complex^{d\times d}, for a fixed d.

Define the following matrices,

: f_{k,j}^{\,\,\,\,\, d} =

\begin{cases}E_{kj} + E_{jk} & {\text{for }}k < j,\\

-i(E_{jk} - E_{kj})&{\text{for }} k > j.\end{cases}

and

: h_{k}^{\,\,\, d} =

\begin{cases}I_d & {\text{for }} k = 1,\\

h_{k}^{\,\,\, d-1} \oplus 0 &{\text{for }} 1 < k < d, \\

\sqrt{\tfrac{2}{d(d - 1)}} \left( h_1^{d-1} \oplus (1 - d)\right) = \sqrt{\tfrac{2}{d(d - 1)}} \left( I_{d-1} \oplus (1 - d)\right) &{\text{for }} k = d

\end{cases}

The collection of matrices defined above without the identity matrix are called the generalized Gell-Mann matrices, in dimension d.{{Cite journal | doi = 10.1016/S0375-9601(03)00941-1| title = The Bloch vector for N-level systems| journal = Physics Letters A| volume = 314| issue = 5–6| pages = 339–349| year = 2003| last1 = Kimura | first1 = G. |arxiv = quant-ph/0301152 |bibcode = 2003PhLA..314..339K | s2cid = 119063531}}{{Cite journal

| doi = 10.1088/1751-8113/41/23/235303

| issn = 1751-8121

| last = Bertlmann | first = Reinhold A.

| author2 = Philipp Krammer

| title = Bloch vectors for qudits

| journal = Journal of Physics A: Mathematical and Theoretical

| volume = 41 | issue = 23 | pages = 235303

| date = 2008-06-13

| bibcode = 2008JPhA...41w5303B

| arxiv = 0806.1174 | s2cid = 118603188

}} The symbol ⊕ (utilized in the Cartan subalgebra above) means matrix direct sum.

The generalized Gell-Mann matrices are Hermitian and traceless by construction, just like the Pauli matrices. One can also check that they are orthogonal in the Hilbert–Schmidt inner product on \Complex^{d\times d}. By dimension count, one sees that they span the vector space of d\times d complex matrices, \mathfrak{gl}(d,\Complex). They then provide a Lie-algebra-generator basis acting on the fundamental representation of \mathfrak{su}(d).

In dimensions d = 2 and 3, the above construction recovers the Pauli and Gell-Mann matrices, respectively.

Sylvester's generalized Pauli matrices (non-Hermitian)

A particularly notable generalization of the Pauli matrices was constructed by James Joseph Sylvester in 1882.Sylvester, J. J., (1882), Johns Hopkins University Circulars I: 241-242; ibid II (1883) 46;

ibid III (1884) 7–9. Summarized in The Collected Mathematics Papers of James Joseph Sylvester (Cambridge University Press, 1909) v III .

[http://quod.lib.umich.edu/u/umhistmath/aas8085.0003.001/664?rgn=full+text;view=pdf;q1=nonions online] and [http://quod.lib.umich.edu/u/umhistmath/AAS8085.0004.001/165?cite1=Sylvester;cite1restrict=author;rgn=full+text;view=pdf further].

These are known as "Weyl–Heisenberg matrices" as well as "generalized Pauli matrices".{{Cite journal|last=Appleby|first=D. M.|date=May 2005|title=Symmetric informationally complete–positive operator valued measures and the extended Clifford group|url=http://aip.scitation.org/doi/10.1063/1.1896384|journal=Journal of Mathematical Physics|language=en|volume=46|issue=5|pages=052107|arxiv=quant-ph/0412001|bibcode=2005JMP....46e2107A|doi=10.1063/1.1896384|issn=0022-2488}}{{Cite journal|last1=Howard|first1=Mark|last2=Vala|first2=Jiri|date=2012-08-15|title=Qudit versions of the qubit π / 8 gate|url=https://link.aps.org/doi/10.1103/PhysRevA.86.022316|journal=Physical Review A|language=en|volume=86|issue=2|pages=022316|arxiv=1206.1598|bibcode=2012PhRvA..86b2316H|doi=10.1103/PhysRevA.86.022316|s2cid=56324846|issn=1050-2947}}

= Framing =

The Pauli matrices \sigma _1 and \sigma _3 satisfy the following:

:

\sigma_1^2 = \sigma_3^2 = I, \quad \sigma_1 \sigma_3 = - \sigma_3 \sigma_1 = e^{\pi i} \sigma_3 \sigma_1.

The so-called Walsh–Hadamard conjugation matrix is

:

W = \frac{1}{\sqrt{2}}

\begin{bmatrix}

1 & 1 \\ 1 & -1

\end{bmatrix}.

Like the Pauli matrices, W is both Hermitian and unitary. \sigma_1, \; \sigma_3 and W satisfy the relation

:\; \sigma_1 = W \sigma_3 W^* .

The goal now is to extend the above to higher dimensions, d.

= Construction: The clock and shift matrices =

{{see also |Generalized Clifford algebra}}

Fix the dimension d as before. Let \omega = \exp(2 \pi i / d), a root of unity. Since \omega^d = 1 and \omega \neq 1, the sum of all roots annuls:

:1 + \omega + \cdots + \omega ^{d-1} = 0 .

Integer indices may then be cyclically identified mod {{mvar|d}}.

Now define, with Sylvester, the shift matrix

:

\Sigma _1 =

\begin{bmatrix}

0 & 0 & 0 & \cdots & 0 & 1\\

1 & 0 & 0 & \cdots & 0 & 0\\

0 & 1 & 0 & \cdots & 0 & 0\\

0 & 0 & 1 & \cdots & 0 & 0\\

\vdots & \vdots & \vdots & \ddots &\vdots &\vdots\\

0 & 0 & 0 & \cdots & 1 & 0\\

\end{bmatrix}

and the clock matrix,

:

\Sigma _3 =

\begin{bmatrix}

1 & 0 & 0 & \cdots & 0\\

0 & \omega & 0 & \cdots & 0\\

0 & 0 & \omega^2 & \cdots & 0\\

\vdots & \vdots & \vdots & \ddots & \vdots\\

0 & 0 & 0 & \cdots & \omega^{d-1}

\end{bmatrix}.

These matrices generalize \sigma_1'' and \sigma_3, respectively.

Note that the unitarity and tracelessness of the two Pauli matrices is preserved, but not Hermiticity in dimensions higher than two. Since Pauli matrices describe quaternions, Sylvester dubbed the higher-dimensional analogs "nonions", "sedenions", etc.

These two matrices are also the cornerstone of quantum mechanical dynamics in finite-dimensional vector spacesWeyl, H., "Quantenmechanik und Gruppentheorie", Zeitschrift für Physik, 46 (1927) pp. 1–46, {{doi|10.1007/BF02055756}}.Weyl, H., The Theory of Groups and Quantum Mechanics (Dover, New York, 1931){{Cite journal | last1 = Santhanam | first1 = T. S. | last2 = Tekumalla | first2 = A. R. | doi = 10.1007/BF00715110 | title = Quantum mechanics in finite dimensions | journal = Foundations of Physics | volume = 6 | issue = 5 | pages = 583 | year = 1976 |bibcode = 1976FoPh....6..583S | s2cid = 119936801 }} as formulated by Hermann Weyl, and they find routine applications in numerous areas of mathematical physics.For a serviceable review, see Vourdas A. (2004), "Quantum systems with finite Hilbert space", Rep. Prog. Phys. 67 267. {{doi|10.1088/0034-4885/67/3/R03}}. The clock matrix amounts to the exponential of position in a "clock" of d hours, and the shift matrix is just the translation operator in that cyclic vector space, so the exponential of the momentum. They are (finite-dimensional) representations of the corresponding elements of the Weyl-Heisenberg group on a d-dimensional Hilbert space.

The following relations echo and generalize those of the Pauli matrices:

:\Sigma_1^d = \Sigma_3^d = I

and the braiding relation,

:\Sigma_3 \Sigma_1 = \omega \Sigma_1 \Sigma_3 = e^{2\pi i / d} \Sigma_1 \Sigma_3 ,

the Weyl formulation of the CCR, and can be rewritten as

:\Sigma_3 \Sigma_1 \Sigma_3^{d-1} \Sigma_1^{d-1} = \omega ~.

On the other hand, to generalize the Walsh–Hadamard matrix W, note

:

W = \frac{1}{\sqrt{2}}

\begin{bmatrix}

1 & 1 \\ 1 & \omega^{2-1}

\end{bmatrix}

=

\frac{1}{\sqrt{2}}

\begin{bmatrix}

1 & 1 \\ 1 & \omega^{d-1}

\end{bmatrix}.

Define, again with Sylvester, the following analog matrix,{{Cite journal|doi=10.1080/14786446708639914|title=Thoughts on inverse orthogonal matrices, simultaneous sign-successions, and tessellated pavements in two or more colours, with applications to Newton's rule, ornamental tile-work, and the theory of numbers|year=1867|last1=Sylvester|first1=J.J.|journal=The London, Edinburgh, and Dublin Philosophical Magazine and Journal of Science|volume=34|issue=232|pages=461–475}} still denoted by W in a slight abuse of notation,

:

W =

\frac{1}{\sqrt{d}}

\begin{bmatrix}

1 & 1 & 1 & \cdots & 1\\

1 & \omega^{d-1} & \omega^{2(d-1)} & \cdots & \omega^{(d-1)^2}\\

1 & \omega^{d-2} & \omega^{2(d-2)} & \cdots & \omega^{(d-1)(d-2)}\\

\vdots & \vdots & \vdots & \ddots & \vdots\\

1 & \omega & \omega ^2 & \cdots & \omega^{d-1}

\end{bmatrix}~.

It is evident that W is no longer Hermitian, but is still unitary. Direct calculation yields

:\Sigma_1 = W \Sigma_3 W^* ~,

which is the desired analog result. Thus, W, a Vandermonde matrix, arrays the eigenvectors of \Sigma_1, which has the same eigenvalues as \Sigma_3.

When d = 2^k, W^* is precisely the discrete Fourier transform matrix, converting position coordinates to momentum coordinates and vice versa.

= Definition =

The complete family of d^2 unitary (but non-Hermitian) independent matrices \{\sigma_{k,j}\}_{k,j=1}^d is defined as follows:

{{Equation box 1

|indent =::

|equation =

\sigma_{k,j}:=

\left(\Sigma_1\right)^k \left(\Sigma_3\right)^j = \sum_{m=0}^{d-1} |m+k\rangle \omega^{jm} \langle m|.

| cellpadding= 6

| border

| border colour = #0073CF

| bgcolor=#F9FFF7

}}

This provides Sylvester's well-known trace-orthogonal basis for \mathfrak{gl}(d,\Complex), known as "nonions" \mathfrak{gl}(3,\Complex), "sedenions" \mathfrak{gl}(4,\Complex), etc...{{Cite journal | last1 = Patera | first1 = J. | last2 = Zassenhaus | first2 = H. | doi = 10.1063/1.528006 | title = The Pauli matrices in n dimensions and finest gradings of simple Lie algebras of type An−1 | journal = Journal of Mathematical Physics | volume = 29 | issue = 3 | pages = 665 | year = 1988 |bibcode = 1988JMP....29..665P }}Since all indices are defined cyclically mod {{mvar|d}}, \mathrm{tr}\Sigma_1^j \Sigma_3^k \Sigma_1^m \Sigma_3^n = \omega^{km} d ~\delta_{j+m,0} \delta_{k+n,0} .

This basis can be systematically connected to the above Hermitian basis.{{Cite journal | last1 = Fairlie | first1 = D. B. | last2 = Fletcher | first2 = P. | last3 = Zachos | first3 = C. K. | doi = 10.1063/1.528788 | title = Infinite-dimensional algebras and a trigonometric basis for the classical Lie algebras | journal = Journal of Mathematical Physics | volume = 31 | issue = 5 | pages = 1088 | year = 1990 |bibcode = 1990JMP....31.1088F }} (For instance, the powers of \Sigma_3, the Cartan subalgebra,

map to linear combinations of the h_{k}^{\,\,\, d} matrices.) It can further be used to identify \mathfrak{gl}(d,\Complex), as d \to \infty, with the algebra of Poisson brackets.

= Properties =

With respect to the Hilbert–Schmidt inner product on operators, \langle A, B \rangle_\text{HS} = \operatorname{Tr}(A^* B), Sylvester's generalized Pauli operators are orthogonal and normalized to \sqrt{d}:

:\langle \sigma_{k,j}, \sigma_{k',j'} \rangle_{\text{HS}} = \delta_{k k'}\delta_{j j'} \| \sigma_{k,j}\|^2_{\text{HS}} = d \delta_{k k'}\delta_{j j'}.

This can be checked directly from the above definition of \sigma_{k,j}.

See also

{{portal|Physics}}

Notes

{{Reflist}}

{{DEFAULTSORT:Generalizations Of Pauli Matrices}}

Category:Linear algebra

Category:Mathematical physics