memo

Householder Transformation

Definition 1 Householder matrix

$v \in \mathbb{R}^{n}$,
- $v \neq 0$,

\[\begin{eqnarray} P_{v} & := & I - \frac{2}{v^{\mathrm{T}}v} v v ^{\mathrm{T}} \nonumber \\ & = & I - \frac{2}{\norm{v}^{2}} v v ^{\mathrm{T}} \nonumber \\ & = & I - 2 \bar{v} \bar{v}^{\mathrm{T}} \nonumber \\ \bar{v} & := & \frac{v}{\norm{v}} \nonumber \end{eqnarray}\]

$P_{v}$ is called a Householder reflection, Householder matrix, or Householder transformation. The $v$ is called a Householder vector.

■

Remark

$P_{v}$ is a projection onto the hyperplane $\mathrm{span}{v}^{\perp}$.

\[\begin{eqnarray} P_{v}x & = & x - \frac{2}{\norm{v}^{2}} v v^{\mathrm{T}} x \nonumber \\ & = & x - \frac{ 2 (v^{\mathrm{T}} x) }{ \norm{v}^{2} } v \nonumber \\ & = & x - 2 \langle \bar{v}, x \rangle \bar{v} \end{eqnarray}\]

$\langle \bar{v}, x \rangle$ is the length of the projected vector obtained by projecting $x$ onto $\bar{v}$. $\langle \bar{v}, x \rangle \bar{v}$ is a projection of $x$ onto $v$.

Suppose that $Px \in \mathrm{span}(e_{1})$ where $(e_{i})_{i=1,\ldots,n}$ is a standard basis. Since

\[Px - x = 2 \langle \bar{v}, x \rangle \bar{v},\]

$v \in \mathrm{span}(x, e_{1})$. Conversely, setting $v := x + \alpha e_{1}$ gives

\[\begin{eqnarray} v^{\mathrm{T}}x & = & x^{\mathrm{T}}x + \alpha x^{1} \nonumber \\ v^{\mathrm{T}}v & = & v^{\mathrm{T}} \left( x + \alpha e_{1} \right) \nonumber \\ & = & v^{\mathrm{T}} x + \alpha v^{\mathrm{T}} e_{1} \nonumber \\ & = & x^{\mathrm{T}}x + \alpha x^{1} + x^{1} \alpha + \alpha^{2} \nonumber \\ & = & x^{\mathrm{T}}x + 2 \alpha x^{1} + \alpha^{2} \nonumber \end{eqnarray}\]

where $x = (x^{1}, \ldots, x^{n})^{\mathrm{T}}$. Therefore,

\[\begin{eqnarray} Px & = & x - 2 \frac{ x^{\mathrm{T}}x + \alpha x^{1} }{ x^{\mathrm{T}}x + 2 \alpha x^{1} + \alpha^{2} } \left( x + \alpha e_{1} \right) \nonumber \\ & = & x \left( 1 - 2 \frac{ x^{\mathrm{T}}x + \alpha x^{1} }{ x^{\mathrm{T}}x + 2 \alpha x^{1} + \alpha^{2} } \right) - 2 \alpha \frac{ x^{\mathrm{T}}x + \alpha x^{1} }{ x^{\mathrm{T}}x + 2 \alpha x^{1} + \alpha^{2} } e_{1} . \label{householder_reflection_equation_01} \end{eqnarray}\]

In order that the coefficient of $x$ is zero, we set $\alpha = \pm \norm{x}_{2}$.

\[\begin{eqnarray} v^{\mathrm{T}}x & = & \norm{x}_{2}^{2} \pm \norm{x}_{2} x^{1} \nonumber \\ v^{\mathrm{T}}v & = & \norm{x}_{2}^{2} \pm 2 \norm{x}_{2} x^{1} + \norm{x}_{2}^{2} \nonumber \\ & = & 2 \norm{x}_{2}^{2} \pm 2 \norm{x}_{2} x^{1} \nonumber . \end{eqnarray}\]

Form $\eqref{householder_reflection_equation_01}$,

\[\begin{eqnarray} Px & = & x \left( 1 - 2 \frac{ \norm{x}_{2}^{2} \pm \norm{x}_{2}x^{1} }{ 2 \norm{x}_{2}^{2} \pm 2 \norm{x}_{2}x^{1} } \right) \mp 2 \norm{x}_{2} \frac{ \norm{x}_{2}^{2} \pm \norm{x}_{2}x^{1} }{ 2 \norm{x}_{2}^{2} \pm 2 \norm{x}_{2}x^{1} } e_{1} \nonumber \\ & = & \mp \norm{x}_{2} e_{1} . \nonumber \end{eqnarray}\]

■

There are various algorithms to compute a householder $v$ for given $x$ such that $P_{v}x = \norm{x}_{2} e_{1}$ and $P = I - \beta v v^{\mathrm{T}}$ is orthogonal where $\beta := 2 / v^{\mathrm{T}}v$. Here we describe one of the algorithms. For the reason that $\alpha = - \norm{x}_{2}$ yields the property that $Px$ is positive, we take $\alpha = -\norm{x}_{2}$. In that case,

\[v^{1} = x^{1} - \norm{x}_{2} .\]

If $x^{1} \approx \norm{x}$, that is, $x \approx x^{1}e_{1}$, catastrophic cancellation would occur in the calculation. To avoid the cancellation, when $x \ge 0$, instead of the above equation we compute the following formula

\[\begin{eqnarray} v^{1} & = & x^{1} - \norm{x}_{2} \nonumber \\ & = & \frac{ x_{1}^{2} - \norm{x}_{2}^{2} }{ x^{1} + \norm{x}_{2} } \nonumber \\ & = & \frac{ - ( (x^{2})^{2} + \cdots + (x^{n})^{2} ) }{ x^{1} + \norm{x}_{2} } \nonumber \end{eqnarray}\]

Algorithm 2

    \begin{algorithm}
    \caption{Computing Householder Vector}
    \begin{algorithmic}
    \PROCEDURE{ComputeHouseholderVector}{$x$}
        \STATE $n := \mathrm{length}(x)$
        \STATE $\sigma := (x^{2:n})^{\mathrm{T}}x^{2:n}$
        \COMMENT{$2(n - 1)$ flops}
        \STATE $
            v :=
            \left(
                \begin{array}{c}
                    1
                    \\
                    x^{2:n}
                \end{array}
            \right)
        $,
        \IF{$\sigma = 0$}
            \COMMENT{$x$ is contained in span $e_{1}$}
            \STATE $\beta := 0$
        \ELSE
            \STATE $\mu := \sqrt{(x^{1})^{2} + \sigma}$
            \COMMENT{norm of $x$}
            \IF{ $x^{1} \le 0$ }
                \STATE $v^{1} \leftarrow x^{1} - \mu$
            \ELSE
                \COMMENT{to avoid catastrophic cancellation}
                \STATE $v^{1} \leftarrow -\sigma/(x^{1} + \mu)$
            \ENDIF
            \STATE $\beta := 2 (v^{1})^{2} / (\sigma + (v^{1})^{2})$
            \STATE $v \leftarrow v / v^{1}$
            \COMMENT{$n$ flops}
        \ENDIF
        \RETURN $(v, \beta)$
    \ENDPROCEDURE
    \end{algorithmic}
    \end{algorithm}

■

\[\begin{eqnarray} Q & := & Q_{1} \cdots Q_{r} \nonumber \\ Q_{j} & := & I - \beta_{j} v^{(j)}(v^{(j)})^{\mathrm{T}} \nonumber \\ v^{(j)} & := & ( \underbrace{0, \ldots, 0,}_{j - 1} 1, v^{j+1,(j)}, \cdots, v^{n,(j)} )^{\mathrm{T}} \nonumber . \end{eqnarray}\]

Since the algorithm requires approximately $3n$, and time complexy is $O(n)$.

Proposition 3

$P_{v}$,

(1)

\[P_{v}^{\mathrm{T}} = P_{v} .\]

(2)

\[P_{v}^{-1} = P_{v}^{\mathrm{T}} = P .\]

(3) $P^{2} = I$.

(4) $P_{v}P_{v^{\prime}}$ is symetric and orthonomal.

proof

proof of (1)

\[\begin{eqnarray} P_{v}^{\mathrm{T}} & = & (I - 2 \bar{v}\bar{v}^{\mathrm{T}})^{\mathrm{T}} \nonumber \\ & = & I - 2 (\bar{v}^{\mathrm{T}}))^{\mathrm{T}}\bar{v}^{\mathrm{T}} \nonumber \\ & = & I - 2 \bar{v}\bar{v}^{\mathrm{T}} \nonumber \end{eqnarray}\]

proof of (2)

\[\begin{eqnarray} P P_{v}^{\mathrm{T}} & = & (I - 2 \bar{v}\bar{v}^{\mathrm{T}}) (I - 2 \bar{v}\bar{v}^{\mathrm{T}}) \nonumber \\ & = & I -2 \bar{v}\bar{v}^{\mathrm{T}}) -2 \bar{v}\bar{v}^{\mathrm{T}}) + 4 (\bar{v}\bar{v}^{\mathrm{T}}) (\bar{v}\bar{v}^{\mathrm{T}}) \nonumber \\ & = & I -4 \bar{v}\bar{v}^{\mathrm{T}} + 4 \bar{v} (\bar{v}^{\mathrm{T}} \bar{v}) \bar{v}^{\mathrm{T}} \nonumber \\ & = & I -4 \bar{v}\bar{v}^{\mathrm{T}} + 4 \bar{v} \bar{v}^{\mathrm{T}} \quad (\because \norm{\bar{v}} = 1) \nonumber \\ & = & I \nonumber \end{eqnarray}\]

proof of (3)

$PP^{\mathrm{T}} = P^{2} = I$.

proof of (4)

This is from a fact that multiplication of orthonomal matrix is also orthonomal.

$\Box$

Algorithm 4 Householder Bidiagonalization

$A \in \mathbb{R}^{m \times n}$.
$m \ge n$,
$U \in \mathbb{R}^{m \times m}$
- orthogonal
$V \in \mathbb{R}^{n \times n}$
- orthogonal

The algorithm to find $U$ and $V$ which satisfy

\[\begin{eqnarray} U^{\mathrm{T}} A V = \left( \begin{array}{ccccc} d_{1} & f_{1} & 0 & \cdots & a0 \\ 0 & d_{2} & f_{2}& 0 & \vdots \\ \vdots & \ddots & \ddots & \vdots & \\ 0 & \cdots & \ddots & d_{n-1} & f_{n-1} \\ 0 & \cdots & & 0 & d_{n} \\ \hline 0 & 0 & \cdots & 0 & 0 \\ \vdots & \vdots & \vdots & \vdots & \\ 0 & 0 & \cdots & 0 & 0 \end{array} \right) . \end{eqnarray}\] \[\begin{eqnarray} A & := & \left( \begin{array}{ccccc} \times & \times & \times & \times \\ \times & \times & \times & \times \\ \times & \times & \times & \times \\ \times & \times & \times & \times \end{array} \right) \nonumber \\ A_{1}^{(1)} & := & U_{1}A \nonumber \\ & = & \left( \begin{array}{ccccc} \times & \times & \times & \times \\ 0 & \times & \times & \times \\ 0 & \times & \times & \times \\ 0 & \times & \times & \times \end{array} \right) \nonumber \\ A_{2}^{(1)} & := & A_{1}^{(1)} V_{1} \nonumber \\ & = & \left( \begin{array}{ccccc} \times & \times & 0 & 0 \\ 0 & \times & \times & \times \\ 0 & \times & \times & \times \\ 0 & \times & \times & \times \end{array} \right) \nonumber \\ A_{1}^{(2)} & := & U_{2} A_{2}^{(1)} \nonumber \\ & = & \left( \begin{array}{ccccc} \times & \times & 0 & 0 \\ 0 & \times & \times & \times \\ 0 & 0 & \times & \times \\ 0 & 0 & \times & \times \end{array} \right) \nonumber \\ A_{2}^{(2)} & := & A_{1}^{(2)} V_{2} \nonumber \\ & = & \left( \begin{array}{ccccc} \times & \times & 0 & 0 \\ 0 & \times & \times & 0 \\ 0 & 0 & \times & \times \\ 0 & 0 & \times & \times \end{array} \right) \nonumber \\ A_{1}^{(3)} & := & U_{3} A_{2}^{(2)} \nonumber \\ & = & \left( \begin{array}{ccccc} \times & \times & 0 & 0 \\ 0 & \times & \times & 0 \\ 0 & 0 & \times & \times \\ 0 & 0 & 0 & \times \end{array} \right) \nonumber \end{eqnarray}\]

    \begin{algorithm}
    \caption{Computing Householder Bidiagonalization}
    \begin{algorithmic}
    \REQUIRE $A \in \mathbb{R}^{m \times n}$
    \PROCEDURE{ComputeHouseholderBidiagonalization}{$A$}
        \FOR{$j = 1$ \TO $n$}
            \STATE $(v, \beta) \leftarrow \mathrm{ComputeHouseholderVector}(A_{j}^{j:m})$
            \STATE $A_{j:m}^{j:n} \leftarrow (I_{m-j+1} - \beta vv^{\mathrm{T}}) A_{j:n}^{j:m}$
            \STATE $A_{j}^{(j+1):m} \leftarrow v^{2:(m-j+1)}$
            \IF{$j \le n - 2$}
                \STATE $(v, \beta) \leftarrow \mathrm{ComputeHouseholderVector}((A_{j}^{j:m})^{\mathrm{T}}))$
                \STATE $A_{(j+1):n}^{j:m} \leftarrow A_{(j+1):n}^{j:m}(I_{n-j} - \beta vv^{\mathrm{T}})$
                \STATE $A_{(j+2):n}^{j} \leftarrow (v^{2:(n-j)})^{\mathrm{T}}$
            \ENDIF
        \ENDFOR
        \RETURN $A$
    \ENDPROCEDURE
    \end{algorithmic}
    \end{algorithm}

■

Reference

Householder transformation - Wikipedia
s93.pdf
G. H. Golub and C. F. Van Loan. Matrix Computations. Johns Hopkins University Press, Baltimore, MD, USA, fourth edition, 2012