矩阵差分方程

矩阵差分方程是一种差分方程，其中某时刻的变量向量（或矩阵）与之前时刻的值通过矩阵相关。^[1]^[2]方程的阶是变量向量任意两个指示值之间的最大时差。例如

\mathbf {x} _{t}=\mathbf {Ax} _{t-1}+\mathbf {Bx} _{t-2}

是二阶矩阵差分方程，其中 $x$ 是 $n \times 1$ 变量向量， $A$ 、 $B$ 是 $n \times n$ 矩阵。该方程齐次，因为方程末尾没有常数项向量。同一个方程也可写成

\mathbf {x} _{t+2}=\mathbf {Ax} _{t+1}+\mathbf {Bx} _{t}

或

\mathbf {x} _{n}=\mathbf {Ax} _{n-1}+\mathbf {Bx} _{n-2}

最常见的矩阵差分方程都是一阶的。

非齐次一阶情形及稳态

非齐次一阶矩阵差分方程如：

\mathbf {x} _{t}=\mathbf {Ax} _{t-1}+\mathbf {b}

与一个加性常向量 $b$ 。该系统的稳态是 $x$ 向量的值 $x *$ ，一旦达到就不会偏离。 $x *$ 可通过置 $x t = x t -1 = x *$ 、解 $x *$ 以得

\mathbf {x} ^{*}=[\mathbf {I} -\mathbf {A} ]^{-1}\mathbf {b}

其中 $I$ 是 $n \times n$ 单位矩阵，假定 $[I - A]$ 可逆。非齐次方程可用偏离稳态的齐次方程重写：

\left[\mathbf {x} _{t}-\mathbf {x} ^{*}\right]=\mathbf {A} \left[\mathbf {x} _{t-1}-\mathbf {x} ^{*}\right]

一阶情形的稳定性

一阶矩阵差分方程 $[x t - x *] = A [x t -1 - x *]$ 是稳定的，即当且仅当转移矩阵 $A$ 的所有特征值（无论实复）绝对值都小于1时， $x t$ 才逐渐收敛到稳态 $x *$ 。

解一阶情形

假定方程齐次形式为 $y t = Ay t -1$ ，然后可从初始条件 $y 0$ 开始迭代。 $y 0$ 是 $y$ 的初值，必须得知才能求解：

{\begin{aligned}\mathbf {y} _{1}&=\mathbf {Ay} _{0}\\\mathbf {y} _{2}&=\mathbf {Ay} _{1}=\mathbf {A} ^{2}\mathbf {y} _{0}\\\mathbf {y} _{3}&=\mathbf {Ay} _{2}=\mathbf {A} ^{3}\mathbf {y} _{0}\end{aligned}}

以此类推，由数学归纳法，用 $t$ 表示的解为

\mathbf {y} _{t}=\mathbf {A} ^{t}\mathbf {y} _{0}

此外，若 $A$ 可对角化，就可用它的特征值和特征向量重写 $A$ ，得到解

\mathbf {y} _{t}=\mathbf {PD} ^{t}\mathbf {P} ^{-1}\mathbf {y} _{0},

其中 $P$ 是 $n \times n$ 矩阵，列是 $A$ 的特征向量（假设特征值互异）； $D$ 是 $n \times n$ 对角矩阵，对角元是 $A$ 的特征值。这个解就是上述稳定性结果的依据：当且仅当 $A$ 的特征值绝对值都小于1， $A t$ 才会随时间收缩到零矩阵。

从一阶矩阵系统中提取单一标量变量的动力特性

从 $n$ 维系统 $y t = Ay t -1$ 开始，可以提取其中一个状态变量（如 $y 1$ ）的动态变化。上述 $y t$ 的求解方程表明， $y 1, t$ 的解是根据 $A$ 的 $n$ 个特征值求得的。因此，描述 $y 1$ 变化的方程本身必须有涉及特征值的解。这种描述直观地产生了 $y 1$ 的演化方程，即

y_{1,t}=a_{1}y_{1,t-1}+a_{2}y_{1,t-2}+\dots +a_{n}y_{1,t-n}

其中参数 $a i$ 来自 $A$ 的特征方程式：

\lambda ^{n}-a_{1}\lambda ^{n-1}-a_{2}\lambda ^{n-2}-\dots -a_{n}\lambda ^{0}=0.

因此， $n$ 维一阶线性系统中的每个标量变量都根据一元 $n$ 阶差分方程演化，与矩阵差分防尘具有相同的稳定性。

高阶情形的解与稳定性

可用分块矩阵将高阶矩阵差分方程转换到一阶，可以求解时滞超过一个周期的高阶方程，并分析其稳定性。例如，假设有二阶方程

\mathbf {x} _{t}=\mathbf {Ax} _{t-1}+\mathbf {Bx} _{t-2}

变量向量 $x$ 尺寸为 $n \times 1$ ， $A$ 、 $B$ 尺寸为 $n \times n$ 。则可以叠加为下列形式

{\begin{bmatrix}\mathbf {x} _{t}\\\mathbf {x} _{t-1}\\\end{bmatrix}}={\begin{bmatrix}\mathbf {A} &\mathbf {B} \\\mathbf {I} &\mathbf {0} \\\end{bmatrix}}{\begin{bmatrix}\mathbf {x} _{t-1}\\\mathbf {x} _{t-2}\end{bmatrix}}

其中 $I$ 是 $n \times n$ 单位矩阵， $0$ 是 $n \times n$ 零矩阵。然后将当前变量和一度滞后变量的 $2 n \times 1$ 叠加向量表示为 $z t$ ，将 $2 n \times 2 n$ 分块矩阵表示为 $L$ ，就得到了之前的解

\mathbf {z} _{t}=\mathbf {L} ^{t}\mathbf {z} _{0}

与之前一样，当且仅当矩阵 $L$ 的所有特征值的绝对值都小于1时，叠加方程与原二阶方程才稳定。

非线性矩阵差分方程：黎卡提方程

在LQG控制中，会出现一个当前和未来成本矩阵反向演化的非线性矩阵方程，下面用 $H$ 表示。这个方程也被称为离散动力黎卡提方程，当据线性矩阵差分方程演化的变量向量受外源向量的控制，以优化二次损失函数时，就会产生这个方程。黎卡提方程形式如下：

\mathbf {H} _{t-1}=\mathbf {K} +\mathbf {A} '\mathbf {H} _{t}\mathbf {A} -\mathbf {A} '\mathbf {H} _{t}\mathbf {C} \left[\mathbf {C} '\mathbf {H} _{t}\mathbf {C} +\mathbf {R} \right]^{-1}\mathbf {C} '\mathbf {H} _{t}\mathbf {A}

其中 $H$ 、 $K$ 、 $A$ 尺寸为 $n \times n$ ； $C$ 尺寸为 $n \times k$ ； $R$ 尺寸为 $k \times k$ ， $n$ 是受控向量元素数， $k$ 是控制向量元素数。参数矩阵 $A$ 、 $C$ 来自线性方程，参数矩阵 $K$ 、 $R$ 来自二次损失函数。详见此处。

一般来说，该方程无法根据 $t$ 分析求解 $H t$ ，而是通过迭代黎卡提方程，求出 $H t$ 的值序列。不过，已经证明^[3]，若 $R = 0$ 、 $n = k + 1$ ，则可将黎卡提方程简化为标量有理差分方程分析求解；对任意 $k$ 、 $n$ ，若转移矩阵 $A$ 可逆，则黎卡提方程就可根据矩阵特征值进行分析求解，尽管特征值可能要用数值计算才能找到。^[4]

在大多数情况下， $H$ 随时间的演化是稳定的，也就是说 $H$ 会收敛到特定的常矩阵 $H *$ ，其他矩阵都有理时也可能是无理的。参见隨機控制#離散時間系統。

另见

参考文献

^ Cull, Paul; Flahive, Mary; Robson, Robbie. Difference Equations: From Rabbits to Chaos. Springer. 2005. ch. 7. ISBN 0-387-23234-6.
^ Chiang, Alpha C. Fundamental Methods of Mathematical Economics 3rd. McGraw-Hill. 1984: 608–612. ISBN 9780070107809.
^ Balvers, Ronald J.; Mitchell, Douglas W. Reducing the dimensionality of linear quadratic control problems (PDF). Journal of Economic Dynamics and Control. 2007, 31 (1): 141–159 [2023-10-15]. doi:10.1016/j.jedc.2005.09.013. （原始内容存档 (PDF)于2022-01-18）.
^ Vaughan, D. R. A nonrecursive algebraic solution for the discrete Riccati equation. IEEE Transactions on Automatic Control. 1970, 15 (5): 597–599. doi:10.1109/TAC.1970.1099549.
^ Martin, C. F.; Ammar, G. The geometry of the matrix Riccati equation and associated eigenvalue method. Bittani; Laub; Willems (编). The Riccati Equation. Springer-Verlag. 1991. ISBN 978-3-642-63508-3. doi:10.1007/978-3-642-58223-3_5.

[1] Cull, Paul; Flahive, Mary; Robson, Robbie. Difference Equations: From Rabbits to Chaos. Springer. 2005. ch. 7. ISBN 0-387-23234-6.

[2] Chiang, Alpha C. Fundamental Methods of Mathematical Economics 3rd. McGraw-Hill. 1984: 608–612. ISBN 9780070107809.

[3] Balvers, Ronald J.; Mitchell, Douglas W. Reducing the dimensionality of linear quadratic control problems (PDF). Journal of Economic Dynamics and Control. 2007, 31 (1): 141–159 [2023-10-15]. doi:10.1016/j.jedc.2005.09.013. （原始内容存档 (PDF)于2022-01-18）.

[4] Vaughan, D. R. A nonrecursive algebraic solution for the discrete Riccati equation. IEEE Transactions on Automatic Control. 1970, 15 (5): 597–599. doi:10.1109/TAC.1970.1099549.

[5] Martin, C. F.; Ammar, G. The geometry of the matrix Riccati equation and associated eigenvalue method. Bittani; Laub; Willems (编). The Riccati Equation. Springer-Verlag. 1991. ISBN 978-3-642-63508-3. doi:10.1007/978-3-642-58223-3_5.

[1]

[2]

[3]

[4]

[5]