矩陣差分方程

矩陣差分方程是一種差分方程，其中某時刻的變量向量（或矩陣）與之前時刻的值通過矩陣相關。^[1]^[2]方程的階是變量向量任意兩個指示值之間的最大時差。例如

\mathbf {x} _{t}=\mathbf {Ax} _{t-1}+\mathbf {Bx} _{t-2}

是二階矩陣差分方程，其中 $x$ 是 $n \times 1$ 變量向量， $A$ 、 $B$ 是 $n \times n$ 矩陣。該方程齊次，因為方程末尾沒有常數項向量。同一個方程也可寫成

\mathbf {x} _{t+2}=\mathbf {Ax} _{t+1}+\mathbf {Bx} _{t}

或

\mathbf {x} _{n}=\mathbf {Ax} _{n-1}+\mathbf {Bx} _{n-2}

最常見的矩陣差分方程都是一階的。

非齊次一階情形及穩態

非齊次一階矩陣差分方程如：

\mathbf {x} _{t}=\mathbf {Ax} _{t-1}+\mathbf {b}

與一個加性常向量 $b$ 。該系統的穩態是 $x$ 向量的值 $x *$ ，一旦達到就不會偏離。 $x *$ 可通過置 $x t = x t -1 = x *$ 、解 $x *$ 以得

\mathbf {x} ^{*}=[\mathbf {I} -\mathbf {A} ]^{-1}\mathbf {b}

其中 $I$ 是 $n \times n$ 單位矩陣，假定 $[I - A]$ 可逆。非齊次方程可用偏離穩態的齊次方程重寫：

\left[\mathbf {x} _{t}-\mathbf {x} ^{*}\right]=\mathbf {A} \left[\mathbf {x} _{t-1}-\mathbf {x} ^{*}\right]

一階情形的穩定性

一階矩陣差分方程 $[x t - x *] = A [x t -1 - x *]$ 是穩定的，即若且唯若轉移矩陣 $A$ 的所有特徵值（無論實復）絕對值都小於1時， $x t$ 才逐漸收斂到穩態 $x *$ 。

解一階情形

假定方程齊次形式為 $y t = Ay t -1$ ，然後可從初始條件 $y 0$ 開始迭代。 $y 0$ 是 $y$ 的初值，必須得知才能求解：

{\begin{aligned}\mathbf {y} _{1}&=\mathbf {Ay} _{0}\\\mathbf {y} _{2}&=\mathbf {Ay} _{1}=\mathbf {A} ^{2}\mathbf {y} _{0}\\\mathbf {y} _{3}&=\mathbf {Ay} _{2}=\mathbf {A} ^{3}\mathbf {y} _{0}\end{aligned}}

以此類推，由數學歸納法，用 $t$ 表示的解為

\mathbf {y} _{t}=\mathbf {A} ^{t}\mathbf {y} _{0}

此外，若 $A$ 可對角化，就可用它的特徵值和特徵向量重寫 $A$ ，得到解

\mathbf {y} _{t}=\mathbf {PD} ^{t}\mathbf {P} ^{-1}\mathbf {y} _{0},

其中 $P$ 是 $n \times n$ 矩陣，列是 $A$ 的特徵向量（假設特徵值互異）； $D$ 是 $n \times n$ 對角矩陣，對角元是 $A$ 的特徵值。這個解就是上述穩定性結果的依據：若且唯若 $A$ 的特徵值絕對值都小於1， $A t$ 才會隨時間收縮到零矩陣。

從一階矩陣系統中提取單一純量變量的動力特性

從 $n$ 維系統 $y t = Ay t -1$ 開始，可以提取其中一個狀態變量（如 $y 1$ ）的動態變化。上述 $y t$ 的求解方程表明， $y 1, t$ 的解是根據 $A$ 的 $n$ 個特徵值求得的。因此，描述 $y 1$ 變化的方程本身必須有涉及特徵值的解。這種描述直觀地產生了 $y 1$ 的演化方程，即

y_{1,t}=a_{1}y_{1,t-1}+a_{2}y_{1,t-2}+\dots +a_{n}y_{1,t-n}

其中參數 $a i$ 來自 $A$ 的特徵方程式：

\lambda ^{n}-a_{1}\lambda ^{n-1}-a_{2}\lambda ^{n-2}-\dots -a_{n}\lambda ^{0}=0.

因此， $n$ 維一階線性系統中的每個純量變量都根據一元 $n$ 階差分方程演化，與矩陣差分防塵具有相同的穩定性。

高階情形的解與穩定性

可用分塊矩陣將高階矩陣差分方程轉換到一階，可以求解時滯超過一個周期的高階方程，並分析其穩定性。例如，假設有二階方程

\mathbf {x} _{t}=\mathbf {Ax} _{t-1}+\mathbf {Bx} _{t-2}

變量向量 $x$ 尺寸為 $n \times 1$ ， $A$ 、 $B$ 尺寸為 $n \times n$ 。則可以疊加為下列形式

{\begin{bmatrix}\mathbf {x} _{t}\\\mathbf {x} _{t-1}\\\end{bmatrix}}={\begin{bmatrix}\mathbf {A} &\mathbf {B} \\\mathbf {I} &\mathbf {0} \\\end{bmatrix}}{\begin{bmatrix}\mathbf {x} _{t-1}\\\mathbf {x} _{t-2}\end{bmatrix}}

其中 $I$ 是 $n \times n$ 單位矩陣， $0$ 是 $n \times n$ 零矩陣。然後將當前變量和一度滯後變量的 $2 n \times 1$ 疊加向量表示為 $z t$ ，將 $2 n \times 2 n$ 分塊矩陣表示為 $L$ ，就得到了之前的解

\mathbf {z} _{t}=\mathbf {L} ^{t}\mathbf {z} _{0}

與之前一樣，若且唯若矩陣 $L$ 的所有特徵值的絕對值都小於1時，疊加方程與原二階方程才穩定。

非線性矩陣差分方程：黎卡提方程

在LQG控制中，會出現一個當前和未來成本矩陣反向演化的非線性矩陣方程，下面用 $H$ 表示。這個方程也被稱為離散動力黎卡提方程，當據線性矩陣差分方程演化的變量向量受外源向量的控制，以優化二次損失函數時，就會產生這個方程。黎卡提方程形式如下：

\mathbf {H} _{t-1}=\mathbf {K} +\mathbf {A} '\mathbf {H} _{t}\mathbf {A} -\mathbf {A} '\mathbf {H} _{t}\mathbf {C} \left[\mathbf {C} '\mathbf {H} _{t}\mathbf {C} +\mathbf {R} \right]^{-1}\mathbf {C} '\mathbf {H} _{t}\mathbf {A}

其中 $H$ 、 $K$ 、 $A$ 尺寸為 $n \times n$ ； $C$ 尺寸為 $n \times k$ ； $R$ 尺寸為 $k \times k$ ， $n$ 是受控向量元素數， $k$ 是控制向量元素數。參數矩陣 $A$ 、 $C$ 來自線性方程，參數矩陣 $K$ 、 $R$ 來自二次損失函數。詳見此處。

一般來說，該方程無法根據 $t$ 分析求解 $H t$ ，而是通過迭代黎卡提方程，求出 $H t$ 的值序列。不過，已經證明^[3]，若 $R = 0$ 、 $n = k + 1$ ，則可將黎卡提方程簡化為純量有理差分方程分析求解；對任意 $k$ 、 $n$ ，若轉移矩陣 $A$ 可逆，則黎卡提方程就可根據矩陣特徵值進行分析求解，儘管特徵值可能要用數值計算才能找到。^[4]

在大多數情況下， $H$ 隨時間的演化是穩定的，也就是說 $H$ 會收斂到特定的常矩陣 $H *$ ，其他矩陣都有理時也可能是無理的。參見隨機控制#離散時間系統。

相關的黎卡提方程^[5]是

\mathbf {X} _{t+1}=-\left[\mathbf {E} +\mathbf {B} \mathbf {X} _{t}\right]\left[\mathbf {C} +\mathbf {A} \mathbf {X} _{t}\right]^{-1}

其中 $X, A, B, C, E$ 全都是 $n \times n$ 方陣。這個方程可以顯式求解。假設 $\mathbf {X} _{t}=\mathbf {N} _{t}\mathbf {D} _{t}^{-1}$ ，在 $t = 0$ 時 $N 0 = X 0$ 、 $D 0 = I$ 顯然成立。然後將其用於差分方程，得出

{\begin{aligned}\mathbf {X} _{t+1}&=-\left[\mathbf {E} +\mathbf {BN} _{t}\mathbf {D} _{t}^{-1}\right]\mathbf {D} _{t}\mathbf {D} _{t}^{-1}\left[\mathbf {C} +\mathbf {AN} _{t}\mathbf {D} _{t}^{-1}\right]^{-1}\\&=-\left[\mathbf {ED} _{t}+\mathbf {BN} _{t}\right]\left[\left[\mathbf {C} +\mathbf {AN} _{t}\mathbf {D} _{t}^{-1}\right]\mathbf {D} _{t}\right]^{-1}\\&=-\left[\mathbf {ED} _{t}+\mathbf {BN} _{t}\right]\left[\mathbf {CD} _{t}+\mathbf {AN} _{t}\right]^{-1}\\&=\mathbf {N} _{t+1}\mathbf {D} _{t+1}^{-1}\end{aligned}}

因此通過歸納法，形式 $\mathbf {X} _{t}=\mathbf {N} _{t}\mathbf {D} _{t}^{-1}$ 對所有 $t$ 都成立。那麼 $N$ 、 $D$ 的演化可寫為

{\begin{bmatrix}\mathbf {N} _{t+1}\\\mathbf {D} _{t+1}\end{bmatrix}}={\begin{bmatrix}-\mathbf {B} &-\mathbf {E} \\\mathbf {A} &\mathbf {C} \end{bmatrix}}{\begin{bmatrix}\mathbf {N} _{t}\\\mathbf {D} _{t}\end{bmatrix}}\equiv \mathbf {J} {\begin{bmatrix}\mathbf {N} _{t}\\\mathbf {D} _{t}\end{bmatrix}}

因此可歸納

{\begin{bmatrix}\mathbf {N} _{t}\\\mathbf {D} _{t}\end{bmatrix}}=\mathbf {J} ^{t}{\begin{bmatrix}\mathbf {N} _{0}\\\mathbf {D} _{0}\end{bmatrix}}

另見

參考文獻

^ Cull, Paul; Flahive, Mary; Robson, Robbie. Difference Equations: From Rabbits to Chaos. Springer. 2005. ch. 7. ISBN 0-387-23234-6.
^ Chiang, Alpha C. Fundamental Methods of Mathematical Economics 3rd. McGraw-Hill. 1984: 608–612. ISBN 9780070107809.
^ Balvers, Ronald J.; Mitchell, Douglas W. Reducing the dimensionality of linear quadratic control problems (PDF). Journal of Economic Dynamics and Control. 2007, 31 (1): 141–159 [2023-10-15]. doi:10.1016/j.jedc.2005.09.013. （原始內容存檔 (PDF)於2022-01-18）.
^ Vaughan, D. R. A nonrecursive algebraic solution for the discrete Riccati equation. IEEE Transactions on Automatic Control. 1970, 15 (5): 597–599. doi:10.1109/TAC.1970.1099549.
^ Martin, C. F.; Ammar, G. The geometry of the matrix Riccati equation and associated eigenvalue method. Bittani; Laub; Willems (編). The Riccati Equation. Springer-Verlag. 1991. ISBN 978-3-642-63508-3. doi:10.1007/978-3-642-58223-3_5.

[1] Cull, Paul; Flahive, Mary; Robson, Robbie. Difference Equations: From Rabbits to Chaos. Springer. 2005. ch. 7. ISBN 0-387-23234-6.

[2] Chiang, Alpha C. Fundamental Methods of Mathematical Economics 3rd. McGraw-Hill. 1984: 608–612. ISBN 9780070107809.

[3] Balvers, Ronald J.; Mitchell, Douglas W. Reducing the dimensionality of linear quadratic control problems (PDF). Journal of Economic Dynamics and Control. 2007, 31 (1): 141–159 [2023-10-15]. doi:10.1016/j.jedc.2005.09.013. （原始內容存檔 (PDF)於2022-01-18）.

[4] Vaughan, D. R. A nonrecursive algebraic solution for the discrete Riccati equation. IEEE Transactions on Automatic Control. 1970, 15 (5): 597–599. doi:10.1109/TAC.1970.1099549.

[5] Martin, C. F.; Ammar, G. The geometry of the matrix Riccati equation and associated eigenvalue method. Bittani; Laub; Willems (編). The Riccati Equation. Springer-Verlag. 1991. ISBN 978-3-642-63508-3. doi:10.1007/978-3-642-58223-3_5.

[1]

[2]

[3]

[4]

[5]