Papers
Topics
Authors
Recent
Assistant
AI Research Assistant
Well-researched responses based on relevant abstracts and paper content.
Custom Instructions Pro
Preferences or requirements that you'd like Emergent Mind to consider when generating responses.
Gemini 2.5 Flash
Gemini 2.5 Flash 165 tok/s
Gemini 2.5 Pro 57 tok/s Pro
GPT-5 Medium 39 tok/s Pro
GPT-5 High 37 tok/s Pro
GPT-4o 106 tok/s Pro
Kimi K2 185 tok/s Pro
GPT OSS 120B 445 tok/s Pro
Claude Sonnet 4.5 37 tok/s Pro
2000 character limit reached

Gradient of Characteristic Poly Coefficients

Updated 14 November 2025
  • Gradient of characteristic polynomial coefficients defines the matrix of partial derivatives, generalizing the classical adjugate via principal minors.
  • It unifies fundamental identities including the Cayley–Hamilton theorem and Newton identities into a comprehensive framework for matrix invariants.
  • The derived algebraic branching programs (ABPs) offer efficient computational tools, halving width and reducing size compared to previous methods.

The gradient of the coefficient of the characteristic polynomial, as developed in "On the gradient of the coefficient of the characteristic polynomial" (Ikenmeyer, 7 Nov 2025), generalizes the adjugate matrix and reveals deep structural identities relating to polynomial invariants of a matrix. The work introduces a unified framework based on the bivariate Cayley–Hamilton theorem, connecting principal minors, Newton identities, and novel algebraic branching program (ABP) constructions for the efficient computation of all coefficients of the characteristic polynomial over arbitrary commutative rings. The systematic paper of the matrix gradient and its consequences provides powerful combinatorial and algorithmic tools for linear algebra and computational complexity.

1. Definitions and Basic Properties

Let X=(Xij)1i,jnX=(X_{ij})_{1\leq i,j\leq n} be an n×nn\times n matrix over a commutative ring, and define the characteristic polynomial as

pX(λ)=det(λIX)=λnc1(X)λn1+c2(X)λn2+(1)ncn(X),p_X(\lambda) = \det(\lambda I - X) = \lambda^n - c_1(X)\lambda^{n-1} + c_2(X)\lambda^{n-2} - \cdots + (-1)^n c_n(X),

where ck(X)c_k(X) denotes the sum of all k×kk\times k principal minors of XX:

ck(X)=S{1,,n},S=kdetXS,S.c_k(X) = \sum_{S\subset\{1,\ldots,n\},\,|S|=k} \det X_{S,S}.

The gradient Xck(X)\nabla_X c_k(X) is the n×nn\times n matrix of partial derivatives:

[Xck(X)]ij=ck(X)Xij.[\nabla_X c_k(X)]_{ij} = \frac{\partial c_k(X)}{\partial X_{ij}}.

For k=nk=n, cn(X)=det(X)c_n(X)=\det(X), and Xdet(X)\nabla_X \det(X) is the cofactor matrix; its transpose is the classical adjugate. For k=n1k=n-1, cn1(X)=tr(X)c_{n-1}(X)=\operatorname{tr}(X) and Xcn1(X)=I\nabla_X c_{n-1}(X)=I.

2. The Bivariate Cayley–Hamilton Theorem

The bivariate Cayley–Hamilton theorem, a direct extension of the classical form, states that for 0dn10\leq d\leq n-1,

(Xcd+1(X))T=i=0d(1)icdi(X)Xi.(\nabla_X c_{d+1}(X))^T = \sum_{i=0}^d (-1)^i c_{d-i}(X) X^i.

For d=n1d = n-1, this recovers the identity

(Xdet(X))T=i=0n1(1)icn1i(X)Xi,(\nabla_X \det(X))^T = \sum_{i=0}^{n-1} (-1)^i c_{n-1-i}(X) X^i,

leading to the classical Cayley–Hamilton theorem:

i=0n(1)icni(X)Xi=0.\sum_{i=0}^n (-1)^i c_{n-i}(X) X^i = 0.

The proof proceeds via combinatorial expansions, comparing terms based on cycle covers and their unions, with involutive cancellation yielding equality between both sides.

3. Combinatorial Formulas for the Gradient Entries

Two principal forms describe the entries of Xck(X)\nabla_X c_k(X):

  • Minor-Sum Form:

[Xck]ij=S[n],S=k,i,jS(1)r+sdetXS{i},S{j},[\nabla_X c_k]_{ij} = \sum_{\substack{S\subset[n],\,|S|=k,\,i,j\in S}} (-1)^{r+s} \det X_{S\setminus\{i\},\,S\setminus\{j\}},

where rr (resp. ss) is the position of ii (resp. jj) in the ordered set SS.

  • Permutation-Sum Form:

ck(X)=I[n],I=kπ:IIsgn(π)aIXa,π(a),c_k(X) = \sum_{I\subset[n],\,|I|=k} \sum_{\pi: I \to I} \operatorname{sgn}(\pi) \prod_{a\in I} X_{a,\pi(a)},

yielding

ck(X)Xij=I[n],I=k,i,jIπ:II,π(i)=jsgn(π)aI{i}Xa,π(a).\frac{\partial c_k(X)}{\partial X_{ij}} = \sum_{\substack{I\subset[n],\,|I|=k,\,i,j\in I}} \sum_{\pi:I\to I,\,\pi(i)=j} \operatorname{sgn}(\pi) \prod_{a\in I\setminus\{i\}} X_{a,\pi(a)}.

These representations interpolate between minor-based and permutation-based expansions, offering insights into both algebraic and combinatorial structure.

4. Low-Dimensional Illustrative Cases

For explicit computation and verification:

  • n=2n=2:
    • c1=tr(X)c_1 = \operatorname{tr}(X), c1=I2\nabla c_1 = I_2
    • c2=det(X)c_2 = \det(X), c2=[x22x21 x12x11]\nabla c_2 = \begin{bmatrix} x_{22} & -x_{21} \ -x_{12} & x_{11} \end{bmatrix}
  • n=3n=3:

c1=x11+x22+x33c_1 = x_{11} + x_{22} + x_{33}, c1=I3\nabla c_1 = I_3 c2=i<j(xiixjjxijxji)c_2 = \sum_{i<j}(x_{ii}x_{jj} - x_{ij}x_{ji}) For example, c2x12=x21x31\frac{\partial c_2}{\partial x_{12}} = -x_{21} - x_{31}.

The cases d=1d=1 and d=2d=2 in the bivariate formula correspond to classical Newton identities and the trace version of Cayley–Hamilton.

5. Algebraic Branching Programs for Characteristic Coefficients

The matrix recurrence derived from the bivariate Cayley–Hamilton identity enables the efficient construction of ABPs computing ck(X)c_k(X):

  • Applying the identity as a matrix recurrence in dd yields a homogeneous ABP of width O(n2)O(n^2) and size O(n2d2)O(n^2 d^2) simultaneously computing cn,0,cn,1,,cn,dc_{n,0},c_{n,1},\dots,c_{n,d} (Corollary 2.7).
  • A refined block recurrence, applied to the last row of (cn,d+1)T(\nabla c_{n,d+1})^T, yields an ABP of width at most n2/2+O(n)n^2/2 + O(n) and size approximately n2d/2d3/6+n^2 d /2 - d^3/6 + \ldots, totaling for d=nd=n around n3/3n^3/3 (Theorem 4.1).
  • This improves substantially over the Mahajan–Vinay 1997 ABP for characteristic polynomial coefficients, which has width n2n^2 and size n3\sim n^3. The new construction halves the width and reduces the total size by roughly a factor of three.
  • The ABP constructions in this framework eschew the ad hoc combinatorics of "clow sequences," instead exploiting gradient-based algebraic identities for a unified approach.

6. Unified Perspective and Algorithmic Significance

The systematic paper of the gradient matrix Xck(X)\nabla_X c_k(X) and its transpose yields a unified recurrence encapsulating the identities underlying the Cayley–Hamilton theorem, Newton’s formulas, and principal minor relationships. The approach demonstrates that all coefficients of the characteristic polynomial, not just the determinant, admit ABPs of polynomial size over arbitrary commutative rings. This yields the currently smallest known ABPs for these invariants, built using purely algebraic (rather than combinatorial) mechanisms. The framework directly resolves the open question of Mahajan–Vinay (1999) concerning the necessity of clow sequences in these constructions, establishing their non-essentiality.

7. Connections and Extensions

The combinatorial arguments in the gradient-based approach exhibit structural similarities with earlier constructions by Straubing and Zeilberger, manifesting involutive cancellations and recursion patterns familiar from those works. The recurrence unifies previously independent results (Cayley–Hamilton theorem, Girard–Newton identities, and ABP existence for characteristic coefficients) under a single algebraic and algorithmic principle. This methodological shift offers new avenues for investigating algebraic complexity and invariant theory in matrix analysis over arbitrary commutative rings.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (1)
Forward Email Streamline Icon: https://streamlinehq.com

Follow Topic

Get notified by email when new papers are published related to Gradient of the Coefficient of the Characteristic Polynomial.