Search results
Results from the WOW.Com Content Network
The higher-dimensional chain rule is a generalization of the one-dimensional chain rule. If k , m , and n are 1, so that f : R → R and g : R → R , then the Jacobian matrices of f and g are 1 × 1 .
The chain rule applies in some of the cases, but unfortunately does not apply in matrix-by-scalar derivatives or scalar-by-matrix derivatives (in the latter case, mostly involving the trace operator applied to matrices). In the latter case, the product rule can't quite be applied directly, either, but the equivalent can be done with a bit more ...
In matrix calculus, Jacobi's formula expresses the derivative of the determinant of a matrix A in terms of the adjugate of A and the derivative ... by the chain rule, ...
Similar to the entropy or mutual information, the Fisher information also possesses a chain rule decomposition. In particular, if X and Y are jointly distributed random variables, it follows that: [ 22 ]
Reverse accumulation traverses the chain rule from outside to inside, or in the case of the computational graph in Figure 3, from top to bottom. The example function is scalar-valued, and thus there is only one seed for the derivative computation, and only one sweep of the computational graph is needed to calculate the (two-component) gradient.
Composable differentiable functions f : R n → R m and g : R m → R k satisfy the chain rule, namely () = (()) for x in R n. The Jacobian of the gradient of a scalar function of several variables has a special name: the Hessian matrix , which in a sense is the " second derivative " of the function in question.
The chain rule has a particularly elegant statement in terms of total derivatives. It says that, for two functions f {\displaystyle f} and g {\displaystyle g} , the total derivative of the composite function f ∘ g {\displaystyle f\circ g} at a {\displaystyle a} satisfies
For the example below, there are four sides: A, B, C and the final result ABC. A is a 10×30 matrix, B is a 30×5 matrix, C is a 5×60 matrix, and the final result is a 10×60 matrix. The regular polygon for this example is a 4-gon, i.e. a square: The matrix product AB is a 10x5 matrix and BC is a 30x60 matrix.