Multivariate Chain Rule

The (univariate) chain rule can be extended for multivariate functions, however, with each new argument, the complexity of the computation increases.

Definition

Chain Rule (Bivariate) ^definition-bivariate

Let $f (x, y)$ be a bivariate function: $R^{2} \to R$

Univariate dependency: If $x = g (t)$ and $y = h (t)$ , i.e. $f$ implicitly depends only on $t$ . Then the chain rule is:

$\frac{df}{d t} = \frac{\partial f}{\partial x} \frac{d x}{d t} + \frac{\partial f}{\partial y} \frac{d y}{d t}$

Multivariate dependency: However, if $x$ and $y$ depend on more than one variable, for example $x = g (r, s, t, \dots)$ and $y = h (r, s, t, \dots)$ then $f$ now depends on multiple variables. Then $f$ can be rewritten as $f (g (r, s, t, \dots), h (r, s, t, \dots))$ . Now the chain rule is, essentially, using a monovarietal chain rule, twice:

$\frac{\partial f}{\partial r} = \frac{\partial f}{\partial x} \frac{\partial x}{\partial r} + \frac{\partial f}{\partial y} \frac{\partial y}{\partial r} \frac{\partial f}{\partial s} = \frac{\partial f}{\partial x} \frac{\partial x}{\partial s} + \frac{\partial f}{\partial y} \frac{\partial y}{\partial s} \dots$

Chain Rule (Multivariate) ^definition-multivariate

Let $f (x_{1}, x_{2}, \dots x_{n})$ be a multivariate function: $R^{n} \to R$

Univariate dependency: If $x_{1} = g_{1} (t), x_{2} = g_{2} (t), \dots, x_{n} = g_{n} (t)$ , i.e. $f$ implicitly depends only on $t$ . Then the chain rule is:

$\frac{df}{d t} = \frac{\partial f}{\partial x _{1}} \frac{d x _{1}}{d t} + \frac{\partial f}{\partial x _{2}} \frac{d x _{2}}{d t} + \dots + \frac{\partial f}{\partial x _{n}} \frac{d x _{n}}{d t}$

Multivariate dependency: However, if $x_{i}$ depend on more than one variable, i.e. $x_{i} = g_{i} (t_{1}, t_{2}, t_{3}, \dots t_{n})$ then $f$ now depends on multiple variables. Then $f$ can be rewritten as $f (g_{1} (t_{1}, t_{2}, \dots, t_{m}), g_{2} (t_{1}, t_{2}, \dots, t_{m}), \dots g_{n} (\dots t_{m}))$ . The chain rule has a matrix-like structure:

$\frac{\partial f}{\partial t _{1}} = \frac{\partial f}{\partial x _{1}} \frac{\partial x _{1}}{\partial t _{1}} + \frac{\partial f}{\partial x _{2}} \frac{\partial x _{2}}{\partial t _{2}} + \dots + \frac{\partial f}{\partial x _{n}} \frac{\partial x _{n}}{\partial t _{m}} \frac{\partial f}{\partial t _{2}} = \frac{\partial f}{\partial x _{1}} \frac{\partial x _{1}}{\partial t _{2}} + \frac{\partial f}{\partial x _{2}} \frac{\partial x _{2}}{\partial t _{2}} + \dots + \frac{\partial f}{\partial x _{n}} \frac{\partial x _{n}}{\partial t _{m}} \dots$
In matrix notation:
$1 \times m [\frac{\partial f}{\partial t _{1}} \frac{\partial f}{\partial t _{2}} \dots \frac{\partial f}{\partial t _{m}}] = 1 \times n [\frac{\partial f}{\partial x _{1}} \frac{\partial f}{\partial x _{2}} \dots \frac{\partial f}{\partial x _{n}}] n \times m \frac{\partial x _{1}}{\partial t _{1}} \frac{\partial x _{2}}{\partial t _{1}} ⋮ \frac{\partial x _{n}}{\partial t _{1}} \frac{\partial x _{1}}{\partial t _{2}} \frac{\partial x _{2}}{\partial t _{2}} ⋮ \frac{\partial x _{n}}{\partial t _{2}} \dots \dots ⋱ \dots \frac{\partial x _{1}}{\partial t _{m}} \frac{\partial x _{2}}{\partial t _{m}} ⋮ \frac{\partial x _{n}}{\partial t _{m}}$
The last matrix is also known as the Jacobi Matrix

Functional Chain Rule (Multivariate) ^definition-functional-multivariate

Dependency Graphs

One way to visualise the multivariate chain rule is using (what I like to call) dependency graphs, where dependencies of variables are represented as lines. Then we need to simply trace all paths that end with that dependency.

For example, take $f (u, v, w)$ to be multivariate function mapping $R^{3} \to R$ and let $u = u (x, y, z)$ (where $u$ is a function, not multiplying!), $v = v (x, y, z)$ and $w = w (x, y, z)$ . The dependency graph would look like

|300 %%🖋 Edit in Excalidraw, and the dark exported image%%

Then to find, say $\frac{\partial f}{\partial x}$ , we simply need to trace all paths that start at $f$ and end at $x$ . Any type we traverse an edge $(a, b)$ we multiply by $\frac{\partial a}{\partial b}$ and any time we take a new path, we add: |300 %%🖋 Edit in Excalidraw, and the dark exported image%%

Examples

1: 1-variable Dependency Chain Rule

If $z = x^{2} - y^{2}$ , $x = sin (t)$ , $y = cos (t)$ . Find $\frac{d z}{d t}$ at $t = \frac{π}{6}$

Solution

$\frac{d z}{d t} = \frac{\partial z}{\partial x} \frac{d x}{d t} + \frac{\partial z}{\partial y} \frac{d y}{d t}$

$\frac{\partial z}{\partial x} = 2 x = 2 sin (t)$

$\frac{\partial z}{\partial y} = - 2 y = - 2 cos (t)$

$\frac{d x}{d t} = cos (t)$

$\frac{d y}{d t} = - sin (t)$

Hence,
$\frac{d z}{d t} = \frac{\partial z}{\partial x} \frac{d x}{d t} + \frac{\partial z}{\partial y} \frac{d y}{d t} = 2 sin (t) cos (t) + - 2 cos (t) \times - sin (t) = 4 sin (t) cos (t)$
Now, to find it at $t = \frac{π}{6}$
$\frac{d z}{d t}_{t = π /6} = 4 sin (\frac{π}{6}) cos (\frac{π}{6}) = 4 \times \frac{1}{2} \times \frac{3}{2} = 3$

2: 2-variable dependancy Chain Rule

If $z = e^{x} sinh (y)$ , $x = s t^{2}$ , $y = s^{2} t$ . Find $\frac{\partial z}{\partial s}$

Solution

$\frac{\partial z}{\partial s} = \frac{\partial z}{\partial x} \frac{\partial x}{\partial s} + \frac{\partial z}{\partial y} \frac{\partial y}{\partial s}$

$\frac{\partial z}{\partial x} = e^{x} sinh (y) = e^{s t^{2}} sinh (s^{2} t)$

$\frac{\partial z}{\partial y} = e^{x} cosh (y) = e^{s t^{2}} cosh (s^{2} t)$

$\frac{\partial x}{\partial s} = t^{2}$

$\frac{\partial y}{\partial s} = 2 s t$

Hence, $\frac{\partial z}{\partial s} = e^{s t^{2}} (sinh (s^{2} t) t^{2} + 2 s t cosh (s^{2} t))$

Questionably Accurate Notes

Explorer

Multivariate Chain Rule

Definition

Dependency Graphs

Examples

Table of Contents

Related Concepts

See Also:

Questionably Accurate Notes

Explorer

Multivariate Chain Rule

Definition §

Dependency Graphs §

Examples §

Table of Contents

Related Concepts

See Also:

Definition

Dependency Graphs

Examples