The (univariate) chain rule can be extended for multivariate functions, however, with each new argument, the complexity of the computation increases.
Definition
Chain Rule (Bivariate) ^definition-bivariate
Let be a bivariate function:
- Univariate dependency: If and , i.e. implicitly depends only on . Then the chain rule is:
- Multivariate dependency: However, if and depend on more than one variable, for example and then now depends on multiple variables. Then can be rewritten as . Now the chain rule is, essentially, using a monovarietal chain rule, twice:
Chain Rule (Multivariate) ^definition-multivariate
Let be a multivariate function:
- Univariate dependency: If , i.e. implicitly depends only on . Then the chain rule is:
- Multivariate dependency: However, if depend on more than one variable, i.e. then now depends on multiple variables. Then can be rewritten as . The chain rule has a matrix-like structure:
In matrix notation:
The last matrix is also known as the Jacobi Matrix
Functional Chain Rule (Multivariate) ^definition-functional-multivariate
Dependency Graphs
One way to visualise the multivariate chain rule is using (what I like to call) dependency graphs, where dependencies of variables are represented as lines. Then we need to simply trace all paths that end with that dependency.
For example, take to be multivariate function mapping and let (where is a function, not multiplying!), and . The dependency graph would look like
%%🖋 Edit in Excalidraw, and the dark exported image%%
Then to find, say , we simply need to trace all paths that start at and end at . Any type we traverse an edge we multiply by and any time we take a new path, we add:
%%🖋 Edit in Excalidraw, and the dark exported image%%
Examples
1: 1-variable Dependency Chain Rule
If , ,. Find at
Solution
Hence,
Now, to find it at
2: 2-variable dependancy Chain Rule
If , , . Find
Solution
Hence,