Applying integration by substitution only works over single integrals. To do it in higher dimensions, we need to involve an extra component: the Jacobian.
Definition
Integration Substitution (Bivariate) ^definition
Let be a bivariate scalar function, . Let be a mapping that transforms coordinate systems, i.e. but it represents with a different coordinate system. Let the new variables be . Then . , i.e. and .
Then, for a double integral we have:
- is the pre-image of under :
- is the Jacobi matrix. is the Jacobian.
- The most common substitution is switching to polar coordinates
Integration Substitution (Trivariate) ^definition
Let be a trivariate scalar function, . Let be a mapping that transforms coordinate systems, i.e. but it represents with a different coordinate system. Let the new variables be . Then . , i.e. and
Then, for a triple integral we have:
- is the pre-image of under :
- is the Jacobi matrix. is the Jacobian.
- The most common substitutions are cylindrical coordinates () and spherical coordinates ()
Integration Substitution (Multivariate) ^definition
Let be a multivariate scalar function, . Let be a mapping that transforms coordinate systems, i.e. but it represents with a different coordinate system,
Then, for a given integral (single,double,triple, etc.) we have:
- is the pre-image of under : #TODO
Common Substitutions
Polar Substitution
Cylindrical Substitution
Spherical Substitution
Derivation
Let’s start by assuming we have a function in Cartesian coordinates, and we need to compute its double integral over some domain . To make things simple, let’s assume that , i.e. is only defined in .
And let’s say we want to switch this to a different coordinate system instead of , represented by the conversion:
We can represent this as a mapping (which is just a function, really). Let .
Applying to this new mapping essentially results in a functional composition i.e. we have . The rules of functional composition state that we need the range of and to be the domain of (which is ).
So we need , that is, has some domain which must map to . is known as the pre-image of
%%🖋 Edit in Excalidraw, and the light exported image%%
This requires to be an injective mapping, since we don’t want two different pairs and to both map to the same . We also require to be a smooth function (#tosee why?)
Now, consider a small section of . We take it to be a rectangle with bottom left corner () and side lengths and .
%%🖋 Edit in Excalidraw, and the dark exported image%%
What happens to this rectangle after the mapping ? Under the mapping , the bottom and left edges now become parametric curves:
- The bottom left corner now maps to
- The bottom edge now becomes a parametric curve . Because the original edge had being constant (to the value ), the transformed curve also has constant. Hence it is only a single-parameter equation (and hence a curve, and not a parametric surface!).
- Similarly, the left edge is now a parametric edge .
%%🖋 Edit in Excalidraw, and the dark exported image%%
Hence, we can construct tangent vectors to these parametric curves. Recall the formula for a tangent vector:
So, we have two tangent vectors, and , at the bottom left corner
Note that we are now in the Cartesian coordinate space, and we are treating and as parameters!
We can approximate the area of the transformed rectangle by using the cross product of the tangent vectors, scaled by . This is because the geometry of cross product produces a parallelogram, and the magnitude is equivalent to the area of the paralellogram.
%%🖋 Edit in Excalidraw, and the dark exported image%%
The cross product expands to
The Jacobian reappears!
When we integrate, this approximation becomes closer and closer to the actual , until they are equal. Then we have:
So we finally have: