The microfacet theory is a fundamental component of physically based rendering. Unfortunately, it is often perceived as being rather complicated. There are several reasons for this. The first one is that most papers only focus on a single component (such as the distribution of normals or the masking function), while glossing over the rest. The second problem is the lack of the geometrical interpretations of mathematical expressions, which inhibits understanding at the intuitive level. In the course of any serious work on the extensions or applications of the theory (such as Smith’s approximation, multiple scattering, bump roughness, microsurface transformations, or area lighting), the absence of a solid grasp of the fundamentals will manifest itself through inefficiency (or, worse yet, ineffectiveness). This article aims to address these issues by collecting the fragments from the literature and explaining how they fit together.
Microsurface and Macrosurface
Pick an object, perhaps one that sits on your desk or lives in your head. Focus on a tiny fragment of its surface. If this fragment is sufficiently small, or is sufficiently far away, it will look effectively flat (but not necessarily smooth). Therefore, to a negligible degree of error, the fragment may be replaced by a first-order approximation1 – its orthogonal projection onto the averaged tangent plane. In the literature, this projected surface is referred to as the macrosurface (see Fig. 1 below). It is characterized by the area $\small A$ and the unit normal vector2 $\small \bm{n}$. Fixing the origin and two (linearly independent, but not necessarily unit or orthogonal) tangent vectors is sufficient to complete the parameterization of the macrosurface.
In contrast, the surface fragment overlaid onto the macrosurface is called3 the microsurface $\small \mathbb{M^2}$. It is composed of infinitesimal elements called microfacets. In the microfacet theory, a microsurface is not modeled explicitly, but rather represented by a statistical model.
Distribution of Normals
Let $\small \bm{m}$ denote the unit normal vector of the microsurface. Since it may correspond to several distinct microfacets, we must define $\small dA(\bm{m}) = D(\bm{m}) A d\Omega(\bm{m})$ as the differential area of the portion of the microsurface perpendicular to $\small \bm{m}$, where $\small d\Omega$ is the differential solid angle centered on $\small \bm{m}$, and $\small D$ is the distribution of normals4 (abbreviated as the NDF) associated with the microsurface5. The domain of this distribution, along with the microsurface normals themselves, is typically restricted to the unit hemisphere $\small \mathbb{H^2}$ (with $\small \bm{n}$ pointing at the zenith), which requires $\small \mathbb{M^2}$ to be a height field. However, this restriction is not strictly necessary; we shall demonstrate this by letting the microsurface normals potentially cover the entire unit sphere $\small \mathbb{S^2}$.
A valid microsurface and, thus, a valid NDF, must obey the signed (orthogonally) projected area constraint
$$ \tag{1a} \begin{aligned} &\bm{v} \cdot \bm{n} A \cr = &\bm{v} \cdot \int_{\bm{p} \in \mathbb{M^2}} \bm{m}(\bm{p}) dA(\bm{p}) \cr = &\bm{v} \cdot \int_{\bm{m} \in \mathbb{S^2}} \bm{m} dA(\bm{m}) \cr = &\bm{v} \cdot \int_{\bm{m} \in \mathbb{S^2}} \bm{m} D(\bm{m}) A d\Omega(\bm{m}) \end{aligned} $$
for all $\small \bm{v}$. Expressed this way, it is clear that the constraint is geometric (coordinate-independent, basis-independent) in nature:
$$ \tag{1b} \bm{n} = \int_{\bm{m} \in \mathbb{S^2}} \bm{m} D(\bm{m}) d\Omega(\bm{m}). $$
This is the definition of the macrosurface normal $\small \bm{n}$ for a given microsurface. In turn, substitution $\small \bm{v} = \bm{n}$ in Eqn. 1a yields the macrosurface area $\small A$.
Eqn. 1 tells us a few things. Geometrically, it says that the signed projected areas of the microsurface and the macrosurface must coincide in any given direction. More specifically, by substituting a constant NDF (or, from the definition), we can see that it is measured in units of reciprocal solid angle.
In the special case of $\small \bm{v} = \bm{n}$, we obtain
$$ \tag{1c} 1 = \int_{\bm{m} \in \mathbb{S^2}} (\bm{n} \cdot \bm{m}) D(\bm{m}) d\Omega(\bm{m}). $$
In order for $\small (\bm{n} \cdot \bm{m}) D(\bm{m})$ to be a valid probability density function, it must be non-negative for all $\small \bm{m}$. By definition, $D \ge 0$, while $\small (\bm{n} \cdot \bm{m}) \ge 0$ if and only if $\small \mathbb{M^2}$ is a height field.
One of the simplest examples of a valid microsurface is a box6, with the flipped bottom face playing the role of the macrosurface. It is instructive to analyze its two-dimensional counterpart – a rectangle (see Fig. 2 above).
This trivial example clearly demonstrates that a microsurface does not have to be smooth (continuously differentiable); however, that ensures the continuity of the NDF, which is desirable, unless the goal is to model a flat surface.
Linearity of Eqn. 1 allows us to consider the individual microfacets separately and to sum up their contributions. As we apply it to the box along some $\small \bm{v}$, observe that the signed projected areas of the opposite faces cancel each other, while the top remains equivalent to the bottom. Another consequence is that an application of a linear transformation (that transforms a box into a parallelepiped, in our case) also generates a valid combination of the microsurface and the macrosurface. We can see why that is the case by picturing Eqn. 1 geometrically and (passively) transforming the coordinate axes7 rather than (actively transforming) the surface itself (see Fig. 3 below). Of course, while the surface lines remain unchanged, the signed projected areas (and, thus, the values of the NDF) do not stay the same.
Erasing any part of the box leads to a signed projected area mismatch for certain angles. Therefore, the constraint may seem to imply that the microsurface must be continuous, but, unfortunately, that is not the case. The issue lies in the translation invariance of Eqn. 1; in particular, it means that the orthogonally projected area of an object is independent of its location. This property may seem innocuous at first, but, coupled with the linearity, it spells disaster: we may freely translate different microfacets in different directions without affecting the value of the integral.
Masking Function
Since the signed projected area is an incomplete description of a real surface (as opposed to a microfacet soup or salad8), we may additionally specify its visible (orthogonally) projected area
$$ \tag{2a} \begin{aligned} &\bm{v} \cdot \int_{\bm{p} \in \mathbb{M^2}} \bm{m}(\bm{p}) V(\bm{v}, \bm{p}) dA(\bm{p}) \cr = &\bm{v} \cdot \int_{\bm{m} \in \mathbb{S^2}} \bm{m} G_1(\bm{v}, \bm{m}) dA(\bm{m}) \cr = &\bm{v} \cdot \int_{\bm{m} \in \mathbb{S^2}} \bm{m} G_1(\bm{v}, \bm{m}) D(\bm{m}) A d\Omega(\bm{m}), \end{aligned} $$
which depends on the relative positions of the microfacets (see Fig. 4 below). It is defined in terms of the dimensionless masking function $\small G_1(\bm{v}, \bm{m})$ that gives the fraction of the differential area $\small dA(\bm{m})$ of the portion of the microsurface perpendicular to $\small \bm{m}$ that happens to be visible along $\small \bm{v}$. In other words, it is the average visibility (along $\small \bm{v}$) of the microfacets with the normal $\small \bm{m}$. The masking function is closely related to the binary visibility function $\small V(\bm{v}, \bm{p})$ that outputs 0 if the point $\small \bm{p}$ is occluded along $\small \bm{v}$, and 1 otherwise. Both functions take self-occlusion into account: $\small V = G_1 = 0$ if $\small (\bm{v} \cdot \bm{m}) \le 0$.
The masking and visibility functions possess an important property called stretch invariance, or, more generally, invariance under linear transformations. We have already seen that, in the context of the microfacet theory, a linearly transformed surface remains valid; however, unlike the NDF, the masking function is dimensionless: it only encodes visibility, and so it remains unaffected by a transformation of the coordinate axes. Of course, this implies that the transformation has been applied to everything, including the view vector $\small \bm{v}$.
Inherent Assumptions
Eqn. 1 and 2 are closely related. For a valid microsurface, the values of the integrals are the same if $\small \bm{v} = \bm{n}$:
$$ \tag{2b} \begin{aligned} 1 = &\int_{\bm{m} \in \mathbb{S^2}} (\bm{n} \cdot \bm{m}) D(\bm{m}) d\Omega(\bm{m}) \cr = &\int_{\bm{m} \in \mathbb{S^2}} (\bm{n} \cdot \bm{m}) G_1(\bm{n}, \bm{m}) D(\bm{m}) d\Omega(\bm{m}). \end{aligned} $$
In the special case of a height field, $\small G_1(\bm{n}, \bm{m}) = 1$.
For a continuous surface, the visible projected area is greater or equal to the signed projected area. This inequality stems from self-occlusion, which eliminates the (formerly negative) contribution of back-facing microfacets. The two types of projected areas coincide if and only if the view angle is sufficiently steep, or the microsurface – sufficiently thin, so that the latter does not extend outside the volume swept by the macrosuface translated along the view vector. The difference of areas can be reduced by flattening or tiling the microsurface (see Fig. 5 above and Fig. 6 below); both methods effectively reduce its height relative to the dimensions of the macrosurface.
Of course, as the view angle tends to 90 degrees, the signed projected area tends to zero, while the visible projected area becomes proportional to the height of the microsurface. This leads to one of the key assumptions of the microfacet theory: a microsurface is infinitesimally thick. This limitation is particularly apparent at grazing angles, and should be familiar to those who have practical experience with bump and normal mapping.
If we want to posit that Eqn. 1a and 2a are equivalent, we must also prevent the signed projected area from turning negative by restricting the view angles to the range where $\small (\bm{v} \cdot \bm{n}) > 0$. In other words, a microsurface is always facing the observer. If that is not the case, we must recompute the statistics for the undersurface.
It is easy to see that reversing the directions of both $\small \bm{n}$ and $\small \bm{m}$ allows us to reuse the distribution of normals. As for the view vector, there are a couple of options: we can either reverse the direction of $\small \bm{v}$, or reflect it across the macrosurface. Since flipping a surface upside down alters visibility, neither method yields exact results; however, the first one is preferable, since it guarantees that self-occlusion is properly accounted for. In practice, the resulting approximation is applicable to complex, unstructured surfaces; you should not expect it to be accurate for a simple repeating pattern.
Distribution of Visible Normals
With the assumption (that all projected surface areas are the same) in place, we can combine Eqn. 1a and 2a into
$$ \tag{3a} 1 = \int_{\bm{m} \in \mathbb{S^2}} D_{vis}(\bm{v}, \bm{n}, \bm{m}) d\Omega(\bm{m}), $$
where
$$ \tag{3b} D_{vis}(\bm{v}, \bm{n}, \bm{m}) = \frac{(\bm{v} \cdot \bm{m})}{(\bm{v} \cdot \bm{n})} G_1(\bm{v}, \bm{m}) D(\bm{m}) $$
is called the distribution of visible normals9 (abbreviated as the VNDF). Its meaning becomes clear in the context of Eqn. 2a: $\small ((\bm{v} \cdot \bm{m})/(\bm{v} \cdot \bm{n}))^{-1} D_{vis}(\bm{v}, \bm{n}, \bm{m}) A d\Omega(\bm{m})$ is the differential area of the portion of the microsurface perpendicular to $\small \bm{m}$ that happens to be visible along $\small \bm{v}$. Like $\small (\bm{n} \cdot \bm{m}) D(\bm{m})$, the VNDF is a probability density function; but because it is always non-negative, this property is not limited to height fields.
General BSDF
A VNDF can be used to construct a bidirectional scattering distribution function $\small f_s$. By definition, a BSDF is a ratio of the differential outgoing radiance to the differential incident irradiance, the latter being the product of the incident radiance and the projected differential solid angle $\small d\Omega_n(\bm{l}) = \vert \bm{n} \cdot \bm{l} \vert d\Omega(\bm{l})$:
$$ \tag{4a} f_s(\bm{v}, \bm{n}, \bm{l}) = \frac{dL(\bm{v})}{dE_n(\bm{l})} = \frac{dL(\bm{v})}{L(\bm{l}) d\Omega_n(\bm{l})}. $$
In other words, the outgoing radiance is a weighted average of the incident radiance over the surface of the unit sphere (or the two-sided unit disk, if you prefer):
$$ \tag{4b} L(\bm{v}) = \int_{\bm{l} \in \mathbb{S^2}} f_s(\bm{v}, \bm{n}, \bm{l}) L(\bm{l}) d\Omega_n(\bm{l}). $$
In order for a BSDF to be physically meaningful, it must satisfy three requirements10:
$$ \tag{4c} \begin{aligned} \textit{non-negativity: } &f_s \ge 0; \cr \textit{reciprocity: } &\frac{f_s(\bm{v}, \bm{n}, \bm{l})}{f_s(\bm{l}, \bm{n}, \bm{v})} = \frac{\eta_v^2}{\eta_l^2}; \cr \textit{energy conservation: } &\int_{\bm{v} \in \mathbb{S^2}} f_s(\bm{v}, \bm{n}, \bm{l}) d\Omega_n(\bm{v}) \le 1; \end{aligned} $$
where $\small \eta_v$ and $\small \eta_l$ are the real11 indices of refraction (abbreviated as the IOR) associated with the directions of exitance (along the view vector $\small \bm{v}$) and incidence (along the light vector $\small \bm{l}$), respectively. In particular, reciprocity is a direct consequence of the law of refraction (also valid for reflection)
$$ \tag{5a} \eta_v \Vert \bm{v} \times \bm{n} \Vert = \eta_l \Vert \bm{l} \times \bm{n} \Vert, $$
which leads to the compression of projected solid angles
$$ \tag{5b} \eta_v^2 d\Omega_n(\bm{v}) = \eta_l^2 d\Omega_n(\bm{l}), $$
and, thus, a change in energy density (provided $\small \eta_v \neq \eta_l$). Note that the relationship between the ordinary solid angles is not quite as simple:
$$ \tag{5c} \eta_v^2 d\Omega(\bm{v}) \neq \eta_l^2 d\Omega(\bm{l}). $$
This provides a strong motivation to utilize the projected solid angle measure. Failure to do so consistently often results in complicated expressions that may be difficult to interpret and are prone to errors.
The properties of a valid BSDF are by no means obvious. In particular, ensuring reciprocity is a non-trivial task. A typical approach represents the BSDF as a sum of two components: reflection (the BRDF $\small f_r$) and transmission (the BTDF $\small f_t$). Its principal advantage lies in the fact that the reflection component is always symmetric: since $\small \bm{v}$ and $\small \bm{l}$ always point away from the surface, $\small \eta_v = \eta_l$ and, therefore, $\small f_r(\bm{v}, \bm{n}, \bm{l}) = f_r(\bm{l}, \bm{n}, \bm{v})$.
Smooth Specular BSDF
In order to be perfectly clear, we shall illustrate these properties using a concrete example. Consider a perfectly smooth, planar surface. Its BSDF (sometimes referred to as the smooth specular BSDF) can be expressed in terms of the Dirac delta “function” $\small \delta$ defined as a projected solid angle measure by the equation
$$ \tag{6} f(\bm{v}) = \int_{\bm{l} \in \mathbb{S^2}} f(\bm{l}) \delta_{\Omega_n}(\bm{v} - \bm{l}) d\Omega_n(\bm{l}) $$
valid for any function $\small f: \mathbb{S^2} \to \mathbb{R}$. Note its similarity to Eqn. 4b.
Now, according to the law of reflection, the view and the light angles must be the same:
$$ \tag{7a} \sin{\theta_v} = \Vert \bm{v} \times \bm{n} \Vert = \Vert \bm{l} \times \bm{n} \Vert, \quad \cos{\theta_v} = \vert \bm{v} \cdot \bm{n} \vert = \vert \bm{l} \cdot \bm{n} \vert. $$
On the other hand, we need to distinguish between the IORs associated with the exterior (above the surface) and the interior (below the surface) of the object:
$$ \tag{7b} \eta_v = \eta_l, \quad \eta_t = \begin{cases} \eta_{int} &\text{if } \eta_v = \eta_{ext}, \cr \eta_{ext} &\text{otherwise}. \end{cases} $$
Using this formalism, the smooth specular BRDF can be expressed as
$$ \tag{7c} f_r(\bm{v}, \bm{n}, \bm{l}) = F(\theta_v, \eta_v/\eta_t) \delta_{\Omega_n}\negmedspace\left( \bm{n} - \frac{\bm{v} + \bm{l}}{(\bm{v} + \bm{l}) \cdot \bm{n}} \right), $$
where $\small 0 \le F \le 1$ is the dimensionless Fresnel reflectance. This BRDF is non-negative, symmetric, and energy-conserving, which can be verified by substituting Eqn. 7c into 4c. In particular,
$$ \tag{7d} \int_{\bm{l} \in \mathbb{S^2}} f_r(\bm{v}, \bm{n}, \bm{l}) d\Omega_n(\bm{l}) = F(\theta_v, \eta_v/\eta_t) \le 1. $$
The trigonometric form of the Fresnel term is
$$ \tag{8a} F(\theta_v, \eta_v/\eta_t) = (1 - \alpha) R_s(\theta_v, \eta_v/\eta_t) + \alpha R_p(\theta_v, \eta_v/\eta_t), $$
and its mutually perpendicular components are defined as follows:
$$ \tag{8b} \begin{aligned} R_s(\theta_v, \eta_v/\eta_t) &= \left| \frac{\cos{\theta_v} / \cos{\theta_t} - \sin{\theta_t} / \sin{\theta_v}}{\cos{\theta_v} / \cos{\theta_t} + \sin{\theta_t} / \sin{\theta_v}} \right|^2 = \left| \frac{\sin(2 \theta_v) - \sin(2 \theta_t)}{\sin(2 \theta_v) + \sin(2 \theta_t)} \right|^2, \cr R_p(\theta_v, \eta_v/\eta_t) &= \left| \frac{\cos{\theta_t} / \cos{\theta_v} - \sin{\theta_t} / \sin{\theta_v}}{\cos{\theta_t} / \cos{\theta_v} + \sin{\theta_t} / \sin{\theta_v}} \right|^2 = \left| \frac{\sin(\theta_v - \theta_t)}{\sin(\theta_v + \theta_t)} \right|^2. \end{aligned} $$
The value of $\small \alpha$ depends on the polarization state of the incident light: $\small \alpha = 0$ corresponds to the s-polarized light, $\small \alpha = 1$ – to the p-polarized light, and $\small \alpha = 1/2$ – to the unpolarized (natural) light.
The angle of refraction (or transmission) $\small \theta_t$ can be determined from the view angle $\small \theta_v$ and the relative IOR $\small \eta_v/\eta_t$ using the law of refraction and basic trigonometry:
$$ \tag{8c} \sin{\theta_t} = \frac{\eta_v}{\eta_t} \sin{\theta_v}, \quad \cos{\theta_t} = \sqrt{1 - \sin^2{\theta_t} }. $$
If you have a physics background, the labeling of the angles and the indices of refraction may appear unconventional, since Eqn. 8a-8c are typically expressed in terms of the direction of incidence rather than exitance. The present notation is motivated by the definition of the BSDF given by Eqn. 4a-4b, where the view vector is fixed. This alteration is enabled by the symmetry of the Fresnel equations, which makes Eqn. 8 invariant under the exchange of the subscripts $\small v \text{ and } t$. Intuitively, the Fresnel reflectance (which is a ratio of two irradiance values) depends only on the path taken by light, and is unaffected by the reversal of its direction.
The definition of the transmission component of the smooth specular BSDF is marginally more complicated. Let
$$ \tag{9a} \begin{aligned} \sin{\theta_v} &= \Vert \bm{v} \times \bm{n} \Vert, & \cos{\theta_v} &= \vert \bm{v} \cdot \bm{n} \vert, \cr \sin{\theta_t} &= \Vert \bm{l} \times \bm{n} \Vert, & \cos{\theta_t} &= \vert \bm{l} \cdot \bm{n} \vert. \end{aligned} $$
By extension, we shall associate
$$ \tag{9b} \eta_t = \eta_l. $$
After taking the reciprocity and the conservation of energy into account, the smooth specular BTDF can be expressed as
$$ \tag{9c} f_t(\bm{v}, \bm{n}, \bm{l}) = \frac{\eta_v^2}{\eta_l^2} \big( 1 - F(\theta_v, \eta_v/\eta_t) \big) \delta_{\Omega_n}\big( \eta_v (\bm{v} \times \bm{n}) + \eta_l (\bm{l} \times \bm{n}) \big). $$
The first term is responsible for the compression of projected solid angles, which can be readily verified by placing the surface inside a white furnace (by setting $\small L(\bm{l}) = 1$ for all $\small \bm{l}$) and evaluating Eqn. 4b.
Is this BTDF reciprocal? Naive substitution of Eqn. 9c into 4c leads to
$$ \tag{9d} \frac{f_t(\bm{v}, \bm{n}, \bm{l})}{f_t(\bm{l}, \bm{n}, \bm{v})} = \frac{\eta_v^4}{\eta_l^4} \frac{ \delta_{\Omega_n}\big( \eta_v (\bm{v} \times \bm{n}) + \eta_l (\bm{l} \times \bm{n}) \big) }{ \delta_{\Omega_n}\big( \eta_l (\bm{l} \times \bm{n}) + \eta_v (\bm{v} \times \bm{n}) \big) }. $$
Unfortunately, Eqn. 9d fails to make it apparent that these two Dirac delta “functions” are not the same: their purpose is to measure two different projected solid angles (each associated with the vector in the last slot), and their ratio is precisely the same as the missing factor of $\small \eta_l^2 / \eta_v^2$. To see why that is the case, we must recall that Eqn. 7c and 9c are only valid in the context of Eqn. 6, which is an integral over the domain of the directions of incidence. If we use the reciprocal of a BSDF, the directions of incidence and exitance are interchanged, and Eqn. 6 must also be modified accordingly.
A better way to verify reciprocity is by combining Eqn. 4c, 5b, and 6:
$$ \tag{10a} \int_{\bm{v’} \in \mathbb{S^2}} f_s(\bm{v’}, \bm{n}, \bm{l}) d\Omega_n(\bm{v’}) = \int_{\bm{l’} \in \mathbb{S^2}} f_s(\bm{l’}, \bm{n}, \bm{v}) d\Omega_n(\bm{l’}). $$
If $\small \bm{v}$, $\small \bm{n}$, and $\small \bm{l}$ do not define a valid light path, both integrals vanish. Otherwise, substitution of Eqn. 9c results in identical values. In particular,
$$ \tag{10b} \begin{aligned} \int_{\bm{l} \in \mathbb{S^2}} f_t(\bm{l}, \bm{n}, \bm{v}) d\Omega_n(\bm{l}) = \int_{\bm{l} \in \mathbb{S^2}} \frac{\eta_{l}^2}{\eta_v^2} f_t(\bm{v}, \bm{n}, \bm{l}) d\Omega_n(\bm{l}) = \big( 1 - F(\theta_v, \eta_v/\eta_t) \big) \le 1. \end{aligned} $$
Since Eqn. 7d and 10b sum up to 1, we have just demonstrated that the full BSDF is energy-conserving.
If you find reciprocity daunting, you are not alone. For instance, Eqn. 18 and 21 in Bruce Walter’s widely cited paper about microfacet models for refraction are missing a factor of $\small \eta_v^2 / \eta_l^2$.
Rough Surface BSDF
We can model a rough microsurface by treating it as locally planar1. The amount of radiance scattered by the microsurface can then be expressed as a weighted average of the contributions of its visible microfacets (see Fig. 6 above).
To better understand this approach, recall that, by definition, the radiance is the amount of power transmitted in a certain direction, per unit solid angle associated with this direction, per unit area perpendicular to this direction. If the source of (scattered) light is very small (or very far away), it will appear point-like, and the variation of the view vector across its surface can be safely neglected. Thus,
$$ \tag{11a} L(\bm{v}) = \frac{\bm{v} \cdot \int_{\bm{p} \in \mathbb{M^2}} \bm{m}(\bm{p}) L(\bm{v}, \bm{p}) V(\bm{v}, \bm{p}) dA(\bm{p})} {\bm{v} \cdot \int_{\bm{p} \in \mathbb{M^2}} \bm{m}(\bm{p}) V(\bm{v}, \bm{p}) dA(\bm{p})}, $$
which is normalized with respect to the visible projected area to convert the intensity into the radiance.
The general expression of the outgoing radiance term in the numerator of Eqn. 11a is given by the spatially varying version of Eqn. 4b:
$$ \tag{11c} L(\bm{v}, \bm{p}) = \int_{\bm{l} \in \mathbb{S^2}} f_s(\bm{v}, \bm{m}(\bm{p}), \bm{l}) L(\bm{l}, \bm{p}) d\Omega_m(\bm{l}). $$
Unfortunately, Eqn. 11c is recursive. In our case, this means that light scattered by one microfacet can illuminate another in a process known as multiple scattering. For simplicity, and at the cost of correctness, this effect is often neglected, with only the distant illumination (and, thus, only single scattering) taken into account:
$$ \tag{11d} L(\bm{v}, \bm{p}) \approx \int_{\bm{l} \in \mathbb{S^2}} f_s(\bm{v}, \bm{m}(\bm{p}), \bm{l}) L(\bm{l}) V(\bm{l}, \bm{p}) d\Omega_m(\bm{l}). $$
These approximations allow us to rewrite Eqn. 11a as
$$ \tag{11e} L(\bm{v}) \approx \frac{\bm{v} \cdot \int_{\bm{p} \in \mathbb{M^2}} \bm{m}(\bm{p}) V(\bm{v}, \bm{p}) \int_{\bm{l} \in \mathbb{S^2}} f_s(\bm{v}, \bm{m}(\bm{p}), \bm{l}) L(\bm{l}) V(\bm{l}, \bm{p}) d\Omega_m(\bm{l}) dA(\bm{p})} {\bm{v} \cdot \int_{\bm{p} \in \mathbb{M^2}} \bm{m}(\bm{p}) V(\bm{v}, \bm{p}) dA(\bm{p})}. $$
Rough Specular BSDF
Substitution of the definition of the smooth specular BSDF given by Eqn. 7c and 9c allows us to evaluate the inner integral of Eqn. 11e analytically. Starting with the reflection component,
$$ \tag{12} L_r(\bm{v}) = \frac{\bm{v} \cdot \int_{\bm{p} \in \mathbb{M^2}} \bm{m}(\bm{p}) F(\theta_v, \eta_v/\eta_t) L(\bm{r}) V(\bm{r}, \bm{p}) V(\bm{v}, \bm{p}) dA(\bm{p})} {\bm{v} \cdot \int_{\bm{p} \in \mathbb{M^2}} \bm{m}(\bm{p}) V(\bm{v}, \bm{p}) dA(\bm{p})}, $$
where
$$ \tag{13a} \bm{r} = \bm{R}(\bm{v}, \bm{m}) = -\bm{v} + 2 (\bm{v} \cdot \bm{m}) \bm{m} $$
is the reflected view vector that points along the direction of incidence (see Fig. 7 below).
According to the law of reflection, $\small \vert \bm{r} \cdot \bm{m} \vert = \vert \bm{v} \cdot \bm{m} \vert = \Vert \bm{r} + \bm{v} \Vert / 2$. If we assume that the microfacet is front-facing (and, thus, not self-occluding), then $\small (\bm{v} \cdot \bm{m}) > 0$, which allows us to solve Eqn. 13a for $\small \bm{m}$:
$$ \tag{13b} \bm{m} = \frac{\bm{r} + \bm{v}}{\Vert \bm{r} + \bm{v} \Vert}. $$
In the absence of multiple scattering, we only consider distant sources of illumination; those located in the lower hemisphere are guaranteed to be occluded by the microsurface12, which reduces the set of potentially useful directions of incidence to those with $\small (\bm{r} \cdot \bm{n}) > 0$.
The transmission component of a microfacet BSDF can be defined analogously:
$$ \tag{14} L_t(\bm{v}) = \frac{\eta_t^2}{\eta_v^2} \frac{\bm{v} \cdot \int_{\bm{p} \in \mathbb{M^2}} \bm{m}(\bm{p}) \big( 1 - F(\theta_v, \eta_v/\eta_t) \big) L(\bm{t}) V(\bm{t}, \bm{p}) V(\bm{v}, \bm{p}) dA(\bm{p})} {\bm{v} \cdot \int_{\bm{p} \in \mathbb{M^2}} \bm{m}(\bm{p}) V(\bm{v}, \bm{p}) dA(\bm{p})}, $$
where
$$ \tag{15a} \bm{t} = \bm{T}(\bm{v}, \bm{m}, \eta_v/\eta_t) = -\frac{\eta_v}{\eta_t}\bm{v} + \left( \frac{\eta_v}{\eta_t}(\bm{v} \cdot \bm{m}) - \mathrm{sgn}(\bm{v} \cdot \bm{m}) \sqrt{1 - \frac{\eta_v^2}{\eta_t^2} \Vert \bm{v} \times \bm{m} \Vert^2 } \right) \bm{m} $$
is the refracted (or transmitted) view vector that points along the direction of incidence (see Fig. 7 above). The derivation of Eqn. 15a is provided in Appendix A. We must caution that, in certain cases, the value of the expression inside the square root is a negative number. This invalidates the refracted direction and implies that the light has been totally internally reflected by the surface.
Eqn. 15a can be solved for $\small \bm{m}$ if we assume that $\small (\bm{v} \cdot \bm{m}) > 0$. The derivation is provided in Appendix A; here, we simply quote the result:
$$ \tag{15b} \bm{m} = \mathrm{sgn}\negmedspace\left(\frac{\eta_v}{\eta_t} + \bm{v} \cdot \bm{t}\right) \frac{(\eta_v / \eta_t) \bm{v} + \bm{t}}{\Vert (\eta_v / \eta_t) \bm{v} + \bm{t} \Vert}. $$
The sign function ensures that $\small (\bm{v} \cdot \bm{m}) > 0$. As for the set of potentially useful directions of incidence, distant light sources are always occluded by the microsurface unless $\small (\bm{t} \cdot \bm{n}) < 0$.
Shadowing-Masking Function
Upon close examination of Eqn. 12 and 14, the visibility terms are the only ones that explicitly depend on the position $\small \bm{p}$. Intuitively, that is because visibility is a non-local property – it connects a point to the entire surface.
We can evaluate both of these integrals numerically by breaking the microsurface down into individual microfacets and sorting them by their orientation. For a fixed view vector $\small \bm{v}$, a group of microfacets with the same normal $\bm{m}$ will also share the values of $\small F$ and $\small L$. The remaining factor is a surface integral that represents the total visible area of the microfacets within each group.
The statistical method of evaluation of Eqn. 12 and 14 mirrors Eqn. 2a. We must introduce the dimensionless shadowing-masking function $\small G_2(\bm{v}, \bm{m}, \bm{l})$ that gives the fraction of the differential area $\small dA(\bm{m})$ of the portion of the microsurface perpendicular to $\small \bm{m}$ that happens to be visible along both $\small \bm{v} \text{ and } \bm{l}$. In other words, it is the average visibility (along both $\small \bm{l} \text{ and } \bm{v}$) of the microfacets with the normal $\small \bm{m}$:
$$ \tag{16a} \begin{aligned} &\int_{\bm{p} \in \mathbb{M^2}} \bm{m}(\bm{p}) V(\bm{l}, \bm{p}) V(\bm{v}, \bm{p}) dA(\bm{p}) \cr = &\int_{\bm{m} \in \mathbb{S^2}} \bm{m} G_2(\bm{v}, \bm{m}, \bm{l}) dA(\bm{m}) \cr = &\int_{\bm{m} \in \mathbb{S^2}} \bm{m} G_2(\bm{v}, \bm{m}, \bm{l}) D(\bm{m}) A d\Omega(\bm{m}). \end{aligned} $$
Once projected onto $\small \bm{v} \text{ or } \bm{l}$, Eqn. 16a also represents the visible projected area, except that now the visibility is bidirectional. Similarly, it takes self-occlusion into account: $\small V = G_2 = 0$ if $\small (\bm{v} \cdot \bm{m}) \le 0 \text{ or } \small (\bm{l} \cdot \bm{m}) \le 0$.
A valid shadowing-masking function also has the following properties:
$$ \tag{16b} \begin{aligned} \textit{directional symmetry: } &G_2(\bm{v}, \bm{m}, \bm{l}) = G_2(\bm{l}, \bm{m}, \bm{v}); \cr \textit{directional correlation: } &G_2(\bm{v}, \bm{m}, \bm{v}) = G_1(\bm{v}, \bm{m}); \cr \textit{height correlation: } &G_2(\bm{v}, \bm{m}, \bm{n}) = G_1(\bm{v}, \bm{m}). \end{aligned} $$
The last expression is only applicable to height fields13. Essentially, it says that the visibility of a microfacet depends on its altitude. The second property is more general; it implies that a bidirectional average cannot be factored into the product of unidirectional averages:
$$ \tag{16c} G_2(\bm{v}, \bm{m}, \bm{l}) \ne G_1(\bm{v}, \bm{m}) G_1(\bm{l}, \bm{m}). $$
As an example, consider a number of evenly spaced boxes – a square wave. For $\small \bm{m} = \bm{n}$, the unidirectional visibility will approach 50% as the view angle increases. Thus, for shallow angles, $\small G_1(\bm{v}, \bm{n}) G_1(\bm{l}, \bm{n}) \approx 1/4$, while the correct value of $\small G_2(\bm{v}, \bm{n}, \bm{l}) \approx 1/2$.
Microfacet Specular BSDF
In order to arrive at the expression of a microfacet BSDF, we must convert Eqn. 12 and 14 into a statistical form. The numerator can be rewritten in terms of the shadowing-masking function defined by Eqn. 16a. As for the denominator, according to the microfacet theory, the visible projected area of the microsurface is identical to the signed projected area of the macrosurface (see Eqn. 1-3). Therefore,
$$ \tag{18a} \begin{aligned} L_r(\bm{v}) &= \frac{\bm{v} \cdot \int_{\bm{m} \in \mathbb{S^2}} \bm{m} F(\theta_v, \eta_v/\eta_t) L(\bm{r}) G_2(\bm{v}, \bm{m}, \bm{r}) D(\bm{m}) d\Omega(\bm{m})} {\bm{v} \cdot \bm{n}}, \cr L_t(\bm{v}) &= \frac{\eta_t^2}{\eta_v^2} \frac{\bm{v} \cdot \int_{\bm{m} \in \mathbb{S^2}} \bm{m} \big( 1 - F(\theta_v, \eta_v/\eta_t) \big) L(\bm{t}) G_2(\bm{v}, \bm{m}, -\bm{t}) D(\bm{m}) d\Omega(\bm{m})} {\bm{v} \cdot \bm{n}}. \end{aligned} $$
The second equation required a minor modification: we had to reverse the refracted view vector $\small \bm{t}$ because the microsurface is one-sided; refer to the section about Inherent Assumptions for details. This formulation is non-reciprocal: an equivalent (proper but verbose) expression is $\small G_2(\mathrm{sgn}(\bm{n} \cdot \bm{v}) \bm{v}, \bm{m}, \mathrm{sgn}(\bm{n} \cdot \bm{t}) \bm{t})$.
Comparison of Eqn. 18a with 4b, which serves as a definition of a BSDF, reveals that the domain of integration is not the same. Therefore, we must perform a change of variables from the microsurface normal to the direction of incidence. This can be accomplished using the Jacobian determinant of the transformation:
$$ \tag{18b} J(\bm{m}, \bm{r}) = \frac{\partial \Omega(\bm{m})}{\partial \Omega(\bm{r})}, \quad J(\bm{m}, \bm{t}) = \frac{\partial \Omega(\bm{m})}{\partial \Omega(\bm{t})}. $$
Since a determinant can be negative, we shall utilize its absolute value:
$$ \tag{18c} \begin{aligned} L_r(\bm{v}) &= \frac{\bm{v} \cdot \int_{\bm{r} \in \mathbb{S^2}} \bm{m} F(\theta_v, \eta_v/\eta_t) L(\bm{r}) G_2(\bm{v}, \bm{m}, \bm{r}) D(\bm{m}) \big\vert J(\bm{m}, \bm{r}) \big\vert d\Omega(\bm{r})} {\bm{v} \cdot \bm{n}}, \cr L_t(\bm{v}) &= \frac{\eta_t^2}{\eta_v^2} \frac{\bm{v} \cdot \int_{\bm{t} \in \mathbb{S^2}} \bm{m} \big( 1 - F(\theta_v, \eta_v/\eta_t) \big) L(\bm{t}) G_2(\bm{v}, \bm{m}, -\bm{t}) D(\bm{m}) \big\vert J(\bm{m}, \bm{t}) \big\vert d\Omega(\bm{t})} {\bm{v} \cdot \bm{n}}. \end{aligned} $$
Technically, the Jacobian of Eqn. 18b describes the rate at which one solid angle changes relative to the other. The notation has been purposefully chosen to mimic the one-dimensional partial14 derivative $\small \partial x / \partial t$. Similarly,
$$ \tag{18d} d\Omega(\bm{m}) = \left\vert \frac{\partial \Omega(\bm{m})}{\partial \Omega(\bm{r})} \right\vert d\Omega(\bm{r}) = \big\vert J(\bm{m}, \bm{r}) \big\vert d\Omega(\bm{r}) $$
is completely analogous to the one-dimensional change of variables $\small dx = (\partial x / \partial t) dt$.
Intuitively, the absolute value of the Jacobian is just a ratio of two differential solid angles, each corresponding to a projected differential area. Unfortunately, making sense of this statement (and making it more precise) requires a background in differential geometry. For those readers who are interested in the details, we provide a brief introduction to Jacobians and differential solid angles in Appendix B.
We can determine the expression of the Jacobian in Eqn. 18d using the geometric properties of differential solid angles. Taking $\small \bm{v}$ as a constant, substitution of Eqn. 13b and utilization of Eqn. B18-B21 yields
$$ \tag{19a} d\Omega(\bm{m}) = d\Omega(\bm{r} + \bm{v}) = \frac{\big\vert \bm{m} \cdot d\bm{A}(\bm{r}) \big\vert}{\Vert \bm{r} + \bm{v} \Vert^2} = \frac{\vert \bm{m} \cdot \bm{r} \vert}{\Vert \bm{r} + \bm{v} \Vert^2} d\Omega(\bm{r}). $$
Thus, according to the definitions of the absolute value of the Jacobian and the law of reflection (Eqn. 7a),
$$ \tag{19b} \big\vert J(\bm{m}, \bm{r}) \big\vert = \frac{\vert \bm{r} \cdot \bm{m} \vert}{\Vert \bm{r} + \bm{v} \Vert^2} = \frac{1}{4 |\bm{r} \cdot \bm{m}|}. $$
Similarly, we can use Eqn. 15b to show that
$$ \tag{20a} d\Omega(\bm{m}) = d\Omega\big( (\eta_v / \eta_t) \bm{v} + \bm{t} \big) = \frac{\big\vert \bm{m} \cdot d\bm{A}(\bm{t}) \big\vert}{\Vert (\eta_v / \eta_t) \bm{v} + \bm{t} \Vert^2} = \frac{\big\vert \bm{m} \cdot \bm{t} \big\vert}{\Vert (\eta_v / \eta_t) \bm{v} + \bm{t} \Vert^2} d\Omega(\bm{t}). $$
Therefore, according to the definitions of the absolute value of the Jacobian and the law of refraction (Eqn. 5a and A1-A4),
$$ \tag{20b} \begin{aligned} \big\vert J(\bm{m}, \bm{t}) \big\vert &= \frac{\big\vert \bm{t} \cdot \bm{m} \big\vert}{\Vert (\eta_v / \eta_t) \bm{v} + \bm{t} \Vert^2} \cr &= \frac{\big\vert \bm{t} \cdot \bm{m} \big\vert}{\big( (\eta_v / \eta_t) \cos{\theta_v} - \cos{\theta_t} \big)^2} \cr &= \frac{\big\vert \bm{t} \cdot \bm{m} \big\vert}{\big( \sqrt{(\eta_v / \eta_t)^2 - \Vert \bm{t} \times \bm{m} \Vert^2} + (\bm{t} \cdot \bm{m}) \big)^2}. \end{aligned} $$
Having finally obtained the last piece of the puzzle, we can proceed with the assembly of a microfacet BSDF. Begin by substituting Eqn. 19b and 20b into 18c:
$$ \tag{21a} \begin{aligned} L_r(\bm{v}) &= \frac{\int_{\bm{r} \in \mathbb{S^2}} (\bm{v} \cdot \bm{m}) F(\theta_v, \eta_v/\eta_t) L(\bm{r}) G_2(\bm{v}, \bm{m}, \bm{r}) D(\bm{m}) d\Omega(\bm{r})} {4 |\bm{r} \cdot \bm{m}| (\bm{v} \cdot \bm{n})}, \cr L_t(\bm{v}) &= \frac{\eta_t^2}{\eta_v^2} \frac{\int_{\bm{t} \in \mathbb{S^2}} (\bm{v} \cdot \bm{m}) \big( 1 - F(\theta_v, \eta_v/\eta_t) \big) L(\bm{t}) G_2(\bm{v}, \bm{m}, -\bm{t}) D(\bm{m}) |\bm{t} \cdot \bm{m}| d\Omega(\bm{t})} {\big( (\eta_v / \eta_t) \bm{v} + \bm{t} \big)^2 (\bm{v} \cdot \bm{n})}. \end{aligned} $$
Recall that the microfacet theory requires $\small (\bm{v} \cdot \bm{n}) > 0$. In addition, the single scattering approximation permits us to neglect the directions of incidence that fail to satisfy either $\small (\bm{r} \cdot \bm{n}) > 0 \text{ or } (\bm{t} \cdot \bm{n}) < 0$. Finally, due to the law of reflection and self-occlusion, $\small (\bm{v} \cdot \bm{m}) = (\bm{r} \cdot \bm{m}) > 0 \text{ and } (\bm{t} \cdot \bm{m}) < 0$. We may use these properties to simplify and rearrange Eqn. 21a:
$$ \tag{21b} \begin{aligned} L_r(\bm{v}) &= \frac{\int_{\bm{r} \in \mathbb{H_{+}^2}} F(\theta_v, \eta_v/\eta_t) L(\bm{r}) G_2(\bm{v}, \bm{m}, \bm{r}) D(\bm{m}) |\bm{r} \cdot \bm{n}| d\Omega(\bm{r})} {4 (\bm{v} \cdot \bm{n}) (\bm{r} \cdot \bm{n})}, \cr L_t(\bm{v}) &= \frac{\eta_t^2}{\eta_v^2} \frac{\int_{\bm{t} \in \mathbb{H_{-}^2}} (\bm{v} \cdot \bm{m}) (\bm{t} \cdot \bm{m}) \big( 1 - F(\theta_v, \eta_v/\eta_t) \big) L(\bm{t}) G_2(\bm{v}, \bm{m}, -\bm{t}) D(\bm{m}) |\bm{t} \cdot \bm{n}| d\Omega(\bm{t})} {\big( (\eta_v / \eta_t) \bm{v} + \bm{t} \big)^2 (\bm{v} \cdot \bm{n}) (\bm{t} \cdot \bm{n})}. \end{aligned} $$
To facilitate comparison with Eqn. 4b, we shall utilize the projected solid angle notation:
$$ \tag{21c} \begin{aligned} L_r(\bm{v}) &= \frac{\int_{\bm{r} \in \mathbb{H_{+}^2}} F(\theta_v, \eta_v/\eta_t) L(\bm{r}) G_2(\bm{v}, \bm{m}, \bm{r}) D(\bm{m}) d\Omega_n(\bm{r})} {4 (\bm{v} \cdot \bm{n}) (\bm{r} \cdot \bm{n})}, \cr L_t(\bm{v}) &= \frac{\eta_t^2}{\eta_v^2} \frac{\int_{\bm{t} \in \mathbb{H_{-}^2}} (\bm{v} \cdot \bm{m}) (\bm{t} \cdot \bm{m}) \big( 1 - F(\theta_v, \eta_v/\eta_t) \big) L(\bm{t}) G_2(\bm{v}, \bm{m}, -\bm{t}) D(\bm{m}) d\Omega_n(\bm{t})} {\big( (\eta_v / \eta_t) \bm{v} + \bm{t} \big)^2 (\bm{v} \cdot \bm{n}) (\bm{t} \cdot \bm{n})}, \end{aligned} $$
which makes extracting the expressions of the BRDF and the BTDF particularly easy:
$$ \tag{21d} \begin{aligned} f_r(\bm{v}, \bm{n}, \bm{l}) &= \frac{ F(\theta_v, \eta_v/\eta_t)) G_2(\bm{v}, \bm{m}, \bm{l}) D(\bm{m})} {4 (\bm{v} \cdot \bm{n}) (\bm{l} \cdot \bm{n})}, \cr f_t(\bm{v}, \bm{n}, \bm{l}) &= \frac{\eta_l^2}{\eta_v^2} \frac{(\bm{v} \cdot \bm{m}) (\bm{l} \cdot \bm{m})} {\big( (\eta_v / \eta_l) \bm{v} + \bm{l} \big)^2} \frac{ \big( 1 - F(\theta_v, \eta_v/\eta_t) \big) G_2(\bm{v}, \bm{m}, -\bm{l}) D(\bm{m})} { (\bm{v} \cdot \bm{n}) (\bm{l} \cdot \bm{n})}. \end{aligned} $$
This concludes our derivation of a microfacet specular BSDF.
Appendix A: Derivation of Refracted View Vector
Eqn. 15a can be derived as follows. Let us define two unit vectors
$$ \tag{A1} \begin{aligned} \bm{x} &= \frac{\bm{m} \times (\bm{v} \times \bm{m})}{\Vert \bm{m} \times (\bm{v} \times \bm{m}) \Vert} = \frac{\bm{v} - (\bm{v} \cdot \bm{m}) \bm{m}}{\Vert \bm{v} \times \bm{m} \Vert}, \cr \bm{y} &= \mathrm{sgn}(\bm{v} \cdot \bm{m}) \bm{m}, \end{aligned} $$
such that
$$ \tag{A2} \begin{aligned} \sin{\theta_v} &= \bm{v} \cdot \bm{x} = \frac{1 - (\bm{v} \cdot \bm{m})^2}{\Vert \bm{v} \times \bm{m} \Vert} = \Vert \bm{v} \times \bm{m} \Vert \ge 0, \cr \cos{\theta_v} &= \bm{v} \cdot \bm{y} = \vert \bm{v} \cdot \bm{m} \vert \ge 0. \end{aligned} $$
In this coordinate system (see Fig. 7 above),
$$ \tag{A3} \begin{aligned} \bm{v} &= \sin{\theta_v} \bm{x} + \cos{\theta_v} \bm{y}, \cr \bm{t} &= -(\sin{\theta_t} \bm{x} + \cos{\theta_t} \bm{y}), \end{aligned} $$
where
$$ \tag{A4} \begin{aligned} \sin{\theta_t} &= \Vert \bm{t} \times \bm{m} \Vert \ge 0, \cr \cos{\theta_t} &= \vert \bm{t} \cdot \bm{m} \vert \ge 0, \end{aligned} $$
by convention.
Expansion of Eqn. A3 via substitution and simplification using the law of refraction yields Eqn. 15a:
$$ \tag{A5} \bm{t} = -\frac{\eta_v}{\eta_t}\bm{v} + \left( \frac{\eta_v}{\eta_t}(\bm{v} \cdot \bm{m}) - \mathrm{sgn}(\bm{v} \cdot \bm{m}) \sqrt{1 - \frac{\eta_v^2}{\eta_t^2} \Vert \bm{v} \times \bm{m} \Vert^2 } \right) \bm{m}. $$
This expression has the following geometric interpretation: we reverse the direction of $\small \bm{v}$, shorten it by a factor of $\small \eta_v / \eta_t$, and stretch it along $\small \bm{m}$ until the resulting vector is of length 1.
Eqn. A3 can be solved for $\small \bm{m}$ if we assume that $\small (\bm{v} \cdot \bm{m}) > 0$, so that $\small \bm{y} = \bm{m}$. We can subsequently eliminate $\small \bm{x}$ by rescaling $\small \bm{t}$ by a factor of $\small \sin{\theta_v} / \sin{\theta_t}$:
$$ \tag{A6} \bm{v} + \frac{\sin{\theta_v}}{\sin{\theta_t}} \bm{t} = \left( \cos{\theta_v} - \frac{\sin{\theta_v}}{\sin{\theta_t}} \cos{\theta_t} \right) \bm{m}. $$
After applying the law of refraction once more and performing a series of trivial algebraic manipulations, we readily obtain Eqn. 15b:
$$ \tag{A7} \bm{m} = \frac{\eta_v \bm{v} + \eta_t \bm{t}}{\eta_v \cos{\theta_v} - \eta_t \cos{\theta_t}} = \mathrm{sgn}\negmedspace\left(\frac{\eta_v}{\eta_t} + \bm{v} \cdot \bm{t}\right) \frac{(\eta_v / \eta_t) \bm{v} + \bm{t}}{\Vert (\eta_v / \eta_t) \bm{v} + \bm{t} \Vert}. $$
Appendix B: Jacobians and Differential Solid Angles
Consider a surface parameterized by a pair of coordinates (such as the Cartesian or the spherical coordinates):
$$ \tag{B1} \bm{p} = \bm{p}(x,y). $$
The change in the surface position is given by the chain rule:
$$ \tag{B2} d\bm{p} = \frac{\partial \bm{p}}{\partial x} dx + \frac{\partial \bm{p}}{\partial y} dy. $$
Let us define a local coordinate frame as follows:
$$ \tag{B3} \bm{i} = \frac{\partial \bm{p}}{\partial x}, \quad \bm{j} = \frac{\partial \bm{p}}{\partial y}, \quad \bm{k} = \bm{i} \times \bm{j}. $$
Clearly, Eqn. B2 is a sum of two linearly independent vectors:
$$ \tag{B4} d\bm{p} = \bm{i} dx + \bm{j} dy. $$
They form a tiny parallelogram that can be represented by the vector differential area
$$ \tag{B5} d\bm{A} = (\bm{i} dx) \times (\bm{j} dy) = (\bm{i} \times \bm{j}) dx dy = \bm{k} dx dy. $$
Its magnitude is the scalar differential area
$$ \tag{B6} dA = \Vert d\bm{A} \Vert = dx dy, $$
where we have made a simplifying assumption that the basis vectors are mutually orthogonal unit vectors.
Now, let us perform a change of variables
$$ \tag{B7} x = x(s,t), \quad y = y(s,t). $$
According to the chain rule,
$$ \tag{B8} d\bm{p} = \frac{\partial \bm{p}}{\partial s} ds + \frac{\partial \bm{p}}{\partial t} dt. $$
The transformed basis vectors are
$$ \tag{B9} \bm{i’} = \frac{\partial \bm{p}}{\partial s}, \quad \bm{j’} = \frac{\partial \bm{p}}{\partial t}, \quad \bm{k’} = \bm{i’} \times \bm{j’}. $$
Eqn. B8 can thus be written as
$$ \tag{B10} d\bm{p} = \bm{i’} ds + \bm{j’} dt. $$
The expression of the vector differential area mirrors Eqn. B5:
$$ \tag{B11} d\bm{A} = (\bm{i’} ds) \times (\bm{j’} dt) = (\bm{i’} \times \bm{j’}) ds dt = \bm{k’} ds dt. $$
If the transformed basis is not orthonormal, the expression of the scalar differential area must account for the length of the vectors and the angle between them:
$$ \tag{B12} dA = \Vert d\bm{A} \Vert = \Vert \bm{k’} \Vert ds dt = \Vert \bm{i’} \times \bm{j’} \Vert ds dt. $$
Now, if we want to replace Eqn. B6 with B12 in an area integral, they must be the same15. The ratio of the areas of the parallelograms formed by the basis vectors (see Fig. 8 above) is given by the absolute value of the Jacobian determinant
$$ \tag{B13} \vert J \vert = \frac{dx dy}{ds dt} = \frac{\Vert \bm{k’} \Vert}{\Vert \bm{k} \Vert} = \Vert \bm{i’} \times \bm{j’} \Vert. $$
The Jacobian is also frequently used in triple integrals, where its absolute value corresponds to the ratio of the volumes of two parallelepipeds.
Once Eqn. B7 that relates the sets of coordinates has been specified, the expression of the Jacobian can be determined algebraically, by calculating the determinant of the matrix of partial derivatives. However, in our case, a geometric (coordinate-independent, basis-independent) approach is both simpler and more insightful.
First, we must point out two obvious properties of the vector differential area. It remains unchanged if the surface is translated along a constant vector $\small \bm{q}$:
$$ \tag{B14} d\bm{A}(\bm{p} + \bm{q}) = d\bm{A}(\bm{p}). $$
Furthermore, the area scales quadratically with the dimensions of the surface. For a constant $\small \lambda$,
$$ \tag{B15} d\bm{A}(\lambda \bm{p}) = \lambda^2 d\bm{A}(\bm{p}). $$
For a unit vector $\small \bm{m}$ (which is effectively a point on the surface of a unit sphere) and a variable $\small \lambda(\bm{m})$, the total differential in the spherical coordinates is
$$ \tag{B16} d(\lambda \bm{m}) = \left( \frac{\partial \lambda}{\partial \theta} \bm{m} + \lambda \frac{\partial \bm{m}}{\partial \theta} \right) d\theta + \left( \frac{\partial \lambda}{\partial \phi} \bm{m} + \lambda \frac{\partial \bm{m}}{\partial \phi} \right) d\phi. $$
$\small \bm{m}, \partial \bm{m} / \partial \theta, \partial \bm{m} / \partial \phi$ (often called $\small \bm{r}, \bm{\theta}, \bm{\phi}$) form an orthonormal basis on the surface of a unit sphere. Thus,
$$ \tag{B17} \begin{aligned} \bm{m} \cdot d\bm{A}(\lambda \bm{m}) &= \bm{m} \cdot \left( \frac{\partial \lambda}{\partial \theta} \bm{m} + \lambda \frac{\partial \bm{m}}{\partial \theta} \right) \times \left( \frac{\partial \lambda}{\partial \phi} \bm{m} + \lambda \frac{\partial \bm{m}}{\partial \phi} \right) d\theta d\phi \cr &= \bm{m} \cdot \lambda^2 \left( \frac{\partial \bm{m}}{\partial \theta} \times \frac{\partial \bm{m}}{\partial \phi} \right) d\theta d\phi = \bm{m} \cdot \lambda^2 d\bm{A}(\bm{m}). \end{aligned} $$
Now, recall that, by definition, the solid angle subtended by an object is the surface area of its projection onto the unit sphere. If we consider an infinitesimal surface element and its vector area $\small d\bm{A}$ located at the point $\small \bm{p}$, the associated differential solid angle is
$$ \tag{B18} d\Omega(\bm{p}) = \left\vert \frac{\bm{p}}{\Vert \bm{p} \Vert} \cdot \frac{d\bm{A}(\bm{p})}{\Vert \bm{p} \Vert^2} \right \vert. $$
In particular, for an area element located on the surface of a unit sphere,
$$ \tag{B19} d\bm{A}(\bm{m}) = \bm{m} d\Omega(\bm{m}). $$
As an angular quantity, the solid angle is scale-invariant:
$$ \tag{B20} d\Omega(\lambda \bm{m}) = \left\vert \frac{\lambda \bm{m}}{\Vert \lambda \bm{m} \Vert} \cdot \frac{d\bm{A}(\lambda \bm{m})}{\Vert \lambda \bm{m} \Vert^2} \right \vert = d\Omega(\bm{m}). $$
However, it is not translation-invariant due to the projection onto the unit sphere:
$$ \tag{B21} \begin{aligned} d\Omega(\bm{p} + \bm{q}) &= \left\vert \frac{\bm{p} + \bm{q}}{\Vert \bm{p} + \bm{q} \Vert} \cdot \frac{d\bm{A}(\bm{p} + \bm{q})}{\Vert \bm{p} + \bm{q} \Vert^2} \right \vert \cr &= \left\vert \frac{\bm{p} + \bm{q}}{\Vert \bm{p} + \bm{q} \Vert} \cdot \frac{d\bm{A}(\bm{p})}{\Vert \bm{p} + \bm{q} \Vert^2} \right \vert. \end{aligned} $$
Acknowledgements
Writing this article would be difficult, if not impossible, without the hard work of the authors of many excellent papers about the microfacet theory and its applications, a few of which are listed below. They contain practical details, helpful illustrations, and different approaches and perspectives, which makes the perusal of these papers a great way to expand your tool set and build a more complete mental model.
References and Suggestions for Further Reading
- Veach, E. Robust Monte Carlo Methods for Light Transport Simulation, Chapter 5 (1998).
- Walter, B., Marschner, S. R., Li, H., & Torrance, K. E. Microfacet Models for Refraction Through Rough Surfaces (2007).
- Heitz, E. Understanding the Masking-Shadowing Function in Microfacet-Based BRDFs (2014).
-
We shall treat all surfaces as one-sided unless stated otherwise. ↩︎
-
We use the superscript 2 to indicate that a surface is a two-dimensional manifold. ↩︎
-
Not to be confused with the normal distribution (a.k.a. the Gaussian distribution) in probability theory. ↩︎
-
If the microsurface is convex, we can interpret $\small DA$ as a Jacobian of the transformation from the surface to the unit hemisphere. ↩︎
-
Using the tools of calculus, we can decompose an arbitrary surface into a (possibly infinite) number of sufficiently small boxes. ↩︎
-
This requires the transformation to be invertible. If the transformation was nonlinear, the transformed axes would vary from point to point, forming vector fields. ↩︎
-
In technical terms, such a surface has an extremely short auto-correlation distance. ↩︎
-
After comparing Eqn. 1c with 3a and taking the definitions into account, it seems more natural to define $\small D_{vis} = G_1 D$. Nevertheless, we stick with Eqn. 3b to conform to the existing body of literature. ↩︎
-
“Energy conservation” refers to the total energy of the system. The kinetic energy of a photon is only conserved if the integral is equal to 1. Otherwise, by definition, a collision is called inelastic. ↩︎
-
We will not consider absorptive or magnetic media in this article. ↩︎
-
We do not care about the literal edge cases, since a microsurface is assumed to be a fragment of a larger surface. ↩︎
-
Height correlation is not limited to height fields. For arbitrary surfaces, the correlation is weaker, and it is not clear whether it is possible to express this property in a compact form. ↩︎
-
The derivative is partial because the microsurface normal implicitly depends on the view vector. ↩︎
-
The same is also true for Eqn. B5 and B11. Intuitively, both the orientation and the size (but not the shape) of the surface elements must be independent of the choice of coordinates (the tessellation scheme). ↩︎