Learning a model of facial shape and expression from 4D scans
1/16/23About 5 min
3. Model formulation
FLAME is described by a function:
— coefficients describing shape — coefficients describing pose — coefficients describing expression — template mesh in the “zero pose” — “zero pose” — shape blendshape function to account for identity related shape variation — corrective pose blendshapes to correct pose deformations that cannot be explained solely by LBS — expression blendshapes that capture facial expressions — A standard skinning function is applied to rotate the vertices of around joints , linearly smoothed by blendweights
— a sparse matrix defining how to compute joint locations from mesh vertices
Shape blendshapes
— shape coefficients — orthonormal shape basis, which will be learned below with PCA
Pose blendshapes
— a function from a face / head / eye pose vector to a vector containing the concatenated elements of all the corresponding rotation matrices — -th element of and- vector
— vertex offsets from the rest pose activated by — pose space, a matrix containing all pose blendshapes
Expression blendshapes
— expression coefficients — orthonormal expression basis
Template shape
4. Temporal registration
4.1. Initial model
Shape
Pose
Expression
4.2. Single-frame registration
Model-only
estimate the model coefficients
— measures the scan-to-mesh distance of the scan vertices and the closest point in the surface of the model — scan vertices — weight controls the influence of the data term — a Geman-McClure robust penalty function — a landmark term, measuring the L2-norm distance between image landmarks and corresponding vertices on the model template, projected into the image using the known camera calibration
— regularizes the pose coefficients , shape coefficients , and expression coefficients to be close to zero by penalizing their squared values
Coupled
allow the optimization to leave the model space by optimizing
— template mesh — measures the scan-to-mesh distance from the scan to the aligned mesh — constrains to be close to the current statistical model by penalizing edge differences between and the model as
E_R = \frac{1}{N} \sum_{k = 1}^N \lambda_k \norm{U\pqty{\vb{v}_k}}^2
E\pqty{\vb{T}, \vec{\beta}, \vec{\theta}, \vec{\psi}} = E_D + E_C + \lambda_T E_T + E_R + E_P
E_T = \sum_{l = 0}^3 \sum_{v = 1}^V \norm{\Gamma\pqty{I_l^{\pqty{v}}} - \Gamma\pqty{\hat{I}_l{\pqty{v}}}}_F2