As a part of DEMM, the Probabilistic Collocation Method (PCM) [199,159], is used for applying DEMM in the case of black-box models. The coefficients of the polynomial chaos expansion are obtained using the model outputs at selected collocation points. Then, the next order polynomial chaos expansion is employed, and the solution process is repeated. If the estimates of the pdfs of the output metrics of concern, are approximately equal, the expansion is assumed to have converged; the higher order approximation is used to estimate the pdfs of output metrics.
In DEMM/PCM the collocation points are selected following the orthogonal
collocation method suggested by Villadsen and Michelsen [208].
The collocation points correspond to the roots of the polynomial of one
degree higher than the order of the polynomial chaos expansion. For
one-dimensional problems, this method gives the same results as Galerkin's
method [208], and hence is regarded as an ``optimal method''.
This approach is adapted for multi-dimensional cases in
DEMM [198]. For example, in order to solve for a two dimensional
second order polynomial chaos expansion, the roots of the third order
Hermite polynomial,
,
and 0 are used, hence the
possible collocation points are (0,0), (
,0), (0,
),
(0,
), (
,
), (
,
),
(
,0), (
,
)
and (
,
). There
are nine possible collocation points, but from
Equations 3.10 and 3.10 with
=2 and
taking terms only up to
,
there are only six unknowns.
Similarly, for higher dimension systems and higher order approximations, the
number of available collocation points is always greater than the number of
collocation points needed, which introduces a problem of selecting the
appropriate collocation points. In the absence of selection criteria at
present, the collocation points are typically selected at random from the
set of available points.
In principle, any choice of collocation points from the available ones, should give adequate estimates of the polynomial chaos expansion coefficients. However, different combinations of collocation points may result in substantially different estimates of the pdfs of output metrics; this poses the problem of the optimal selection of collocation points. Furthermore, in a computational setting, some of the collocation points could be outside the range of the algorithmic or numerical applicability of the model, and model results at these points cannot be obtained. These issues are addressed in the following section, where an algorithm for collocation point selection method is described.
Another shortcoming of the standard collocation technique suggested in DEMM
is that sometimes the roots of the Hermite polynomials do not correspond to
high probability regions and these regions are therefore not adequately
represented in the polynomial chaos expansion.
For example, for a third order
approximation, the roots of the fourth order Hermite polynomial, namely
are used as the
collocation points. However, the origin corresponds to the region of
highest probability for a normal random variable. Hence, the exclusion
of the origin as a collocation point could potentially lead to a poor
approximation.
The limitations of DEMM/PCM are shown in the context of a case study involving a human Physiologically Based Pharmacokinetic (PBPK) Model, and the results are presented in Section 5.1.4.
A simple heuristic technique is used to select the required number of points
from the large number of potential candidates: for each term of the series
expansion, a ``corresponding'' collocation point is selected. For example,
the collocation point corresponding to the constant is the origin; i.e., all
the standard normal variables (
's) are set to value zero. For terms
involving only one variable, the collocation points are selected by setting
all other
's to zero value, and by letting the corresponding variable
take values as the roots of the higher order Hermite polynomial. For terms
involving two or more random variables, the values of the corresponding
variables are set to the values of the roots of the higher order polynomial,
and so on. If more points ``corresponding'' to a set of terms are available
than needed, the points which are closer to the origin are preferred, as
they fall in regions of higher probability. Further, when there is
still an unresolved choice, the collocation points are selected such that
the overall distribution of the collocation points is more symmetric with
respect to the origin. If still, more points are available, the collocation
point is selected randomly.
The advantage of this method is that the behavior of the model is captured reasonably well at points corresponding to regions of high probability. Furthermore, singularities can be avoided in the resultant linear equations for the unknown coefficients, as the collocation points are selected to correspond to the terms of the polynomial chaos expansion. Thus, the pdfs of the output metrics are likely to be approximated better than by a random choice of collocation points, while following a technique similar to the ``orthogonal collocation'' approach.
The ECM method was applied to a more complex model, the two-dimensional atmospheric photochemical plume model, the Reactive Plume Model, version 4, (RPM-IV), as described in Section 5.2.3. For that case study, the ECM method failed to converge for a third order polynomial chaos approximation. This is attributed to the inherent instability of the collocation based approaches, as described by Atkinson [6].
The regression method has the advantage that it is more robust than the collocation method, but the method requires a higher number of model simulations to obtain estimates of output uncertainty. The advantages of this method are demonstrated by comparing the results of this method with those of ECM for a the case study involving RPM-IV, an atmospheric photochemical plume model, described in Section 5.2.3.
Based on preliminary case studies (presented in Chapter 5), the regression based method is found to be robust and the method resulted in good approximations of model outputs. Hence this method is recommended for use in general. The higher computational burden, compared to the ECM, is offset by the robustness this method provides for many cases. Throughout the rest of this document, the ECM method is denoted as ``ECM'' and the regression based method is denoted simply as ``SRSM''. Unless otherwise specified, the term SRSM in the case studies refers to a regression based SRSM.