Model Diagnostics and Evaluation in Pumas

1 Introduction

The PumasUtilities package provides access to the plotting functions available for Pumas. All of the available plotting functions build upon the AlgebraOfGraphics.jl plotting ecosystem and therefore, are interoperable with any plots created using Makie.jl. You can freely combine individual plots from Pumas-specific functions with those of CairoMakie.jl, or any other plotting functions that build upon it such as AlgebraOfGraphics.jl.

The objective of this topic is to demonstrate how to extract model results and perform graphical assessments of the model using in-built Pumas functions.

While not the focus of this topic, customized diagnostics plots can still be generated using AlgebraOfGraphics.jl and CairoMakie.jl using Pumas model output. How to obtain Pumas model output appropriate for customized plotting is demonstrated as part of this topic.

In this topic, it is already assumed that a Pumas model has been fitted (mod_fit) to the warfarin concentrations over time where the analysis dataset was named, examp_df_pumas, and the dependent variable was named, conc.

Warfarin Population Pharmacokinetic Model

The model is a 1-compartment model with linear elimination and first-order absorption, log-normally distributed inter-individual variability on clearance (CL) and volume of distribution of the central compartment (VC), and a proportional residual error model.

# Read in as a Pumas dataset
examp_df_pumas = read_pumas(
    dataset("pumas/warfarin_pumas");
    observations = [:conc],
    covariates = [:wtbl, :age, :sex],
)

# Define the Pumas model
mod_code = @model begin
    @param begin
        # Definition of fixed effect parameters
        θCL ∈ RealDomain(; lower = 0.0)
        θVC ∈ RealDomain(; lower = 0.0)
        θKA ∈ RealDomain(; lower = 0.0)
        # Random effect parameters
        # Variance-covariance matrix for inter-individual variability
        Ω ∈ PSDDomain(2)
        # Residual unexplained variability
        σpro ∈ RealDomain(; lower = 0.0)
    end
    @random begin
        # Sampling random effect parameters
        η ~ MvNormal(Ω)
    end
    @covariates wtbl age sex
    @pre begin
        # Derived variables
        # Covariates
        # None

        # Individual PK parameters
        CL = θCL * ((wtbl / 70)^0.75) * exp(η[1])
        VC = θVC * ((wtbl / 70)^1.0) * exp(η[2])
        KA = θKA
    end
    @init begin
        # Define initial conditions
        Depot = 0.0
        Central = 0.0
    end
    @vars begin
        # Concentrations in compartments
        centc := Central / VC
    end
    @dynamics begin
        # Differential equations
        Depot' = -KA * Depot
        Central' = KA * Depot - CL * centc
    end
    @derived begin
        # Definition of derived variables
        # Individual-predicted concentration
        ipre := @.(Central / VC)
        # Dependent variable
        """
        Warfarin Concentration (mg/L)
        """
        conc ~ @.Normal(ipre, sqrt((ipre * σpro)^2))
    end
end

# Define the initial estimates
init_params = (θCL = 1, θVC = 10, θKA = 1, Ω = [
    0.09 0.01
    0.01 0.09
], σpro = 0.3)

FittedPumasModel

Dynamical system type:          Matrix exponential

Number of subjects:                             32

Observation records:         Active        Missing
    conc:                       251             47
    Total:                      251             47

Number of parameters:      Constant      Optimized
                                  0              7

Likelihood approximation:                     FOCE
Likelihood optimizer:                         BFGS

Termination Reason:                      NoXChange
Log-likelihood value:                    -458.8034

----------------
       Estimate
----------------
θCL    0.13799
θVC    8.3908
θKA    0.63283
Ω₁,₁   0.058535
Ω₂,₁   0.012552
Ω₂,₂   0.015931
σpro   0.23487
----------------

2 Obtaining Model Predictions

There are several variables that need to be calculated in order to generate standard goodness-of-fit diagnostics. These variables include:

Empirical Bayes estimates (ebes): individual random effect parameters
Population-predictions (pred): model-based predictions corresponding to observation time-points using parameters derived from the fixed effects (population-typical values and covariate effects)
Individual-predictions (ipred): model-based predictions corresponding to observation time-points using parameters derived from the fixed effects (population-typical values and covariate effects) and ebes
Residuals (wres, iwres, npdes, eiwres): population weighted residuals (or conditional weighted residuals), individual weighted residuals, normalized prediction distribution errors, and expected simulated-based individual weighted residuals, respectively.

To obtain these variables, the FittedPumasModel is passed to the inspect function and the corresponding output is shown below.

Tip

nsim is a keyword argument specifying the number of simulations to be performed to obtain simulation-based residual diagnostics (npde and eiwres).

# Return model predictions
# Specifying 100 simulations for demonstrate purposes only
mod_pred = inspect(mod_fit; nsim = 100)

[ Info: Calculating predictions.
[ Info: Calculating weighted residuals.
[ Info: Calculating empirical bayes.
[ Info: Calculating NPDEs and EWRES
[ Info: Evaluating dose control parameters.
[ Info: Evaluating individual parameters.
[ Info: Done.

FittedPumasModelInspection

Likelihood approximation used for weighted residuals: FOCE

The FittedPumasModelInspection object returned by the inspect function will be passed to all subsequent goodness-of-fit diagnostic functions in Pumas.

However, as shown above, we cannot directly view the contents of the object nor use it to generate customized goodness-of-fit diagnostic plots with AlgebraOfGraphics.jl. Such that, the object, mod_pred, can be converted to a DataFrame:

mod_pred_df = DataFrame(mod_pred)

The first 10 rows of the DataFrame are printed below. Of note:

The DataFrame closely resembles the input analysis dataset for the Pumas model
The variables required for goodness-of-fit diagnostics have been added for each dependent variable represented in the model
Individual parameters for CL, VC, and KA have been calculated
The amounts in each of the model (Depot and Central) compartments are returned


id	time	evid	conc	amt	cmt	rate	duration	ss	ii	route	wtbl	age	sex	tad	dosenum	conc_pred	conc_ipred	η₁	η₂	conc_wres	conc_iwres	wres_approx	conc_npde	conc_ewres	CL	VC	KA	Depot	Central

1	0	1	missing	100	1	0	0	0	0	NCA.NullRoute	66.7	50	M	0	1	missing	missing	0.733	0.205	missing	missing	missing	missing	missing	0.277	9.81	0.633	missing	missing
1	0.5	0	0	0	missing	0	0	0	0	missing	66.7	50	M	0.5	1	3.38	2.74	0.733	0.205	-4.05	-4.26	FOCE	-2.58	0.114	0.277	9.81	0.633	72.9	26.9
1	1	0	1.9	0	missing	0	0	0	0	missing	66.7	50	M	1	1	5.81	4.71	0.733	0.205	-2.49	-2.54	FOCE	-1.55	-0.00825	0.277	9.81	0.633	53.1	46.2
1	2	0	3.3	0	missing	0	0	0	0	missing	66.7	50	M	2	1	8.8	7.07	0.733	0.205	-2.38	-2.27	FOCE	-1.41	-0.0495	0.277	9.81	0.633	28.2	69.4
1	3	0	6.6	0	missing	0	0	0	0	missing	66.7	50	M	3	1	10.3	8.2	0.733	0.205	-1.02	-0.832	FOCE	0.0251	-0.045	0.277	9.81	0.633	15	80.5
1	6	0	9.1	0	missing	0	0	0	0	missing	66.7	50	M	6	1	11.3	8.77	0.733	0.205	-0.162	0.162	FOCE	0.496	-0.0189	0.277	9.81	0.633	2.24	86
1	9	0	10.8	0	missing	0	0	0	0	missing	66.7	50	M	9	1	11	8.24	0.733	0.205	0.903	1.32	FOCE	1.04	0.018	0.277	9.81	0.633	0.336	80.8
1	12	0	8.6	0	missing	0	0	0	0	missing	66.7	50	M	12	1	10.5	7.6	0.733	0.205	0.0527	0.563	FOCE	0.44	-0.141	0.277	9.81	0.633	0.0503	74.5
1	24	0	5.6	0	missing	0	0	0	0	missing	66.7	50	M	24	1	8.61	5.42	0.733	0.205	-0.663	0.144	FOCE	-0.44	0.0499	0.277	9.81	0.633	2.53e-5	53.1
1	36	0	4	0	missing	0	0	0	0	missing	66.7	50	M	36	1	7.05	3.86	0.733	0.205	-0.863	0.155	FOCE	-0.772	0.0921	0.277	9.81	0.633	1.28e-8	37.9

2.1 Interpolation and Extrapolation of Predictions

Predictions at time-points other than those in the original analysis dataset can be generated using the predict function and passing the FittedPumasModel, the Pumas Population object (from read_pumas), and a vector of times to generate predictions.

Note: The intent here is to take the existing dosing regimens and individual parameters in the analysis population and interpolate/extrapolate the individual-predictions beyond those that were observed.

Prediction
  Subjects: 32
  Predictions: conc
  Covariates: wtbl, age, sex

The Prediction object returned by predict can be passed to a set of goodness-of-fit diagnostic functions in Pumas.

However, as shown above, we cannot directly view the contents of the object nor use it to generate customized goodness-of-fit diagnostic plots with AlgebraOfGraphics.jl. Such that, the object, mod_extrap, can be converted to a DataFrame:

mod_extrap_df = DataFrame(mod_extrap)

The first 10 rows of the DataFrame are printed below. Of note:

The DataFrame closely resembles the input analysis dataset for the Pumas model
All time-points specified in obstimes in predict have been added to the DataFrame
The observed dependent variable for the model, conc, is set to missing at all records where interpolation or extrapolation has been performed
The individual-predictions (ipred) and population-predictions (pred) have been added for each dependent variable represented in the model as interpolation/extrapolation times
Individual parameters for CL, VC, and KA have been calculated
The amounts in each of the model (Depot and Central) compartments are returned


id	time	evid	conc	amt	cmt	rate	duration	ss	ii	route	wtbl	age	sex	tad	dosenum	conc_pred	conc_ipred	η₁	η₂	Depot	Central

1	0	0	missing	0	missing	0	0	0	0	missing	66.7	50	M	0	1	0	0	0.733	0.205	100	0
1	0	1	missing	100	1	0	0	0	0	NCA.NullRoute	66.7	50	M	0	1	0	0	0.733	0.205	100	0
1	0.5	0	0	0	missing	0	0	0	0	missing	66.7	50	M	0.5	1	3.38	2.74	0.733	0.205	72.9	26.9
1	1	0	1.9	0	missing	0	0	0	0	missing	66.7	50	M	1	1	5.81	4.71	0.733	0.205	53.1	46.2
1	2	0	3.3	0	missing	0	0	0	0	missing	66.7	50	M	2	1	8.8	7.07	0.733	0.205	28.2	69.4
1	3	0	6.6	0	missing	0	0	0	0	missing	66.7	50	M	3	1	10.3	8.2	0.733	0.205	15	80.5
1	4	0	missing	0	missing	0	0	0	0	missing	66.7	50	M	4	1	11	8.68	0.733	0.205	7.96	85.2
1	5	0	missing	0	missing	0	0	0	0	missing	66.7	50	M	5	1	11.3	8.81	0.733	0.205	4.23	86.5
1	6	0	9.1	0	missing	0	0	0	0	missing	66.7	50	M	6	1	11.3	8.77	0.733	0.205	2.24	86
1	7	0	missing	0	missing	0	0	0	0	missing	66.7	50	M	7	1	11.3	8.63	0.733	0.205	1.19	84.6

3 Convergence Trace

The previous topic of this Module, Interpreting Results from Pumas Fits, discussed how to evaluate if a model has successfully converged and explained the cases why a model optimization may terminate. Pumas allows graphical evaluation of how the loglikelihood and gradient norm changed over the iterations using the convergence_trace function:

# Plot the log-likelihood and gradient norm over each iteration
convergence_trace(mod_fit)

Tip

Additional keyword arguments can be passed to convergence_trace to assist with formatting and styles. The keyword arguments and options leverage CairoMakie.jl functionality.

Type ?convergence_trace in the Julia REPL to see how to modify the colors and styles for the lines.

4 Goodness-of-Fit Diagnostics

Pumas has a suite of functions that generate standard goodness-of-fit diagnostic plots.

To generate a standard 2 x 2 panel, the goodness_of_fit function can be used to plot the following diagnostics. Note: each subpanel of the plot is constructed from separate functions of which can also be used to generate the subpanels separately:

Observed versus population-predicted (observations_vs_predictions)
Observed versus individual-predicted (observations_vs_ipredictions)
Residuals versus time
- If nsim was not passed to inspect, then default is population weighted residuals versus time (wresiduals_vs_time)
- If nsim was passed to inspect, then default is normalized prediction distribution errors versus time (npde_vs_time)
Residuals versus predictions
- If nsim was not passed to inspect, then default is individual weighted residuals versus individual-predictions (iwresiduals_vs_ipredictions)
- If nsim was passed to inspect, then default is normalized prediction distribution errors versus population-predictions (npde_vs_predictions)
- Note: An additional variation of residuals versus predictions is available but not included in the result of goodness_of_fit (population weighted residuals versus population predictions; wresiduals_vs_predictions)

In our example, nsim was passed to inspect and therefore, diagnostics displaying normalized prediction distribution errors are presented by default.

# Generate 2 x 2 panel of goodness-of-fit diagnostics
goodness_of_fit(
    mod_pred, # FittedPumasModelInspection
    observations = [:conc],

    # Legend options
    legend = (; position = :bottom, framevisible = false),
)

Tip

Additional keyword arguments can be passed to goodness_of_fit to assist with formatting and styles. The keyword arguments and options leverage AlgebraOfGraphics.jl functionality.

Type ?goodness_of_fit in the Julia REPL to see how to modify the colors and styles for the points, and LOESS and linear regression fits.

5 Individual- and Population-Predicted Time-Courses

Individual- and population-predicted time-courses can be evaluated against the observed data for each individual in the analysis population using the subject_fits function.

Objects from either inspect or predict can be passed to subject_fits, where the latter provides individual- and population-predictions at interpolated or extrapolated time-points and may be useful in sparse sampling scenarios.

fig_id_conc = subject_fits(
    mod_pred; # FittedPumasModelInspection

    # Separate the individuals into their own panels
    separate = true,

    # Observation type to be plotted
    observations = [:conc],

    # Legend options
    legend = (; position = :bottom, framevisible = false),

    # Labels for the legend options
    labels = (;
        data = "Observed",
        pred = "Population Predicted",
        ipred = "Individual Predicted",
    ),

    # Options to print plots over several pages to avoid crowding
    paginate = true,
)

fig_id_conc = subject_fits(
    mod_extrap; # Prediction

    # Separate the individuals into their own panels
    separate = true,

    # Observation type to be plotted
    observations = [:conc],

    # Legend options
    legend = (; position = :bottom, framevisible = false),

    # Labels for the legend options
    labels = (;
        data = "Observed",
        pred = "Population Predicted",
        ipred = "Individual Predicted",
    ),

    # Options to print plots over several pages to avoid crowding
    paginate = true,
)

Tip

Additional keyword arguments can be passed to subject_fits to assist with formatting and styles. The keyword arguments and options leverage AlgebraOfGraphics.jl functionality.

Type ?subject_fits in the Julia REPL to see how to modify the colors and styles for the points, and individual- and population-predicted lines.

6 Empirical Bayes Estimates Distributions

Pumas plotting functions are available to make assessments of normality of empirical Bayes estimates (EBE) distributions.

6.1 Assessment of Normality

Histograms of EBEs from the FittedPumasModel can be generated using the empirical_bayes_dist function.

# Plot histograms of each of the EBEs from the model output
testplot = empirical_bayes_dist(
    mod_pred,  # FittedPumasModelInspection
)

Tip

Additional keyword arguments can be passed to empirical_bayes_dist to assist with formatting and styles. The keyword arguments and options leverage CairoMakie.jl functionality.

Type ?empirical_bayes_dist in the Julia REPL to see how to modify the colors and styles for the histogram and the vertical line aesthetics.

6.2 Correlation of EBEs

Pairwise correlation plots assessing the correlation between EBEs is best achieved using the PairPlots.jl package and the pairplot function. This package uses the Makie plotting library similar to the in-built Pumas plotting functions.

A pairwise plot can be generated using pairplot. This example code below is just to demonstrate one methods for creating pairwise plots in Julia and it is highly recommended to use help queries where possible, i.e., ?pairplot, and review the PairsPlot.jl Guide. Note: this requires the DataFrame format of the inspect output.

# Create a DataFrame storing just the EBE information based on the
# FittedPumasModelInspection
ebes = @chain mod_pred_df begin
    # Select only columns that contain η
    select(r"^η")
    # Exclude any missing values
    dropmissing
end

# Construct the pairwise plot
pairplot(
    # Specify the input DataFrame
    ebes => (
        # Specify the elements that should be presented on the off-diagonals
        # Scatterplot with correlation and trendline
        PairPlots.Scatter(
            marker = '∘',
            markersize = 24,
            alpha = 0.5,
            color = ColorSchemes.tab10.colors[1],
        ),
        PairPlots.TrendLine(color = :red),
        PairPlots.PearsonCorrelation(fontsize = 14, color = :blue),

        # Specify elements that should be presented on the diagonals
        # Histogram
        PairPlots.MarginHist(color = ColorSchemes.tab10.colors[1]),
    ),
    fullgrid = false,
)

Additional comments:

The customization options for the in-built Pumas function, empirical_bayes_dist, are lean. For more layered graphics such as modifying the statistical summary of the histogram or adding a distribution density line, it is recommended to explore AlgebraOfGraphics.jl - Density and CairoMakie.jl - density
There is currently no in-built function that generates a Quantile-Quantile plot. However, AlgebraOfGraphics.jl - Statistical Visualizations and CairoMakie.jl - qqplot provide functionality for generating this plot and can be used for additional assessment of normality of EBE distributions.

7 EBEs versus Covariates

Graphical assessments of EBE versus covariate relationships can be obtained using empirical_bayes_vs_covariates. By default:

Continuous covariates provide a x-y scatter plot and print the Pearson correlation coefficient
Categorical covariates produce a violin plot of the distribution of EBEs for a category

# Plot EBEs versus covariates
empirical_bayes_vs_covariates(
    mod_pred, # FittedPumasModelInspection
)

In the example, the function could determine which covariates were continuous and which were categorical based on the variable types in the Population object when the initial analysis dataset was constructed (the first 10 rows are shown below). Here, sex is a variable with categories M (male) and F (female) and was retained in this state through the Pumas model fitting process and generation of the FittedPumasModelInspection object.

10×10 DataFrame

Row	id	time	evid	amt	cmt	conc	pca	wtbl	age	sex
	Int64	Float64	Int64	Float64?	Int64?	Float64?	Float64?	Float64	Int64	String1
1	1	0.0	1	100.0	1	missing	missing	66.7	50	M
2	1	0.5	0	missing	missing	0.0	missing	66.7	50	M
3	1	1.0	0	missing	missing	1.9	missing	66.7	50	M
4	1	2.0	0	missing	missing	3.3	missing	66.7	50	M
5	1	3.0	0	missing	missing	6.6	missing	66.7	50	M
6	1	6.0	0	missing	missing	9.1	missing	66.7	50	M
7	1	9.0	0	missing	missing	10.8	missing	66.7	50	M
8	1	12.0	0	missing	missing	8.6	missing	66.7	50	M
9	1	24.0	0	missing	missing	5.6	44.0	66.7	50	M
10	1	36.0	0	missing	missing	4.0	27.0	66.7	50	M

By default, empirical_bayes_vs_covariates treats all covariates as continuous unless otherwise specified by the categorical keyword argument.

Tip

Additional keyword arguments can be passed to empirical_bayes_vs_covariates to assist with formatting and styles. The keyword arguments and options leverage CairoMakie.jl functionality.

Type ?empirical_bayes_vs_covariates in the Julia REPL to see how to modify the colors and styles for the scatter, violin plots, and horizontal line aesthetics.

8 Residual Distributions

In-built Pumas functions are available for assessing the normality of residual distributions similar to empirical_bayes_dist. These include wresiduals_dist and npde_dist for population/individual weighted residuas and normalized prediction distribution errors, respectively.

# Plot histograms of weighted residuals from the model output
wresiduals_dist(
    mod_pred, # FittedPumasModelInspection
)

# Plot histograms of normalized prediction distribution errors from the model output
npde_dist(
    mod_pred, # FittedPumasModelInspection
)

Tip

Additional keyword arguments can be passed to wresiduals_dist and npde_dist to assist with formatting and styles. The keyword arguments and options leverage AlgebraOfGraphics.jl functionality.

Type ?wresiduals_dist and ?npde_dist in the Julia REPL to see how to modify the colors and styles for the histogram and the vertical line aesthetics.

9 Summary

The Pumas and PumasUtilities packages have several functions for generating goodness-of-fit diagnostics using FittedPumasModel output. The plotting functions accommodate modifications of the aesthetics, however, for full customization it is recommended to generate customized plots using AlgebraOfGraphics.jl and the inspect output converted to a DataFrame output.